Research November 09, 2024 OpenCoder The Open Cookbook for Top-Tier Code Large Language Models Abstract
Research November 09, 2024 Mixture-of-Transformers:A Sparse and Scalable Architecture for Multi-Modal Foundation Models Abstract
Research October 28, 2024 MrT5:Dynamic Token Merging for Efficient Byte-level Language Models Abstract
Research October 21, 2024 A Theoretical Understanding of Chain-of-Thought:Coherent Reasoning and Error-Aware Demonstration Abstract
Research October 17, 2024 Looking Inward:Language Models Can Learn About Themselves by Introspection Abstract