Research June 25, 2024 Data curation via joint example selection further accelerates multimodal learning Abstract
Research June 20, 2024 Connecting the Dots - LLMs can Infer and Verbalize Latent Structure from Disparate Training Data Abstract
Research June 13, 2024 Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Monte Carlo Trees with Llama-3 8B solve mathematics limitations of LLMs
Research June 13, 2024 Depth Anything V2 Samba architecture achieves 3.73x faster throughput with enhanced context
Research June 11, 2024 Samba - Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Samba architecture achieves 3.73x faster throughput with enhanced context
Research May 16, 2024 Chameleon - Mixed-Modal Early-Fusion Foundation Models Chameleon integrates images and text, achieving state-of-the-art performance.
Research May 16, 2024 CAT3D - Create Anything in 3D with Multi-View Diffusion Models CAT3D generates high-quality 3D content quickly with multi-view diffusion models.
Research May 15, 2024 LoRA Learns Less and Forgets Less LoRA compared to full finetuning, shows strong regularization effects..
Research April 27, 2024 DeepSeekMath Pushing the Limits of Mathematical Reasoning in Open Language Models LoRA compared to full finetuning, shows strong regularization effects..