Research September 06, 2024 Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Abstract
Research August 20, 2024 Automating Thought of Search - A Journey Towards Soundness and Completeness Abstract
Research August 20, 2024 Transfusion - Predict the Next Token and Diffuse Images with One Multi-Modal Model Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
Research July 22, 2024 Stretching Each Dollar- Diffusion Training from Scratch on a Micro-Budget Abstract
Research July 07, 2024 Selective Reflection-Tuning, Student-Selected Data Recycling for LLM Instruction-Tuning Abstract
Research June 25, 2024 Data curation via joint example selection further accelerates multimodal learning Abstract
Research June 20, 2024 Connecting the Dots - LLMs can Infer and Verbalize Latent Structure from Disparate Training Data Abstract
Research June 13, 2024 Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Monte Carlo Trees with Llama-3 8B solve mathematics limitations of LLMs
Research June 13, 2024 Depth Anything V2 Samba architecture achieves 3.73x faster throughput with enhanced context
Research June 11, 2024 Samba - Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Samba architecture achieves 3.73x faster throughput with enhanced context
Research May 16, 2024 Chameleon - Mixed-Modal Early-Fusion Foundation Models Chameleon integrates images and text, achieving state-of-the-art performance.
Research May 16, 2024 CAT3D - Create Anything in 3D with Multi-View Diffusion Models CAT3D generates high-quality 3D content quickly with multi-view diffusion models.
Research May 15, 2024 LoRA Learns Less and Forgets Less LoRA compared to full finetuning, shows strong regularization effects..
Research January 04, 2022 Deep Learning Interviews - Hundreds of fully solved job interview questions from a wide range of key topics in AI Deep Learning Interview: the best preparation book for AI/ML job seekers and students. Free on ArXiv.