News August 27, 2024 Introducing LLaVA V1.5 7B on GroqCloud Introducing LLaVA v1.5 7B: The Next Level of Multimodal AI on GroqCloud
News August 27, 2024 Introducing Cerebras Inference - AI at Instant Speed Cerebras has unveiled its new AI inference solution, claiming it to be the fastest in the world, outpacing NVIDIA GPU-based clouds by 20 times and delivering industry-leading cost efficiency.
News August 27, 2024 Revolutionizing Enterprise Applications with NVIDIA's NIM Agent Blueprints NVIDIA's NIM Agent Blueprints empower enterprises to build and deploy customized generative AI applications, driving business transformation and innovation across industries.
News August 22, 2024 The Jamba 1.5 Open Model Family-The Most Powerful and Efficient Long Context Models AI21 Labs has introduced the Jamba 1.5 family of models, designed to revolutionize enterprise-level AI with unmatched speed, efficiency, and quality. The models, Jamba 1.5 Mini and Jamba 1.5 Large, are built on
News August 21, 2024 Enhancing retrieval augmented generation through drafting In the evolving landscape of AI, large language models (LLMs) have become essential for generating human-like text. However, these models often struggle with accuracy, particularly when tasked with answering complex, knowledge-intensive questions. This
News August 21, 2024 NVIDIA and Mistral AI's Mistral-NeMo-Minitron 8B Model-A Leap Forward in LLM Efficiency NVIDIA and Mistral AI have introduced the Mistral-NeMo-Minitron 8B model, an advanced large language model (LLM) that delivers exceptional accuracy across nine popular benchmarks. This model is a pruned and distilled version of
Research August 20, 2024 Automating Thought of Search - A Journey Towards Soundness and Completeness Abstract
Research August 20, 2024 Transfusion - Predict the Next Token and Diffuse Images with One Multi-Modal Model Abstract