Research September 10, 2024 LLaMA-Omni - Seamless Speech Interaction with Large Language Models Abstract
Research September 10, 2024 GroUSE - A Benchmark to Evaluate Evaluators in Grounded Question Answering Abstract
News September 10, 2024 SambaNova Launches The World's Fastest AI Platform In an exciting development for AI and machine learning, SambaNova Systems has announced the launch of SambaNova Cloud, the world’s fastest AI inference platform. Leveraging the power of its SN40L AI chip, SambaNova
Project September 09, 2024 Multiple datasources - Route selections I hope you enjoy every step so far. Until this point of our Langchain/RAG journey, we have managed to build a simple local application and a querry transformation assistant. But what happens when
News September 09, 2024 Exploring the Replit Agent - AI Power Coding for Developers Replit has long been at the forefront of integrating AI into software development, and their latest offering, the Replit Agent, is no exception. Currently available through a limited early access program, this AI-powered
News September 09, 2024 Google’s Illuminate - Transforming Academic Papers into AI-Generated Podcasts Google Labs has a long tradition of inviting users to explore innovative technologies, with notable successes like Gmail, which began as an exclusive beta. Now, Google is unveiling Illuminate, a groundbreaking project that
Research September 06, 2024 Open MAGVIT2 - An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Abstract
Research September 06, 2024 Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Abstract
News August 27, 2024 Introducing LLaVA V1.5 7B on GroqCloud Introducing LLaVA v1.5 7B: The Next Level of Multimodal AI on GroqCloud
News August 27, 2024 Introducing Cerebras Inference - AI at Instant Speed Cerebras has unveiled its new AI inference solution, claiming it to be the fastest in the world, outpacing NVIDIA GPU-based clouds by 20 times and delivering industry-leading cost efficiency.
News August 27, 2024 Revolutionizing Enterprise Applications with NVIDIA's NIM Agent Blueprints NVIDIA's NIM Agent Blueprints empower enterprises to build and deploy customized generative AI applications, driving business transformation and innovation across industries.
News August 22, 2024 The Jamba 1.5 Open Model Family-The Most Powerful and Efficient Long Context Models AI21 Labs has introduced the Jamba 1.5 family of models, designed to revolutionize enterprise-level AI with unmatched speed, efficiency, and quality. The models, Jamba 1.5 Mini and Jamba 1.5 Large, are built on
News August 21, 2024 Enhancing retrieval augmented generation through drafting In the evolving landscape of AI, large language models (LLMs) have become essential for generating human-like text. However, these models often struggle with accuracy, particularly when tasked with answering complex, knowledge-intensive questions. This
News August 21, 2024 NVIDIA and Mistral AI's Mistral-NeMo-Minitron 8B Model-A Leap Forward in LLM Efficiency NVIDIA and Mistral AI have introduced the Mistral-NeMo-Minitron 8B model, an advanced large language model (LLM) that delivers exceptional accuracy across nine popular benchmarks. This model is a pruned and distilled version of
Research August 20, 2024 Automating Thought of Search - A Journey Towards Soundness and Completeness Abstract
Research August 20, 2024 Transfusion - Predict the Next Token and Diffuse Images with One Multi-Modal Model Abstract
News August 20, 2024 Unlocking GPT-4o Fine-Tuning-A New Era for Custom AI Models Today marks a significant milestone for developers as GPT-4o, a highly anticipated AI model, opens up for fine-tuning. This feature allows developers to tailor GPT-4o for specific tasks, offering enhanced performance and cost
News August 16, 2024 Enhancing Music Recommendations with Transformers - A New Approach in YouTube Music Google presents a music recommendation ranking system that uses Transformer models to better understand the sequential nature of user actions based on the current user context.
News August 14, 2024 California's SB 1047 - A Weakened Bill on AI Safety California’s SB 1047, a bill initially aimed at preventing AI disasters, has been significantly weakened by amendments that reduce the state’s regulatory power, addressing concerns from AI firms while still holding developers liable
News August 14, 2024 Nous Research presents Hermes 3 Hermes 3 contains advanced long-term context retention and multi-turn conversation capability, complex roleplaying and internal monologue abilities, and enhanced agentic function-calling.
News August 14, 2024 Pruning and Distilling Llama 3.1 Structured weight pruning combined with knowledge distillation forms an effective and efficient strategy for obtaining progressively smaller language models from an initial larger sibling.
News August 14, 2024 Grok-2 Beta Release An early preview of Grok-2 is released, a significant step forward from X.AI's previous model Grok-1.5, featuring frontier capabilities in chat, coding, and reasoning.
News August 13, 2024 Transform your mobile device to a powerfull AI Assistant with Gemini Live. Gemini Live is available today to Advanced subscribers, along with conversational overlay on Android and even more connected apps.
News August 13, 2024 Sakana AI’s ‘AI Scientist’, Too Autonomous for Its Own Good? Sakana AI, in collaboration with scientists from the University of Oxford and the University of British Columbia, has developed an artificial intelligence system that can conduct end-to-end scientific research autonomously, called 'AI-Scientist'.
Research August 12, 2024 MoMa - Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
Research August 06, 2024 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Abstract
News August 02, 2024 Introducing GitHub Models The rise of the AI engineer with GitHub Models–bringing the power of industry leading large and small language models to GitHub's more than 100 million users directly on GitHub.
News August 01, 2024 LangGraph Studio - The first agent IDE LangGraph Studio provides a specialized agent IDE for visualizing, interacting with, and debugging complex agentic applications.
News July 31, 2024 Introducing the Galileo Hallucination Index - A New Benchmark for AI Accuracy Hallucination is a huge issue in the field of artificial intelligence. The accuracy and reliability of AI-generated content has become increasingly important in the last few years since people tend to rely on
Project July 29, 2024 RAG - Query Transformation Welcome back, I hope you enjoyed the first part of this series where we are going to explore a portion portion of RAG tool. It is higly suggested that you take a look
News July 29, 2024 Meta AI Introduces Segment Anything 2.0 - Revolutionizing Image and Video Segmentation Meta AI has once again pushed the boundaries of artificial intelligence with the release of Segment Anything 2.0 or as it is published SAM2 (Segment Anything Model). This latest iteration in image segmentation
News July 25, 2024 Llama 3.1 - Most capable model to date The recent release of Meta's Llama 3.1 marks a significant advancement in the field of open-source large language models (LLMs). As the first openly available model to rival top proprietary models, Llama 3.1
News July 25, 2024 AI achieves silver medal solving International Mathematical Olympiad problems AI Achieves Silver Medal Level in International Math Olympiad Problems: A Milestone in Computational Intelligence
News July 24, 2024 Mistral Unveils Mistral 7B - A Cutting-Edge Language Model In a significant leap for artificial intelligence, Mistral has announced the launch of Mistral 7B, a state-of-the-art language model designed to push the boundaries of what AI can achieve in natural language processing.
Project July 22, 2024 Introduction to RAG models Firstly, in case you don't know what is RAG here is an unofficial explanation. Imagine you’re on a treasure hunt, but instead of a dusty old map, you’ve got a genius guide who
Research July 22, 2024 Stretching Each Dollar- Diffusion Training from Scratch on a Micro-Budget Abstract
News July 18, 2024 GPT 4o Mini - Advancing cost-efficient intelligence In the official announcement, OpenAI, has introduced GPT-4o Mini, a new iteration in the GPT-4 series designed to deliver high-quality AI performance while being significantly more cost-effective. This latest model aims to make
Research July 07, 2024 Selective Reflection-Tuning, Student-Selected Data Recycling for LLM Instruction-Tuning Abstract
News July 07, 2024 MInference Now, you can process 1M context 10x faster in a single A100 using Long-context LLMs like LLaMA-3-8B-1M, GLM-4-1M, with even better accuracy, try MInference 1.0 right now! as stated in the announcement.
News July 06, 2024 Gen-3 Alpha opened by Runway As it is stated here Gen-3 Alpha is the first of an upcoming series of models trained by Runway on a new infrastructure built for large-scale multimodal training. It is a major improvement
News July 06, 2024 Gemini 1.5 Pro 2M context window Gemini 1.5 Pro 2M context window, code execution capabilities, and Gemma 2 are available today
News June 27, 2024 Google releases Gemma 2 Google has introduced Gemma 2, its latest generation of open AI models, aimed at enhancing research and development in artificial intelligence. With 9B and 27B parameter versions, Gemma 2 boasts significant improvements in
Research June 25, 2024 Data curation via joint example selection further accelerates multimodal learning Abstract
Research June 20, 2024 Connecting the Dots - LLMs can Infer and Verbalize Latent Structure from Disparate Training Data Abstract
News June 18, 2024 Meta releases New AI Research Models to Accelerate Innovation at Scale For over a decade, Meta's Fundamental AI Research (FAIR) team has been dedicated to advancing AI through open research. In light of rapid innovations in the field, we recognize that collaboration with the
News June 14, 2024 NVIDIA announced Nemotron 340B Nemotron-4 340B, a family of models optimized for NVIDIA NeMo and NVIDIA TensorRT-LLM, includes cutting-edge instruct and reward models, and a dataset for generative AI training.
News June 14, 2024 Google open Project During the Google I/O 2024 developer conference, Google revealed that Project IDX, its next-generation, AI-powered browser-based development environment, is now in open beta. Initially introduced in August as an invite-only service, Project IDX
Research June 13, 2024 Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Monte Carlo Trees with Llama-3 8B solve mathematics limitations of LLMs
Research June 13, 2024 Depth Anything V2 Samba architecture achieves 3.73x faster throughput with enhanced context
Research June 11, 2024 Samba - Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Samba architecture achieves 3.73x faster throughput with enhanced context
News June 10, 2024 Apple announced partnership with ChatGPT Apple is partnering with OpenAI to put ChatGPT into Siri, the company announced at its WWDC 2024 keynote on 10th of June 2024.
News May 30, 2024 Anthropic's now lets you create bots to work for you and interact with external APIs and tools Tool use, which enables Claude to interact with external tools and APIs, is now generally available across the entire Claude 3 model family on the Anthropic Messages API, Amazon Bedrock, and Google Cloud's
News May 22, 2024 Paragon changes RAG model for your customers Integrate your multi-tenant AI SaaS with 100+ 3rd party apps with 70% less engineering.
Research May 16, 2024 Chameleon - Mixed-Modal Early-Fusion Foundation Models Chameleon integrates images and text, achieving state-of-the-art performance.
Research May 16, 2024 CAT3D - Create Anything in 3D with Multi-View Diffusion Models CAT3D generates high-quality 3D content quickly with multi-view diffusion models.
News May 16, 2024 Grok comes to Europe It has been announced that Grok AI model has expanded to Europe.
News May 16, 2024 Chatgpt new features announcements OpenAI is taking AI capabilities to another level with its new ChatGPT feature. This new update enhances the user experience by allowing ChatGPT to interact with tables, charts, and add files directly from
Research May 15, 2024 LoRA Learns Less and Forgets Less LoRA compared to full finetuning, shows strong regularization effects..
News May 14, 2024 Google Announcements Just a day after OpenAI wowed us with GPT-4o, Google decided it’s their turn to dazzle! Let's dive into the goodies unveiled at the Google IO conference.
Project Deep learning Pytorch vision December 26, 2023 Jingle or No Jingle. A Hilariously Serious Dive into PyTorch Image Classification for Santa Claus Detection. 🎄 Season's Greetings, data scientists and tech enthusiasts! In the spirit of ho-ho-hilarity and cutting-edge Christmas cheer, I present to you a Christmas-themed trip into the world of PyTorch image classification. Armed with the
Project Machine learning Nlp Bayesian statistics October 17, 2023 Uncovering Topics in BBC News with Latent Dirichlet Allocation in R Welcome everyone! Keeping in mind the post about Latent Dirichlet Allocation (in case you have not read it yet and you are interested, you can read it here), I am going explore the
Project Machine learning Nlp Bayesian statistics October 05, 2023 Topic Modelling - Latent Dirichlet Allocation Hello everyone! In this post I am going to go through an NLP subject. As you may have already read in this post's title, Topic Modelling is what I aim to explain to
Project Deep learning March 04, 2023 Neural Network ~ Predicting a Numerical Value Welcome back! As part of this introductory series on Neural Networks, we will be exploring the process of building NNs and making decisions about their topology. This includes determining the number of layers,
Research January 04, 2022 Deep Learning Interviews - Hundreds of fully solved job interview questions from a wide range of key topics in AI Deep Learning Interview: the best preparation book for AI/ML job seekers and students. Free on ArXiv.