Home > Blogs > What Is RAG? : Retrieval Augmented Generation

What Is RAG? : Retrieval Augmented Generation

Tech Explained

Diving deeper into the world of large language models (LLMs), the term RAG—Retrieval Augmented Generation is often used. Let’s break it down: What RAG is, How it functions, and Why it’s such a transformative technology in Artificial Intelligence.

How Does RAG Work?

RAG employs two main components:

Retriever: This part of the model searches through the data and pull out relevant documents.
Generator: Once the relevant documents are retrieved, the generator uses this information to craft responses or generate text.

Generation

When a user asks a question, the LLM model provides a confident response based on the parameters that it has learned during training.

The issue here is that the Response may not be accurate as data may evolve with time.

RETRIEVAL AUGMENTED

When a user asks a question, we can provide a content store, say the internet, and combine that into the prompt.

Now, the LLM can provide relevant response with evidence.

Advantages

Model hallucinates less
Model response can be accurate and positive
Model says “I don’t know” if the user’s question cannot be reliably answered.

More In Tech Explained

Tech Explained

Quick Guide To Quantization In Machine Learning

Models are compressed and downsized using techniques such as Model Quantization to address the constraints of local computing.

Tech Explained

Why you should care about MACs and FLOPs in Neural Network?

Building hardware-aware models ensures they are optimized for latency, power, and overall efficiency, making them suitable for real-world applications.

More Blogs

Tips & How-To

Accelerate Machine Learning Development With These Tools

Use a suite of tools that streamline and enhance development process. These tools have been trusted in many projects, which have accelerated Machine Learning development.

Tips & How-To

Top 20 Linux Commands for every Machine Learning Engineer

Explore the top 20 Linux commands that every Machine Learning Engineer should know to enhance productivity and streamline their work.

Trends

Exploring the Evolution Beyond Transformers: Unveiling the Power of State Space Models with Mamba

While the industry has been heavily focused on Transformers, it's exciting to see how State Space Models (SSMs) are emerging as the next-generation alternative.