Glossary

What is RAG?

RAG, or Retrieval-Augmented Generation, is the pattern of retrieving relevant source material before an LLM generates an answer.

What RAG solves

RAG improves groundedness by pulling in current documentation, articles, code, or internal knowledge before the model answers. This is especially useful when the information changes faster than the model can be retrained.

It is one of the most common ways to reduce hallucinations in product and support workflows.

What RAG does not solve by itself

RAG does not automatically remember users across sessions. It can retrieve documents, but it is not the same thing as a durable memory layer for preferences, instructions, or prior decisions.

That is why many modern agent stacks pair RAG with memory rather than choosing one or the other.

Related concepts

Use these glossary pages and commercial landing pages to move from definition to implementation.

RAG vs memory layer

A fuller comparison of when each layer matters.

Vector database

Infrastructure commonly used in RAG systems.

AI agent memory layer

The complementary layer for user continuity and persistent state.

Put the concept into production

Start building