RAG Agent Setup
RAG (Retrieval-Augmented Generation) agents use your knowledge base to provide accurate, grounded responses. They do not require fine-tuning but can be used alongside it in other modes.
๐ง Overviewโ
RAG agents retrieve relevant Q&A content at runtime, grounding responses in your indexed knowledge base.
๐งพ Form Fieldsโ
Base Modelโ
Select the foundational model used to run retrieval-augmented responses (e.g., GPT-4.1 mini
).
Enable Indexing for RAG Grounded Responsesโ
This toggle must be enabled to create a RAG agent.
Once enabled:
- Your Q&A pairs are semantically indexed.
- The agent fetches relevant answers during inference.
- Responses are grounded and source-traceable.
โ ๏ธ Important: Without enabling RAG indexing, the Create Agent button will remain disabled.
๐ก Notesโ
- RAG can be used independently or later combined with fine-tuned models.
- RAG-enabled agents automatically stay up-to-date as knowledge changes.
- Runtime usage incurs a small retrieval overhead, typically ~2ร standard model calls.