RAG Academy
An interactive sandbox for Retrieval-Augmented Generation — watch every step happen, from chunk to grounded answer.
- Role
- Solo build
- Period
- 2025 – present
- rag
- ai
- openai
- cohere
- embeddings
- vector-search
- self-hosted
- education
An interactive sandbox for Retrieval-Augmented Generation — watch every step happen, from chunk to grounded answer.
Files & links
A web app that exposes every stage of a RAG pipeline — chunk → embed → retrieve → generate — so you can see the choices that most RAG tutorials hide behind a chatbot.
Live at ragacademy.bojansavic.com — self-hosted and free to try. Sign up, run the pipeline on your own text, and watch every step. It calls real OpenAI + Cohere + GPT-4o APIs, so a light per-visitor daily cap keeps the bill honest.
Step through a full RAG loop on your own document (upload, paste, or use a bundled sample):
Every step has an inline explainer so the mechanism stays visible. Nothing in the pipeline is magic — just a chain of deterministic choices you can inspect.
Most RAG tutorials treat the pipeline as a black box: drop a PDF in, get a chatbot out. That’s fine for shipping a demo and useless for understanding why a RAG system fails or succeeds. The failures sit upstream of the LLM call — wrong chunk size, wrong overlap, wrong embedding model, wrong top-k — and you can’t debug what you can’t see.
RAG Academy is the tool I wished existed while learning this stack. Every choice at every stage is visible, swappable, and comparable. Pick a chunking strategy, watch the chunks change. Swap the embedding model, watch the similarity rankings change. Ask a question, watch retrieval pick a different chunk, watch the answer drift.
The L&D angle is why the visual design matters — you teach what you can see. The AI-for-L&D practitioner angle is why this matters in practice: the teams buying “AI-powered learning” rarely understand what their system is actually retrieving, and the pre-processing step — the one that decides most of the quality — is the one they’re least equipped to reason about.
Live and open at ragacademy.bojansavic.com — sign up and run the full loop on your own text. Because every run spends real API credits, there is a light per-visitor daily cap. Follow-up deep-dives planned on how chunking strategy alone changes answer quality.