RAG

Retrieval-augmented generation, vector stores, indexers.

70 tools

Why it matters

RAG isn't a model, it's an architecture — retrieve, augment, generate. The choice is between frameworks that orchestrate the retrieval and the vector stores underneath.

What's in here

Includes RAG frameworks (LlamaIndex, LangChain), managed vector databases (Pinecone), open-source vector stores (Weaviate, Chroma, Vespa), and hybrid-search engines.

How to pick

Pick LlamaIndex when retrieval quality is the bottleneck. Pick Pinecone for zero-ops production. Pick Weaviate or Chroma for self-hosted or budget-conscious. Pick Vespa at scale beyond a few million docs.

Quivr

RAG · Multi-model (OpenAI, Anthropic, Mistral, Gemma)

Open-source RAG framework for building custom AI assistants over your own documents in a few lines of Python.

Free· Open source (pip install quivr-core); pay only for LLM/vector-store usagedocument-qacustom-knowledge-base

RAGFlow

RAG · Multi-model

Open-source RAG engine with deep document parsing, hybrid search, and visual agent orchestration.

Freemium· Free tier; Starter $29/mo; Pro $129/mo; Enterprise customdocument-qaenterprise-search

RAGs by LlamaIndex

RAG · Multi-model (OpenAI, Anthropic, Replicate, HuggingFace)

Open-source Streamlit app that builds a custom RAG pipeline from a natural-language brief.

Free· Free, MIT-licensed; bring your own model/API keysnatural-language-rag-builderdocument-qa

Rayyan

RAG

AI-assisted systematic review platform for screening, deduplicating, and extracting data from large literature corpora.

Freemium· Free forever for basic use; enterprise custom pricingsystematic-reviewliterature-screening

Recall

RAG · Multi-model (GPT, Claude, Gemini)

AI-powered personal knowledge base that summarizes, links, and quizzes you on everything you save.

Freemium· Free tier; Premium upgrade with 30-day refundknowledge-managementvideo-summarization

Refinder AI

RAG

Chat-native enterprise search and task automation hub that lives inside Slack and Google Chat.

Freemium· Free plan available; enterprise pricing on requestenterprise-searchslack-automation

Rivestack

RAG · OpenAI embeddings (auto-embeddings)

Managed Postgres with pgvector on dedicated NVMe, pitched as a cheaper RAG backend than Pinecone or Supabase.

Freemium· Free shared tier; Solo $15/mo, Starter $35, Growth $59, Scale $99 (EU Central)rag-backendvector-search

SciSpace

RAG · Multi-model

AI research assistant that turns dense PDFs and literature reviews into searchable, citation-backed answers.

Freemium· Free tier; Premium $12/mo; Advanced $70/mo; Teams $20/user/moliterature-reviewchat-with-pdf

Scite

RAG · Multi-model

AI research assistant that grades citations as supporting, contrasting, or mentioning across 1.6B citation statements.

Freemium· Free tier; Personal ~$20/user/mo ($12 annual); Organization customliterature-reviewcitation-analysis

Singlebase Cloud

RAG · Multi-model

AI-native Firebase alternative bundling document DB, vector DB, auth, storage, and built-in AI services.

Freemium· Free tier available; paid plans scale with usagevector-searchrag-apps

SiteGPT

RAG · GPT-4 (per testimonials; not publicly specified)

Custom GPT-powered support chatbots trained on your website content and docs.

Freemium· 7-day free trial; Starter $39/mo, Growth $79/mo, Scale $259/mo, Enterprise customcustomer-supportwebsite-chatbot

SiteSpeakAI

RAG · Multi-model

Custom-trained chatbot that turns your website, docs, and PDFs into a multilingual support and lead-gen agent.

Freemium· Free tier; Starter $29/mo, Pro $79/mo, Growth $249/mo, Business $499/mowebsite-chatbotcustomer-support

Superduper

RAG · Multi-model

Enterprise AI agent orchestration that brings RAG and agents to your existing data stack without migration.

Enterprise· Free trial on Snowflake Marketplace; enterprise self-hosted pricing on requestin-database-ragagent-orchestration

TurboVec

RAG

Rust-powered vector index with 2-4 bit TurboQuant compression for SIMD-accelerated RAG search.

Free· Free, MIT licensedvector-searchrag

UltraRAG

RAG · Multi-model (MiniCPM-Embedding-Light, AgentCPM-Report, BYO LLM)

Low-code, YAML-driven RAG pipeline orchestrator with a visual UI for building and demoing retrieval systems.

Free· Open source; self-hostedrag-pipelinesknowledge-base-qa

Vanna.ai

RAG · Multi-model (Anthropic, OpenAI, Gemini, Ollama)

Open-source text-to-SQL agent that learns your schema and writes queries against your real warehouse.

Freemium· Open-source free; paid cloud tier for hosted admin featurestext-to-sqlnatural-language-bi

WeKnora

RAG · Multi-model

Tencent's open-source RAG framework that turns raw documents into a queryable knowledge base, ReAct agent, and self-maintaining wiki.

Free· Free, open-source (self-hosted)document-qaenterprise-knowledge-base

Wren AI

RAG · Multi-model (OpenAI, Anthropic, Gemini, self-hosted)

Open-source GenBI semantic layer that lets AI agents query your warehouse in natural language with governed, accurate SQL.

Freemium· OSS free; Enterprise Cloud contact salestext-to-sqlsemantic-layer

You.com

RAG · Multi-model

Web search and research APIs purpose-built for LLMs and AI agents.

Freemium· Free trial; enterprise pricing on requestweb-search-apiagent-grounding

Yuxi

RAG · Multi-model

Open-source AI agent platform that fuses agentic RAG with knowledge graphs on a LangGraph runtime.

Free· Free, MIT-licensed self-hostagentic-ragknowledge-graphs

aiPDF

RAG

Chat-with-your-documents app that ingests PDFs, EPUBs, web pages and YouTube videos with cited answers.

Freemium· Free tier; paid Playful, Dynamic and Flagship plansdocument-chatpdf-summarization

alphaXiv

RAG · Multi-model

AI reading layer over arXiv with grounded Q&A, auto-summaries, and line-by-line discussion on every preprint.

Free· Free, no signup requiredpaper-qaliterature-review