RAG

Retrieval-augmented generation, vector stores, indexers.

70 tools

Why it matters

RAG isn't a model, it's an architecture — retrieve, augment, generate. The choice is between frameworks that orchestrate the retrieval and the vector stores underneath.

What's in here

Includes RAG frameworks (LlamaIndex, LangChain), managed vector databases (Pinecone), open-source vector stores (Weaviate, Chroma, Vespa), and hybrid-search engines.

How to pick

Pick LlamaIndex when retrieval quality is the bottleneck. Pick Pinecone for zero-ops production. Pick Weaviate or Chroma for self-hosted or budget-conscious. Pick Vespa at scale beyond a few million docs.

Feast

RAG

Open-source feature store that serves consistent features to ML training and online inference, with RAG vector search built in.

Free· Free, open source (Apache 2.0); self-hostedfeature-storerag-retrieval

FinChat (Fiscal.ai)

RAG · Multi-model (proprietary finance-tuned copilot)

AI copilot for equity research that reads filings, transcripts, and KPI tables across 100,000+ public companies.

Freemium· Free; Pro $39/mo (annual) or $49/mo; Max and Enterprise API tiers aboveequity-researchearnings-call-analysis

Firecrawl

RAG

Web scraping and crawling API that returns LLM-ready markdown, JSON, or structured data from any URL.

Freemium· Free 1,000 credits/mo; paid Hobby/Standard/Growth tiers; Scale/Enterprise annualweb-scrapingrag-ingestion

FutureHouse Platform

RAG · Multi-model

Multi-agent AI research stack for scientists, with retrieval over 175M+ papers, patents, and trials.

Freemium· Free tier for academics; paid plans for higher rate limitsscientific-literature-searchautonomous-research-agent

GaliChat

RAG

No-code AI chatbot builder that trains on your website content for support and lead capture.

Freemium· Free tier (no credit card); paid plans for advanced featurescustomer-supportlead-generation

Genei

RAG · GPT-3 (per public site)

AI research assistant that summarizes PDFs and web pages and answers questions across your document library.

Freemium· 14-day free trial; Basic Â£3.99/mo; Pro Â£15.99/mopdf-summarizationresearch-assistant

Graphify

RAG · Multi-model

Open-source on-device knowledge graph engine that turns code, docs, papers, meetings and images into a queryable graph.

Free· MIT-licensed, free forever; cloud tier hinted but unpriced (waitlist)knowledge-graphcode-search

Graphiti

RAG · Multi-model

Open-source temporal knowledge graph framework for building agent memory that updates in real time.

Freemium· Open-source (Apache 2.0); managed Zep Cloud sold separatelyagent-memorytemporal-knowledge-graphs

Haystack

RAG · Multi-model

Open-source Python framework from deepset for building production RAG pipelines and LLM agents.

Freemium· Open-source free; deepset Enterprise Support and AI Platform via salesragagents

HelixDB

RAG

Unified graph-and-vector database built for AI agent memory and GraphRAG.

Freemium· Open-source core; managed cloud pricing on requestagent-memorygraphrag

Humata.ai

RAG · Multi-model

Chat-with-your-documents RAG tool with citation-backed answers across uploaded PDFs and files.

Freemium· Free (60 pages); Expert $9.99/mo; Team $49/user/mo; Enterprise on requestdocument-qaresearch-summarization

Kotaemon

RAG · Multi-model (OpenAI, LlamaCPP, any OpenAI-compatible endpoint)

Open-source RAG UI for chatting with your own documents, locally or self-hosted.

Free· Free, open-source (MIT-style); self-hosted infrastructure costs onlydocument-qaprivate-rag

LanceDB

RAG

Open-source multimodal lakehouse and vector database built for AI training and retrieval at petabyte scale.

Freemium· Open-source free; LanceDB Cloud and Enterprise via contact salesvector-searchrag

LangExtract

RAG · Multi-model (Gemini, GPT-4/4o, Ollama-hosted local models)

Google's open-source Python library for LLM-driven structured extraction from unstructured text, with source-grounded outputs.

Free· Library is free (Apache-2.0); LLM API costs depend on chosen backendstructured-extractiondocument-parsing

Langchain-Chatchat

RAG · Multi-model (GLM-4, Qwen2, Llama 3, etc. via Xinference/Ollama/LocalAI/FastChat)

Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.

Free· Apache-2.0 open source; self-hosted, infra costs onlyprivate-knowledge-baseoffline-rag

MaxKB

RAG · Multi-model

Open-source enterprise RAG and agent platform with built-in workflow engine and multi-LLM support.

Freemium· Community edition free (GPLv3); paid enterprise editionenterprise-knowledge-basecustomer-support-bots

NotebookLM

RAG · Gemini 2.5

Google's source-grounded research notebook that turns your documents into chats, briefs, and AI-hosted podcasts.

Freemium· Free tier; Plus via Google One AI Premium ($19.99/mo) or Workspace add-ondocument Q&Aresearch synthesis

OneKE

RAG · Multi-model (OneKE-13B, LLaMA3, Qwen2.5, GPT, DeepSeek-R1)

Open-source multi-agent framework for schema-guided knowledge extraction from documents.

Free· Free, MIT-licensed; you pay for LLM API calls or self-hosted computeknowledge-graph-constructionnamed-entity-recognition

OpenDataLoader PDF

RAG

Open-source PDF parser built for RAG pipelines, with reading-order detection, table extraction, and bounding-box citations.

Freemium· Free (Apache 2.0); enterprise tier for PDF/UA export and visual editorpdf-parsingrag-preprocessing

PageIndex

RAG

Vectorless reasoning-based retrieval for long documents, with traceable, auditable answers.

Freemium· Free Try Now tier; enterprise pricing on requestdocument-qalong-pdf-retrieval

Pathway

RAG · Multi-model

Live data framework for production RAG and streaming ETL pipelines in Python.

Freemium· Community free (BSL 1.1, 8GB/4 cores); Scale and Enterprise tiers with license keylive-ragstreaming-etl

Perplexity AI

RAG · Multi-model (Sonar, GPT-4 class, Claude, Gemini)

Conversational answer engine that cites its sources by default.

Freemium· Free tier; Pro $20/mo or $200/yr; Enterprise from $40/user/mo; Sonar API usage-basedai-searchresearch

PostgresML

RAG · Multi-model (Llama, Mistral, open-source embeddings)

PostgreSQL extension that runs embeddings, vector search, and LLM inference inside your database.

Freemium· Open-source self-host free; managed cloud usage-based with $100 free creditsvector-searchrag

PrivateGPT

RAG · Multi-model (BYO local LLM)

Production-ready, air-gapped RAG framework for querying your documents with local LLMs.

Freemium· OSS free; Zylon enterprise contract (contact sales)private-ragchat-with-documents