📖 The AI Tool Bible

Best AI tools for retrievers

48 tools in the RAG category, filtered to retrievers.

All RAG

Pinecone

Featured
RAG · Hosted vector DB (not an LLM)
8.8

Managed vector database for production-scale similarity search.

Freemium· Free starter; serverless pay-as-you-go from $0.33/1M readsmanaged vector DBproduction RAG

LlamaIndex

Featured
RAG · BYO (Claude / GPT / open)
8.7

Data framework for connecting LLMs to your data.

Freemium· Free open-source; LlamaCloud paidRAGdata ingestion

Weaviate

RAG · Hosted vector DB (not an LLM)
8.4

Open-source vector DB with hybrid search and modules.

Freemium· Free open-source; cloud from $25/moself-hosted RAGhybrid search

LangChain

RAG · BYO (any major LLM)
8.3

The broad LLM application framework — chains, agents, retrievers.

Freemium· Free open-source; LangSmith paidgeneral LLM appsRAG

Vespa

RAG · Hosted search engine (not an LLM)
8.2

Yahoo's open-source search engine with vector + sparse retrieval.

Freemium· Free open-source; Vespa Cloud paidlarge-scale searchranking

Chroma

RAG · Hosted vector DB (not an LLM)
8.1

Embedded, developer-friendly vector store for Python.

Freemium· Free open-source; Chroma Cloud paidprototypingembedded RAG

Agentset

RAG · Multi-model (Claude, OpenAI, Google, xAI, Cohere, Mistral, DeepSeek)

Production-ready RAG infrastructure with agentic search, citations, and model-agnostic plumbing.

Freemium· Free 1K pages/10K retrievals; Pro $49/mo + $0.01/page; Enterprise customdocument-qaagentic-search

AnythingLLM

RAG · Multi-model

Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.

Freemium· Desktop free (MIT); self-host free; cloud paid plansdocument-chatprivate-rag

BGE (BAAI General Embedding)

RAG · BGE / bge-m3 / bge-reranker

Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.

Free· Free, open-source (MIT-style license); self-hosted inference cost onlysemantic-searchrag-retrieval

Chat With PDF by Copilot.us

RAG

Conversational PDF Q&A bundled into a multi-app productivity membership.

Freemium· 7-day free trial; paid membership covers full app suitepdf-qadocument-summarization

ChatPDF

RAG · GPT-4o / GPT-4o-mini

Conversational Q&A over PDFs and other documents with citation-backed answers.

Freemium· Free: 2 PDFs/day; Plus plan for unlimitedpdf-qadocument-summarization

CocoIndex

RAG · Bring-your-own (embeddings + LLM)

Open-source incremental data framework that keeps RAG indexes and agent context continuously fresh.

Free· Open-source, self-hosted; bring your own infracode-indexingrag-pipelines

Cognee

RAG · Multi-model (Claude, OpenAI, others)

Open-source graph-memory layer that gives AI agents persistent, queryable context across sessions.

Freemium· Hobby free (1M tokens/mo); Growth $5/workspace/mo + token usage; Enterprise customagent-memoryknowledge-graphs

Cohere

RAG · Command, Embed, Rerank, Transcribe (proprietary)

Enterprise-grade LLM platform built for private, secure, and customizable deployment.

Enterprise· Free trial API keys; production via usage-based API pricing or enterprise contractsenterprise-ragsemantic-search

Context Data

RAG · Multi-model

Enterprise data platform for deploying private RAG pipelines without infrastructure plumbing.

Enterprise· Contact salesenterprise-ragdocument-search

Cosmos

RAG

AI-powered archive search and reel curation for video production companies.

Enterprise· Contact salesvideo archive searchsales enablement

Cube

RAG · Multi-model

Semantic layer that grounds LLM agents in your real business metrics instead of letting them hallucinate SQL.

Freemium· Cube Core open source; Cube Cloud paid, contact salessemantic-layerembedded-analytics

Databricks Vector Search

RAG · Multi-model (BYO embeddings or Databricks-hosted)

Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.

Enterprise· Consumption-based via Databricks; free trial availablerag-retrievalhybrid-search

DeepSearcher

RAG · Multi-model (DeepSeek, OpenAI o1/o3-mini, Claude, Llama, others)

Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.

Free· Free, Apache 2.0; bring your own LLM and vector DB costsenterprise-ragagentic-search

Elicit

RAG · Claude Opus 4.5

AI research assistant that searches, screens, and extracts data from 138M+ academic papers at scale.

Freemium· Free tier; paid Plus, Pro, and Enterprise plansliterature-reviewsystematic-review

Emergent Mind

RAG · Undisclosed

AI-curated arXiv discovery layer that summarizes frontier papers and aggregates social discussion around them.

Freemium· Free Basic; Pro $10/mo annual ($12 mo); Max $25/mo annual ($30 mo)arxiv-discoverypaper-summarization

Epsilla

RAG · Multi-model

Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.

Freemium· Free; Starter $29/mo; Professional $249/mo; AI Concierge $2,499/mo; Enterprise customenterprise-ragai-agents

Exa

RAG · Proprietary neural + keyword search

Web search API built for AI agents, with structured outputs and token-efficient highlights.

Freemium· Free playground; paid usage-based plans; enterprise on requestagent-web-searchrag-retrieval

Explainpaper

RAG · Undisclosed (tiered basic vs. advanced)

AI reading companion that decodes dense academic papers by highlighting and chatting with the PDF.

Freemium· Free; Pro $16/mo with 7-day trialpaper-readingresearch-summaries

Feast

RAG

Open-source feature store that serves consistent features to ML training and online inference, with RAG vector search built in.

Free· Free, open source (Apache 2.0); self-hostedfeature-storerag-retrieval

FinChat (Fiscal.ai)

RAG · Multi-model (proprietary finance-tuned copilot)

AI copilot for equity research that reads filings, transcripts, and KPI tables across 100,000+ public companies.

Freemium· Free; Pro $39/mo (annual) or $49/mo; Max and Enterprise API tiers aboveequity-researchearnings-call-analysis

Firecrawl

RAG

Web scraping and crawling API that returns LLM-ready markdown, JSON, or structured data from any URL.

Freemium· Free 1,000 credits/mo; paid Hobby/Standard/Growth tiers; Scale/Enterprise annualweb-scrapingrag-ingestion

FutureHouse Platform

RAG · Multi-model

Multi-agent AI research stack for scientists, with retrieval over 175M+ papers, patents, and trials.

Freemium· Free tier for academics; paid plans for higher rate limitsscientific-literature-searchautonomous-research-agent

GaliChat

RAG

No-code AI chatbot builder that trains on your website content for support and lead capture.

Freemium· Free tier (no credit card); paid plans for advanced featurescustomer-supportlead-generation

Genei

RAG · GPT-3 (per public site)

AI research assistant that summarizes PDFs and web pages and answers questions across your document library.

Freemium· 14-day free trial; Basic £3.99/mo; Pro £15.99/mopdf-summarizationresearch-assistant

Graphify

RAG · Multi-model

Open-source on-device knowledge graph engine that turns code, docs, papers, meetings and images into a queryable graph.

Free· MIT-licensed, free forever; cloud tier hinted but unpriced (waitlist)knowledge-graphcode-search

Graphiti

RAG · Multi-model

Open-source temporal knowledge graph framework for building agent memory that updates in real time.

Freemium· Open-source (Apache 2.0); managed Zep Cloud sold separatelyagent-memorytemporal-knowledge-graphs

Haystack

RAG · Multi-model

Open-source Python framework from deepset for building production RAG pipelines and LLM agents.

Freemium· Open-source free; deepset Enterprise Support and AI Platform via salesragagents

HelixDB

RAG

Unified graph-and-vector database built for AI agent memory and GraphRAG.

Freemium· Open-source core; managed cloud pricing on requestagent-memorygraphrag

Humata.ai

RAG · Multi-model

Chat-with-your-documents RAG tool with citation-backed answers across uploaded PDFs and files.

Freemium· Free (60 pages); Expert $9.99/mo; Team $49/user/mo; Enterprise on requestdocument-qaresearch-summarization

Kotaemon

RAG · Multi-model (OpenAI, LlamaCPP, any OpenAI-compatible endpoint)

Open-source RAG UI for chatting with your own documents, locally or self-hosted.

Free· Free, open-source (MIT-style); self-hosted infrastructure costs onlydocument-qaprivate-rag

LanceDB

RAG

Open-source multimodal lakehouse and vector database built for AI training and retrieval at petabyte scale.

Freemium· Open-source free; LanceDB Cloud and Enterprise via contact salesvector-searchrag

LangExtract

RAG · Multi-model (Gemini, GPT-4/4o, Ollama-hosted local models)

Google's open-source Python library for LLM-driven structured extraction from unstructured text, with source-grounded outputs.

Free· Library is free (Apache-2.0); LLM API costs depend on chosen backendstructured-extractiondocument-parsing

Langchain-Chatchat

RAG · Multi-model (GLM-4, Qwen2, Llama 3, etc. via Xinference/Ollama/LocalAI/FastChat)

Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.

Free· Apache-2.0 open source; self-hosted, infra costs onlyprivate-knowledge-baseoffline-rag

MaxKB

RAG · Multi-model

Open-source enterprise RAG and agent platform with built-in workflow engine and multi-LLM support.

Freemium· Community edition free (GPLv3); paid enterprise editionenterprise-knowledge-basecustomer-support-bots

NotebookLM

RAG · Gemini 2.5

Google's source-grounded research notebook that turns your documents into chats, briefs, and AI-hosted podcasts.

Freemium· Free tier; Plus via Google One AI Premium ($19.99/mo) or Workspace add-ondocument Q&Aresearch synthesis

OneKE

RAG · Multi-model (OneKE-13B, LLaMA3, Qwen2.5, GPT, DeepSeek-R1)

Open-source multi-agent framework for schema-guided knowledge extraction from documents.

Free· Free, MIT-licensed; you pay for LLM API calls or self-hosted computeknowledge-graph-constructionnamed-entity-recognition

OpenDataLoader PDF

RAG

Open-source PDF parser built for RAG pipelines, with reading-order detection, table extraction, and bounding-box citations.

Freemium· Free (Apache 2.0); enterprise tier for PDF/UA export and visual editorpdf-parsingrag-preprocessing

PageIndex

RAG

Vectorless reasoning-based retrieval for long documents, with traceable, auditable answers.

Freemium· Free Try Now tier; enterprise pricing on requestdocument-qalong-pdf-retrieval

Pathway

RAG · Multi-model

Live data framework for production RAG and streaming ETL pipelines in Python.

Freemium· Community free (BSL 1.1, 8GB/4 cores); Scale and Enterprise tiers with license keylive-ragstreaming-etl

Perplexity AI

RAG · Multi-model (Sonar, GPT-4 class, Claude, Gemini)

Conversational answer engine that cites its sources by default.

Freemium· Free tier; Pro $20/mo or $200/yr; Enterprise from $40/user/mo; Sonar API usage-basedai-searchresearch

PostgresML

RAG · Multi-model (Llama, Mistral, open-source embeddings)

PostgreSQL extension that runs embeddings, vector search, and LLM inference inside your database.

Freemium· Open-source self-host free; managed cloud usage-based with $100 free creditsvector-searchrag

PrivateGPT

RAG · Multi-model (BYO local LLM)

Production-ready, air-gapped RAG framework for querying your documents with local LLMs.

Freemium· OSS free; Zylon enterprise contract (contact sales)private-ragchat-with-documents