Best AI tools for retrievers
48 tools in the RAG category, filtered to retrievers.
Pinecone
FeaturedManaged vector database for production-scale similarity search.
LlamaIndex
FeaturedData framework for connecting LLMs to your data.
Weaviate
Open-source vector DB with hybrid search and modules.
LangChain
The broad LLM application framework — chains, agents, retrievers.
Vespa
Yahoo's open-source search engine with vector + sparse retrieval.
Chroma
Embedded, developer-friendly vector store for Python.
Agentset
Production-ready RAG infrastructure with agentic search, citations, and model-agnostic plumbing.
AnythingLLM
Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.
BGE (BAAI General Embedding)
Open-source embedding and reranker models from BAAI that anchor a huge share of production RAG stacks.
Chat With PDF by Copilot.us
Conversational PDF Q&A bundled into a multi-app productivity membership.
ChatPDF
Conversational Q&A over PDFs and other documents with citation-backed answers.
CocoIndex
Open-source incremental data framework that keeps RAG indexes and agent context continuously fresh.
Cognee
Open-source graph-memory layer that gives AI agents persistent, queryable context across sessions.
Cohere
Enterprise-grade LLM platform built for private, secure, and customizable deployment.
Context Data
Enterprise data platform for deploying private RAG pipelines without infrastructure plumbing.
Cosmos
AI-powered archive search and reel curation for video production companies.
Cube
Semantic layer that grounds LLM agents in your real business metrics instead of letting them hallucinate SQL.
Databricks Vector Search
Managed hybrid vector search that lives inside the Databricks lakehouse and auto-syncs with your source tables.
DeepSearcher
Open-source agentic RAG framework for private enterprise data, built by the Zilliz/Milvus team.
Elicit
AI research assistant that searches, screens, and extracts data from 138M+ academic papers at scale.
Emergent Mind
AI-curated arXiv discovery layer that summarizes frontier papers and aggregates social discussion around them.
Epsilla
Agent-as-a-Service platform with managed RAG and a no-code builder for vertical enterprise AI.
Exa
Web search API built for AI agents, with structured outputs and token-efficient highlights.
Explainpaper
AI reading companion that decodes dense academic papers by highlighting and chatting with the PDF.
Feast
Open-source feature store that serves consistent features to ML training and online inference, with RAG vector search built in.
FinChat (Fiscal.ai)
AI copilot for equity research that reads filings, transcripts, and KPI tables across 100,000+ public companies.
Firecrawl
Web scraping and crawling API that returns LLM-ready markdown, JSON, or structured data from any URL.
FutureHouse Platform
Multi-agent AI research stack for scientists, with retrieval over 175M+ papers, patents, and trials.
GaliChat
No-code AI chatbot builder that trains on your website content for support and lead capture.
Genei
AI research assistant that summarizes PDFs and web pages and answers questions across your document library.
Graphify
Open-source on-device knowledge graph engine that turns code, docs, papers, meetings and images into a queryable graph.
Graphiti
Open-source temporal knowledge graph framework for building agent memory that updates in real time.
Haystack
Open-source Python framework from deepset for building production RAG pipelines and LLM agents.
HelixDB
Unified graph-and-vector database built for AI agent memory and GraphRAG.
Humata.ai
Chat-with-your-documents RAG tool with citation-backed answers across uploaded PDFs and files.
Kotaemon
Open-source RAG UI for chatting with your own documents, locally or self-hosted.
LanceDB
Open-source multimodal lakehouse and vector database built for AI training and retrieval at petabyte scale.
LangExtract
Google's open-source Python library for LLM-driven structured extraction from unstructured text, with source-grounded outputs.
Langchain-Chatchat
Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.
MaxKB
Open-source enterprise RAG and agent platform with built-in workflow engine and multi-LLM support.
NotebookLM
Google's source-grounded research notebook that turns your documents into chats, briefs, and AI-hosted podcasts.
OneKE
Open-source multi-agent framework for schema-guided knowledge extraction from documents.
OpenDataLoader PDF
Open-source PDF parser built for RAG pipelines, with reading-order detection, table extraction, and bounding-box citations.
PageIndex
Vectorless reasoning-based retrieval for long documents, with traceable, auditable answers.
Pathway
Live data framework for production RAG and streaming ETL pipelines in Python.
Perplexity AI
Conversational answer engine that cites its sources by default.
PostgresML
PostgreSQL extension that runs embeddings, vector search, and LLM inference inside your database.
PrivateGPT
Production-ready, air-gapped RAG framework for querying your documents with local LLMs.