📖 The AI Tool Bible

AI tools tagged Supports Ollama

42 tools matching this tag.model

All tags →

Continue

Coding · BYO (any OpenAI-compatible API + Ollama for local)
7.9

Open-source, self-hostable VS Code/JetBrains AI assistant.

Free· Free / open-source; you pay model costsself-hostedopen source

AgenticSeek

Agents · Bring-your-own local LLM (Ollama / llama.cpp compatible)

Open-source local-first AI agent that browses the web, writes code, and runs tasks without sending anything to the cloud.

Free· Free and open source; optional SerpApi key for enhanced searchlocal-ai-agentautonomous-web-browsing

AnythingLLM

RAG · Multi-model

Open-source desktop and self-hosted app that turns your documents into a private chat-and-agent workspace.

Freemium· Desktop free (MIT); self-host free; cloud paid plansdocument-chatprivate-rag

AstrBot

Agents · Multi-model (OpenAI, Anthropic, Gemini, DeepSeek, Ollama, Dify, Coze)

Open-source agentic AI assistant that bridges chat platforms like Telegram, Discord, and QQ with any LLM and a 1000+ plugin ecosystem.

Free· Free, open-source (AGPL-3.0); self-hosted, you pay your own LLM API costs.chatbotgroup-chat-assistant

Browser Use

Agents · Multi-model (BYO LLM; Claude in hosted Box)

Open-source browser automation harness and cloud platform for LLM agents that drive real websites.

Freemium· Free open-source library; cloud usage-based with free tier and enterprise plansweb-automationscraping

Browser Use Web UI

Agents · Multi-model

Gradio web UI for running browser-use AI agents in a real or persistent Chrome session.

Free· Free, open-source; bring your own LLM API keysbrowser-automationweb-agents

Cherry Studio

Agents · Multi-model

Open-source desktop AI client that wires 300+ LLMs into one chat, knowledge-base, and agent workspace.

Free· Free and open source; bring your own API keysmulti-model chatlocal knowledge base

GPTLocalhost

Writing · Bring-your-own (Ollama, LM Studio, llama.cpp, Foundry Local, etc.)

Run local LLMs directly inside Microsoft Word without sending text to the cloud.

Freemium· Free tier (512-char limit); paid monthly subscription and lifetime license availableprivate draftingoffline writing assistant

Hermes One

Agents · Multi-model (BYO via OpenRouter/OpenAI/Anthropic/Gemini/xAI/Ollama)

Open-source desktop AI agent with a self-improving learning loop and multi-platform messaging connectors.

Free· Free, MIT-licensed; you pay your own model inference costsautonomous-agentschat-ops

IntelliBar

Writing · Multi-model (GPT-4o, Claude 3.5, Gemini, o1, Llama, DeepSeek, Ollama)

Native macOS menu-bar client that talks to every major LLM with your own API keys.

Freemium· Free evaluation + one-time license; you pay model providers directly via your own API keysmulti-model-chatmenu-bar-assistant

Jan

Writing · Multi-model (local open-weights + OpenAI/Claude/Gemini via API)

Open-source desktop ChatGPT alternative that runs local LLMs and routes to cloud providers from one app.

Free· Free and open source; bring-your-own keys for cloud modelslocal-llm-chatprivate-ai-assistant

KNIME

Agents · Multi-model (OpenAI, Anthropic, Gemini, Ollama)

Visual node-based data science platform with built-in connectors for OpenAI, Anthropic, Gemini, and local LLMs.

Freemium· Free open-source desktop; Team and Business Hub plans paiddata-pipelinesllm-workflows

Kiln AI

Evaluation · Multi-model

Open-source workbench for building, evaluating, and fine-tuning AI agents across 190+ models.

Freemium· Free Individual tier; Team (request access); Enterprise (custom)llm-evaluationfine-tuning

Kilo Code

Coding · Multi-model (500+ via BYO keys or routing)

Open-source agentic coding assistant for VS Code, JetBrains, and the terminal with bring-your-own-key routing across 500+ models.

Freemium· Free tier; Kilo Pass subscription; BYO-keys with zero markupai-pair-programmingcode-review

Kotaemon

RAG · Multi-model (OpenAI, LlamaCPP, any OpenAI-compatible endpoint)

Open-source RAG UI for chatting with your own documents, locally or self-hosted.

Free· Free, open-source (MIT-style); self-hosted infrastructure costs onlydocument-qaprivate-rag

LLM by Datasette

Coding · Multi-model

A CLI and Python library for running prompts against any LLM provider and logging everything to SQLite.

Free· Free and open source (Apache 2.0); pay underlying model providers separatelycli-promptingprompt-logging

LLaMA Factory

Fine-tuning · Multi-model (LLaMA, Mistral, Qwen, Gemma, Phi, LLaVA, ChatGLM, Yi)

Open-source, no-code WebUI for fine-tuning 100+ open LLMs with LoRA, QLoRA, DPO, and PPO.

Free· Free, open-source (Apache-2.0); self-hostedlora-fine-tuningqlora

LM Studio

Agents · Multi-model (gpt-oss, Qwen3, Gemma, DeepSeek-R1, Llama, others)

Desktop app for discovering, downloading, and running open-weight LLMs locally with an OpenAI-compatible server.

Freemium· Free for personal and commercial use; paid LM Studio for Work / Enterprise tierlocal-llm-inferenceprivate-chat

LangExtract

RAG · Multi-model (Gemini, GPT-4/4o, Ollama-hosted local models)

Google's open-source Python library for LLM-driven structured extraction from unstructured text, with source-grounded outputs.

Free· Library is free (Apache-2.0); LLM API costs depend on chosen backendstructured-extractiondocument-parsing

Langchain-Chatchat

RAG · Multi-model (GLM-4, Qwen2, Llama 3, etc. via Xinference/Ollama/LocalAI/FastChat)

Self-hostable RAG and agent framework that wires LangChain to any local open-source LLM and a knowledge base.

Free· Apache-2.0 open source; self-hosted, infra costs onlyprivate-knowledge-baseoffline-rag

LibreChat

Writing · Multi-model (OpenAI, Anthropic, Google, AWS Bedrock, Azure, Ollama, and others)

Open-source, self-hostable ChatGPT-style frontend that brings every major LLM provider under one roof.

Free· Free and open source; self-hosted (you pay model providers for API usage)multi-model chatself-hosted chatgpt

Llama

Fine-tuning · Llama 4 (Maverick, Scout), Llama 3.3/3.2/3.1

Meta's open-weight LLM family covering 1B mobile models up to 405B frontier and natively multimodal 10M-context Llama 4 variants.

Freemium· Weights free under Llama Community License; partner API inference ~$0.19-$0.49 per 1M tokensself-hosted-llmfine-tuning

Llama 3

Writing · Llama 3 / 3.1 (8B, 70B, 405B)

Meta's open-weights LLM family that put serious frontier-adjacent models in everyone's hands.

Free· Weights free under Meta Llama Community License; inference cost via self-hosting or 3rd-party providerschatlong-context reasoning

Manifest

Agents · Multi-model

Open-source LLM router that fans your agent traffic across providers and your existing AI subscriptions.

Freemium· Open-source self-host is free; managed cloud in early accessllm-routingcost-control

Msty

Writing · Multi-model (Ollama, Claude, GPT, Gemini, others)

Privacy-first desktop AI workspace that runs local and cloud models side by side.

Freemium· Free Studio tier; paid upgrades for Studio and Claw (see in-app pricing)local-llm-chatmulti-model-comparison

NovelCrafter

Writing · Multi-model (BYO key: OpenAI, Anthropic, Gemini, Mistral, OpenRouter, Ollama)

Bring-your-own-key novel-writing workspace with a wiki-style codex and multi-model AI assistance.

Freemium· From $4/mo; 21-day free trial (no credit card); BYO AI API keysnovel-writingworldbuilding

Ollama

Coding · Multi-model (Llama, Qwen, Gemma, DeepSeek, Mistral, Phi, etc.)

The de facto runtime for running open-weights LLMs locally, now with a paid cloud tier for bigger models.

Freemium· Free local; Pro $20/mo; Max $100/molocal-llmself-hosted-inference

PandasAI

Coding · Multi-model (via LiteLLM)

Conversational data analysis library that turns natural-language questions into pandas, SQL and chart code.

Freemium· OSS library free (MIT); managed cloud and enterprise self-hosted are contact-salesdata-analysisnatural-language-sql

Pathway

RAG · Multi-model

Live data framework for production RAG and streaming ETL pipelines in Python.

Freemium· Community free (BSL 1.1, 8GB/4 cores); Scale and Enterprise tiers with license keylive-ragstreaming-etl

Pieces

Coding · Multi-model (BYO OpenAI, Anthropic, Gemini, Ollama)

On-device long-term memory layer that feeds your last nine months of work context into any LLM or IDE assistant.

Freemium· Individual free; Teams contact salesdeveloper-memorycode-snippets

PrivateGPT

RAG · Multi-model (BYO local LLM)

Production-ready, air-gapped RAG framework for querying your documents with local LLMs.

Freemium· OSS free; Zylon enterprise contract (contact sales)private-ragchat-with-documents

PyGPT

Writing · Multi-model (GPT-5, Claude, Gemini, Grok, DeepSeek, Mistral, Ollama)

Open-source desktop AI assistant that wires every major LLM provider into one local app with agents, vision, and a code interpreter.

Free· Free and open-source (MIT); bring your own provider API keysdesktop-ai-assistantchat-with-files

PySpur

Agents · Multi-model

Open-source agent builder with a drag-and-drop canvas, Python escape hatch, and a built-in test harness.

Freemium· Open-source (Apache 2.0); managed Cloud coming soonagent-orchestrationagent-evaluation

Qwen

Writing · Qwen3 / Qwen-Image / Qwen-MT / Qwen3Guard

Alibaba's open-weight foundation model family covering chat, vision, image generation, translation, and safety classification.

Freemium· Open weights free; hosted API priced per-token via Alibaba Cloud DashScopechatreasoning

Skyvern

Agents · Multi-model (OpenAI, Anthropic, Gemini, Ollama)

AI browser agent that automates web workflows from natural-language instructions, with CAPTCHA and 2FA handling built in.

Freemium· Free tier with 5,000 monthly credits; paid and enterprise plans availablebrowser-automationdata-extraction

TypingMind

Writing · Multi-model (GPT, Claude, Gemini, Mistral, DeepSeek, local)

BYOK chat frontend that puts GPT, Claude, Gemini, and local models behind one polished UI.

Paid· One-time license from ~$39; Custom/Team self-host plans highermulti-model-chatprompt-library

Unsloth

Fine-tuning · Llama, Mistral, Gemma, Qwen, GLM (multi-model)

Open-source LLM fine-tuning toolkit with custom kernels that train 2-30x faster and use up to 90% less VRAM.

Freemium· Free open-source; Pro and Enterprise contact saleslora-finetuningqlora

Vanna.ai

RAG · Multi-model (Anthropic, OpenAI, Gemini, Ollama)

Open-source text-to-SQL agent that learns your schema and writes queries against your real warehouse.

Freemium· Open-source free; paid cloud tier for hosted admin featurestext-to-sqlnatural-language-bi

WeKnora

RAG · Multi-model

Tencent's open-source RAG framework that turns raw documents into a queryable knowledge base, ReAct agent, and self-maintaining wiki.

Free· Free, open-source (self-hosted)document-qaenterprise-knowledge-base

llmfit

Evaluation · Multi-model

Terminal tool that scores hundreds of open LLMs against your actual CPU, RAM, and GPU and tells you which ones will run well.

Free· Free, MIT-licensedlocal-llm-selectionhardware-benchmarking

n8n

Agents · Multi-model

Source-available workflow automation with first-class AI-agent and RAG building blocks.

Freemium· Free self-host; Cloud Starter ~$24/mo; Enterprise contact salesai-agentsworkflow-automation

oMLX

Coding · Multi-model (Qwen, Llama, Mistral, Gemma, DeepSeek, MiniMax, GLM)

Native macOS LLM inference server built on MLX, with paged SSD KV caching for Apple Silicon agents.

Free· Free, Apache 2.0 open sourcelocal-llm-inferencecoding-agents