📖 The AI Tool Bible

AI tools tagged Supports Vision

37 tools matching this tag.model

All tags →

GPT-4o

Featured
Writing · GPT-4o
9.4

OpenAI's multimodal flagship behind ChatGPT.

Freemium· Free tier; Plus $20/mo; Pro $200/mogeneral writingsummarization

Pinecone

Featured
RAG · Hosted vector DB (not an LLM)
8.8

Managed vector database for production-scale similarity search.

Freemium· Free starter; serverless pay-as-you-go from $0.33/1M readsmanaged vector DBproduction RAG

OpenAI Fine-tuning

Fine-tuning · GPT-4o-mini / GPT-3.5
8.4

Fine-tune GPT-4o-mini and friends on your own data.

Paid· Training $25/1M tokens; inference at standard ratesstyleformat

AgentMail

Agents

API-first email provider purpose-built for autonomous AI agents.

Freemium· Free (3 inboxes); Developer $20/mo; Startup $200/mo; Enterprise customagent-emailinbox-provisioning

Apple Intelligence

Writing · Apple Foundation Models + Private Cloud Compute; optional ChatGPT

Apple's on-device AI layer baked into iOS, iPadOS, macOS, and visionOS with a rebuilt Siri, writing tools, and image generation.

Free· Free with supported Apple hardware; no subscriptionwriting-assistancesummarization

Brandmark

Image Generation · Undisclosed (proprietary generative AI + curated assets)

AI logo maker that spits out a full brand kit from a name and a few keywords.

Paid· Free previews; one-time purchase tiers (historically ~$25-$175)logo-designbrand-identity

CompassRank

Evaluation · Multi-model

Public leaderboard from the OpenCompass project ranking open and closed LLMs across 100+ benchmarks.

Free· Free leaderboard; OpenCompass toolkit is Apache 2.0 open sourcellm-benchmarkingmodel-selection

DeepSeek

Coding · DeepSeek R1, V3, V2, Coder V2, VL

Chinese AI lab shipping open-weight reasoning models that punch well above their API price.

Freemium· Free web chat; API priced per million tokens (significantly cheaper than GPT/Claude)reasoningcode-generation

Domino Data Lab

Agents · Multi-model

Enterprise AI platform for building, deploying, and governing models and agents at scale.

Enterprise· Contact sales; enterprise contracts onlyenterprise mlopsagentic ai

Edge Impulse

Fine-tuning · Multi-model (TF Lite Micro, custom DSP blocks)

End-to-end platform for training and deploying ML models on microcontrollers, sensors, and other edge hardware.

Freemium· Free developer tier; paid Professional and Enterprise plans (contact sales)edge-aitinyml

Forefront

Fine-tuning · Multi-model (Mistral-7B, Mixtral, Phi-2)

Fine-tune and serve open-source LLMs on your own data without managing GPUs.

Paid· Usage-based per token (e.g. Phi-2 $0.0006/1k, Mixtral $0.004/1k)fine-tuningopen-source-llms

Geniusrise

Agents · Multi-model

Open-source framework for building, deploying, and scaling AI microservices across text, vision, and audio.

Free· Free, open source; self-hostedinference-servingfine-tuning

Genmo

Video · Mochi 1

Open-source text-to-video model (Mochi 1) with a hosted playground for turning prompts into short clips.

Freemium· Free playground; open weights available; paid tiers not clearly disclosedtext-to-videogenerative-video

Grok

Writing · Grok (xAI)

xAI's conversational assistant with real-time X integration and a distinctly less-filtered personality.

Freemium· Free tier on X; X Premium from ~$8/mo; SuperGrok and API pay-as-you-goconversational-aireal-time-search

Groq

Coding · Multi-model (Llama, Mixtral, Gemma, Qwen, Whisper)

Custom-silicon LPU inference platform serving open models at GPU-trouncing latency via an OpenAI-compatible API.

Freemium· Free API key with rate limits; per-token paid tiers; enterprise contractslow-latency inferencevoice agents

Headroom

Agents · Model-agnostic (Anthropic, OpenAI, Vertex, Bedrock, Azure, 100+ via LiteLLM)

Open-source context compression layer that strips 70-95% of boilerplate before it hits your LLM.

Free· Apache 2.0 open source; free for commercial usetoken-compressionagent-context

Hermes Agent

Agents · Multi-model (300+ via Nous Portal)

Open-source multi-channel AI agent with persistent memory, scheduling, and subagent delegation from Nous Research.

Freemium· MIT-licensed self-host free; managed Portal tiers Free/Plus/Super/Ultra with creditspersonal-assistanttask-automation

LangFast

Evaluation · Multi-model

No-signup LLM playground for testing, comparing, and versioning prompts against your own API keys.

Paid· One-time lifetime ~$60-$120; 14-day money-backprompt-testingprompt-versioning

Llama

Fine-tuning · Llama 4 (Maverick, Scout), Llama 3.3/3.2/3.1

Meta's open-weight LLM family covering 1B mobile models up to 405B frontier and natively multimodal 10M-context Llama 4 variants.

Freemium· Weights free under Llama Community License; partner API inference ~$0.19-$0.49 per 1M tokensself-hosted-llmfine-tuning

LooksMax AI

Image Generation · Proprietary computer vision model

AI-powered facial attractiveness analyzer that rates your looks and suggests cosmetic improvements.

Freemium· Free basic rating; paid plans for detailed analysisface-analysisattractiveness-rating

MineContext

Agents · Doubao-Seed-1.6-flash (default); OpenAI-compatible

Open-source desktop agent that watches your screen and proactively surfaces summaries, todos, and notes.

Free· Free and open-source (Apache 2.0); bring your own model API keysscreen-context-capturedaily-summaries

OpenSandbox

Agents · Model-agnostic

Open-source sandbox infrastructure for running AI-generated code, agents, and browsers in isolated Docker or Kubernetes environments.

Free· Open source (Apache 2.0); managed pricing not disclosedcode-executionagent-sandboxing

PageAgent

Agents · Bring-your-own (Qwen, GPT, Claude, etc.)

An in-page JavaScript GUI agent that drives web interfaces with natural language, no headless browser required.

Free· Free, MIT-licensed; LLM costs are whatever provider you wire inweb-automationai-copilots

PyGPT

Writing · Multi-model (GPT-5, Claude, Gemini, Grok, DeepSeek, Mistral, Ollama)

Open-source desktop AI assistant that wires every major LLM provider into one local app with agents, vision, and a code interpreter.

Free· Free and open-source (MIT); bring your own provider API keysdesktop-ai-assistantchat-with-files

Quix

Agents

Agentic AI platform that generates adaptive test plans for hardware engineering and manufacturing teams.

Enterprise· Contact sales (book a demo)hardware-testingmanufacturing-analytics

Qwen

Writing · Qwen3 / Qwen-Image / Qwen-MT / Qwen3Guard

Alibaba's open-weight foundation model family covering chat, vision, image generation, translation, and safety classification.

Freemium· Open weights free; hosted API priced per-token via Alibaba Cloud DashScopechatreasoning

Qwen Chat

Writing · Qwen3 family (Qwen3-Max, Qwen-VL, Qwen-Coder, Qwen-Image)

Alibaba's flagship chatbot fronting the Qwen family of open-weight LLMs, with vision, code, and image generation in one UI.

Free· Free with sign-in; paid API access via Alibaba Cloud DashScopechatreasoning

Rivestack

RAG · OpenAI embeddings (auto-embeddings)

Managed Postgres with pgvector on dedicated NVMe, pitched as a cheaper RAG backend than Pinecone or Supabase.

Freemium· Free shared tier; Solo $15/mo, Starter $35, Growth $59, Scale $99 (EU Central)rag-backendvector-search

Soundful

Audio · Proprietary (human-aided AI)

Template-driven AI music generator that spits out royalty-free, commercially licensable tracks in seconds.

Freemium· Free tier; Plus/Pro/Business monthly per-user; Enterprise on requestbackground-musiccontent-creator-audio

StarOps

Coding · Multi-model

AI-native platform engineering engine that provisions and manages cloud infrastructure from natural-language prompts.

Freemium· Free tier; paid from $199/mo; custom enterpriseinfrastructure-automationkubernetes-management

Together AI Fine-tuning

Fine-tuning · Multi-model (any Hugging Face open-source model)

Managed fine-tuning platform for open-source LLMs and vision models with LoRA, full fine-tuning, and RL support.

Paid· Usage-based; cost estimator in-product, no public price listllm-fine-tuningvision-fine-tuning

Unsloth

Fine-tuning · Llama, Mistral, Gemma, Qwen, GLM (multi-model)

Open-source LLM fine-tuning toolkit with custom kernels that train 2-30x faster and use up to 90% less VRAM.

Freemium· Free open-source; Pro and Enterprise contact saleslora-finetuningqlora

Velda

Fine-tuning

Serverless GPU orchestration that runs AI training and batch jobs without Docker or Kubernetes.

Freemium· Free monthly credits on Velda Cloud; Enterprise contact salesdistributed-trainingbatch-inference

Veritone aiWARE

Agents · Multi-model

Enterprise AI operating system that orchestrates hundreds of cognitive engines through low-code workflows.

Enterprise· Contact salesworkflow-automationmedia-intelligence

VisualWebArena

Evaluation · Model-agnostic (GPT-4V, Gemini, Claude, open VLMs)

Open benchmark for evaluating multimodal web agents on realistic visual browsing tasks.

Free· Free and open source (MIT-style research release)multimodal-agent-evalweb-browsing-benchmark

WhisperAPI

Audio · OpenAI Whisper

Hosted OpenAI Whisper transcription with a pay-as-you-go API and drop-in web dashboard.

Paid· Pay-as-you-go credits; $5 for 20 credits, down to ~$0.10/credit in bulkaudio-transcriptionvideo-subtitles

editGPT

Writing · ChatGPT (GPT-4 / GPT-3.5)

Browser extension that turns ChatGPT into a tracked-changes proofreader for your documents.

Freemium· Free tier; Premium around $9.99/moproofreadingcopy-editing