📖 The AI Tool Bible

AI tools tagged For Data Scientists

48 tools matching this tag.audience

All tags →

Gemini Advanced

Writing · Gemini 2.5 Pro
9.0

Google's flagship — strong at math, long context, and Workspace integration.

Paid· $20/mo via Google One AI PremiumWorkspace integrationmath

Agent Lightning

Agents · Multi-model

Microsoft's open-source trainer that fine-tunes AI agents with RL and prompt optimization, framework-agnostic.

Free· Free, MIT-licensed open sourceagent-trainingreinforcement-learning

Amazon SageMaker

Agents · Multi-model

AWS's end-to-end platform for building, training, and deploying machine learning models and AI agents at enterprise scale.

Paid· Pay-as-you-go; free tier available for new AWS accountsmodel-trainingmodel-deployment

Athina AI

Evaluation · Multi-model

Collaborative LLM evaluation and observability platform for teams shipping AI features to production.

Freemium· Starter free (10k logs/mo); Pro & Enterprise customllm-evaluationprompt-management

Beam

Coding

Serverless GPU infrastructure for AI workloads with sub-second cold starts and bring-your-own-cloud support.

Freemium· $30 free credit refreshed monthly; usage-based beyond thatgpu-inferenceagent-sandboxes

BentoML

Agents · Multi-model

Open-source framework and managed platform for serving and scaling AI models in production.

Freemium· OSS free (Apache 2.0); managed Bento cloud has free tier + usage-based pricingmodel-servingllm-inference

CAMEL-AI

Agents · Multi-model

Open-source Python framework for building multi-agent systems and synthetic data pipelines.

Free· Free, open-source; pay for the underlying LLM API callsmulti-agent-systemssynthetic-data-generation

Chassis

Agents

Open-source tool that auto-packages ML models into production-ready Docker containers with a prediction API.

Free· Free, open source (Apache-style community project)model-packagingedge-deployment

Count

Agents · Multi-model (Anthropic, OpenAI, Google)

Collaborative AI-powered data canvas that blends SQL, Python, and natural-language agents for team analytics.

Freemium· Free; Pro $49/editor/mo; Scale $69/editor/mo (15-seat min); Enterprise customcollaborative-analyticsai-data-exploration

CustomPod

Audio

Turns your chosen news sources, RSS feeds, and inboxes into a personalized daily AI podcast.

Freemium· Free tier (manual generation); Pro $4.99/mopersonal podcastnews briefing

DVC

Coding

Git-style version control for datasets, ML models, and experiment pipelines.

Free· Free and open source; lakeFS Enterprise available for large-scale deploymentsdata-versioningml-experiment-tracking

DagsHub

Fine-tuning

GitHub-style collaboration platform for ML datasets, experiments, and models with MLflow and DVC under the hood.

Freemium· Free Individual tier; Team $99-$119/user/mo; Enterprise customexperiment-trackingdata-versioning

Dataiku

Agents · Multi-model (LLM Mesh: OpenAI, Anthropic, Bedrock, Vertex, OSS)

Enterprise AI platform unifying data, ML, LLMs, and agents under one governed workflow.

Enterprise· Free Community edition + Cloud trial; paid tiers quote-basedenterprise-aiagent-orchestration

Emergent Mind

RAG · Undisclosed

AI-curated arXiv discovery layer that summarizes frontier papers and aggregates social discussion around them.

Freemium· Free Basic; Pro $10/mo annual ($12 mo); Max $25/mo annual ($30 mo)arxiv-discoverypaper-summarization

Explainpaper

RAG · Undisclosed (tiered basic vs. advanced)

AI reading companion that decodes dense academic papers by highlighting and chatting with the PDF.

Freemium· Free; Pro $16/mo with 7-day trialpaper-readingresearch-summaries

FedML

Fine-tuning · Bring-your-own (PyTorch, Hugging Face)

Distributed training, fine-tuning, and serving platform with federated learning roots.

Freemium· Open-source library free; managed GPU usage pay-as-you-gofine-tuningdistributed-training

Geniusrise

Agents · Multi-model

Open-source framework for building, deploying, and scaling AI microservices across text, vision, and audio.

Free· Free, open source; self-hostedinference-servingfine-tuning

Google AI Studio

Coding · Gemini 2.5 Pro / Flash, Imagen, Veo

Browser-based playground and API console for prototyping with Google's Gemini models.

Freemium· Free tier with rate limits; paid via Gemini API usage-based pricingprompt-prototypinggemini-api-keys

Gorilla

Agents · gorilla-openfunctions-v2 (6.91B)

Open-source LLM purpose-built for function calling and API invocation across thousands of tools.

Free· Free and Apache 2.0; self-hostedfunction-callingtool-use

Guild AI

Agents · Multi-model (bring your own)

Control plane for deploying, governing, and auditing AI agents in production.

Freemium· Free 100 runs/mo; Individual $20/mo; Team $200/mo; Enterprise customagent deploymentagent governance

H2O AutoML

Fine-tuning · H2O-3 (GBM, XGBoost, GLM, DRF, Deep Learning, Stacked Ensembles)

Open-source automated machine learning that handles feature engineering, model selection, and stacked ensembling out of the box.

Free· Free and open-source (Apache 2.0); paid Driverless AI sold separatelyautomltabular-ml

Harmonai

Audio · Dance Diffusion / Stable Audio family

Open-source generative audio lab from Stability AI building diffusion models for music production.

Free· Free open-source models and code; no hosted product on this sitemusic-generationsound-design

Hugging Face AutoTrain

Fine-tuning · Multi-model (Hugging Face Hub)

No-code fine-tuning and training pipeline that spins up state-of-the-art models on the Hugging Face Hub.

Paid· Per-minute billing based on hardware tier; self-hosted OSS version is freellm-fine-tuningtext-classification

Iguazio

Agents · Multi-model

Enterprise MLOps and GenAI platform for taking models from notebook to production at scale.

Enterprise· Contact sales; free trial availablemlopsllm-fine-tuning

Inspect AI

Evaluation · Multi-model

Open-source LLM evaluation framework from the UK AI Security Institute with 200+ built-in benchmarks.

Free· Free and open source (MIT-style license); you pay only for underlying model API usage.llm-benchmarkingagent-evaluation

Jina Serve

Agents

Open-source Python framework for serving multimodal AI models as scalable gRPC/HTTP microservices.

Freemium· Open-source free; Jina AI Cloud hosting paidmodel-servingmultimodal-pipelines

KNIME

Agents · Multi-model (OpenAI, Anthropic, Gemini, Ollama)

Visual node-based data science platform with built-in connectors for OpenAI, Anthropic, Gemini, and local LLMs.

Freemium· Free open-source desktop; Team and Business Hub plans paiddata-pipelinesllm-workflows

Kiln AI

Evaluation · Multi-model

Open-source workbench for building, evaluating, and fine-tuning AI agents across 190+ models.

Freemium· Free Individual tier; Team (request access); Enterprise (custom)llm-evaluationfine-tuning

Kubeflow

Agents · Multi-framework (PyTorch, JAX, XGBoost, TensorFlow)

Open-source toolkit for running the full ML lifecycle on Kubernetes.

Free· Free and open source; commercial distributions and managed offerings priced separately by vendorsml-pipelinesdistributed-training

LLM Stats

Evaluation · Multi-model

Live leaderboard and side-by-side comparison hub for 300+ frontier LLMs across reasoning, coding, and multimodal benchmarks.

Free· Free to browse; underlying model usage billed by each providermodel-comparisonbenchmark-tracking

Ludwig

Fine-tuning · Multi-model (PyTorch + HuggingFace Transformers)

Declarative, YAML-driven deep learning framework for fine-tuning LLMs and multi-modal models without writing training loops.

Free· Free, Apache 2.0 open sourcellm-fine-tuningmulti-modal-training

LynxKite

Agents · Multi-model (LLM agents + GNNs + NVIDIA BioNeMo)

No-code AI orchestration platform built for graph-native pipelines in drug discovery and enterprise analytics.

Enterprise· Contact sales; no public pricingdrug-discoverygraph-neural-networks

MMagic

Image Generation · Multi-model (Stable Diffusion, ControlNet, StyleGAN, GANs, diffusion)

OpenMMLab's research-grade toolbox for image and video generation, restoration, and editing.

Free· Free and open source (Apache 2.0)text-to-imagesuper-resolution

NotebookLM

RAG · Gemini 2.5

Google's source-grounded research notebook that turns your documents into chats, briefs, and AI-hosted podcasts.

Freemium· Free tier; Plus via Google One AI Premium ($19.99/mo) or Workspace add-ondocument Q&Aresearch synthesis

ONNX

Fine-tuning

Open standard for representing and exchanging machine learning models across frameworks and runtimes.

Free· Free and open source (Apache-2.0); Linux Foundation AI projectmodel-interchangeedge-deployment

OpenAI Evals

Evaluation · OpenAI GPT models (extensible)

OpenAI's open-source framework for benchmarking LLMs against a shared registry of evaluations.

Free· Free (MIT); you pay OpenAI API costs for eval runsllm-benchmarkingregression-testing

OpenBB

Agents · Multi-model (Agent Rita is model-agnostic)

Open-source financial workspace where analysts and AI agents share the same governed data.

Freemium· Free Community Edition; paid Pro tiers via pro.openbb.cofinancial-researchinvestment-workflows

OpenSandbox

Agents · Model-agnostic

Open-source sandbox infrastructure for running AI-generated code, agents, and browsers in isolated Docker or Kubernetes environments.

Free· Open source (Apache 2.0); managed pricing not disclosedcode-executionagent-sandboxing

Optuna

Fine-tuning

Open-source Python framework for automated hyperparameter optimization across any ML stack.

Free· Free and open source (MIT)hyperparameter-tuningml-experiment-tracking

PandasAI

Coding · Multi-model (via LiteLLM)

Conversational data analysis library that turns natural-language questions into pandas, SQL and chart code.

Freemium· OSS library free (MIT); managed cloud and enterprise self-hosted are contact-salesdata-analysisnatural-language-sql

Paperspace Gradient

Fine-tuning · Bring-your-own (PyTorch, TensorFlow, Hugging Face)

End-to-end MLOps platform with GPU notebooks, training jobs, and model deployment, now folded into DigitalOcean.

Freemium· Free notebook tier; paid Pro/Growth plans + per-second GPU billingmodel-trainingfine-tuning

PyCaret

Coding · Multi-model (scikit-learn, XGBoost, LightGBM, CatBoost)

Low-code Python AutoML library that wraps scikit-learn, XGBoost, LightGBM and friends behind a few-line API.

Free· Free and open-source (MIT license)automlclassification

RAPIDS

Coding

NVIDIA's open-source suite of GPU-accelerated drop-in replacements for pandas, scikit-learn, and NetworkX.

Free· Free and open sourcegpu-dataframesml-training

Ray Tune

Fine-tuning

Open-source Python library for distributed hyperparameter tuning at any scale.

Free· Open-source (Apache 2.0); managed via Anyscale offers a $100 starting credithyperparameter-tuningdistributed-training

Recommenders

Coding · Multi-algorithm (ALS, xDeepFM, others)

Open-source Python library with classical and deep-learning algorithms for building recommendation systems.

Free· Free and open-source (MIT License)recommendation-systemscollaborative-filtering

RunPod

Fine-tuning · Bring-your-own (any open-weight or custom model)

On-demand GPU cloud and serverless inference platform built specifically for AI workloads.

Paid· Pay-per-second GPU rental; H100 from ~$1.89/hr, consumer GPUs from ~$0.20/hrllm-fine-tuninggpu-rental

Runcell

Coding · Multi-model (GPT, Claude, Gemini)

Jupyter-native AI agent built for multi-week ML and data science projects.

Freemium· Free Hobby tier with monthly credits; paid plans for more credits and frontier modelsjupyter-notebooksdata-science

Sematic

Agents

Open-source Python-first orchestrator for ML training pipelines from laptop to cloud.

Freemium· Open-source free; managed/enterprise tier on requestml-pipelinestraining-orchestration