PySpur
Open-source agent builder with a drag-and-drop canvas, Python escape hatch, and a built-in test harness.
Pick PySpur if you want a self-hostable, git-friendly agent builder that takes evaluation as seriously as orchestration.
Skip it if you need a hosted, zero-ops SaaS today or a deep catalog of pre-built integrations.
PySpur is an open-source framework for building, testing, and deploying AI agents. It pairs a visual drag-and-drop workflow editor with first-class Python so engineers can sketch a pipeline graphically, then drop into code for custom tools, evaluators, or nodes. Agents are exported as JSON, which makes them easy to diff, code-review, and roll out via the same git flow you already use for the rest of your stack.
The differentiator is the emphasis on the iteration loop. Most agent frameworks (LangGraph, CrewAI, etc.) leave you to bolt on your own test rig; PySpur ships test-case definition, batch evaluation, and failure inspection in the core product, which is closer in spirit to LangSmith than to a pure orchestration library. It's Apache-2.0 licensed with 5,500+ GitHub stars, installs via pip, and runs equally well on a laptop, a cloud VM, or your own hardware. A managed Cloud tier and Python/TypeScript SDKs are flagged as forthcoming.
PySpur is model-agnostic: you wire up whichever LLM provider the node calls, so you can mix Claude, GPT, open-weight models, or local Ollama within one graph. It's a sensible pick for teams that want a self-hosted alternative to closed visual builders like Flowise Cloud or Relevance AI, particularly where the agent graph needs to live next to the rest of your application code.
PySpur is one of the more thoughtful open-source agent frameworks because it treats the test loop as a first-class citizen, not a tutorial appendix. The visual-plus-Python model is the right compromise, and Apache 2.0 plus JSON exports keep it from feeling like a walled garden. The trade-off is youth: docs and integrations are still catching up to the big-name competitors.
— The AI Tool Bible editorial team
Pros
- ✅ Apache-2.0 licensed and pip-installable, runs fully self-hosted
- ✅ Visual canvas plus Python escape hatch, no lock-in to a DSL
- ✅ Built-in test cases and failure inspection, not an afterthought
- ✅ Agents export as JSON so they diff cleanly in git
Cons
- ⚠️ Managed cloud and official SDKs are still on the roadmap
- ⚠️ Smaller ecosystem than LangGraph or LlamaIndex
- ⚠️ Visual builders can hide complexity that bites at scale
Use cases
Explore related
Compare with similar tools
All in Agents →LangGraph
FeaturedStateful, graph-based agent orchestration from LangChain.
CrewAI
FeaturedPython framework for multi-agent orchestration.
Claude Agent SDK
Anthropic's official SDK for building autonomous Claude agents.
Manus
Generalist agent for research, code, and web tasks.
Devin
Cognition Labs' "autonomous software engineer" agent.
AutoGPT
Open-source platform for building autonomous AI agents.