📖 The AI Tool Bible

SWE-agent

Open-source autonomous agent framework that lets LLMs fix GitHub issues and find security vulnerabilities by using a custom agent-computer interface.

Free· Free and open-source; you pay your own LLM API costsAgentsMulti-model (GPT-4o, Claude Sonnet, DeepSeek, local via LiteLLM)
Visit website →
Best for

Pick SWE-agent if you're a researcher or engineer who wants a hackable, reproducible coding-agent baseline you can wire to any LLM.

Skip if

Skip it if you want a plug-and-play hosted coding assistant - reach for Cursor, Devin, or Cline instead.

SWE-agent is an open-source autonomous coding agent built by researchers at Princeton and Stanford that turns a language model into a software engineer. It gives the LLM a tightly-scoped agent-computer interface (ACI) for navigating repos, editing files, running tests, and iterating on fixes, and it was one of the first agents to post serious numbers on SWE-bench - including state-of-the-art results among open-source projects when it launched. A sibling mode, EnIGMA, retargets the same loop at offensive-security CTF challenges.

It's aimed at researchers and engineers who want a hackable agent rather than a polished SaaS. Configuration is a single YAML file, you bring your own API key (GPT-4o, Claude Sonnet, DeepSeek, or local models via LiteLLM), and you run it yourself - there's no hosted product, no pricing page, no signup. The team has explicitly moved focus to mini-swe-agent, a 100-line successor, and labels the original repo as maintenance-only, so treat SWE-agent as a stable research baseline rather than an actively-developed product.

It integrates with GitHub issues, Docker-sandboxed execution, and the SWE-bench evaluation harness, and is widely cited as a reference implementation for tool-using coding agents. Caveat: LLM API costs are on you, and on real-world tasks they add up quickly.

Editor's take

SWE-agent is a foundational piece of the open coding-agent canon and still worth studying, but the team has moved on to mini-swe-agent and says so on the front page. Use it as a research reference or a starting fork; for daily-driver coding, pick something actively maintained.

— The AI Tool Bible editorial team

Pros

  • Open-source under MIT with a strong research pedigree (Princeton/Stanford)
  • Model-agnostic via LiteLLM - swap GPT-4o, Claude, or local models freely
  • Reproducible SWE-bench harness makes it a credible baseline for agent research
  • EnIGMA mode extends the same loop to CTF-style security tasks

Cons

  • ⚠️ Officially in maintenance mode; team now recommends mini-swe-agent
  • ⚠️ No hosted product, GUI, or managed service - CLI and YAML only
  • ⚠️ LLM API costs on long-horizon tasks can be significant
  • ⚠️ Setup requires Docker and comfort with Python tooling

Use cases

github-issue-fixingautonomous-codingswe-benchctf-securityagent-research

Explore related

Compare with similar tools

All in Agents