Skip to content

ARUNAGIRINATHAN-K/awesome-ai-agents-2026

Repository files navigation

Awesome AI Agents 2026

Awesome GitHub stars PRs Welcome Last Commit Tools Listed License: CC0

The most comprehensive, structured guide to AI agent frameworks, tools, and resources.
Updated weekly. Compared side-by-side. Built for developers who ship.

Explore the StackRead the DocsSubmit a Tool

Contents

Orchestration Frameworks

  • Agency Swarm - Orchestrates multi-agent systems built on the OpenAI Assistants API with role-based collaboration (🏷️ Python OpenAI SDK).
  • AgentScope - Alibaba multi-agent framework with distributed deployment and fault tolerance for production use (🏷️ Python Alibaba SDK).
  • Agno - Multi-agent framework with a runtime and control plane for managing agent deployments at scale (Python FastAPI Cloud) (🏷️ Python FastAPI Cloud).
  • Anthropic SDK - Official Claude SDK with tool use, computer control, and streaming for building Claude-native agents (🏷️ Python Anthropic SDK).
  • AutoGen - Event-driven multi-agent framework merged with Semantic Kernel for production workflows (🏷️ Python Semantic Kernel SDK).
  • AWS Bedrock Agents - Managed AWS infrastructure for agents with Bedrock models, enterprise compliance, and auto-scaling deployment (🏷️ Python AWS Cloud [Enterprise] [SOC2]).
  • CAMEL - Role-based multi-agent simulation framework for collaborative reasoning and task completion (🏷️ Python Research SDK).
  • CopilotKit - The open-source AI copilot platform for React, allowing developers to embed custom AI agents directly into their applications (🏷️ TypeScript React SDK).
  • CrewAI - Production multi-agent framework with 5K+ GitHub stars, role-based collaboration, async execution, and 1500+ company adoptions (🏷️ Python LangChain SDK [Enterprise]).
  • DSPy - Programmatically optimizes LLM prompts and chains using machine learning instead of manual prompt engineering (🏷️ Python Stanford SDK).
  • Ontheia - Self-hosted, open-source AI agent platform with Chain Engine for visual workflow automation, MCP-native tool integration, multi-provider support (Claude, OpenAI, Gemini, Ollama), long-term memory via pgvector, multi-user RBAC, and GDPR-compliant architecture (🏷️ TypeScript Self-hosted MCP Docker [Enterprise] [GDPR]).
  • Google ADK - Modular agent dev kit integrating Gemini and Vertex AI natively (🏷️ Python Vertex AI SDK).
  • Haystack - Production-ready AI orchestration framework focused on building customizable LLM applications and RAG pipelines (🏷️ Python RAG SDK).
  • LangGraph - Enterprise framework for stateful, graph-based agent workflows (🏷️ Python LangChain SDK).
  • Letta - Formerly MemGPT. Stateful agents with built-in long-term memory and a REST API server (🏷️ Python REST SDK).
  • LightAgent - Lightweight Python agent framework with memory, MCP/SSE integration, reusable Skills, Tree-of-Thought planning, streaming, and LightSwarm multi-agent collaboration (🏷️ Python MCP SDK Multi-Agent).
  • LlamaIndex - The leading framework for connecting LLMs to your data, with powerful indexing and retrieval capabilities (🏷️ Python RAG SDK).
  • LangSmith Fleet - No-code agent orchestration platform with visual workflow builder, real-time monitoring, and enterprise deployment (🏷️ Cloud LangChain Platform [Enterprise]).
  • Mastra - Opinionated TypeScript framework with RAG, observability, and MCP support built in (🏷️ TypeScript Next.js SDK).
  • MicroAgent - Lightweight agent framework with self-editing prompts and code for minimal-overhead agent loops (🏷️ TypeScript Lightweight SDK).
  • Modus - Serverless framework for high-throughput agent workloads with minimal cold starts (🏷️ WebAssembly Hypermode Serverless).
  • Open-AutoGLM - Open-source phone agent model and framework for building mobile device automation agents (🏷️ Python Mobile SDK).
  • OpenAI Agents SDK - Lightweight multi-agent SDK with tracing and guardrails from OpenAI (🏷️ Python OpenAI SDK).
  • PraisonAI - Production multi-agent framework with self-reflection, MCP integration, and workflow automation (🏷️ Python Multi-Agent CLI).
  • PydanticAI - Type-safe agent framework from the Pydantic team with a FastAPI-style developer experience (🏷️ Python Pydantic SDK).
  • Semantic Kernel - Microsoft enterprise SDK for Python, C#, and Java with modular plugins, memory, and goal planning (🏷️ C# Microsoft SDK).
  • Smolagents - Hugging Face code-first framework where agents write and execute Python instead of JSON tool calls (🏷️ Python Hugging Face SDK).
  • Strands Agents SDK - AWS model-driven agent SDK with native Bedrock integration (🏷️ Python AWS Bedrock SDK).
  • Upsonic - Minimalist agent framework with MCP support and rapid setup for quick prototyping (🏷️ Python MCP SDK).
  • Vercel AI SDK - Streaming-first primitives for AI UIs with React Server Components and edge runtime support (🏷️ TypeScript Next.js SDK).
  • VoltAgent - TypeScript agent framework with built-in observability and a self-improving context engine (🏷️ TypeScript React SDK).
  • Vertex AI Agents - Google Cloud enterprise agent framework with native Gemini integration, scalable infrastructure, and compliance certifications (🏷️ Python Google Cloud [Enterprise] [GDPR]).

framework-flow

Coding Agents

  • Aider - Terminal-first pair programmer that edits code in local repos, preserves Git history, and supports multi-file changes (🏷️ Python CLI Local).
  • Amazon Q Developer - AWS-native AI coding assistant with Lambda, CloudWatch, infrastructure support, and security scanning (🏷️ Python AWS IDE).
  • AutoGPT - Mature autonomous agent platform with Forge framework and public benchmarks for evaluating agent capabilities (🏷️ Python Forge CLI).
  • Claude Code - Terminal-first agentic coding from Anthropic with Computer Use integration, multi-file edits, persistent shell sessions, Git operations, and fine-tuning support (🏷️ TypeScript CLI Local [Anthropic]).
  • Cline - Autonomous coding agent in your IDE that creates/edits files, runs commands, and uses the browser with permission-gated steps (🏷️ TypeScript VS Code IDE).
  • CodeRabbit - AI-powered PR reviewer with inline suggestions, security scanning, and automatic code quality checks (🏷️ Cloud GitHub Web).
  • Codex CLI - OpenAI's lightweight, open-source terminal coding agent with fast execution and strong benchmark scores (🏷️ Python CLI Local).
  • Codex-CLI - CLI tool that turns natural language commands into Bash, ZShell, and PowerShell equivalents (🏷️ C# CLI Local).
  • Continue - Source-controlled AI checks enforceable in CI, powered by the open-source Continue CLI (🏷️ TypeScript VS Code IDE).
  • Cursor - AI-native IDE with Cursor Cloud Agents supporting parallel task execution, $29.3B valuation, 100K+ enterprise users, and advanced agentic loops with real-time cursor control (🏷️ TypeScript Electron Desktop [Enterprise]).
  • Devika - Open-source AI software engineer that understands high-level instructions and writes full-featured code autonomously (🏷️ Python Docker Web).
  • Devin - Fully autonomous AI software engineer showing 8-12x efficiency gains in production (Nubank case study), cloud sandbox with persistent state, GitHub integration, and enterprise adoption growing 40% MoM (🏷️ Cloud Web SaaS [Enterprise]).
  • gemini-cli - Open-source AI agent that brings the power of Gemini directly into your terminal (🏷️ Go CLI Local).
  • GitHub Copilot - Agent Mode in VS Code with Copilot Workspace for issue-to-PR workflows across Claude, GPT, and Gemini (🏷️ TypeScript VS Code IDE).
  • Goose - Open-source extensible AI agent that goes beyond code suggestions, installs, executes, edits, and tests with any LLM (🏷️ Rust CLI Local).
  • JetBrains AI - Deep AI integration across all JetBrains IDEs with context-aware completions and refactoring (🏷️ Kotlin JetBrains IDE).
  • Kiro - Spec-driven development agent that writes specs, auto-generates tasks, implements code, and automates DevOps workflows (🏷️ Cloud AWS IDE).
  • Open Interpreter - Execute code locally via natural-language model instructions with a ChatGPT-like interface (🏷️ Python CLI Local).
  • opencode - Open-source coding agent available as a desktop application with a visual interface (🏷️ TypeScript Electron Desktop).
  • OpenHands - AI-driven development platform that writes, tests, and deploys code autonomously (🏷️ Python Docker Web).
  • PR-Agent - Open-source AI PR reviewer that auto-describes, reviews, and improves pull requests (🏷️ Python GitHub CLI).
  • Qodo - AI code review platform with context-aware PR validation and security analysis (🏷️ Cloud Enterprise Web).
  • RooCode - Cline fork with structured modes and reduced hallucinations for more reliable code generation (🏷️ TypeScript VS Code IDE).
  • Snyk Code - AI-powered security scanner with real-time vulnerability detection in agent-generated code (🏷️ Cloud Security Platform).
  • Sourcegraph Cody - AI coding assistant that excels at large codebases with an enterprise context engine (🏷️ TypeScript Enterprise IDE).
  • SWE-agent - Takes a GitHub issue and tries to automatically fix it. Also used for cybersecurity and competitive coding (🏷️ Python Docker CLI).
  • Tabnine - Privacy-first AI code completion with on-premise deployment and codebase fine-tuning options (🏷️ TypeScript Enterprise IDE).
  • TaskWeaver - A code-first agent framework from Microsoft for seamlessly planning and executing data analytics tasks (🏷️ Python Jupyter CLI).
  • Windsurf - AI-native IDE with Cascade agent for multi-step autonomous tasks and team workflows (🏷️ TypeScript Electron Desktop).

Memory and Context

  • Acontext - Manages agent skills and long-term memory as a layered data structure for persistent context (🏷️ Python SDK Local).
  • Chroma - Lightweight, embeddable vector store for building memory-augmented AI agents with fast semantic retrieval (🏷️ Python TypeScript SDK).
  • cognee - Knowledge engine for AI agent memory, set up in 6 lines of code with graph-based knowledge extraction (🏷️ Python Neo4j SDK).
  • Cortex Memory - Full-stack solution for agent memory covering extraction, vector search, and optimization (🏷️ Python Vector DB SDK).
  • graphiti - Build real-time knowledge graphs for AI agents with automatic entity extraction and linking (🏷️ Python Knowledge Graph SDK).
  • LanceDB - Serverless vector search database embedded directly in the agent process with no infrastructure needed (🏷️ Rust Python SDK).
  • Langmem - Helps agents learn and adapt from their interactions over time with persistent memory (🏷️ Python LangChain SDK).
  • Mem0 - Memory layer for AI applications with long-term, short-term, and semantic memory extraction (🏷️ Python Vector Cloud).
  • Memoir - Git-like versioned semantic memory for AI agents with branching, commits, and cryptographic integrity over hierarchical paths (🏷️ Python Git-like SDK).
  • Memvid - Replace complex RAG pipelines with a serverless, single-file memory layer for instant retrieval (🏷️ Python Multimodal SDK).
  • Milvus - Scales vector search to billions of embeddings for large-scale agent knowledge bases (🏷️ Go Python Platform).
  • Motorhead - Manages conversation context windows for agents with automatic background summarization (🏷️ Rust Python SDK).
  • Pathway - Live data RAG engine with real-time streaming for agents that need up-to-the-second knowledge (🏷️ Python Streaming SDK).
  • Pinecone - Managed vector database with agent namespaces for multi-tenant isolation, hybrid search (vector + keyword), serverless auto-scaling, and $11B valuation (🏷️ Cloud Vector DB API [Enterprise] [SOC2]).
  • Qdrant - High-performance vector similarity search engine with rich payload filtering for agent memory (🏷️ Rust Python SDK).
  • RAGFlow - Open-source RAG engine with agent capabilities and deep document understanding for knowledge bases (🏷️ Python RAG Platform).
  • SimpleMem - Efficient lifelong memory for LLM agents supporting both text and multimodal inputs (🏷️ Python SDK Local).
  • Supermemory - Extremely fast and scalable memory engine and API designed for the AI era (🏷️ TypeScript Vector DB Web).
  • Weaviate - Stores and searches vector embeddings with hybrid keyword and semantic retrieval for agent knowledge (🏷️ Go Python Platform).
  • Zep - Enriches agent long-term memory with automatic summarization, entity extraction, and search (🏷️ Python TypeScript SDK).

Multi-Agent Systems

  • AgentVerse - Framework for building custom multi-agent environments to accomplish collaborative tasks (🏷️ Python Benchmark SDK).
  • EvoAgentX - Evaluates and evolves agentic workflows over time using automatic optimization (🏷️ Python Optimization SDK).
  • Hivemoot - Autonomous agent teams that collaboratively build software on GitHub (🏷️ Python GitHub SDK).
  • MetaGPT - Simulates a full software company workflow from requirements to PRs using role-playing agents (🏷️ Python Role-Playing SDK).
  • P2PCLAW - Decentralized scientific research network with 14+ autonomous agents featuring P2P mesh, IPFS storage, and Lean 4 formal verification, self-hosted with Docker Compose (🏷️ Python P2P Self-Hosted).
  • Swarm - Lightweight framework for agent handoffs, context variables, and function calling patterns from OpenAI (🏷️ Python OpenAI SDK).
  • Swarms Framework - Multi-agent orchestration for production use cases with scalability and reliability at its core (🏷️ Python Production SDK).
  • Yao Agents - Complete local AI execution platform with 30+ Experts, autonomous Robot orchestration via 5-stage Pipeline, API-based Robot-to-Robot calling, MCP support, and multi-channel messaging (🏷️ Go TypeScript Docker MCP Self-Hosted WeChat Feishu).

Agent Communication Protocols

The protocol layer that enables agents to discover tools, communicate with each other, and interoperate across ecosystems.

Protocol Purpose Creator Status
MCP Agent-to-tool Anthropic Standard
A2A Agent-to-agent Google Growing
ACP Agent communication IBM/BeeAI Early

MCP (Model Context Protocol)

  • A2A Protocol - Google's open protocol enabling AI agents to communicate, collaborate, and delegate tasks across frameworks (🏷️ Protobuf Standard RPC).
  • Arcade AI - Tool-use platform with authentication, authorization, and logging for agent-tool interactions (🏷️ Python Cloud API).
  • Composio - Integration platform with 250+ pre-built tool connectors for AI agents and LLMs (🏷️ TypeScript Cloud API).
  • Docker MCP - Docker's MCP gateway CLI plugin for running MCP servers in isolated containers (🏷️ Go Docker CLI).
  • HCS Agent Protocol - Hedera open standards for agent identity with trustless P2P communication and 187K+ verified agents (🏷️ TypeScript Hedera Protocol).
  • Hyper - Open-source Agent Skills and a hosted MCP connecting agents to 200+ marketing integrations across paid ads, SEO, analytics, social, and image and video generation, with a human-approval gate on every action (🏷️ Cloud MCP Marketing).
  • MCP Registry - Official Model Context Protocol specification and server implementations for standardized tool access (🏷️ JSON Standard Registry).
  • mcp-nest - Unified Model Context Protocol (MCP) server for executing code and managing files (🏷️ Python MCP CLI).
  • NotFair - Hosted Google Ads MCP server for diagnosing, optimizing, and executing campaign changes via the Google Ads API with a human-approval gate (🏷️ Cloud MCP Marketing).
  • Toolhouse - Cloud-hosted tool infrastructure for agents with optimized execution and low-latency access (🏷️ Python Cloud API).
  • Zapier MCP Server - Connect agents to 7,000+ app integrations via MCP, powered by Zapier's automation platform (🏷️ Cloud Zapier API).
  • zero-api-key-web-search - Free web search toolkit for AI agents with no API keys, MCP server support (🏷️ Python MCP Search).

Browser and Computer Use Agents

  • Agent S2 - Open-source GUI automation framework for building desktop and browser agents with vision (🏷️ Python Vision SDK).
  • AgentQL - AI-powered web scraping and automation with a semantic query language for page elements (🏷️ Python Playwright SDK).
  • Airtop - Enterprise-grade cloud browser infrastructure for AI agent automation at scale (🏷️ Cloud Enterprise Platform).
  • Browser Use - Open-source framework to let LLMs navigate and interact with any website programmatically (🏷️ Python Playwright SDK).
  • Browserbase - Scalable headless browser infrastructure purpose-built for running AI agents in production (🏷️ Cloud Infrastructure Platform).
  • Dia Browser - AI-native browser from Atlassian and the Browser Company with built-in agent workflows (🏷️ Cloud Desktop Browser).
  • Fellou - Transparent agentic browser with visual workflow editing and persistent agentic memory (🏷️ Cloud Desktop Browser).
  • Genspark - On-device AI browser with 169+ local models that runs without internet connection (🏷️ Cloud Desktop Browser).
  • Grok Computer - Upcoming xAI desktop agent with mouse control and application automation capabilities (🏷️ Cloud Desktop Platform).
  • LaVague - Large Action Model framework to turn natural language instructions into browser automation (🏷️ Python Selenium SDK).
  • Manus - Autonomous digital employee with Browser Operator extension acquired by Meta for web-based task execution (🏷️ Cloud Meta Platform).
  • MultiOn - Reliable web automation API with CAPTCHA handling for building production browser agents (🏷️ Cloud API Platform).
  • OpenAI Operator - ChatGPT autonomous web agent with human checkpoints using Computer Use Agent technology (🏷️ Cloud OpenAI Platform).
  • Playwright MCP - MCP server wrapping Playwright for seamless browser automation integration with LLM agents (🏷️ TypeScript MCP SDK).
  • Skyvern - Production browser automation with 21.5K GitHub stars, 10M+ executed workflows, vision-first automation, and sub-300ms response times (🏷️ Python Playwright Cloud).
  • Stagehand - AI web browsing framework built on Playwright with natural-language selectors and actions (🏷️ TypeScript Playwright SDK).

Agent Tooling and Infrastructure

Sandboxes, web scrapers, browser automation, and networking layers that agents depend on.

  • AgentDock - Framework for building and deploying production-ready AI agents with composable node architecture (🏷️ Python Docker Platform).
  • codex-profiles - Bash CLI for switching OpenAI Codex CLI and Desktop profiles with isolated CODEX_HOME directories (🏷️ Bash CLI Local).
  • Crawl4AI - Extracts structured data from web pages using LLM-friendly output formats optimized for agent ingestion (🏷️ Python Playwright SDK).
  • Docling - Parses PDFs, DOCX, and slides into structured text with deep layout understanding for document agents (🏷️ Python PDF SDK).
  • E2B - Cloud sandboxes for AI agents to run code securely in isolated environments (🏷️ TypeScript Cloud Sandbox).
  • Engram - Universal bridge for multi-protocol AI agent systems with automated semantic mapping (🏷️ Python Networking SDK).
  • Firecrawl - Web scraping API built for LLMs that converts websites to clean, structured markdown (🏷️ TypeScript Cloud API).
  • Jina Reader - Converts any URL to LLM-ready clean text via a simple API prefix for agent ingestion (🏷️ TypeScript Cloud API).
  • LlamaParse - GenAI-native document parser designed to extract complex tables and layouts for RAG pipelines (🏷️ Python Cloud API).
  • Marker - Converts PDF documents to markdown with high accuracy for tables, equations, and figures (🏷️ Python PDF CLI).
  • Notte - Browser automation engine optimized for production AI pipelines (🏷️ Python Browser SDK).
  • Pilot Protocol - Networking stack for distributed agent systems with encrypted tunnels (🏷️ Go Networking SDK).
  • Playwright - Automates Chromium, Firefox, and WebKit browsers with a single cross-language API for agent-driven testing (🏷️ TypeScript Python SDK).
  • ScrapeGraphAI - Python web-scraping library that uses LLMs to build intelligent scraping pipelines from natural-language instructions (🏷️ Python LangChain SDK).
  • Surya - Runs OCR and layout detection on documents in 90+ languages for multilingual document agents (🏷️ Python PDF CLI).
  • Tavily - Search API purpose-built for LLM agents providing real-time, accurate web data with source citations (🏷️ Python Cloud API).
  • traceAI - OpenTelemetry-native tracing for LLM and agent apps with 50+ framework integrations (🏷️ Python TypeScript OpenTelemetry SDK).
  • Unstructured - Ingests and preprocesses documents across 25+ file types for downstream LLM and agent pipelines (🏷️ Python Pipeline SDK).

Low and No-Code Builders

  • Activepieces - Open-source Zapier alternative with AI-powered automation nodes and visual workflow builder (🏷️ TypeScript Docker Web).
  • AgentGPT - Deploy AI agents in the browser with zero local setup required (🏷️ TypeScript Next.js Web).
  • Dify - Open-source LLM app development platform with visual workflow builder and RAG orchestration (🏷️ Python FastAPI Web).
  • FastAgency - Deploys multi-agent workflows as production APIs with minimal configuration and setup (🏷️ Python FastAPI SDK).
  • Flowise - Open-source drag-and-drop UI to build customized LLM flows, RAG pipelines, and agent systems (🏷️ TypeScript React Web).
  • Langflow - Visual drag-and-drop builder for LLM workflows, RAG agents, and multi-step pipelines (🏷️ Python FastAPI Web).
  • Lindy - No-code AI agent platform with 3000+ app integrations for business workflow automation (🏷️ Cloud No-Code Platform).
  • Make - Visual workflow automation platform with AI capabilities and drag-and-drop scenario builder (🏷️ Cloud No-Code Platform).
  • n8n - Open-source workflow automation with AI agent nodes combining visual and code-based orchestration (🏷️ TypeScript Docker Platform).
  • Relevance AI - No-code AI agent builder for sales, support, and research use cases with team collaboration (🏷️ Cloud No-Code Platform).
  • Rivet - Visual AI workflow builder with drag-and-drop interface for designing agent pipelines (🏷️ TypeScript React Desktop).
  • Temporal - Durable execution platform for long-running agent workflows with automatic retry and state persistence (🏷️ Go Production Platform).
  • Wordware - Web-hosted IDE where domain experts collaborate with AI engineers to build agent workflows (🏷️ TypeScript Next.js Web).
  • Zapier AI - Connects 7000+ apps with natural language workflow creation for no-code agent automation (🏷️ Cloud No-Code Platform).

Voice and Multimodal Agents

  • Agentset - Production RAG platform with reasoning, hybrid search, and full multimodal support (🏷️ Python Multimodal Platform).
  • LiveKit Agents - Framework for building real-time, multimodal AI agents with voice, video, and data channels (🏷️ Python LiveKit SDK).
  • Nuance AI - Enterprise speech and conversational AI platform for clinical and contact-center workflows with HIPAA-capable deployments (🏷️ Cloud Voice Enterprise [HIPAA]).
  • Google Cloud Speech-to-Text v2 - Google Cloud streaming and batch speech recognition API v2 with improved accuracy, streaming, and noise suppression for real-time agent pipelines (🏷️ Cloud Google STT).
  • Pipecat - Production-grade voice AI framework with sub-250ms latency, WebRTC support, multimodal (voice+vision+text), real-time streaming, and 70+ language support (🏷️ Python WebRTC SDK).
  • Rasa - Open-source conversational AI framework with self-hosted NLU training and dialogue management (🏷️ Python NLU SDK).
  • simulate-sdk - Persona- and scenario-driven SDK for simulating voice and text AI agents (🏷️ Python Voice Simulation SDK).
  • Vapi - Platform for building voice AI agents with low-latency speech-to-speech capabilities (🏷️ Python Cloud API).
  • Vocode - Open-source framework for building voice-based LLM agent applications with streaming support (🏷️ Python Voice SDK).
  • Wispr Flow - Voice orchestration platform for multimodal AI agents with 50+ language support, workflow building, and enterprise integrations (🏷️ Cloud Voice Platform [Enterprise]).

Safety Guardrails and Observability

  • Agent OS - Kernel architecture for governing autonomous AI agents with policy enforcement (🏷️ Python Policy SDK).
  • AgentDoG - Diagnostic guardrails that analyze full agent execution trajectories to detect instruction hijacking and tool misuse (🏷️ Python Diagnostics SDK).
  • AgentGuard - Runtime observability and guardrails for AI agents with loop detection and anomaly alerts (🏷️ Python CyberArk SDK).
  • agenttrace - Local-first TUI for AI coding agent session observability with tokens, cost, latency, tool failures, anomalies, reports, diffs, and CI health gates (🏷️ Go Observability CLI).
  • APort Agent Guardrails - Pre-action authorization plugin for agent frameworks with policy-based access control (🏷️ Python Authorization SDK).
  • Patronus AI LYNX - Hallucination detection system beating GPT-4 baselines, with specialized testing for agent outputs and LLM-generated content quality (🏷️ Cloud Testing SDK).
  • Arize Phoenix - Open-source observability platform built on OpenTelemetry for tracing, evaluating, and debugging AI agents (🏷️ Python OpenTelemetry Platform).
  • Braintrust - Eval-driven development platform with experiment tracking and prompt optimization for agent quality (🏷️ TypeScript Evaluation Platform).
  • ElevenAgents - Voice agent platform from ElevenLabs for customer support automation with HIPAA compliance and multi-language support (🏷️ Cloud Voice Platform [HIPAA] [Enterprise]).
  • DriftGuard - Semantic memory guardrails using causal graphs to prevent agents from repeating past failures (🏷️ Python Memory SDK).
  • Guardrails AI - Adds structural, type, and quality guarantees to LLM outputs for reliable agent responses (🏷️ Python Validation SDK).
  • Helicone - Open-source LLM observability with one-line integration for cost tracking and prompt analytics (🏷️ TypeScript Observability Platform).
  • Lakera Guard - Real-time protection against prompt injection, data leakage, and toxicity in agent interactions (🏷️ Cloud Security API).
  • Laminar - Open-source observability and analytics platform purpose-built for the full lifecycle of AI agents (🏷️ Rust Observability Platform).
  • Langfuse - Open-source LLM observability platform for tracing, prompt versioning, and LLM-as-a-judge evaluations (🏷️ TypeScript Next.js Platform).
  • LangSmith - LangChain platform for tracing, testing, and evaluating agent performance with production monitoring (🏷️ Cloud LangChain Platform).
  • LLM Guard - Security toolkit for scanning LLM inputs and outputs to prevent prompt injection and data leaks (🏷️ Python Security SDK).
  • Logfire - Python-native observability from the Pydantic team with deep integration for high-performance agent monitoring (🏷️ Python Pydantic Platform).
  • NeMo Guardrails - NVIDIA programmable guardrails toolkit for controlling and securing LLM-powered agent conversations (🏷️ Python NVIDIA SDK).
  • Orchard Kit - Modules for agent runtime security, self-audit trails, and collective cognition patterns (🏷️ Python Security SDK).
  • OWASP Top 10 for Agentic Apps - Security framework covering goal hijacking, tool misuse, and cascading failure mitigations for agents (🏷️ Policy Security Framework).
  • Rebuff - Self-hardening prompt injection detection system for securing agent inputs against adversarial attacks (🏷️ Python Security SDK).
  • ai-evaluation - LLM evaluation framework with 50+ metrics, LLM-as-Judge, and guardrail scanners (jailbreak, PII, injection) (🏷️ Python Evaluation Guardrails SDK).
  • Future AGI - Self-hostable end-to-end agent engineering platform with tracing, evals, guardrails, and gateway (🏷️ Python Platform Self-hosted).

Agent Interfaces and UIs

Frontend workspaces and chat interfaces with built-in agent plugins and tool-use capabilities.

  • AnythingLLM - All-in-one AI application with RAG, agents, and multi-model support for desktop and Docker (🏷️ TypeScript Docker Desktop).
  • DB-GPT - Data interaction platform with local LLM support for 100% private database and analytics agents (🏷️ Python Database Web).
  • LibreChat - Self-hosted multi-model chat interface supporting all major AI providers with access control (🏷️ TypeScript Docker Web).
  • LobeHub - Modern platform for hybrid work and AI-driven collaboration with extensible agent teams and rapid integration (🏷️ TypeScript Next.js Web).
  • LobeChat - Modern, open-source AI chat framework with a massive plugin ecosystem for autonomous agent capabilities (🏷️ TypeScript Next.js Web).
  • OpenWebUI - Extensible local AI interface with built-in RAG, tool use, and support for multi-agent workflows (🏷️ TypeScript Svelte Web).

Agent Deployment and Hosting

  • AWS Bedrock AgentCore - Managed AWS infrastructure for Bedrock-based agents with compliance, scaling, and monitoring built in (🏷️ TypeScript AWS Platform).
  • Cerebras Inference - Fastest LLM inference delivering 1000+ tokens per second on Llama 3.3 70B with a free tier (🏷️ Cloud Inference API).
  • Fireworks AI - Serverless LLM inference with fine-tuning, RAG support, and free credits for rapid prototyping (🏷️ Cloud Inference API).
  • Groq Cloud - Ultra-fast LPU-based LLM inference for Mixtral, Llama, and Gemma with a free API tier (🏷️ Cloud Inference API).
  • Modal - Serverless GPU compute purpose-built for AI workloads with fast cold starts and Python-native deployment (🏷️ Python Serverless Cloud).
  • Northflank - Full-stack platform with GPU orchestration, Git-based CI/CD, and bring-your-own-cloud support (🏷️ Kubernetes Cloud Platform).
  • Railway - One-click deploy from GitHub with persistent volumes and databases for stateful agent deployments (🏷️ Nix Cloud Platform).
  • Together AI - Inference API hosting 200+ open models with fast generation and a free tier for developers (🏷️ Cloud Inference API).
  • Trigger.dev - Background job platform with cron, webhook, and event triggers purpose-built for long-running agent tasks (🏷️ TypeScript Serverless Cloud).

Agent Evaluation and Benchmarks

  • AgentBench - Comprehensive benchmark for evaluating LLMs as agents across 8 distinct environments (🏷️ Python Benchmark SDK).
  • ARC-AGI-2 - Frontier benchmark for measuring general intelligence capabilities in AI agents beyond pattern matching (🏷️ Python Benchmark Research).
  • GAIA Benchmark - Benchmark for General AI Assistants measuring real-world reasoning and tool use (🏷️ JSON Benchmark Dataset).
  • Inspect AI - Framework for evaluating large language models with composable tasks and scoring (🏷️ Python Evaluation SDK).
  • SWE-bench - Benchmark for evaluating LLMs on real-world software engineering tasks from GitHub issues (🏷️ Python Docker Benchmark).
  • WebArena - Benchmark for web agent evaluation using real websites with realistic task completion metrics (🏷️ Python Benchmark Web).

Agent Testing & Debugging

Tools and frameworks focused on testing, debugging and evaluating agent workflows in production.

Tool Debugging Testing Eval Latency Notes
agenttrace Advanced Basic Local Session observability, tokens/cost/latency (see Safety Guardrails)
Inspect AI API Composable evaluation framework (see Agent Evaluation)
Patronus AI LYNX Cloud Hallucination detection for agent outputs
Braintrust Evals Basic Cloud Eval-driven development and experiment tracking
pytest-Agents Local Pytest fixtures and runners for agent unit tests
Agent Mutation Testing Local Generates adversarial mutations to test agent robustness
Cortex Debugger Local Step-through debugging primitives for agent workflows
Agent Coverage Tools Local Coverage analysis for agent branches and tool calls

Notes: Several of these projects already appear elsewhere in this document (agenttrace, Inspect AI, Patronus). New local docs link to conceptual testing tools not yet listed elsewhere.

Industry-Specific Agents

Curated list of vertical agent solutions for finance, healthcare, legal, manufacturing, and government.

Finance

  • Anthropic Finance Agents - Domain-specialized agents for earnings analysis and portfolio insight (🏷️ Cloud Finance Enterprise).
  • JPMorgan AlphaBlue - Trading and market analysis agent integrations for institutional workflows (🏷️ Cloud Finance Enterprise).
  • Goldman Sachs Marquee AI - Market data and analytics agents built on Marquee platform (🏷️ Cloud Finance Enterprise).
  • Morgan Stanley AdvisorBot - Financial advisory assistant for advisors and retail clients (🏷️ Cloud Finance Enterprise).

Healthcare

  • Epic AI Assistant (https://www.epic.com) - Clinical documentation and workflow agents integrated with Epic HER (🏷️ Cloud Healthcare Enterprise).
  • Nuance Healthcare AI - Speech and clinical AI solutions for documentation and radiology (🏷️ Cloud Healthcare Enterprise [HIPAA]).
  • Tempus AI - Oncology research and clinical decision support agents (🏷️ Cloud Healthcare Research).

Legal

  • LexisNexis AI - Document review and legal research agents (🏷️ Cloud Legal Enterprise).
  • Westlaw AI-Assisted Research - Case law analysis and brief drafting agents (🏷️ Cloud Legal Enterprise).
  • ROSS Intelligence - Contract analysis and legal research agents (🏷️ Cloud Legal Platform).

Manufacturing & Industrial

  • Siemens AI Ops - Factory-floor optimization and predictive maintenance agents (🏷️ Cloud Industrial Enterprise).
  • GE Predix Agents - Equipment monitoring and incident prediction agents for industrial fleets (🏷️ Cloud Industrial Enterprise).

Government & Compliance

  • Anthropic Government Agents - Policy analysis and public sector agents for regulated workflows (🏷️ Cloud Government Enterprise).
  • Tax & Insurance Agent Platforms - Generic category placeholder for compliance-focused tax and underwriting agents (🏷️ Cloud Compliance Enterprise).

Learning Resources

  • AgentBench: Evaluating LLMs as Agents - The benchmark paper for evaluating LLMs as agents across diverse environments (🏷️ Paper ArXiv PDF).
  • AI Agents in LangGraph - Short course on building production agents with LangGraph by Andrew Ng's platform (🏷️ Course DeepLearning.ai Web).
  • AI Engineering by Chip Huyen - Comprehensive guide on AI systems design and deployment covering agent architecture patterns (🏷️ Book O'Reilly Print).
  • Anthropic Cookbook - Collection of Claude agent recipes and integration patterns from Anthropic (🏷️ Guide Anthropic GitHub).
  • Building Effective Agents - Anthropic's guide on agent design patterns, evaluation strategies, and production best practices (🏷️ Guide Anthropic Web).
  • Hugging Face Agents Course - Open-source course on building AI agents using Hugging Face tools and models (🏷️ Course HuggingFace Web).
  • LATS: Language Agent Tree Search - Combines Monte Carlo tree search with LLM reasoning for complex multi-step planning tasks (🏷️ Paper Python Research).
  • LLM Powered Autonomous Agents - Deep breakdown of LLM-powered agent components: planning, memory, and tool use (🏷️ Blog Weng Web).
  • Microsoft GenAI for Beginners - 21-lesson course on generative AI concepts and agent development from Microsoft (🏷️ Course Microsoft GitHub).
  • OpenAI Cookbook - Practical guides and recipes for building with OpenAI APIs including agent patterns (🏷️ Guide OpenAI GitHub).
  • Prompt Engineering Guide - Community-maintained guide covering prompt engineering techniques and agent strategies (🏷️ Guide DAIR Web).
  • ReAct: Synergizing Reasoning and Acting in Language Models - The foundational paper behind the ReAct prompting pattern used in most agent frameworks (🏷️ Paper ArXiv PDF).
  • Reflexion - Research framework letting agents learn from past mistakes via iterative verbal self-reflection loops (🏷️ Paper Python Research).
  • Tree of Thoughts - Explores multiple parallel reasoning paths before committing to a final answer for complex problems (🏷️ Paper Python Research).

Agent Communication

Capabilities that let agents send messages, notifications, and reports across channels.

  • Apprise - Sends agent notifications to 100+ services including Slack, Discord, and email from a single Python interface (🏷️ Python Multi-Channel SDK).
  • Discord.py - Lets agents interact with Discord channels and servers for team-facing real-time communication (🏷️ Python Discord SDK).
  • FastAPI-Mail - Adds async email sending to FastAPI-based agent services with SMTP and OAuth2 support (🏷️ Python FastAPI SDK).
  • Novu - Routes agent notifications across email, SMS, push, and chat from a unified API and workflow engine (🏷️ TypeScript Python Platform).
  • Ntfy - Pushes real-time agent notifications to phones and desktops via a dead-simple HTTP API (🏷️ Go Self-hosted API).
  • Resend - Sends transactional emails from agent workflows with a clean, developer-first API (🏷️ TypeScript Cloud API).
  • Slack Bolt - Enables agents to send, receive, and react to Slack messages with event-driven listeners (🏷️ Python Slack SDK).
  • The Colony - Provides an agent-only social network with a REST API, MCP server, and A2A agent-card so agents can post, comment, and DM each other autonomously (🏷️ Cloud MCP API Platform).
  • Twilio - Sends SMS and voice calls from agent workflows to any phone number worldwide (🏷️ Python Cloud API).

Data Pipeline and Workflow

Frameworks for orchestrating data transformations and long-running agent-driven workflows.

  • AI for Database - Connects to any database for plain English queries with self-refreshing dashboards and automated workflows (🏷️ Cloud Database Platform).
  • Burr - Tracks and manages multi-step agent state machines with built-in observability and persistence (🏷️ Python StateGraph SDK).
  • Dagster - Manages data assets and pipelines with built-in lineage tracking for data-intensive agent systems (🏷️ Python Pipeline Platform).
  • GPT Researcher - Conducts multi-source autonomous web research and produces cited, structured long-form reports (🏷️ Python LangChain Web).
  • Hamilton - Defines modular data transformations as Python functions wired automatically into a DAG for agent pipelines (🏷️ Python DAG SDK).
  • Hex AI - Collaborative data platform with AI-powered analysis and notebook-based data exploration for teams (🏷️ Cloud Data Platform).
  • Julius AI - Upload CSV or Excel files and analyze data using natural language questions for instant insights (🏷️ Cloud Data Web).
  • PandasAI - Chat with your data using natural language queries that convert to Pandas and SQL operations (🏷️ Python Data SDK).
  • Prefect - Orchestrates agent workflows and data pipelines with retries, caching, and built-in observability (🏷️ Python Pipeline Platform).
  • Signals CLI - Intent signal CLI detecting LinkedIn engagers, keyword posters, and funding events with JSON output for agent pipelines (🏷️ Cloud CLI Platform).
  • STORM - Generates Wikipedia-style long-form articles by autonomously researching and synthesizing multiple sources (🏷️ Python Stanford Research).

Secure Execution Environments

Infrastructure for running agent-generated code safely in isolated, sandboxed, or serverless environments.

💡 Quick Decision Guide:

  • Fastest hosted sandbox: E2B, Daytona
  • Self-hosted VM isolation: Firecracker, Kata Containers, gVisor
  • Container-based: Docker, Podman, Sysbox
  • Serverless: AWS Lambda, Google Cloud Run, Cloudflare Workers
Tool Isolation Cold Start Languages
E2B VM (Firecracker) ~300ms Python, TS, Go
Daytona Container/VM ~2s Python, TS, Go
Firecracker microVM ~125ms REST API
Docker Namespace+cgroup ~500ms All
AWS Lambda Firecracker VM ~200ms Python, TS, Go
Cloudflare Workers V8 isolate ~0ms TS, Rust (WebAssembly)
  • agentbox - Docker for AI Agents — package, test, version, and govern AI agents across every framework (🏷️ Python Docker CLI).
  • agentnotary - Notarize, govern, and audit AI agents — cryptographic seal, runtime guard, EU AI Act docs, and adversarial fuzzer (🏷️ Python Compliance CLI).
  • AWS Lambda - Serverless function runtime with Firecracker isolation for executing short-lived agent tasks at scale (🏷️ Python AWS Serverless).
  • Cloudflare Workers - Ultra-low-latency V8-isolate serverless runtime at the edge for lightweight agent task execution (🏷️ TypeScript Cloudflare Serverless).
  • CodeSandbox SDK - Provides forked, snapshotted microVM environments for real-time collaborative agent code execution (🏷️ TypeScript Python SDK).
  • Daytona - Self-hostable git-based developer environment manager for sandboxed agent code execution with fast provisioning (🏷️ Python TypeScript Platform).
  • Docker - Standard container runtime providing namespace and cgroup isolation for packaging and running agent workloads (🏷️ Go Docker Platform).
  • Firecracker - Lightweight microVM technology with 125ms boot time used by AWS Lambda and E2B under the hood (🏷️ Rust AWS Infrastructure).
  • Google Cloud Run - Fully managed serverless platform with gVisor isolation for running stateless agent containers at scale (🏷️ Python GCP Serverless).
  • gVisor - User-space kernel that intercepts syscalls to provide VM-level security with container-level overhead (🏷️ Go Google Infrastructure).
  • Kata Containers - Combines VM-level security with the developer experience of containers using lightweight VMs (🏷️ Go OCI Infrastructure).
  • Podman - Daemonless, rootless container engine ideal for running untrusted agent-generated code securely (🏷️ Go Rootless CLI).
  • Sysbox - Enhanced container runtime that enables Docker-in-Docker securely for nested agent execution environments (🏷️ Go Docker Infrastructure).

Creative AI Agents

Tools for generating images, video, music, audio, and 3D assets using AI models.

Image Generation

Generator Strength Open Source Pricing
Midjourney v7 Artistic quality No $10-120/mo
FLUX 2 Photorealism Yes Free / API
Stable Diffusion Full control Yes Free (OSS)
Ideogram v3 Text rendering No Free / $7+/mo
Google Imagen 4 Highest fidelity No API
  • Adobe Firefly 3 - Generates commercially safe images from text prompts using a model trained exclusively on licensed data (🏷️ Cloud Adobe CC Web).
  • DALL-E 3.5 - Generates detailed images from text prompts with 95% text accuracy integrated directly into ChatGPT (🏷️ Cloud OpenAI API).
  • FLUX 2 - Open-weight photorealism model producing 4K output at 6x faster generation speed than its predecessor (🏷️ Python Open-Weight API).
  • Google Imagen 4 - Produces state-of-the-art photorealistic images from text via Google DeepMind accessible through AI Studio (🏷️ Cloud Google API).
  • Ideogram v3 - Renders readable text inside generated images with near-zero spelling errors for logos and marketing (🏷️ Cloud Freemium Web).
  • Leonardo AI - Multi-model image platform with Realtime Canvas and 3D gaming asset generation now owned by Canva (🏷️ Cloud Canva Web).
  • Midjourney v7 - Produces the highest artistic quality images with unmatched aesthetics via Discord and web interface (🏷️ Cloud Discord Web).
  • Recraft - Design-first image generator with native vector art output and brand consistency tools for designers (🏷️ Cloud Freemium Web).
  • Stable Diffusion 3.5 - Open-source foundation model with full ControlNet, LoRA, and ComfyUI ecosystem for custom pipelines (🏷️ Python Open-Source Local).

Video Generation

Generator Max Length Resolution Open Source
Sora 2 25 sec 1080p No
Google Veo 3.1 Variable 4K No
Runway Gen-4.5 Variable 1080p+ No
Kling 3.0 2 min 4K No
Wan 2.1 Unlimited Variable Yes
HunyuanVideo Unlimited Variable Yes
  • Google Veo 3.1 - Generates the highest cinematic quality video with native audio from text prompts via Google DeepMind (🏷️ Cloud Google API).
  • HaiLuo AI - Budget video generation platform offering 10 free videos per day with HD output quality (🏷️ Cloud Freemium Web).
  • HunyuanVideo - Open-source video generation model from Tencent that runs on consumer-grade GPUs (🏷️ Python Open-Source Local).
  • Kling 3.0 - Generates 4K video up to 2 minutes long with native audio at the most competitive pricing available (🏷️ Cloud Freemium Web).
  • LTX Video - Commercially safe open-source video generation model trained on fully licensed data (🏷️ Python Open-Source Local).
  • Luma Dream Machine - Creates physics-accurate 4K HDR video scenes with 3D-style output for cinematic quality (🏷️ Cloud Freemium Web).
  • Pika 2.5 - Beginner-friendly video generation tool with fast turnaround and intuitive editing controls (🏷️ Cloud Freemium Web).
  • Runway Gen-4.5 - Professional video generation with Motion Brush and the highest benchmark scores for visual quality (🏷️ Cloud Pro Web).
  • Seedance 2.0 - Quad-modal input video generator with built-in lip-sync and 2K resolution output (🏷️ Cloud Freemium Web).
  • Sora 2 - Generates 25-second cinematic video clips with strong narrative coherence from text prompts by OpenAI (🏷️ Cloud OpenAI Web).
  • Wan 2.1 - Open-source self-hosted video generation model with unlimited length and no per-video costs (🏷️ Python Open-Source Local).

Music and Audio Generation

  • ElevenLabs Music - Generates vocals and instrumentals with stem separation and per-section editing for audio production (🏷️ Cloud ElevenLabs Web).
  • Meta AudioCraft - Open-source audio generation suite including MusicGen and AudioGen with fine-grained prompt control (🏷️ Python Open-Source Local).
  • Stable Audio - Generates instrumental audio tracks with a commercial license included by default for all outputs (🏷️ Cloud Stability AI Web).
  • Suno - Generates complete songs with vocals, instruments, and lyrics from a single text prompt (🏷️ Cloud Freemium Web).
  • Udio - Produces high-fidelity music with fine-grained sectional control over generation and arrangement (🏷️ Cloud Freemium Web).

3D and Design Generation

  • Meshy - Generates 3D models from text or images for game assets and product visualization with multi-format export (🏷️ Cloud Freemium Web).
  • Tripo AI - Produces fast 3D model generation from text or images with multi-format export for rapid prototyping (🏷️ Cloud Freemium Web).
  • Vizcom - Renders industrial design sketches into photorealistic 3D visuals in real-time for product designers (🏷️ Cloud Pro Web).

Customer Support and CRM Agents

AI agents that automate customer support, CRM workflows, sales outreach, and ticket resolution at scale.

Support Automation

Platform Auto-Resolution Best For
Intercom Fin 50%+ tickets SaaS teams
Ada 60%+ tickets Complex resolution
Zendesk AI 30%+ tickets Enterprise
Freshdesk Freddy 25%+ tickets SMBs
  • Ada - Resolves 60% of customer support tickets automatically with complex multi-turn query understanding (🏷️ Cloud Enterprise Platform).
  • Assembled - Routes support tickets with workforce-aware scheduling and intelligent handoff to human agents (🏷️ Cloud Enterprise Platform).
  • Dixa - CRM-first conversational support platform with AI-powered routing and customer context enrichment (🏷️ Cloud Enterprise Platform).
  • Freshdesk Freddy AI - Auto-triages and routes support tickets with smart AI suggestions for budget-conscious SMB teams (🏷️ Cloud Freshworks Platform).
  • Intercom Fin - Resolves 50% of SaaS support tickets by learning directly from your help center and knowledge base (🏷️ Cloud Intercom Platform).
  • Zendesk AI - Automates 30% of enterprise support tickets with deep integration into the existing Zendesk ecosystem (🏷️ Cloud Zendesk Platform).

AI-Powered CRM Platforms

  • Creatio - No-code CRM platform with pre-built AI agents for sales, marketing, and service automation (🏷️ Cloud No-Code Platform).
  • Dynamics 365 Copilot - Drafts, summarizes, and translates across Office 365 with deep Microsoft ecosystem integration (🏷️ Cloud Microsoft Platform).
  • Monday CRM Lexi - AI sales agent with automated lead sourcing, qualification, and pipeline management for SMBs (🏷️ Cloud Monday Platform).
  • Pipedrive AI - AI-powered email drafting, deal prioritization, and smart reporting for small sales teams (🏷️ Cloud Pipedrive Platform).
  • Salesforce Einstein - Enterprise AI with predictions, autonomous agents, and CRM automation across the Salesforce platform (🏷️ Cloud Salesforce Platform).
  • Salesmate - CRM with AI-powered call summarization, lead qualification, and pipeline automation for small teams (🏷️ Cloud CRM Platform).
  • ServiceNow AI Agents - Orchestrates AI agents across IT, HR, and CRM workflows for enterprise service management (🏷️ Cloud ServiceNow Platform).
  • XMACNA Funcionarios Digitais - Provides AI/IA agents for WhatsApp, phone, CRM updates, lead qualification, follow-up, and human handoff (🏷️ Cloud CRM Sales WhatsApp).
  • Zoho CRM Zia - Predictive scoring, sentiment analysis, and voice commands for sales automation across Zoho products (🏷️ Cloud Zoho Platform).

Sales and Outreach Agents

  • Apollo.io - AI prospecting platform with 275M+ contacts, lead scoring, and automated email sequencing (🏷️ Cloud Freemium Platform).
  • Clay - Enriches leads from 70+ data providers and generates hyper-personalized outreach at scale (🏷️ Cloud Pro Platform).
  • Instantly - Generates AI cold emails with smart sender rotation and built-in domain warmup for deliverability (🏷️ Cloud Pro Platform).
  • Lavender - Coaches email writing in real-time with AI response scoring and recipient intelligence (🏷️ Cloud Freemium Platform).
  • Overloop CLI - AI outbound CLI agent that sources 450M+ contacts and runs email plus LinkedIn campaigns with JSON output (🏷️ Cloud CLI Platform).

Voice Agent Platforms

Platforms for building, deploying, and scaling voice-based AI agents across calls, support, and live conversations.

Platform Latency HIPAA Best For
ElevenLabs 75ms Yes Industry benchmark
Vapi Low No Developer-first
Bland AI Medium Yes Outbound call automation
PolyAI Low Yes Enterprise scale
  • AgentLine - Telephony infrastructure for AI agents — provision phone numbers, make/receive calls, and manage voice pipelines via API (🏷️ Cloud Telephony API).
  • AssemblyAI - Speech-to-text API with speaker diarization, sentiment analysis, and summarization for voice agent pipelines (🏷️ Cloud STT API).
  • Bland AI - Automates outbound phone calls at scale with SOC2 and HIPAA compliance and CRM integration (🏷️ Cloud Telephony API).
  • Deepgram - Sub-300ms speech-to-text and text-to-speech APIs purpose-built for real-time voice agent pipelines (🏷️ Cloud STT/TTS API).
  • ElevenLabs - Industry-leading voice AI with 75ms latency, Conversational AI 2.0, RAG, and HIPAA compliance (🏷️ Cloud Voice Platform).
  • HeyGen - Creates AI talking avatars with voice cloning and lip-sync for video-based agent interactions (🏷️ Cloud Avatar Web).
  • PolyAI - Enterprise voice AI platform for natural multi-turn conversations with high-volume call handling (🏷️ Cloud Enterprise Platform).
  • Retell AI - Builds human-like voice agents with multi-language telephony support and low-latency responses (🏷️ Cloud Telephony API).
  • Synthesia - Generates AI video avatars that speak in 120+ languages for training and communication agents (🏷️ Cloud Avatar Web).
  • Synthflow - No-code voice agent builder with pre-built templates for SMBs to deploy phone agents quickly (🏷️ Cloud No-Code Platform).
  • Voiceflow - No-code builder for voice and chat agents with visual conversation design and team collaboration (🏷️ Cloud No-Code Platform).

Deep Research Agents

AI platforms that conduct autonomous multi-step research, synthesize findings from multiple sources, and generate structured reports with citations.

Agent Context Window Approach
Claude Research 200K tokens Multi-step with citations
ChatGPT Research 128K tokens Extended reasoning + browse
Gemini Research 1M tokens Google Search + KG
Perplexity Pro Variable Real-time cited search
  • CAJAL - Local AI agent that generates publication-ready IMRaD scientific papers with verified arXiv citations and AI tribunal scoring (🏷️ Python Ollama Local).
  • ChatGPT Deep Research - Conducts extended reasoning with web browsing to produce structured research reports with Canvas output (🏷️ Cloud OpenAI Web).
  • Claude Deep Research - Performs multi-step investigation with verified source citations and 200K token context window (🏷️ Cloud Anthropic Web).
  • DeerFlow - Multi-agent research system from ByteDance with planning and execution loops for autonomous investigation (🏷️ Python Open-Source Research).
  • Gemini Deep Research - Leverages Google Search and Knowledge Graph integration with 1M token context for breadth-first research (🏷️ Cloud Google Web).
  • Perplexity Pro - Provides real-time search answers with inline citations and follow-up threads for iterative research (🏷️ Cloud Freemium Web).
  • Perplexity Personal Computer - Autonomous research agent with vision capabilities, real-time search integration, and document analysis for comprehensive research workflows (🏷️ Cloud Research Consumer).

Prompt-to-App Builders

Platforms that generate full-stack applications, components, or web apps from natural language prompts.

Builder Output Hosting
Bolt.new Full-stack app Browser/deploy
Lovable Full web app Auto-deploy
v0 by Vercel React components Vercel
Replit Agent Full-stack Replit
  • Bolt.new - Generates full-stack web applications in the browser from natural language prompts with instant deploy (🏷️ Cloud Freemium Web).
  • Dyad - Privacy-first open-source prompt-to-app builder that generates full-stack applications locally (🏷️ TypeScript Open-Source Local).
  • Google Project IDX - AI-powered cloud IDE with free Claude Opus access for building full projects across all frameworks (🏷️ Cloud Google Web).
  • Lovable - Iterates from chat to deployed web application with auto-deploy and real-time preview (🏷️ Cloud Freemium Web).
  • PlayCode Agent - Browser-only lightweight web app creation tool with AI-assisted code generation (🏷️ Cloud Pro Web).
  • Replit Agent - Builds and deploys full-stack projects from prompts with integrated hosting and collaboration (🏷️ Cloud Freemium Web).
  • v0 by Vercel - Generates React and Tailwind UI components from text descriptions with one-click Vercel deployment (🏷️ Cloud Freemium Web).

Multi-Agent Consumer Platforms

All-in-one AI platforms providing access to agents, tools, and models through consumer-facing interfaces without writing code.

Platform Context Flagship Features
ChatGPT 128K GPTs, Deep Research, Canvas, Agent Mode
Claude 200K Computer Use, MCP, Cowork
Gemini 1M Deep Think, Gems, NotebookLM
Grok 131K Real-time X data, Grok Build
  • ChatGPT - General-purpose AI platform with GPTs, Deep Research, Canvas, and Agent Mode across the widest plugin ecosystem (🏷️ Cloud OpenAI Web).
  • Claude - Reasoning-focused AI with Computer Use, MCP integration, and Chrome browsing for coding and long document analysis (🏷️ Cloud Anthropic Web).
  • Coze - No-code agent builder with a marketplace of 100+ custom agents and plugin integrations (🏷️ Cloud ByteDance Web).
  • Gemini - Google's multimodal AI with 1M token context, Deep Think, Gems, and NotebookLM for workspace integration (🏷️ Cloud Google Web).
  • Grok - Real-time AI with live X data access, Grok Build for 8-agent parallel code generation, and image generation (🏷️ Cloud xAI Web).
  • Meta AI - Llama-powered AI integrated across WhatsApp, Messenger, and Instagram for conversational assistance (🏷️ Cloud Meta Web).
  • Microsoft Copilot - AI assistant integrated into Office 365, Teams, and Power Platform for enterprise productivity workflows (🏷️ Cloud Microsoft Web).
  • Sistava - AI agent orchestration platform for deploying multi-channel agents across messaging, voice, and APIs with full Computer Use capabilities on your own OS (🏷️ Cloud Sistava Web).

Open-Source Models for Agents

Open-weight LLMs optimized for agentic tasks including tool use, planning, multi-step reasoning, and code generation.

Model Organization Parameters Agent Strength
Llama 4 Meta 8B-405B+ Strong tool use
Qwen 3 Alibaba 0.6B-235B MCP-native, multilingual
DeepSeek V3/R1 DeepSeek 671B MoE 68x cheaper than GPT-4
Gemma 3 Google 1B-27B Efficient edge agents
  • Baichuan 3 - Chinese-optimized open model family with strong domain performance for multilingual agent deployments (🏷️ Python Open-Source Local).
  • Command R+ - Cohere's 104B parameter model optimized specifically for RAG and enterprise tool use in agent workflows (🏷️ Cloud Cohere API).
  • DeepSeek V3 - 671B MoE model that delivers GPT-4 level reasoning at 68x lower cost for budget-conscious agent deployments (🏷️ Python Open-Source Local).
  • Falcon 3 - Efficient open model from TII UAE with strong multilingual support across 10B to 180B parameter variants (🏷️ Python Open-Source Local).
  • Gemma 3 - Efficient multimodal model from Google in 1B to 27B sizes ideal for edge and on-device agents (🏷️ Python Google Local).
  • Gemma 4 - Consumer and IoT-optimized model from Google in 2B to 31B sizes with E2B and E4B variants (🏷️ Python Google Local).
  • GLM-4 - Zhipu AI's 744B MoE model achieving the lowest hallucination rate and 77.8% on SWE-bench (🏷️ Python Open-Source Local).
  • InternLM 3 - Shanghai AI Lab model with strong long-context reasoning capabilities for complex agent tasks (🏷️ Python Open-Source Local).
  • Llama 4 - Meta's flagship open model family with Scout and Maverick variants and strong tool use capabilities (🏷️ Python Meta Local).
  • Mistral Large 2 - Leading European open model at 123B parameters with strong function calling for agent workflows (🏷️ Python Mistral API).
  • Phi-4 - Microsoft's compact 14B parameter model delivering strong reasoning for on-device and edge agent deployments (🏷️ Python Microsoft Local).
  • Qwen 3 - MCP-native multilingual model family from Alibaba with thinking mode across 0.6B to 235B sizes (🏷️ Python Alibaba Local).
  • Qwen 3.6-Plus - Agentic-focused model with 1M context window, repo-level coding, and MCP-native tool use (🏷️ Python Alibaba Local).
  • StarCoder 2 - Code-focused open model trained on 600+ programming languages for specialized coding agents (🏷️ Python BigCode Local).
  • Yi-Lightning - 01.AI's multilingual model with long context and fast inference across 6B to 200B parameter sizes (🏷️ Python Open-Source Local).

Market Stats and Benchmarks 2026

Key data points on the AI agent market covering adoption, growth, valuations, and fastest-growing open-source projects.

Market Size

Year Market Size Growth
2024 $5.1B Baseline
2025 $7.6B +49% YoY
2026 $10.91B +43% YoY
2027 $16.8B +54% YoY
2030 $52.63B 46.3% CAGR

Adoption Rates

Metric Value
Orgs with agents in production 57%
Devs using AI coding tools regularly 85%
Companies planning agent deployment 72%
Enterprises with dedicated agent budget 41%

Top Use Cases

Use Case Share
Customer Service 26.5%
Research and Analysis 24.4%
Workflow Automation 18.0%
Code Generation 15.0%
Data Processing 10.0%

Fastest Growing Open-Source Projects

Project Stars (Jan 2026) Stars (May 2026) Growth
Browser Use 10K 50K 5x
Playwright MCP 1K 10K 10x
DeerFlow 5K 25K 5x

Notable Valuations

Company Valuation
OpenAI $90B+
Cursor $29.3B
Anthropic $20B
ElevenLabs $11B
Mistral $6B
Hugging Face $4.5B
  • AI Agent Market Report 2026 - Comprehensive market analysis projecting the AI agent market to reach $52.63B by 2030 at 46.3% CAGR (🏷️ Report Market Research).
  • State of AI Agents Survey - Enterprise survey showing 57% of organizations now have AI agents in production workflows (🏷️ Report Survey Research).

Local and Self-Hosted AI

Tools for running LLMs locally and self-hosting AI agent platforms with full privacy and control.

  • GPT4All - Open-source local chat application that runs LLMs on consumer hardware without cloud dependencies (🏷️ C++ Desktop Local).
  • Jan - Open-source ChatGPT alternative that runs 100% offline with local model management and privacy guarantees (🏷️ TypeScript Electron Desktop).
  • Llamafile - Distributes LLMs as single executable files with zero setup required from Mozilla (🏷️ C++ Portable Local).
  • llama.cpp - C/C++ LLM inference engine supporting CPU, GPU, and Apple Silicon as the foundation of local AI (🏷️ C++ Cross-Platform Local).
  • LM Studio - Desktop application for running local LLMs with a polished UI across all major platforms (🏷️ TypeScript Electron Desktop).
  • LocalAI - Drop-in OpenAI API replacement that runs models locally without requiring a GPU (🏷️ Go Docker Local).
  • Ollama - Run LLMs locally with a dead-simple CLI interface and 162K+ GitHub stars (🏷️ Go CLI Local).
  • vLLM - High-throughput LLM serving engine with PagedAttention for production-grade local inference (🏷️ Python CUDA Local).
  • Yao Agents - Local-first AI execution platform with Docker sandbox isolation, BYOK model configuration, MCP support, 5-stage Pipeline, and multi-platform messaging via WeChat, Feishu, DingTalk, Telegram, and Discord (🏷️ Go TypeScript Docker Desktop Self-Hosted).

AI Governance and Compliance

Frameworks and tools for AI risk management, regulatory compliance, and governance as EU AI Act obligations take effect August 2026.

  • Credo AI - End-to-end AI governance platform with EU AI Act policy packs and model inventory management (🏷️ Cloud Governance Platform).
  • EU AI Act - Official EU regulatory framework classifying AI systems by risk tier from unacceptable to minimal risk (🏷️ Policy Regulation EU).
  • IBM watsonx.governance - Enterprise AI risk, compliance, and model monitoring platform for regulated industries (🏷️ Cloud IBM Platform).
  • Nobulex - Cryptographic receipts for AI agent actions with two Ed25519 signatures per action, hash-chained for tamper-evident audit trails (🏷️ TypeScript Security Cryptography).
  • NIST AI RMF - US framework for AI risk management covering Govern, Map, Measure, and Manage functions (🏷️ Policy Framework US).
  • OneTrust AI Governance - Risk classification, consent management, and compliance workflows for AI agent deployments (🏷️ Cloud Compliance Platform).
  • Project Glasswing - Industry consortium (11-company initiative) focused on AI safety, red-teaming, and shared benchmarks for agent risk mitigation (🏷️ Consortium Safety Research).

Cybersecurity Agents

AI agents specialized in penetration testing, vulnerability discovery, threat detection, and security analysis.

  • Microsoft Security Copilot - Enterprise threat detection and incident response AI integrated across Microsoft security products (🏷️ Cloud Microsoft Platform).
  • PentestGPT - GPT-powered penetration testing tool with automated reasoning for vulnerability assessment (🏷️ Python Security CLI).
  • YAWNING TITAN - Graph-based cybersecurity simulation environment for training and testing defensive AI agents (🏷️ Python Simulation Research).

Healthcare and Therapy Agents

AI agents for mental health support, cognitive training, and therapy-adjacent applications with clinical validation.

  • Akili Interactive - FDA-cleared video game-based digital medicine for ADHD using neuroplasticity-focused cognitive training (🏷️ Cloud FDA-Cleared Medical).
  • Elomia - Clinician-designed AI therapy chatbot with natural conversation flow and anonymous interaction by design (🏷️ Cloud Mental Health Web).
  • Ginger - On-demand mental healthcare platform combining AI coaching with live therapists using CBT and DBT approaches (🏷️ Cloud Mental Health Mobile).
  • Headspace Health - Meditation and mental health platform with CBT-based courses, sleep programs, and clinical partnerships (🏷️ Cloud Wellness Mobile).
  • Replika - AI companion with Rogerian-style support, persistent memory, and mood tracking for social wellbeing (🏷️ Cloud Companion Mobile).
  • Talkspace AI - AI-assisted therapy matching platform with human therapist backup for personalized mental health care (🏷️ Cloud Therapy Mobile).
  • Tess by X2AI - SMS-based therapy coach using CBT with clinical validation showing 28% depression reduction (🏷️ Cloud Mental Health SMS).
  • Woebot - CBT-based mental health chatbot that is FDA-cleared and Stanford-validated for therapeutic conversations (🏷️ Cloud FDA-Cleared Mobile).
  • Wysa - NHS-approved AI mental health companion using CBT and DBT with anonymous design principles (🏷️ Cloud Mental Health Mobile).
  • Youper - Emotional health assistant using CBT and ACT with Stanford-tested clinical effectiveness for mood tracking (🏷️ Cloud Mental Health Mobile).

Newsletters and Communities

Curated newsletters, podcasts, and communities for staying current with AI agent development.

  • AI Engineering Newsletter - AI engineering podcast and newsletter by Swyx and Alessio covering agent architectures and tooling (🏷️ Newsletter Podcast Web).
  • aibtc.news - Bitcoin-focused agent news platform with bounties and classifieds for the agent economy (🏷️ Newsletter Bitcoin Web).
  • Awesome Agents Newsletter - Weekly curated tools and reviews covering the latest in AI agent development (🏷️ Newsletter Weekly Web).
  • r/ClaudeAI - Reddit community for Claude users sharing agent workflows, prompts, and integration patterns (🏷️ Community Reddit Forum).
  • r/LangChain - Reddit community for agent developers using LangChain, LangGraph, and related frameworks (🏷️ Community Reddit Forum).
  • r/LocalLLaMA - Reddit community for self-hosted LLM users sharing local deployment and agent setup guides (🏷️ Community Reddit Forum).
  • The Rundown AI - Daily AI digest reaching 600K+ subscribers with concise coverage of agent news and launches (🏷️ Newsletter Daily Web).

Changelog

See CHANGELOG.md for the full update history.

Contributing

Your contributions are what keep this list useful. Read Contributing.md for the entry format, inclusion criteria, and style guide.

Star History

Star History Chart

About

Awesome AI Agents for 2026 - 300+ AI Agents, Frameworks & Coding • Creative • Voice • Research • Enterprise. Comparison guides, benchmarks & deep dives

Topics

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors