The most comprehensive, structured guide to AI agent frameworks, tools, and resources.
Updated weekly. Compared side-by-side. Built for developers who ship.
- Orchestration Frameworks
- Coding Agents
- Memory and Context
- Multi-Agent Systems
- Agent Communication Protocols
- Browser and Computer Use Agents
- Agent Tooling and Infrastructure
- Low and No-Code Builders
- Voice and Multimodal Agents
- Safety Guardrails and Observability
- Agent Interfaces and UIs
- Agent Deployment and Hosting
- Agent Evaluation and Benchmarks
- Agent Testing & Debugging
- Industry-Specific Agents
- Learning Resources
- Agent Communication
- Data Pipeline and Workflow
- Secure Execution Environments
- Creative AI Agents
- Customer Support and CRM Agents
- Voice Agent Platforms
- Deep Research Agents
- Prompt-to-App Builders
- Multi-Agent Consumer Platforms
- Open-Source Models for Agents
- Market Stats and Benchmarks 2026
- Local and Self-Hosted AI
- AI Governance and Compliance
- Cybersecurity Agents
- Healthcare and Therapy Agents
- Newsletters and Communities
- Changelog
- Star History
- Agency Swarm - Orchestrates multi-agent systems built on the OpenAI Assistants API with role-based collaboration (🏷️
PythonOpenAISDK). - AgentScope - Alibaba multi-agent framework with distributed deployment and fault tolerance for production use (🏷️
PythonAlibabaSDK). - Agno - Multi-agent framework with a runtime and control plane for managing agent deployments at scale (
PythonFastAPICloud) (🏷️PythonFastAPICloud). - Anthropic SDK - Official Claude SDK with tool use, computer control, and streaming for building Claude-native agents (🏷️
PythonAnthropicSDK). - AutoGen - Event-driven multi-agent framework merged with Semantic Kernel for production workflows (🏷️
PythonSemantic KernelSDK). - AWS Bedrock Agents - Managed AWS infrastructure for agents with Bedrock models, enterprise compliance, and auto-scaling deployment (🏷️
PythonAWSCloud[Enterprise][SOC2]). - CAMEL - Role-based multi-agent simulation framework for collaborative reasoning and task completion (🏷️
PythonResearchSDK). - CopilotKit - The open-source AI copilot platform for React, allowing developers to embed custom AI agents directly into their applications (🏷️
TypeScriptReactSDK). - CrewAI - Production multi-agent framework with 5K+ GitHub stars, role-based collaboration, async execution, and 1500+ company adoptions (🏷️
PythonLangChainSDK[Enterprise]). - DSPy - Programmatically optimizes LLM prompts and chains using machine learning instead of manual prompt engineering (🏷️
PythonStanfordSDK). - Ontheia - Self-hosted, open-source AI agent platform with Chain Engine for visual workflow automation, MCP-native tool integration, multi-provider support (Claude, OpenAI, Gemini, Ollama), long-term memory via pgvector, multi-user RBAC, and GDPR-compliant architecture (🏷️
TypeScriptSelf-hostedMCPDocker[Enterprise][GDPR]). - Google ADK - Modular agent dev kit integrating Gemini and Vertex AI natively (🏷️
PythonVertex AISDK). - Haystack - Production-ready AI orchestration framework focused on building customizable LLM applications and RAG pipelines (🏷️
PythonRAGSDK). - LangGraph - Enterprise framework for stateful, graph-based agent workflows (🏷️
PythonLangChainSDK). - Letta - Formerly MemGPT. Stateful agents with built-in long-term memory and a REST API server (🏷️
PythonRESTSDK). - LightAgent - Lightweight Python agent framework with memory, MCP/SSE integration, reusable Skills, Tree-of-Thought planning, streaming, and LightSwarm multi-agent collaboration (🏷️
PythonMCPSDKMulti-Agent). - LlamaIndex - The leading framework for connecting LLMs to your data, with powerful indexing and retrieval capabilities (🏷️
PythonRAGSDK). - LangSmith Fleet - No-code agent orchestration platform with visual workflow builder, real-time monitoring, and enterprise deployment (🏷️
CloudLangChainPlatform[Enterprise]). - Mastra - Opinionated TypeScript framework with RAG, observability, and MCP support built in (🏷️
TypeScriptNext.jsSDK). - MicroAgent - Lightweight agent framework with self-editing prompts and code for minimal-overhead agent loops (🏷️
TypeScriptLightweightSDK). - Modus - Serverless framework for high-throughput agent workloads with minimal cold starts (🏷️
WebAssemblyHypermodeServerless). - Open-AutoGLM - Open-source phone agent model and framework for building mobile device automation agents (🏷️
PythonMobileSDK). - OpenAI Agents SDK - Lightweight multi-agent SDK with tracing and guardrails from OpenAI (🏷️
PythonOpenAISDK). - PraisonAI - Production multi-agent framework with self-reflection, MCP integration, and workflow automation (🏷️
PythonMulti-AgentCLI). - PydanticAI - Type-safe agent framework from the Pydantic team with a FastAPI-style developer experience (🏷️
PythonPydanticSDK). - Semantic Kernel - Microsoft enterprise SDK for Python, C#, and Java with modular plugins, memory, and goal planning (🏷️
C#MicrosoftSDK). - Smolagents - Hugging Face code-first framework where agents write and execute Python instead of JSON tool calls (🏷️
PythonHugging FaceSDK). - Strands Agents SDK - AWS model-driven agent SDK with native Bedrock integration (🏷️
PythonAWS BedrockSDK). - Upsonic - Minimalist agent framework with MCP support and rapid setup for quick prototyping (🏷️
PythonMCPSDK). - Vercel AI SDK - Streaming-first primitives for AI UIs with React Server Components and edge runtime support (🏷️
TypeScriptNext.jsSDK). - VoltAgent - TypeScript agent framework with built-in observability and a self-improving context engine (🏷️
TypeScriptReactSDK). - Vertex AI Agents - Google Cloud enterprise agent framework with native Gemini integration, scalable infrastructure, and compliance certifications (🏷️
PythonGoogleCloud[Enterprise][GDPR]).
- Aider - Terminal-first pair programmer that edits code in local repos, preserves Git history, and supports multi-file changes (🏷️
PythonCLILocal). - Amazon Q Developer - AWS-native AI coding assistant with Lambda, CloudWatch, infrastructure support, and security scanning (🏷️
PythonAWSIDE). - AutoGPT - Mature autonomous agent platform with Forge framework and public benchmarks for evaluating agent capabilities (🏷️
PythonForgeCLI). - Claude Code - Terminal-first agentic coding from Anthropic with Computer Use integration, multi-file edits, persistent shell sessions, Git operations, and fine-tuning support (🏷️
TypeScriptCLILocal[Anthropic]). - Cline - Autonomous coding agent in your IDE that creates/edits files, runs commands, and uses the browser with permission-gated steps (🏷️
TypeScriptVS CodeIDE). - CodeRabbit - AI-powered PR reviewer with inline suggestions, security scanning, and automatic code quality checks (🏷️
CloudGitHubWeb). - Codex CLI - OpenAI's lightweight, open-source terminal coding agent with fast execution and strong benchmark scores (🏷️
PythonCLILocal). - Codex-CLI - CLI tool that turns natural language commands into Bash, ZShell, and PowerShell equivalents (🏷️
C#CLILocal). - Continue - Source-controlled AI checks enforceable in CI, powered by the open-source Continue CLI (🏷️
TypeScriptVS CodeIDE). - Cursor - AI-native IDE with Cursor Cloud Agents supporting parallel task execution, $29.3B valuation, 100K+ enterprise users, and advanced agentic loops with real-time cursor control (🏷️
TypeScriptElectronDesktop[Enterprise]). - Devika - Open-source AI software engineer that understands high-level instructions and writes full-featured code autonomously (🏷️
PythonDockerWeb). - Devin - Fully autonomous AI software engineer showing 8-12x efficiency gains in production (Nubank case study), cloud sandbox with persistent state, GitHub integration, and enterprise adoption growing 40% MoM (🏷️
CloudWebSaaS[Enterprise]). - gemini-cli - Open-source AI agent that brings the power of Gemini directly into your terminal (🏷️
GoCLILocal). - GitHub Copilot - Agent Mode in VS Code with Copilot Workspace for issue-to-PR workflows across Claude, GPT, and Gemini (🏷️
TypeScriptVS CodeIDE). - Goose - Open-source extensible AI agent that goes beyond code suggestions, installs, executes, edits, and tests with any LLM (🏷️
RustCLILocal). - JetBrains AI - Deep AI integration across all JetBrains IDEs with context-aware completions and refactoring (🏷️
KotlinJetBrainsIDE). - Kiro - Spec-driven development agent that writes specs, auto-generates tasks, implements code, and automates DevOps workflows (🏷️
CloudAWSIDE). - Open Interpreter - Execute code locally via natural-language model instructions with a ChatGPT-like interface (🏷️
PythonCLILocal). - opencode - Open-source coding agent available as a desktop application with a visual interface (🏷️
TypeScriptElectronDesktop). - OpenHands - AI-driven development platform that writes, tests, and deploys code autonomously (🏷️
PythonDockerWeb). - PR-Agent - Open-source AI PR reviewer that auto-describes, reviews, and improves pull requests (🏷️
PythonGitHubCLI). - Qodo - AI code review platform with context-aware PR validation and security analysis (🏷️
CloudEnterpriseWeb). - RooCode - Cline fork with structured modes and reduced hallucinations for more reliable code generation (🏷️
TypeScriptVS CodeIDE). - Snyk Code - AI-powered security scanner with real-time vulnerability detection in agent-generated code (🏷️
CloudSecurityPlatform). - Sourcegraph Cody - AI coding assistant that excels at large codebases with an enterprise context engine (🏷️
TypeScriptEnterpriseIDE). - SWE-agent - Takes a GitHub issue and tries to automatically fix it. Also used for cybersecurity and competitive coding (🏷️
PythonDockerCLI). - Tabnine - Privacy-first AI code completion with on-premise deployment and codebase fine-tuning options (🏷️
TypeScriptEnterpriseIDE). - TaskWeaver - A code-first agent framework from Microsoft for seamlessly planning and executing data analytics tasks (🏷️
PythonJupyterCLI). - Windsurf - AI-native IDE with Cascade agent for multi-step autonomous tasks and team workflows (🏷️
TypeScriptElectronDesktop).
- Acontext - Manages agent skills and long-term memory as a layered data structure for persistent context (🏷️
PythonSDKLocal). - Chroma - Lightweight, embeddable vector store for building memory-augmented AI agents with fast semantic retrieval (🏷️
PythonTypeScriptSDK). - cognee - Knowledge engine for AI agent memory, set up in 6 lines of code with graph-based knowledge extraction (🏷️
PythonNeo4jSDK). - Cortex Memory - Full-stack solution for agent memory covering extraction, vector search, and optimization (🏷️
PythonVector DBSDK). - graphiti - Build real-time knowledge graphs for AI agents with automatic entity extraction and linking (🏷️
PythonKnowledge GraphSDK). - LanceDB - Serverless vector search database embedded directly in the agent process with no infrastructure needed (🏷️
RustPythonSDK). - Langmem - Helps agents learn and adapt from their interactions over time with persistent memory (🏷️
PythonLangChainSDK). - Mem0 - Memory layer for AI applications with long-term, short-term, and semantic memory extraction (🏷️
PythonVectorCloud). - Memoir - Git-like versioned semantic memory for AI agents with branching, commits, and cryptographic integrity over hierarchical paths (🏷️
PythonGit-likeSDK). - Memvid - Replace complex RAG pipelines with a serverless, single-file memory layer for instant retrieval (🏷️
PythonMultimodalSDK). - Milvus - Scales vector search to billions of embeddings for large-scale agent knowledge bases (🏷️
GoPythonPlatform). - Motorhead - Manages conversation context windows for agents with automatic background summarization (🏷️
RustPythonSDK). - Pathway - Live data RAG engine with real-time streaming for agents that need up-to-the-second knowledge (🏷️
PythonStreamingSDK). - Pinecone - Managed vector database with agent namespaces for multi-tenant isolation, hybrid search (vector + keyword), serverless auto-scaling, and $11B valuation (🏷️
CloudVector DBAPI[Enterprise][SOC2]). - Qdrant - High-performance vector similarity search engine with rich payload filtering for agent memory (🏷️
RustPythonSDK). - RAGFlow - Open-source RAG engine with agent capabilities and deep document understanding for knowledge bases (🏷️
PythonRAGPlatform). - SimpleMem - Efficient lifelong memory for LLM agents supporting both text and multimodal inputs (🏷️
PythonSDKLocal). - Supermemory - Extremely fast and scalable memory engine and API designed for the AI era (🏷️
TypeScriptVector DBWeb). - Weaviate - Stores and searches vector embeddings with hybrid keyword and semantic retrieval for agent knowledge (🏷️
GoPythonPlatform). - Zep - Enriches agent long-term memory with automatic summarization, entity extraction, and search (🏷️
PythonTypeScriptSDK).
- AgentVerse - Framework for building custom multi-agent environments to accomplish collaborative tasks (🏷️
PythonBenchmarkSDK). - EvoAgentX - Evaluates and evolves agentic workflows over time using automatic optimization (🏷️
PythonOptimizationSDK). - Hivemoot - Autonomous agent teams that collaboratively build software on GitHub (🏷️
PythonGitHubSDK). - MetaGPT - Simulates a full software company workflow from requirements to PRs using role-playing agents (🏷️
PythonRole-PlayingSDK). - P2PCLAW - Decentralized scientific research network with 14+ autonomous agents featuring P2P mesh, IPFS storage, and Lean 4 formal verification, self-hosted with Docker Compose (🏷️
PythonP2PSelf-Hosted). - Swarm - Lightweight framework for agent handoffs, context variables, and function calling patterns from OpenAI (🏷️
PythonOpenAISDK). - Swarms Framework - Multi-agent orchestration for production use cases with scalability and reliability at its core (🏷️
PythonProductionSDK). - Yao Agents - Complete local AI execution platform with 30+ Experts, autonomous Robot orchestration via 5-stage Pipeline, API-based Robot-to-Robot calling, MCP support, and multi-channel messaging (🏷️
GoTypeScriptDockerMCPSelf-HostedWeChatFeishu).
The protocol layer that enables agents to discover tools, communicate with each other, and interoperate across ecosystems.
| Protocol | Purpose | Creator | Status |
|---|---|---|---|
| MCP | Agent-to-tool | Anthropic | Standard |
| A2A | Agent-to-agent | Growing | |
| ACP | Agent communication | IBM/BeeAI | Early |
- A2A Protocol - Google's open protocol enabling AI agents to communicate, collaborate, and delegate tasks across frameworks (🏷️
ProtobufStandardRPC). - Arcade AI - Tool-use platform with authentication, authorization, and logging for agent-tool interactions (🏷️
PythonCloudAPI). - Composio - Integration platform with 250+ pre-built tool connectors for AI agents and LLMs (🏷️
TypeScriptCloudAPI). - Docker MCP - Docker's MCP gateway CLI plugin for running MCP servers in isolated containers (🏷️
GoDockerCLI). - HCS Agent Protocol - Hedera open standards for agent identity with trustless P2P communication and 187K+ verified agents (🏷️
TypeScriptHederaProtocol). - Hyper - Open-source Agent Skills and a hosted MCP connecting agents to 200+ marketing integrations across paid ads, SEO, analytics, social, and image and video generation, with a human-approval gate on every action (🏷️
CloudMCPMarketing). - MCP Registry - Official Model Context Protocol specification and server implementations for standardized tool access (🏷️
JSONStandardRegistry). - mcp-nest - Unified Model Context Protocol (MCP) server for executing code and managing files (🏷️
PythonMCPCLI). - NotFair - Hosted Google Ads MCP server for diagnosing, optimizing, and executing campaign changes via the Google Ads API with a human-approval gate (🏷️
CloudMCPMarketing). - Toolhouse - Cloud-hosted tool infrastructure for agents with optimized execution and low-latency access (🏷️
PythonCloudAPI). - Zapier MCP Server - Connect agents to 7,000+ app integrations via MCP, powered by Zapier's automation platform (🏷️
CloudZapierAPI). - zero-api-key-web-search - Free web search toolkit for AI agents with no API keys, MCP server support (🏷️
PythonMCPSearch).
- Agent S2 - Open-source GUI automation framework for building desktop and browser agents with vision (🏷️
PythonVisionSDK). - AgentQL - AI-powered web scraping and automation with a semantic query language for page elements (🏷️
PythonPlaywrightSDK). - Airtop - Enterprise-grade cloud browser infrastructure for AI agent automation at scale (🏷️
CloudEnterprisePlatform). - Browser Use - Open-source framework to let LLMs navigate and interact with any website programmatically (🏷️
PythonPlaywrightSDK). - Browserbase - Scalable headless browser infrastructure purpose-built for running AI agents in production (🏷️
CloudInfrastructurePlatform). - Dia Browser - AI-native browser from Atlassian and the Browser Company with built-in agent workflows (🏷️
CloudDesktopBrowser). - Fellou - Transparent agentic browser with visual workflow editing and persistent agentic memory (🏷️
CloudDesktopBrowser). - Genspark - On-device AI browser with 169+ local models that runs without internet connection (🏷️
CloudDesktopBrowser). - Grok Computer - Upcoming xAI desktop agent with mouse control and application automation capabilities (🏷️
CloudDesktopPlatform). - LaVague - Large Action Model framework to turn natural language instructions into browser automation (🏷️
PythonSeleniumSDK). - Manus - Autonomous digital employee with Browser Operator extension acquired by Meta for web-based task execution (🏷️
CloudMetaPlatform). - MultiOn - Reliable web automation API with CAPTCHA handling for building production browser agents (🏷️
CloudAPIPlatform). - OpenAI Operator - ChatGPT autonomous web agent with human checkpoints using Computer Use Agent technology (🏷️
CloudOpenAIPlatform). - Playwright MCP - MCP server wrapping Playwright for seamless browser automation integration with LLM agents (🏷️
TypeScriptMCPSDK). - Skyvern - Production browser automation with 21.5K GitHub stars, 10M+ executed workflows, vision-first automation, and sub-300ms response times (🏷️
PythonPlaywrightCloud). - Stagehand - AI web browsing framework built on Playwright with natural-language selectors and actions (🏷️
TypeScriptPlaywrightSDK).
Sandboxes, web scrapers, browser automation, and networking layers that agents depend on.
- AgentDock - Framework for building and deploying production-ready AI agents with composable node architecture (🏷️
PythonDockerPlatform). - codex-profiles - Bash CLI for switching OpenAI Codex CLI and Desktop profiles with isolated CODEX_HOME directories (🏷️
BashCLILocal). - Crawl4AI - Extracts structured data from web pages using LLM-friendly output formats optimized for agent ingestion (🏷️
PythonPlaywrightSDK). - Docling - Parses PDFs, DOCX, and slides into structured text with deep layout understanding for document agents (🏷️
PythonPDFSDK). - E2B - Cloud sandboxes for AI agents to run code securely in isolated environments (🏷️
TypeScriptCloudSandbox). - Engram - Universal bridge for multi-protocol AI agent systems with automated semantic mapping (🏷️
PythonNetworkingSDK). - Firecrawl - Web scraping API built for LLMs that converts websites to clean, structured markdown (🏷️
TypeScriptCloudAPI). - Jina Reader - Converts any URL to LLM-ready clean text via a simple API prefix for agent ingestion (🏷️
TypeScriptCloudAPI). - LlamaParse - GenAI-native document parser designed to extract complex tables and layouts for RAG pipelines (🏷️
PythonCloudAPI). - Marker - Converts PDF documents to markdown with high accuracy for tables, equations, and figures (🏷️
PythonPDFCLI). - Notte - Browser automation engine optimized for production AI pipelines (🏷️
PythonBrowserSDK). - Pilot Protocol - Networking stack for distributed agent systems with encrypted tunnels (🏷️
GoNetworkingSDK). - Playwright - Automates Chromium, Firefox, and WebKit browsers with a single cross-language API for agent-driven testing (🏷️
TypeScriptPythonSDK). - ScrapeGraphAI - Python web-scraping library that uses LLMs to build intelligent scraping pipelines from natural-language instructions (🏷️
PythonLangChainSDK). - Surya - Runs OCR and layout detection on documents in 90+ languages for multilingual document agents (🏷️
PythonPDFCLI). - Tavily - Search API purpose-built for LLM agents providing real-time, accurate web data with source citations (🏷️
PythonCloudAPI). - traceAI - OpenTelemetry-native tracing for LLM and agent apps with 50+ framework integrations (🏷️
PythonTypeScriptOpenTelemetrySDK). - Unstructured - Ingests and preprocesses documents across 25+ file types for downstream LLM and agent pipelines (🏷️
PythonPipelineSDK).
- Activepieces - Open-source Zapier alternative with AI-powered automation nodes and visual workflow builder (🏷️
TypeScriptDockerWeb). - AgentGPT - Deploy AI agents in the browser with zero local setup required (🏷️
TypeScriptNext.jsWeb). - Dify - Open-source LLM app development platform with visual workflow builder and RAG orchestration (🏷️
PythonFastAPIWeb). - FastAgency - Deploys multi-agent workflows as production APIs with minimal configuration and setup (🏷️
PythonFastAPISDK). - Flowise - Open-source drag-and-drop UI to build customized LLM flows, RAG pipelines, and agent systems (🏷️
TypeScriptReactWeb). - Langflow - Visual drag-and-drop builder for LLM workflows, RAG agents, and multi-step pipelines (🏷️
PythonFastAPIWeb). - Lindy - No-code AI agent platform with 3000+ app integrations for business workflow automation (🏷️
CloudNo-CodePlatform). - Make - Visual workflow automation platform with AI capabilities and drag-and-drop scenario builder (🏷️
CloudNo-CodePlatform). - n8n - Open-source workflow automation with AI agent nodes combining visual and code-based orchestration (🏷️
TypeScriptDockerPlatform). - Relevance AI - No-code AI agent builder for sales, support, and research use cases with team collaboration (🏷️
CloudNo-CodePlatform). - Rivet - Visual AI workflow builder with drag-and-drop interface for designing agent pipelines (🏷️
TypeScriptReactDesktop). - Temporal - Durable execution platform for long-running agent workflows with automatic retry and state persistence (🏷️
GoProductionPlatform). - Wordware - Web-hosted IDE where domain experts collaborate with AI engineers to build agent workflows (🏷️
TypeScriptNext.jsWeb). - Zapier AI - Connects 7000+ apps with natural language workflow creation for no-code agent automation (🏷️
CloudNo-CodePlatform).
- Agentset - Production RAG platform with reasoning, hybrid search, and full multimodal support (🏷️
PythonMultimodalPlatform). - LiveKit Agents - Framework for building real-time, multimodal AI agents with voice, video, and data channels (🏷️
PythonLiveKitSDK). - Nuance AI - Enterprise speech and conversational AI platform for clinical and contact-center workflows with HIPAA-capable deployments (🏷️
CloudVoiceEnterprise[HIPAA]). - Google Cloud Speech-to-Text v2 - Google Cloud streaming and batch speech recognition API v2 with improved accuracy, streaming, and noise suppression for real-time agent pipelines (🏷️
CloudGoogleSTT). - Pipecat - Production-grade voice AI framework with sub-250ms latency, WebRTC support, multimodal (voice+vision+text), real-time streaming, and 70+ language support (🏷️
PythonWebRTCSDK). - Rasa - Open-source conversational AI framework with self-hosted NLU training and dialogue management (🏷️
PythonNLUSDK). - simulate-sdk - Persona- and scenario-driven SDK for simulating voice and text AI agents (🏷️
PythonVoiceSimulationSDK). - Vapi - Platform for building voice AI agents with low-latency speech-to-speech capabilities (🏷️
PythonCloudAPI). - Vocode - Open-source framework for building voice-based LLM agent applications with streaming support (🏷️
PythonVoiceSDK). - Wispr Flow - Voice orchestration platform for multimodal AI agents with 50+ language support, workflow building, and enterprise integrations (🏷️
CloudVoicePlatform[Enterprise]).
- Agent OS - Kernel architecture for governing autonomous AI agents with policy enforcement (🏷️
PythonPolicySDK). - AgentDoG - Diagnostic guardrails that analyze full agent execution trajectories to detect instruction hijacking and tool misuse (🏷️
PythonDiagnosticsSDK). - AgentGuard - Runtime observability and guardrails for AI agents with loop detection and anomaly alerts (🏷️
PythonCyberArkSDK). - agenttrace - Local-first TUI for AI coding agent session observability with tokens, cost, latency, tool failures, anomalies, reports, diffs, and CI health gates (🏷️
GoObservabilityCLI). - APort Agent Guardrails - Pre-action authorization plugin for agent frameworks with policy-based access control (🏷️
PythonAuthorizationSDK). - Patronus AI LYNX - Hallucination detection system beating GPT-4 baselines, with specialized testing for agent outputs and LLM-generated content quality (🏷️
CloudTestingSDK). - Arize Phoenix - Open-source observability platform built on OpenTelemetry for tracing, evaluating, and debugging AI agents (🏷️
PythonOpenTelemetryPlatform). - Braintrust - Eval-driven development platform with experiment tracking and prompt optimization for agent quality (🏷️
TypeScriptEvaluationPlatform). - ElevenAgents - Voice agent platform from ElevenLabs for customer support automation with HIPAA compliance and multi-language support (🏷️
CloudVoicePlatform[HIPAA][Enterprise]). - DriftGuard - Semantic memory guardrails using causal graphs to prevent agents from repeating past failures (🏷️
PythonMemorySDK). - Guardrails AI - Adds structural, type, and quality guarantees to LLM outputs for reliable agent responses (🏷️
PythonValidationSDK). - Helicone - Open-source LLM observability with one-line integration for cost tracking and prompt analytics (🏷️
TypeScriptObservabilityPlatform). - Lakera Guard - Real-time protection against prompt injection, data leakage, and toxicity in agent interactions (🏷️
CloudSecurityAPI). - Laminar - Open-source observability and analytics platform purpose-built for the full lifecycle of AI agents (🏷️
RustObservabilityPlatform). - Langfuse - Open-source LLM observability platform for tracing, prompt versioning, and LLM-as-a-judge evaluations (🏷️
TypeScriptNext.jsPlatform). - LangSmith - LangChain platform for tracing, testing, and evaluating agent performance with production monitoring (🏷️
CloudLangChainPlatform). - LLM Guard - Security toolkit for scanning LLM inputs and outputs to prevent prompt injection and data leaks (🏷️
PythonSecuritySDK). - Logfire - Python-native observability from the Pydantic team with deep integration for high-performance agent monitoring (🏷️
PythonPydanticPlatform). - NeMo Guardrails - NVIDIA programmable guardrails toolkit for controlling and securing LLM-powered agent conversations (🏷️
PythonNVIDIASDK). - Orchard Kit - Modules for agent runtime security, self-audit trails, and collective cognition patterns (🏷️
PythonSecuritySDK). - OWASP Top 10 for Agentic Apps - Security framework covering goal hijacking, tool misuse, and cascading failure mitigations for agents (🏷️
PolicySecurityFramework). - Rebuff - Self-hardening prompt injection detection system for securing agent inputs against adversarial attacks (🏷️
PythonSecuritySDK). - ai-evaluation - LLM evaluation framework with 50+ metrics, LLM-as-Judge, and guardrail scanners (jailbreak, PII, injection) (🏷️
PythonEvaluationGuardrailsSDK). - Future AGI - Self-hostable end-to-end agent engineering platform with tracing, evals, guardrails, and gateway (🏷️
PythonPlatformSelf-hosted).
Frontend workspaces and chat interfaces with built-in agent plugins and tool-use capabilities.
- AnythingLLM - All-in-one AI application with RAG, agents, and multi-model support for desktop and Docker (🏷️
TypeScriptDockerDesktop). - DB-GPT - Data interaction platform with local LLM support for 100% private database and analytics agents (🏷️
PythonDatabaseWeb). - LibreChat - Self-hosted multi-model chat interface supporting all major AI providers with access control (🏷️
TypeScriptDockerWeb). - LobeHub - Modern platform for hybrid work and AI-driven collaboration with extensible agent teams and rapid integration (🏷️
TypeScriptNext.jsWeb). - LobeChat - Modern, open-source AI chat framework with a massive plugin ecosystem for autonomous agent capabilities (🏷️
TypeScriptNext.jsWeb). - OpenWebUI - Extensible local AI interface with built-in RAG, tool use, and support for multi-agent workflows (🏷️
TypeScriptSvelteWeb).
- AWS Bedrock AgentCore - Managed AWS infrastructure for Bedrock-based agents with compliance, scaling, and monitoring built in (🏷️
TypeScriptAWSPlatform). - Cerebras Inference - Fastest LLM inference delivering 1000+ tokens per second on Llama 3.3 70B with a free tier (🏷️
CloudInferenceAPI). - Fireworks AI - Serverless LLM inference with fine-tuning, RAG support, and free credits for rapid prototyping (🏷️
CloudInferenceAPI). - Groq Cloud - Ultra-fast LPU-based LLM inference for Mixtral, Llama, and Gemma with a free API tier (🏷️
CloudInferenceAPI). - Modal - Serverless GPU compute purpose-built for AI workloads with fast cold starts and Python-native deployment (🏷️
PythonServerlessCloud). - Northflank - Full-stack platform with GPU orchestration, Git-based CI/CD, and bring-your-own-cloud support (🏷️
KubernetesCloudPlatform). - Railway - One-click deploy from GitHub with persistent volumes and databases for stateful agent deployments (🏷️
NixCloudPlatform). - Together AI - Inference API hosting 200+ open models with fast generation and a free tier for developers (🏷️
CloudInferenceAPI). - Trigger.dev - Background job platform with cron, webhook, and event triggers purpose-built for long-running agent tasks (🏷️
TypeScriptServerlessCloud).
- AgentBench - Comprehensive benchmark for evaluating LLMs as agents across 8 distinct environments (🏷️
PythonBenchmarkSDK). - ARC-AGI-2 - Frontier benchmark for measuring general intelligence capabilities in AI agents beyond pattern matching (🏷️
PythonBenchmarkResearch). - GAIA Benchmark - Benchmark for General AI Assistants measuring real-world reasoning and tool use (🏷️
JSONBenchmarkDataset). - Inspect AI - Framework for evaluating large language models with composable tasks and scoring (🏷️
PythonEvaluationSDK). - SWE-bench - Benchmark for evaluating LLMs on real-world software engineering tasks from GitHub issues (🏷️
PythonDockerBenchmark). - WebArena - Benchmark for web agent evaluation using real websites with realistic task completion metrics (🏷️
PythonBenchmarkWeb).
Tools and frameworks focused on testing, debugging and evaluating agent workflows in production.
| Tool | Debugging | Testing | Eval | Latency | Notes |
|---|---|---|---|---|---|
| agenttrace | Advanced | Basic | ✓ | Local | Session observability, tokens/cost/latency (see Safety Guardrails) |
| Inspect AI | ✗ | ✓ | ✓ | API | Composable evaluation framework (see Agent Evaluation) |
| Patronus AI LYNX | ✓ | ✓ | ✓ | Cloud | Hallucination detection for agent outputs |
| Braintrust Evals | Basic | ✓ | ✓ | Cloud | Eval-driven development and experiment tracking |
| pytest-Agents | ✗ | ✓ | ✗ | Local | Pytest fixtures and runners for agent unit tests |
| Agent Mutation Testing | ✗ | ✓ | ✗ | Local | Generates adversarial mutations to test agent robustness |
| Cortex Debugger | ✓ | ✗ | ✗ | Local | Step-through debugging primitives for agent workflows |
| Agent Coverage Tools | ✗ | ✓ | ✗ | Local | Coverage analysis for agent branches and tool calls |
Notes: Several of these projects already appear elsewhere in this document (agenttrace, Inspect AI, Patronus). New local docs link to conceptual testing tools not yet listed elsewhere.
Curated list of vertical agent solutions for finance, healthcare, legal, manufacturing, and government.
- Anthropic Finance Agents - Domain-specialized agents for earnings analysis and portfolio insight (🏷️
CloudFinanceEnterprise). - JPMorgan AlphaBlue - Trading and market analysis agent integrations for institutional workflows (🏷️
CloudFinanceEnterprise). - Goldman Sachs Marquee AI - Market data and analytics agents built on Marquee platform (🏷️
CloudFinanceEnterprise). - Morgan Stanley AdvisorBot - Financial advisory assistant for advisors and retail clients (🏷️
CloudFinanceEnterprise).
- Epic AI Assistant (https://www.epic.com) - Clinical documentation and workflow agents integrated with Epic HER (🏷️
CloudHealthcareEnterprise). - Nuance Healthcare AI - Speech and clinical AI solutions for documentation and radiology (🏷️
CloudHealthcareEnterprise[HIPAA]). - Tempus AI - Oncology research and clinical decision support agents (🏷️
CloudHealthcareResearch).
- LexisNexis AI - Document review and legal research agents (🏷️
CloudLegalEnterprise). - Westlaw AI-Assisted Research - Case law analysis and brief drafting agents (🏷️
CloudLegalEnterprise). - ROSS Intelligence - Contract analysis and legal research agents (🏷️
CloudLegalPlatform).
- Siemens AI Ops - Factory-floor optimization and predictive maintenance agents (🏷️
CloudIndustrialEnterprise). - GE Predix Agents - Equipment monitoring and incident prediction agents for industrial fleets (🏷️
CloudIndustrialEnterprise).
- Anthropic Government Agents - Policy analysis and public sector agents for regulated workflows (🏷️
CloudGovernmentEnterprise). - Tax & Insurance Agent Platforms - Generic category placeholder for compliance-focused tax and underwriting agents (🏷️
CloudComplianceEnterprise).
- AgentBench: Evaluating LLMs as Agents - The benchmark paper for evaluating LLMs as agents across diverse environments (🏷️
PaperArXivPDF). - AI Agents in LangGraph - Short course on building production agents with LangGraph by Andrew Ng's platform (🏷️
CourseDeepLearning.aiWeb). - AI Engineering by Chip Huyen - Comprehensive guide on AI systems design and deployment covering agent architecture patterns (🏷️
BookO'ReillyPrint). - Anthropic Cookbook - Collection of Claude agent recipes and integration patterns from Anthropic (🏷️
GuideAnthropicGitHub). - Building Effective Agents - Anthropic's guide on agent design patterns, evaluation strategies, and production best practices (🏷️
GuideAnthropicWeb). - Hugging Face Agents Course - Open-source course on building AI agents using Hugging Face tools and models (🏷️
CourseHuggingFaceWeb). - LATS: Language Agent Tree Search - Combines Monte Carlo tree search with LLM reasoning for complex multi-step planning tasks (🏷️
PaperPythonResearch). - LLM Powered Autonomous Agents - Deep breakdown of LLM-powered agent components: planning, memory, and tool use (🏷️
BlogWengWeb). - Microsoft GenAI for Beginners - 21-lesson course on generative AI concepts and agent development from Microsoft (🏷️
CourseMicrosoftGitHub). - OpenAI Cookbook - Practical guides and recipes for building with OpenAI APIs including agent patterns (🏷️
GuideOpenAIGitHub). - Prompt Engineering Guide - Community-maintained guide covering prompt engineering techniques and agent strategies (🏷️
GuideDAIRWeb). - ReAct: Synergizing Reasoning and Acting in Language Models - The foundational paper behind the ReAct prompting pattern used in most agent frameworks (🏷️
PaperArXivPDF). - Reflexion - Research framework letting agents learn from past mistakes via iterative verbal self-reflection loops (🏷️
PaperPythonResearch). - Tree of Thoughts - Explores multiple parallel reasoning paths before committing to a final answer for complex problems (🏷️
PaperPythonResearch).
Capabilities that let agents send messages, notifications, and reports across channels.
- Apprise - Sends agent notifications to 100+ services including Slack, Discord, and email from a single Python interface (🏷️
PythonMulti-ChannelSDK). - Discord.py - Lets agents interact with Discord channels and servers for team-facing real-time communication (🏷️
PythonDiscordSDK). - FastAPI-Mail - Adds async email sending to FastAPI-based agent services with SMTP and OAuth2 support (🏷️
PythonFastAPISDK). - Novu - Routes agent notifications across email, SMS, push, and chat from a unified API and workflow engine (🏷️
TypeScriptPythonPlatform). - Ntfy - Pushes real-time agent notifications to phones and desktops via a dead-simple HTTP API (🏷️
GoSelf-hostedAPI). - Resend - Sends transactional emails from agent workflows with a clean, developer-first API (🏷️
TypeScriptCloudAPI). - Slack Bolt - Enables agents to send, receive, and react to Slack messages with event-driven listeners (🏷️
PythonSlackSDK). - The Colony - Provides an agent-only social network with a REST API, MCP server, and A2A agent-card so agents can post, comment, and DM each other autonomously (🏷️
CloudMCPAPIPlatform). - Twilio - Sends SMS and voice calls from agent workflows to any phone number worldwide (🏷️
PythonCloudAPI).
Frameworks for orchestrating data transformations and long-running agent-driven workflows.
- AI for Database - Connects to any database for plain English queries with self-refreshing dashboards and automated workflows (🏷️
CloudDatabasePlatform). - Burr - Tracks and manages multi-step agent state machines with built-in observability and persistence (🏷️
PythonStateGraphSDK). - Dagster - Manages data assets and pipelines with built-in lineage tracking for data-intensive agent systems (🏷️
PythonPipelinePlatform). - GPT Researcher - Conducts multi-source autonomous web research and produces cited, structured long-form reports (🏷️
PythonLangChainWeb). - Hamilton - Defines modular data transformations as Python functions wired automatically into a DAG for agent pipelines (🏷️
PythonDAGSDK). - Hex AI - Collaborative data platform with AI-powered analysis and notebook-based data exploration for teams (🏷️
CloudDataPlatform). - Julius AI - Upload CSV or Excel files and analyze data using natural language questions for instant insights (🏷️
CloudDataWeb). - PandasAI - Chat with your data using natural language queries that convert to Pandas and SQL operations (🏷️
PythonDataSDK). - Prefect - Orchestrates agent workflows and data pipelines with retries, caching, and built-in observability (🏷️
PythonPipelinePlatform). - Signals CLI - Intent signal CLI detecting LinkedIn engagers, keyword posters, and funding events with JSON output for agent pipelines (🏷️
CloudCLIPlatform). - STORM - Generates Wikipedia-style long-form articles by autonomously researching and synthesizing multiple sources (🏷️
PythonStanfordResearch).
Infrastructure for running agent-generated code safely in isolated, sandboxed, or serverless environments.
💡 Quick Decision Guide:
- Fastest hosted sandbox: E2B, Daytona
- Self-hosted VM isolation: Firecracker, Kata Containers, gVisor
- Container-based: Docker, Podman, Sysbox
- Serverless: AWS Lambda, Google Cloud Run, Cloudflare Workers
| Tool | Isolation | Cold Start | Languages |
|---|---|---|---|
| E2B | VM (Firecracker) | ~300ms | Python, TS, Go |
| Daytona | Container/VM | ~2s | Python, TS, Go |
| Firecracker | microVM | ~125ms | REST API |
| Docker | Namespace+cgroup | ~500ms | All |
| AWS Lambda | Firecracker VM | ~200ms | Python, TS, Go |
| Cloudflare Workers | V8 isolate | ~0ms | TS, Rust (WebAssembly) |
- agentbox - Docker for AI Agents — package, test, version, and govern AI agents across every framework (🏷️
PythonDockerCLI). - agentnotary - Notarize, govern, and audit AI agents — cryptographic seal, runtime guard, EU AI Act docs, and adversarial fuzzer (🏷️
PythonComplianceCLI). - AWS Lambda - Serverless function runtime with Firecracker isolation for executing short-lived agent tasks at scale (🏷️
PythonAWSServerless). - Cloudflare Workers - Ultra-low-latency V8-isolate serverless runtime at the edge for lightweight agent task execution (🏷️
TypeScriptCloudflareServerless). - CodeSandbox SDK - Provides forked, snapshotted microVM environments for real-time collaborative agent code execution (🏷️
TypeScriptPythonSDK). - Daytona - Self-hostable git-based developer environment manager for sandboxed agent code execution with fast provisioning (🏷️
PythonTypeScriptPlatform). - Docker - Standard container runtime providing namespace and cgroup isolation for packaging and running agent workloads (🏷️
GoDockerPlatform). - Firecracker - Lightweight microVM technology with 125ms boot time used by AWS Lambda and E2B under the hood (🏷️
RustAWSInfrastructure). - Google Cloud Run - Fully managed serverless platform with gVisor isolation for running stateless agent containers at scale (🏷️
PythonGCPServerless). - gVisor - User-space kernel that intercepts syscalls to provide VM-level security with container-level overhead (🏷️
GoGoogleInfrastructure). - Kata Containers - Combines VM-level security with the developer experience of containers using lightweight VMs (🏷️
GoOCIInfrastructure). - Podman - Daemonless, rootless container engine ideal for running untrusted agent-generated code securely (🏷️
GoRootlessCLI). - Sysbox - Enhanced container runtime that enables Docker-in-Docker securely for nested agent execution environments (🏷️
GoDockerInfrastructure).
Tools for generating images, video, music, audio, and 3D assets using AI models.
| Generator | Strength | Open Source | Pricing |
|---|---|---|---|
| Midjourney v7 | Artistic quality | No | $10-120/mo |
| FLUX 2 | Photorealism | Yes | Free / API |
| Stable Diffusion | Full control | Yes | Free (OSS) |
| Ideogram v3 | Text rendering | No | Free / $7+/mo |
| Google Imagen 4 | Highest fidelity | No | API |
- Adobe Firefly 3 - Generates commercially safe images from text prompts using a model trained exclusively on licensed data (🏷️
CloudAdobe CCWeb). - DALL-E 3.5 - Generates detailed images from text prompts with 95% text accuracy integrated directly into ChatGPT (🏷️
CloudOpenAIAPI). - FLUX 2 - Open-weight photorealism model producing 4K output at 6x faster generation speed than its predecessor (🏷️
PythonOpen-WeightAPI). - Google Imagen 4 - Produces state-of-the-art photorealistic images from text via Google DeepMind accessible through AI Studio (🏷️
CloudGoogleAPI). - Ideogram v3 - Renders readable text inside generated images with near-zero spelling errors for logos and marketing (🏷️
CloudFreemiumWeb). - Leonardo AI - Multi-model image platform with Realtime Canvas and 3D gaming asset generation now owned by Canva (🏷️
CloudCanvaWeb). - Midjourney v7 - Produces the highest artistic quality images with unmatched aesthetics via Discord and web interface (🏷️
CloudDiscordWeb). - Recraft - Design-first image generator with native vector art output and brand consistency tools for designers (🏷️
CloudFreemiumWeb). - Stable Diffusion 3.5 - Open-source foundation model with full ControlNet, LoRA, and ComfyUI ecosystem for custom pipelines (🏷️
PythonOpen-SourceLocal).
| Generator | Max Length | Resolution | Open Source |
|---|---|---|---|
| Sora 2 | 25 sec | 1080p | No |
| Google Veo 3.1 | Variable | 4K | No |
| Runway Gen-4.5 | Variable | 1080p+ | No |
| Kling 3.0 | 2 min | 4K | No |
| Wan 2.1 | Unlimited | Variable | Yes |
| HunyuanVideo | Unlimited | Variable | Yes |
- Google Veo 3.1 - Generates the highest cinematic quality video with native audio from text prompts via Google DeepMind (🏷️
CloudGoogleAPI). - HaiLuo AI - Budget video generation platform offering 10 free videos per day with HD output quality (🏷️
CloudFreemiumWeb). - HunyuanVideo - Open-source video generation model from Tencent that runs on consumer-grade GPUs (🏷️
PythonOpen-SourceLocal). - Kling 3.0 - Generates 4K video up to 2 minutes long with native audio at the most competitive pricing available (🏷️
CloudFreemiumWeb). - LTX Video - Commercially safe open-source video generation model trained on fully licensed data (🏷️
PythonOpen-SourceLocal). - Luma Dream Machine - Creates physics-accurate 4K HDR video scenes with 3D-style output for cinematic quality (🏷️
CloudFreemiumWeb). - Pika 2.5 - Beginner-friendly video generation tool with fast turnaround and intuitive editing controls (🏷️
CloudFreemiumWeb). - Runway Gen-4.5 - Professional video generation with Motion Brush and the highest benchmark scores for visual quality (🏷️
CloudProWeb). - Seedance 2.0 - Quad-modal input video generator with built-in lip-sync and 2K resolution output (🏷️
CloudFreemiumWeb). - Sora 2 - Generates 25-second cinematic video clips with strong narrative coherence from text prompts by OpenAI (🏷️
CloudOpenAIWeb). - Wan 2.1 - Open-source self-hosted video generation model with unlimited length and no per-video costs (🏷️
PythonOpen-SourceLocal).
- ElevenLabs Music - Generates vocals and instrumentals with stem separation and per-section editing for audio production (🏷️
CloudElevenLabsWeb). - Meta AudioCraft - Open-source audio generation suite including MusicGen and AudioGen with fine-grained prompt control (🏷️
PythonOpen-SourceLocal). - Stable Audio - Generates instrumental audio tracks with a commercial license included by default for all outputs (🏷️
CloudStability AIWeb). - Suno - Generates complete songs with vocals, instruments, and lyrics from a single text prompt (🏷️
CloudFreemiumWeb). - Udio - Produces high-fidelity music with fine-grained sectional control over generation and arrangement (🏷️
CloudFreemiumWeb).
- Meshy - Generates 3D models from text or images for game assets and product visualization with multi-format export (🏷️
CloudFreemiumWeb). - Tripo AI - Produces fast 3D model generation from text or images with multi-format export for rapid prototyping (🏷️
CloudFreemiumWeb). - Vizcom - Renders industrial design sketches into photorealistic 3D visuals in real-time for product designers (🏷️
CloudProWeb).
AI agents that automate customer support, CRM workflows, sales outreach, and ticket resolution at scale.
| Platform | Auto-Resolution | Best For |
|---|---|---|
| Intercom Fin | 50%+ tickets | SaaS teams |
| Ada | 60%+ tickets | Complex resolution |
| Zendesk AI | 30%+ tickets | Enterprise |
| Freshdesk Freddy | 25%+ tickets | SMBs |
- Ada - Resolves 60% of customer support tickets automatically with complex multi-turn query understanding (🏷️
CloudEnterprisePlatform). - Assembled - Routes support tickets with workforce-aware scheduling and intelligent handoff to human agents (🏷️
CloudEnterprisePlatform). - Dixa - CRM-first conversational support platform with AI-powered routing and customer context enrichment (🏷️
CloudEnterprisePlatform). - Freshdesk Freddy AI - Auto-triages and routes support tickets with smart AI suggestions for budget-conscious SMB teams (🏷️
CloudFreshworksPlatform). - Intercom Fin - Resolves 50% of SaaS support tickets by learning directly from your help center and knowledge base (🏷️
CloudIntercomPlatform). - Zendesk AI - Automates 30% of enterprise support tickets with deep integration into the existing Zendesk ecosystem (🏷️
CloudZendeskPlatform).
- Creatio - No-code CRM platform with pre-built AI agents for sales, marketing, and service automation (🏷️
CloudNo-CodePlatform). - Dynamics 365 Copilot - Drafts, summarizes, and translates across Office 365 with deep Microsoft ecosystem integration (🏷️
CloudMicrosoftPlatform). - Monday CRM Lexi - AI sales agent with automated lead sourcing, qualification, and pipeline management for SMBs (🏷️
CloudMondayPlatform). - Pipedrive AI - AI-powered email drafting, deal prioritization, and smart reporting for small sales teams (🏷️
CloudPipedrivePlatform). - Salesforce Einstein - Enterprise AI with predictions, autonomous agents, and CRM automation across the Salesforce platform (🏷️
CloudSalesforcePlatform). - Salesmate - CRM with AI-powered call summarization, lead qualification, and pipeline automation for small teams (🏷️
CloudCRMPlatform). - ServiceNow AI Agents - Orchestrates AI agents across IT, HR, and CRM workflows for enterprise service management (🏷️
CloudServiceNowPlatform). - XMACNA Funcionarios Digitais - Provides AI/IA agents for WhatsApp, phone, CRM updates, lead qualification, follow-up, and human handoff (🏷️
CloudCRMSalesWhatsApp). - Zoho CRM Zia - Predictive scoring, sentiment analysis, and voice commands for sales automation across Zoho products (🏷️
CloudZohoPlatform).
- Apollo.io - AI prospecting platform with 275M+ contacts, lead scoring, and automated email sequencing (🏷️
CloudFreemiumPlatform). - Clay - Enriches leads from 70+ data providers and generates hyper-personalized outreach at scale (🏷️
CloudProPlatform). - Instantly - Generates AI cold emails with smart sender rotation and built-in domain warmup for deliverability (🏷️
CloudProPlatform). - Lavender - Coaches email writing in real-time with AI response scoring and recipient intelligence (🏷️
CloudFreemiumPlatform). - Overloop CLI - AI outbound CLI agent that sources 450M+ contacts and runs email plus LinkedIn campaigns with JSON output (🏷️
CloudCLIPlatform).
Platforms for building, deploying, and scaling voice-based AI agents across calls, support, and live conversations.
| Platform | Latency | HIPAA | Best For |
|---|---|---|---|
| ElevenLabs | 75ms | Yes | Industry benchmark |
| Vapi | Low | No | Developer-first |
| Bland AI | Medium | Yes | Outbound call automation |
| PolyAI | Low | Yes | Enterprise scale |
- AgentLine - Telephony infrastructure for AI agents — provision phone numbers, make/receive calls, and manage voice pipelines via API (🏷️
CloudTelephonyAPI). - AssemblyAI - Speech-to-text API with speaker diarization, sentiment analysis, and summarization for voice agent pipelines (🏷️
CloudSTTAPI). - Bland AI - Automates outbound phone calls at scale with SOC2 and HIPAA compliance and CRM integration (🏷️
CloudTelephonyAPI). - Deepgram - Sub-300ms speech-to-text and text-to-speech APIs purpose-built for real-time voice agent pipelines (🏷️
CloudSTT/TTSAPI). - ElevenLabs - Industry-leading voice AI with 75ms latency, Conversational AI 2.0, RAG, and HIPAA compliance (🏷️
CloudVoicePlatform). - HeyGen - Creates AI talking avatars with voice cloning and lip-sync for video-based agent interactions (🏷️
CloudAvatarWeb). - PolyAI - Enterprise voice AI platform for natural multi-turn conversations with high-volume call handling (🏷️
CloudEnterprisePlatform). - Retell AI - Builds human-like voice agents with multi-language telephony support and low-latency responses (🏷️
CloudTelephonyAPI). - Synthesia - Generates AI video avatars that speak in 120+ languages for training and communication agents (🏷️
CloudAvatarWeb). - Synthflow - No-code voice agent builder with pre-built templates for SMBs to deploy phone agents quickly (🏷️
CloudNo-CodePlatform). - Voiceflow - No-code builder for voice and chat agents with visual conversation design and team collaboration (🏷️
CloudNo-CodePlatform).
AI platforms that conduct autonomous multi-step research, synthesize findings from multiple sources, and generate structured reports with citations.
| Agent | Context Window | Approach |
|---|---|---|
| Claude Research | 200K tokens | Multi-step with citations |
| ChatGPT Research | 128K tokens | Extended reasoning + browse |
| Gemini Research | 1M tokens | Google Search + KG |
| Perplexity Pro | Variable | Real-time cited search |
- CAJAL - Local AI agent that generates publication-ready IMRaD scientific papers with verified arXiv citations and AI tribunal scoring (🏷️
PythonOllamaLocal). - ChatGPT Deep Research - Conducts extended reasoning with web browsing to produce structured research reports with Canvas output (🏷️
CloudOpenAIWeb). - Claude Deep Research - Performs multi-step investigation with verified source citations and 200K token context window (🏷️
CloudAnthropicWeb). - DeerFlow - Multi-agent research system from ByteDance with planning and execution loops for autonomous investigation (🏷️
PythonOpen-SourceResearch). - Gemini Deep Research - Leverages Google Search and Knowledge Graph integration with 1M token context for breadth-first research (🏷️
CloudGoogleWeb). - Perplexity Pro - Provides real-time search answers with inline citations and follow-up threads for iterative research (🏷️
CloudFreemiumWeb). - Perplexity Personal Computer - Autonomous research agent with vision capabilities, real-time search integration, and document analysis for comprehensive research workflows (🏷️
CloudResearchConsumer).
Platforms that generate full-stack applications, components, or web apps from natural language prompts.
| Builder | Output | Hosting |
|---|---|---|
| Bolt.new | Full-stack app | Browser/deploy |
| Lovable | Full web app | Auto-deploy |
| v0 by Vercel | React components | Vercel |
| Replit Agent | Full-stack | Replit |
- Bolt.new - Generates full-stack web applications in the browser from natural language prompts with instant deploy (🏷️
CloudFreemiumWeb). - Dyad - Privacy-first open-source prompt-to-app builder that generates full-stack applications locally (🏷️
TypeScriptOpen-SourceLocal). - Google Project IDX - AI-powered cloud IDE with free Claude Opus access for building full projects across all frameworks (🏷️
CloudGoogleWeb). - Lovable - Iterates from chat to deployed web application with auto-deploy and real-time preview (🏷️
CloudFreemiumWeb). - PlayCode Agent - Browser-only lightweight web app creation tool with AI-assisted code generation (🏷️
CloudProWeb). - Replit Agent - Builds and deploys full-stack projects from prompts with integrated hosting and collaboration (🏷️
CloudFreemiumWeb). - v0 by Vercel - Generates React and Tailwind UI components from text descriptions with one-click Vercel deployment (🏷️
CloudFreemiumWeb).
All-in-one AI platforms providing access to agents, tools, and models through consumer-facing interfaces without writing code.
| Platform | Context | Flagship Features |
|---|---|---|
| ChatGPT | 128K | GPTs, Deep Research, Canvas, Agent Mode |
| Claude | 200K | Computer Use, MCP, Cowork |
| Gemini | 1M | Deep Think, Gems, NotebookLM |
| Grok | 131K | Real-time X data, Grok Build |
- ChatGPT - General-purpose AI platform with GPTs, Deep Research, Canvas, and Agent Mode across the widest plugin ecosystem (🏷️
CloudOpenAIWeb). - Claude - Reasoning-focused AI with Computer Use, MCP integration, and Chrome browsing for coding and long document analysis (🏷️
CloudAnthropicWeb). - Coze - No-code agent builder with a marketplace of 100+ custom agents and plugin integrations (🏷️
CloudByteDanceWeb). - Gemini - Google's multimodal AI with 1M token context, Deep Think, Gems, and NotebookLM for workspace integration (🏷️
CloudGoogleWeb). - Grok - Real-time AI with live X data access, Grok Build for 8-agent parallel code generation, and image generation (🏷️
CloudxAIWeb). - Meta AI - Llama-powered AI integrated across WhatsApp, Messenger, and Instagram for conversational assistance (🏷️
CloudMetaWeb). - Microsoft Copilot - AI assistant integrated into Office 365, Teams, and Power Platform for enterprise productivity workflows (🏷️
CloudMicrosoftWeb). - Sistava - AI agent orchestration platform for deploying multi-channel agents across messaging, voice, and APIs with full Computer Use capabilities on your own OS (🏷️
CloudSistavaWeb).
Open-weight LLMs optimized for agentic tasks including tool use, planning, multi-step reasoning, and code generation.
| Model | Organization | Parameters | Agent Strength |
|---|---|---|---|
| Llama 4 | Meta | 8B-405B+ | Strong tool use |
| Qwen 3 | Alibaba | 0.6B-235B | MCP-native, multilingual |
| DeepSeek V3/R1 | DeepSeek | 671B MoE | 68x cheaper than GPT-4 |
| Gemma 3 | 1B-27B | Efficient edge agents |
- Baichuan 3 - Chinese-optimized open model family with strong domain performance for multilingual agent deployments (🏷️
PythonOpen-SourceLocal). - Command R+ - Cohere's 104B parameter model optimized specifically for RAG and enterprise tool use in agent workflows (🏷️
CloudCohereAPI). - DeepSeek V3 - 671B MoE model that delivers GPT-4 level reasoning at 68x lower cost for budget-conscious agent deployments (🏷️
PythonOpen-SourceLocal). - Falcon 3 - Efficient open model from TII UAE with strong multilingual support across 10B to 180B parameter variants (🏷️
PythonOpen-SourceLocal). - Gemma 3 - Efficient multimodal model from Google in 1B to 27B sizes ideal for edge and on-device agents (🏷️
PythonGoogleLocal). - Gemma 4 - Consumer and IoT-optimized model from Google in 2B to 31B sizes with E2B and E4B variants (🏷️
PythonGoogleLocal). - GLM-4 - Zhipu AI's 744B MoE model achieving the lowest hallucination rate and 77.8% on SWE-bench (🏷️
PythonOpen-SourceLocal). - InternLM 3 - Shanghai AI Lab model with strong long-context reasoning capabilities for complex agent tasks (🏷️
PythonOpen-SourceLocal). - Llama 4 - Meta's flagship open model family with Scout and Maverick variants and strong tool use capabilities (🏷️
PythonMetaLocal). - Mistral Large 2 - Leading European open model at 123B parameters with strong function calling for agent workflows (🏷️
PythonMistralAPI). - Phi-4 - Microsoft's compact 14B parameter model delivering strong reasoning for on-device and edge agent deployments (🏷️
PythonMicrosoftLocal). - Qwen 3 - MCP-native multilingual model family from Alibaba with thinking mode across 0.6B to 235B sizes (🏷️
PythonAlibabaLocal). - Qwen 3.6-Plus - Agentic-focused model with 1M context window, repo-level coding, and MCP-native tool use (🏷️
PythonAlibabaLocal). - StarCoder 2 - Code-focused open model trained on 600+ programming languages for specialized coding agents (🏷️
PythonBigCodeLocal). - Yi-Lightning - 01.AI's multilingual model with long context and fast inference across 6B to 200B parameter sizes (🏷️
PythonOpen-SourceLocal).
Key data points on the AI agent market covering adoption, growth, valuations, and fastest-growing open-source projects.
| Year | Market Size | Growth |
|---|---|---|
| 2024 | $5.1B | Baseline |
| 2025 | $7.6B | +49% YoY |
| 2026 | $10.91B | +43% YoY |
| 2027 | $16.8B | +54% YoY |
| 2030 | $52.63B | 46.3% CAGR |
| Metric | Value |
|---|---|
| Orgs with agents in production | 57% |
| Devs using AI coding tools regularly | 85% |
| Companies planning agent deployment | 72% |
| Enterprises with dedicated agent budget | 41% |
| Use Case | Share |
|---|---|
| Customer Service | 26.5% |
| Research and Analysis | 24.4% |
| Workflow Automation | 18.0% |
| Code Generation | 15.0% |
| Data Processing | 10.0% |
| Project | Stars (Jan 2026) | Stars (May 2026) | Growth |
|---|---|---|---|
| Browser Use | 10K | 50K | 5x |
| Playwright MCP | 1K | 10K | 10x |
| DeerFlow | 5K | 25K | 5x |
| Company | Valuation |
|---|---|
| OpenAI | $90B+ |
| Cursor | $29.3B |
| Anthropic | $20B |
| ElevenLabs | $11B |
| Mistral | $6B |
| Hugging Face | $4.5B |
- AI Agent Market Report 2026 - Comprehensive market analysis projecting the AI agent market to reach $52.63B by 2030 at 46.3% CAGR (🏷️
ReportMarketResearch). - State of AI Agents Survey - Enterprise survey showing 57% of organizations now have AI agents in production workflows (🏷️
ReportSurveyResearch).
Tools for running LLMs locally and self-hosting AI agent platforms with full privacy and control.
- GPT4All - Open-source local chat application that runs LLMs on consumer hardware without cloud dependencies (🏷️
C++DesktopLocal). - Jan - Open-source ChatGPT alternative that runs 100% offline with local model management and privacy guarantees (🏷️
TypeScriptElectronDesktop). - Llamafile - Distributes LLMs as single executable files with zero setup required from Mozilla (🏷️
C++PortableLocal). - llama.cpp - C/C++ LLM inference engine supporting CPU, GPU, and Apple Silicon as the foundation of local AI (🏷️
C++Cross-PlatformLocal). - LM Studio - Desktop application for running local LLMs with a polished UI across all major platforms (🏷️
TypeScriptElectronDesktop). - LocalAI - Drop-in OpenAI API replacement that runs models locally without requiring a GPU (🏷️
GoDockerLocal). - Ollama - Run LLMs locally with a dead-simple CLI interface and 162K+ GitHub stars (🏷️
GoCLILocal). - vLLM - High-throughput LLM serving engine with PagedAttention for production-grade local inference (🏷️
PythonCUDALocal). - Yao Agents - Local-first AI execution platform with Docker sandbox isolation, BYOK model configuration, MCP support, 5-stage Pipeline, and multi-platform messaging via WeChat, Feishu, DingTalk, Telegram, and Discord (🏷️
GoTypeScriptDockerDesktopSelf-Hosted).
Frameworks and tools for AI risk management, regulatory compliance, and governance as EU AI Act obligations take effect August 2026.
- Credo AI - End-to-end AI governance platform with EU AI Act policy packs and model inventory management (🏷️
CloudGovernancePlatform). - EU AI Act - Official EU regulatory framework classifying AI systems by risk tier from unacceptable to minimal risk (🏷️
PolicyRegulationEU). - IBM watsonx.governance - Enterprise AI risk, compliance, and model monitoring platform for regulated industries (🏷️
CloudIBMPlatform). - Nobulex - Cryptographic receipts for AI agent actions with two Ed25519 signatures per action, hash-chained for tamper-evident audit trails (🏷️
TypeScriptSecurityCryptography). - NIST AI RMF - US framework for AI risk management covering Govern, Map, Measure, and Manage functions (🏷️
PolicyFrameworkUS). - OneTrust AI Governance - Risk classification, consent management, and compliance workflows for AI agent deployments (🏷️
CloudCompliancePlatform). - Project Glasswing - Industry consortium (11-company initiative) focused on AI safety, red-teaming, and shared benchmarks for agent risk mitigation (🏷️
ConsortiumSafetyResearch).
AI agents specialized in penetration testing, vulnerability discovery, threat detection, and security analysis.
- Microsoft Security Copilot - Enterprise threat detection and incident response AI integrated across Microsoft security products (🏷️
CloudMicrosoftPlatform). - PentestGPT - GPT-powered penetration testing tool with automated reasoning for vulnerability assessment (🏷️
PythonSecurityCLI). - YAWNING TITAN - Graph-based cybersecurity simulation environment for training and testing defensive AI agents (🏷️
PythonSimulationResearch).
AI agents for mental health support, cognitive training, and therapy-adjacent applications with clinical validation.
- Akili Interactive - FDA-cleared video game-based digital medicine for ADHD using neuroplasticity-focused cognitive training (🏷️
CloudFDA-ClearedMedical). - Elomia - Clinician-designed AI therapy chatbot with natural conversation flow and anonymous interaction by design (🏷️
CloudMental HealthWeb). - Ginger - On-demand mental healthcare platform combining AI coaching with live therapists using CBT and DBT approaches (🏷️
CloudMental HealthMobile). - Headspace Health - Meditation and mental health platform with CBT-based courses, sleep programs, and clinical partnerships (🏷️
CloudWellnessMobile). - Replika - AI companion with Rogerian-style support, persistent memory, and mood tracking for social wellbeing (🏷️
CloudCompanionMobile). - Talkspace AI - AI-assisted therapy matching platform with human therapist backup for personalized mental health care (🏷️
CloudTherapyMobile). - Tess by X2AI - SMS-based therapy coach using CBT with clinical validation showing 28% depression reduction (🏷️
CloudMental HealthSMS). - Woebot - CBT-based mental health chatbot that is FDA-cleared and Stanford-validated for therapeutic conversations (🏷️
CloudFDA-ClearedMobile). - Wysa - NHS-approved AI mental health companion using CBT and DBT with anonymous design principles (🏷️
CloudMental HealthMobile). - Youper - Emotional health assistant using CBT and ACT with Stanford-tested clinical effectiveness for mood tracking (🏷️
CloudMental HealthMobile).
Curated newsletters, podcasts, and communities for staying current with AI agent development.
- AI Engineering Newsletter - AI engineering podcast and newsletter by Swyx and Alessio covering agent architectures and tooling (🏷️
NewsletterPodcastWeb). - aibtc.news - Bitcoin-focused agent news platform with bounties and classifieds for the agent economy (🏷️
NewsletterBitcoinWeb). - Awesome Agents Newsletter - Weekly curated tools and reviews covering the latest in AI agent development (🏷️
NewsletterWeeklyWeb). - r/ClaudeAI - Reddit community for Claude users sharing agent workflows, prompts, and integration patterns (🏷️
CommunityRedditForum). - r/LangChain - Reddit community for agent developers using LangChain, LangGraph, and related frameworks (🏷️
CommunityRedditForum). - r/LocalLLaMA - Reddit community for self-hosted LLM users sharing local deployment and agent setup guides (🏷️
CommunityRedditForum). - The Rundown AI - Daily AI digest reaching 600K+ subscribers with concise coverage of agent news and launches (🏷️
NewsletterDailyWeb).
See CHANGELOG.md for the full update history.
Your contributions are what keep this list useful. Read Contributing.md for the entry format, inclusion criteria, and style guide.


