A curated list of awesome AI agents, LLM frameworks, autonomous agent systems, and multi-agent orchestration tools for building intelligent applications.
AI agents are autonomous systems powered by large language models (LLMs) that can perceive, reason, plan, and act to achieve goals. This list covers frameworks, tools, and resources for building production-ready AI agent systems.
- Claude Opus 4.5 - Anthropic's best model for agents with 80.9% SWE-bench
- OpenAI Operator - Browser automation agent in research preview
- MCP (Model Context Protocol) - Emerging standard for tool integration
- Devin GA - Cognition's AI software engineer now generally available
- Multi-agent systems - Production-ready frameworks becoming mainstream
- Multi-Agent Frameworks
- Agent Orchestration
- LLM Integration
- Tool & Function Calling
- Memory & State Management
- Agent Development Platforms
- Production & Deployment
- Learning Resources
- Research Papers
- AutoGen - Microsoft's framework for building multi-agent conversational systems.
- CrewAI - Cutting-edge framework for orchestrating role-playing autonomous AI agents.
- LangGraph - Build stateful multi-actor applications with LLMs using graph-based workflows.
- Swarm - OpenAI's educational framework for exploring lightweight multi-agent orchestration.
- AgentScope - Easier multi-agent application development with LLMs.
- Devin - Cognition's autonomous AI software engineer, now generally available.
- OpenHands - Open-source autonomous coding agent (formerly OpenDevin).
- AutoGPT - Experimental autonomous GPT-4 agent that chains tasks.
- BabyAGI - AI-powered task management system using OpenAI and vector databases.
- GPT Engineer - Specify what you want to build, AI asks clarifying questions, then builds it.
- MetaGPT - Multi-agent framework that assigns different roles to GPTs to form collaborative software entities.
- Goose - Block/Square's open-source AI agent with extensible toolkit system.
- OpenAI Operator - OpenAI's browser automation agent (research preview).
- Stagehand - AI browser automation built on Playwright with act/extract/observe primitives.
- browser-use - Open-source Python framework for AI web automation.
- Skyvern - Computer vision-based browser automation API.
- Anthropic Computer Use - Claude's native computer use capability for GUI automation.
- LangChain - Building applications with LLMs through composability.
- Semantic Kernel - Microsoft's SDK for integrating LLMs with conventional programming languages.
- Haystack - End-to-end framework for building production-ready LLM applications.
- Composio - Integration platform for AI agents with 150+ tools.
- TaskWeaver - Code-first framework for building LLM-powered autonomous agents.
- AgentGPT - Assemble, configure, and deploy autonomous AI agents in your browser.
- ix - Autonomous GPT-4 agent platform.
- OpenAI API - GPT-5, GPT-4o, o1/o3 reasoning models with function calling.
- Anthropic Claude - Claude Opus 4.5 (best for agents), Claude 3.5 Sonnet with 200K context.
- Google Gemini - Gemini 3 Pro with 1M+ context and function calling.
- Ollama - Run open-source LLMs locally (Llama 3, Mistral, Qwen, etc.).
- vLLM - High-throughput LLM serving with PagedAttention.
- Gorilla - LLM connected with massive APIs.
- ToolLLM - Facilitating large language models to master 16000+ real-world APIs.
- OpenAI Functions - Native function calling in GPT-4 and GPT-3.5.
- LangChain Tools - Pre-built tools for agents.
- Transformers Agents - Hugging Face toolkit for building agents.
- n8n - Workflow automation with AI capabilities and 400+ integrations.
- Zapier AI Actions - Expose Zapier actions to LLMs.
- Flowise - Visual builder for AI workflows.
- Pinecone - Vector database for LLM applications.
- Weaviate - Open-source vector search engine.
- Qdrant - Vector similarity search engine with extended filtering support.
- Chroma - Open-source embedding database.
- Milvus - Cloud-native vector database.
- Mem0 - Memory layer for AI agents.
- Zep - Long-term memory for AI assistants.
- LangMem - Memory management for LangChain agents.
- Flowise - Drag-and-drop UI to build LLM flows.
- Dify - LLMOps platform for building AI applications.
- PromptFlow - Streamlining development cycle of LLM applications.
- Botpress - Open-source conversational AI platform.
- LangSmith - Platform for debugging, testing, and monitoring LLM applications.
- Weights & Biases - Developer tools for ML experiment tracking.
- MLflow - Platform for ML lifecycle management.
- Ray - Distributed computing framework for ML workloads.
- BentoML - Unified framework for building, shipping, and scaling ML services.
- Seldon Core - ML deployment on Kubernetes.
- LangFuse - Open-source LLM engineering platform.
- Phoenix - AI observability & evaluation.
- Helicone - Open-source LLM observability platform.
- DeepLearning.AI Courses - AI agent development courses.
- LangChain Academy - Official LangChain learning platform.
- Microsoft Learn - Semantic Kernel tutorials.
- Building LLM Apps - O'Reilly guide to LLM application development.
- Prompt Engineering Guide - Comprehensive prompt engineering resource.
- LangChain Discord - LangChain community.
- AutoGPT Discord - AutoGPT development community.
- r/LocalLLaMA - Subreddit for local LLM deployment.
- ReAct: Synergizing Reasoning and Acting in Language Models - Foundation for agent reasoning patterns.
- Toolformer: Language Models Can Teach Themselves to Use Tools - Self-taught tool use.
- Generative Agents: Interactive Simulacra of Human Behavior - Stanford's agent simulation research.
- AutoGPT: An Autonomous GPT-4 Experiment - Foundational autonomous agent work.
Contributions are welcome! Please read the contribution guidelines first.