Skip to content

DavidZWZ/Awesome-Deep-Research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

77 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Awesome Agentic Deep Research Resources

Awesome arXiv Maintenance Contribution Welcome Code

Oryx Video-ChatGPT

Welcome to Awesome-Deep-Research! πŸš€ This repository serves as your comprehensive guide to the cutting-edge world of Agentic Deep Research. We've meticulously curated a collection of resources for you.

DeepResearch Framework

Whether you're a researcher, developer, or enthusiast, this repository is your gateway to exploring the fascinating intersection of artificial intelligence and autonomous agents. For a detailed analysis of the changing paradigm in information search, check out our position paper: From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents πŸ“„, which outlines existing domain trends and future directions. For researchers interested in the broader intersection of RAG and Reasoning, we also recommend exploring our comprehensive collection at Awesome-RAG-Reasoning πŸ”₯πŸ”₯πŸ”₯.

Table of Contents

Industry-Leading Products

  • Gemini Deep Research: Google's advanced research assistant for deep analysis (December 11, 2024)
  • Deep Research: OpenAI's deep research platform [API Guide] (February 2, 2025)
  • Perplexity Deep Research: Perplexity's product for in-depth research and analysis (February 14, 2025)
  • Grok Agents: xAI's autonomous DeepSearch agents powered by Grok-3 (February 19, 2025)
  • Copilot Researcher: Researcher and Analyst in Microsoft 365 Copilot (March 25, 2025)
  • Research: Anthropic's research platform to find and reason with information (April 15, 2025)
  • Manus: Advanced research and analysis platform (March 6, 2025)
  • 🦌 DeerFlow: ByteDance's research and analysis solution (May 9, 2025)
  • Deep Research: Alibaba's Qwen-powered research assistant (May 14, 2025)
  • Kimi-Researcher: Moonshot's research assistant powered by Kimi (June 20, 2025)

Open-Source Implementations

Latest Research Papers

πŸ”₯πŸ”₯πŸ”₯ This section showcases the most recent and impactful research papers in the field of Agentic Deep Research. Each paper represents a significant advancement in the development of autonomous research agents, search capabilities, and reasoning frameworks. The papers are organized chronologically, with the most recent publications at the top. Key areas covered include:

  • πŸ€– Agentic frameworks for deep research
  • πŸ” Search-enhanced reasoning models
  • 🌐 Web agents for deep research
  • πŸ”„ Reasoning and retrieval-augmented generation
  • πŸ“Š Multimodal deep research

πŸš€πŸš€πŸš€ Stay tuned for the hottest breakthroughs in the field!

Title Date & Code Base model Optimization Search Engine Agent Architecture Training Dataset Evaluation Dataset
Search-o1: Agentic Search-Enhanced Large Reasoning Models 2025/01/09 GitHub stars QwQ-32B-Preview Prompting Web Search Single-Agent – GPQA, MATH500, AMC2023, AIME2024, LiveCodeBench, NQ, TriviaQA, HotpotQA, 2WikiMultiHopQA, MuSiQue, Bamboogle
Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research 2025/02/07 GitHub stars N/A Prompting Web Search Multi-Agent – GPQA
AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents 2025/02/18 GitHub stars Claude3.5-Sonnet Prompting Web Search Multi-Agent – GAIA
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language Models 2025/03/11 GitHub stars GPT-4o, Claude3.5-Sonnet Prompting Web Search Multi-Agent – TELL ME A STORY, WildSeek
Open Deep Search: Democratizing Search with Open-source Reasoning Agents 2025/03/26 GitHub stars Llama3.1-70B, Deepseek-R1 Prompting Web Search Single-Agent – SimpleQA, FRAME
Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents 2025/05/17 GitHub stars Qwen2.5-14B, Qwen2.5-7B Prompting Local Retrieval Single-Agent – Musique, NQ, 2WikiMultiHopQA, HotpotQA, Bamboogle, StrategyQA
Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework 2025/06/03 Claude3.7-Sonnet, GPT-4o-mini, Qwen3-235B-A22B, Qwen2.5-VL-72B-Instruct Prompting Web Search Multi-Agent – Pew Research, Our World in Data, Open Knowledge Foundation
VideoDeepResearch: Long Video Understanding With Agentic Tool Using 2025/06/12 GitHub stars GPT-4o, Gemini1.5-pro, Qwen2.5-VL-72B-Instruct Prompting Local Retrieval Multi-Agent – MLVU, Video‑MME, LVBench, LongVideoBench
RAG-Gym: Systematic Optimization of Language Agents for Retrieval-Augmented Generation 2025/05/31 GitHub stars Llama3.1-8B-Instruct, Qwen2.5-7B-Instruct, GPT-4o-mini SFT, RL(PPO, DPO) Local Retrieval Single-Agent HotpotQA, MedQA HotpotQA, 2Wiki, Bamboogle, MedQA
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning 2025/03/07 GitHub stars Qwen2.5-7B-Base, Llama3.1-8B-Instruct SFT, RL(GRPO, Reinforce++) Web Search, Local Retrieval Single-Agent HotpotQA, 2WikiMultiHopQA HotpotQA, 2WikiMultiHopQA, Musique, Bamboogle
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning 2025/03/12 GitHub stars Qwen2.5-7B-Instruct, Qwen2.5-7B-Base, Qwen2.5-3B-Instruct, Qwen2.5-3B-Base RL(PPO, GRPO) Web Search Single-Agent NQ, HotpotQA NQ, TriviaQA, PopQA, HotpotQA, 2WikiMultiHopQA, Musique, Bamboogle
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning 2025/03/25 GitHub stars Qwen2.5-7B-Instruct, Qwen2.5-32B-Instruct RL(GRPO) Web Search Single-Agent MuSiQue HotpotQA, 2WikiMultiHopQA, Musique, Bamboogle
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments 2025/03/26 GitHub stars Qwen2.5-7B-Instruct RL(GRPO) Web Search Multi-Agent NQ, TQ, HotpotQA, 2WikiMultiHopQA MuSiQue, Bamboogle, PopQA, NQ, TQ, HotpotQA, 2WikiMultiHopQA
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs 2025/04/11 GitHub stars Pangu Ultra-135B SFT, RL Local Retrieval Single-Agent – –
Webthinker: Empowering large reasoning models with deep research capability 2025/04/30 GitHub stars GPT-o1, GPT-o3, Deepseek-R1, QwQ-32B, Qwen2.5-32B-Instruct RL(DPO) Web Search Single-Agent SuperGPQA, WebWalkerQA, OpenThoughts, NaturalReasoning, NuminaMath GPQA, GAIA, WebWalkerQA, Humanity’s Last Exam
ZeroSearch: Incentivize the Search Capability of LLMs without Searching 2025/05/07 GitHub stars Qwen2.5-3B-Base, Qwen2.5-7B-Base, Qwen2.5-7B-Instruct, Qwen2.5-3B-Instruct, Llama3.2-3B-Instruct, Llama3.2-3B-Base RL(Reinforce, GRPO, PPO) Web Search Single-Agent NQ, HotpotQA NQ, TriviaQA, PopQA, HotpotQA, 2WikiMultiHopQA, Musique, Bamboogle
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent 2025/05/12 GitHub stars Qwen2.5-3B-Instruct, Qwen2.5-7B-Instruct RL(GRPO) Local Retrieval Single-Agent NQ, HotpotQA PopQA, 2WikiMultihopQA
s3 - Efficient Yet Effective Search Agent Training via RL 2025/05/20 GitHub stars Qwen2.5-7B-Instruct RL(PPO) Local Retrieval Single-Agent NQ, HotpotQA NQ, TriviaQA, PopQA, HotpotQA, 2wiki, Musique, MedQA-US, MedMCQA, PubMedQA, BioASQ-Y/N, MMLU-Med
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning 2025/05/22 GitHub stars Qwen2.5-7B-Instruct RL(DPO) Local Retrieval Single-Agent PopQA, HotpotQA, 2WikiMultihopQA PopQA, HotpotQA, 2WikiMultiHopQA, Bamboogle, MuSiQue
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning 2025/05/22 GitHub stars Qwen2.5-7B-Instruct SFT, RL Local Retrieval Single-Agent HotpotQA, 2WikiMultiHopQA HotpotQA, 2WikiMultiHopQA, Musique, Bamboogle
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning 2025/05/22 GitHub stars Qwen2.5-3B, Llama3.1-8B SFT, RL(M-GRPO) Web Search Single-Agent WebArena-Lite, WebArena WebArena-Lite, WebArena
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis 2025/05/25 GitHub stars Qwen2.5-7B-Instruct, Qwen2.5-32B-Instruct, DeepseekDistilled-Qwen2.5-32B, QwQ-32B SFT Web Search Single-Agent NQ, SimpleQA, HotpotQA, 2WikiMultiHopQA, MuSiQue, MultiHopRAG Bamboogle, FRAMES, GAIA, NQ, SimpleQA, HotpotQA, 2WikiMultiHopQA, MuSiQue, MultiHopRAG
MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability 2025/05/27 GitHub stars Llama3.1-8B, Llama3.2-3B, Llama3.2-1B, Llama3, Qwen2.5-7B, Qwen2.5-3B, Qwen2.5-1.5B, Qwen2.5 SFT, RL(DAPO) Local Retrieval Multi-Agent HotpotQA HotpotQA, FanoutQA, Musique, 2WikiMultiHopQA, Bamboogle, FreshQA
MMSearch-R1: Incentivizing LMMs to Search 2025/06/25 GitHub stars Qwen2.5-VL-7B RL(GRPO) Web Search Single-Agent VQA, MetaClip, FVQA, InfoSeek FVQA-test, InfoSeek, MMSearch, SimpleVQA, LiveVQA
Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward 2025/08/18 GitHub stars Qwen2.5-7B RL(GRPO) Web Search Single-Agent NQ, SimpleQA, HotpotQA, 2WikiMultiHopQA, MuSiQue, MultiHopRAG Bamboogle, NQ, SimpleQA, HotpotQA, 2WikiMultiHopQA, MuSiQue, MultiHopRAG

Benchmarks and Applications

Benchmarks Plot

  • Humanity's Last Exam [Paper] [Code] GitHub Repo stars
  • BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents [Paper] [Code] GitHub Repo stars
  • BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese '[Paper]' [Code] GitHub Repo stars
  • DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents [Paper] [Code] GitHub Repo stars
  • MedBrowseComp: Benchmarking Medical Deep Research and Computer Use [Paper] [Code] GitHub Repo stars
  • Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge [Paper] [Code] GitHub Repo stars

Contributing and Citations

🀝 We welcome contributions to expand this comprehensive collection of Agentic Deep Research resources!

πŸ“ How to Contribute

Adding New Research Papers and Benchmarks:

  • Submit an issue with the paper details (title, arXiv link, all the categories in our paper table, and GitHub repo if available)
  • Or create a pull request with the paper added to the research papers table or the benchmarks section

Adding New Open-Source Implementations and New Products:

  • Submit an issue with the repository details (name, description, release data, GitHub link if available)
  • Or create a pull request with the implementation added to the open-source and products section

πŸ“– Citation

πŸ”₯πŸ”₯πŸ”₯ If you find this repository useful, please cite our papers:

@article{zhang2025web,
  title={From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents},
  author={Zhang, Weizhi and Li, Yangning and Bei, Yuanchen and Luo, Junyu and Wan, Guancheng and Yang, Liangwei and Xie, Chenxuan and Yang, Yuyao and Huang, Wei-Chieh and Miao, Chunyu and others},
  journal={arXiv preprint arXiv:2506.18959},
  year={2025}
}

@article{li2025towards,
  title={Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs},
  author={Li, Yangning and Zhang, Weizhi and Yang, Yuyao and Huang, Wei-Chieh and Wu, Yaozu and Luo, Junyu and Bei, Yuanchen and Zou, Henry Peng and Luo, Xiao and Zhao, Yusheng and others},
  journal={arXiv preprint arXiv:2507.09477},
  year={2025}
}

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •