GPT Researcher

An autonomous LLM agent that conducts comprehensive web and local research on any topic, producing detailed, cited long-form reports to address hallucination and bias in current AI models.

One Minute Overview#

GPT Researcher is an autonomous AI agent that acts more like a human analyst than a simple chatbot. It performs research by formulating questions, crawling multiple sources in parallel, filtering bias, and aggregating data into a comprehensive report with citations (often 2000+ words). It addresses the issues of outdated knowledge and hallucination in standard LLMs, making it ideal for scenarios requiring accurate, source-verified information.

Core Value: Automates weeks of manual research into minutes, delivering verifiable, cited, and objective factual reports to drastically boost information efficiency.

Quick Start#

Installation Difficulty: Medium - Requires Python environment and API keys (OpenAI + Tavily).

# 1. Clone the project
git clone https://github.com/assafelovic/gpt-researcher.git
cd gpt-researcher

# 2. Install dependencies
pip install -r requirements.txt

# 3. Configure API keys (in .env file or export)
export OPENAI_API_KEY="Your OpenAI Key"
export TAVILY_API_KEY="Your Tavily Search Key"

# 4. Start the server
python -m uvicorn main:app --reload

Is this suitable for me?

✅ Content Creators/Analysts: Need to quickly understand new topics and generate cited drafts.

✅ Investors/Researchers: Need multi-dimensional fact-checking on companies or trends.

❌ Casual Chat: If you just need quick chit-chat or simple Q&A, standard ChatGPT is sufficient.

❌ No API Budget: Running the agent requires LLM and Search API credits, incurring small costs.

Core Capabilities#

1. Deep Research & Aggregation - Solves Information Bias#

GPT Researcher doesn't rely on a single source; it automatically crawls and aggregates information from over 20 different websites and resources in parallel. Value: By cross-referencing data points, it significantly reduces bias and errors common in single-source results, ensuring objective conclusions.

2. Auto-Citation & Sourcing - Solves AI Hallucination#

Every key fact in the generated report includes a source link, with exports available in Markdown, PDF, and Word formats. Value: Users can verify information with a single click, which is critical for academic, business, or serious content creation.

3. Deep Research Mode - Vertical Exploration#

Features a "tree-like exploration" pattern that drills down into sub-topics like an expert, rather than just skimming the surface. Value: For complex topics, it generates near-expert level depth reports, providing a comprehensive view.

4. Hybrid Web & Local Retrieval#

Research isn't limited to the internet; it can ingest and analyze local files like PDFs, Word docs, and Excel sheets alongside web data. Value: Businesses can combine internal proprietary data with external web intelligence for comprehensive market analysis.

Tech Stack & Integration#

Languages: Python (Backend), TypeScript/JavaScript (Frontend) Core Architecture:

FastAPI: High-performance web service framework.
LangGraph: Orchestrates the multi-agent workflow (Planner, Executor, Publisher).
Next.js: Provides a modern, production-grade frontend interface.

Integration Methods:

Code Library (PIP): Can be embedded directly into Python scripts as gpt-researcher.
MCP Protocol: Supports Model Context Protocol to connect with GitHub, databases, etc.
Docker: Full containerized deployment setup available.

Ecosystem & Extensions#

MCP Server: A dedicated server is provided to allow AI clients like Claude Desktop to utilize GPT Researcher's capabilities directly.
Multi-Agent Assistant: Built on LangGraph, allowing specialized agents to collaborate (e.g., one for searching, one for writing).
Frontend Options: Offers both a lightweight HTML interface and a feature-rich NextJS application.

Maintenance Status#

Development Activity: Active. The project recently added "Deep Research" mode and MCP integration; updates are frequent.
Community Response: Large Discord community; high star count. It is currently one of the most popular AI Agent projects on GitHub.
Documentation Quality: Comprehensive. Provides complete docs from installation to API references and tutorials.

Commercial & Licensing#

License: Apache-2.0

✅ Commercial Use: Allowed
✅ Modification: Allowed
⚠️ Disclaimer: This is an experimental project provided "as-is". Generated content is for academic/reference purposes only and not professional advice (medical, legal, financial).

One Minute Overview#

Quick Start#

Core Capabilities#

1. Deep Research & Aggregation - Solves Information Bias#

2. Auto-Citation & Sourcing - Solves AI Hallucination#

3. Deep Research Mode - Vertical Exploration#

4. Hybrid Web & Local Retrieval#

Tech Stack & Integration#

Ecosystem & Extensions#

Maintenance Status#

Commercial & Licensing#

Related Projects

Zylos Core

verl

Kalshi AI Trading Bot

STAY UPDATED