Research Agent CLI

A multi-phase pipeline for automated academic research. The system discovers papers, constructs a vector-indexed library, generates structured documents with inline citations, and iterates through peer review until acceptance criteria are met.

Supports configurable private PDF sources for institutions with licensed access.

Architecture

flowchart TB
    subgraph INPUT
        Topic[Research Topic]
    end

    subgraph PLANNING["Phase 1: Planning"]
        Plan[LLM Decomposition]
        Plan --> Questions[Research Questions]
        Plan --> Concepts[Key Concepts]
        Plan --> ArgMap[Argument Map]
        Plan --> Queries[Search Queries]
    end

    subgraph DISCOVERY["Phase 2: Discovery"]
        direction TB
        SS[Semantic Scholar API]
        Exa[Exa.ai Neural Search]
        Cite[Citation Networks]
        Queries --> SS
        Queries --> Exa
        Queries --> Cite
        SS --> Papers[Paper Candidates]
        Exa --> Papers
        Cite --> Papers
    end

    subgraph ACQUISITION["Phase 3: Acquisition"]
        direction TB
        ArXiv[ArXiv]
        Unpay[Unpaywall]
        Private[Private Sources]
        Papers --> ArXiv
        Papers --> Unpay
        Papers --> Private
        ArXiv --> PDFs[PDF Collection]
        Unpay --> PDFs
        Private --> PDFs
    end

    subgraph INDEXING["Phase 4: Indexing"]
        PaperQA[PaperQA2 Chunking]
        Qdrant[(Qdrant Vector DB)]
        PDFs --> PaperQA
        PaperQA --> Qdrant
    end

    subgraph TOOLS["Shared Tool Registry"]
        T1[query_library]
        T2[discover_papers]
        T3[add_paper]
        T4[fuzzy_cite]
        T5[validate_citations]
        T6[literature_sheet]
    end

    subgraph DRAFTING["Phase 5: Drafting Agent"]
        direction TB
        DraftAgent[ReAct Loop]
        DraftAgent --> Draft[Typst Draft]
    end

    subgraph REVIEW["Phase 6-7: Review & Revision Loop"]
        ReviewAgent[Peer Review]
        Draft --> ReviewAgent
        
        subgraph Checks
            C1[Citation Validity]
            C2[Claim Grounding]
            C3[Coverage Gaps]
            C4[Counter-Arguments]
        end
        
        ReviewAgent --> Checks
        Checks --> Verdict{Verdict}
        Verdict -->|ACCEPTED| Final[Finalize]
        Verdict -->|REVISIONS| Limit{Count < Max?}
        Limit -->|Yes| ReviseAgent[Reviser Agent]
        Limit -->|No| Final
        ReviseAgent --> DraftAgent
    end


    subgraph FINALIZATION
        Final --> BibFilter[Filter Bibliography]
        Final --> StarHash[Generate Star Hash]
        Final --> Compile[Typst Compile]
    end

    subgraph OUTPUT
        Compile --> PDF[main.pdf]
        BibFilter --> Bib[refs.bib]
        Compile --> Artifacts[artifacts/]
    end

    subgraph NOTIFICATIONS["Live Notifications"]
        UI[Terminal UI]
        MacOS[MacOS Alerts]
        Telegram[Telegram Bot]
    end

    Topic --> Plan
    Qdrant --> DraftAgent
    Qdrant --> ReviewAgent
    Qdrant --> ReviseAgent

    %% Tool connections
    TOOLS <--> DraftAgent
    TOOLS <--> ReviewAgent
    TOOLS <--> ReviseAgent

    %% Notification connections
    DraftAgent -.-> UI
    ReviewAgent -.-> UI
    ReviseAgent -.-> UI
    Final -.-> MacOS
    Final -.-> Telegram
    PDF -.-> Telegram

    style PLANNING fill:#1a1a2e
    style DISCOVERY fill:#16213e
    style ACQUISITION fill:#1a1a2e
    style INDEXING fill:#16213e
    style DRAFTING fill:#1a1a2e
    style REVIEW fill:#16213e
    style FINALIZATION fill:#1a1a2e
    style TOOLS fill:#232634,stroke:#8aadf4
    style NOTIFICATIONS fill:#232634,stroke:#a6da95
    style Checks fill:#232634,stroke:#f5a97f

Installation

git clone https://github.com/gbrlpzz/research-agent-cli.git
cd research-agent-cli
python3 -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env  # Configure API keys

Typst is required for PDF compilation:

brew install typst  # macOS
# Or download from https://github.com/typst/typst/releases

Authentication & Costs

The agent supports two modes of operation:

API Key Mode: Standard billing via GEMINI_API_KEY or OPENAI_API_KEY.
"Free Cost" Mode (Recommended): Use your existing Gemini/Google quota via OAuth.

Setting up "Free Cost" Mode

This mode allows you to use Gemini models (and Antigravity models) without incurring per-token API costs, leveraging your plan's included quota.

Login: Run the OAuth script:

research gemini-login
# Follow the browser prompt to authenticate with your Google Cloud Project

Verify:
```
research gemini-status
```
Run: The agent will auto-detect OAuth credentials and switch to "Free Cost" mode (budget set to HIGH by default).
Antigravity Users: Run research antigravity-login to access internal models.

Usage

research agent "Attention mechanisms in transformer architectures"

Output directory structure:

reports/<timestamp>_<topic>/
├── main.typ              # Typst source
├── main.pdf              # Compiled document
├── refs.bib              # Bibliography (cited papers only)
├── literature_sheet.csv  # Paper metadata
└── artifacts/            # Intermediate outputs (plans, drafts, reviews)

Command Reference

Command	Function
`research agent <topic>`	Execute full pipeline
`research agent -i <topic>`	Interactive mode (model selection, iteration limits)
`research agent --resume <path>`	Resume from checkpoint
`research qa <question>`	Query library via RAG
`research <query>`	Search Semantic Scholar
`research add <id>`	Add paper by DOI or arXiv ID
`research cite [query]`	Fuzzy-match citation keys
`research exa <query>`	Neural search via Exa.ai
`research gemini-login`	Authenticate with Gemini OAuth (Free Quota)
`research antigravity-login`	Authenticate with Antigravity OAuth

Agent Options

Options:
  -i, --interactive           Configuration menu before execution
  --resume PATH               Resume from existing report directory
  -r, --revisions N           Maximum peer review rounds (default: 3)
  --reviewers N               Parallel reviewer count (default: 1)
  --budget {low,balanced,high}  Cost/quality tradeoff (default: low)
  --reasoning-model MODEL     LLM for planning, drafting, review
  --rag-model MODEL           LLM for PaperQA queries
  --embedding-model MODEL     Embedding model for indexing

Budget Modes

Mode	Model Selection	Citation Target	Strategy
`low`	gemini-2.5-flash	10+	Query existing library first; discover only for gaps
`balanced`	Mixed (flash/pro)	12+	Query first; discover when gaps identified
`high`	gemini-3-pro	15+	Proactive discovery; traverse citation networks

Configuration

Environment variables (.env):

# Required: at least one LLM API key
GEMINI_API_KEY=<key>
OPENAI_API_KEY=<key>

# Model routing (optional)
RESEARCH_REASONING_MODEL=gemini/gemini-2.5-flash
RESEARCH_RAG_MODEL=gemini/gemini-2.5-flash
RESEARCH_EMBEDDING_MODEL=gemini/text-embedding-004


# Discovery APIs (optional)
SEMANTIC_SCHOLAR_API_KEY=<key>
EXA_API_KEY=<key>

# Telegram notifications (optional)
TELEGRAM_BOT_TOKEN=<token>
TELEGRAM_CHAT_ID=<chat_id>

# Tuning parameters (optional)
AGENT_MAX_ITERATIONS=50
REVISION_MAX_ITERATIONS=25
MAX_REVIEWER_ITERATIONS=15
API_TIMEOUT_SECONDS=120

Terminal Interface

The agent features a live TUI dashboard showing:

Header: Topic, model, cost, elapsed time
Status: Current phase, LLM/Embedding token breakdown
Logs: Scrolling activity log

The UI persists throughout all phases and closes cleanly on completion or error.

Notifications

MacOS

Native notifications are sent automatically:

Success: Banner notification when research completes
Failure: Modal alert if an error occurs

Telegram

Configure TELEGRAM_BOT_TOKEN and TELEGRAM_CHAT_ID in .env to receive:

Live phase updates on your phone
Final PDF delivered to your chat

Project Structure

research-agent-cli/
├── bin/research                # CLI entry point
├── scripts/
│   ├── agent.py                # Pipeline orchestration
│   ├── phases/
│   │   ├── orchestrator.py     # Model selection, cost tracking
│   │   ├── planner.py          # Research decomposition
│   │   ├── drafter.py          # Document generation
│   │   ├── reviewer.py         # Peer review
│   │   ├── reviser.py          # Revision handling
│   │   └── tool_registry.py    # Tool definitions
│   ├── tools/                  # Tool implementations
│   ├── qa.py                   # RAG interface
│   └── utils/                  # LLM wrapper, prompts, Typst utilities
├── library/                    # PDF storage
├── reports/                    # Generated outputs
├── templates/typst-template/   # Document template (submodule)
└── master.bib                  # Master bibliography

Dependencies

Component	Function
PaperQA2	RAG over academic papers
Qdrant	Vector storage
LiteLLM	Multi-provider LLM routing
Papis	Bibliography management
Typst	Document compilation
Semantic Scholar API	Paper discovery
Star Hash	Astronomic document fingerprint

Limitations

Requires LLM API access (Gemini or OpenAI)
PDF availability varies by source
Generated citations require manual verification
Semantic Scholar and Exa.ai impose rate limits
Optimized for English-language literature

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
.venv		.venv
bin		bin
reports		reports
research-agent-telegram-bot		research-agent-telegram-bot
scripts		scripts
templates		templates
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
papis.config		papis.config
requirements.txt		requirements.txt
run_agent.command		run_agent.command

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Research Agent CLI

Architecture

Installation

Authentication & Costs

Setting up "Free Cost" Mode

Usage

Command Reference

Agent Options

Budget Modes

Configuration

Terminal Interface

Notifications

MacOS

Telegram

Project Structure

Dependencies

Limitations

License

About

Uh oh!

Releases

Packages

Languages

License

gbrlpzz/research-agent-cli

Folders and files

Latest commit

History

Repository files navigation

Research Agent CLI

Architecture

Installation

Authentication & Costs

Setting up "Free Cost" Mode

Usage

Command Reference

Agent Options

Budget Modes

Configuration

Terminal Interface

Notifications

MacOS

Telegram

Project Structure

Dependencies

Limitations

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages