AI Intern Agent

An autonomous Go-based engineering assistant that reads JIRA tickets assigned to it, analyzes the target repository, generates and applies code changes using an AI provider (Anthropic Claude or local LLMs via Ollama), opens a GitHub Pull Request, and updates the JIRA ticket status.

Overview

Ticketing integration (JIRA) to fetch and update tickets
Repository integration (Git/GitHub) to branch, commit, push, and open PRs
AI provider facade with Anthropic implementation to plan code changes based on ticket description and repo context
Smart context selection using keyword extraction and file scoring to optimize token usage
Orchestrator with a pipeline-like workflow per ticket
Configurable concurrency and repository context limits

Architecture

internal/orchestrator/: Coordinates the end-to-end workflow
- coordinator.go: Ticket processing loop, worker pool, pipeline
- branch.go: Branch naming utilities (sanitization, prefixing)
internal/ticketing/: Ticketing service facade
- jira/: Concrete JIRA client implementation
internal/repository/: Repository service facade
- github/: Concrete GitHub client based on go-git and go-github
internal/ai/: AI facade and shared types
- agent/: Agent interface and shared types (CodeChange, UsageMetrics)
- agent/anthropic/: Anthropic Claude provider
- agent/ollama/: Ollama local LLM provider
- context_builder.go: Builds a compact repo context for prompting with smart file selection
internal/provider/: Provider factory for AI agent instantiation
internal/indexer/: Smart context selection components
- indexer.go: Builds and manages file index for the repository
- keywords.go: Extracts keywords from ticket descriptions (file paths, identifiers)
- scorer.go: Scores files based on keyword relevance with tiered matching
- parser.go: Extracts minimal context from Go files (signatures without implementations)
internal/config/: Configuration loading and validation
internal/util/: Shared utility functions
cmd/agent/: Entry-point wiring

Data flow (simplified)

Orchestrator fetches tickets from JIRA
Prepares repository (clone/sync, switch to base branch)
For each ticket:
- Creates a feature branch
- Builds smart repo context: extracts keywords from ticket, scores files by relevance, selects top matches
- Calls AI agent to plan changes based on minimal context (function signatures, not implementations)
- Applies changes, commits, pushes
- Creates a PR and updates ticket status to Done

Smart Context Selection

The agent uses an intelligent context selection system to optimize token usage while providing relevant code context to the AI:

How it works:

Keyword Extraction (keywords.go): Extracts technical terms from ticket description
- File paths: internal/auth/login.go
- Identifiers: camelCase, PascalCase, snake_case variables/functions
- Filters common stop words (the, and, with, etc.)
File Scoring (scorer.go): Ranks files by relevance using tiered matching
- Exact path match: +15 points
- Path contains keyword: +8 points
- Path segment matches: +5 points
- Segment contains keyword: +2 points
- Category multipliers: core (1.5x), config (1.2x), test (0.7x), doc (0.5x)
Minimal Context Extraction (parser.go): For Go files, extracts only:
- Package name and imports
- Type definitions
- Function signatures (no implementation bodies)
- Results in 60-80% token reduction vs full file content
Graceful Fallback: If index unavailable or no keywords extracted, falls back to simple context builder

Benefits:

70% cost reduction: From ~$0.15 to ~$0.045 per ticket (based on testing)
Better relevance: Only includes files actually mentioned or related to the ticket
Faster processing: Less context means faster AI responses

Design Patterns and Practices

Interface-based architecture: ticketing, repository, and ai use interfaces with DI-friendly services
Provider facade: AI is abstracted via ai.Agent; Anthropic is one implementation under ai/anthropic
Pipeline/Chain of Responsibility: Orchestrator breaks the workflow into small, testable steps
Worker Pool: Concurrency honoring MaxConcurrentTickets with a bounded semaphore
Configuration-driven: Environment variable based config with defaults; supports WorkingDir, BaseBranch, BranchPrefix
Repository context limiting: Restricts number and size of files included in the AI prompt to control token usage
Guardrails: Skips PR creation if no effective changes detected
Logging: Consistent structured logging via a logger package

Requirements

Go 1.22+ (latest recommended)
Access tokens for JIRA and GitHub
AI Provider (choose one):
- Anthropic Claude: API key from console.anthropic.com
- Ollama: Local installation with models (see Ollama Setup Guide)

Quick Start

Clone the repo and install dependencies:
```
go mod tidy
```
Initialize sample config files:
```
go run cmd/agent/main.go --init
```
Edit .env.example and export/envsubst variables (or create .env)
(Optional but recommended) Build the file index for smart context selection:
```
go run cmd/agent/main.go --build-index
```
This creates a .ai-intern/file_index.json in your repository with metadata about all files. The agent will use this for intelligent file selection.
Run the agent:
```
go run cmd/agent/main.go
```

Configuration

Environment variables (examples):

AI Provider (required):
- AI_PROVIDER: "anthropic" or "ollama" (default: "anthropic")
Anthropic (required if AI_PROVIDER=anthropic):
- ANTHROPIC_API_KEY: API key from console.anthropic.com
Ollama (required if AI_PROVIDER=ollama):
- OLLAMA_BASE_URL: Ollama server URL (default: "http://localhost:11434")
- OLLAMA_MODEL: Model name (e.g., "qwen2.5-coder:7b", "deepseek-coder:6.7b")
- See Ollama Setup Guide for installation and model recommendations
JIRA:
- JIRA_URL, JIRA_EMAIL, JIRA_API_TOKEN, JIRA_PROJECT_KEY
- Transitions map (via YAML or env mapping if loaded): you can provide mapping in code/config for status transitions
GitHub:
- GITHUB_TOKEN, GITHUB_OWNER, GITHUB_REPO
Agent:
- AGENT_USERNAME, POLLING_INTERVAL (e.g., 30s), MAX_CONCURRENT_TICKETS
- WORKING_DIR (default ./workspace)
- BASE_BRANCH (default main)
- BRANCH_PREFIX (e.g., feature)

Quick Config Examples

Using Anthropic Claude:

AI_PROVIDER=anthropic
ANTHROPIC_API_KEY=sk-ant-...

Using Local Ollama:

AI_PROVIDER=ollama
OLLAMA_BASE_URL=http://localhost:11434
OLLAMA_MODEL=qwen2.5-coder:7b

How It Works

The orchestrator loops on POLLING_INTERVAL:
- Prepares the local repo (clone/sync, switch to base)
- Spawns up to MAX_CONCURRENT_TICKETS workers
- Each worker processes a ticket end-to-end and marks it done when the PR is created

Extensibility

AI Providers: Implement agent.Agent interface and add to factory (see internal/provider/factory.go)
- Existing: Anthropic Claude (agent/anthropic/), Ollama (agent/ollama/)
- Easy to add: OpenAI, Azure OpenAI, custom APIs
Ticketing Systems: Implement ticketing.TicketingClient and create a TicketingService
VCS Providers: Implement repository.RepositoryClient and wrap in RepositoryService
Pipeline Steps: Add steps to processTicket or refactor into discrete handlers

Supported AI Providers

Provider	Type	Cost	Setup	Best For
Anthropic Claude	Cloud API	$3-15 per M tokens	API key	Production, best quality
Ollama	Local LLM	Free	Install + model	Development, privacy, high volume

See docs/OLLAMA_SETUP.md for Ollama installation and docs/MULTI_PROVIDER_PLAN.md for architecture details.

Testing

Unit tests for clients and orchestrator helpers are encouraged
For mocks, use go install go.uber.org/mock/mockgen@latest and generate interface mocks
Run tests:
```
go test ./...
```

Contribution Guidelines

See CONTRIBUTING.md for detailed guidelines on branching, coding style, commit messages, and PR checks.

Roadmap

Base branch auto-detection for checkout (currently applied to PR creation fallback)
✅ ~~Configurable repo context limits (files/bytes)~~ - Implemented with CONTEXT_MAX_FILES and smart selection
✅ ~~Intelligent file selection~~ - Implemented with keyword extraction and file scoring
✅ ~~Multi-provider AI support~~ - Implemented with Anthropic Claude and Ollama
Exponential backoff with jitter for all remote calls
Per-repo locking for concurrent ticket processing across the same repository
Additional AI providers (OpenAI, Azure OpenAI) and ticketing/VCS integrations
Incremental index updates (currently rebuilds entire index each time)

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.claude		.claude
.github/workflows		.github/workflows
cmd/agent		cmd/agent
docs		docs
internal		internal
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Intern Agent

Overview

Architecture

Data flow (simplified)

Smart Context Selection

Design Patterns and Practices

Requirements

Quick Start

Configuration

Quick Config Examples

How It Works

Extensibility

Supported AI Providers

Testing

Contribution Guidelines

Roadmap

About

Uh oh!

Releases 2

Packages

Contributors 2

Uh oh!

Languages

License

jenish-jain/intern

Folders and files

Latest commit

History

Repository files navigation

AI Intern Agent

Overview

Architecture

Data flow (simplified)

Smart Context Selection

Design Patterns and Practices

Requirements

Quick Start

Configuration

Quick Config Examples

How It Works

Extensibility

Supported AI Providers

Testing

Contribution Guidelines

Roadmap

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages