GitForAI

Semantic Memory Infrastructure for AI Coding Assistants

Stop wasting tokens. Give your AI agents Git history context through vector embeddings and semantic search.

The Problem

AI coding assistants waste tokens and miss context:

15-20K tokens to answer simple questions about codebase history
No understanding of "why" code evolved the way it did
Hallucinations from missing historical context
Expensive token costs for enterprise teams

The Solution

GitForAI transforms Git history into queryable semantic memory:

from gitforai import GitForAI

# Initialize with your repo
git_memory = GitForAI("/path/to/repo")

# Ask questions in natural language
results = git_memory.query("How does authentication work?")
# Returns relevant commits with 85% fewer tokens

# Track file evolution
history = git_memory.track_file("auth.py")
# See how a file changed over time

# Find similar changes
similar = git_memory.find_similar(commit_hash)
# Discover related work

Features

🎯 85% token reduction - Semantic search returns only relevant context
🔒 Privacy-first - Local embeddings, no API keys required
⚡ Fast - Sub-second semantic search with ChromaDB
🐳 Self-hostable - Docker setup included
🔌 Pluggable - Extensible architecture for custom integrations

Quick Start

Installation

# Install from PyPI
pip install gitforai

# Or install from source
git clone https://github.com/git-for-ai/gitforai.git
cd gitforai
pip install -e .

Index Your Repository

# Index repository with local embeddings (zero cost, offline)
gitforai index /path/to/repo

# Search your codebase semantically
gitforai search "authentication bug fixes"

# Get detailed commit info
gitforai analyze abc123 --diffs

Docker (Recommended)

# Pull and run
docker pull gitforai/core
docker run -v $(pwd):/repo gitforai/core index /repo

# Or use docker-compose
docker-compose up

How It Works

Extract - Parse Git commits, diffs, and file changes
Embed - Generate semantic embeddings (local, no API cost)
Index - Store in ChromaDB vector database
Query - Natural language search returns relevant context

Result: AI agents get exactly the context they need, without wasting tokens on irrelevant code.

Platform Integrations

Platform-specific adapters available separately.

Use Cases

For Individual Developers

Understand unfamiliar codebases quickly
Find relevant commits when debugging
Learn from code evolution patterns
Reduce AI assistant token costs

For Teams

Onboard new developers faster
Share institutional knowledge automatically
Improve code review quality
Standardize AI context across team

For Enterprises

Reduce token costs by 85%
Self-host for data privacy
Integrate with existing AI tools
SOC2/GDPR compliant

Documentation

User Guide - Complete usage guide
API Reference - Python API documentation
Docker Guide - Self-hosting with Docker
Contributing - How to contribute

Architecture

Extraction - GitPython for repository parsing
Embeddings - sentence-transformers (local, free) or OpenAI (optional)
Vector DB - ChromaDB for semantic search
CLI - Typer for command-line interface

See CLAUDE.md for detailed architecture.

Development

# Clone and setup
git clone https://github.com/git-for-ai/gitforai.git
cd gitforai
python -m venv venv
source venv/bin/activate
pip install -e ".[dev]"

# Run tests
pytest

# Run with coverage
pytest --cov=src/gitforai --cov-report=html

# Format code
black src/ tests/

# Lint
ruff check src/ tests/

Configuration

Create .env file:

# Embedding provider (default: local, zero cost)
EMBEDDING_PROVIDER=local  # or "openai" for maximum quality
EMBEDDING_MODEL=all-MiniLM-L6-v2  # 384 dims, 80MB, free

# Vector database
VECTORDB_PROVIDER=chroma
VECTORDB_PERSIST_DIR=~/.gitforai/vectordb

# Optional: OpenAI API key (only if using OpenAI embeddings)
OPENAI_API_KEY=your-key-here

Default settings use local embeddings:

Works offline
Zero API cost
No API keys required
~88% of OpenAI quality
Auto-downloads ~80MB model on first use

Why Open Source?

We believe semantic Git history should be accessible to all developers. The core extraction, embedding, and indexing logic is open source (MIT license).

Commercial offerings:

Managed cloud hosting (Pro/Team tiers)
Platform-specific integrations
Enterprise features (SSO, RBAC, audit logs)
Professional support

See gitforai.com for commercial options.

Performance

Indexing: 1000 commits in <5 minutes
Query: Semantic search in <500ms
Accuracy: Relevant results in top 5 for 90% of queries
Cost: $0.00 with local embeddings (vs $0.10/1000 commits with OpenAI)

License

MIT License - See LICENSE file for details.

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: support@gitforai.com
Docs: docs.gitforai.com

Roadmap

Community

⭐ Star this repo if you find it useful
🐛 Report bugs via GitHub Issues
💡 Request features via GitHub Discussions
🤝 Contribute - see CONTRIBUTING.md

Built with ❤️ by the GitForAI team

Semantic memory for the AI coding revolution

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
external		external
src/gitforai		src/gitforai
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
test_unified_intelligence.py		test_unified_intelligence.py
test_wrapper_classes.py		test_wrapper_classes.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GitForAI

The Problem

The Solution

Features

Quick Start

Installation

Index Your Repository

Docker (Recommended)

How It Works

Platform Integrations

Use Cases

For Individual Developers

For Teams

For Enterprises

Documentation

Architecture

Development

Configuration

Why Open Source?

Performance

License

Contributing

Support

Roadmap

Community

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

git-for-ai/gitforai

Folders and files

Latest commit

History

Repository files navigation

GitForAI

The Problem

The Solution

Features

Quick Start

Installation

Index Your Repository

Docker (Recommended)

How It Works

Platform Integrations

Use Cases

For Individual Developers

For Teams

For Enterprises

Documentation

Architecture

Development

Configuration

Why Open Source?

Performance

License

Contributing

Support

Roadmap

Community

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages