RAG Project Learning - FastAPI AI Assistant with Vector Search

A powerful Retrieval-Augmented Generation (RAG) system built with FastAPI and open-source technologies, featuring an interactive chat interface that can answer questions based on your document knowledge base.

🚀 Features

FastAPI Backend: Modern, fast, async-first web framework with automatic API documentation
Interactive Chat Interface: Web-based chat UI with streaming responses
Vector Database: ChromaDB for efficient semantic search
Document Processing: Automatic chunking and embedding of documents
Semantic Search: Sentence transformers for intelligent document retrieval
API Documentation: Interactive Swagger UI and ReDoc with configurable endpoints
Production Ready: Structured logging, health checks, Docker deployment
Open Source: Built entirely with open-source technologies to avoid vendor lock-in

🏗️ Project Structure

rag-project-learning/
├── app/                          # Main FastAPI package
│   ├── __init__.py              # Package initialization
│   ├── app.py                   # FastAPI application definition
│   ├── api/                     # API routers
│   │   ├── v1/                  # API version 1
│   │   │   └── chat.py         # Chat endpoints
│   │   └── visualizer/          # Visualizer endpoints
│   │       └── routes.py
│   ├── core/                    # Core configuration and engines
│   │   ├── config.py            # Settings management
│   │   ├── logging.py           # Logging configuration
│   │   └── engines/             # Core engine modules
│   │       ├── __init__.py
│   │       ├── vector_engine.py # Vector database operations
│   │       ├── chat_engine.py   # RAG chat engine
│   │       └── document_processor.py # Document processing
│   ├── schemas/                 # Pydantic models
│   │   ├── chat.py             # Chat schemas
│   │   └── visualizer.py       # Visualizer schemas
│   ├── services/                # Business logic layer
│   │   ├── vector_service.py    # Vector database operations
│   │   ├── chat_service.py      # Chat/RAG operations
│   │   └── document_service.py  # Document processing
│   └── scripts/                 # Utility scripts
│       ├── ingest_documents.py  # Document ingestion script
│       └── init_vectordb.py    # Vector database initialization
├── static/                      # Frontend assets (CSS, JavaScript)
│   ├── css/
│   └── js/
├── templates/                   # HTML templates
├── data/                        # Data storage
│   ├── knowledge-docs/         # Document storage
│   ├── vector_db/              # Vector database
│   └── costs/                  # Cost tracking database
├── chroma_db/                   # Vector database storage (auto-generated, gitignored)
├── main.py                      # Entry point (python main.py)
├── start_production.py          # Production entry point
├── Pipfile                      # Python dependencies
├── Pipfile.lock                 # Locked dependency versions
├── Dockerfile                   # Production Docker image
├── docker-compose.yml           # Docker Compose for deployment
├── .dockerignore                # Docker build exclusions
└── README.md                    # Project documentation

🛠️ Prerequisites

Python 3.12 (as specified in Pipfile)
pipenv for dependency management
Docker and Docker Compose for deployment (optional)
Git for version control

📦 Installation & Setup

1. Clone the Repository

git clone <your-repo-url>
cd rag-project-learning

2. Install pipenv (if not already installed)

# On macOS/Linux
pip install pipenv

# On Windows
pip install pipenv

3. Install Dependencies

pipenv install

This will install all required packages:

fastapi - Modern, fast web framework
uvicorn[standard] - ASGI server with production features
chromadb - Vector database for embeddings
sentence-transformers - Text embedding models
openai - OpenAI API integration
structlog - Structured logging
rich - Rich console output
pydantic - Data validation and settings management

4. Activate Virtual Environment

pipenv shell

🚀 Running the Application

Development Mode

# Run with auto-reload
python main.py

# Or run as a module
python -m app

# Or use uvicorn directly
uvicorn app.app:app --reload --host 0.0.0.0 --port 5252

Production Mode

# Use production startup script
python start_production.py

# Or run production module directly
python -m app.production

# Or use uvicorn with production settings
uvicorn app.app:app --host 0.0.0.0 --port 5252 --workers 1 --log-level info

Docker Deployment

# Build and run with Docker Compose
docker-compose up --build

# Or build and run manually
docker build -t legendarycorp-ai-assistant .
docker run -p 5252:5252 -e OPENAI_API_KEY=your_key legendarycorp-ai-assistant

🧹 Project Organization

The project follows a clean, modular FastAPI structure:

app/ - Main application package (Python code only)
- app/app.py - FastAPI application definition
- app/api/ - API endpoints and routers
- app/core/ - Configuration, logging, and core engines
- app/services/ - Business logic layer
- app/schemas/ - Pydantic data models
- app/scripts/ - Utility scripts for document processing
static/ - Frontend assets (CSS, JavaScript)
templates/ - HTML templates

Running Scripts

# Document ingestion
cd app/scripts
python ingest_documents.py

# Vector database initialization
python init_vectordb.py

🔧 Configuration

Environment Variables

Create a .env file in the project root by copying the example file:

cp .env.example .env

Then edit the .env file and update the values according to your environment. The most important variables to set are:

OPENAI_API_KEY - Your OpenAI API key (required for chat functionality)
DEBUG - Set to false in production
LOG_LEVEL - Set to INFO or WARNING in production

Note: Never commit .env files to version control. See .env.example for the complete list of available environment variables.

Production Settings

The application automatically detects production vs development environments:

Development: Auto-reload, debug logging, single worker
Production: No reload, structured logging, multiple workers, health checks

🌐 Application URLs

Once running, access the application at:

Chat Interface: http://localhost:5252/ - Main AI chat interface
API Documentation: http://localhost:5252/redoc - Interactive API docs (ReDoc) ⚙️
Health Check: http://localhost:5252/health - Application health status
ChromaDB Visualizer: http://localhost:5252/visualizer - Database visualization dashboard

⚙️ Configurable: ReDoc is enabled by default and can be disabled via ENABLE_REDOC environment variable

🐳 Docker Deployment

Quick Start

# Start with Docker Compose
docker-compose up --build

# View logs
docker-compose logs -f ai-assistant

# Stop services
docker-compose down

Production Deployment

# Build production image
docker build -t legendarycorp-ai-assistant:latest .

# Run with production settings
docker run -d \
  --name ai-assistant \
  -p 5252:5252 \
  -e LOG_LEVEL=INFO \
  -e CHROMA_DB_VISUALIZER=true \
  -e OPENAI_API_KEY=your_key \
  -v $(pwd)/chroma_db:/app/chroma_db \
  -v $(pwd)/data/knowledge-docs:/app/data/knowledge-docs \
  legendarycorp-ai-assistant:latest

Docker Features

Multi-stage build for optimized image size
Non-root user for security
Health checks for monitoring
Volume mounting for persistent data
Environment variable configuration
Production-ready uvicorn settings

📊 Monitoring & Logging

Structured Logging

The application uses structlog for production-grade logging:

import structlog

logger = structlog.get_logger()
logger.info("Application started", port=5252, environment="production")

Health Checks

# Check application health
curl http://localhost:5252/health

# Response
{
  "status": "healthy",
  "timestamp": "2024-01-01T12:00:00Z"
}

Metrics

Request/response times
Error rates
Memory usage
ChromaDB statistics

🧪 Testing

Run Tests

# Install dev dependencies
pipenv install --dev

# Run all tests
pytest

# Run with coverage
pytest --cov=core --cov-report=html

API Testing

# Test chat endpoint
curl -X POST "http://localhost:5252/api/v1/chat" \
  -H "Content-Type: application/json" \
  -d '{"message": "What is the company policy on remote work?"}'

# Test streaming endpoint
curl -X POST "http://localhost:5252/api/v1/chat/stream" \
  -H "Content-Type: application/json" \
  -d '{"message": "Tell me about employee benefits"}'

🚀 Performance & Scaling

Async by Default

FastAPI provides excellent performance with async/await:

Concurrent requests handling
Non-blocking I/O operations
Efficient streaming responses
WebSocket support for real-time features

Production Optimizations

Multiple workers with uvicorn
Connection pooling for databases
Caching strategies for embeddings
Load balancing ready

Scaling Options

# Scale with multiple workers
uvicorn app:app --host 0.0.0.0 --port 5252 --workers 4

# Use Gunicorn for more control
gunicorn app:app -w 4 -k uvicorn.workers.UvicornWorker --bind 0.0.0.0:5252

🔒 Security Features

CORS middleware configuration
Input validation with Pydantic
Rate limiting ready
Authentication ready (can be added)
HTTPS support

📈 Production Checklist

Set DEBUG=false
Configure LOG_LEVEL=INFO or higher
Set proper CORS_ORIGINS
Use production database
Docker: Use production Dockerfile
Monitoring: Enable health checks
Logging: Configure structured logging
Security: Review CORS and authentication

🐛 Troubleshooting

Common Issues

Port conflicts: Change port in .env or Docker configuration
Memory issues: Reduce worker count or increase container memory
ChromaDB errors: Check volume permissions and database initialization
Logging issues: Verify LOG_LEVEL environment variable

Debug Mode

# Enable debug logging
export LOG_LEVEL=DEBUG
python app.py

Docker Debugging

# View container logs
docker-compose logs -f ai-assistant

# Access container shell
docker-compose exec ai-assistant bash

# Check container health
docker inspect legendarycorp-ai-assistant

🔮 Future Enhancements

📄 License

This project is open source. Please check the license file for specific terms.

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

📞 Support

For issues and questions:

Check the troubleshooting section
Review existing issues
Create a new issue with detailed information

Built with ❤️ using FastAPI and open-source technologies

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vscode		.vscode
app		app
data/knowledge-docs		data/knowledge-docs
static		static
templates		templates
test		test
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
COST_TRACKING.md		COST_TRACKING.md
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
docker-compose.yml		docker-compose.yml
ingest_docs.py		ingest_docs.py
init_db.py		init_db.py
main.py		main.py

thebinij/RAG-project-learning

Folders and files

Latest commit

History

Repository files navigation