Advanced RAG Q&A Chatbot System

A sophisticated Retrieval-Augmented Generation (RAG) chatbot built with Node.js and Express, featuring real-time document processing, semantic search, and multiple LLM provider support.

Author

Aaryan Choudhary

Features

Multi-Format Document Support: PDF, DOCX, TXT, CSV, HTML, JSON
Advanced Text Processing: Intelligent chunking with overlap and semantic boundaries
Real-time Communication: WebSocket-based streaming responses
Multiple LLM Providers: OpenAI GPT, Google Gemini, Anthropic Claude, Cohere, HuggingFace
Semantic Search: Vector-based document retrieval with cosine similarity
Professional UI: Clean, responsive interface with drag-and-drop file upload
Configurable Settings: Adjustable similarity thresholds and result limits
Source Attribution: Transparent citation of information sources
Rate Limiting: Built-in protection against abuse
Comprehensive Logging: Detailed system monitoring and debugging

Technology Stack

Backend: Node.js, Express.js
Real-time: WebSocket (ws)
Document Processing: pdf-parse, mammoth, csv-parser
Vector Operations: Custom implementation with cosine similarity
Security: Helmet, CORS, rate limiting
Frontend: Vanilla JavaScript, TailwindCSS
Architecture: Modular service-based design

Project Structure

rag-qa-chatbot/
├── src/
│   ├── services/
│   │   ├── index.js              # Service initialization
│   │   ├── documentProcessor.js  # Document parsing and chunking
│   │   ├── embeddingService.js    # Text embedding generation
│   │   ├── vectorStore.js         # Vector storage and similarity search
│   │   ├── llmService.js          # LLM provider integration
│   │   └── retrievalService.js    # RAG pipeline orchestration
│   ├── routes/
│   │   ├── index.js              # Route registration
│   │   ├── documents.js          # Document management endpoints
│   │   ├── chat.js               # Chat and query endpoints
│   │   └── admin.js              # Administrative endpoints
│   ├── middleware/
│   │   ├── rateLimiter.js        # Rate limiting middleware
│   │   ├── errorHandler.js       # Global error handling
│   │   └── auth.js               # Authentication middleware
│   ├── websocket/
│   │   └── chatSocket.js         # WebSocket message handling
│   └── utils/
│       ├── logger.js             # Logging utilities
│       └── textProcessor.js      # Text processing utilities
├── public/
│   ├── index.html                # Main application interface
│   ├── css/
│   │   └── styles.css            # Application styling
│   └── js/
│       └── app.js                # Frontend application logic
├── uploads/                      # Document storage directory
├── .env.example                  # Environment configuration template
├── .gitignore                    # Git ignore rules
├── package.json                  # Project dependencies and scripts
├── server.js                     # Main application entry point
└── README.md                     # Project documentation

Installation

Prerequisites

Node.js (v18 or higher)
npm or yarn package manager

Quick Start

Clone the repository

git clone https://github.com/yourusername/rag-qa-chatbot.git
cd rag-qa-chatbot

Install dependencies
```
npm install
```

Configure environment variables

cp .env.example .env

Edit .env file with your API keys:

# Required for enhanced features
OPENAI_API_KEY=your_openai_api_key_here
GOOGLE_API_KEY=your_google_api_key_here
ANTHROPIC_API_KEY=your_anthropic_api_key_here
COHERE_API_KEY=your_cohere_api_key_here
HUGGINGFACE_API_KEY=your_huggingface_api_key_here

# Server configuration
PORT=3000
NODE_ENV=development

# Security
RATE_LIMIT_WINDOW_MS=900000
RATE_LIMIT_MAX_REQUESTS=100

Start the application
```
npm start
```
Access the application Open your browser and navigate to http://localhost:3000

Alternative Installation Methods

Minimal Installation (Core Features Only)

npm run install-minimal
npm start

Development Mode

npm install -g nodemon
npm run dev

Usage

Document Upload

Via Web Interface: Drag and drop files onto the upload zone or click to browse
Supported Formats: PDF, DOCX, TXT, CSV, HTML, JSON files
Automatic Processing: Documents are automatically chunked and indexed

Querying Documents

Natural Language Queries: Ask questions in plain English
Context-Aware Responses: Answers include source citations
Real-time Streaming: Watch responses generate in real-time
Follow-up Questions: Maintain conversation context

Configuration

Access the settings panel to adjust:

Retrieval Count: Number of relevant chunks to retrieve (1-20)
Similarity Threshold: Minimum relevance score (0.0-1.0)
Streaming Mode: Enable/disable real-time response streaming
LLM Provider: Switch between different AI models

API Endpoints

Document Management

POST /api/documents/upload - Upload and process documents
GET /api/documents/ - List all processed documents
DELETE /api/documents/:id - Remove document and its chunks

Chat Interface

POST /api/chat/query - Send query and receive response
POST /api/chat/stream - Streaming query endpoint
POST /api/chat/related - Get suggested follow-up questions

Administration

POST /api/admin/llm/provider - Change LLM provider
GET /api/admin/stats - System statistics
POST /api/admin/clear - Clear all documents and chat history

WebSocket Events

Client to Server

{
  "type": "chat",
  "data": {
    "message": "What is the main topic?",
    "sessionId": "session-id",
    "options": {
      "topK": 5,
      "threshold": 0.7
    }
  }
}

Server to Client

{
  "type": "chat_response",
  "data": {
    "message": "Based on the documents...",
    "sources": [
      {
        "filename": "document.pdf",
        "preview": "relevant text excerpt",
        "similarity": 0.85
      }
    ]
  }
}

Configuration Options

Environment Variables

Variable	Description	Default	Required
`PORT`	Server port	3000	No
`NODE_ENV`	Environment mode	development	No
`OPENAI_API_KEY`	OpenAI API key	-	Optional
`GOOGLE_API_KEY`	Google Gemini API key	-	Optional
`ANTHROPIC_API_KEY`	Anthropic Claude API key	-	Optional
`COHERE_API_KEY`	Cohere API key	-	Optional
`HUGGINGFACE_API_KEY`	HuggingFace API key	-	Optional

System Settings

Maximum File Size: 50MB per document
Chunk Size: 500-1000 characters with 100 character overlap
Vector Dimensions: 384 (sentence-transformers compatible)
Maximum Documents: No hard limit (memory dependent)
Supported Languages: Multi-language support via LLM providers

Development

Project Architecture

The application follows a modular, service-oriented architecture:

Service Layer: Core business logic and data processing
Route Layer: HTTP endpoint handling and validation
WebSocket Layer: Real-time communication management
Middleware Layer: Cross-cutting concerns (auth, logging, rate limiting)
Frontend Layer: User interface and client-side logic

Adding New Features

New Document Format: Extend documentProcessor.js service
New LLM Provider: Add integration to llmService.js
Custom Embedding: Modify embeddingService.js
UI Enhancements: Update public/ directory files

Testing

# Run basic functionality test
npm test

# Test document upload
curl -X POST -F "document=@test.pdf" http://localhost:3000/api/documents/upload

# Test query endpoint
curl -X POST -H "Content-Type: application/json" \
  -d '{"message":"What is this document about?"}' \
  http://localhost:3000/api/chat/query

Deployment

Local Production

NODE_ENV=production npm start

Docker Deployment

FROM node:18-alpine
WORKDIR /app
COPY package*.json ./
RUN npm ci --only=production
COPY . .
EXPOSE 3000
CMD ["npm", "start"]

Cloud Deployment

The application can be deployed to any Node.js hosting platform:

Heroku
Vercel
Railway
Digital Ocean
AWS Elastic Beanstalk

Troubleshooting

Common Issues

Port Already in Use
```
Error: listen EADDRINUSE :::3000
```
Solution: Change PORT in .env or kill process using port 3000
Large File Upload Fails
```
Error: File too large
```
Solution: Adjust file size limits in server.js
API Key Errors
```
Error: Unauthorized - Invalid API key
```
Solution: Verify API keys in .env file
Memory Issues with Large Documents
```
Error: JavaScript heap out of memory
```
Solution: Increase Node.js memory limit or process documents in smaller chunks

Performance Optimization

Enable compression middleware for faster response times
Implement document caching for frequently accessed files
Use connection pooling for database operations
Configure appropriate rate limiting based on usage patterns

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/new-feature)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature/new-feature)
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built with modern Node.js and Express.js
Vector similarity search implementation
Multiple LLM provider integrations
Professional UI design with TailwindCSS

Support

For questions, issues, or feature requests, please open an issue on GitHub or contact the developer.

Aaryan Choudhary - Professional RAG Q&A Chatbot System

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
public		public
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Overview.md		Overview.md
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
README.md		README.md
docker-compose.yml		docker-compose.yml
healthcheck.js		healthcheck.js
package.json		package.json
quick-start.js		quick-start.js
requirements.txt		requirements.txt
server-minimal.js		server-minimal.js
server.js		server.js
setup-project.js		setup-project.js
setup.js		setup.js
streamlit_rag_chatbot.py		streamlit_rag_chatbot.py
test.js		test.js

License

IRONalways17/RAG-ChatBot

Folders and files

Latest commit

History

Repository files navigation

Advanced RAG Q&A Chatbot System

Author

Features

Technology Stack

Project Structure

Installation

Prerequisites

Quick Start

Alternative Installation Methods

Minimal Installation (Core Features Only)

Development Mode

Usage

Document Upload

Querying Documents

Configuration

API Endpoints

Document Management

Chat Interface

Administration

WebSocket Events

Client to Server

Server to Client

Configuration Options

Environment Variables

System Settings

Development

Project Architecture

Adding New Features

Testing

Deployment

Local Production

Docker Deployment

Cloud Deployment

Troubleshooting

Common Issues

Performance Optimization

Contributing

License

Acknowledgments

Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages