Credit Risk Analyzer 🎯

An AI-powered credit risk assessment system with explainable predictions using machine learning and SHAP values.

🚀 Quick Start

Your application is already running!

Frontend: http://localhost:8082
Backend API: http://localhost:8000
API Docs: http://localhost:8000/docs

Open http://localhost:8082 in your browser to start analyzing credit risk!

✨ Features

Backend

✅ Machine Learning Model: Gradient Boosted Trees (LightGBM/XGBoost/CatBoost)
✅ Explainable AI: SHAP values for feature importance
✅ RESTful API: FastAPI with automatic documentation
✅ Data Pipeline: Automated preprocessing and feature engineering
✅ Model Evaluation: Comprehensive metrics and visualizations
✅ Class Imbalance Handling: Proper handling of imbalanced datasets
✅ CORS Support: Ready for frontend integration

Frontend

✅ Modern UI: React with TypeScript and Tailwind CSS
✅ Real-time Validation: Form validation with immediate feedback
✅ Risk Visualization: Clear display of risk levels and factors
✅ Explainable Results: Human-readable explanations for predictions
✅ Responsive Design: Works on desktop and mobile
✅ Error Handling: Graceful error messages and loading states

🏗 Architecture

┌─────────────────┐      HTTP/REST      ┌─────────────────┐
│                 │ ←─────────────────→ │                 │
│  React Frontend │                     │  FastAPI Backend│
│  (Port 8082)    │                     │  (Port 8000)    │
│                 │                     │                 │
└─────────────────┘                     └────────┬────────┘
                                                 │
                                                 │ Loads
                                                 ↓
                                        ┌─────────────────┐
                                        │ ML Model +      │
                                        │ SHAP Explainer  │
                                        │ Artifacts       │
                                        └─────────────────┘

Tech Stack

Backend:

Python 3.8+
FastAPI (Web framework)
LightGBM/XGBoost/CatBoost (ML models)
SHAP (Explainability)
Pandas, NumPy (Data processing)
Scikit-learn (Preprocessing)

Frontend:

React 18+
TypeScript
Tailwind CSS
Vite (Build tool)
Shadcn/ui (UI components)

🎯 Getting Started

Prerequisites

Python 3.8+
Node.js 16+
npm or yarn

Installation

Clone the repository

cd /path/to/CreditScore

Install Backend Dependencies

cd backend
pip install -r requirements.txt

Install Frontend Dependencies

npm install

Training the Model

python train_model.py

This will:

Load and preprocess the training data
Train multiple model variants (LightGBM, XGBoost, CatBoost)
Evaluate and select the best model
Save model artifacts to backend/artifacts/
Generate evaluation plots

Running the Application

Terminal 1 - Backend:

cd backend
python -m uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

Terminal 2 - Frontend:

npm run dev

Access the application at http://localhost:8082

💡 Usage

Web Interface

Navigate to http://localhost:8082
Fill in the credit assessment form with applicant details
Click "Analyze Credit Risk"
View the prediction results:
- Default probability (0-100%)
- Risk label (LOW/MEDIUM/HIGH)
- Top risk factors with explanations

API Usage

Health Check:

curl http://localhost:8000/api/health

Predict Credit Risk:

curl -X POST http://localhost:8000/api/predict \
  -H "Content-Type: application/json" \
  -d '{
    "age": 35,
    "annual_income": 60000,
    "debt_to_income_ratio": 0.45,
    "revolving_utilization": 0.6,
    "open_credit_lines": 5,
    "delinquencies_2yrs": 2,
    "dependents": 1,
    "fico_score": 720,
    "loan_amount": 25000,
    "employment_length": 5
  }'

Get Model Schema:

curl http://localhost:8000/api/schema

📚 API Documentation

Interactive API documentation is available at:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

Endpoints

GET /api/health

Returns the health status of the API and model loading status.

Response:

{
  "status": "ok",
  "model_loaded": true,
  "version": "1.0.0"
}

POST /api/predict

Predicts credit default probability for an applicant.

Request Body:

{
  "age": 35,
  "annual_income": 60000,
  "debt_to_income_ratio": 0.45,
  "revolving_utilization": 0.6,
  "open_credit_lines": 5,
  "delinquencies_2yrs": 2,
  "dependents": 1,
  "fico_score": 720,
  "loan_amount": 25000,
  "employment_length": 5
}

Response:

{
  "default_probability": 0.665,
  "risk_label": "HIGH",
  "top_factors": [
    {
      "feature": "loan_to_income_ratio",
      "impact": 0.471,
      "direction": "increases_risk",
      "human_readable_reason": "Higher loan-to-income ratio increases default risk"
    }
  ],
  "model_version": "1.0"
}

GET /api/schema

Returns model metadata and feature definitions.

🤖 Model Information

Training Data

Source: loan_processed_data.csv
Target: Binary classification (default vs. non-default)
Features: 14 features including FICO score, income, DTI, utilization, etc.

Model Type

Gradient Boosted Trees (best of LightGBM, XGBoost, CatBoost selected based on validation performance)

Features Used

FICO Score (300-850)
Annual Income ($)
Debt-to-Income Ratio (0-1)
Revolving Utilization (0-1)
Open Credit Lines (count)
Delinquencies in Last 2 Years (count)
Loan Amount ($)
Employment Length (years)
Age (years)
Dependents (count)
Monthly Income (derived)
Loan-to-Income Ratio (derived)
High Utilization Flag (derived)
Term Length (months)

Risk Thresholds

LOW: < 33% default probability
MEDIUM: 33% - 66% default probability
HIGH: ≥ 66% default probability

Model Performance

Check backend/artifacts/model_evaluation_results.csv for detailed metrics including:

ROC AUC
Precision
Recall
F1 Score
KS Statistic

🧪 Testing

Test Cases

See TESTING_RESULTS.md for comprehensive test results.

Example Test Cases:

Low Risk:

{
  "age": 45, "annual_income": 85000, "debt_to_income_ratio": 0.25,
  "revolving_utilization": 0.30, "open_credit_lines": 8,
  "delinquencies_2yrs": 0, "dependents": 2, "fico_score": 780,
  "loan_amount": 15000, "employment_length": 10
}

Expected: ~23% default probability, LOW risk

High Risk:

{
  "age": 28, "annual_income": 35000, "debt_to_income_ratio": 0.65,
  "revolving_utilization": 0.95, "open_credit_lines": 3,
  "delinquencies_2yrs": 4, "dependents": 0, "fico_score": 580,
  "loan_amount": 30000, "employment_length": 1
}

Expected: ~82% default probability, HIGH risk

🚀 Deployment

Production Checklist

Before deploying to production:

Deployment Options

Backend:

AWS EC2 / Lambda
Google Cloud Run
Azure App Service
Heroku
DigitalOcean

Frontend:

Vercel
Netlify
AWS S3 + CloudFront
GitHub Pages

📁 Project Structure

CreditScore/
├── backend/                    # Backend API
│   ├── app/
│   │   ├── main.py            # FastAPI application
│   │   ├── config.py          # Configuration
│   │   ├── schemas.py         # Pydantic models
│   │   ├── preprocessing.py   # Data preprocessing
│   │   ├── inference.py       # Model inference & SHAP
│   │   └── utils.py           # Utilities
│   ├── training/
│   │   ├── data_loader.py     # Data loading
│   │   ├── train_model.py     # Model training
│   │   └── evaluate_model.py  # Model evaluation
│   ├── artifacts/             # Model artifacts
│   ├── requirements.txt       # Python dependencies
│   └── README.md             # Backend docs
├── src/                       # Frontend source
│   ├── pages/
│   │   └── Assessment.tsx     # Main page
│   ├── services/
│   │   └── api.ts            # API service
│   └── components/           # UI components
├── data/                      # Training data
├── train_model.py            # Training script
├── start_backend.py          # Backend launcher
├── QUICKSTART.md             # Quick start guide
├── PROJECT_STATUS.md         # Project status
├── TESTING_RESULTS.md        # Test results
└── README.md                 # This file

🤝 Contributing

Contributions are welcome! Areas for improvement:

Authentication & Authorization: Add user management
Database Integration: Store predictions and user data
Model Monitoring: Track model performance over time
Batch Processing: Support CSV uploads for bulk predictions
Advanced Features: A/B testing, model versioning, etc.
Mobile App: Native iOS/Android applications
Advanced Visualizations: More detailed charts and graphs

📄 License

This project is licensed under the MIT License.

📞 Support

For questions or issues:

Check the documentation files (QUICKSTART.md, PROJECT_STATUS.md, TESTING_RESULTS.md)
Review the API documentation at http://localhost:8000/docs
Check the backend logs in the terminal
Check the browser console for frontend errors

🎉 Acknowledgments

Built with FastAPI, React, and modern ML tools
SHAP library for explainable AI
Gradient boosting libraries (LightGBM, XGBoost, CatBoost)
Shadcn/ui for beautiful UI components

🚀 Start using your Credit Risk Analyzer at http://localhost:8082

Last updated: October 27, 2025

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
public		public
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
PROJECT_DESCRIPTION.md		PROJECT_DESCRIPTION.md
PROJECT_REPORT.md		PROJECT_REPORT.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
start_backend.py		start_backend.py
tailwind.config.ts		tailwind.config.ts
test_backend.py		test_backend.py
test_integration.py		test_integration.py
train_model.py		train_model.py
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

anish00700/Credit-Risk-Analyzer

Folders and files

Latest commit

History

Repository files navigation