Skip to content

ashmi8/End_to_End_DataScience_Project

Repository files navigation

End to End Data Science Project — Wine Quality

End-to-end ML pipeline for predicting wine quality, with automated ingestion → validation → transformation → training → evaluation, plus a simple Flask UI for inference.

Overview

Project Structure

.
├─ main.py
├─ app.py
├─ config/
│  └─ config.yaml
├─ params.yaml
├─ schema.yaml
├─ src/datascience/
│  ├─ constants/__init__.py
│  ├─ config/configuration.py
│  ├─ entity/config_entity.py
│  ├─ utils/common.py
│  ├─ components/
│  │  ├─ data_ingestion.py
│  │  ├─ data_validation.py
│  │  ├─ data_transformation.py
│  │  ├─ model_trainer.py
│  │  └─ model_evaluation.py
│  └─ pipeline/
│     ├─ data_ingestion_pipeline.py
│     ├─ data_validation_pipeline.py
│     ├─ data_transformation_pipeline.py
│     ├─ model_trainer_pipeline.py
│     ├─ model_evaluation_pipeline.py
│     └─ prediction_pipeline.py
├─ templates/
│  ├─ index.html
│  └─ results.html
└─ artifacts/ (generated)

Configuration

Pipelines

Setup

python -m venv env
source env/bin/activate
pip install -r requirements.txt
cp .env.example .env  # optional, for MLflow credentials if needed

How to Run (Training)

python main.py

Artifacts are written under artifacts/:

How to Run (Web UI)

python app.py  # serves on http://0.0.0.0:8080

Guidelines for Extending

  1. Update schemas in schema.yaml when adding/removing columns.
  2. Adjust paths/artifacts in config/config.yaml when changing storage layout.
  3. Tune hyperparameters in params.yaml.
  4. Add components under src/datascience/components and wire them in src.datascience.config.configuration.ConfigurationManager and pipelines under src/datascience/pipeline.
  5. Keep utility helpers in src/datascience/utils/common.py for I/O and serialization.
  6. Ensure MLflow URI is set in src.datascience.config.configuration.ConfigurationManager.get_model_evaluation_config if remote tracking is used.

License

This project is licensed under GPL-3.0; see LICENSE.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published