Skip to content

This repository contains a variety of Data Science projects developed for learning and practical applications. It includes end-to-end workflows such as data preprocessing, EDA, visualization, machine learning model development, evaluation, and deployment-ready code. Each project is structured for clarity and reproducibility.

Notifications You must be signed in to change notification settings

randeepraj2003/Data-Science-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Projects

Python Machine Learning AI Data Science

A curated collection of end-to-end Data Science projects, demonstrating hands-on skills in data analysis, machine learning, and AI.


Skills Dashboard

Category Skills & Tools
Programming Python, NumPy, Pandas, SQL
Visualization Matplotlib, Seaborn, Plotly, Tableau
Machine Learning Decision Trees, Random Forest, SVM, KNN, Logistic Regression
Advanced ML / AI XGBoost, LightGBM, Neural Networks, MLP, Deep Learning
Data Science EDA, Feature Engineering, PCA, Hypothesis Testing, Statistics
NLP & Text Mining Naive Bayes, Text Processing, Sentiment Analysis
Time Series & Forecasting ARIMA, Prophet
Recommender Systems Collaborative Filtering, Content-Based Filtering
Tools Git, Jupyter Notebook, VS Code, Google Colab

Projects Overview

# Project Focus / Techniques
1 Association Rules Market Basket Analysis, Apriori Algorithm
2 Basics of Python 1 & 2 Python fundamentals & scripting
3 Basic Statistics Descriptive & Inferential Statistics
4 Chi-Square Test Categorical Data, Hypothesis Testing
5 Clustering K-Means, Hierarchical Clustering
6 Decision Tree Classification & Regression
7 EDA 1 & 2 Data Visualization, Insights Extraction
8 Hypothesis Testing t-tests, ANOVA, p-values
9 KNN Classification, Distance Metrics
10 LightGBM & XGBoost Gradient Boosting, Hyperparameter Tuning
11 Logistic Regression Binary & Multiclass Classification
12 Multilayer Perceptron (MLP) Neural Networks for tabular data
13 Naive Bayes & Text Mining NLP, Sentiment Analysis
14 Neural Networks Deep Learning, TensorFlow / PyTorch
15 Random Forest Ensemble Learning, Feature Importance
16 PCA Dimensionality Reduction
17 Recommendation System Collaborative & Content-Based Filtering
18 SVM Classification, Kernel Methods
19 Time Series Analysis Forecasting Models, ARIMA, Prophet

Project Workflow

  1. Data Collection & Cleaning – transform raw data into analysis-ready format.
  2. EDA & Visualization – discover patterns, trends, and anomalies.
  3. Feature Engineering & Selection – extract meaningful features.
  4. Modeling – build, tune, and evaluate classical & advanced ML models.
  5. Insights & Reporting – actionable business insights from data.

Why This Repository?

1.FAANG-Level Skills: Demonstrates data science, ML, AI, and problem-solving skills.

  1. End-to-End Projects: From raw data to deployable insights.

  2. Clean & Reproducible Code: Well-commented, structured, and ready for production.

  3. Real-World Applications: Projects emulate business and analytical challenges.

About

This repository contains a variety of Data Science projects developed for learning and practical applications. It includes end-to-end workflows such as data preprocessing, EDA, visualization, machine learning model development, evaluation, and deployment-ready code. Each project is structured for clarity and reproducibility.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published