Welcome to my GitHub portfolio!
I am a Data Science student with a strong interest in data analysis, machine learning, and big data technologies.
This repository showcases my academic projects, technical skills, and hands-on experience with real datasets.
- π Program: Applied Data Science
- π Location: OttawaβGatineau, Canada
- π Interests: Data analysis, Machine Learning, Big Data, ETL pipelines
- π‘ Tools I enjoy working with: Python, SQL, Spark, Pandas, Power BI
- Python (Pandas, NumPy, Scikit-learn)
- SQL
- PySpark (RDD & DataFrames)
- Hadoop (HDFS)
- Data cleaning & preprocessing
- Exploratory Data Analysis (EDA)
- Classification & Regression models
- Clustering (K-Means, DBSCAN)
- Dimensionality reduction (PCA, MCA)
- Jupyter Notebook
- Git & GitHub
- VS Code
- Docker (basic usage)
- PowerPoint & technical reporting
- Objective: Predict clients likely to default on credit card payments
- Techniques: EDA, PCA, K-Means clustering, classification models
- Tools: Python, Pandas, Scikit-learn
- Objective: Analyze large-scale taxi trip data
- Techniques: Spark DataFrames, aggregations, correlations
- Tools: PySpark, Hadoop
- Objective: Classify SMS messages as spam or ham
- Techniques: TF-IDF, Logistic Regression, SVM, MLP
- Metrics: Accuracy, Precision, Recall, F1-score
(More projects available in the repository folders)
- π Clean and well-structured project folders
- π Jupyter notebooks with explanations
- π Data visualizations and interpretations
- π Reports and presentation slides
- GitHub: https://github.com/traore19
- LinkedIn: https://www.linkedin.com/in/traore-fadimatou-648162347/
- Email: traorefadimatou21@gmail.com
β Feel free to explore my projects and connect with me!

