Dataset: Criteo's CTR prediction challenge
- Dataset with all the variables
- Dataset with dimensionality reduction (PCA)
Algorithms:
- Logistic Regression
- Random Forest
- KNearest Neighbors
- Naive Bayes
- AdaBoost
- Linear Discriminant Analysis
- Gradient Boosting
- Neural Networks (MultiLayer Perceptron)
Parameter tuning: YES
Output:
- ROC curves (png)
- Summary performances information of each algorithm (csv)
Programming Language: Python
Main Libraries:
- Pandas
- Numpy
- Sklearn
- Keras
- Matplotlib