GitHub - giobbu/collaborative-data-imputation: Data imputation with collaborative filtering and latent factor models for wind farms time series data

Collaborative-Data-Imputation

Wind Power Data Reconstruction

In power system operations and electricity markets, missing data is a pervasive challenge in practice. Missing observations can arise from sensor faults, communication failures, or maintenance outages. This issue becomes particularly critical when large-scale, data-driven approaches are applied to point and probabilistic wind power forecasting, where data quality directly affects model performance and therefore decision making.

To address this, data imputation techniques—such as k-nearest neighbors (k-NN) and factor models—are commonly employed to reconstruct incomplete datasets before training forecasting models. Effective imputation ensures data completeness and consistency, which are essential for the reliability and accuracy of modern machine-learning–based forecasting methods.

MLflow Experiments

MLflow is used to systematically compare and evaluate missing-data imputation algorithms, making it easier to identify the best-performing approach for a given dataset.

Install UV (Dependency Manager)
```
pip install uv
```
Install Project Dependencies

Install all required dependencies, including MLflow::
```
uv sync
```
Start Mlflow server

Launch the MLflow UI locally:
```
uv run mlflow ui
```
Run the Experiments

Set paramaters to test in config.py
```
nano config.py
```
Execute the experiment pipeline:
```
uv run main.py
```
View Experiment Results:

Open your browser and navigato to http://127.0.0.1:5000.

From the MLflow UI, you can explore:
- Experiment runs
- Model parameters and hyperparameters
- Evaluation metrics
- Logged artifacts (e.g., reconstructed datasets and plots)

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
data		data
experiments		experiments
img		img
source		source
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.secrets.baseline		.secrets.baseline
LICENSE		LICENSE
README.md		README.md
config.py		config.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collaborative-Data-Imputation

Wind Power Data Reconstruction

MLflow Experiments

About

Uh oh!

Releases

Uh oh!

Languages

License

giobbu/collaborative-data-imputation

Folders and files

Latest commit

History

Repository files navigation

Collaborative-Data-Imputation

Wind Power Data Reconstruction

MLflow Experiments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages