Skip to content
View EttoreRocchi's full-sized avatar
💻
4 8 15 16 23 42 EXECUTE
💻
4 8 15 16 23 42 EXECUTE

Block or report EttoreRocchi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
EttoreRocchi/README.md

Hi, I'm Ettore!

PhD student building interpretable ML pipelines for omics data analysis.

GitHub followers LinkedIn Google Scholar Scopus

About Me

I'm a PhD student at the University of Bologna with a background in Physics. I work at the intersection of machine learning and biomedicine, building interpretable, reproducible, and scalable pipelines for omics data analysis. I'm part of the Multi-Omics and Health-Care Data Analytics Unit at Sant'Orsola Hospital in Bologna.

Tech Stack

Python scikit-learn PyTorch Bash Snakemake Nextflow

What I Work On

  • ML & Deep Learning in Biomedicine - Feature engineering, multi-omics integration, and interpretable models for clinical and biological prediction tasks
  • Algorithms for Biomedical Data - Building computational tools, statistical models, and network-based approaches to simulate and analyze biological systems
  • Mass Spectrometry - ML methods for predicting antimicrobial resistance from MALDI-TOF mass spectra
  • Metagenomics - Network-based modeling of microbial communities for microbiome profiling, simulation, and pathogen detection
  • Genomics - Analyzing structural variants, APOBEC-style mutations, and computational approaches to CRISPR genome editing

Featured Projects

  • combatlearn - Scikit-learn compatible ComBat for batch effect correction in high-dimensional data. Useful for harmonizing data across studies.

  • ResPredAI - ML pipeline for predicting antimicrobial resistance from demographic and clinical data.

  • MaldiAMRKit - Toolkit to read and preprocess MALDI-TOF mass spectra for AMR analyses.

  • CATS - Compare Cas9 nucleases by detecting overlapping PAM sites and enabling allele-specific targeting of disease mutations, with ClinVar integration.

  • CAMISIM-BrokenStick - Extension of CAMISIM to simulate metagenomic sequencing data using a broken stick model.

  • APOBECSeeker - Pipeline for identifying APOBEC-style mutations from multiple sequence alignment.

Selected Publications

Organization

I'm a member of Physics4MedicineLab, the Applied Physics group within the Department of Medicine at Alma Mater Studiorum - University of Bologna.

Get in Touch

Pinned Loading

  1. combatlearn combatlearn Public

    The ComBat algorithm for a learning framework (scikit-learn compatible)

    Python 5

  2. ResPredAI ResPredAI Public

    Implementation of the pipeline described in the work "Artificial intelligence model to predict resistances in Gram-negative bloodstream infections" by Bonazzetti et al., npj Digit. Med. 8, 319 (2025)

    Python 5

  3. MaldiAMRKit MaldiAMRKit Public

    Comprehensive toolkit for MALDI-TOF mass spectrometry data preprocessing for antimicrobial resistance (AMR) prediction purposes

    Jupyter Notebook 2

  4. Physics4MedicineLab/CATS Physics4MedicineLab/CATS Public

    Comparing Cas Activities by Target Superimposition (CATS)

    Python 2

  5. Physics4MedicineLab/APOBECSeeker Physics4MedicineLab/APOBECSeeker Public

    Pipeline for the identification of APOBEC-style mutations from multiple sequence alignment

    Python 1