Chromatin undergoes dynamic structural changes throughout the cell cycle to regulate DNA accessibility, replication, and segregation. These transformations ensure that the genome remains transcriptionally active when required, is accurately replicated, and properly segregated during cell division. While the general principles of chromatin conformation changes are gradually being uncovered, much less is known about the specific structural transitions that may drive chromatin organization throughout the cell cycle.
Our project aims to comprehensively integrate multi-omics datasets—including gene expression, chromatin accessibility, and contact maps—to advance the understanding of cell cycle phase-specific chromatin dynamics. Coupled with 3D genome modeling, our approach seeks to uncover novel and distinct chromatin conformational changes along the trajectory of cell cycle progression and other genomic processes such as cellular maturation. We employ state-of-the-art 3D modeling techniques that incorporate temporal information from the cell cycle, alongside cutting-edge methods for multi-omics integration, clustering, and pattern recognition. This combined strategy is designed to reveal common principles and patterns underlying chromatin structural organization within the cell nucleus.
The method was primarily tested on the ChAIR data set from (Chai et al. 2025) which combines contacts from ChIA-PET methodology, scRNA-seq and scATAC-seq obtained simultaneously for thousands of cells along the cell cycle trajectory. Our algorithms are also compatible with other similar multi-omics data sets. Similarily the 3D modeling strategy ChromMovie is applicable to different types of single cell contact data.
ArchR
scHiCyclePred
CIRCLET
ChromMovie
- Load statistics file after Seurat – only filtered cells left, which have both the ATAC and RNAseq signal. Infor,mation about Phase of the cells and Pseudotime.
- Loading EnsemblID 2 Gene Name mapping – Gene names downloaded friom Ensembl Biomart.
- Getting gene counts per cell from cellranger BAM file – parsing CB and GX fields (from uniq UMIs)
- Creating expression matrix for filtered cells – Merge metadata with expression matrix.
- Normalization per read counts per cell – Normalize gene counts per cell to 10,000 (Counts per 10k).
- Aggregation by Phase and Pseudotime – Compute average gene expression per cell cycle phase.
- ATACseq script first takes ATAC seq signal file and annotate it with gene regions +- 500 bp from gene TSS.
- The ATAC singals as 1/0 are normalized per cell and scaled by 10K factor, same as in scRNAseq analysis.
- The ATAC signal data are filtered to contain only the cells same as in the Seurat stat file, so have same cells as in RNAseq analysis.
- The ATAC signal data ate aggregated by Metacell (the mean signal per Metacell, so Pseudotime).,
- The ATAC-seq and RNA-seq data are combined together for common genes.
graph TD;
RNA(scRNA-seq)-->|Seurat, Scanpy|T(multidimentional tensor);
ATAC(scATAC-seq)-->|ArchR|T;
HiC(sc contacts)-->|scHiCyclePred, CIRCLET|T;
HiC(sc contacts)-->|ChromMovie|3D(3D models);
3D-->STdata(spatiotemporal data);
T-->STdata;
T-->classification(Classification);
STdata-->Cluster(clusterization, AE, etc.);
Cluster-->FDA(FDA);
In order to analyse your structures saved in the same format as the output of ChromMovie, start with the following preprocessing:
python -m modeling_00_preprocess --input "path/to/files_cif"This preprocessing, creates pandas data frames for further analysis. Subsequently, the motifs are extracted from this data the following way:
python -m modeling_01_extract_motifs --input "path/to/files_pd" --output "path/to/results_kmers" -l 10 -w 5Finally, the clustering analysis and the centroids of each cluster can be extracted with a script:
python -m modeling_02_clustering --input "path/to/results_kmers/result.npy" --output "path/to/results_centroid" --nc 5 --seed 100 -l 10 -w 5the -nc parameter represents the number of clusters to be identified, -l represents the length of the motifs and -w its width, that is the number of consecutive frames to be considered as part of the motif.
The output to our methods consists of a number of structures representing typical chromatin conformational changes detected in the studied case. The files are saved in the .cif format with additional .gif visualization files.
| Cluster 1 | Cluster 2 | Cluster 3 | ... |
|---|---|---|---|
![]() |
![]() |
![]() |
... |
- Model validation
- Parameter tuning
- Better integration of the multi-omics data (scRNA-seq, scATAC-seq, scChIA-PET,...)
- Explore better characterizations of chromatin conformational changes
- Explore better clustering algorithms
- Explore the algorithms for determining representative structures via Functional Data Analysis (FDA) and other methods
Krzysztof Banecki - team lead, 3D modeling, chromatin structure analysis
Rafał Wóycicki - RNA-seq analysis, ATAC-seq analysis
Alicja M. Olszewska - RNA-seq analysis
Konrad Dębski - ATAC-seq analysis
Jatin Shrinet - scChIA-PET analysis
- Haoxi Chai, Xingyu Huang, Guangzhou Xiong, Jiaxiang Huang, Katarzyna Karolina Pels, Lingyun Meng, Jin Han, Dongmei Tang, Guanjing Pan, Liang Deng, Qin Xiao, Xiaotao Wang, Meng Zhang, Krzysztof Banecki, Dariusz Plewczynski, Chia-Lin Wei, Yijun Ruan, Tri-omic mapping revealed concerted dynamics of 3D epigenome and transcriptome in brain cells, Nature Methods (2025). https://doi.org/10.1038/s41592-025-02658-7
- Kursa M., Rudnicki W., "Feature Selection with the Boruta Package" Journal of Statistical Software, Vol. 36, Issue 11, Sep 2010, https://doi.org/10.18637/jss.v036.i11
- Krzysztof H. Banecki, Haoxi Chai, Yijun Ruan, Dariusz Plewczynski, ChromMovie: A Molecular Dynamics Approach for Simultaneous Modeling of Chromatin Conformation Changes from Multiple Single-Cell Hi-C Maps, bioRxiv, 2025, https://doi.org/10.1101/2025.05.16.654550


