Skip to content

SFGLab/Team8_PhaseSpecific_Chromatin_Changes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Team8: PhaseSpecific Chromatin Changes

🎯Project Objectives

Chromatin undergoes dynamic structural changes throughout the cell cycle to regulate DNA accessibility, replication, and segregation. These transformations ensure that the genome remains transcriptionally active when required, is accurately replicated, and properly segregated during cell division. While the general principles of chromatin conformation changes are gradually being uncovered, much less is known about the specific structural transitions that may drive chromatin organization throughout the cell cycle.

Our project aims to comprehensively integrate multi-omics datasets—including gene expression, chromatin accessibility, and contact maps—to advance the understanding of cell cycle phase-specific chromatin dynamics. Coupled with 3D genome modeling, our approach seeks to uncover novel and distinct chromatin conformational changes along the trajectory of cell cycle progression and other genomic processes such as cellular maturation. We employ state-of-the-art 3D modeling techniques that incorporate temporal information from the cell cycle, alongside cutting-edge methods for multi-omics integration, clustering, and pattern recognition. This combined strategy is designed to reveal common principles and patterns underlying chromatin structural organization within the cell nucleus.

📦 Dataset

The method was primarily tested on the ChAIR data set from (Chai et al. 2025) which combines contacts from ChIA-PET methodology, scRNA-seq and scATAC-seq obtained simultaneously for thousands of cells along the cell cycle trajectory. Our algorithms are also compatible with other similar multi-omics data sets. Similarily the 3D modeling strategy ChromMovie is applicable to different types of single cell contact data.

🛠 Tools & Dependencies

ArchR
scHiCyclePred
CIRCLET
ChromMovie

🧠 Methodology

Aggregation of average gene expresion per Phase and Pseudotime.

  1. Load statistics file after Seurat – only filtered cells left, which have both the ATAC and RNAseq signal. Infor,mation about Phase of the cells and Pseudotime.
  2. Loading EnsemblID 2 Gene Name mapping – Gene names downloaded friom Ensembl Biomart.
  3. Getting gene counts per cell from cellranger BAM file – parsing CB and GX fields (from uniq UMIs)
  4. Creating expression matrix for filtered cells – Merge metadata with expression matrix.
  5. Normalization per read counts per cell – Normalize gene counts per cell to 10,000 (Counts per 10k).
  6. Aggregation by Phase and Pseudotime – Compute average gene expression per cell cycle phase.

Adding ATACseq data to RNAseq:

  1. ATACseq script first takes ATAC seq signal file and annotate it with gene regions +- 500 bp from gene TSS.
  2. The ATAC singals as 1/0 are normalized per cell and scaled by 10K factor, same as in scRNAseq analysis.
  3. The ATAC signal data are filtered to contain only the cells same as in the Seurat stat file, so have same cells as in RNAseq analysis.
  4. The ATAC signal data ate aggregated by Metacell (the mean signal per Metacell, so Pseudotime).,
  5. The ATAC-seq and RNA-seq data are combined together for common genes.

📉 Hackathon work flowchart

graph TD;
    RNA(scRNA-seq)-->|Seurat, Scanpy|T(multidimentional tensor);
    ATAC(scATAC-seq)-->|ArchR|T;
    HiC(sc contacts)-->|scHiCyclePred, CIRCLET|T;
    HiC(sc contacts)-->|ChromMovie|3D(3D models);
    3D-->STdata(spatiotemporal data);
    T-->STdata;
    T-->classification(Classification);
    STdata-->Cluster(clusterization, AE, etc.);
    Cluster-->FDA(FDA);
Loading

⚡ Example use

In order to analyse your structures saved in the same format as the output of ChromMovie, start with the following preprocessing:

python -m modeling_00_preprocess --input "path/to/files_cif"

This preprocessing, creates pandas data frames for further analysis. Subsequently, the motifs are extracted from this data the following way:

python -m modeling_01_extract_motifs --input "path/to/files_pd" --output "path/to/results_kmers" -l 10 -w 5

Finally, the clustering analysis and the centroids of each cluster can be extracted with a script:

python -m modeling_02_clustering --input  "path/to/results_kmers/result.npy" --output "path/to/results_centroid" --nc 5 --seed 100 -l 10 -w 5

the -nc parameter represents the number of clusters to be identified, -l represents the length of the motifs and -w its width, that is the number of consecutive frames to be considered as part of the motif.

🧬 Output

The output to our methods consists of a number of structures representing typical chromatin conformational changes detected in the studied case. The files are saved in the .cif format with additional .gif visualization files.

Cluster 1 Cluster 2 Cluster 3 ...
centroid_0 centroid_5 centroid_3 ...

🏗 Future steps

  1. Model validation
  2. Parameter tuning
  3. Better integration of the multi-omics data (scRNA-seq, scATAC-seq, scChIA-PET,...)
  4. Explore better characterizations of chromatin conformational changes
  5. Explore better clustering algorithms
  6. Explore the algorithms for determining representative structures via Functional Data Analysis (FDA) and other methods

🧾 Contributions

Krzysztof Banecki - team lead, 3D modeling, chromatin structure analysis

Rafał Wóycicki - RNA-seq analysis, ATAC-seq analysis

Alicja M. Olszewska - RNA-seq analysis

Konrad Dębski - ATAC-seq analysis

Jatin Shrinet - scChIA-PET analysis

📌 References

  1. Haoxi Chai, Xingyu Huang, Guangzhou Xiong, Jiaxiang Huang, Katarzyna Karolina Pels, Lingyun Meng, Jin Han, Dongmei Tang, Guanjing Pan, Liang Deng, Qin Xiao, Xiaotao Wang, Meng Zhang, Krzysztof Banecki, Dariusz Plewczynski, Chia-Lin Wei, Yijun Ruan, Tri-omic mapping revealed concerted dynamics of 3D epigenome and transcriptome in brain cells, Nature Methods (2025). https://doi.org/10.1038/s41592-025-02658-7
  2. Kursa M., Rudnicki W., "Feature Selection with the Boruta Package" Journal of Statistical Software, Vol. 36, Issue 11, Sep 2010, https://doi.org/10.18637/jss.v036.i11
  3. Krzysztof H. Banecki, Haoxi Chai, Yijun Ruan, Dariusz Plewczynski, ChromMovie: A Molecular Dynamics Approach for Simultaneous Modeling of Chromatin Conformation Changes from Multiple Single-Cell Hi-C Maps, bioRxiv, 2025, https://doi.org/10.1101/2025.05.16.654550

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5

Languages