README

Purpose

main.py executes the full embedding → aggregation → ROI-level classification pipeline for multiplexed imaging (mxIF) datasets.
It loads cell tables and ROI labels, builds per-ROI graphs, computes node embeddings, aggregates them into ROI-level embeddings, and trains/evaluates ROI classifiers.

Inputs

The workflow is driven by a YAML config file. Main input components include:

1. Cell-level data

.rds or .csv file containing per-cell x, y, phenotype, roi_id, cell_id.
Column names are mapped through cell_columns.

2. ROI-level labels

CSV mapping roi_id → roi_label.
Optionally supports patient_id and subject-level labels.

3. Graph configuration

Graph type: knn or radius
Parameters such as knn_k or radius.

4. ROI-supervision configuration

Embedding parameter search space
Aggregation method (mean / attention / pooling)
ROI classifier and CV settings.

Outputs

All results are written under the directory specified by --outdir. Key outputs:

Data

dataframes/df.csv — merged and validated cell table with ROI/subject labels.

Graphs

graphs/graph_dict_<type>.pkl — per-ROI graphs.
graphs/G_all_<type>.pkl — disconnected union graph.

Embeddings & Classification

Cached node embeddings.

ROI embeddings stored under:

evaluate/roi_supervised_best/<metric>/roi_embedding.mat

Best hyperparameters (best_roi_supervision.yaml).
Trained classifier objects.

Logs & Config

logs/run_*.log — timestamped logs.
config/resolved_config.yaml — exact config used in the run.

Pipeline Summary

Load and validate cell + ROI/subject labels
Build per-ROI graphs
Construct global union graph
Run supervised search over embedding + aggregation hyperparameters
Compute best node embeddings
Aggregate node embeddings into ROI-level vectors
Train and export ROI-level classifiers
Save embeddings, parameters, and diagnostics

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
config		config
notebooks		notebooks
plot_overlay		plot_overlay
src		src
test		test
.gitignore		.gitignore
README.md		README.md
analyze_from_csv.py		analyze_from_csv.py
concat_roi.py		concat_roi.py
environment.yml		environment.yml
interprete.py		interprete.py
logs_to_csv.py		logs_to_csv.py
main.py		main.py
pipe.sh		pipe.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

Purpose

Inputs

1. Cell-level data

2. ROI-level labels

3. Graph configuration

4. ROI-supervision configuration

Outputs

Data

Graphs

Embeddings & Classification

Logs & Config

Pipeline Summary

About

Uh oh!

Releases

Packages

Languages

dimi-lab/TMA_Graph_Embedding

Folders and files

Latest commit

History

Repository files navigation

README

Purpose

Inputs

1. Cell-level data

2. ROI-level labels

3. Graph configuration

4. ROI-supervision configuration

Outputs

Data

Graphs

Embeddings & Classification

Logs & Config

Pipeline Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages