AA table visualization

This pipeline is for filtering, summarizing, and visualizing amino acid changes that have occurred across a sequencing dataset, and takes amino acid tables as input. These amino acid tables can come from a variety of sources, including:

miptools (for MIP amplicon data): github.com/bailey-lab/miptools
seekdeep (for illumina amplicon data): github.com/bailey-lab/seekdeep_AA
nanopore (for nanopore amplicon or whole genome data): github.com/simkin-bioinformatics/clair3-nanopore

Inputs and Outputs:

This program takes as input three tables with samples as rows, mutations as columns, and read counts at the intersections, where read counts represent coverage, reference count, or alternate count, respectively, in each of the three tables.

A fourth table gives GPS coordinates where each sample was collected, as well as a geographic label to use for aggregating samples (e.g. aggregating samples that came from the same city, county, region, district, or health facility together and counting the number of samples from the location that had a mutation divided by the total number of samples that had sequencing data available).

This program outputs an interactive html file that can be zoomed or panned as needed, with the ability to hover over locations and see the prevalence of a mutation at each location. The program also exports svg files suitable for publication.

Installation:

You will need a copy of conda in order to manage dependencies. We strongly recommend using 'mamba' derivatives of conda, as these provide built-in strict version control when downloading packages, have more robust package solving, and provide quicker package download times. The easiest way to obtain mamba is with micromamba, available here: https://mamba.readthedocs.io/en/latest/installation/micromamba-installation.html The relevant command is here:

"${SHELL}" <(curl -L micro.mamba.pm/install.sh)

You can obtain a copy of this git repository with this command:

git clone https://github.com/simkin-bioinformatics/AA_table_visualization.git

After changing directory to the cloned AA_table_visualization folder, you can cd into the folder (if you're unsure whether you're in the correct folder, make sure the folder you cd into contains an environment.yaml file) and run this command to build a conda environment that contains all package dependencies (you can substitute micromamba or conda instead of mamba depending on which type of conda you have):

mamba env create -f environment.yaml

(optional - useful if your system times out or throws errors during the static image graphing step). Install chrome for plotly by activating your environment and running the included install script:

mamba activate aa_table_visualization

plotly_get_chrome

Usage:

You'll need to have coverage, alternate, and reference AA tables copied into the github folder that you cloned, along with a metadata table that includes columns with sample names, latitudes, longitudes, and location names that you would like to use to group your samples together (e.g. geographic region, district, or health facility names)

activate the environment with:

mamba activate aa_table_visualization

Launch the jupyter notebook with:

jupyter lab variant_graphing.ipynb

make sure to open the jupyter notebook in a web browser (e.g. using the link included in the output messages), and open the variant_graphing.ipynb file (e.g. by double clicking it). Follow the instructions in the notebook carefully. An example usage visualizing a dataset from Tanzania is pre-filled out in the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
input		input
seekdeep_input_data		seekdeep_input_data
src		src
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
variant_graphing.ipynb		variant_graphing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AA table visualization

Inputs and Outputs:

Installation:

Usage:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

simkin-bioinformatics/AA_table_visualization

Folders and files

Latest commit

History

Repository files navigation

AA table visualization

Inputs and Outputs:

Installation:

Usage:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages