Doc - How to contribute new modes #443

keiran-rowell-unsw · 2026-01-16T04:33:18Z

A Pipeline structure and metrics precis, to guide addition of any new proteinfold --modes.

Designed to quickly orient new contributors of the main places to edit to add a newly release protein structure prediction program, not provide fine-grained implementation details.

…examples

github-actions · 2026-01-16T04:35:03Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 371a4be

+| ✅ 327 tests passed       |+
#| ❔   4 tests were ignored |#
#| ❔   1 tests had warnings |#
!| ❗  33 tests had warnings |!

Details

❗ Test warnings:

files_exist - File not found: conf/igenomes.config
files_exist - File not found: conf/igenomes_ignored.config
pipeline_todos - TODO string in base.config: Check the defaults for all processes
pipeline_todos - TODO string in base.config: Customise requirements for specific processes.
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in usage.md: Add documentation about anything specific to running your pipeline. For general topics, please point to (and add to) the main nf-core website.
pipeline_todos - TODO string in nextflow.config: Specify any additional parameters here
schema_description - No description provided in schema for parameter: rosettafold2na_uniref30_link
schema_description - No description provided in schema for parameter: rosettafold2na_bfd_link
schema_description - No description provided in schema for parameter: rosettafold2na_pdb100_link
schema_description - No description provided in schema for parameter: rosettafold2na_weights_link
schema_description - No description provided in schema for parameter: rfam_full_region_link
schema_description - No description provided in schema for parameter: rfam_cm_link
schema_description - No description provided in schema for parameter: rnacentral_rfam_annotations_link
schema_description - No description provided in schema for parameter: rnacentral_id_mapping_link
schema_description - No description provided in schema for parameter: rnacentral_sequences_link
schema_description - No description provided in schema for parameter: rosettafold2na_uniref30_path
schema_description - No description provided in schema for parameter: rosettafold2na_bfd_path
schema_description - No description provided in schema for parameter: rosettafold2na_pdb100_path
schema_description - No description provided in schema for parameter: rosettafold2na_weights_path
local_component_structure - post_processing.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_rosettafold_all_atom_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_rosettafold2na_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_alphafold3_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_colabfold_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - aria2_uncompress.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_esmfold_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_helixfold3_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_boltz_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure
local_component_structure - prepare_alphafold2_dbs.nf in subworkflows/local should be moved to a SUBWORKFLOW_NAME/main.nf structure

❔ Tests ignored:

files_unchanged - File ignored due to lint config: .github/CONTRIBUTING.md
files_unchanged - File ignored due to lint config: .github/workflows/linting.yml
actions_schema_validation - actions_schema_validation
multiqc_config - multiqc_config

❔ Tests fixed:

rocrate_readme_sync - Mismatch fixed: RO-Crate description updated from README.md.

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/nf-test.yml
files_exist - File found: .github/actions/get-shards/action.yml
files_exist - File found: .github/actions/nf-test/action.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-proteinfold_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-proteinfold_logo_light.png
files_exist - File found: docs/images/nf-core-proteinfold_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: nf-test.config
files_exist - File found: tests/default.nf.test
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File found: ro-crate-metadata.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-proteinfold_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowProteinfold.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedHeaders
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 1.2.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.mode= alphafold2
nextflow_config - Config default value correct: params.uniref30_prefix= UniRef30_2023_02
nextflow_config - Config default value correct: params.alphafold2_max_template_date= 2038-01-19
nextflow_config - Config default value correct: params.alphafold2_mode= split_msa_prediction
nextflow_config - Config default value correct: params.alphafold2_model_preset= monomer_ptm
nextflow_config - Config default value correct: params.alphafold2_params_prefix= alphafold_params_2022-12-06
nextflow_config - Config default value correct: params.colabfold_model_preset= alphafold2_ptm
nextflow_config - Config default value correct: params.colabfold_num_recycles= 3
nextflow_config - Config default value correct: params.colabfold_use_amber= true
nextflow_config - Config default value correct: params.colabfold_use_templates= true
nextflow_config - Config default value correct: params.esmfold_num_recycles= 4
nextflow_config - Config default value correct: params.esmfold_model_preset= monomer
nextflow_config - Config default value correct: params.helixfold3_precision= bf16
nextflow_config - Config default value correct: params.helixfold3_infer_times= 4
nextflow_config - Config default value correct: params.helixfold3_max_template_date= 2038-01-19
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.alphafold2_bfd_link= https://storage.googleapis.com/alphafold-databases/casp14_versions/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt.tar.gz
nextflow_config - Config default value correct: params.alphafold2_small_bfd_link= https://storage.googleapis.com/alphafold-databases/reduced_dbs/bfd-first_non_consensus_sequences.fasta.gz
nextflow_config - Config default value correct: params.alphafold2_params_link= https://storage.googleapis.com/alphafold/alphafold_params_2022-12-06.tar
nextflow_config - Config default value correct: params.alphafold2_mgnify_link= https://ftp.ebi.ac.uk/pub/databases/metagenomics/peptide_database/2024_04/mgy_clusters.fa.gz
nextflow_config - Config default value correct: params.alphafold2_pdb70_link= https://wwwuser.gwdguser.de/~compbiol/data/hhsuite/databases/hhsuite_dbs/pdb70_from_mmcif_220313.tar.gz
nextflow_config - Config default value correct: params.alphafold2_pdb_mmcif_link= rsync.rcsb.org::ftp_data/structures/divided/mmCIF/
nextflow_config - Config default value correct: params.alphafold2_pdb_obsolete_link= https://files.wwpdb.org/pub/pdb/data/status/obsolete.dat
nextflow_config - Config default value correct: params.alphafold2_uniref30_link= https://wwwuser.gwdguser.de/~compbiol/uniclust/2023_02/UniRef30_2023_02_hhsuite.tar.gz
nextflow_config - Config default value correct: params.alphafold2_uniref90_link= https://ftp.ebi.ac.uk/pub/databases/uniprot/uniref/uniref90/uniref90.fasta.gz
nextflow_config - Config default value correct: params.alphafold2_pdb_seqres_link= https://files.wwpdb.org/pub/pdb/derived_data/pdb_seqres.txt
nextflow_config - Config default value correct: params.alphafold2_uniprot_sprot_link= https://ftp.ebi.ac.uk/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_sprot.fasta.gz
nextflow_config - Config default value correct: params.alphafold2_uniprot_trembl_link= https://ftp.ebi.ac.uk/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_trembl.fasta.gz
nextflow_config - Config default value correct: params.alphafold2_bfd_path= null/bfd/*
nextflow_config - Config default value correct: params.alphafold2_small_bfd_path= null/small_bfd/*
nextflow_config - Config default value correct: params.alphafold2_params_path= null/params/alphafold_params_2022-12-06/*
nextflow_config - Config default value correct: params.alphafold2_mgnify_path= null/mgnify/*
nextflow_config - Config default value correct: params.alphafold2_pdb70_path= null/pdb70/**
nextflow_config - Config default value correct: params.alphafold2_pdb_mmcif_path= null/pdb_mmcif/mmcif_files
nextflow_config - Config default value correct: params.alphafold2_pdb_obsolete_path= null/pdb_mmcif/obsolete.dat
nextflow_config - Config default value correct: params.alphafold2_uniref30_path= null/uniref30/*
nextflow_config - Config default value correct: params.alphafold2_uniref90_path= null/uniref90/*
nextflow_config - Config default value correct: params.alphafold2_pdb_seqres_path= null/pdb_seqres/*
nextflow_config - Config default value correct: params.alphafold2_uniprot_path= null/uniprot/*
nextflow_config - Config default value correct: params.alphafold3_small_bfd_link= https://storage.googleapis.com/alphafold-databases/v3.0/bfd-first_non_consensus_sequences.fasta.zst
nextflow_config - Config default value correct: params.alphafold3_mgnify_link= https://storage.googleapis.com/alphafold-databases/v3.0/mgy_clusters_2022_05.fa.zst
nextflow_config - Config default value correct: params.alphafold3_pdb_mmcif_link= https://storage.googleapis.com/alphafold-databases/v3.0/pdb_2022_09_28_mmcif_files.tar.zst
nextflow_config - Config default value correct: params.alphafold3_uniref90_link= https://storage.googleapis.com/alphafold-databases/v3.0/uniref90_2022_05.fa.zst
nextflow_config - Config default value correct: params.alphafold3_pdb_seqres_link= https://storage.googleapis.com/alphafold-databases/v3.0/pdb_seqres_2022_09_28.fasta.zst
nextflow_config - Config default value correct: params.alphafold3_uniprot_link= https://storage.googleapis.com/alphafold-databases/v3.0/uniprot_all_2021_04.fa.zst
nextflow_config - Config default value correct: params.alphafold3_rnacentral_link= https://storage.googleapis.com/alphafold-databases/v3.0/rnacentral_active_seq_id_90_cov_80_linclust.fasta.zst
nextflow_config - Config default value correct: params.alphafold3_nt_rna_link= https://storage.googleapis.com/alphafold-databases/v3.0/nt_rna_2023_02_23_clust_seq_id_90_cov_80_rep_seq.fasta.zst
nextflow_config - Config default value correct: params.alphafold3_rfam_link= https://storage.googleapis.com/alphafold-databases/v3.0/rfam_14_9_clust_seq_id_90_cov_80_rep_seq.fasta.zst
nextflow_config - Config default value correct: params.alphafold3_small_bfd_path= null/small_bfd/*
nextflow_config - Config default value correct: params.alphafold3_params_path= null/params/*
nextflow_config - Config default value correct: params.alphafold3_mgnify_path= null/mgnify/*
nextflow_config - Config default value correct: params.alphafold3_pdb_mmcif_path= null/pdb_mmcif/mmcif_files
nextflow_config - Config default value correct: params.alphafold3_uniref90_path= null/uniref90/*
nextflow_config - Config default value correct: params.alphafold3_pdb_seqres_path= null/pdb_seqres/*
nextflow_config - Config default value correct: params.alphafold3_uniprot_path= null/uniprot/*
nextflow_config - Config default value correct: params.alphafold3_rnacentral_path= null/rnacentral/*
nextflow_config - Config default value correct: params.alphafold3_nt_rna_path= null/nt_rna/*
nextflow_config - Config default value correct: params.alphafold3_rfam_path= null/rfam/*
nextflow_config - Config default value correct: params.colabfold_db_link= https://opendata.mmseqs.org/colabfold/colabfold_envdb_202108.db.tar.gz
nextflow_config - Config default value correct: params.colabfold_uniref30_link= https://opendata.mmseqs.org/colabfold/uniref30_2302.db.tar.gz
nextflow_config - Config default value correct: params.colabfold_envdb_path= null/colabfold_envdb/*
nextflow_config - Config default value correct: params.colabfold_uniref30_path= null/colabfold_uniref30/*
nextflow_config - Config default value correct: params.esmfold_3B_v1= https://dl.fbaipublicfiles.com/fair-esm/models/esmfold_3B_v1.pt
nextflow_config - Config default value correct: params.esm2_t36_3B_UR50D= https://dl.fbaipublicfiles.com/fair-esm/models/esm2_t36_3B_UR50D.pt
nextflow_config - Config default value correct: params.esm2_t36_3B_UR50D_contact_regression= https://dl.fbaipublicfiles.com/fair-esm/regression/esm2_t36_3B_UR50D-contact-regression.pt
nextflow_config - Config default value correct: params.esmfold_params_path= null/params/*
nextflow_config - Config default value correct: params.boltz_ccd_link= https://huggingface.co/boltz-community/boltz-1/resolve/main/ccd.pkl
nextflow_config - Config default value correct: params.boltz_model_link= https://huggingface.co/boltz-community/boltz-1/resolve/main/boltz1_conf.ckpt
nextflow_config - Config default value correct: params.boltz2_aff_link= https://huggingface.co/boltz-community/boltz-2/resolve/main/boltz2_aff.ckpt
nextflow_config - Config default value correct: params.boltz2_conf_link= https://huggingface.co/boltz-community/boltz-2/resolve/main/boltz2_conf.ckpt
nextflow_config - Config default value correct: params.boltz2_mols_link= https://huggingface.co/boltz-community/boltz-2/resolve/main/mols.tar
nextflow_config - Config default value correct: params.boltz_ccd_path= null/params/ccd.pkl
nextflow_config - Config default value correct: params.boltz_model_path= null/params/boltz1_conf.ckpt
nextflow_config - Config default value correct: params.boltz2_aff_path= null/params/boltz2_aff.ckpt
nextflow_config - Config default value correct: params.boltz2_conf_path= null/params/boltz2_conf.ckpt
nextflow_config - Config default value correct: params.boltz2_mols_path= null/params/mols/
nextflow_config - Config default value correct: params.rosettafold2na_uniref30_link= http://wwwuser.gwdg.de/~compbiol/uniclust/2020_06/UniRef30_2020_06_hhsuite.tar.gz
nextflow_config - Config default value correct: params.rosettafold2na_bfd_link= https://bfd.mmseqs.com/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt.tar.gz
nextflow_config - Config default value correct: params.rosettafold2na_pdb100_link= https://files.ipd.uw.edu/pub/RoseTTAFold/pdb100_2021Mar03.tar.gz
nextflow_config - Config default value correct: params.rosettafold2na_weights_link= https://files.ipd.uw.edu/dimaio/RF2NA_apr23.tgz
nextflow_config - Config default value correct: params.rfam_full_region_link= ftp://ftp.ebi.ac.uk/pub/databases/Rfam/CURRENT/Rfam.full_region.gz
nextflow_config - Config default value correct: params.rfam_cm_link= ftp://ftp.ebi.ac.uk/pub/databases/Rfam/CURRENT/Rfam.cm.gz
nextflow_config - Config default value correct: params.rnacentral_rfam_annotations_link= ftp://ftp.ebi.ac.uk/pub/databases/RNAcentral/current_release/rfam/rfam_annotations.tsv.gz
nextflow_config - Config default value correct: params.rnacentral_id_mapping_link= ftp://ftp.ebi.ac.uk/pub/databases/RNAcentral/current_release/id_mapping/id_mapping.tsv.gz
nextflow_config - Config default value correct: params.rnacentral_sequences_link= ftp://ftp.ebi.ac.uk/pub/databases/RNAcentral/current_release/sequences/rnacentral_species_specific_ids.fasta.gz
nextflow_config - Config default value correct: params.rosettafold2na_uniref30_path= null/UniRef30_2020_06/*
nextflow_config - Config default value correct: params.rosettafold2na_bfd_path= null/bfd/*
nextflow_config - Config default value correct: params.rosettafold2na_pdb100_path= null/pdb100/*
nextflow_config - Config default value correct: params.rosettafold2na_weights_path= null/params/network/weights/RF2NA_apr23.pt
nextflow_config - Config default value correct: params.rosettafold2na_rna_path= null/RNA/*
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
nextflow_config - Config default value correct: params.rosettafold_all_atom_uniref30_link= https://wwwuser.gwdguser.de/~compbiol/uniclust/2023_02/UniRef30_2023_02_hhsuite.tar.gz
nextflow_config - Config default value correct: params.rosettafold_all_atom_bfd_link= https://bfd.mmseqs.com/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt.tar.gz
nextflow_config - Config default value correct: params.rosettafold_all_atom_pdb100_link= https://files.ipd.uw.edu/pub/RoseTTAFold/pdb100_2021Mar03.tar.gz
nextflow_config - Config default value correct: params.rosettafold_all_atom_paper_weights_link= http://files.ipd.uw.edu/pub/RF-All-Atom/weights/RFAA_paper_weights.pt
nextflow_config - Config default value correct: params.rosettafold_all_atom_uniref30_path= null/uniref30/*
nextflow_config - Config default value correct: params.rosettafold_all_atom_bfd_path= null/bfd/*
nextflow_config - Config default value correct: params.rosettafold_all_atom_pdb100_path= null/pdb100/*
nextflow_config - Config default value correct: params.rosettafold_all_atom_paper_weights_path= null/params/RFAA_paper_weights.pt
nextflow_config - Config default value correct: params.helixfold3_init_models_path= null/params/HelixFold3-240814.pdparams
nextflow_config - Config default value correct: params.helixfold3_uniclust30_path= null/uniref30/*
nextflow_config - Config default value correct: params.helixfold3_ccd_preprocessed_path= null/params/ccd_preprocessed_etkdg.pkl.gz
nextflow_config - Config default value correct: params.helixfold3_rfam_path= null/rfam/Rfam-14.9_rep_seq.fasta
nextflow_config - Config default value correct: params.helixfold3_bfd_path= null/bfd/*
nextflow_config - Config default value correct: params.helixfold3_small_bfd_path= null/small_bfd/*
nextflow_config - Config default value correct: params.helixfold3_uniprot_path= null/uniprot/*
nextflow_config - Config default value correct: params.helixfold3_pdb_seqres_path= null/pdb_seqres/*
nextflow_config - Config default value correct: params.helixfold3_uniref90_path= null/uniref90/*
nextflow_config - Config default value correct: params.helixfold3_mgnify_path= null/mgnify/*
nextflow_config - Config default value correct: params.helixfold3_pdb_mmcif_path= null/pdb_mmcif/mmcif_files
nextflow_config - Config default value correct: params.helixfold3_maxit_src_path= null/maxit-v11.200-prod-src
nextflow_config - Config default value correct: params.helixfold3_obsolete_path= null/pdb_mmcif/obsolete.dat
nextflow_config - Config default value correct: params.helixfold3_init_models_link= https://paddlehelix.bd.bcebos.com/HelixFold3/params/HelixFold3-params-240814.zip
nextflow_config - Config default value correct: params.helixfold3_uniclust30_link= https://wwwuser.gwdguser.de/~compbiol/uniclust/2023_02/UniRef30_2023_02_hhsuite.tar.gz
nextflow_config - Config default value correct: params.helixfold3_ccd_preprocessed_link= https://paddlehelix.bd.bcebos.com/HelixFold3/CCD/ccd_preprocessed_etkdg.pkl.gz
nextflow_config - Config default value correct: params.helixfold3_rfam_link= https://paddlehelix.bd.bcebos.com/HelixFold3/MSA/Rfam-14.9_rep_seq.fasta
nextflow_config - Config default value correct: params.helixfold3_bfd_link= https://storage.googleapis.com/alphafold-databases/casp14_versions/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt.tar.gz
nextflow_config - Config default value correct: params.helixfold3_small_bfd_link= https://storage.googleapis.com/alphafold-databases/reduced_dbs/bfd-first_non_consensus_sequences.fasta.gz
nextflow_config - Config default value correct: params.helixfold3_pdb_seqres_link= https://files.wwpdb.org/pub/pdb/derived_data/pdb_seqres.txt
nextflow_config - Config default value correct: params.helixfold3_uniref90_link= ftp://ftp.uniprot.org/pub/databases/uniprot/uniref/uniref90/uniref90.fasta.gz
nextflow_config - Config default value correct: params.helixfold3_mgnify_link= https://ftp.ebi.ac.uk/pub/databases/metagenomics/peptide_database/2024_04/mgy_clusters.fa.gz
nextflow_config - Config default value correct: params.helixfold3_pdb_mmcif_link= rsync.rcsb.org::ftp_data/structures/divided/mmCIF/
nextflow_config - Config default value correct: params.helixfold3_uniprot_sprot_link= ftp://ftp.ebi.ac.uk/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_sprot.fasta.gz
nextflow_config - Config default value correct: params.helixfold3_uniprot_trembl_link= ftp://ftp.ebi.ac.uk/pub/databases/uniprot/current_release/knowledgebase/complete/uniprot_trembl.fasta.gz
nextflow_config - Config default value correct: params.helixfold3_obsolete_link= https://files.rcsb.org/pub/pdb/data/status/obsolete.dat
nextflow_config - Config default value correct: params.helixfold3_maxit_src_link= https://proteinfold-dataset.s3.amazonaws.com/test-data/db/helixfold3/maxit-v11.200-prod-src.tar.gz
nf_test_content - 'tests/alphafold2_download.nf.test' contains outdir parameter
nf_test_content - 'tests/alphafold2_download.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/alphafold2_download.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_local.nf.test' contains outdir parameter
nf_test_content - 'tests/colabfold_local.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_local.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_download.nf.test' contains outdir parameter
nf_test_content - 'tests/colabfold_download.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_download.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' contains outdir parameter
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/alphafold3.nf.test' contains outdir parameter
nf_test_content - 'tests/alphafold3.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/alphafold3.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/alphafold2_split.nf.test' contains outdir parameter
nf_test_content - 'tests/alphafold2_split.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/alphafold2_split.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_webserver.nf.test' contains outdir parameter
nf_test_content - 'tests/colabfold_webserver.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/colabfold_webserver.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/esmfold.nf.test' contains outdir parameter
nf_test_content - 'tests/esmfold.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/esmfold.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/split_fasta.nf.test' contains outdir parameter
nf_test_content - 'tests/split_fasta.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/split_fasta.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/nextflow.config' contains modules_testdata_base_path
nf_test_content - 'tests/nextflow.config' contains pipelines_testdata_base_path
nf_test_content - 'nf-test.config' sets a testsDir
nf_test_content - 'nf-test.config' sets a workDir
nf_test_content - 'nf-test.config' sets a configFile
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-proteinfold_logo_light.png matches the template
files_unchanged - docs/images/nf-core-proteinfold_logo_light.png matches the template
files_unchanged - docs/images/nf-core-proteinfold_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_nf_test - '.github/workflows/nf-test.yml' is triggered on expected events
actions_nf_test - '.github/workflows/nf-test.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 25.10.2, Config: 25.10.2
readme - README nf-core template version badge found.
readme - README Zenodo placeholder was replaced with DOI.
pipeline_if_empty_null - No ifEmpty(null) strings found
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
local_component_structure - local modules directory structure is correct 'modules/local/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - UNTAR found in conf/modules.config and Nextflow scripts.
modules_config - ARIA2 found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC found in conf/modules.config and Nextflow scripts.
modules_config - FOLDSEEK_EASYSEARCH found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.5.1
rocrate_readme_sync - RO-Crate description matches the README.md.

Run details

nf-core/tools version 3.5.1
Run at 2026-01-16 04:56:21

keiran-rowell-unsw · 2026-01-16T04:48:56Z

@JoseEspinosa happy to edit however or knock back, just would have found this helpful getting up to speed on nf-core/proteinfold structure

JoseEspinosa · 2026-01-18T18:18:26Z

@JoseEspinosa happy to edit however or knock back, just would have found this helpful getting up to speed on nf-core/proteinfold structure

Will take a look tomorrow. Thanks!

JoseEspinosa

Just added some suggestions and corrected few "tyops" 😛
Feel free to reject the ones you think are not ok.
This is really awesome @keiran-rowell-unsw ! 🚀

JoseEspinosa · 2026-01-19T17:11:36Z

HOWTO_CONTRIBUTE_NEW_MODES.md

@@ -0,0 +1,128 @@
+## Guidance on how to add a new --mode (i.e. structure prediction software) to ProteinFold


Suggested change

## Guidance on how to add a new --mode (i.e. structure prediction software) to ProteinFold

## Adding structure prediction modes to nf-core/proteinfold

This section provides guidance on adding new structure prediction modes, implemented via the `--mode` option, to nf-core/proteinfold.

JoseEspinosa · 2026-01-19T17:17:07Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+### Contributing
+
+One of the great advantages of an `nf-core` pipeline is the community can add new protein structure prediction modules as they are released, while still leveraging the workflow infrastructure and reports developed for `proteinfold`.


Suggested change

One of the great advantages of an `nf-core` pipeline is the community can add new protein structure prediction modules as they are released, while still leveraging the workflow infrastructure and reports developed for `proteinfold`.

One of the great advantages of an `nf-core` pipeline is that the community can extend workflows to add new functionalities. In nf-core/proteinfold, this allows adding new protein structure prediction modules as they are released, while still leveraging the existing workflow infrastructure and reporting.

JoseEspinosa · 2026-01-19T17:18:38Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+One of the great advantages of an `nf-core` pipeline is the community can add new protein structure prediction modules as they are released, while still leveraging the workflow infrastructure and reports developed for `proteinfold`.
+
+Please consider writing some code to become a [nf-core contributor](https://nf-co.re/contributors) and expand the pipeline! Reach out to a maintainer of contributor for guidance :)


Maybe also mention the #proteinfold_dev slack channel? I think it would be easy to people to just write a message on slack and we are all there

JoseEspinosa · 2026-01-19T17:27:22Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+  - `"""script block"""`:
+    - `program`: the script block calls the program from the Nextflow shell with the programs typical `--flags`, in whatever form (`binary` or `script.py`) the program is distributed from its codebase repository.
+    - `extract_metrics.py`: accesses the canonical data output formats from the structure prediction program and returns a core set of plain text `.tsv` metric files.
+- `bin/extract_metrics.py`: a globally accessible program to go from serialised data -> `.tsv` plaintext. Currently runs particular extraction logic functions based upon file format (`.pkl`, `.json`, `.npz`). However, as the commnity adds more `--mode`s to the pipeline, different programs could use the same compressed output format. In which case `extract_metrics.py` should be refactored to match based on the passing the `--mode` to `extract_metrics.py`.


Suggested change

- `bin/extract_metrics.py`: a globally accessible program to go from serialised data -> `.tsv` plaintext. Currently runs particular extraction logic functions based upon file format (`.pkl`, `.json`, `.npz`). However, as the commnity adds more `--mode`s to the pipeline, different programs could use the same compressed output format. In which case `extract_metrics.py` should be refactored to match based on the passing the `--mode` to `extract_metrics.py`.

- `bin/extract_metrics.py`: a globally accessible program to go from serialised data into `.tsv` plaintext. It currently applies format specific extraction logic for `.pkl`, `.json` and `.npz` files. However, as the community adds more `--mode`s to the pipeline, different programs could use the same compressed output format. In which case `extract_metrics.py` should be refactored to match based on the passing the `--mode` to `extract_metrics.py`.

JoseEspinosa · 2026-01-19T18:25:00Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+### Process labelling
+
+At the top of a module's `RUN_[MODE_NAME]`{} process there are a series of labels that allow the `nextflow.config` to pass the job to the approriate resources on the compute cluster. `label 'process_gpu'` is very useful to specify this is the AI inference stage requiring GP-GPU grunt -- whereas other processes can have default labels that request CPU resources and, once finished, will naturally cascade onto GPUs due to Nextflow's dataflow paradigm.


Suggested change

At the top of a module's `RUN_[MODE_NAME]`{} process there are a series of labels that allow the `nextflow.config` to pass the job to the approriate resources on the compute cluster. `label 'process_gpu'` is very useful to specify this is the AI inference stage requiring GP-GPU grunt -- whereas other processes can have default labels that request CPU resources and, once finished, will naturally cascade onto GPUs due to Nextflow's dataflow paradigm.

At the top of a module's `RUN_[MODE_NAME]`{} process, there are a series of labels that allow the `nextflow.config` to pass the job to the appropriate resources on the compute cluster. `label 'process_gpu'` is very useful to specify the AI inference stages requiring GPU-intensive computation. Other processes can use default labels that request CPU resources and, once finished, will naturally cascade onto GPU-enabled steps due to Nextflow's dataflow paradigm.

JoseEspinosa · 2026-01-19T18:28:49Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+### Processable structure prediction metrics
+
+Metrics from AlphaFold-inspired protein strucutre prediction programs are structured in two ways: tabular or as a matrix (PAE values)


Suggested change

Metrics from AlphaFold-inspired protein strucutre prediction programs are structured in two ways: tabular or as a matrix (PAE values)

Metrics from AlphaFold-inspired protein structure prediction programs are structured in two ways: tabular or as a matrix (PAE values)

JoseEspinosa · 2026-01-19T18:30:23Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+When contributing a new mode to `proteinfold`, functionality should be added to `extract_metrics.py` to access the canonical ouput files of the new program, and extract data into compliant `.tsv` files that can be easily processed by downstream plotting and MultiQC functions.
+
+Metrics files are **0 indexed**.


Suggested change

Metrics files are **0 indexed**.

> [!WARNING]

> Metrics files are **0 indexed**.

JoseEspinosa · 2026-01-19T18:33:50Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+#### pLDDT (`{meta.id}_plddt.tsv`)
+
+Confidence values per residue, rounded to 2 decimal places. Each ranked result gets its own column. [For all-atom modules, atomic token confidences are processed to a naive mean value across the residue]


Suggested change

Confidence values per residue, rounded to 2 decimal places. Each ranked result gets its own column. [For all-atom modules, atomic token confidences are processed to a naive mean value across the residue]

Confidence values per residue, rounded to 2 decimal places. Each ranked result gets its own column (for all-atom modules, atomic token confidences are processed to a naive mean value across the residue).

JoseEspinosa · 2026-01-19T18:34:21Z

HOWTO_CONTRIBUTE_NEW_MODES.md

+
+#### (i)pTM (`{meta.id}_[i]ptm.tsv`)
+
+(i)pTM scores, rounded to 3 decimal places, listed by the rank number. [Currently unsorted]


Suggested change

(i)pTM scores, rounded to 3 decimal places, listed by the rank number. [Currently unsorted]

(i)pTM scores, rounded to 3 decimal places, listed by the rank number (currently unsorted).

keiran-rowell-unsw added 4 commits January 12, 2026 16:31

Deploy multiqc-proteinfold as Module in dev

10f6d21

Add local Multiqc python module

ac6a315

contrib mode file created

03a1a5e

How to contribute new modes precis with editing locations and metric …

37cb7d9

…examples

keiran-rowell-unsw added the documentation Improvements or additions to documentation label Jan 16, 2026

Remove accidental addition of my bulk proteinfold multqic module code

3c8dc6a

keiran-rowell-unsw marked this pull request as ready for review January 16, 2026 04:39

Make prettier happy

60a22c5

keiran-rowell-unsw added 2 commits January 16, 2026 15:50

module output: indentation

cd8cf4b

Unindent script block

371a4be

JoseEspinosa self-requested a review January 18, 2026 18:18

JoseEspinosa requested changes Jan 19, 2026

View reviewed changes

		@@ -0,0 +1,128 @@
		## Guidance on how to add a new --mode (i.e. structure prediction software) to ProteinFold

-## Guidance on how to add a new --mode (i.e. structure prediction software) to ProteinFold
+## Adding structure prediction modes to nf-core/proteinfold
+This section provides guidance on adding new structure prediction modes, implemented via the `--mode` option, to nf-core/proteinfold.


		### Contributing

		One of the great advantages of an `nf-core` pipeline is the community can add new protein structure prediction modules as they are released, while still leveraging the workflow infrastructure and reports developed for `proteinfold`.


		One of the great advantages of an `nf-core` pipeline is the community can add new protein structure prediction modules as they are released, while still leveraging the workflow infrastructure and reports developed for `proteinfold`.

		Please consider writing some code to become a [nf-core contributor](https://nf-co.re/contributors) and expand the pipeline! Reach out to a maintainer of contributor for guidance :)


		### Process labelling

		At the top of a module's `RUN_[MODE_NAME]`{} process there are a series of labels that allow the `nextflow.config` to pass the job to the approriate resources on the compute cluster. `label 'process_gpu'` is very useful to specify this is the AI inference stage requiring GP-GPU grunt -- whereas other processes can have default labels that request CPU resources and, once finished, will naturally cascade onto GPUs due to Nextflow's dataflow paradigm.


		### Processable structure prediction metrics

		Metrics from AlphaFold-inspired protein strucutre prediction programs are structured in two ways: tabular or as a matrix (PAE values)


		When contributing a new mode to `proteinfold`, functionality should be added to `extract_metrics.py` to access the canonical ouput files of the new program, and extract data into compliant `.tsv` files that can be easily processed by downstream plotting and MultiQC functions.

		Metrics files are 0 indexed.

	Metrics files are 0 indexed.
	> [!WARNING]
	> Metrics files are 0 indexed.


		#### pLDDT (`{meta.id}_plddt.tsv`)

		Confidence values per residue, rounded to 2 decimal places. Each ranked result gets its own column. [For all-atom modules, atomic token confidences are processed to a naive mean value across the residue]


		#### (i)pTM (`{meta.id}_[i]ptm.tsv`)

		(i)pTM scores, rounded to 3 decimal places, listed by the rank number. [Currently unsorted]

Doc - How to contribute new modes #443

Are you sure you want to change the base?

Doc - How to contribute new modes #443

Conversation

keiran-rowell-unsw commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

❔ Tests fixed:

✅ Tests passed:

Run details

Uh oh!

keiran-rowell-unsw commented Jan 16, 2026

Uh oh!

JoseEspinosa commented Jan 18, 2026

Uh oh!

JoseEspinosa left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

keiran-rowell-unsw commented Jan 16, 2026 •

edited

Loading

github-actions bot commented Jan 16, 2026 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

JoseEspinosa left a comment •

edited

Loading