feat: Add tirmite search command for ensemble search with hit merging#37
Draft
feat: Add tirmite search command for ensemble search with hit merging#37
tirmite search command for ensemble search with hit merging#37Conversation
Co-authored-by: Adamtaranto <2160099+Adamtaranto@users.noreply.github.com>
Co-authored-by: Adamtaranto <2160099+Adamtaranto@users.noreply.github.com>
Copilot
AI
changed the title
[WIP] Add ensemble search feature for HMM construction
feat: Add Jan 21, 2026
tirmite search command for ensemble search with hit merging
Owner
|
@copilot # TODO: Add info logging message with full commands for external cmd line tools before calling: i.e makeblastdb, blastn, nhmmer, any other external tools. TODO: Add separate independent '--max-evalue' for use with running nhmmer and blastn, or for filtering pre-calculated results from nhmmer and blastn. i.e Add '--blast-max-evalue' and '--hmm-max-evalue'TODO:
|
- Add separate --blast-max-evalue and --hmm-max-evalue options - Fix BLAST output format parsing by using standard format 6 - Add info logging with full command strings for BLAST and nhmmer - Update output format to BLAST tabular format 6 for tirmite pair compatibility Co-authored-by: Adamtaranto <2160099+Adamtaranto@users.noreply.github.com>
Contributor
Author
Addressed all items in commit 796f78f:
|
Owner
|
@copilot - [ ] Add '--keep-temp' option so that results in tmp dir are not cleaned up at end of run.
|
- Add --keep-temp option to retain temp files after run - Add --logfile option for writing logs to file - Add --blast-db option for pre-built BLAST databases - Add --lengths-file option for query lengths with precomputed results - Add --word-size option for BLAST word size (default: 4) - Add --genome-list option for multiple genome inputs - Support gzipped genome files via prepare_genome_file - Auto-extract query lengths from FASTA and HMM files Co-authored-by: Adamtaranto <2160099+Adamtaranto@users.noreply.github.com>
Contributor
Author
Implemented all requested features in commit 96076d3:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
tirmite searchCLI entrypoint for ensemble search functionalitysrc/tirmite/cli/ensemble_search.pywith:cli.pyReviewer feedback addressed:
--blast-max-evalueand--hmm-max-evalueoptions--keep-tempoption to retain temp directory after run--logfileoption for logging to file--blast-dboption--lengths-fileoption for query lengths when using precomputed results--word-sizeoption for BLAST word size (default: 4)--genome-listoption for multiple genome inputsSecurity Summary
No security vulnerabilities were detected by CodeQL analysis.
Original prompt
💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.