Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add Casanovo-DB Functionality (#325)
* begin adding tests for annotate mode * add basic test for annotate mode * added test case for annotate mode and modified method * very rough sketch of db upgrade (untested) * small upgrades to documentation * better output formatting * all tests added * remove minor debugging print statement * Generate new screengrabs with rich-codex * remove excess info logs, add monkeypatch to tests * mp fix * fix line lengths and modify test * Generate new screengrabs with rich-codex * justins requested fixes * added minor changes as requested by Wout * partial fixes requested by wout. Lots of subclassing removed * documentation fixes and starting to cleanup batching code * cleaned up on_predict_batch_end, TODOs for calc_mz * add proper calc_mz calculation with depthcharge * rough implementation * tested implementation of db search * fix for issue with 0 candidates * minor fixes added * reordered and renamed variables for consistency * casanovo-db full working version with code simplification * Generate new screengrabs with rich-codex * fix batching issues * small fixes regarding documentation, import syntax, etc. * add proteindatabase * Generate new screengrabs with rich-codex * finish proteindatabase * all comments addressed * new comments addressed * final adjustments added * minor changes regarding formatting and small efficiency boosts * changes before reformatting config * replace all occurences of "max_length" with "max_peptide_len" * added nonspecific digestion * minor comments * full branch comments addressed * Generate new screengrabs with rich-codex * updated and fixed failed tests * add mztab validation to dbsearch test * lint fix * fix integration test * fix unit tests * force fix test * clean up test_digest_fasta_enzyme * adjust test_digest_fasta_mods * allows top_match filtering for casanovo-db * change default value for protein value in PepSpecMatch * reverse issues with decoder * update test and remove logging statement * db_utils fixes * updates to dataloaders, model_runner, and model.py * near final changes for all but db_utils * line length fixes * Minor refactoring and type hint fixes * Use mask for more efficient candidate filtering * Reorder methods in logical order * Fix unit tests * Directly generate DB peptides as DataFrame * Fix type hints and line lengths * Generate new screengrabs with rich-codex * Refactor batching to avoid code repetition * More minor refactoring * Reformat with black * Minor fix * Fix output name crash * Fix AA score masking * Fix PSM export * Less verbose logging of skipped peptides * Appropriate end-of-run reporting * Fix PSM export from de novo * Generalize end-of-run reporting * Log additional information on spectra with no matching candidates * Fix linting issue * Fix some testing warnings * Log digestion settings * Reduce logging level for spectra without candidates * Round peptide masses for consistent sorting * Fox linting * Remove superfluous PSM export * Update changelog --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Wout Bittremieux <[email protected]>
- Loading branch information