Eliminate evaluate Command #359

Lilferrit · 2024-07-31T18:26:31Z

Eliminated the evaluate command in favor of a --evaluate command line option for the sequence command. Evaluation metrics will still be logged to the console as before if the --evaluate options is set. The model (Spec2Pep) will also log predictions to its out_writer in validation mode similar as it does in prediction mode.

bittremieux

This works, but I find the changes in model.py a bit messy and unsatisfying.

If the goal of the evaluation is to get the peptide and amino acid metrics, can this not be simplified by:

First just do standard predictions.
After the whole inference has been finished, you then have all of the peptides for each spectrum. Then (at a higher level than inside of the model) you can read the peptide sequences from the annotated MGF separately, and compare these to each other.
This removes all of the validation related complexity from the model and should simplify the flow of the data considerably.
The evaluation part is also more maintainable, and it would for example be much easier to add another data source for evaluation (e.g. an mzTab or CSV file with "ground truth" rather than an annotated MGF).

Note that it's slightly different from the current validation approach, because that also gives the loss, which you wouldn't have in this approach. However, it seems to me that the loss is not that informative anyway, and not something a user would expect to get when specifying evaluate during prediction.

The best approach to tackle this should probably be discussed.

casanovo/casanovo.py

casanovo/denovo/dataloaders.py

casanovo/denovo/model.py

casanovo/denovo/model_runner.py

Lilferrit · 2024-08-06T00:11:58Z

This works, but I find the changes in model.py a bit messy and unsatisfying.

If the goal of the evaluation is to get the peptide and amino acid metrics, can this not be simplified by:

First just do standard predictions.

After the whole inference has been finished, you then have all of the peptides for each spectrum. Then (at a higher level than inside of the model) you can read the peptide sequences from the annotated MGF separately, and compare these to each other.

This removes all of the validation related complexity from the model and should simplify the flow of the data considerably.

The evaluation part is also more maintainable, and it would for example be much easier to add another data source for evaluation (e.g. an mzTab or CSV file with "ground truth" rather than an annotated MGF).

Note that it's slightly different from the current validation approach, because that also gives the loss, which you wouldn't have in this approach. However, it seems to me that the loss is not that informative anyway, and not something a user would expect to get when specifying evaluate during prediction.

The best approach to tackle this should probably be discussed.

I agree that this approach makes more sense. Tentatively what I'm thinking is that ModelRunner should handle the validation metrics calculation using the model output and the spectra annotations, using the functionality in evaluate.py. This would probably leave the actual model itself unchanged.

* bug report template * punctuation, hardware description item * Restrict NumPy to pre-2.0 (#344) * Restrict NumPy to pre-2.0 * Update changelog * Update paper reference (#361) --------- Co-authored-by: Lilferrit <[email protected]>

Lilferrit · 2024-08-06T21:16:21Z

I reimplemented evaluate mode by having ModelRunner handle the evaluation instead of using the validation functionality of the Spec2Pep model itself. My only concern at this point is how to go about testing the evaluation metric calculations. The simplest solution I can think of is to have log_metrics also return the aa_precision and pep_precision in addition to the logging operation. From there I could introduce some unit tests.

casanovo/denovo/model_runner.py

codecov · 2024-08-06T21:44:20Z

Codecov Report

Attention: Patch coverage is 97.22222% with 1 line in your changes missing coverage. Please review.

Project coverage is 94.26%. Comparing base (a46f995) to head (2d882b7).
Report is 2 commits behind head on dev.

Files	Patch %	Lines
casanovo/denovo/model_runner.py	96.15%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #359      +/-   ##
==========================================
+ Coverage   94.03%   94.26%   +0.23%     
==========================================
  Files          12       12              
  Lines        1022     1029       +7     
==========================================
+ Hits          961      970       +9     
+ Misses         61       59       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

bittremieux

My only concern at this point is how to go about testing the evaluation metric calculations. The simplest solution I can think of is to have log_metrics also return the aa_precision and pep_precision in addition to the logging operation. From there I could introduce some unit tests.

I don't think that we need to check the actual metric values. We have dedicated unit tests that already do that.

What should be added is some tests that verify correct behavior without/with evaluation specified based on different types of input files (annotated vs simple MGF, mzML).

casanovo/casanovo.py

casanovo/denovo/model_runner.py

Lilferrit · 2024-08-07T21:29:18Z

What should be added is some tests that verify correct behavior without/with evaluation specified based on different types of input files (annotated vs simple MGF, mzML).

Sounds good, I'll look into getting some tests implemented for these cases.

Lilferrit · 2024-08-07T23:17:58Z

I did some experimenting while trying to set up the test cases, and it looks like in the current implementation of ModelRunner._get_index when the argument annotated is set to true unannotated peak files (for the most part) are silently ignored. From a quick glance this ultimately stems from the behavior of _get_peak_filenames. However, in the case of unannotated mgf files it looks like there is an uncaught exception from the Depth Charge library that gets kicked up to the ModelRunner when attempting to construct an AnnotatedSpectrumIndex. I've added the error itself to the end of this comment.

Imo silently ignoring unannotated files is not desirable behavior in the case of running model evaluation post sequencing (this also means that the unannotated files would simply not get sequenced). The best way that comes to mind of getting around this issue is to check that all of the peak files are annotated before sequencing begins, and throwing an exception if any of them aren't. However I'm not sure if there's a quick and easy way to do this.

I put the test cases in progress on the branch elim-eval-tests if anyone wants to take a look.

casanovo\denovo\model_runner.py:183: in predict
    test_index = self._get_index(peak_path, evaluate, "")
casanovo\denovo\model_runner.py:420: in _get_index
    return Index(index_fname, filenames, valid_charge=valid_charge)
D:\anaconda3\envs\casanovo_env\lib\site-packages\depthcharge\data\hdf5.py:446: in __init__
    super().__init__(
D:\anaconda3\envs\casanovo_env\lib\site-packages\depthcharge\data\hdf5.py:104: in __init__
    self.add_file(ms_file)
D:\anaconda3\envs\casanovo_env\lib\site-packages\depthcharge\data\hdf5.py:231: in add_file
    group.create_dataset(
D:\anaconda3\envs\casanovo_env\lib\site-packages\h5py\_hl\group.py:183: in create_dataset
    dsid = dataset.make_new_dset(group, shape, dtype, data, name, **kwds)
D:\anaconda3\envs\casanovo_env\lib\site-packages\h5py\_hl\dataset.py:166: in make_new_dset
    dset_id.write(h5s.ALL, h5s.ALL, data)
h5py\\_objects.pyx:54: in h5py._objects.with_phil.wrapper
    ???
h5py\\_objects.pyx:55: in h5py._objects.with_phil.wrapper
    ???
h5py\\h5d.pyx:282: in h5py.h5d.DatasetID.write
    ???
h5py\\_proxy.pyx:147: in h5py._proxy.dset_rw
    ???
h5py\\_conv.pyx:442: in h5py._conv.str2vlen
    ???
h5py\\_conv.pyx:96: in h5py._conv.generic_converter
    ???
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

>   ???
E   TypeError: Can't implicitly convert non-string objects to strings

h5py\\_conv.pyx:247: TypeError

bittremieux · 2024-08-08T07:47:35Z

Imo silently ignoring unannotated files is not desirable behavior in the case of running model evaluation post sequencing (this also means that the unannotated files would simply not get sequenced).

I agree.

The best way that comes to mind of getting around this issue is to check that all of the peak files are annotated before sequencing begins, and throwing an exception if any of them aren't. However I'm not sure if there's a quick and easy way to do this.

That's tricky, because there's indeed no elegant way to do this, so I'm not really in favor of trying to hack this in.

Instead, giving better error messages is a good starting point. Then at least users will know what the problem is and how to fix it if they want to run evaluation.

* save best model * save best model * updated unit tests * remove save top k config item * added save_top_k to deprecated config options * changelog entry * test case, formatting * requested changes

* bug report template * punctuation, hardware description item * Restrict NumPy to pre-2.0 (#344) * Restrict NumPy to pre-2.0 * Update changelog * Update paper reference (#361) --------- Co-authored-by: Lilferrit <[email protected]>

Lilferrit · 2024-08-14T21:39:24Z

I added some light error handling such that if the AnnotatedSpectrumIndex throws a TypeError an additional error message will be thrown pointing the user in the direction of there being an unannotated MGF file in the validation peak path list. I think this provides a nice balance between diagnosing the reason for the error without relying too heavily on the underlying implementation of Depth Charge.

bittremieux

A few more final tweaks.

casanovo/denovo/model_runner.py

tests/test_integration.py

CHANGELOG.md

Lilferrit and others added 6 commits July 30, 2024 15:03

prediction output in model eval mode

4e24904

eliminate eval command, introduce -e flag for predict command

82018cf

adapted unit test to new model runner and model functionality

c59edce

updated documentation

b9f843a

removed log and result files

6716db6

Generate new screengrabs with rich-codex

dccb729

Lilferrit requested a review from bittremieux July 31, 2024 18:50

bittremieux requested changes Aug 2, 2024

View reviewed changes

bittremieux linked an issue Aug 2, 2024 that may be closed by this pull request

Eliminate the eval command #355

Closed

Update paper reference (#361)

5879b0a

bittremieux and others added 4 commits August 6, 2024 08:47

Bug report template (#360)

f792527

* bug report template * punctuation, hardware description item * Restrict NumPy to pre-2.0 (#344) * Restrict NumPy to pre-2.0 * Update changelog * Update paper reference (#361) --------- Co-authored-by: Lilferrit <[email protected]>

upgrade codecove to v4 (#364)

bf1e5a3

implemen eval mode at model runner level, fix unit test

4c14b94

merge dev

8d3ceba

Lilferrit commented Aug 6, 2024

View reviewed changes

casanovo/denovo/model_runner.py Show resolved Hide resolved

Lilferrit and others added 4 commits August 6, 2024 14:22

CLI documentation

0862a7c

Generate new screengrabs with rich-codex

8461608

Merge branch 'main' into elim-eval

059119e

Merge branch 'elim-eval' of github.com:Noble-Lab/casanovo into elim-eval

87ad500

bittremieux requested changes Aug 7, 2024

View reviewed changes

casanovo/casanovo.py Outdated Show resolved Hide resolved

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

casanovo/denovo/model_runner.py Show resolved Hide resolved

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

Lilferrit added 2 commits August 7, 2024 14:07

merge conflict

a2b50c1

requested changes

148d32a

github-actions bot and others added 2 commits August 7, 2024 21:29

Generate new screengrabs with rich-codex

1981f13

evaluation test cases

6ffa3f8

Lilferrit and others added 22 commits August 9, 2024 13:30

verify annotated mgf files

7b9557b

verify annotated mgf files

5dd591f

Merge branch 'elim-eval' of github.com:Noble-Lab/casanovo into elim-eval

9c90aee

Merge branch 'elim-eval' of github.com:Noble-Lab/casanovo into elim-eval

c188df3

Generate new screengrabs with rich-codex

4b3d1a4

Merge branch 'elim-eval' of github.com:Noble-Lab/casanovo into elim-eval

34fb4d1

Save best model (#365)

ba58668

* save best model * save best model * updated unit tests * remove save top k config item * added save_top_k to deprecated config options * changelog entry * test case, formatting * requested changes

prediction output in model eval mode

bd8ceba

eliminate eval command, introduce -e flag for predict command

d4326b1

adapted unit test to new model runner and model functionality

b43121e

updated documentation

d9b6f48

removed log and result files

f441034

implemen eval mode at model runner level, fix unit test

cd9bfe5

CLI documentation

5cb3e21

Bug report template (#360)

0bb617d

* bug report template * punctuation, hardware description item * Restrict NumPy to pre-2.0 (#344) * Restrict NumPy to pre-2.0 * Update changelog * Update paper reference (#361) --------- Co-authored-by: Lilferrit <[email protected]>

requested changes

7be64ed

evaluation test cases

b5862b5

file warnings, evaluation tests

d20494c

fixed ubuntu specific test case bug

9647321

verify annotated mgf files

c4cd147

removed mgf annotation verification

31cc133

AnnotatedSpectrumIndex type error

695c739

Lilferrit requested a review from bittremieux August 15, 2024 04:27

bittremieux requested changes Aug 20, 2024

View reviewed changes

casanovo/denovo/model_runner.py Outdated Show resolved Hide resolved

tests/test_integration.py Outdated Show resolved Hide resolved

tests/test_integration.py Outdated Show resolved Hide resolved

CHANGELOG.md Outdated Show resolved Hide resolved

requested changes, changelog entry

2d882b7

Lilferrit requested a review from bittremieux August 21, 2024 05:33

bittremieux approved these changes Aug 21, 2024

View reviewed changes

Lilferrit merged commit 67939b8 into dev Aug 21, 2024
7 checks passed

Lilferrit deleted the elim-eval branch August 21, 2024 21:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliminate evaluate Command #359

Eliminate evaluate Command #359

Lilferrit commented Jul 31, 2024

bittremieux left a comment •

edited

Loading

Lilferrit commented Aug 6, 2024

Lilferrit commented Aug 6, 2024 •

edited

Loading

codecov bot commented Aug 6, 2024 •

edited

Loading

bittremieux left a comment

Lilferrit commented Aug 7, 2024

Lilferrit commented Aug 7, 2024

bittremieux commented Aug 8, 2024

Lilferrit commented Aug 14, 2024

bittremieux left a comment

Eliminate evaluate Command #359

Eliminate evaluate Command #359

Conversation

Lilferrit commented Jul 31, 2024

bittremieux left a comment • edited Loading

Choose a reason for hiding this comment

Lilferrit commented Aug 6, 2024

Lilferrit commented Aug 6, 2024 • edited Loading

codecov bot commented Aug 6, 2024 • edited Loading

Codecov Report

bittremieux left a comment

Choose a reason for hiding this comment

Lilferrit commented Aug 7, 2024

Lilferrit commented Aug 7, 2024

bittremieux commented Aug 8, 2024

Lilferrit commented Aug 14, 2024

bittremieux left a comment

Choose a reason for hiding this comment

bittremieux left a comment •

edited

Loading

Lilferrit commented Aug 6, 2024 •

edited

Loading

codecov bot commented Aug 6, 2024 •

edited

Loading