Manual/semi-automatic performance regression checking #356

MarionBWeinzierl · 2024-11-12T17:13:03Z

Description

This branch includes code to run profiling on PyRealm, and check whether performance has degraded between

Fixes #256

Type of change

New feature (non-breaking change which adds functionality)
Optimization (back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)

Key checklist

Make sure you've run the pre-commit checks: $ pre-commit run -a
All tests pass: $ poetry run pytest

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works

…rformance-regression-checking

…king

codecov-commenter · 2024-11-12T17:17:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.09%. Comparing base (1f315ba) to head (955528d).
Report is 469 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #356      +/-   ##
===========================================
+ Coverage    95.29%   96.09%   +0.80%     
===========================================
  Files           28       35       +7     
  Lines         1720     2766    +1046     
===========================================
+ Hits          1639     2658    +1019     
- Misses          81      108      +27

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

MarionBWeinzierl · 2024-11-18T15:17:46Z

The scripts are currently lacking any error checking etc, but before I do more, here are some questions @davidorme :

Is it ok to have this as a bash script calling Python, or would you rather have a Python script (potentially calling some bash for the git parts)?
This script uses the existing pytest-profile functionality. As you mentioned on Slack, we could also just call cProfile. I am wondering whether this changes something in the precision of the results.
It seems that there is a bit of a fluctuation we would need to account for with a tolerance and/or larger datasets to reduce the impact of overhead (or maybe we have to focus on certain core functions?). Running the current script on the same two commit hashes three times gives, for example, the following results:

davidorme · 2024-11-19T03:04:32Z

The scripts are currently lacking any error checking etc, but before I do more, here are some questions @davidorme :

Is it ok to have this as a bash script calling Python, or would you rather have a Python script (potentially calling some bash for the git parts)?

I think it would be better to do this all in Python (maybe using GitPython?) but that's mostly to avoid issues for currently hypothetical Windows based developers. So, for the moment, I think this is fine.

This script uses the existing pytest-profile functionality. As you mentioned on Slack, we could also just call cProfile. I am wondering whether this changes something in the precision of the results.

I don't think it should - IIUC it's just that the pytest-profiling wraps exactly the same operation.

It seems that there is a bit of a fluctuation we would need to account for with a tolerance and/or larger datasets to reduce the impact of overhead (or maybe we have to focus on certain core functions?). Running the current script on the same two commit hashes three times gives, for example, the following results:

Yeah - I think that increasing the load is probably the way to go, at least initially. The other thing is to dive deeper into the results to find which functions have changed, but that's probably a new PR.

MarionBWeinzierl · 2024-11-19T13:45:57Z

Talking about overheads.... I noticed that the most expensive parts in the list were those associated with running pytest etc:

I had assumed, when I created the simplified version, that these parts would be kind of similar in each run and therefore would not need excluding, but I was wrong. So I reinstantiated the exclude parameter to make sure those are thrown out. This is what I get now, running the same thing three times:

Given that the two commits that I am comparing are quite similar, I would argue that we might still want a, say, 5% tolerance or so.

MarionBWeinzierl · 2024-11-25T14:22:14Z

Do we want this in the CI (falling over if the codes get 5% slower), or as a manual thing?

MarionBWeinzierl · 2024-11-26T13:06:15Z

CONTRIBUTING.md

-local profiling and benchmarking.
-
-See the [profiling and benchmarking
-page](https://pyrealm.readthedocs.io/en/latest/development/profiling_and_benchmarking.md)


That link doe not work. Looking at https://github.com/ImperialCollegeLondon/pyrealm/blob/develop/docs/source/development/profiling_and_benchmarking.md , a lot of that information is also no longer valid. We will need to adapt that when we have decided how to proceed (i.e., keep the old benchmarking code, if and when to run the new code automatically, etc.)

MarionBWeinzierl · 2024-12-02T09:59:13Z

profiling/simple_benchmarking.py

@davidorme would you prefer me to use this to replace the old run_benchmarking.py , or leave the old script in and keep this as simple_benchmarking.py? That decision will also influence how the above-mentioned documentation is extended or adapted/rewritten.

…chmarking

CONTRIBUTING.md

j-emberton · 2024-12-02T10:43:51Z

This issue found when trying to run ./performance_regression_checking.sh locally on M2 Mac.

Appears to be a Mac/Linux incompatibility.

Specific issue fixed by converting
poetry run /usr/bin/time -v pytest -m "profiling" --profile-svg
to
poetry run /usr/bin/time -l pytest -m "profiling" --profile-svg

Co-authored-by: James Emberton <[email protected]>

davidorme · 2024-12-03T10:41:05Z

Something is up with the profiling in the CI. The profiling YAML is re-running the whole of the pyrealm_ci.yaml under the test job. I think we can probably simply add the profiling job as a new job under the docs job pyrealm_ci.yaml, which avoids having to import the previous workflow and ensures that the previous testing runs?

Also, the purpose of this job is to check that the profiling script works? We only need it to run so we can drop all of the graphviz stuff from the job and make it faster and cleaner.

I've pushed an update to set up what I think works? We'll see if it does!

davidorme · 2024-12-03T10:58:00Z

I think that's cleaner? It's not really doing a different workflow, it's adding a new CI test to the standard test and build suite, so it seems reasonable to just add it there.

With that change, I think we can delete:

.github/workflows/pyrealm_profiling_after_push.yaml
.github/workflows/pyrealm_profiling_on_approve.yaml
.github/workflows/pyrealm_profiling_without_benchmarking.yaml

davidorme

LGTM - this works for me and I think it is a much saner approach. We need to work on the docs and possibly moving things into Python and adding functionality, but this gets us back on track.

CONTRIBUTING.md

Co-authored-by: David Orme <[email protected]>

MarionBWeinzierl · 2024-12-03T13:18:57Z

I have created #358 as a follow-up regarding the old code and documentation. I am merging this in now, as incremental improvement.

…ecking

MarionBWeinzierl added 4 commits October 22, 2024 17:05

correct PModel constructor signature use

95ec3bc

start writing script using git worktree

939cd21

Merge remote-tracking branch 'origin/develop' into issue256-manual-pe…

bd7d4f9

…rformance-regression-checking

added proof-of-concept scripts for simple performance regression chec…

20359e0

…king

MarionBWeinzierl self-assigned this Nov 12, 2024

MarionBWeinzierl linked an issue Nov 12, 2024 that may be closed by this pull request

Develop manual performance regression checking #256

Closed

MarionBWeinzierl marked this pull request as draft November 12, 2024 17:13

MarionBWeinzierl mentioned this pull request Nov 12, 2024

Develop manual performance regression checking #256

Closed

changed script to accept commit hashs

11e4918

MarionBWeinzierl added 4 commits November 19, 2024 13:47

put 'exclude' part back in

b5b8f3a

added some error checks and regression tolerance

92e503a

make the script fail if performance regression

50d5c28

change default comparison to origin/develop

af03991

MarionBWeinzierl marked this pull request as ready for review November 25, 2024 14:14

MarionBWeinzierl requested review from davidorme and j-emberton November 25, 2024 14:20

Update CONTRIBUTING.md to reflect change in how profiling is run

5240072

MarionBWeinzierl commented Nov 26, 2024

View reviewed changes

Remove trailing whitespace and shorten line

871f7a5

MarionBWeinzierl commented Dec 2, 2024

View reviewed changes

MarionBWeinzierl added 2 commits December 2, 2024 10:13

Update CONTRIBUTING.md with further information on how to run the ben…

932a7cf

…chmarking

Remove trailing whitespaces

440e6c9

j-emberton reviewed Dec 2, 2024

View reviewed changes

CONTRIBUTING.md Outdated Show resolved Hide resolved

MarionBWeinzierl added 2 commits December 2, 2024 10:30

Update CONTRIBUTING.md with information on running profiling test suit

f21dd0a

fix formatting

2073628

Update CONTRIBUTING.md

0619c93

MarionBWeinzierl and others added 2 commits December 2, 2024 10:55

added conditional changes in flag for mac and linux

1d1051b

accept @j-emberton 's suggestion from review

9da026f

Co-authored-by: James Emberton <[email protected]>

MarionBWeinzierl requested a review from j-emberton December 3, 2024 09:24

added profiling database exclusion to .gitignore

58512ba

davidorme added 2 commits December 3, 2024 10:45

Suggested simplification of profiling validation CI

fc08721

profiling validation in CI doesn't need to wait for tests to complete

b8ecb73

davidorme approved these changes Dec 3, 2024

View reviewed changes

CONTRIBUTING.md Outdated Show resolved Hide resolved

Accept @davidorme 's suggestion from the code review

b1f70c9

Co-authored-by: David Orme <[email protected]>

MarionBWeinzierl mentioned this pull request Dec 3, 2024

Update documentation with new benchmarking approach and decide what to keep of old code #358

Open

MarionBWeinzierl added 4 commits December 3, 2024 13:19

Merge branch 'develop' into issue256-manual-performance-regression-ch…

d6328b0

…ecking

Delete .github/workflows/pyrealm_profiling_on_approve.yaml

74bfce8

Delete .github/workflows/pyrealm_profiling_after_push.yaml

3c49d98

Delete .github/workflows/pyrealm_profiling_without_benchmarking.yaml

955528d

MarionBWeinzierl merged commit fda0906 into develop Dec 3, 2024
13 checks passed

MarionBWeinzierl deleted the issue256-manual-performance-regression-checking branch December 3, 2024 13:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Manual/semi-automatic performance regression checking #356

Manual/semi-automatic performance regression checking #356

MarionBWeinzierl commented Nov 12, 2024 •

edited

Loading

codecov-commenter commented Nov 12, 2024 •

edited

Loading

MarionBWeinzierl commented Nov 18, 2024 •

edited

Loading

davidorme commented Nov 19, 2024

MarionBWeinzierl commented Nov 19, 2024

MarionBWeinzierl commented Nov 25, 2024

MarionBWeinzierl Nov 26, 2024

MarionBWeinzierl Dec 2, 2024

j-emberton commented Dec 2, 2024 •

edited

Loading

davidorme commented Dec 3, 2024 •

edited

Loading

davidorme commented Dec 3, 2024

davidorme left a comment

MarionBWeinzierl commented Dec 3, 2024

Manual/semi-automatic performance regression checking #356

Manual/semi-automatic performance regression checking #356

Conversation

MarionBWeinzierl commented Nov 12, 2024 • edited Loading

Description

Type of change

Key checklist

Further checks

codecov-commenter commented Nov 12, 2024 • edited Loading

Codecov Report

MarionBWeinzierl commented Nov 18, 2024 • edited Loading

davidorme commented Nov 19, 2024

MarionBWeinzierl commented Nov 19, 2024

MarionBWeinzierl commented Nov 25, 2024

MarionBWeinzierl Nov 26, 2024

Choose a reason for hiding this comment

MarionBWeinzierl Dec 2, 2024

Choose a reason for hiding this comment

j-emberton commented Dec 2, 2024 • edited Loading

davidorme commented Dec 3, 2024 • edited Loading

davidorme commented Dec 3, 2024

davidorme left a comment

Choose a reason for hiding this comment

MarionBWeinzierl commented Dec 3, 2024

MarionBWeinzierl commented Nov 12, 2024 •

edited

Loading

codecov-commenter commented Nov 12, 2024 •

edited

Loading

MarionBWeinzierl commented Nov 18, 2024 •

edited

Loading

j-emberton commented Dec 2, 2024 •

edited

Loading

davidorme commented Dec 3, 2024 •

edited

Loading