Replace pystan models with numpyro #377

jack89roberts · 2021-07-17T17:37:29Z

See:

- Hangs if running multithreaded, possibly because bpl-next uses jax which is itself uses multiprocessing. See jax-ml/jax#1805 but changing multiprocessing in AIrsenal gives errors about sqlalchemy session not being pickle-able. - bpl-next predictions occasionally have nan values

review-notebook-app · 2021-07-23T21:00:49Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

… working

jack89roberts · 2021-08-07T16:18:57Z

Hopefully have a working team model now, but could do with someone checking what I've done to avoid NaNs and for adding new teams.

Also have an idea to get around the multiprocessing issue - we should be able to pre-generate score probabilities for each fixture and pass those to each thread rather than the team model (may also be a bit quicker as we'd only compute them once per fixture rather than once per player like we do currently)

though actually looks like very limited benefit over single threaded

…titute/AIrsenal into feature/313-pyro

jack89roberts · 2021-08-07T21:02:08Z

Nice work Nick!

Quick comparison before/after new player model (but using bpl-next in both cases):

Stan player model:

==================================================
PREDICTED TOP 5 PLAYERS FOR GAMEWEEK(S) [1, 2, 3, 4, 5]:
==================================================
GK:
1. Alisson Ramses Becker, 21.38pts (£6.0m, LIV)
2. Emiliano Martínez, 21.24pts (£5.5m, AVL)
3. Dean Henderson, 20.28pts (£5.0m, MUN)
4. Illan Meslier, 19.69pts (£5.0m, LEE)
5. Hugo Lloris, 19.56pts (£5.5m, TOT)
-------------------------
DEF:
1. Trent Alexander-Arnold, 28.34pts (£7.5m, LIV)
2. Andrew Robertson, 25.72pts (£7.0m, LIV)
3. Lucas Digne, 20.69pts (£5.5m, EVE)
4. Nathaniel Phillips, 20.48pts (£4.5m, LIV)
5. Rúben Santos Gato Alves Dias, 20.11pts (£6.0m, MCI)
-------------------------
MID:
1. Mohamed Salah, 31.83pts (£12.5m, LIV)
2. Sadio Mané, 28.63pts (£12.0m, LIV)
3. Bruno Miguel Borges Fernandes, 27.17pts (£12.0m, MUN)
4. Heung-Min Son, 25.54pts (£10.0m, TOT)
5. Mason Greenwood, 21.23pts (£7.5m, MUN)
-------------------------
FWD:
1. Harry Kane, 27.45pts (£12.5m, TOT)
2. Jamie Vardy, 27.02pts (£10.5m, LEI)
3. Gabriel Fernando de Jesus, 26.97pts (£8.5m, MCI)
4. Kelechi Iheanacho, 25.02pts (£7.5m, LEI)
5. Roberto Firmino, 23.17pts (£9.0m, LIV)
-------------------------

NumPyro player model:

==================================================
PREDICTED TOP 5 PLAYERS FOR GAMEWEEK(S) [1, 2, 3, 4, 5]:
==================================================
GK:
1. Alisson Ramses Becker, 21.38pts (£6.0m, LIV)
2. Emiliano Martínez, 21.24pts (£5.5m, AVL)
3. Dean Henderson, 20.28pts (£5.0m, MUN)
4. Illan Meslier, 19.69pts (£5.0m, LEE)
5. Hugo Lloris, 19.56pts (£5.5m, TOT)
-------------------------
DEF:
1. Trent Alexander-Arnold, 29.34pts (£7.5m, LIV)
2. Andrew Robertson, 26.32pts (£7.0m, LIV)
3. Nathaniel Phillips, 21.18pts (£4.5m, LIV)
4. Lucas Digne, 21.04pts (£5.5m, EVE)
5. Rúben Santos Gato Alves Dias, 20.64pts (£6.0m, MCI)
-------------------------
MID:
1. Mohamed Salah, 33.57pts (£12.5m, LIV)
2. Sadio Mané, 30.80pts (£12.0m, LIV)
3. Bruno Miguel Borges Fernandes, 28.37pts (£12.0m, MUN)
4. Heung-Min Son, 27.06pts (£10.0m, TOT)
5. Mason Greenwood, 23.01pts (£7.5m, MUN)
-------------------------
FWD:
1. Gabriel Fernando de Jesus, 29.71pts (£8.5m, MCI)
2. Harry Kane, 28.02pts (£12.5m, TOT)
3. Jamie Vardy, 27.98pts (£10.5m, LEI)
4. Roberto Firmino, 26.30pts (£9.0m, LIV)
5. Kelechi Iheanacho, 26.26pts (£7.5m, LEI)
-------------------------

New model seems to like Jesus and Firmino a decent chunk more than the old model (NumPyro scores also look higher in general).

nbarlowATI · 2021-08-08T13:28:45Z

Interesting!!
I'm going to tidy up a bit (flake8 etc.) and add a new function to the model to return the p(score), p(assist) etc. given a player_id. I'm still not entirely sure what it's doing for new players where it doesn't have any previous data...

…titute/AIrsenal into feature/313-pyro

jack89roberts · 2021-08-08T20:50:42Z

I think we just predict 0 for new players as they have 0 recent minutes, so they probably don't make it as far as the player model.

The model seems to predict Firmino and Salah to have very similar score/assist probabilities - I'm wondering if it's something to do with priors (we have a position-dependent prior and Salah scores much more than an average midfielder but Firmino (maybe) much less than an average striker - perhaps the prior is dragging them both towards the mean).

nbarlowATI · 2021-08-09T08:26:40Z

Hmm... that would make sense, but not sure why it would be any different to what it was with the Stan model...?

For the new players, I think you're right - it's the minutes that gives them zero. I looked at what the model itself predicts for their p(score) etc. though, and it's very close to the position-dependent averages, which I guess is what we want.

jack89roberts added 2 commits July 17, 2021 18:22

WIP: start creating an interface for bpl-next

b038aeb

jack89roberts changed the title ~~WIP: Replace bpl (pystan) team model with bpl-next (numpyro)~~ [WIP] Replace bpl (pystan) team model with bpl-next (numpyro) Jul 19, 2021

Create team_model_bpl_next.ipynb

0a3d7a2

nbarlowATI added 2 commits August 1, 2021 11:53

skeleton (wrong) player model in numpyro

a133d50

update to numpyro_player_model notebook - getting closer to something…

caf3c5d

… working

jack89roberts changed the title ~~[WIP] Replace bpl (pystan) team model with bpl-next (numpyro)~~ [WIP] Replace pystan models with numpyro Aug 6, 2021

jack89roberts added 3 commits August 6, 2021 21:48

update bpl-next version

eabebf2

1st hopefully fully working team model with bpl next

8c23fc2

remove unused imports

7880702

nbarlowATI and others added 4 commits August 7, 2021 20:28

first go at implementation of player_model in numpyro

5e76ff2

remove pystan from requirements.txt

50ce771

fix multithreaded predictions

de99edf

though actually looks like very limited benefit over single threaded

Merge branch 'feature/313-pyro' of https://github.com/alan-turing-ins…

4e92e5e

…titute/AIrsenal into feature/313-pyro

nbarlowATI added 10 commits August 8, 2021 15:25

formatting

a03c098

Merge branch 'feature/313-pyro' of https://github.com/alan-turing-ins…

16ba985

…titute/AIrsenal into feature/313-pyro

remove stan player model

5560c31

update tests for predictions

f489a9d

add function to get probs for given player_id

64dd77d

remove unused imports

9658a0c

remove unused function

ed83642

remove stan-related functions from setup.py

4ccd11a

remove stan-related functions from setup.py

cda3586

remove some unused imports to keep flake8 happy

bdf76e6

Remove a few remaining pystan references

c2f672c

nbarlowATI marked this pull request as ready for review August 10, 2021 13:44

nbarlowATI merged commit cab7eb2 into develop Aug 10, 2021

callummole mentioned this pull request Aug 10, 2021

Experiment with using Pyro instead of Stan? #313

Closed

jack89roberts changed the title ~~[WIP] Replace pystan models with numpyro~~ Replace pystan models with numpyro Aug 10, 2021

jack89roberts deleted the feature/313-pyro branch August 10, 2021 14:02

jack89roberts restored the feature/313-pyro branch August 10, 2021 14:03

jack89roberts deleted the feature/313-pyro branch August 10, 2021 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace pystan models with numpyro #377

Replace pystan models with numpyro #377

jack89roberts commented Jul 17, 2021 •

edited

Loading

review-notebook-app bot commented Jul 23, 2021

jack89roberts commented Aug 7, 2021

jack89roberts commented Aug 7, 2021 •

edited

Loading

nbarlowATI commented Aug 8, 2021

jack89roberts commented Aug 8, 2021

nbarlowATI commented Aug 9, 2021

Replace pystan models with numpyro #377

Replace pystan models with numpyro #377

Conversation

jack89roberts commented Jul 17, 2021 • edited Loading

review-notebook-app bot commented Jul 23, 2021

jack89roberts commented Aug 7, 2021

jack89roberts commented Aug 7, 2021 • edited Loading

nbarlowATI commented Aug 8, 2021

jack89roberts commented Aug 8, 2021

nbarlowATI commented Aug 9, 2021

jack89roberts commented Jul 17, 2021 •

edited

Loading

jack89roberts commented Aug 7, 2021 •

edited

Loading