Check Fixed target DIS #2170

giacomomagni · 2024-10-10T15:42:05Z

This PR is to implement the fixed target dis datasets from hepdata.

Summary

Experiment	Reproducibility	Note	TODO
BCDMS	✅	Two new observables, which superseede the old ones.
CHORUS	✅	Kinematics and central data match. Last point added manually. Minor change in Q2
EMC	✅	Kinematics and central data match. Branching ratio of 0.82 wrt to Hepdata. Add syst of 15%
NMC	🟡	The ratio dataset matches (except for uncertainties). ⚠️ $\sigma$ normalisation not clear for the P dataset. Two new observables, which are alternative to the old ones.
NUTEV	🟡	Implementation matches with Mason theses. Br and acceptance corrections where provided directly by the author. Minor change in Q2	acc_err not used ? where is the normalization error coming from ?
SLAC	🔴	Numbers are consistent with Phys.Lett.B 282 (1992) 475, but the source is unavailable. See comment below.

Common questions

⚠️ Old implementation used the breakdown of systematics when available. HepData use the combined one.
How do we want to proceed? NO, only in the legacy data.
⚠️ Do we want to store the information about the old systematics somewhere? NO, only in the legacy data
⚠️ Do we want to store the old rawdata for SLAC ? NO, only in the legacy data

New variants

Dataset	hepdata	arxiv	Action	Motivation	Uncertainties	Variants
BCDMS_NC_NOTFIXED_P	link	link	no action taken, replaced by new observables	The old implementation (data and kinematic) coincides with the R=0, not averaged on $\sqrt{s}$ values. However the averaged and not averaged tables do not have the same kinematic.	Breakdown of syst. not given in HepData
BCDMS_NC_NOTFIXED_D	link	link	no action taken, replaced by new observables	The old implementation (data and kinematic) coincides with the R=0, not averaged on $\sqrt{s}$ values. However the averaged and not averaged tables do not have the same kinematic.	Breakdown of syst. not given in HepData
CHORUS_CC_NOTFIXED_PB	link	link	new variant	Kinematics and central value match	Breakdown of syst. not given in HepData	`hepdata`
EMC_NC_250GEV_DW	link	link	new variant	Kinematics and central match value	Old implementation has a 0.15 % syst not given in HepData. + nuclear u.	`rzero`
NMC_NC_NOTFIXED	link	link	new variant	Kinematics and central value match	Breakdown of syst. not given in HepData	`hepdata`
NMC_NC_NOTFIXED_P	link	link	no action taken, see new observables	Kinematics match, $\sigma$ was computed in the old buildmaster from R and F2	Full breakdown of systematics included in legacy.
NUTEV_CC_NOTFIXED_PB		link	new variant	Kinematics and central value match	copied from old buildmaster	`hepdata` with the updated BR and no nuclear uncertanties
SLAC_NC_NOTFIXED_P			no action taken. metadata amended.
SLAC_NC_NOTFIXED_D			no action taken. metadata amended.

New HEPDATA Observables

Observable	hepdata	arxiv	Motivation	Variants
BCDMS_NC_NOTFIXED_P_EM-F2-HEPDATA	link	link	Averaged values on $\sqrt{s}$ has different kinematics.	`rzero`, `rqcd`
BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA	link	link	Averaged values on $\sqrt{s}$ has different kinematics.	`rzero`, `rqcd`
NMC_NC_NOTFIXED_P_EM-F2-HEPDATA	link	link	Old implementation was $\sigma$, not $F_2$, breakdown of syst. not given
NMC_NC_NOTFIXED_D_EM-F2-HEPDATA	link	link	Not available before (possible double counting), breakdown of syst. not given

giacomomagni · 2024-10-10T15:46:25Z

Okay here we have the first interesting case:
central data and and kinematics (even if I swap sqrts with y) do not match exactly.
What do we want to do? New dataset ? New observable ? @scarlehoff @enocera

Other question:
do we really want to store the hepdata tables or maybe automatically download them ?

enocera · 2024-10-10T16:03:56Z

My opinion is as follows:

implement a new data set, consistent with the information on Hepdata, given that the legacy data set is there, we could study later the impact of the "new" dataset w.r.t. the old;
we want to download and store the tables.

scarlehoff · 2024-10-10T17:58:43Z

Should we do a complete new dataset or just have a legacy variant?

(this is a new situation due to the kinematics change so a decision has to be made!! I'm personally partial to have either a variant or an observable even if the legacy didn't come from the same paper)

giacomomagni · 2024-10-10T18:20:34Z

So the legacy and the hepdata version should come from the same reference.
I'd be in favor of creating a variant (with different kinematic, unc and data) so this way it will be clear that it's completely exclusive wrt to the legacy one, and they would belong under the same observable and dataset as they should.

But I'm not sure if kinematic variants are easy to support (or already working)

scarlehoff · 2024-10-10T18:24:45Z

But I'm not sure if kinematic variants are easy to support (or already working)

It might be working and if not it's very easy to support so no problem

giacomomagni · 2024-10-10T18:26:00Z

It might be working and if not it's very easy to support so no problem

But are you sure? for instance it will requires also different FKtables, ecc...

scarlehoff · 2024-10-11T05:34:34Z

Theory and data/uncertainties are supported by variants (at the beginning it was only for uncertainties but then we realized data and theory might need to change for these legacy cases).

We decided to create a new dataset if e.g. the number of points changed.

validphys2/src/validphys/commondataparser.py

This reverts commit 7dcd327.

scarlehoff · 2024-10-16T14:30:46Z

https://www.slac.stanford.edu/pubs/slacpubs/5250/slac-pub-5442.pdf
https://www.slac.stanford.edu/pubs/slacreports/reports11/slac-r-357.pdf

giacomomagni · 2024-12-02T16:17:20Z

As agrred with @enocera we have added some modifications to the EMC and CHORUS data w.r.t the plain hepdata implementation.

For Chorus. We have decided to retain in the hepdata variant also CHORUSISOTARGCOR and CHORUSQEDRADCOR, computed exactly as in the legacy variant. The first uncertainty is due the isoscalar target correction interpreted as uncertainty, while the secondo one takes into account possible QED effects (QED radiation correction interpreted as uncertainty).
For EMC. We added to the hepdata implementation an addtional 15% "general" systematical uncertainty, which was taken as from buidmaster. We also take into account a 1.2% error from the Branching ration.

This should conclude the revision of these datasets. So the PR is ready for rewiev.

scarlehoff · 2024-12-02T17:50:27Z

Thanks @giacomomagni should we merge #2192 here before doing any checks?

giacomomagni · 2024-12-02T21:22:18Z

Thanks @giacomomagni should we merge #2192 here before doing any checks?

sure, we can do it!

scarlehoff · 2024-12-03T07:36:41Z

A quick look seems ok (I have to check the failing fitbot, it seems vp crashed?) but I have two questions:

Why is there no default variant? I would say that we want to use the dataset without having to specify variant: xx. Just select the variant that you consider the "most correct" and promote it to be the default.

1.b Shouldn't we have a variant: dw for some of the nuclear uncertainties or in those cases we just need to use the legacy?

Wouldn't it make more sense (for organizational purposes) to have the _HEPDATA datasets as a separate obaervable within the same folder as the others? I guess this is somewhat irrelevant, but given that the others are just a port I think they can live safely in the same folder.

RE the other PR, actually it might be possible to merge it directly to master so on second thoughts better finish it and merge the other by itself since it is basically harmless.

giacomomagni · 2024-12-03T08:39:26Z

A quick look seems ok (I have to check the failing fitbot, it seems vp crashed?) but I have two questions:

Why is there no default variant? I would say that we want to use the dataset without having to specify variant: xx. Just select the variant that you consider the "most correct" and promote it to be the default.

1.b Shouldn't we have a variant: dw for some of the nuclear uncertainties or in those cases we just need to use the legacy?

which dataset are you referring exactly?

Wouldn't it make more sense (for organizational purposes) to have the _HEPDATA datasets as a separate obaervable within the same folder as the others? I guess this is somewhat irrelevant, but given that the others are just a port I think they can live safely in the same folder.

This is not poissbile because the kinematic is not the same, or the number of points was different, see the tables at the top of the PR...

RE the other PR, actually it might be possible to merge it directly to master so on second thoughts better finish it and merge the other by itself since it is basically harmless.

okay

scarlehoff · 2024-12-03T08:42:06Z

which dataset are you referring exactly?

All of them, you left dataset_uncertainties empty for all of them (including the ones that are purely new)

This is not poissbile because the kinematic is not the same, or the number of points was different, see the tables at the top of the PR...

You can have two different observables with different kinematics. I'm not talking about variants. So you have BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA, both under BCDMS_NC_NOTFIXED_D.

giacomomagni · 2024-12-03T08:45:13Z

which dataset are you referring exactly?

All of them, you left dataset_uncertainties empty for all of them (including the ones that are purely new)

Okay sorry, let me fix it and rerun the fitbot.

You can have two different observables with different kinematics. I'm not talking about variants. So you have BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA, both under BCDMS_NC_NOTFIXED_D.

But here we're going in loop:

#2170 (comment)
#2170 (comment)

scarlehoff · 2024-12-03T08:48:52Z

Okay sorry, let me fix it and rerun the fitbot.

I'm not sure what the problem of the fitbot is

But here we're going in loop:

No, we just meant different things by dataset*, I think you meant a "set of data" while I meant really a dataset name that you would put in a runcard.
What I'm suggesting is having two datasets: BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA both part of the set BCDMS_NC_NOTFIXED_D.

*I know, the terminology has become impossible, probably the per-folder organization was not a good idea but now it's too late.

nnpdf_data/nnpdf_data/commondata/BCDMS_NC_NOTFIXED_D/metadata.yaml

…ematics Exterminate k1,k2,k3 from Fixed target data

github-actions · 2024-12-04T11:33:18Z

Greetings from your nice fit 🤖 !
I have good news for you, I just finished my tasks:

Fit Name: NNBOT-5e49a24cb-2024-12-04
Fit Report wrt master: https://vp.nnpdf.science/jh6DywkkTDiQYOM-80xVdQ==
Fit Report wrt latest stable reference: https://vp.nnpdf.science/es-mfMAcTSybbiSqmfyfAg==
Fit Data: https://data.nnpdf.science/fits/NNBOT-5e49a24cb-2024-12-04.tar.gz

Check the report carefully, and please buy me a ☕ , or better, a GPU 😉!

init commit

d3259f2

giacomomagni marked this pull request as draft October 10, 2024 15:42

giacomomagni self-assigned this Oct 10, 2024

giacomomagni added the data toolchain label Oct 10, 2024

giacomomagni linked an issue Oct 10, 2024 that may be closed by this pull request

Revisit implementation of all DIS #2073

Closed

minor fixes

93f191c

attempt to extend variants to support different kinematics

7dcd327

scarlehoff reviewed Oct 11, 2024

View reviewed changes

validphys2/src/validphys/commondataparser.py Outdated Show resolved Hide resolved

giacomomagni added 10 commits October 11, 2024 11:47

Revert "attempt to extend variants to support different kinematics"

08a2565

This reverts commit 7dcd327.

move new implementation to new dataset

7a8d2e3

implement rzero and rqcd variants averaged on s

2f6eb90

improve variant error msg

b20f5e2

adding deuteron data

a6eac00

add EMC from Hepdata

15be1d2

minor fixes

ceeedfd

implement NMC F2 d/p ratio

e180dbb

minor fixes

cf2b1b3

add NMC F2_p and F2_d

99f772d

minor on fks names

7124d22

Radonirinaunimi mentioned this pull request Oct 18, 2024

Implementation of ATLAS_Z0J_8TEV PT-Y and PT-M in the new format #2169

Merged

giacomomagni added 2 commits October 22, 2024 15:48

add hepdata source for chorus

792ee55

fix BR in EMC data

7b55cac

giacomomagni added 4 commits November 28, 2024 17:32

Merge branch 'master' into check_fixed_target_dis

ae11351

Merge branch 'master' into check_fixed_target_dis

f3a9685

Add some legacy syst erro for EMC

483188b

add CHORUSQEDRADCOR and CHORUSISOTARGCOR

1cf1051

giacomomagni marked this pull request as ready for review December 2, 2024 16:17

giacomomagni requested review from scarlehoff, Radonirinaunimi and enocera December 2, 2024 16:17

giacomomagni added the run-fit-bot Starts fit bot from a PR. label Dec 2, 2024

giacomomagni added 3 commits December 3, 2024 16:17

NMC and BCDMS as new observables

c348992

utils fix

3660158

make fktables names more uniform

67ca8d4

scarlehoff reviewed Dec 3, 2024

View reviewed changes

nnpdf_data/nnpdf_data/commondata/BCDMS_NC_NOTFIXED_D/metadata.yaml Show resolved Hide resolved

scarlehoff and others added 3 commits December 4, 2024 07:49

yet another change to appease plotoptions

fa3f92b

exterminate k1,k2,k3 from legacy data

57b5304

Merge pull request #2192 from NNPDF/clean_legacy_fixed_target_dis_kin…

eac6d99

…ematics Exterminate k1,k2,k3 from Fixed target data

scarlehoff added run-fit-bot Starts fit bot from a PR. and removed run-fit-bot Starts fit bot from a PR. labels Dec 4, 2024

scarlehoff approved these changes Dec 4, 2024

View reviewed changes

scarlehoff merged commit 3298b56 into master Dec 4, 2024
9 checks passed

scarlehoff deleted the check_fixed_target_dis branch December 4, 2024 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check Fixed target DIS #2170

Check Fixed target DIS #2170

giacomomagni commented Oct 10, 2024 •

edited

Loading

giacomomagni commented Oct 10, 2024 •

edited

Loading

enocera commented Oct 10, 2024 •

edited

Loading

scarlehoff commented Oct 10, 2024

giacomomagni commented Oct 10, 2024 •

edited

Loading

scarlehoff commented Oct 10, 2024

giacomomagni commented Oct 10, 2024 •

edited

Loading

scarlehoff commented Oct 11, 2024

scarlehoff commented Oct 16, 2024 •

edited

Loading

giacomomagni commented Dec 2, 2024

scarlehoff commented Dec 2, 2024

giacomomagni commented Dec 2, 2024

scarlehoff commented Dec 3, 2024

giacomomagni commented Dec 3, 2024

scarlehoff commented Dec 3, 2024

giacomomagni commented Dec 3, 2024

scarlehoff commented Dec 3, 2024

github-actions bot commented Dec 4, 2024

Check Fixed target DIS #2170

Check Fixed target DIS #2170

Conversation

giacomomagni commented Oct 10, 2024 • edited Loading

Summary

Common questions

New variants

New HEPDATA Observables

giacomomagni commented Oct 10, 2024 • edited Loading

enocera commented Oct 10, 2024 • edited Loading

scarlehoff commented Oct 10, 2024

giacomomagni commented Oct 10, 2024 • edited Loading

scarlehoff commented Oct 10, 2024

giacomomagni commented Oct 10, 2024 • edited Loading

scarlehoff commented Oct 11, 2024

scarlehoff commented Oct 16, 2024 • edited Loading

giacomomagni commented Dec 2, 2024

scarlehoff commented Dec 2, 2024

giacomomagni commented Dec 2, 2024

scarlehoff commented Dec 3, 2024

giacomomagni commented Dec 3, 2024

scarlehoff commented Dec 3, 2024

giacomomagni commented Dec 3, 2024

scarlehoff commented Dec 3, 2024

github-actions bot commented Dec 4, 2024

giacomomagni commented Oct 10, 2024 •

edited

Loading

giacomomagni commented Oct 10, 2024 •

edited

Loading

enocera commented Oct 10, 2024 •

edited

Loading

giacomomagni commented Oct 10, 2024 •

edited

Loading

giacomomagni commented Oct 10, 2024 •

edited

Loading

scarlehoff commented Oct 16, 2024 •

edited

Loading