Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check Fixed target DIS #2170

Merged
merged 38 commits into from
Dec 4, 2024
Merged

Check Fixed target DIS #2170

merged 38 commits into from
Dec 4, 2024

Conversation

giacomomagni
Copy link
Contributor

@giacomomagni giacomomagni commented Oct 10, 2024

This PR is to implement the fixed target dis datasets from hepdata.

Summary

Experiment Reproducibility Note TODO
BCDMS Two new observables, which superseede the old ones.
CHORUS Kinematics and central data match. Last point added manually. Minor change in Q2
EMC Kinematics and central data match. Branching ratio of 0.82 wrt to Hepdata. Add syst of 15%
NMC 🟡 The ratio dataset matches (except for uncertainties). ⚠️ $\sigma$ normalisation not clear for the P dataset. Two new observables, which are alternative to the old ones.
NUTEV 🟡 Implementation matches with Mason theses. Br and acceptance corrections where provided directly by the author. Minor change in Q2 acc_err not used ? where is the normalization error coming from ?
SLAC 🔴 Numbers are consistent with Phys.Lett.B 282 (1992) 475, but the source is unavailable. See comment below.

Common questions

  • ⚠️ Old implementation used the breakdown of systematics when available. HepData use the combined one.
    How do we want to proceed? NO, only in the legacy data.
  • ⚠️ Do we want to store the information about the old systematics somewhere? NO, only in the legacy data
  • ⚠️ Do we want to store the old rawdata for SLAC ? NO, only in the legacy data

New variants

Dataset hepdata arxiv Action Motivation Uncertainties Variants
BCDMS_NC_NOTFIXED_P link link no action taken, replaced by new observables The old implementation (data and kinematic) coincides with the R=0, not averaged on $\sqrt{s}$ values. However the averaged and not averaged tables do not have the same kinematic. Breakdown of syst. not given in HepData
BCDMS_NC_NOTFIXED_D link link no action taken, replaced by new observables The old implementation (data and kinematic) coincides with the R=0, not averaged on $\sqrt{s}$ values. However the averaged and not averaged tables do not have the same kinematic. Breakdown of syst. not given in HepData
CHORUS_CC_NOTFIXED_PB link link new variant Kinematics and central value match Breakdown of syst. not given in HepData hepdata
EMC_NC_250GEV_DW link link new variant Kinematics and central match value Old implementation has a 0.15 % syst not given in HepData. + nuclear u. rzero
NMC_NC_NOTFIXED link link new variant Kinematics and central value match Breakdown of syst. not given in HepData hepdata
NMC_NC_NOTFIXED_P link link no action taken, see new observables Kinematics match, $\sigma$ was computed in the old buildmaster from R and F2 Full breakdown of systematics included in legacy.
NUTEV_CC_NOTFIXED_PB link new variant Kinematics and central value match copied from old buildmaster hepdata with the updated BR and no nuclear uncertanties
SLAC_NC_NOTFIXED_P no action taken. metadata amended.
SLAC_NC_NOTFIXED_D no action taken. metadata amended.

New HEPDATA Observables

Observable hepdata arxiv Motivation Variants
BCDMS_NC_NOTFIXED_P_EM-F2-HEPDATA link link Averaged values on $\sqrt{s}$ has different kinematics. rzero, rqcd
BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA link link Averaged values on $\sqrt{s}$ has different kinematics. rzero, rqcd
NMC_NC_NOTFIXED_P_EM-F2-HEPDATA link link Old implementation was $\sigma$, not $F_2$, breakdown of syst. not given
NMC_NC_NOTFIXED_D_EM-F2-HEPDATA link link Not available before (possible double counting), breakdown of syst. not given

@giacomomagni giacomomagni marked this pull request as draft October 10, 2024 15:42
@giacomomagni
Copy link
Contributor Author

giacomomagni commented Oct 10, 2024

Okay here we have the first interesting case:
central data and and kinematics (even if I swap sqrts with y) do not match exactly.
What do we want to do? New dataset ? New observable ? @scarlehoff @enocera

Other question:
do we really want to store the hepdata tables or maybe automatically download them ?

@enocera
Copy link
Contributor

enocera commented Oct 10, 2024

My opinion is as follows:

  • implement a new data set, consistent with the information on Hepdata, given that the legacy data set is there, we could study later the impact of the "new" dataset w.r.t. the old;
  • we want to download and store the tables.

@giacomomagni giacomomagni linked an issue Oct 10, 2024 that may be closed by this pull request
@scarlehoff
Copy link
Member

Should we do a complete new dataset or just have a legacy variant?

(this is a new situation due to the kinematics change so a decision has to be made!! I'm personally partial to have either a variant or an observable even if the legacy didn't come from the same paper)

@giacomomagni
Copy link
Contributor Author

giacomomagni commented Oct 10, 2024

So the legacy and the hepdata version should come from the same reference.
I'd be in favor of creating a variant (with different kinematic, unc and data) so this way it will be clear that it's completely exclusive wrt to the legacy one, and they would belong under the same observable and dataset as they should.

But I'm not sure if kinematic variants are easy to support (or already working)

@scarlehoff
Copy link
Member

But I'm not sure if kinematic variants are easy to support (or already working)

It might be working and if not it's very easy to support so no problem

@giacomomagni
Copy link
Contributor Author

giacomomagni commented Oct 10, 2024

It might be working and if not it's very easy to support so no problem

But are you sure? for instance it will requires also different FKtables, ecc...

@scarlehoff
Copy link
Member

Theory and data/uncertainties are supported by variants (at the beginning it was only for uncertainties but then we realized data and theory might need to change for these legacy cases).

We decided to create a new dataset if e.g. the number of points changed.

@giacomomagni
Copy link
Contributor Author

As agrred with @enocera we have added some modifications to the EMC and CHORUS data w.r.t the plain hepdata implementation.

  • For Chorus. We have decided to retain in the hepdata variant also CHORUSISOTARGCOR and CHORUSQEDRADCOR, computed exactly as in the legacy variant. The first uncertainty is due the isoscalar target correction interpreted as uncertainty, while the secondo one takes into account possible QED effects (QED radiation correction interpreted as uncertainty).

  • For EMC. We added to the hepdata implementation an addtional 15% "general" systematical uncertainty, which was taken as from buidmaster. We also take into account a 1.2% error from the Branching ration.

This should conclude the revision of these datasets. So the PR is ready for rewiev.

@giacomomagni giacomomagni marked this pull request as ready for review December 2, 2024 16:17
@giacomomagni giacomomagni added the run-fit-bot Starts fit bot from a PR. label Dec 2, 2024
@scarlehoff
Copy link
Member

Thanks @giacomomagni should we merge #2192 here before doing any checks?

@giacomomagni
Copy link
Contributor Author

Thanks @giacomomagni should we merge #2192 here before doing any checks?

sure, we can do it!

@scarlehoff
Copy link
Member

A quick look seems ok (I have to check the failing fitbot, it seems vp crashed?) but I have two questions:

  1. Why is there no default variant? I would say that we want to use the dataset without having to specify variant: xx. Just select the variant that you consider the "most correct" and promote it to be the default.

1.b Shouldn't we have a variant: dw for some of the nuclear uncertainties or in those cases we just need to use the legacy?

  1. Wouldn't it make more sense (for organizational purposes) to have the _HEPDATA datasets as a separate obaervable within the same folder as the others? I guess this is somewhat irrelevant, but given that the others are just a port I think they can live safely in the same folder.

RE the other PR, actually it might be possible to merge it directly to master so on second thoughts better finish it and merge the other by itself since it is basically harmless.

@giacomomagni
Copy link
Contributor Author

A quick look seems ok (I have to check the failing fitbot, it seems vp crashed?) but I have two questions:

  1. Why is there no default variant? I would say that we want to use the dataset without having to specify variant: xx. Just select the variant that you consider the "most correct" and promote it to be the default.

1.b Shouldn't we have a variant: dw for some of the nuclear uncertainties or in those cases we just need to use the legacy?

which dataset are you referring exactly?

  1. Wouldn't it make more sense (for organizational purposes) to have the _HEPDATA datasets as a separate obaervable within the same folder as the others? I guess this is somewhat irrelevant, but given that the others are just a port I think they can live safely in the same folder.

This is not poissbile because the kinematic is not the same, or the number of points was different, see the tables at the top of the PR...

RE the other PR, actually it might be possible to merge it directly to master so on second thoughts better finish it and merge the other by itself since it is basically harmless.

okay

@scarlehoff
Copy link
Member

which dataset are you referring exactly?

All of them, you left dataset_uncertainties empty for all of them (including the ones that are purely new)

This is not poissbile because the kinematic is not the same, or the number of points was different, see the tables at the top of the PR...

You can have two different observables with different kinematics. I'm not talking about variants. So you have BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA, both under BCDMS_NC_NOTFIXED_D.

@giacomomagni
Copy link
Contributor Author

which dataset are you referring exactly?

All of them, you left dataset_uncertainties empty for all of them (including the ones that are purely new)

Okay sorry, let me fix it and rerun the fitbot.

You can have two different observables with different kinematics. I'm not talking about variants. So you have BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA, both under BCDMS_NC_NOTFIXED_D.

But here we're going in loop:

#2170 (comment)
#2170 (comment)

@scarlehoff
Copy link
Member

Okay sorry, let me fix it and rerun the fitbot.

I'm not sure what the problem of the fitbot is

But here we're going in loop:

No, we just meant different things by dataset*, I think you meant a "set of data" while I meant really a dataset name that you would put in a runcard.
What I'm suggesting is having two datasets: BCDMS_NC_NOTFIXED_D_EM-F2 and BCDMS_NC_NOTFIXED_D_EM-F2-HEPDATA both part of the set BCDMS_NC_NOTFIXED_D.

*I know, the terminology has become impossible, probably the per-folder organization was not a good idea but now it's too late.

@scarlehoff scarlehoff added run-fit-bot Starts fit bot from a PR. and removed run-fit-bot Starts fit bot from a PR. labels Dec 4, 2024
Copy link

github-actions bot commented Dec 4, 2024

Greetings from your nice fit 🤖 !
I have good news for you, I just finished my tasks:

Check the report carefully, and please buy me a ☕ , or better, a GPU 😉!

@scarlehoff scarlehoff merged commit 3298b56 into master Dec 4, 2024
9 checks passed
@scarlehoff scarlehoff deleted the check_fixed_target_dis branch December 4, 2024 14:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data toolchain run-fit-bot Starts fit bot from a PR.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Revisit implementation of all DIS
4 participants