You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update:
There will no disk space saved most likely. Integers in TTrees will get int32, bools are bool and floating point numbers of C++ float will become float32 automatically using uproot (see below).
Converting all float32 columns to float64 columns makes the pickled pandas.DataFrames ~1.3 times as large both using lz4 compression.
If datatypes for
pandas.DataFrame
are not specified, a query operation with negative values most likely fails (see also: https://stackoverflow.com/questions/50400843/using-negative-numbers-in-pandas-dataframe-query-expression)Hence, we cannot directly apply custom cuts with negative values using
pandas.DataFrame.query
which is a major problem.Furthermore, specifying datatypes might save diskspace!
This needs to be done at the first processing stage when TTrees are converted and pickled.
The text was updated successfully, but these errors were encountered: