Potential data structures for 3d data #733
jgriesfeller
started this conversation in
Ideas
Replies: 2 comments 2 replies
-
My thoughts: Data Structure2d pandas.DataFrame. (Mutable) Columns of the arrayMost basic form (lon, lat, alt, time, obs)
A ColocatedData would follow a similar basic form with an additional column for model data and columns for the associated metadata Metadata
|
Beta Was this translation helpful? Give feedback.
2 replies
-
Before we discuss, we should clarify what kinds of data we need to store:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In preparation for tomorrow's (16 Sept 2022) PointCloudData meeting I thought it's a good idea to note some of my basic ideas.
Data structure:
The most basic data structure is a 2d numpy array of np.float. One row is one time step, the columns hold the data.
Columns of data array: (some are optional):
time bounds are NaN for instantaneous data, altitude is either the station's altitude or the altitude of the measurement (for 3d measurements)
Metadata structure (one entry per station)
optimisations for speed
in order to speed up common usage patterns indexes can be used. Common selections are based on:
additional data structures: Indexes
Things to remember
ndarray is immutable (
np.append
/np.resize
copy the data to a new grown array), the so avoid growing and shrinking where possible. If needed add data in chunks.Start with an array size that is likely big enough to hold all data
Beta Was this translation helpful? Give feedback.
All reactions