Remove batch-static tensor from dataset class and models #13

joeloskarsson · 2024-03-18T08:14:02Z

The batch-static tensor contained forcing that differed between initialization times, but stayed static for all lead times of a forecast. For the MEPS data we used this for the land-water-mask, as this could be different throughout the year, but we could not produce separate values per lead time (as all other forcing).

This PR removes the batch-static features as an explicit extra input. The motivation is:

Having such input features is quite a rare and a highly specific case.
If such inputs exists, it is better to just treat them as any other type of forcing. Then the values have to be repeated over the temporal dimension, but this can either be handled in pre-processing or easily in the Dataset class. In this PR the MEPS Dataset class is changed to take this approach.
Needing to pass around the batch-static features clutter up the code. For most dataset they would not be used, requiring constant special checks for if they are None.

This PR changes:

Bake the batch-static features into the normal forcing in the MEPS Dataset class.
Change the Dataset class to only return 3 tensors per sample (init, target, forcing).
Remove the batch-static tensor from being extracted from the batch and passed around in the graph-based models. This while making sure that input dimensions line up so older checkpoints can still be loaded correctly.

joeloskarsson · 2024-03-18T08:17:07Z

neural_lam/models/base_graph_model.py

@@ -115,7 +112,6 @@ def predict_step(
            (
                prev_state,
                prev_prev_state,
-                batch_static_features,


Note that the batch-static features are now put as the first feature dimension in forcing. Earlier they were stacked right on top of forcing in this tensor. This results in no change to how grid_features looks like for a sample. Importantly, this means that models trained before this PR can be loaded and works without any problems.

joeloskarsson · 2024-03-18T08:20:50Z

@sadamov Hope it's ok that I put you to review PRs like this :) I think it's valuable to get a second pair of eyes to look at the changes, and also good for you to get an update on small things I am changing.

The changes to the MEPS Dataset class here are not very important, this is really motivated by moving away from things being too specific for that data.

sadamov

I agree with both the general direction of and the explicit changes to the codebase.

In general, making the dataloader more flexible and reducing the complexity of input feature types, allows for easier onboarding of new collaborators.
I tested the explicit changes with the meps_example dataset and training is successful without batch_static_features.

joeloskarsson · 2024-03-18T09:40:12Z

Thanks for taking a look!

I just realized I forgot to change create_parameter_weights.py, as the Dataset class is used in there also. Will fix that (should only be a tiny change of index) and then merge.

…Dataset

Squashed commit of the following: commit b0050b9 Author: Joel Oskarsson <[email protected]> Date: Mon Mar 18 10:56:45 2024 +0100 Remove batch-static tensor from dataset class and models (mllam#13) * Bake the batch-static features into the normal forcing in the MEPS Dataset class. * Change the Dataset class to only return 3 tensors per sample (init, target, forcing). * Remove the batch-static tensor from being extracted from the batch and passed around in the graph-based models. This while making sure that input dimensions line up so older checkpoints can still be loaded correctly. commit 0669ff4 Author: Joel Oskarsson <[email protected]> Date: Thu Feb 29 11:50:27 2024 +0100 Re-define RMSE metric to take sqrt after sample averaging (mllam#10)

joeloskarsson added 2 commits March 17, 2024 18:37

Change batch-static forcing features to be baked into dynamic forcing

65f00ea

Fix formatting with pre-commit

2d9afd1

joeloskarsson commented Mar 18, 2024

View reviewed changes

joeloskarsson requested a review from sadamov March 18, 2024 08:17

sadamov approved these changes Mar 18, 2024

View reviewed changes

Change create_parameter_weights script to correctly use batches from …

fe46758

…Dataset

joeloskarsson merged commit b0050b9 into main Mar 18, 2024
1 check passed

joeloskarsson deleted the remove_batch_static branch March 18, 2024 09:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove batch-static tensor from dataset class and models #13

Remove batch-static tensor from dataset class and models #13

joeloskarsson commented Mar 18, 2024

joeloskarsson Mar 18, 2024

joeloskarsson commented Mar 18, 2024

sadamov left a comment

joeloskarsson commented Mar 18, 2024

Remove batch-static tensor from dataset class and models #13

Remove batch-static tensor from dataset class and models #13

Conversation

joeloskarsson commented Mar 18, 2024

joeloskarsson Mar 18, 2024

Choose a reason for hiding this comment

joeloskarsson commented Mar 18, 2024

sadamov left a comment

Choose a reason for hiding this comment

joeloskarsson commented Mar 18, 2024