RuntimeError - Please reduce your request rate #650

norlandrhagen · 2024-12-04T23:46:11Z

Hi there 👋

Deep within pangeo-forge-recipes we're seeing this error crop up when writing to a gcs bucket. It seem to happen on multiple gcsfs version (2024.10.0, 2024.09.0, etc..)

RuntimeError: gcsfs.retry.HttpError: The object <path>/chirps-global-daily.zarr/time/0 exceeded the rate limit for object mutation operations (create, update, and delete). Please reduce your request rate. See https://cloud.google.com/storage/docs/gcs429.,

I don't have an MRE, but was wondering if there are any gcsfs knobs for controlling the rate limit when writing Zarr chunks?

Thanks in advance!

cc @jbusecke

The text was updated successfully, but these errors were encountered:

martindurant · 2024-12-05T15:05:34Z

gcsfs.retry has the code to decide what to do with various error states. Obviously, this one should be caught in the retryable errors list, which will result in retries with exponential backoff, just what you need.

There's no general way to coordinate the number of requests across processes, and it's the total rate on the bucket that counts. All you can do is limit the number of concurrent requests per batch, see fsspec.asyn._get_batch_size for the relevant config values.

norlandrhagen · 2024-12-05T19:06:47Z

Thanks for the response @martindurant!

I haven't touched the gcsfs/fsspec internals, so please excuse some maybe obvious questions!

. .. this one should be caught in the retryable errors list, which will result in retries with exponential backoff, just what you need.

With the error: line 117, in validate_response raise HttpError(error) RuntimeError: gcsfs.retry.HttpError: T would you suggested adding the HttpError to the RETRIABLE_EXCEPTIONS list?

gcsfs/gcsfs/retry.py

Line 47 in 290f572

RETRIABLE_EXCEPTIONS = (

There's no general way to coordinate the number of requests across processes, and it's the total rate on the bucket that counts. All you can do is limit the number of concurrent requests per batch, see fsspec.asyn._get_batch_size for the relevant config values.

Do you have any tips for setting this _get_batch_size for gcsfs/fsspec? It would be nice to pass it into the pangeo-forge-recipes FSSpecTarget fsspec_kwargs

import fsspec 
import gcsfs 

fsspec.config.conf = {'gather_batch_size':17}

fs = fsspec.filesystem('gs')
fs.batch_size

martindurant · 2024-12-05T19:12:31Z

would you suggested adding the HttpError to the RETRIABLE_EXCEPTIONS list

No, it's far too general - but testing the case of HttpError to see if it's a "slow down" request. I see that code 429 is listed in the possible status codes to retry, so more specifics on what the server actually sent would be good.

martindurant · 2024-12-05T19:18:03Z

fsspec.config.conf['gather_batch_size'] = 17

is probably how you want to phrase it, so that copies of the dict are updated too. You can also put this in config files (any "~/.config/fsspec/*.json").

martindurant · 2024-12-12T16:07:12Z

more specifics on what the server actually sent would be good.

Did you see this again, is it possible to establish what the HTTP error looked like, to make sure we retry it correctly in the future?

norlandrhagen mentioned this issue Dec 13, 2024

Dataflow rate limit leap-stc/chirps_feedstock#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError - Please reduce your request rate #650

RuntimeError - Please reduce your request rate #650

norlandrhagen commented Dec 4, 2024

martindurant commented Dec 5, 2024

norlandrhagen commented Dec 5, 2024

martindurant commented Dec 5, 2024

martindurant commented Dec 5, 2024

martindurant commented Dec 12, 2024

RuntimeError - Please reduce your request rate #650

RuntimeError - Please reduce your request rate #650

Comments

norlandrhagen commented Dec 4, 2024

martindurant commented Dec 5, 2024

norlandrhagen commented Dec 5, 2024

martindurant commented Dec 5, 2024

martindurant commented Dec 5, 2024

martindurant commented Dec 12, 2024