This repository contains the code that performs daily exports of the Open Food Facts database.
It fetches the JSON dumps and generate and pushes the Parquet export to Hugging Face.
We want to move all exports from the main off server to this new service to reduce load on the main server and to make exports more robust.
Two services are currently running:
- a scheduler, whose only role is to trigger the export service every day
- rq workers, that take care of all tasks: JSONL file download, Parquet export,...
The following exports are yet to migrate:
- CSV (en, fr)
- RDF (en, fr)
- daily diffs
We're also considering to move JSONL and MongoDB exports from Product Opener to this service, so that Product Opener does not handle exports anymore.