Skip to content

A service that takes care of daily exports of the database, in various formats

License

Notifications You must be signed in to change notification settings

openfoodfacts/openfoodfacts-exports

Repository files navigation

Open Food Facts exports

This repository contains the code that performs daily exports of the Open Food Facts database.

It fetches the JSON dumps and generate and pushes the Parquet export to Hugging Face.

We want to move all exports from the main off server to this new service to reduce load on the main server and to make exports more robust.

Architecture

Two services are currently running:

  • a scheduler, whose only role is to trigger the export service every day
  • rq workers, that take care of all tasks: JSONL file download, Parquet export,...

The following exports are yet to migrate:

  • CSV (en, fr)
  • RDF (en, fr)
  • daily diffs

We're also considering to move JSONL and MongoDB exports from Product Opener to this service, so that Product Opener does not handle exports anymore.