Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EIS Fire Ingest tweaking #156

Open
5 tasks
mccabete opened this issue Jul 24, 2024 · 2 comments
Open
5 tasks

EIS Fire Ingest tweaking #156

mccabete opened this issue Jul 24, 2024 · 2 comments

Comments

@mccabete
Copy link

Contact Details

[email protected]

URL/DOI

N/A

Data License Identifier

N/A

Data Location

s3://maap-ops-workspace/shared/gsfc_landslides/FEDSoutput-s3-conus/

Size Estimate

N/A

Number of Items

N/A

Description

This is the EIS Fire layers that are being exported to the API at https://nasa-impact.github.io/veda-docs/notebooks/quickstarts/wfs.html

Collection Creation Notebook

N/A

Item Creation Notebook

N/A

Checklist

Any additional info you think is relevant, possibly including spatial or temporal subset if applicable?

I am looking for more understanding and control over what data gets fed into the API. Right now, I am a little hazy on the workflow, but want some redundancy with @ranchodeluxe who has been doing all the API support up until now. These are the situations that come up that I would like enough tools/agency over the ingest to respond to:

Situation 1: "The API isn't updating -- why?"

This has come up a few times. I can help check on the data generation on the back-end, but don' have enough transparency to check on any of the issue that crop up after the data get generated. This is usually time sensitive, because we only catch it when we go look in the data for some big fire.

Situation 2: "We need to add a new region to the API real fast"

If a new place starts having an extreme fire season, we try to point the algorithm there. We have gotten a lot of help with spinning up our algorithm in new places, but (I'm at least) still in the dark about how to then get that data into the API as a new feature layer. This also is usually time sensitive because we are trying to spin up measurements of the fire season as it's happening.

Situation 3: "We want to label and organize the data differently before we export it to the public"

There are some columns that make sense to keep around for researchers, but are too much information for the API. Also, we're working with more public-facing systems now (FIRMS) and may need to tweak our column names, or generate new columns now that we are working with a different community. We could use more opportunities to maintain how the data change between data generation and API.

Tagging @eorland (who is also interested in this) and @smohiudd.

To Do

  • Open PR for publishing those datasets to the Staging API
  • Notify QA / move ticket to QA state
  • Once approved, merge and close
@smohiudd
Copy link
Contributor

smohiudd commented Jul 24, 2024

@mccabete here is the script that runs in AWS Eventbridge to trigger new a ingest as assets are added to the buckets. I was able to find the IAC that creates that function (it wasn't created manually in the console as I originally thought): https://github.com/NASA-IMPACT/veda-features-api/tree/main/terraform/veda-wfs3/functions/s3_event_bridge_to_sfn_execute

Actions going forward for @mccabete :

  • confirm if the buckets noted in the script above are correct. Create a PR from here if there are changes needed.
  • confirm in if there are new collections needed or column name changes in the Airflow ingest handler. Please start a pull request and we can review.

@ranchodeluxe
Copy link

@mccabete: Can you set up a time today or tomorrow with me (and maybe one intern)? Then we can pair on doing this work in v2 and v3. This should help flesh out your questions above and we can backfill any existing documentation as part of this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants