EIS Fire Ingest tweaking #156

mccabete · 2024-07-24T16:06:11Z

Contact Details

[email protected]

URL/DOI

N/A

Data License Identifier

N/A

Data Location

s3://maap-ops-workspace/shared/gsfc_landslides/FEDSoutput-s3-conus/

Size Estimate

N/A

Number of Items

N/A

Description

This is the EIS Fire layers that are being exported to the API at https://nasa-impact.github.io/veda-docs/notebooks/quickstarts/wfs.html

Collection Creation Notebook

N/A

Item Creation Notebook

N/A

Checklist

Files are valid COGs. Use rio cogeo validate
COGs appear to be correct

Any additional info you think is relevant, possibly including spatial or temporal subset if applicable?

I am looking for more understanding and control over what data gets fed into the API. Right now, I am a little hazy on the workflow, but want some redundancy with @ranchodeluxe who has been doing all the API support up until now. These are the situations that come up that I would like enough tools/agency over the ingest to respond to:

Situation 1: "The API isn't updating -- why?"

This has come up a few times. I can help check on the data generation on the back-end, but don' have enough transparency to check on any of the issue that crop up after the data get generated. This is usually time sensitive, because we only catch it when we go look in the data for some big fire.

Situation 2: "We need to add a new region to the API real fast"

If a new place starts having an extreme fire season, we try to point the algorithm there. We have gotten a lot of help with spinning up our algorithm in new places, but (I'm at least) still in the dark about how to then get that data into the API as a new feature layer. This also is usually time sensitive because we are trying to spin up measurements of the fire season as it's happening.

Situation 3: "We want to label and organize the data differently before we export it to the public"

There are some columns that make sense to keep around for researchers, but are too much information for the API. Also, we're working with more public-facing systems now (FIRMS) and may need to tweak our column names, or generate new columns now that we are working with a different community. We could use more opportunities to maintain how the data change between data generation and API.

Tagging @eorland (who is also interested in this) and @smohiudd.

To Do

Open PR for publishing those datasets to the Staging API
Notify QA / move ticket to QA state
Once approved, merge and close

The text was updated successfully, but these errors were encountered:

smohiudd · 2024-07-24T21:06:14Z

@mccabete here is the script that runs in AWS Eventbridge to trigger new a ingest as assets are added to the buckets. I was able to find the IAC that creates that function (it wasn't created manually in the console as I originally thought): https://github.com/NASA-IMPACT/veda-features-api/tree/main/terraform/veda-wfs3/functions/s3_event_bridge_to_sfn_execute

Actions going forward for @mccabete :

confirm if the buckets noted in the script above are correct. Create a PR from here if there are changes needed.
confirm in if there are new collections needed or column name changes in the Airflow ingest handler. Please start a pull request and we can review.

ranchodeluxe · 2024-07-29T13:55:58Z

@mccabete: Can you set up a time today or tomorrow with me (and maybe one intern)? Then we can pair on doing this work in v2 and v3. This should help flesh out your questions above and we can backfill any existing documentation as part of this

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EIS Fire Ingest tweaking #156

EIS Fire Ingest tweaking #156

mccabete commented Jul 24, 2024

smohiudd commented Jul 24, 2024 •

edited

Loading

ranchodeluxe commented Jul 29, 2024

EIS Fire Ingest tweaking #156

EIS Fire Ingest tweaking #156

Comments

mccabete commented Jul 24, 2024

Contact Details

URL/DOI

Data License Identifier

Data Location

Size Estimate

Number of Items

Description

Collection Creation Notebook

Item Creation Notebook

Checklist

Any additional info you think is relevant, possibly including spatial or temporal subset if applicable?

To Do

smohiudd commented Jul 24, 2024 • edited Loading

ranchodeluxe commented Jul 29, 2024

smohiudd commented Jul 24, 2024 •

edited

Loading