Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Invalid / partial XML granule_metadata.xml files #6

Open
piyushrpt opened this issue Jun 9, 2023 · 5 comments
Open

Invalid / partial XML granule_metadata.xml files #6

piyushrpt opened this issue Jun 9, 2023 · 5 comments
Labels
bug Something isn't working

Comments

@piyushrpt
Copy link

We are noticing an increased number of partial / corrupted and invalid XML metadata files over the last few days. The imagery is fine but when we try to retrieve information from the granule_metadata.xml files these are usually truncated/ corrupted compared to the ones in the original scihub granules. Possibly in an issue with copying these out in chunks from the original source

Example: S2A_19VDG_20230604_0_L2A
https://earth-search.aws.element84.com/v1/collections/sentinel-2-l2a/items/S2A_19VDG_20230604_0_L2A
https://sentinel-cogs.s3.us-west-2.amazonaws.com/sentinel-s2-l2a-cogs/19/V/DG/2023/6/S2A_19VDG_20230604_0_L2A/granule_metadata.xml

@piyushrpt
Copy link
Author

At current count, number of impacted granules are more than 3500 over the last week.

@piyushrpt
Copy link
Author

Looks like another 750-800 scenes have this issue since the last time I reported. Is there an alternate way to handle this - other than going back to the scihub granules for the xml metadata. This is holding up analytics pipelines that rely on detailed metadata from the xml file.

@piyushrpt
Copy link
Author

Any updates on this?

This had also been reported earlier here: https://github.com/cirrus-geo/cirrus-earth-search/issues/39

@matthewhanson
Copy link
Member

@piyushrpt Currently resolving the other issues but will be looking at this problem this week.

@tonykgill
Copy link

Hi folks,

I'm doing some backprocessing for tiles over Australia and am seeing a few truncated metadata files too. List attached. I'll add more if I find any. The problem seems to be constrained to early June.

extract-2023-09-01T00_15_01.758Z.csv

This is not holding us up. We fall back to using the metadata in the Frankfurt s3 bucket, s3://sentinel-s2-l2a/tiles, which is fine for the corresponding files.

Tony

@gadomski gadomski added the bug Something isn't working label Sep 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants