-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New gbifIDs issued for unchanged occurrence records #1105
Comments
Ping @MattBlissett. If there's a relatively quick fix & reharvest because issues are discovered to be a result of processing datasetKey e88fdc62-ea3b-41a3-8967-b3780f602c0b and 85714c48-f762-11e1-a439-00145eb45e9a & not originating at their sources, I'll hold off on a Bionomia refresh. Otherwise, I'll proceed and swallow the ~20k broken links. |
Hi @dshorthouse I can't diagnose this fully tonight, but I looked quickly at the last example. It looks like something might have changed at the source and produced a broken archive. If you look at the DwC-A for the dataset you will find the CSV headers and the metadata are contradictory (swapped In the CSV the first columns read:
but in the meta.xml:
Because the Could this explain other anomalies you see please? (Pinging @jholetschek too) |
Thanks @timrobertson100 for the prompt response. What you discovered appears to be the source of some new |
Hi Tim, David, yes, there was a bug in the DwC archive generation in BioCASe, which caused the entries occurrenceStatus and occurrenceID in But the other dataset (https://www.gbif.org/dataset/e88fdc62-ea3b-41a3-8967-b3780f602c0b) doesn't have a DwC endpoint, and the contents hasn't been changed in years, it's a static dataset. Cheers |
For some BioCASE-based datasets that lack source occurrenceIDs such as https://www.gbif.org/dataset/e88fdc62-ea3b-41a3-8967-b3780f602c0b, I've seen re-issuance of new gbifIDs upon recent re-harvest but apparently no significant/relevant new adjustments or enhancements were made to records in their source. For instance, this occurrence record is deprecated https://www.gbif.org/occurrence/1946076733 whereas it has been issued a new gbifID, https://www.gbif.org/occurrence/4981443192. In other instances, it looks like the interpretation of
occurrenceID
andoccurrenceStatus
has been mistakenly swapped, resulting in an issuance of new gbifIDs. See https://www.gbif.org/occurrence/144842729 and what appears to be its replacement https://www.gbif.org/occurrence/4979384474 by way of example.The text was updated successfully, but these errors were encountered: