You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It took me a while to get a grip on the data. I ended up writing an HTML scraper to get the JSON out of Zenodo, it is not ideal, but the Wikidata bot now works. I cherry-picked a few treatments from the main plazi website and created the following wikidata items: https://w.wiki/5A3h
There are more treatments added, which did not show up in the query above. This was because of some issues with publication links in their associated publications. This query will show all treatments currently in Wikidata: https://w.wiki/5A3i .
I can now add more treatments to wikidata given a set of plazi UUIDs. The bot uses both the RDF and the JSON from zenodo. I would able to rely on the RDF only if the following changes are made to the RDF:
Add the DOI of the scientific publication associated with the treatment. In some cases it often contains zenodo intermediate DOIs, which need to be resolved through the json.
Add the locations to the RDF. The second item the bot takes from the JSON is the location coordinates.
Use URIs and rdfs:label in the RDF. The taxonomic tree, currently uses literals for the different clades. For each clade the complete parent branch is repeated. Can this be simplified by changing the clades from strings to URIs? as in this example:"
The next step is to request a bot account and/or permission to do this on scale. But I propose to first discuss the current schema on Wikidata and make some possible adaptations.
Cheers,
Andra
The text was updated successfully, but these errors were encountered:
Hi Donat,
It took me a while to get a grip on the data. I ended up writing an HTML scraper to get the JSON out of Zenodo, it is not ideal, but the Wikidata bot now works. I cherry-picked a few treatments from the main plazi website and created the following wikidata items: https://w.wiki/5A3h
There are more treatments added, which did not show up in the query above. This was because of some issues with publication links in their associated publications. This query will show all treatments currently in Wikidata: https://w.wiki/5A3i .
I can now add more treatments to wikidata given a set of plazi UUIDs. The bot uses both the RDF and the JSON from zenodo. I would able to rely on the RDF only if the following changes are made to the RDF:
<http://taxon-concept.plazi.org/id/Animalia/Brighstoneus_simmondsi_Lockwood_2021> a dwcFP:TaxonConcept ;
Would become:
The next step is to request a bot account and/or permission to do this on scale. But I propose to first discuss the current schema on Wikidata and make some possible adaptations.
Cheers,
Andra
The text was updated successfully, but these errors were encountered: