Skip to content

Latest commit

 

History

History
88 lines (58 loc) · 2.34 KB

data_about.md

File metadata and controls

88 lines (58 loc) · 2.34 KB

Data Files

Located in the data directory.

Original RDF ontology file.

Ontology exported to csv and converted to txt. Each row has an entity and all its properties, delimited by the vertical pipe |.

all_owl.txt converted to comma-separated excel file to read the data easier.

Q identifiers for every entity in the local Wikibase instance Formatted as:

{ 'label': (id number as integer without Q) }

P identifiers for properties imported from Wikidata. RFrmatted as:

{ 'label': (id number as integer without P) }

Entities for each category formatted as json objects. Object structure is as follows:

label : {
    original_property : [
        "value(s)" ,
        "as" , 
        "list"
    ],
    Q : "Q Value or None",
    wiki: {
        wikidata_property : [ 
            "P_value" , 
            { 
                value_label : "Q Value or None" 
            }
        ]
    }
}

Entities folder contains any edited/refined versions of the json files where relevant. The old_versions directory contains the originals.

Output of any errors that occur during different steps.

Step 2-1: prop_import_errors.txt and entity_import_errors.txt

Step 2-2: prop_import_errors_people.txt and entity_import_errors_people.txt

Step 2-3: collection_import_errors.txt

Step 4: wiki_prop_errors.txt

All imports (to record non-unique labels): disambiguate.txt

Properties from Wikidata extracted from the entity json files. People properties in properties_export_people.csv, others in properties_export.csv. Formatted as:

Property Label PID Type Described Usage Count

The files properties_people.xlsx and properties_other.xslx contain this data for sorting and cutting out properties that won't be added to the Wikibase. There are also columns for local identifiers and the property description to copy from Wikibase.

The final properties to be imported are in props_for_import.csv, formatted as:

Label Description