An Automatically Constructed Knowledge Graph for the Manuscripts of Giacomo Leopardi

This repository contains data and source code for a prototypical Knowledge Extraction application for the manuscripts of Leopardi preserved at the Cambridge University Digital Library.

The automatically extracted Knowledge Graph is available in this Turtle File.

Pipeline

The Knowledge Graph was extracted by using Babelscape's REBEL jointly with ChatGPT, which was used to pre-process the Italian text in natural language triples. An illustration of our pipeline is presented below.

About

The source code and data here published are the result of research studies for the digitization project related to the manuscripts of Giacomo Leopardi and carried out by the National Center of Leopardian Studies and the University of Macerata.

For additional information, you can contact:

Cristian Santini ([email protected])

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
data		data
docs		docs
queries		queries
results		results
xml_tei		xml_tei
README.md		README.md
generate_kg.py		generate_kg.py
get_data.py		get_data.py
get_entities.py		get_entities.py
get_stats.py		get_stats.py
get_triples.py		get_triples.py
get_triples_baseline.py		get_triples_baseline.py
query_kg.py		query_kg.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Automatically Constructed Knowledge Graph for the Manuscripts of Giacomo Leopardi

Pipeline

About

About

Releases

Packages

Languages

sntcristian/leopardi_kg

Folders and files

Latest commit

History

Repository files navigation

An Automatically Constructed Knowledge Graph for the Manuscripts of Giacomo Leopardi

Pipeline

About

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages