Skip to content

V9.0.0 - new data, summaries and entity salience

Compare
Choose a tag to compare
@amir-zeldes amir-zeldes released this 02 Feb 18:55
· 257 commits to master since this release
5f724df
  • 20 documents added including more conversational data (total tokens: 203,879)
  • Abstractive summaries for each document in metadata
  • Annotations for most salient entities in each document
  • Foreign language tags identify individual source languages
  • New process for reconstructing Reddit text data in top-level folders (see README.md)
  • Many corrections to all annotation layers