Skip to content

geoffbacon/verrius

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Verrius

Part-of-speech tagger and lemmatizer for Latin.

To do

  • POS tagging
    • Postprocessing
    • Evaluation
      • Errors
    • Output and evaluation script
    • Facilitate engagement
    • Lint and clean up
    • Preprocessing
      • Start/end sentence boundaries
    • Re-train
      • Increase token embedding, character embedding and hidden sizes
      • Fix batch size to 8
  • Lemmatization
    • Preprocessing
  • External unlabelled data
  • External labelled data
  • Data augmentation methods

Current POS score

0.957

Current lemmatization score

0.905

About

Part-of-speech tagger and lemmatizer for Latin

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published