POS Tagger with Stanza pipeline #2

dxv2k · 2020-12-05T10:20:20Z

DATA ANNOTATION: Penn Treebank 2 format

WORK FLOW with Stanza:
Document (in CoNLLU format or perform conversion ) -> Setence Segmentation -> Tokenize and Multi-word Tokenize-> POS Tagging

NOTICE:

We work with English so no need for multi-word tokenization (MWT)
In order to use Stanza, data format must be in CoNLLU
Use pre-trained tokenization and POS Tagger to compare with Viterbi Algorithm (XPOS field)
Neural pipeline in Stanza: Maximum entropy cyclic dependency network

Essential libraries and other components will be listed later.

dxv2k added the documentation Improvements or additions to documentation label Dec 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POS Tagger with Stanza pipeline #2

POS Tagger with Stanza pipeline #2

dxv2k commented Dec 5, 2020 •

edited

Loading

POS Tagger with Stanza pipeline #2

POS Tagger with Stanza pipeline #2

Comments

dxv2k commented Dec 5, 2020 • edited Loading

dxv2k commented Dec 5, 2020 •

edited

Loading