Is it possible to incorporate POS tag info to aid alignment? #53

stelmath · 2023-03-02T10:16:21Z

Hello and many thanks for sharing the project

I have an open question/discussion: would it be possible to incorporate the POS information of each token during training? For example, by having a new loss function that tries to minimize POS tag mismatching from source to target token. This comes from the idea that if a source token is a Noun in the source language, it will most likely also be a Noun in the target language. Same would go for Verbs etc. or other high-level POS tags. What are your thoughts on this?

Thank you

zdou0830 · 2023-03-05T04:36:55Z

Hello, thank you for the suggestion! yes I think incorporating the POS tag information into training may improve the model performance. maybe you can start by doing this at inference time (e.g. enforcing the extracted aligned word pairs to have the same POS) and see if the results can improve, then investigate potential training objectives.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to incorporate POS tag info to aid alignment? #53

Is it possible to incorporate POS tag info to aid alignment? #53

stelmath commented Mar 2, 2023

zdou0830 commented Mar 5, 2023

Is it possible to incorporate POS tag info to aid alignment? #53

Is it possible to incorporate POS tag info to aid alignment? #53

Comments

stelmath commented Mar 2, 2023

zdou0830 commented Mar 5, 2023