Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to incorporate POS tag info to aid alignment? #53

Open
stelmath opened this issue Mar 2, 2023 · 1 comment
Open

Is it possible to incorporate POS tag info to aid alignment? #53

stelmath opened this issue Mar 2, 2023 · 1 comment

Comments

@stelmath
Copy link

stelmath commented Mar 2, 2023

Hello and many thanks for sharing the project

I have an open question/discussion: would it be possible to incorporate the POS information of each token during training? For example, by having a new loss function that tries to minimize POS tag mismatching from source to target token. This comes from the idea that if a source token is a Noun in the source language, it will most likely also be a Noun in the target language. Same would go for Verbs etc. or other high-level POS tags. What are your thoughts on this?

Thank you

@zdou0830
Copy link
Collaborator

zdou0830 commented Mar 5, 2023

Hello, thank you for the suggestion! yes I think incorporating the POS tag information into training may improve the model performance. maybe you can start by doing this at inference time (e.g. enforcing the extracted aligned word pairs to have the same POS) and see if the results can improve, then investigate potential training objectives.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants