Long words - right bucket size/parameters ? #196

entenbein · 2020-07-29T08:34:11Z

Hi folks,

I trained a model for German and now I'm struggling with predicted output for longer words (e.g. 39 letters ≙ 34 phones, yeah German...). Meaning for the predicted words the last phones are repeated over and over again.

So for training I set max_length=50. The results got better but there are some phone repetitions still.

How do the other to bucket parameters influence the predicted transcriptions?

Thanks alot!

The text was updated successfully, but these errors were encountered:

nshmyrev · 2020-07-29T08:50:58Z

You'd better try something modern transformer architecture, not seq2seq.

entenbein · 2020-07-29T08:53:55Z

Alright, which ones would you suggest?

nshmyrev · 2020-07-29T08:56:33Z

Maybe https://github.com/hajix/G2P

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long words - right bucket size/parameters ? #196

Long words - right bucket size/parameters ? #196

entenbein commented Jul 29, 2020

nshmyrev commented Jul 29, 2020

entenbein commented Jul 29, 2020

nshmyrev commented Jul 29, 2020

Long words - right bucket size/parameters ? #196

Long words - right bucket size/parameters ? #196

Comments

entenbein commented Jul 29, 2020

nshmyrev commented Jul 29, 2020

entenbein commented Jul 29, 2020

nshmyrev commented Jul 29, 2020