Is it possible to add effects similar to Expressive speech-to-speech translation (S2ST)? #3288

meowlgm · 2023-11-23T01:44:46Z

meowlgm
Nov 23, 2023

Please check out this project: https://github.com/keonlee9420/STYLER.
This project can transfer the prosodic attributes of the source speech to the target speech, preserving the intonation, speech rate, emotional style, while only changing the content of speech. However, it seems that this project only supports the implementation within the same language.

Therefore, is it possible to combine this effect with XTTS, meaning first converting the source speech to text through ASR, translating the text into the desired target language, then synthesizing the text into the target language speech using XTTS, while transferring the prosodic attributes of the source speech during the synthesis process, ultimately achieving an effect similar to Expressive speech-to-speech translation (S2ST)?

Shayano · 2024-06-06T05:44:05Z

Shayano
Jun 6, 2024

The request is old but have you tried or found another solution to be able to do it?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to add effects similar to Expressive speech-to-speech translation (S2ST)? #3288

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Is it possible to add effects similar to Expressive speech-to-speech translation (S2ST)? #3288

meowlgm Nov 23, 2023

Replies: 1 comment

Shayano Jun 6, 2024

meowlgm
Nov 23, 2023

Shayano
Jun 6, 2024