From 69a163177dff15b66d8c457cc0f0c470e40145f4 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Tim=20von=20K=C3=A4nel?= <35628149+dunky11@users.noreply.github.com> Date: Thu, 14 Jul 2022 19:03:17 +0200 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 3546f34..8393679 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -# VoiceSmith [WIP] +# VoiceSmith [Work in Progress] VoiceSmith makes it possible to train and infer on both single and multispeaker models without any coding experience. It fine-tunes a pretty solid text to speech pipeline based on a modified version of [DelightfulTTS](https://arxiv.org/abs/2110.12612) and [UnivNet](https://arxiv.org/abs/2106.07889) on your dataset. Both models were pretrained on a proprietary 5000 speaker dataset. It also provides some tools for dataset preprocessing like automatic text normalization.