Few-shot multilingual-tts with rvc and vits

A tool that allows you to learn soft multilingual speech with a small amount of data set (5-10 minutes) using RVC. Most speech synthesis models require vast amounts of data. However, it is not always possible to learn only in situations where there is a lot of data. This repository started with the idea of "Then why don't we clone a dataset and use it?"

0.Process

RVC Training with few dataset
Dataset Cloning with Trained RVC Model.
Training Vits
Inference

1. Pre-requisites

Python >= 3.8
Download RVC-VITS.zip and unzip RVC-VITS.zip
Install python requirements. Please refer requirements.txt
1. You may need to install espeak first: apt-get install espeak
Build requirements.txt and torch

./set_env.sh

Put the dataset into the rvc_dataset directory according to the following file structure. In this experiment, I used 50 wavs files of ljspeech datasets (330 seconds).

rvc_dataset
├───ljs
│   ├───LJ001-0001.wav
│   ├───LJ001-0002.wav
│   ├───...
│   └───LJ001-0050.wav

2. Training

./train_rvc.sh ljs 500
# If you want to train korean tts, change ja to ko (ja -> japanese, ko -> korean, en -> english)
./make_dataset.sh ljs ja
./train_vits.sh ljs

3. Inference

See vits/inference.ipynb

3.5 Inference Voice Sample

See ljs_ja_voice

Test Datasets

Language	Name	Link
Korean	KSS	https://www.kaggle.com/datasets/bryanpark/korean-single-speaker-speech-dataset
Japanese	JSUT	https://sites.google.com/site/shinnosuketakamichi/publication/jsut
English	LJSPEECH	https://keithito.com/LJ-Speech-Dataset/

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
rvc		rvc
rvc_dataset/ljs		rvc_dataset/ljs
samples		samples
standard_dataset		standard_dataset
vits		vits
vits_dataset		vits_dataset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
make_dataset.bat		make_dataset.bat
make_dataset.sh		make_dataset.sh
requirements.txt		requirements.txt
set_env.bat		set_env.bat
set_env.sh		set_env.sh
train_rvc.bat		train_rvc.bat
train_rvc.sh		train_rvc.sh
train_vits.bat		train_vits.bat
train_vits.sh		train_vits.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Few-shot multilingual-tts with rvc and vits

0.Process

1. Pre-requisites

2. Training

3. Inference

3.5 Inference Voice Sample

Test Datasets

References

About

Releases

Packages

Languages

License

kdrkdrkdr/RVC-VITS

Folders and files

Latest commit

History

Repository files navigation

Few-shot multilingual-tts with rvc and vits

0.Process

1. Pre-requisites

2. Training

3. Inference

3.5 Inference Voice Sample

Test Datasets

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages