MultiSpeech

This is a PyTorch implementation of MultiSpeech: Multi-Speaker Text to Speech with Transformer

Train on your data

In order to train the model on your data, follow the steps below

1. data preprocessing

prepare your data and make sure the data is formatted in an PSV format as below without the header

speaker_id,audio_path,text,duration
0|file/to/file.wav|the text in that file|3.2

The speaker id should be integer and starts from 0

make sure the audios are MONO if not make the proper conversion to meet this condition

2. Setup development environment

create enviroment

python -m venv env

activate the enviroment

source env/bin/activate

install the required dependencies

pip install -r requirements.txt

3. Training

update the config file if needed

train the model

python train.py --train_path train_data.txt --test_path test_data.txt --checkpoint_dir outdir --epoch 100 --batch_size 64

Name	Name	Last commit message	Last commit date
Latest commit msalhab96 TODO.md added Jun 23, 2022 e7d19f9 · Jun 23, 2022 History 54 Commits
.gitignore	.gitignore	Initial commit	Jun 12, 2022
LICENSE	LICENSE	Initial commit	Jun 12, 2022
README.md	README.md	Update README.md	Jun 22, 2022
TODO.md	TODO.md	TODO.md added	Jun 23, 2022
args.py	args.py	args factories added to args.py	Jun 22, 2022
data.py	data.py	dataloader factory added	Jun 22, 2022
data_loaders.py	data_loaders.py	data_loaders.py added	Jun 21, 2022
decorators.py	decorators.py	decorators.py added	Jun 19, 2022
interfaces.py	interfaces.py	trainer added to interfaces.py	Jun 21, 2022
layers.py	layers.py	layers.py updated	Jun 19, 2022
loss.py	loss.py	device mapping issue resolved	Jun 22, 2022
model.py	model.py	device mapping issue resolved	Jun 22, 2022
optim.py	optim.py	optim.py added	Jun 19, 2022
padder.py	padder.py	get_pipelines, get_padders added	Jun 22, 2022
pipelines.py	pipelines.py	get_pipelines, get_padders added	Jun 22, 2022
requirements.txt	requirements.txt	requirements.txt added	Jun 22, 2022
setup.cfg	setup.cfg	setup.cfg added	Jun 12, 2022
tokenizer.py	tokenizer.py	tokenizer.py added	Jun 19, 2022
train.py	train.py	train.py updated	Jun 22, 2022
utils.py	utils.py	get_resampler added to utils	Jun 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiSpeech

Train on your data

1. data preprocessing

2. Setup development environment

3. Training

About

Releases

Packages

Languages

License

msalhab96/MultiSpeech

Folders and files

Latest commit

History

Repository files navigation

MultiSpeech

Train on your data

1. data preprocessing

2. Setup development environment

3. Training

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages