Pytorch code for following paper:
- Title : Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models (Accepted for ICASSP 2024, available here)
- Autor : Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim and Ha-Jin Yu
- We used 'nvcr.io/nvidia/pytorch:21.04-py3' image of Nvidia GPU Cloud for conducting our experiments.
- Run 'build.sh' file to make docker image
./docker/build.sh
- Run 'interactive.sh' file to activate docker container
- Note that you must modify the mapping path before running the 'interactive.sh' file
./docker/interactive.sh
- We used VoxCeleb1 dataset for training and test.
- For noisy test, we used the MUSAN, Nonspeech100, and VOiCES datasets.
- Each downloaded dataset should be mapped to the 'data' folder in docker environment.
python3 code/diff_sv/main.py
Please cite this paper if you make use of the code.
@article{kim2023diff,
title={Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models},
author={Kim, Ju-ho and Heo, Jungwoo and Shin, Hyun-seo and Lim, Chan-yeong and Yu, Ha-Jin},
journal={arXiv preprint arXiv:2309.08320},
year={2023}
}