Name		Name	Last commit message	Last commit date
parent directory ..
LICENSE		LICENSE
README.md		README.md
audio_utils.py		audio_utils.py
mixed.wav		mixed.wav
output.wav		output.wav
ref-voice.wav		ref-voice.wav
voicefilter.py		voicefilter.py

README.md

VoiceFilter

Input

Audio file

Mixed audio

mixed.mov

Reference audio for d-vector

ref-voice.mov

Input an audio file that is spoken by multiple people and an audio file that contains the voices of the people you want to extract. The voice of one person is extracted and output.

Output

Audio file

Estimated audio

output.mov

Ground truth

ground_truth.mov

(Audio from http://swpark.me/voicefilter/)

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample wav,

$ python3 voicefilter.py --input mixed.wav --reference_file ref-voice.wav

If you want to specify the mixed audio, put the file path after the --input option, and to specify the reference audio, put the file path after the --reference_file option.
You can use --savepath option to change the name of the output file to save.

$ python3 voicefilter.py --input MIXED_WAV --reference_file REFERENCE_WAV --savepath SAVE_PATH

Reference

VoiceFilter

Framework

Pytorch

Model Format

ONNX opset=11

Netron

embedder.onnx.prototxt
model.onnx.prototxt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voicefilter

voicefilter

README.md

VoiceFilter

Input

Output

Usage

Reference

Framework

Model Format

Netron

Files

voicefilter

Directory actions

More options

Directory actions

More options

Latest commit

History

voicefilter

Folders and files

parent directory

README.md

VoiceFilter

Input

Output

Usage

Reference

Framework

Model Format

Netron