This is a 288x288 wav2lip model version.

test environment

Ubuntu22.04 Python3.10.13

The original repo: https://github.com/Rudrabha/Wav2Lip Some Features I will implement here

input size 288x288
PRelu
LeakyRelu
Gradient penalty
Wasserstein Loss
[] wav2lip_384
[] wav2lip_512
[] syncnet_192
[] syncnet_384
[] 2TUnet instead of simple unet in wav2lip original: https://arxiv.org/abs/2210.15374
[] MSG-UNet: https://github.com/laxmaniron/MSG-U-Net
[] SAM-UNet: https://github.com/1343744768/Multiattention-UNet
I trained my own model on AVSPEECH dataset and then transfer learning with my private dataset.

Citing

To cite this repository:

@misc{Wav2Lip,
  author={Rudrabha},
  title={Wav2Lip: Accurately Lip-syncing Videos In The Wild},
  year={2020},
  url={https://github.com/Rudrabha/Wav2Lip}
}

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
checkpoints		checkpoints
evaluation		evaluation
face_detection		face_detection
models		models
results		results
temp		temp
weights		weights
.gitignore		.gitignore
README.md		README.md
audio.py		audio.py
clear_data.py		clear_data.py
collect_avspeech.py		collect_avspeech.py
color_syncnet_train.py		color_syncnet_train.py
convert2fps.py		convert2fps.py
hparams.py		hparams.py
hq_wav2lip_train.py		hq_wav2lip_train.py
inference.py		inference.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
wav2lip_train.py		wav2lip_train.py
wloss_hq_wav2lip_train.py		wloss_hq_wav2lip_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This is a 288x288 wav2lip model version.

test environment

Citing

About

Releases

Packages

Languages

monk-after-90s/wav2lip_288x288

Folders and files

Latest commit

History

Repository files navigation

This is a 288x288 wav2lip model version.

test environment

Citing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages