torchvggish-gpu

Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.

A re-implementation of VGGish^[1], a feature embedding frontend for audio classification models, using Pytorch with GPU support. This code is fully based on torchvggish^[2].

Usage

import torch
from torchvggish_gpu import vggish
import vggish_input

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
# For GPU support, the device must be cuda

embedding_model = vggish()
embedding_model.to(device)
embedding_model.eval()
example = vggish_input.wavfile_to_examples("bus_chatter.wav")
example = example.to(device)
audio_embeddings = embedding_model.forward(example)

[1] S. Hershey et al., ‘CNN Architectures for Large-Scale Audio Classification’, in International Conference on Acoustics, Speech and Signal Processing (ICASSP),2017. Available: https://arxiv.org/abs/1609.09430, https://ai.google/research/pubs/pub45611

[2] Harri Taylor et al., ‘Pytorch port of Google Research's VGGish model used for extracting audio features’, v0.1, Sep 27, 2019. Available: https://github.com/harritaylor/torchvggish/releases/tag/v0.1

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bus_chatter.wav		bus_chatter.wav
mel_features.py		mel_features.py
torchvggish_gpu.py		torchvggish_gpu.py
vggish_input.py		vggish_input.py
vggish_params.py		vggish_params.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torchvggish-gpu

Usage

About

Releases

Packages

Languages

License

nhattruongpham/torchvggish-gpu

Folders and files

Latest commit

History

Repository files navigation

torchvggish-gpu

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages