Key-Word Spotting using Deep Neural Networks

Project repository for the Human Data Analytics course exam, Data Science Master Degree, Padova AA 2020-2021.

Project Overview

The Keyword Spotting (KWS) task consists in the detection of a certain predetermined set of keywords from a stream of user utterances. Deep learning models have proved to give highly accurate results, while remaining lightweight and suitable for running in mobile devices. In this project, a variety deep neural architectures for KWS are tested. Specifically, I focused on architectures based on the attention mechanism. All models are trained on the Google Speech Commands dataset, on the 12kws task and the 35kws task.

A detailed report of all the work can be found in the project report, which is placed in project_report/project_report.pdf .

Code Details

In order to run the code, the dataset folder (downloadable here) must be placed inside the data folder, and must have name speech_commands_v0.02.

All proposed models are defined in models.py. To train all the models, run the train_models.py python script. All the hyperparameters are defined in hyperparameters.py.

Two Jupyter Notebooks are provided:

Input Pipeline.ipynb : contains a demonstration of how the input pipeline for the project works.
Models Evaluation.ipynb : here, all the weights from the trained models are loaded, in order to evaluate them. At the end, all the code to produce the plots present in the project report is provided. To run this notebook, the models folder must contan the weights of the trained models. To get them, one must train all the models by executing the train_models.py file. Alternatively, they can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data/binaries		data/binaries
models		models
project_report		project_report
.gitignore		.gitignore
Input Pipeline.ipynb		Input Pipeline.ipynb
Models Evaluation.ipynb		Models Evaluation.ipynb
custom_layers.py		custom_layers.py
hyperparams.py		hyperparams.py
input_pipeline.py		input_pipeline.py
metrics.py		metrics.py
models.py		models.py
readme.md		readme.md
train_models.py		train_models.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Key-Word Spotting using Deep Neural Networks

Project Overview

Code Details

About

Releases

Packages

Languages

rmazzier/HDA-Project-Key-Word-Spotting

Folders and files

Latest commit

History

Repository files navigation

Key-Word Spotting using Deep Neural Networks

Project Overview

Code Details

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages