Identifying emotions from voice. Based on this blog post
Relevant resources:
- LibROSA
- Understanding the Mel Spectrogram
- The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)
- Toronto emotional speech set (TESS)
- The LJ Speech Dataset
- How To Apply Machine Learning And Deep Learning Methods to Audio Analysis
- Musical Genre Classification with Convolutional Neural Networks
- Deep Learning Approaches for Understanding Simple SpeechCommands
- DeepMind's Tacotron-2 Tensorflow implementation