Audio Modelling #9

ClashLuke · 2022-04-30T08:36:48Z

There are multiple ways we could go about modelling audio. For example, we could tokenise sounds or audio snippets and autoregressively predict the next token. Whether the audio tokens come from a VQGAN or discrete Fourier transformation doesn't matter to the model but could change the performance of our generation a lot. This issue is about finding out how to model sound and develop an end-to-end pipeline to develop a prototype and see how it works.

ClashLuke added the research Creative project that might fail but could give high returns label Apr 30, 2022

ClashLuke changed the title ~~Audio Modeling~~ Audio Modelling Apr 30, 2022

ClashLuke mentioned this issue Apr 30, 2022

Tokenizing Phonetics #10

Open

ClashLuke added the ML Requires machine-learning knowledge (can be built up on the fly) label Apr 30, 2022

This was referenced May 15, 2022

Alternative Sampling Methods #42

Merged

Long-Range-Arena Evaluation #49

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio Modelling #9

Audio Modelling #9

ClashLuke commented Apr 30, 2022

Audio Modelling #9

Audio Modelling #9

Comments

ClashLuke commented Apr 30, 2022