Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A voice and speech (Spanish) corpus of patients who underwent upper airway surgery in pre- and post-operative states #27

Open
aguerrerolopez opened this issue Nov 7, 2024 · 0 comments

Comments

@aguerrerolopez
Copy link

Hi again!

I recommend also to add this dataset:
https://zenodo.org/records/11654546

The data set comprises 3,800 speech audio files of 3 types of upper respiratory tract surgeries and 1 control set. The dataset has an average of 35.51 +- 5.91 audio recordings per patient. It provides valuable resources to the scientific community to systematically investigate the objective effects of upper respiratory tract surgery on voice and speech.

This data set is a complete corpus comprising data from 107 Spanish Castilian speakers. This corpus encompasses voice and speech recordings from both control speakers and patients who underwent upper airway surgical procedures in pre- and post-operative stages. The surgeries in focus include Tonsillectomy, Functional Endoscopic Sinus Surgery, and Septoplasty, all consistently performed by a single surgeon.

There is a paper where the dataset is described:
https://www.nature.com/articles/s41597-024-03540-5

and there is also a github repo where code can be found to preprocess the data and launch some machine learning experiments:
https://github.com/BYO-UPM/CUCO_Database

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant