Skip to content

The Parkinson's Disease Detection project utilizes the Oxford Parkinson's Disease dataset with 197 instances and 23 real-valued attributes. Conducting classification tasks, the project employs machine learning models to discriminate between healthy individuals and those with Parkinson's disease.

License

Notifications You must be signed in to change notification settings

amiruzzaman1/Parkinson-Disease-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Parkinson-Disease-Detection

The Parkinson's Disease Detection project utilizes the Oxford Parkinson's Disease dataset with 197 instances and 23 real-valued attributes. Conducting classification tasks, the project employs machine learning models to discriminate between healthy individuals and those with Parkinson's disease based on biomedical voice measurements, contributing to the diagnosis and understanding of the disease.

Dataset Overview

The Parkinson's Disease dataset consists of biomedical voice measurements from 31 individuals, 23 with Parkinson's disease (PD). There are 197 instances, with each row corresponding to a voice recording. The dataset includes 23 real-valued attributes, such as vocal fundamental frequency measures, variation in frequency and amplitude, noise-to-tonal component ratios, and nonlinear complexity measures. The goal is to discriminate between healthy subjects (status=0) and those with PD (status=1). Recorded by Max Little in collaboration with the National Centre for Voice and Speech, the dataset is valuable for classification tasks. It is in ASCII CSV format and has been utilized in a study focusing on telemonitoring PD.

image image image image image

Dataset Link: Parkinson's Disease Dataset

Colab Notebook

Open In Colab

Models and Results

Model Accuracy Precision (Class 0) Recall (Class 0) F1-Score (Class 0) Precision (Class 1) Recall (Class 1) F1-Score (Class 1)
SVM 0.87 0.67 0.57 0.62 0.91 0.94 0.92
Random Forest 0.95 1.00 0.71 0.83 0.94 1.00 0.97
K-Nearest Neighbors 0.95 1.00 0.71 0.83 0.94 1.00 0.97
Logistic Regression 0.90 1.00 0.43 0.60 0.89 1.00 0.94
Decision Tree 0.92 0.83 0.71 0.77 0.94 0.97 0.95
Gradient Boosting 0.92 0.83 0.71 0.77 0.94 0.97 0.95
LSTM 0.95 1.00 0.71 0.83 0.94 1.00 0.97
BiLSTM 0.95 1.00 0.71 0.83 0.94 1.00 0.97
CNN 0.87 0.75 0.43 0.55 0.89 0.97 0.93
Naive Bayes 0.72 0.33 0.57 0.42 0.89 0.75 0.81
CNN + GRU 0.85 0.57 0.57 0.57 0.91 0.91 0.91
CNN + LSTM 0.87 0.62 0.71 0.67 0.94 0.91 0.92
LSTM + GRU 0.95 1.00 0.71 0.83 0.94 1.00 0.97
LSTM + GRU (ROS) 1.00 1.00 1.00 1.00 1.00 1.00 1.00
CNN + LSTM (ROS) 1.00 1.00 1.00 1.00 1.00 1.00 1.00
XGB-NN Hybrid (ROS) 1.00 1.00 1.00 1.00 1.00 1.00 1.00
XGB-NN Hybrid (Undersample) 1.00 1.00 1.00 1.00 1.00 1.00 1.00
CNN + LSTM (Undersample) 0.85 0.79 1.00 0.88 1.00 0.67 0.80
LSTM + GRU (Undersample) 0.70 0.73 0.73 0.73 0.67 0.67 0.67

Note: ROS stands for Random Oversampling, and the same applies for Undersample.

About

The Parkinson's Disease Detection project utilizes the Oxford Parkinson's Disease dataset with 197 instances and 23 real-valued attributes. Conducting classification tasks, the project employs machine learning models to discriminate between healthy individuals and those with Parkinson's disease.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published