Malaria Detection using Cell Images

How to Run the Code

Prepare the data by using the data_download.ipynb notebook found in the 'Data Download' directory.
- Tune the required height and width (parameters at the top of the notebook)
- The output should create a Data directory containing the original cell images, and a Resized_data_ directory, containing the resized images.
Label the data using the labelling.ipynb notebook found in the 'Data Labelling' directory.
- It will save a CSV of relative filenames and labels in the specified directory.
Create train and test splits using train_test_split.ipynb
Modeling scripts are in the 'Modeling' directory.

Contributors

Srishti Singh, [email protected]
Shreya Bhatia, [email protected]
Madhava Krishna, [email protected]
Harshit Goyal, [email protected]

Motivation

Malaria is a life-threatening disease affecting many people wordwide, spread by infected Anopheles mosquito bites. Earlier studies have shown that the degree of agreement between physicians on the acuteness of the disease in a given patient's sample is very low. Preliminary detection aided by computer systems can be of utmost importance for faster and reliable diagnosis. We aim to create a classifier for paratisized and non-parasitized cells to aid medical professionals in this venture.

Related Work

Pan, et al. (2018) created a model based on deep CNN architectures. They were able to obtain accuracies of over 90% on the training and validation samples using data augmentation.
Raihan and Nahid (2021) created a model based on boosted trees with feature engineering and determined feature importance using Shapely Additive Explanations (SHAP).
Fuhad et al. (2020) implemented a CNN based model with accuracy over 99% while being computationally efficient.

Suggested Outcomes

Automation of the diagnosis process will guarntee accurate diagnosis and, as a result, holds the possibility of providing dependable healthcare to places with limited resources. We aim to implement various algorithms for classification while attempting to find optimal parameters for optimising training time, computational complexity and performance. We will attempt transformations and feature engineering and extraction on the dataset. We are going to apply various machine learning models such as SVMs, logistic regression, decision trees, random forest, and compare the performance of all models. We intend to also attempt grayscale conversion and observe the change in behavior of the models.

Project Proposal

This browser does not support PDFs. Please download the PDF to view it: Download PDF.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
Data Augmentation		Data Augmentation
Data Download		Data Download
Data Labelling		Data Labelling
EDA		EDA
GUI Main		GUI Main
Modeling		Modeling
Modules		Modules
Plots		Plots
Report		Report
.DS_Store		.DS_Store
.gitignore		.gitignore
GUI Demo.mkv		GUI Demo.mkv
Project_proposal_group_17.pdf		Project_proposal_group_17.pdf
README.md		README.md
notes.md		notes.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Malaria Detection using Cell Images

How to Run the Code

Contributors

Motivation

Related Work

Suggested Outcomes

Project Proposal

About

Releases

Packages

Contributors 3

Languages

madhava20217/Malaria-Detection-from-Cells

Folders and files

Latest commit

History

Repository files navigation

Malaria Detection using Cell Images

How to Run the Code

Contributors

Motivation

Related Work

Suggested Outcomes

Project Proposal

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages