- Replicated a cutting-edge NLP model from the 2017 paper "PubMed 200k RCT" to classify sentences in medical abstracts sequentially, using the dataset of ~200,000 labelled Randomised Controlled Trial (RCT) abstracts to enhance literature review efficiency.
- Developed and iterated through multiple model architectures including TF-IDF classifiers, deep learning models with various embeddings, and multimodal models, culminating in a final model that significantly aids in structured abstract skimming.
- Integrated advanced preprocessing and modelling techniques, including Python and spaCy for text segmentation and neural network models for sentence classification, aiming to implement the model in practical applications like browser extensions for real-time literature structuring.
-
Notifications
You must be signed in to change notification settings - Fork 0
Replicated a cutting-edge NLP model from the 2017 paper "PubMed 200k RCT" to classify sentences in medical abstracts sequentially, using the dataset of ~200,000 labelled Randomised Controlled Trial (RCT) abstracts to enhance literature review efficiency. Developed and iterated through multiple model architectures including TF-IDF classifiers...
wahidulalamriyad/skimlit
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Replicated a cutting-edge NLP model from the 2017 paper "PubMed 200k RCT" to classify sentences in medical abstracts sequentially, using the dataset of ~200,000 labelled Randomised Controlled Trial (RCT) abstracts to enhance literature review efficiency. Developed and iterated through multiple model architectures including TF-IDF classifiers...
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published