Skip to content

cretaceousmart/Drug_Property_Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Drug_Property_Prediction

Hi, I'm using Transformer, GNN, and Tree model(Random Forest) to predict the drug property. Thanks for giving me this opportunity to slove real-world problems since I haven’t gotten in torch with Medical ML problems and Transformer. I will show you what I’ve done and what I found.

  • Due to the limited time, there're a lot of things that can be improved, I'm looking forward to having a furthur discussion with you!

  • Outcome: All the experiment's outcomes are shown in the .ipynb. Since I don't have a nice GPU, I put all the training processes on Colab. I will suggest you check them by the below link:

  • What interesting things I found:

    • Data Shuffle is neccessary in this task
    • Featurization (Drug structure embedding) is important (but I didn't choose the featurizer carefully due time limited)
    • Batch size need to be small (like 32) and epoch number need to be large (like 300)
    • Transformer is trained fastest due Parallel Computing
    • Model comparasion:It's hard to compare since I don't spend much time in tunning the parameters
  • About model:

    • For Transformer: I build it from scratch without using any other library. In the beginning, I don’t know how to embed the drug structure in a proper way. So for the Transformer, I just use a very naive NLP method for featurization (Treat the drug input such as "CC(=O)Nc1ccccc1" as a sentence, maybe missed the information about the drug structure ). I don't spend much time tuning the parameters but it achieves the comparable performance as the implementation in Deep Purpose.

      • image
      • image
    • For GNN, I use the implementation from Deep Purpose and plot the ROC curve.

    • For Tree model, I use the featurizer from DeepChem and use scikit-learn to build model,then plot the ROC curve.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published