Skip to content

CBIIT/NCI-DOE-Collab-Pilot1-Tumor-Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NCI-DOE-Collab-Pilot1-Tumor-Classifier

Description:

The Tumor Classifier capability (TC1) shows how to train and use a neural network model to classify tumor types from molecular features (such as RNA-Seq expressions) provided in Genomics Data Commons (GDC).

User Community:

Researchers interested in cancer susceptibility/histology; classification of diseases for oncology; cancer biology.

Usability:

Data scientists can train the provided untrained model on their own data, or use the trained model to classify the provided test samples. The provided scripts use data that have been downloaded from GDC and normalized.

Uniqueness:

Researchers have commonly used machine learning to classify molecular data. This capability shows how you can use neural networks in classification of genomic profiles without downsampling the provided expressions.

Impact:

TC1 can augment existing data quality control methods.

Components:

  • Untrained model:

    • The untrained neural network model is defined in tc1_baseline_keras2.py. The model architecture is available in JSON and YAML formats in MoDaC.
  • Data:

    • The processed training and test data are in MoDaC.
  • Trained model:

    • The trained model weights file, tc1.model.h5, is available in MoDaC.

Technical Details:

Refer to this README.

Releases

No releases published

Packages

No packages published

Languages