PharmGKB_VCF-DataAnalysis

PharmGKB DB usage in VCF files

This script uses R with VCF file specific preparation of VCF required
VCF should contains lines from #CHROM as starting
Formation of certain VCF format can be done in linux terminal or also by manual removal

Usage

Step 1:

cat input-properVCF-file.vcf | grep -v "##" > Prepared.vcf

This can also be done by removing all lines that has ## as starting in the VCF file

Step 2:

Rscript PharmGKB_VCF_analysis.R Prepared.vcf output-file-name

Make sure VCF file is annotated properly for getting ID. The complete script depends on ID column data.
Make sure VCF format is proper with starting line is from #CHROM line as usually present.
Make sure to provide input file name properly with extension and provide output file name alone

Clinical.Annotation.ID	Gene	Level.of.Evidence	Score	Phenotype.Category	Drug	Phenotype	CHROM	POS	QUAL	INFO	REF	ALT	Annotation.Text

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
PharmGKB_VCF_analysis.R		PharmGKB_VCF_analysis.R
Pharmacogenetics.csv		Pharmacogenetics.csv
README.md		README.md
SNP_ALLELE-pharmGCK.csv		SNP_ALLELE-pharmGCK.csv
model-output.csv		model-output.csv