Skip to content

Junkdnalab/acg_2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Applied Computational Genomics

Winter Trimester 2024

Course Description

The goal of this course is to initiate bench scientists on the Ph.D. track to master basic skills in bioinformatics and computational genomics. These skills are a prerequisite to a wide range of fields in modern molecular biology and are often overlooked. During this course we will acquire basic computer literacy needed to perform analyses of next generation sequencing experiments in all their modern forms. We will cover basic unix, project management with version control, principal components and clustering, visualization and color, single cell technologies, spatial omics, microbiome, and a brief survey of the fundamentals of Machine Learning. We will empower students with basic skills to advocate for and analyze their own data.

Course Director

Dennis Hazelett, PhD
Assistant Professor in Computational Biomedicine
G-577 Pacific Design Center, Green bldg.
West Hollywood
Office: (424) 315-4412
Dennis.hazelett at csmc.edu

Instructors and guest lecturers:

Simon Coetzee (simon.coetzee at csmc.edu)
Ivan Vujkovic-Cvijin (Ivan.Vujkovic-Cvijin at csmc.edu)
Kyoung-Jae Won (KyoungJae.Won at cshs.org)
Ryan Urbanowicz

Time: 9:00 AM – 10:30 AM Tuesdays (outlook calendar invitation)

Location: Picasso Room PDCG

Location of pre-recorded lectures

Some lectures in the course may be taught using the “inverted” method, in which pre-recorded lectures are augmented by structured class discussions and demonstrations. These will be made available at least 1 week prior to the relevant topic week.

Office hours: by appointment

Prerequisites

There are no prerequisites to take this class other than the requirements of admission to Cedars-Sinai’s graduate program.

Class Format

Classes are held every Tuesday from 9:00-10:30 AM in Picasso on the 5th floor of PDC green building (PDC-G). Classes are graded 20% attendance and contribution, 80% homework assignments and problem sets. Many lectures will be done in follow-along demonstration or mixed lecture and demonstration style. At the beginning of each lecture we will spend time addressing important questions from the 5-minute feedback (see Expectations, below). Students will be provided with a virtual machine for the duration of the course that has sufficient resources to complete any assignments. If desired each student may use their own resources with some limited support from course staff. There will be at most one homework assignment per lecture, often with required reading. Every homework assignment is due 1 week from the date of assignment at midnight. Office hours will be granted by availability and first come first serve EXCEPT ON THE DUE DATE. Early in the class, students will learn to use github, a critically important skill for maintaining open science publication standards. For the remainder of the course, completed homework assignments will be pushed to each student’s private repository (“repo”) on the github website which is to be shared with the course instructors only.

Expectations

Attendance is required no exceptions. You may obtain permission ahead of time or with extenuating circumstances after the fact from the graduate school (email to Emma Yates-Kassler). Homework assignments are required on time (see schedule below) regardless of attendance or for half credit one lecture late. Unexcused absences result in an incomplete grade.

After each lecture, a link will be posted to provide feedback. Attendance will be recorded by the completion of the "5-minute feedback". There will be three questions, with space for brief answers:

• What was the most important concept or take-home message you learned from today's class?
• What concepts or topics from today's class do you not understand well enough?
• What suggestions do you have for improving or enhancing today's class?

Frequently asked questions or topics of confusion will be addressed in the subsequent lecture.

Course Evaluation

A letter grade is assigned at the end of the course according to the following breakdown: 80% assignments 20% lecture attendance and feedback

Assignments

There is one assignment for each lecture and no exams. Late assignments will be given a maximum of 50% of full credit up to one lecture after the original due date. When homework is not assigned, points will be given for attendance. Additional readings may be indicated on the syllabus. Each question set is worth 80 points and will be graded based on completeness. Links to lecture slides and readings will be provided in the syllabi below.

Academic Honesty:

It is expected that students in a PhD program produce only original work representing their own ideas. This extends to use of published or public sources as well as the work of other students and colleagues. Instances of likely (not proven) plagiarism of code or other intellectual property or prose will be submitted to the graduate program curriculum committee for potential sanctions (read the student handbook for further details), which could include dismissal from the program.

Students with Disabilities

We will provide reasonable accommodations to students with disabilities. If this applies to you, please make an appointment with the course instructors (Dennis & Simon) as soon as possible to discuss your needs.

Class Schedule

Resources:

The Visual Display of Quantitative Information by Edward Tufte

Resources for learning basic Linux:

linuxjourney.com
Linux for Hackers by OccupyTheWeb (Author)

Resources/links for git:

Atlassian Git course on Coursera
git documentation

Resources:

Article: What is version control?
Poetry Foundation: "The Jabberwocky" (by Lewis Carroll)

Resources:

The best video to introduce a computational biology course.

Homework

Homework for bedtools and GenomicRanges

feedback form

compiling

02/20/2023 (7) NGS and File Formats

Analysis File

analysis_lecture8.R

Homework

Homework for File Formats

02/27/2023 (8) Single cell I

03/05/2023 (8) Single cell II

PCA short lecture (slides).

03/12/2023 (9) Single Cell III

Analysis File

split_seurat.rda

integration_speedup.rda

03/19/2023 (10) Spatial Omics I

Spatial Transcriptomics (slides).
Visium dataset (google drive link).

03/26/2023 (11) Spatial Omics II

Spatial Omics II - Course Pack

As soon as possible please run the following commands:

(This is for Mimas) for other machines the terminal code may be different.

In the terminal (not the R Console)

sudo apt update
sudo apt install patch

Then run these in the R Console:

BiocManager::install(c("devtools", "hdf5r"))
BiocManager::install("patrickCNMartin/Vesalius")

If you see the following:

These packages have more recent versions available.
It is recommended to update all of them.
Which would you like to update?

1: All                          
2: CRAN packages only           
3: None

select 1: All

04/02/2023 (11) Introduction to microbiome analysis (slides)

Assignment: Read the Dada2 tutorial, and instead of performing it on the dataset described in the tutorial, perform it on the following dataset all the way up to and including section “Assign taxonomy”. This dataset comprises FASTQ files of 16S rRNA sequencing data from a cohort of C. difficile-infected patients which we will analyze in the next class.

To capture all your outputs, set a path to your desired output directory and run the following commands after performing the Dada2 tutorial:

outpath<-"~/path/to/your/output/directory/"
write.table(table(nchar(getSequences(seqtab))), paste(outpath,"/table_nchar.xls",sep=""),sep="\t")
saveRDS(seqtab, paste(outpath,"/seqtab.rds",sep=""))
write.csv(t(seqtab), paste(outpath,"/seqtab.csv",sep=""))
write.csv(rowSums(seqtab.nochim)[order(rowSums(seqtab.nochim))],file=paste(outpath, "/biomsummary.csv",sep=""))
write.csv(taxa, paste(outpath,"/taxa.csv",sep=""))

The file named “biomsummary” has read counts for each sample. Email me (Ivan.Vujkovic-Cvijin at csmc.edu) this file before the next class!

04/09/2023 (12) Microbiome analysis I (slides)

Metadata and R scripts

Assignment due night of 4/15/2023

04/16/2023 (13) Microbiome analysis II (slides)

R code

Assignment due 4/21

bayesian

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published