Cell type annotation of pbmc_unsorted_3k_filtered

Download the data

#Download 10x Multi-ome data
wget https://cf.10xgenomics.com/samples/cell-arc/1.0.0/pbmc_unsorted_3k/pbmc_unsorted_3k_filtered_feature_bc_matrix.tar.gz
tar -xzf pbmc_unsorted_3k_filtered_feature_bc_matrix.tar.gz
rm pbmc_unsorted_3k_filtered_feature_bc_matrix.tar.gz

#Download annotated Seurat object (https://satijalab.org/seurat/vignettes.html)
wget https://dl.dropbox.com/s/3f3p5nxrn5b3y4y/pbmc_10k_v3.rds

Predict cell types in R

library(Seurat)

#read data
multiome.data = Read10X(data.dir = "filtered_feature_bc_matrix")
multiome_RNA_Seurat = CreateSeuratObject(counts = multiome.data[['Gene Expression']], project = "MultiOmeRna", min.cells = 3, min.features = 200)

#EDIT(!) after re-reading this script --> It might be better to not filter cells in this step (otherwise some cells wont have an annotation), so better set min.features at 0

annotated_Seurat = readRDS("pbmc_10k_v3.rds")

#normalize data (can take a long time)
pbmc.list = c(multiome_RNA_Seurat, annotated_Seurat)
for (i in 1:length(pbmc.list)) {
    pbmc.list[[i]] = SCTransform(pbmc.list[[i]], verbose = TRUE)
}

#select top 3000 highly variable features
pbmc.features = SelectIntegrationFeatures(object.list = pbmc.list, nfeatures = 3000)

#prepare for integration
pbmc.list = PrepSCTIntegration(object.list = pbmc.list, anchor.features = pbmc.features, verbose = TRUE)

#we want to query the multiome data for cell types in the annotated dataset
pbmc.query = pbmc.list[[1]]
pbmc.reference = pbmc.list[[2]]

#find transfer achnors
pbmc.anchors = FindTransferAnchors(reference = pbmc.reference, query = pbmc.query, dims = 1:30, normalization.method = 'SCT', features = pbmc.features)

#predict cell types based on transfer anchors
predictions = TransferData(anchorset = pbmc.anchors, refdata = pbmc.reference$celltype, dims = 1:30)
pbmc.query = AddMetaData(pbmc.query, metadata = predictions)

#check if cell type annotation makes sense
pbmc.query = RunPCA(pbmc.query, features = pbmc.features)
pbmc.query = RunUMAP(pbmc.query, dims = 1:30)
pdf('UMAP_w_Predicted_cell_types.pdf')
DimPlot(pbmc.query, reduction = "umap", group.by = 'predicted.id', label = TRUE)
dev.off()

#write meta_data to file
write.csv(predictions, file = 'CellType_annotation_pbmc_unsorted_3k.csv', row.names = TRUE)

The file: CellType_annotation_pbmc_unsorted_3k.csv contains the predicted annotation for the "PBMC from a healthy donor - no cell sorting (3k)" dataset

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CellType_annotation_pbmc_unsorted_3k.csv		CellType_annotation_pbmc_unsorted_3k.csv
README.md		README.md
UMAP_w_Predicted_cell_types.pdf		UMAP_w_Predicted_cell_types.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cell type annotation of pbmc_unsorted_3k_filtered

Download the data

Predict cell types in R

About

Releases

Packages

SeppeDeWinter/CellTypeAnnotation_labelTransfer

Folders and files

Latest commit

History

Repository files navigation

Cell type annotation of pbmc_unsorted_3k_filtered

Download the data

Predict cell types in R

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages