data-scripts

Scripts to demultiplex or pre-process user data to get into Cellenics.

Utils.R

Contains utilities for the other functions.

filter_empty_drops.R

Usage:

Download samples using aws s3 cp s3://biomage-originals-production/PROJECTUUID input --recursive
Copy samples table (Pure json as opposed to dynamodb JSON) into samples.json
run "python3 rename_samples.py"
Open data-scripts.rproj and load renv dependencies renv::restore()
Use the filter_empty_drops function to filter all samples in the input dir

hto_demultiplex.R

Usage:

Follow usage instructions for filter_empty_drops.R to filter all samples in the input dir
Use the hto_demux function to demultiplex all samples in the out dir, which were previously filtered by the filter_empty_drops function

cellset extraction

To extract cellsets, you only need an experiment ID, and the index of the cellset in the cellsets file. The cellset index is composed from the cellset class, as listed below, and the cellset number inside each class (with 1-based indexing).

1 = louvain
2 = scratchpad
3 = samples
4 = metadata tracks

The easiest way to get this is to use Rstudio's list viewer: View(parsed_json_object). You can download the cellset file using the download_cellset_file function, import it with jsonlite::read_json and explore it with the list viewer.

After getting the positions, The function extract_cellset will do everything automagically, returning a subsetted seurat object.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
renv		renv
.Rprofile		.Rprofile
.gitignore		.gitignore
FCA_convert_loom_to_10x.py		FCA_convert_loom_to_10x.py
README.md		README.md
appendix_non-standard_h5_files.Rmd		appendix_non-standard_h5_files.Rmd
cellset_extraction.R		cellset_extraction.R
convert_10x_to_parse.R		convert_10x_to_parse.R
convert_csv_count_matrices.R		convert_csv_count_matrices.R
convert_hdf5_to_10x.Rmd		convert_hdf5_to_10x.Rmd
convert_kbout_to_10x.R		convert_kbout_to_10x.R
convert_parse_to_10x.R		convert_parse_to_10x.R
data-scripts.code-workspace		data-scripts.code-workspace
demultiplex_10x_files.R		demultiplex_10x_files.R
demultiplex_csv.R		demultiplex_csv.R
demultiplex_seurat_obj.R		demultiplex_seurat_obj.R
filter_empty_drops.R		filter_empty_drops.R
hto_demultiplex.R		hto_demultiplex.R
process_fastq.ipynb		process_fastq.ipynb
remove_cellsets.Rmd		remove_cellsets.Rmd
rename_folders.py		rename_folders.py
renv.lock		renv.lock
utils.R		utils.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data-scripts

Utils.R

filter_empty_drops.R

hto_demultiplex.R

cellset extraction

About

Releases

Packages

Contributors 4

Languages

biomage-org/data-scripts

Folders and files

Latest commit

History

Repository files navigation

data-scripts

Utils.R

filter_empty_drops.R

hto_demultiplex.R

cellset extraction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages