4. QC and normalization
4.0 Input datasets
library(SingleCellExperiment)
library(scater)
options(stringsAsFactors = FALSE)mx <- read.table("GSE71008.NvsCRC.reads.txt", sep = "\t")
anno <- read.table("GSE71008.NvsCRC.anno.txt", sep = "\t", header=T)
# assign class
anno$Class <- "NC"
anno[which(anno$Stage=="1S"),]$Class <- "S1"
anno[which(anno$Stage=="2S"),]$Class <- "S2"
anno[which(anno$Stage=="3S"),]$Class <- "S3"
anno[which(anno$Stage=="4S"),]$Class <- "S4"head(mx[,1:5]) s1S1 s1S2 s1S3 s2S1 s2S2
A-NT2RP7011570 11 10 6 0 9
C-ADG04260 1 1 1 1 1
C-ADG07684 2 3 5 3 5
C-ASTRO3000154 9 9 10 5 11
C-BRACE2001543 0 0 0 0 0
C-BRACE2001954 1 2 1 1 14.1 Sample QC
library size

detected genes

Control genes

Sample filtering
4.2 Gene QC
gene expression

Gene filtering
4.3 Save data
4.4 Visualization
The PCA plot



The tSNE map



4.5 Normalization
Systematic biases

Size factor & global-scale normalization

CPM


TMM (edgeR)


SF (DESeq2)


scran



Optional: rank-based methods

Scripts and Code
Software/Packages
References
Last updated