Petukhov V, Igolkina A, Rydbirk R, Mei S, Christoffersen L, Kharchenko P, Khodosevich K
viktor.petukhov@pm.me
Case samples
Control samples
What is going on in our data?
What is going on in our data?
Align samples
scVI, Conos, ..., Seurat
See the review from the Theis lab
Align samples
Perform joint annotation
Align samples
Perform joint annotation
Run differential expression
Align samples
Perform joint annotation
Run differential expression
Run Gene Ontology analysis
Align samples
Perform joint annotation
Run differential expression
Run Gene Ontology analysis
Compare cell type proportions
Align samples
Perform joint annotation
Run differential expression
Run Gene Ontology analysis
Compare cell type proportions
There are up to hundreds significant GO terms per type
There are up to 1000 significant DE genes per type
Compositional analysis
Gene expression analysis
Gene expression analysis
Compositional analysis
Cluster-based
Cluster-free
Control
Multiple sclerosis
Problem: changes are not independent
*Credit to Anna Igolkina
Problem: changes are not independent
*Credit to Anna Igolkina
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
Neurons
Glia
Control
Multiple sclerosis
Effect size
Significance
L2-L3 EN
L2-L3 EN
L2-L3 EN
Color by batch
Aggregated across all cell types
separation
L2-L3 EN
control
epilepsy
control
epilepsy
Differential expression
PDE10A
MS
Control
NCKAP5
MS
Control
Gene expression analysis
Compositional analysis
Cluster-based
Cluster-free
Control
Multiple sclerosis
No significance!
No significance!
Effect size
Significance
Condition
Condition
Sex
Condition
Sex
Protocol
L2_Cux2_Lamp5 programs
L2_Cux2_Lamp5 programs
Use top DE genes instead of p-cutoff!
Epilepsy dataset
Cancer dataset
Use pseudo-bulk DE, not single-cell methods!
Single-cell-based methods fail even on a theoretical level
Konstantin Khodosevich lab
Peter Kharchenko
Co-authors
viktor.petukhov@pm.me
Gene expression analysis
Compositional analysis
Cluster-based
Cluster-free
Control
Multiple sclerosis