Luisa Cutillo, l.cutillo@leeds.ac.uk, University of Leeds
in collaboration with
Bailey Andrew, and David Westhead, UoL
Examples:
undirected graphical model
X1
X3
X2
X4
X5
X6
X7
X8
As
Vertices V
Edges E
Global Markov Property:
the absence of an edge between nodes in A and in B corresponds to conditional independence of the random vectors A, B given the separating set C
special case of conditional independence graph where
Edges weights E ∝ Precision matrix
Vertices V
partial correlation
(Dempster, 72) it encodes the conditional independence structure
The sparsity pattern of Θ expresses conditional independence relations encoded in the corresponding GGM
Conditional Independence = SPARSITY!
Genes 1, 2, 3
IEstimated gene-gene Pearson correlation coefficients (a) with their respective full order partial correlation coefficients (b) for single-cell RNA sequencing data of melanoma metastases (Tirosh et al., 2016).
Sparsity assumption => max graph degree d<<p
is reasonable in many contexts!
Example: Gene interaction networks
However the " Bet on Sparsity principle" introduced Tibshirani 2001, "In praise of sparsity and convexity":
(...no procedure does well in dense problems!)
Graphical Lasso (Friedman, Hastie, Tibshirani 2008):
imposes an penalty for the estimation of
Features
Samples
Data
Single cell data
extract the conditional independence structure between genes and cells, inferring a network both at genes level and at cells level.
Cells
Genes
| 2 | ... | 10 |
|---|---|---|
| : | ... | : |
| 5 | ... | 7 |
We need a general framework that models conditional independence relationships between features and data points together.
Preserves the matrix structure by using a Kronecker sum (KS) for the precision matrixes
KS => Cartesian product of graphs' adjacency matrix
(eg. Frames x Pixels)
(Kalaitzis et al. (2013))
Limitations:
https://github.com/luisacutillo78/Scalable_Bigraphical_Lasso.git
If we remove the regularization, we need only 1 eigendecomposition!
In place of regularization, use thresholding
We may be interested in graph representations of the people, species, and metabolites.
GmGm addresses this problem!
Tensors (i.e. modalities) sharing an axis will be drawn independently from a Kronecker-sum normal distribution and parameterized by the same precision matrix
2025
2025
Genes
Cells
| 2 | ... | 10 |
|---|---|---|
| : | ... | : |
| 5 | ... | 7 |
Dependency between genes
Dependency between cells
X
Y
X
Y
Z
The Strong Product differentiate and learns:
single sample
parameters!
We can impose a specific structure on
Genes
Cells
| 2 | ... | 10 |
|---|---|---|
| : | ... | : |
| 5 | ... | 7 |
Strong Product model: We add these together!
Synthetic data ground truth
(here y=1, x=2.5)
with A, B, C are Barabasi-Albert random graphs with 500 nodes each
Our Strong Product is the only one that learns both types of column graphs
288 mouse embryo stem cell scRNA-seq dataset from Buettner et al. (2015), limited to 167 genes mitosis-related genes cell cycle labelled (G1,S,G2M).
Assortativity:
measures tendency of cells within a stage to connect
1 (tend to connect)
0 (no tendency)
-1 (tend not connect)
Instructor: Bailey Andrew, University of Leeds
Ask questions if stuck or curious!