High-dimensional latent structure in visual cortex responses to natural images

Brice Ménard

Raj Magesh Gauthaman

Michael F. Bonner

Theories should be as simple as possible ...

F \propto \frac{M m}{r^2}

Newtonian gravity

\vdots

x_1

x_2

x_n

\Sigma

\sigma

w_1

w_2

w_n

abstract neuron

R_{\mu \nu} - \frac{1}{2}R g_{\mu \nu} + \Lambda g_{\mu \nu} \propto T_{\mu \nu}

Theories should be as simple as possible ...
... but no simpler!

general relativity

Hodgkin-Huxley model

We often simplify the primate visual system ...

dimensionality reduction

PCA

NMF

ICA

... yielding valuable insights ...

faces

words

bodies

scenes

food

social interactions

yeet

eigendecomposition

kayfabe

... but perhaps we oversimplify it.

large-scale datasets

high-SNR neuroimaging

modern analysis techniques

Implications for visual processing

high-dimensional code

low-dimensional code

explains many invariances observed in visual cortex

readily interpretable!

high representational capacity, expressive

makes learning new tasks easier

Is the visual code low-dimensional?

" [...] the topographies in VT cortex that support a wide range of stimulus distinctions, including distinctions among responses to complex video segments, can be modeled with 35 basis functions"

Haxby et al., 2011, 2020

Computational goal: compressing dimensionality?

~87 dimensions of object representations in monkey IT

"A progressive shrinkage in the intrinsic dimensionality of object response manifolds at higher cortical levels might simplify the task of discriminating different objects or object categories."

Lehky et al., 2014, 2016

But high-dimensional coding has benefits!

detecting high-dimensional codes requires large datasets!

high representational capacity, expressive

makes learning new tasks easier

Fusi et al. (2016)

Elmoznino & Bonner (2024)

Does human vision use a

low- or high-dimensional code?

The Natural Scenes fMRI dataset

Allen et al. (2022)

8 subjects

7 T fMRI

10,000 images (repeated 3×)

from the COCO dataset

"Have you seen this image before?"

continuous recognition

Ideal dataset to characterize dimensionality

Allen et al. (2022)

very large-scale
complex naturalistic stimuli

How can we estimate dimensionality?

same geometry,

new perspective!

ambient
dimensions

voxel 1

voxel 2

rotation

latent
dimensions

Key statistic: the covariance spectrum

latent dimensions sorted by variance

variance along the dimension

low-dimensional code

high-dimensional code

variance along each dimension

stimulus-related signal

trial-specific noise

Should we just apply PCA?

Cross-decomposition: a better PCA

generalizes

... across stimulus repetitions

... to novel images

X_\text{train}

Y_\text{train}

trial 1

trial 2

neurons (or) voxels

stimuli

\text{cov}\left(X_\text{train}, Y_\text{train}\right) = U \Sigma V^\top

Learn latent dimensions

Step 1

Step 2

Evaluate reliable variance

\hat{\Sigma} = \text{cov}\left(X_\text{test} U, Y_\text{test} V\right)

X_\text{test}

Y_\text{test}

If there is no stimulus-related signal, expected value = 0

PCA vs cross-decomposition

logarithmic binning + 8-fold cross-validation

Our covariance spectra

latent dimensions sorted by variance on the training set

reliable variance on the test set

\text{covariance} \propto \left(\text{rank}\right)^\alpha

no small "core subset"
thousands of dimensions!
limited by dataset size

\text{covariance} \propto \left(\text{rank}\right)^\alpha

A power-law covariance spectrum

more data

more dimensions?

Zipf's law: the classic power law

\text{word frequency} \propto \left(\text{rank}\right)^{-1}

Using only the 200 most frequent words...

i love sci-fi and am willing to put up with a lot. Sci-fi movies/tv are usually underfunded, under-appreciated and misunderstood. I tried to like this, I really did, but it is to good tv sci-fi as babylon 5 is to star trek (the original). Silly prosthetics, cheap cardboard sets, stilted dialogues, cg that doesn't match the background, and painfully one-dimensional characters cannot be overcome with a'sci-fi'setting. (I'm sure there are those of you out there who think babylon 5 is good sci-fi tv. It's not. It's clichéd and uninspiring.) while us viewers might like emotion and character development, sci-fi is a genre that does not take itself seriously (cf. Star trek). [...]

i love and to up with a lot. Are, and. I to like this, I really did, but it is to good as is to (the).,,, that doesn't the, and characters be with a''. (I'm there are those of you out there who think is good. It's not. It's and.) while us like and character, is a that does not take [...]