1
2, 1
1
light enters your eyes
1
signals travel to visual cortex
2
cortical processing
3
perceptual experience
4
“ambient”
dimensions
neural response
stimulus
vector of neuron responses
neuron 1
neuron 4
neuron N
...
all axes are orthogonal (90°)
neuron 2
neuron 1
neuron 3
neuron 4
...
neuron N
neuron 2
neuron 1
neuron 3
neuron 4
...
neuron N
has many people
has no people
“latent”
dimension
all axes are orthogonal (90°)
“animacy” dimension in human visual cortex
“food” dimension in human visual cortex
66 dimensions of mental object representations (inferred from behavioral data)
When does this approach make sense?
data manifold spans a low-dimensional subspace
data spans both ambient dimensions
data lives in a 1-D subspace
“latent dimension”
When does this approach make sense?
When does this approach make sense?
Catalog all the latent dimensions!
data manifold spans a low-dimensional subspace
“ [...] the topographies in ventrotemporal cortex (VTC) that support a wide range of stimulus distinctions [...] can be modeled with 35 basis functions”
“Thus, our fMRI data are sufficient to recover semantic spaces for individual subjects that consist of 6 to 8 dimensions.”
66 dimensions
Principal Component Analysis
Non-negative Matrix Factorization
Sparse Positive Similarity Embedding
| dimensions | stimuli | system | method | |
|---|---|---|---|---|
| Haxby et al. (2011) | 35 | movies | human ventrotemporal cortex | PCA |
| Huth et al. (2012) | 6-8 (4 shared) | movies | human cortex | PCA |
| Lehky et al. (2014) | 60 (~87 estimate) | objects | macaque inferotemporal cortex | PCA |
| Tarhan et al. (2020) | 5 | visual actions | human cortex | k-means |
| Hebart et al. (2020) | 49 | objects | human behavior | SPoSE |
| Khosla et al. (2022) | 20 (5 reliable) | scenes | human ventral visual stream | NMF |
| Hebart et al. (2023) | 66 | objects | human behavior | SPoSE |
A lot of converging evidence that visual representations are low-dimensional across stimuli, species, and methods
~87 dimensions of object representations in monkey inferotemporal (IT) cortex
“A progressive shrinkage in the intrinsic dimensionality of object response manifolds at higher cortical levels might simplify the task of discriminating different objects or object categories.”
neural representations
mental representations
What if dimensionality doesn't saturate with dataset size?
high representational capacity, expressive
makes learning new tasks easier
Could visual representations be more high-dimensional than previously found?
Visual cortex representations are high-dimensional.
1
They are self-similar over a huge range of spatial scales.
2
The same geometry underlies mental representations of images.
3
Gauthaman, Menard & Bonner, in preparation
8 subjects
7 T fMRI
"Have you seen this image before?"
continuous recognition
Credit: NordicNeuroLab
functional magnetic resonance imaging
resolution
~1 mm
~3 s
coarse measure of neural activity
very large-scale
naturalistic stimuli
complex scenes
rotation
Principal Component Analysis
rank 1
rank 2
latent dimensions
ambient dimensions
high variance
low variance
latent dimensions sorted by variance
variance along the dimension
low-dimensional code
high-dimensional code
no variance along other dimensions
variance along many dimensions
"core subset" of relevant dimensions
variance along each dimension
stimulus-related signal
trial-specific noise
generalizes
... across stimulus repetitions
... to novel images
trial 1
trial 2
neurons (or) voxels
stimuli
Learn latent dimensions
Step 1
Step 2
Evaluate reliable variance
If there is no reliable signal in the data, expected value = 0
the (cross-)covariance matrix
its singular value decomposition
"splitting a matrix into the sum of rank-1 matrices"
... but cross-decomposition
has an expected value of 0.
PCA
cross-decomposition
reliable variance on the test set
latent dimensions sorted by variance on the training set
vs
binned logarithmically to increase SNR
artifact of analyzing a large ROI?
High-dimensional across multiple levels of the visual hierarchy
| dimensions | stimuli | system | method | |
|---|---|---|---|---|
| Huth et al. (2012) | 6-8 (4 shared) | movies | human cortex | PCA |
| Khosla et al. (2022) | 20 (5 reliable) | scenes | human ventral visual stream | NMF |
vs
reliable, and shared
reliable, but idiosyncratic
generalizes
... across stimulus repetitions
... to novel images
... across participants
subject 1
subject 2
neurons (or) voxels
stimuli
Learn latent dimensions
Step 1
Step 2
Evaluate reliable variance
reliable, and shared
No, they represent real high-dimensional signal!
1
vs
vs
striking universality across individuals
high-dimensional
power-law covariance
6 mice
calcium imaging
~10,000 V1 neurons
~2,800 natural images
Similar neural population statistics
Why does this happen?
also shared across individuals!
Visual cortex representations are high-dimensional.
1
They are self-similar over a huge range of spatial scales.
2
The same geometry underlies mental representations of images.
3
Gauthaman, Menard & Bonner, in preparation
Learn latent dimensions
Step 1
Each is a linear combination of voxels
voxel 1
voxel 4
voxel N
...
low-rank
high-rank
1
10
100
1,000
large spatial scale
small spatial scale
They show a rank-dependent spatial pattern.
1
30 mm
We can measure the characteristic spatial scale of each latent dimension.
human
mouse
10,000 neurons
primary visual cortex (V1)
human
mouse
Most variance in the data is on large spatial scales.
1-D Fourier basis
2-D Fourier basis
Answer: when data are translationally invariant
number of neurons in human V1
spacing between neurons
a code that leverages all latent dimensions
spatial binning
high-resolution data
simulated low-resolution data
2
High-resolution neuroimaging probes low-variance dimensions
Invariance to spatial scale explains similar findings across fields
Visual cortex representations are high-dimensional.
1
They are self-similar over a huge range of spatial scales.
2
The same geometry underlies mental representations of images.
3
Gauthaman, Menard & Bonner, in preparation
~26,000 images
1,854 categories
nameable, picturable nouns
What is the dimensionality of these neural representations of objects?
We replicate our earlier findings in the Natural Scenes Dataset.
increasingly abstract object representations
Which is the odd-one-out?
... and used to learn an embedding capturing the mental representations
that participants use when performing the odd-one-out task
What is the underlying structure of these mental representations?
Covariance decays as a power-law
Latent dimensions have significant correlations
There are power-law spectra in both cortical and behavioral data, but are they the same underlying representation?
generalizes
... to novel images
... across experiments
mental
neural
neurons (or) voxels
stimuli
Learn latent dimensions
Step 1
Step 2
Evaluate reliable variance
Systematic increase in shared dimensionality between neural and mental representations!
increasingly abstract object representations
Only 66 dimensions of mental representations were detected.
Scaling up the dataset doesn't help much!
49-D to 66-D
A potential cause: task complexity?
between-category triplets
(only coarse distinctions required)
within-category triplets
(fine distinctions required)
also has power-law covariance structure
coarse distinctions
fine distinctions
VGG-16 deep neural network
?
DNN
Yes, object representations increase in dimensionality when a DNN is forced to learn a more difficult task.
Visual cortex representations are high-dimensional.
1
They are self-similar over a huge range of spatial scales.
2
The same geometry underlies mental representations of images.
3
power-law covariance spectra
shared across subjects
remarkably similar across species
invariant to spatial scale
shared neural and mental representations
higher ranks encode finer distinctions
The Bonner Lab
The Isik Lab
The research presented in this dissertation was supported in part by a
“Thank you for paying us to do
what we would gladly pay to do.”
– S P Arun, Vision Lab, Indian Institute of Science
across species, in systems neuroscience
artificial neural networks
out-of-distribution classification accuracy
adversarial robustness
outside the visual system
prefrontal cortex
hand kinematics
Catalog all the latent dimensions!
Understand the generative mechanisms!
Not helpful to interpret every latent dimension
We want theory!
trial 1
trial 2
neurons (or) voxels
stimuli
Limitations of cross-decomposition:
We need new tools:
More variety in
For example,
also has power-law covariance structure
Using artificial vision systems to study the role of dimensionality, especially with causal manipulations
deep neural network
How are other scale-free aspects of cortex
related to the scale-free representations we find here?
Chihye (Kelsey) Han
Are humans behaviorally sensitive to low-variance visual information present in the tail of the spectrum?
How much information about the stimulus can be encoded?
How stable is the representation to irrelevant perturbations?
low-D
high-D
robustness
expressivity
i love sci-fi and am willing to put up with a lot. Sci-fi movies/tv are usually underfunded, under-appreciated and misunderstood. I tried to like this, I really did, but it is to good tv sci-fi as babylon 5 is to star trek (the original). Silly prosthetics, cheap cardboard sets, stilted dialogues, cg that doesn't match the background, and painfully one-dimensional characters cannot be overcome with a'sci-fi'setting. (I'm sure there are those of you out there who think babylon 5 is good sci-fi tv. It's not. It's clichéd and uninspiring.) while us viewers might like emotion and character development, sci-fi is a genre that does not take itself seriously (cf. Star trek). [...]
i love and to up with a lot. Are, and. I to like this, I really did, but it is to good as is to (the).,,, that doesn't the, and characters be with a''. (I'm there are those of you out there who think is good. It's not. It's and.) while us like and character, is a that does not take [...]