Presenter: Jakob

Theory journal club, 07.01.2022

NeurIPS 2020

Do brains and CNNs use the same

(or at least similar) decision strategies?

classification accuracy is not sufficient to decide (in particular when accuracy is high): "two systems may achieve similar accuracy with very different strategies"
better measure: error consistency
- measure whether two systems make not just similar numbers of errors (~accuracy) but make similar mistakes per stimulus (~find the same individual stimuli difficult or easy)
- applicable to: algo-algo, human-human, algo-human

system1: 75%

system2: 75%

system1: 100%

system2: 50%

system1: 50%

system2: 100%

Error consistency

Consider two systems i and j:

How much do their decisions overlap?

observed overlap:

C^\text{obs}_{i,j} = \frac{e_{ij}}{n}

e_{ij}

number of equal responses

error consistency

\kappa = \frac{c^\text{obs}_{i,j} - c^\text{exp}_{i,j}}{1 - c^\text{exp}_{i,j}}

remove expected overlap

for high expected overlap additional overlap is more relevant

accuracy

expected overlap:

C^\text{exp}_{i,j} = p_ip_j + (1-p_i)(1-p_j)

(binomial)

Do better ImageNet models make more human-like errors?

Do better ImageNet models make more human-like errors (o.o.d)?

How is error consistency influenced by model architecture?

consistency by chance

Recurrence to the rescue!

"We developed CORnet-S, a shallow ANN with four anatomically mapped areas and recurrent connectivity, guided by Brain-Score, a new large-scale composite of neural and behavioral benchmarks for quantifying the functional fidelity of models of the primate ventral visual stream. Despite being significantly shallower than most models, CORnet-S is the top model on Brain-Score and outperforms similarly compact models on ImageNet. Moreover, our extensive analyses of CORnet-S circuitry variants reveal that recurrence is the main predictive factor of both Brain-Score and ImageNet top-1 performance. Finally, we report that the temporal evolution of the CORnet-S "IT" neural population resembles the actual monkey IT population dynamics. Taken together, these results establish CORnet-S, a compact, recurrent ANN, as the current best model of the primate ventral visual stream."

Kubelius ... DiCarlo: Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs [NeurIPS 2019 (oral)]

Do brains and CNNs use the same

(or at least similar) decision strategies?

Error consistency

Do better ImageNet models make more human-like errors?

Do better ImageNet models make more human-like errors (o.o.d)?

How is error consistency influenced by model architecture?

Recurrence to the rescue!

Recurrence to the rescue!

Recurrence to the rescue!

Conclusion

What now?

Beyond accuracy

Beyond accuracy

jakobj

Do brains and CNNs use the same

(or at least similar) decision strategies?

Error consistency

Do better ImageNet models make more human-like errors?

Do better ImageNet models make more human-like errors (o.o.d)?

How is error consistency influenced by model architecture?

Recurrence to the rescue!

Recurrence to the rescue!

Recurrence to the rescue!

Conclusion

What now?

Beyond accuracy

More from jakobj