One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment

Christina Sartzetaki
Gemma Roig
Cees G.M. Snoek
Iris I.A. Groen

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

What can we learn from comparing video models to human brains, arguably the most efficient and effective video processing systems in existence? Our work takes a step towards answering this question by performing the first large-scale benchmarking of deep video models on representational alignment to the human brain, using publicly available models and a recently released video brain imaging (fMRI) dataset. We disentangle four factors of variation in the models (temporal modeling, classification task, architecture, and training dataset) that affect alignment to the brain, which we measure by conducting Representational Similarity Analysis across multiple brain regions and model layers. We show that temporal modeling is key for alignment to brain regions involved in early visual processing, while a relevant classification task is key for alignment to higher-level regions. Moreover, we identify clear differences between the brain scoring patterns across layers of CNNs and Transformers, and reveal how training dataset biases transfer to alignment with functionally selective brain areas. Additionally, we uncover a negative correlation of computational complexity to brain alignment. Measuring a total of 99 neural networks and 10 human brains watching videos, we aim to forge a path that widens our understanding of temporal and semantic video representations in brains and machines, ideally leading towards more efficient video models and more mechanistic explanations of processing in the human brain.

Version published to 10.1101/2024.12.05.626975v1 on bioRxiv
Dec 9, 2024

Brain-Guided Convolutional Neural Networks Reveal Task-Specific Representations in Scene Processing

This article has 6 authors:
1. Bruce C. Hansen
2. Michelle R. Greene
3. Henry A.S. Lewinsohn
4. Audrey E. Kris
5. Sophie Smyth
6. Binghui Tang
This article has no evaluationsLatest version Jan 8, 2025
Brain-Guided Convolutional Neural Networks Reveal Task-Specific Representations in Scene Processing

This article has 6 authors:
1. Bruce C. Hansen
2. Michelle R. Greene
3. Henry A.S. Lewinsohn
4. Audrey E. Kris
5. Sophie Smyth
6. Binghui Tang
This article has no evaluationsLatest version Jan 13, 2025
MortX: A Domain Generalization Benchmark for Mouse Cortex Segmentation and Registration

This article has 4 authors:
1. Asim Iqbal
2. Romesa Khan
3. Edith M. Schneider Gasser
4. Theofanis Karayannis
This article has no evaluationsLatest version Dec 1, 2024

Listed in

Abstract

Article activity feed

Related articles

Brain-Guided Convolutional Neural Networks Reveal Task-Specific Representations in Scene Processing

Brain-Guided Convolutional Neural Networks Reveal Task-Specific Representations in Scene Processing

MortX: A Domain Generalization Benchmark for Mouse Cortex Segmentation and Registration