The Impact of Scene Context on Visual Object Recognition: Comparing Humans, Monkeys, and Computational Models

Sara Djambazovska
Anaa Zafer
Hamidreza Ramezanpour
Gabriel Kreiman
Kohitij Kar

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

During natural vision, we rarely see objects in isolation but rather embedded in rich and complex contexts. Understanding how the brain recognizes objects in natural scenes by integrating contextual information remains a key challenge. To elucidate neural mechanisms compatible with human visual processing, we need an animal model that behaves similarly to humans, so that inferred neural mechanisms can provide hypotheses relevant to the human brain. Here we assessed whether rhesus macaques could model human context-driven object recognition by quantifying visual object identification abilities across variations in the amount, quality, and congruency of contextual cues. Behavioral metrics revealed strikingly similar context-dependent patterns between humans and monkeys. However, neural responses in the inferior temporal (IT) cortex of monkeys that were never explicitly trained to discriminate objects in context, as well as current artificial neural network models, could only partially explain this cross-species correspondence. The shared behavioral variance unexplained by context-naive neural data or computational models highlights fundamental knowledge gaps. Our findings demonstrate an intriguing alignment of human and monkey visual object processing that defies full explanation by either brain activity in a key visual region or state-of-the-art models.

Version published to 10.1101/2024.05.27.596127 on bioRxiv
Jun 1, 2024

A Context-Sensitive Neural Hierarchy for Evaluating Temporal Structure in Primate Vocalizations

This article has 5 authors:
1. Ding Cui
2. Margaret Loewith
3. Audrey Dureux
4. Alessandro Zanini
5. Stefan Everling
This article has no evaluationsLatest version Nov 14, 2025
Predictive vision-language integration in the human visual cortex

This article has 5 authors:
1. Shurui Li
2. Zheyu Jin
3. Ru-Yuan Zhang
4. Shi Gu
5. Yuanning Li
This article has no evaluationsLatest version Nov 4, 2025
Two cortical mechanisms for natural audiovisual processing

This article has 2 authors:
1. Subha Nawer Pushpita
2. Leila Wehbe
This article has no evaluationsLatest version Nov 6, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Context-Sensitive Neural Hierarchy for Evaluating Temporal Structure in Primate Vocalizations

Predictive vision-language integration in the human visual cortex

Two cortical mechanisms for natural audiovisual processing