Do monkeys see the way we do? Qualitative similarities and differences between monkey and human perception
Curation statements for this article:-
Curated by eLife
eLife Assessment
This valuable study presents a comprehensive comparison of human and macaque monkey behavior across a range of visual perceptual phenomena. The use of a unified oddball visual search paradigm enables direct cross-species comparison while minimizing task-related confounds. It provides solid evidence that visual perception is largely similar between these two species, with some interesting exceptions. These insights into qualitative and quantitative differences between species are relevant for evaluating macaques as a model organism for understanding human vision.
This article has been Reviewed by the following groups
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
- Evaluated articles (eLife)
Abstract
Monkeys are widely used as model organisms for human vision and cognition. While their anatomy and physiology strongly correspond to humans, it is unclear to what extent their perception matches with ours. Previous studies have evaluated specific aspects of perception, after extensive training on customized tasks that could have altered their perception. To resolve these issues, we trained monkeys to perform an oddball visual search task on natural images, tested them on carefully controlled images to detect a variety of perceptual phenomena, and compared them with humans. This revealed a number of qualitative similarities and differences. Like humans, monkeys showed similar object relations, Weber’s law and amodal completion. However, unlike humans, monkeys did not show mirror confusion or a global advantage. These findings represent a first comprehensive evaluation of visual perception in monkeys and humans, revealing the limitations of monkeys as a model organism for human vision.
Article activity feed
-
eLife Assessment
This valuable study presents a comprehensive comparison of human and macaque monkey behavior across a range of visual perceptual phenomena. The use of a unified oddball visual search paradigm enables direct cross-species comparison while minimizing task-related confounds. It provides solid evidence that visual perception is largely similar between these two species, with some interesting exceptions. These insights into qualitative and quantitative differences between species are relevant for evaluating macaques as a model organism for understanding human vision.
-
Reviewer #1 (Public review):
Summary:
The authors set out to conduct a behavioral comparison of macaque and human vision across a wide range of visual properties. Such a comparison is critical for evaluating the use of macaques as a model system for understanding human vision and the underlying neural mechanisms. This goal represents a unique endeavour since prior studies have typically focused on only highly specific tasks. While the authors found consistent coarse representational structure for objects, evidence for Weber's Law and amodal completion, there was divergence for mirror image confusion and the use of global or local image properties.
Strengths:
There are three major strengths of the study. First, the authors employed a behavioral paradigm (oddball search) that allowed them to test multiple perceptual phenomena without …
Reviewer #1 (Public review):
Summary:
The authors set out to conduct a behavioral comparison of macaque and human vision across a wide range of visual properties. Such a comparison is critical for evaluating the use of macaques as a model system for understanding human vision and the underlying neural mechanisms. This goal represents a unique endeavour since prior studies have typically focused on only highly specific tasks. While the authors found consistent coarse representational structure for objects, evidence for Weber's Law and amodal completion, there was divergence for mirror image confusion and the use of global or local image properties.
Strengths:
There are three major strengths of the study. First, the authors employed a behavioral paradigm (oddball search) that allowed them to test multiple perceptual phenomena without having to train the macaques on the specific type of stimuli tested. Second, humans and macaques could be tested in an identical manner. Third, the authors tested a range of different visual properties and phenomena, allowing a broader comparison between species.
There are also some weaknesses to the study (described below), but that doesn't change the fact that the authors have demonstrated and validated a novel approach for systematic and comprehensive comparisons of vision across species.
Weaknesses:
The weaknesses of the study arise in part because of the breadth of the work. In cases where there are divergences between the two species, it would be helpful to know what might account for such divergence, to have more depth. Is it really a species difference, or could there be a different account? For example, does the difference in mirror image confusion arise because the stimuli were objects that would have been highly familiar to the humans but not the macaques? Further, the authors often used small sets of stimuli (e.g. 8 objects only in the test of object similarity; a small set of highly specific occlusion stimuli), and how well the findings will generalize beyond those stimuli is unclear.
The authors discuss the implications of training macaques to perform specific tasks on specific stimuli in comparing across species. While I agree that extensive training in monkeys could change perception, it is important to also consider that humans have been extensively trained through the types of visual tasks we conduct throughout our lives, so I'm not sure it is universally true that the best comparison is between humans and untrained monkeys. But this just consideration just highlights the general problem of comparing across species.
-
Reviewer #2 (Public review):
Summary:
The macaque monkey is often considered as the animal model of choice to study the neural correlates of visual perception, due to the close similarities to humans in terms of anatomy, physiology and behaviour (Van Essen and Dierker, 2007; DiCarlo et al., 2012; Roelfsema and Treue, 2014; Picaud et al., 2019; Van Essen et al., 2019; Hesse and Tsao, 2020). Quite some studies have been performed to compare the behaviour of macaque monkeys and humans on visual perception tasks. However, it remains difficult to compare the results of these studies as the methods that are used differ significantly between these studies. Furthermore, behavioural studies of macaque monkeys often involve extensive training as the tasks were relatively hard, making it difficult to compare the results with humans, who generally …
Reviewer #2 (Public review):
Summary:
The macaque monkey is often considered as the animal model of choice to study the neural correlates of visual perception, due to the close similarities to humans in terms of anatomy, physiology and behaviour (Van Essen and Dierker, 2007; DiCarlo et al., 2012; Roelfsema and Treue, 2014; Picaud et al., 2019; Van Essen et al., 2019; Hesse and Tsao, 2020). Quite some studies have been performed to compare the behaviour of macaque monkeys and humans on visual perception tasks. However, it remains difficult to compare the results of these studies as the methods that are used differ significantly between these studies. Furthermore, behavioural studies of macaque monkeys often involve extensive training as the tasks were relatively hard, making it difficult to compare the results with humans, who generally require very little training. The authors present a set of experiments to compare visual perception between macaque monkeys and humans, using the exact same behavioral task that is easy to learn and therefore requires very little training. As expected, they overall find that the two species behave similarly. However, they find a number of interesting exceptions.
Strengths:
A major strength of the current study is the relatively large number of tasks that were tested in the same subjects. This is made possible by using the oddball visual search task, which macaque monkeys can learn very quickly. This means that few trials are sufficient to obtain a significant difference between conditions, minimizing learning effects. Although this type of task has been used in previous studies (Sablé-Meyer et al., 2021), the current manuscript makes better use of the advantages and explains them more explicitly.
In addition, the study finds a number of interesting differences between macaque monkeys and humans. In particular, while humans can dissociate horizontally mirrored images better than vertically mirrored images, monkeys show no difference between these two conditions (Experiment 4). Also, while humans dissociate images better based on the global shape of a stimulus, monkeys dissociate images better based on local elements of a stimulus (Experiments 5 and 6). Although these findings are largely a replication of previous results, they have not yet been studied together with other tasks within the same individual subjects, and the low number of trials avoids any learning effects.
Weaknesses:
A weakness of the study is that while the objects that were used can be considered to be familiar to humans, they are not familiar to macaque monkeys.
In Experiment 4, humans can be expected to have 3D representations of familiar objects such as a Roman helmet or an office chair. Humans can therefore be expected to have view-invariant representations of these objects, predominantly for rotations around the vertical axis of the object (as movements are most common in the horizontal plane). This can explain why only humans confuse objects more often when mirrored vertically than when mirrored horizontally.
Similarly, in Experiment 5, humans can be expected to be familiar with abstract geometric shapes such as squares and circles, while monkeys likely are not. This could explain why monkeys find it hard to recognize these geometric shapes in the global shape of the stimuli, even when thin grey lines are drawn to connect the local elements that constitute the global shape (Experiment 6). Instead, the combination of local shapes can be expected to form a texture that might be more easily recognized by the monkeys.
More generally, as proposed by Fagot et al, it might well be that monkeys tend to conceive stimuli as a combination of low-level visual features, instead of as references to objects in the outside world, as humans have learned to do (Fagot et al., 2010). This line of critique would be relevant to take into account.
Another weakness could be that only three monkeys are tested, while 24 human subjects are tested. According to some theoretical work, a finding in 3 animals is not sufficient to make a claim about an animal species (Fries and Maris, 2022). However, it seems that the results are largely consistent between the different monkeys. Moreover, the results generally agree with the results from previous literature.
The conclusions by the authors are therefore largely supported by the results. Some results could be strengthened by additional experiments, or at least a more extensive discussion of the potential weaknesses.
The potential impact of the paper is significant, as a start of a comprehensive comparison of visual perception between humans and macaque monkeys, which can inspire other labs to contribute to. This comparison can also be extended to other animal species (e.g. crows and rodents), as well as to different types of artificial neural networks (Leibo et al., 2018). -
Reviewer #3 (Public review):
Summary:
In this study, Cherian and colleagues compare visual perception between humans and monkeys using a common oddball visual search task across a battery of perceptual phenomena. By keeping the task constant and varying only the stimulus sets, the authors aim to isolate perceptual similarities and differences between species. Across six experiments, they report that monkeys and humans share similarities in coarse object representations, Weber's law, and amodal completion, but differ in mirror confusion and global/local processing.
Strengths:
A major strength of the study is the unified experimental framework. The authors designed the experiments such that the task procedures are identical across conditions and species, differing only in the images shown. This is a significant methodological advantage, …
Reviewer #3 (Public review):
Summary:
In this study, Cherian and colleagues compare visual perception between humans and monkeys using a common oddball visual search task across a battery of perceptual phenomena. By keeping the task constant and varying only the stimulus sets, the authors aim to isolate perceptual similarities and differences between species. Across six experiments, they report that monkeys and humans share similarities in coarse object representations, Weber's law, and amodal completion, but differ in mirror confusion and global/local processing.
Strengths:
A major strength of the study is the unified experimental framework. The authors designed the experiments such that the task procedures are identical across conditions and species, differing only in the images shown. This is a significant methodological advantage, as it minimizes task-related confounds that often complicate cross-species and cross-experiment comparisons. As a result, observed similarities and differences can be more directly attributed to perceptual processes rather than differences in training or task demands. This allows for a more comprehensive evaluation of visual perception than is typical in the literature, where individual studies often focus on a single effect with specialized training. The data are carefully collected, and the analyses are systematic and appropriate for the questions posed.
Weaknesses:
Despite its strengths, the study is largely descriptive and provides limited mechanistic or theoretical explanation for the observed similarities and differences. While the authors document several convergences and divergences between humans and monkeys, there is relatively little discussion of why these patterns arise or how they relate to existing theories of visual processing. As a result, it is difficult to assess the broader implications or generalizability of the findings beyond the specific task and stimuli used.
Relatedly, the rationale for selecting the particular set of perceptual phenomena is not fully developed. Some tasks appear motivated by prior work comparing humans and deep neural networks, but it is unclear whether this set constitutes a representative or theoretically grounded sampling of visual perception. Without a clearer justification, it is difficult to interpret the absence or presence of specific effects (e.g., mirror confusion or global advantage) as reflecting fundamental species similarities/differences.
-
-
-