Exposure to naturalistic occlusion promotes generalized, human-like robustness in deep neural networks

David D Coggan
Frank Tong

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Human object recognition is robust to challenging conditions, such as when one’s view of an object is fragmented due to an occluding foreground object. In comparison, deep neural networks (DNNs) are typically more susceptible to occlusion, suggesting that human vision relies on distinct mechanisms. Here, we investigated the role of visual diet in the emergence of these mechanisms by asking whether human-like robustness might arise in DNNs when trained with image datasets that better reflect the properties of occlusion in natural vision. We trained convolutional and transformer DNNs to classify clear images only, images augmented with artificial occluders (i.e., geometric shapes) or natural occluders (objects segmented from photographs). We then evaluated DNN occlusion robustness and compared their performance profiles with 30 human participants. We found that DNNs trained with artificial occluders remained vulnerable to natural occlusion and exhibited less human-like performance than those trained with natural occlusion. Our findings suggest that human robustness to visual occlusion arises from learning to disentangle natural objects from each other rather than simply learning to recognize objects from partial views. They also imply that commonly used forms of artificial occlusion are unsuitable for the evaluation or promotion of robustness to real-world occlusion in DNNs.

Version published to 10.64898/2026.04.23.720370 on bioRxiv
Apr 27, 2026

A Unified Account of Lightness Illusions via Edge-Based Reconstruction of Natural Images

This article has 3 authors:
1. Srijani Saha
2. Talia Konkle
3. George A. Alvarez
This article has no evaluationsLatest version Apr 10, 2026
Deep Learning-Based Framework for Filtering Objectionable Scenes in Cartoon Videos

This article has 8 authors:
1. Irshad Ullah
2. Sameed ur Rehman
3. Wajahat Akbar
4. Altaf Hussain
5. Raaz Waheeb Attar
6. Ruzat Ullah
7. Tariq Hussain
8. Amal Hassan Alhazmi
This article has no evaluationsLatest version Apr 16, 2026
From visual appearance to material categories

This article has 6 authors:
1. Chenxi Liao
2. Filipp Schmidt
3. Jacob Raleigh Cheeseman
4. Masataka Sawayama
5. Roland Fleming
6. Bei Xiao
This article has no evaluationsLatest version Apr 17, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Unified Account of Lightness Illusions via Edge-Based Reconstruction of Natural Images

Deep Learning-Based Framework for Filtering Objectionable Scenes in Cartoon Videos

From visual appearance to material categories