Flexible computation of object motion and depth based on viewing geometry inferred from optic flow

Zhe-Xin Xu
Jiayi Pang
Akiyuki Anzai
Gregory C. DeAngelis

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Vision is an active process. We move our eyes and head to acquire useful information and to track objects of interest. While these movements are essential for many behaviors, they greatly complicate the analysis of retinal image motion—the image motion of an object reflects both how that object moves in the world and how the eye moves relative to the scene. Our brain must account for the visual consequences of self-motion to accurately perceive the 3D layout and motion of objects in the scene. Traditionally, compensation for eye movements (e.g., smooth pursuit) has been modeled as a simple vector subtraction process. While these models are effective for pure eye rotations and 2D scenes, we show that they fail to apply to more natural viewing geometries involving combinations of eye rotation and translation. We develop theoretical predictions for how perception of object motion and depth should depend on the observer’s inferred viewing geometry. Through psychophysical experiments, we demonstrate novel perceptual biases that manifest when different viewing geometries are simulated by optic flow, in the absence of physical eye movements. Remarkably, these biases occur automatically, without training or feedback, and are well predicted by our theoretical framework. A neural network model trained to perform the same tasks exhibits neural response patterns similar to those observed in macaque area MT, suggesting a possible neural basis for these adaptive computations. Our findings demonstrate that the visual system automatically infers viewing geometry from optic flow and flexibly attributes components of image motion to either self-motion or depth structure according to the inferred geometry. Our findings unify previously separate bodies of work by showing that the visual consequences of self-motion play a crucial role in computing object motion and depth, thus enabling the visual system to adaptively perceive a dynamic 3D environment.

Version published to 10.1101/2024.10.29.620928v2 on bioRxiv
May 19, 2025
Version published to 10.1101/2024.10.29.620928v1 on bioRxiv
Oct 31, 2024

The Curious Mind: Eye Movements to Maximize Scene Understanding

This article has 2 authors:
1. Shravan Murlidaran
2. Miguel P Eckstein
This article has no evaluationsLatest version Jun 30, 2025
The cortical scene processing network emerges in infancy, prior to independent navigation experience

This article has 7 authors:
1. Frederik S. Kamps
2. Emily M. Chen
3. Haoyu Du
4. Heather L. Kosakowski
5. Ariel Fuchs
6. Nancy Kanwisher
7. Rebecca Saxe
This article has no evaluationsLatest version Jun 16, 2025
A discontinuity in motion perception during fixational drift

This article has 6 authors:
1. Josephine C. D’Angelo
2. Pavan Tiruveedhula
3. Raymond J. Weber
4. David W. Arathorn
5. Jorge Otero-Millan
6. Austin Roorda
This article has no evaluationsLatest version Jun 6, 2025

Listed in

Abstract

Article activity feed

Related articles

The Curious Mind: Eye Movements to Maximize Scene Understanding

The cortical scene processing network emerges in infancy, prior to independent navigation experience

A discontinuity in motion perception during fixational drift