Rethinking Image Quality Assessment through the Lens of Task Utility in Embodied Settings

Jirong Zha
Yemin Wang
Xiangmin Yi
Siqi Peng
Yingfeng Chen
Chen Gao
Xinlei Chen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Image quality assessment (IQA) underpins embodied imaging pipelines by judging whether visual quality satisfies downstream tasks, yet most methods learn task-agnostic scores aligned with generic human ratings on static benchmarks. This objective mismatches the embodied and interactive settings, where image adequacy depends on task goals, context, and action requirements that shape an agent’s decisions. We argue that IQA should shift from score regression to goal-conditioned judgment defined by the utility of embodied tasks. Such utility-aware assessment demands models with strong reasoning, grounding, and tool-use capabilities, as enabled by multimodal large language models (MLLMs) agent. We advocate rethinking IQA from the perspective of embodied task utility and outline benchmarks, evaluation protocols, and research directions for developing MLLM-based embodied IQA agents.

Version published to 10.31224/6844
Apr 23, 2026

A hard look at TVA’s promises: Comparing the same quantitative estimates of attention across two tasks

This article has 2 authors:
1. Kai Biermeier
2. Ingrid Scharlau
This article has no evaluationsLatest version Apr 17, 2026
Perceiving Uncertainty: how visual encoding, socially mediated doubt, and task complexity influence human decision-making

This article has 3 authors:
1. John R. Taylor
2. Stafford van Putten
3. Christopher J. Stanton
This article has no evaluationsLatest version Apr 16, 2026
Diffusion-based stimulus optimization reveals functional organization across higher visual cortex

This article has 5 authors:
1. Margaret M. Henderson
2. Andrew F. Luo
3. Sungjoon Park
4. Michael J. Tarr
5. Leila Wehbe
This article has no evaluationsLatest version May 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A hard look at TVA’s promises: Comparing the same quantitative estimates of attention across two tasks

Perceiving Uncertainty: how visual encoding, socially mediated doubt, and task complexity influence human decision-making

Diffusion-based stimulus optimization reveals functional organization across higher visual cortex