A method for automatically generating semantic information distribution maps of images

Ke Zhang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Abstract. Semantic information in images—such as meaningful, recognizable regions or objects that are incongruent with the overall scene—can effectively capture attention. The meaning map is currently the primary method for mapping the distribution of semantic information across visual stimuli. However, this approach relies on subjective human ratings of semantic meaningfulness and requires extensive manual annotation for each image prior to analysis. To address these limitations and enable rapid, efficient, and reproducible generation of semantic distribution maps for any given image, this paper proposes an automated method for constructing semantic information maps using multimodal large language models (MLLMs). Our approach generates two types of semantic maps: local semantic information maps, which quantify the semantic content at each spatial location within an image, and global semantic maps, which assess the contextual relevance of local regions to the overall scene. Additionally, the method can generate distribution maps of visual information associated with specific concepts. We argue that this method significantly enhances the precision and flexibility in controlling, measuring, and manipulating semantic information in visual stimuli, thereby advancing research in visual attention and visual language processing. The method is currently under further refinement.

Version published to 10.31234/osf.io/wcde5_v1 on OSF Preprints
Jul 18, 2025

Semantic Saliency from Multi-Modal Large Language Model Scene Understanding Maps

This article has 5 authors:
1. Shravan Murlidaran
2. Ziqi Wen
3. Jonathan Skaza
4. William Wang
5. Miguel P Eckstein
This article has no evaluationsLatest version Aug 1, 2025
Enhancing Infrared-Visible Image Fusion via Text-Guided Adaptive Feature Integration

This article has 6 authors:
1. Jundong Zhang
2. Yanan Guo
3. Kangjian He
4. Dan Xu
5. SongHan Zheng
6. WenCheng Mei
This article has no evaluationsLatest version Jul 23, 2025
Mapping and tracking the development of visual and semantic information use in object perception

This article has 4 authors:
1. Marelle Maeekalle
2. Inga María Ólafsdóttir
3. Brent Pitchford
4. Heida Maria Sigurdardottir
This article has no evaluationsLatest version Jul 9, 2025

Listed in

Abstract

Article activity feed

Related articles

Semantic Saliency from Multi-Modal Large Language Model Scene Understanding Maps

Enhancing Infrared-Visible Image Fusion via Text-Guided Adaptive Feature Integration

Mapping and tracking the development of visual and semantic information use in object perception