HGACH: Hypergraph Attention Convolutional Hashing for Semi-supervised Cross-modal Retrieval

Fangming Zhong
Rui Zhang
Cun Zhu
Haiquan Yu
Chenglong Chu
Suhua Zhang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Currently, the semi-supervised paradigm that integrates a few labeled and large numbers of unlabeled instances has garnered significant attention in cross-modal retrieval. This approach is particularly advantageous as it not only harnesses the supervisory signals from labeled data but also capitalizes on the latent information embedded in unlabeled samples. Nevertheless, previous works predominantly focus on pairwise relationships between instances, overlooking the higher-order relationships among samples, and consequently failing to fully exploit the underlying structure of the data. To bridge this gap, we propose a novel hypergraph attention convolutional hashing (HGACH) method for semi-supervised cross-modal retrieval. Distinguished from prior works, HGACH incorporates a dedicated mechanism to prioritize unlabeled samples, enabling the model to precisely capture complex higher-order dependencies via hypergraph attention convolutional networks. This approach ensures that the interdependencies between different modalities are better understood, thus significantly boosting retrieval accuracy. In addition, a robust similarity matrix is carefully designed, explicitly modeling both inter-modality and intra-modality distances, which is essential for correctly identifying similarities between different types of data. Furthermore, for labeled instances, we propose a supervised loss to preserve the similarity according to the given labels, ensuring that the model's predictions are consistent with the labeled data. Experimental results demonstrate that the proposed HGACH method outperforms existing state-of-the-art (SOTA) methods in terms of retrieval performance, showcasing its effectiveness in handling complex cross-modal retrieval tasks. The codes are available at https://anonymous.4open.science/r/HGACH-1D7F/.

Version published to 10.21203/rs.3.rs-7331317/v1 on Research Square
Sep 24, 2025

Enhancing Cross-Modal Retrieval via Label Graph Optimization and Hybrid Loss Functions

This article has 3 authors:
1. Lin Wang
2. Chenchen Wang
3. Simin Peng
This article has no evaluationsLatest version Nov 11, 2025
Learning-Compatible Sparse Hypergraph Partitioning for Scalable Structured Prediction

This article has 5 authors:
1. Menghao Li
2. Yuhan Chen
3. Zixuan Wang
4. Liyun Xu
5. Zhou Shen
This article has no evaluationsLatest version Nov 5, 2025
Enhancing Multimodal Recommendation via Contrastive Self-Supervised Modality-Preserving Learning

This article has 2 authors:
1. Jiajie Lu
2. Yamashita Haruka
This article has no evaluationsLatest version Oct 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing Cross-Modal Retrieval via Label Graph Optimization and Hybrid Loss Functions

Learning-Compatible Sparse Hypergraph Partitioning for Scalable Structured Prediction

Enhancing Multimodal Recommendation via Contrastive Self-Supervised Modality-Preserving Learning