Motion-driven adaptive frame selection strategy for video action recognition

Hao Ding
Chen Guo
Jing Sun
Xiaoping Jiang
Hongling Shi
Jianjin Li

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Frame selection is a fundamental issue in video action recognition. It aims to minimize temporal redundancy and reduce computational cost. Current frame sampling strategies either rely on US based on motion, lacking emphasis on discriminative frames, or employ complex learning models or additional modal information, compromising generalizability. To address these challenges, this paper presents an adaptive frame selection strategy. It filters redundant frames through motion information and models relationships between each frame and others, thereby predicting the significance of each frame. This strategy combines the advantages of motion prior information and supervised learning. During training, frame importance-related constraints are integrated, guiding frames selection with strong discriminative features as inputs for the action recognition network. This frame selection method is integrated with backbone network structures such as TDN, GCTDN, AIM, and tested on three action datasets, Diving-48, UCF101 and HMDB51. The improvement on action recognition achieved is 4.4% on the Diving-48 dataset, 1.9% on the UCF101 dataset and 2.3% on HMDB51 dataset. Experimental results demonstrate that our selection strategy can be integrated with state-of-the-art action recognition models, leading to improved recognition performance.

Version published to 10.1186/s13640-025-00675-2
Jun 3, 2025
Version published to 10.21203/rs.3.rs-5495634/v1 on Research Square
Jan 9, 2025

Enhancing Action Recognition via Dynamic Cross-Frame Differential Modeling

This article has 5 authors:
1. Qiuhong Tian
2. Tiancheng Chen
3. Lizao Zhang
4. Ziyu Yang
5. Fei Zeng
This article has no evaluationsLatest version Aug 18, 2025
TransMODAL: A Dual-Stream Transformer with Adaptive Co-Attention for Efficient Human Action Recognition

This article has 3 authors:
1. Majid Joudaki
2. Mehdi Imani
3. Hamid R. Arabnia
This article has no evaluationsLatest version Aug 21, 2025
Diving Performance Analysis with 3D Motion Knowledge Hypergraphs

This article has 4 authors:
1. Jingbo Wang
2. Yifan Xie
3. Yitao Xie
4. Hongyu Xiao
This article has no evaluationsLatest version Sep 8, 2025

Listed in

Abstract

Article activity feed

Related articles

Enhancing Action Recognition via Dynamic Cross-Frame Differential Modeling

TransMODAL: A Dual-Stream Transformer with Adaptive Co-Attention for Efficient Human Action Recognition

Diving Performance Analysis with 3D Motion Knowledge Hypergraphs