Multi-Object Tracking with Integrated Appearance and Mamba-Based Motion Features

Changhao Zhou
Xun Duan
Dafu Zu
Guangqian Kong

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Multi-object tracking plays a vital role in computer vision, widely applied in various applications, such as surveillance, autonomous driving, and video analytics. In real-world scenarios, fast object motion and frequent occlusions often lead to blurred or missing appearance information, resulting in reduced detection accuracy and failed trajectory association. Such challenges severely compromise the robustness and accuracy of tracking.To address the above problems, this study proposes MTrack, a novel multi-object tracking framework that integrates both appearance and motion cues. Inspired by state-space modeling, specifically, we first design a motion feature module for object movements in both horizontal and vertical directions across frames using the Mamba state-space model, capturing global motion features. Next, we design a motion-appearance coupling module that effectively integrates the appearance features from the previous frame with the modeled motion features, thereby enhancing the model's ability to localize objects when appearance cues are unreliable.MTrack achieves strong performance on multiple benchmarks, including MOT17, MOT20, DanceTrack, and SportsMOT. It significantly outperforms existing methods in key metrics such as HOTA and IDF1, demonstrating both its effectiveness and generalization capability.

Version published to 10.21203/rs.3.rs-7318459/v1 on Research Square
Sep 2, 2025

Object Tracking: A Comprehensive Survey From Classical Approaches to Large Vision-Language and Foundation Models

This article has 4 authors:
1. Rahul Raja
2. Arpita Vats
3. Omkar Thawakar
4. Tajamul Ashraf
This article has no evaluationsLatest version Oct 6, 2025
A Robust Framework Fusing Visual SLAM and 3D Gaussian Splatting with a Coarse-Fine Method for Dynamic Region Segmentation

This article has 3 authors:
1. Zhian Chen
2. Yaqi Hu
3. Yong Liu
This article has no evaluationsLatest version Sep 5, 2025
A Dual RoI Feature Fusion for 3D Object Detection

This article has 6 authors:
1. Qingao Meng
2. Jigang Tong
3. Sen Yang
4. Tian Xie
5. Shengzhi Du
6. Wenyu Li
This article has no evaluationsLatest version Aug 21, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Object Tracking: A Comprehensive Survey From Classical Approaches to Large Vision-Language and Foundation Models

A Robust Framework Fusing Visual SLAM and 3D Gaussian Splatting with a Coarse-Fine Method for Dynamic Region Segmentation

A Dual RoI Feature Fusion for 3D Object Detection