CTM-DETR——Frequency-Aware and Statistically Guided Transformer for Early Infrared Forest Fire Detection

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Early-stage infrared forest fire detection is severely hindered by strong background thermal interference and extremely weak fire radiation signals. Existing methods mainly rely on spatial-domain modeling and overlook the frequency-domain characteristics of flame thermal radiation, limiting robustness in complex environments. To address this challenge, we propose CTM-DETR, an end-to-end detection framework tailored for infrared forest fire monitoring. A frequency-aware backbone, CGlobalFilter, is introduced to explicitly model thermal radiation priors by performing real-spectrum filtering in the frequency domain, effectively suppressing non-fire thermal disturbances. In addition, a statistics-guided linear attention mechanism (TSSA) is designed, which replaces conventional pairwise similarity calculations with token-level second-order statistics, reducing attention complexity from O(N²) to O(N) while preserving global contextual modeling. To mitigate sample imbalance, a Matching-Aware Loss (MAL) is incorporated to adaptively reweight samples based on matching quality. Experiments on a constructed infrared forest fire dataset show that CTM-DETR surpasses RT-DETR, achieving a 3.1% mAP50 improvement, with 15.6% fewer parameters and 17.8% lower computational cost. Beyond performance gains, this work provides new insights into the frequency-domain and statistical properties of infrared flame radiation and offers a transferable paradigm for thermal imaging-based perception tasks.

Article activity feed