CTM-DETR——Frequency-Aware and Statistically Guided Transformer for Early Infrared Forest Fire Detection
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Early-stage infrared forest fire detection is severely hindered by strong background thermal interference and extremely weak fire radiation signals. Existing methods mainly rely on spatial-domain modeling and overlook the frequency-domain characteristics of flame thermal radiation, limiting robustness in complex environments. To address this challenge, we propose CTM-DETR, an end-to-end detection framework tailored for infrared forest fire monitoring. A frequency-aware backbone, CGlobalFilter, is introduced to explicitly model thermal radiation priors by performing real-spectrum filtering in the frequency domain, effectively suppressing non-fire thermal disturbances. In addition, a statistics-guided linear attention mechanism (TSSA) is designed, which replaces conventional pairwise similarity calculations with token-level second-order statistics, reducing attention complexity from O(N²) to O(N) while preserving global contextual modeling. To mitigate sample imbalance, a Matching-Aware Loss (MAL) is incorporated to adaptively reweight samples based on matching quality. Experiments on a constructed infrared forest fire dataset show that CTM-DETR surpasses RT-DETR, achieving a 3.1% mAP50 improvement, with 15.6% fewer parameters and 17.8% lower computational cost. Beyond performance gains, this work provides new insights into the frequency-domain and statistical properties of infrared flame radiation and offers a transferable paradigm for thermal imaging-based perception tasks.