FMT: Foundation Model-based Transformer for Remote Sensing Change Detection

xianran zhang
Zhengpeng Li
Jiansheng Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Change detection is a popular topic in the field of remote sensing, aiming to detect significant changes between bi-temporal images. With the technological advancements, advanced satellites capture more complex geographical information, making change detection more challenging. Existing models often use convolutional networks and Transformers to learn changes between bi-temporal images, but they often fail to fully utilize the knowledge and scalability of the foundation model, neglecting the importance of filtering invariant background information, which leads to unfiltered tokens interfering with model performance. In this work, we demonstrate the advantages of the foundation model and the necessity of token filtering. We propose a Foundation Model-based Transformer for Remote Sensing Change Detection (FMT). We introduce a collaborative feature extraction module that utilises a modified ResNet18 and a frozen foundation model. We also propose a multi-scale cross-axis attention fusion module that combines general features extracted by the foundation model with ResNet18 backbone network features. Additionally, we design an anchor token filtering module that uses algorithms such as TVConv, k-means, and top-k to filter change region anchor tokens based on a fuzzy prediction map and background information. Subsequently, the relationships between tokens are learned through a self-attention mechanism, and finally, a dual cross-attention module is used to interact with original and enhanced features, generating a prediction map with a convolutional decoder. The FMT was evaluated on the WHU‑CD, LEVIR‑CD, and DSIFN datasets, demonstrating superior performance compared to existing models.

Version published to 10.21203/rs.3.rs-7112910/v1 on Research Square
Nov 17, 2025

Research on Meteorological Recognition Method Based on Improved Spatio-Temporal Two-Stream Network Model

This article has 5 authors:
1. Shi-Qi Yuan
2. Yu Cao
3. Zhong Tao
4. Xu Cheng
5. Xiaodong Wang
This article has no evaluationsLatest version Nov 18, 2025
Attention-Enhanced YOLOv8s Framework for Accurate Small-Scale Landslide Detection from Remote Sensing Imagery

This article has 9 authors:
1. Yan-chang Jia
2. Shu-yan Hua
3. Hong-fei Wang
4. Tong Jiang
5. Qi-qi Zhao
6. Wen-long Ma
7. Tian-bao Niu
8. Zhi-hao Chen
9. Ya-hui Liang
This article has no evaluationsLatest version Nov 4, 2025
Remote Sensing Image Change Detection Using Deep Learning Techniques: A Comprehensive Survey

This article has 8 authors:
1. Tao Lei
2. Shuxin Zhang
3. Shaoxiong Lin
4. Tongfei Liu
5. Zhiyong Lv
6. Tao Gao
7. Maoguo Gong
8. Asoke K. Nandi
This article has no evaluationsLatest version Oct 21, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Research on Meteorological Recognition Method Based on Improved Spatio-Temporal Two-Stream Network Model

Attention-Enhanced YOLOv8s Framework for Accurate Small-Scale Landslide Detection from Remote Sensing Imagery

Remote Sensing Image Change Detection Using Deep Learning Techniques: A Comprehensive Survey