ISNet: Towards Improving Separability for Remote Sensing Image Change Detection

Deep learning has substantially pushed forward remote sensing image change detection through extracting discriminative hierarchical features. However, as the increasingly high-resolution remote sensing images have abundant spatial details but limited spectral information, the use of conventional backbone networks would give rise to blurry boundaries between different semantics among hierarchical features. This explains why most false alarms in the final predictions distribute around change boundaries. To alleviate the problem, we pay attention to feature refinement and propose deep learning networks that deliver improved separability (ISNet). Our ISNet reaps the advantages from two strategies applied to refining bitemporal feature hierarchies: 1) margin maximization that clarifies the gap between changed and unchanged semantics and 2) targeted arrangement of attention mechanisms that direct the use of channel attention (CA) and spatial attention (SA) for highlighting semantic and positional information, respectively. Specifically, we insert CA modules into share-weighted backbone networks to facilitate semantic-specific feature extraction. The semantic boundaries in the extracted bitemporal hierarchical features are then clarified by margin maximization modules, followed by SA modules to enhance positional change responses. A top–down fusion pathway makes the final refined features cover multiscale representations and have strong separability for remote sensing image change detection. Extensive experimental evaluations demonstrate that our ISNet achieves state-of-the-art performance on the LEVIR-CD, SYSU-CD, and Season-Varying datasets in terms of overall accuracy (OA), Intersection-of-Union (IoU), and F1 score. Code is available at https://github.com/xingronaldo/ISNet.