Skip to main content
PLOS One logoLink to PLOS One
. 2026 Feb 23;21(2):e0342350. doi: 10.1371/journal.pone.0342350

A fine-grained evaluation framework for urban land cover change based on feature monitoring with remotely sensed imagery

Qiang Liu 1, Jiachen Guo 1, Chuanxing Zheng 2,*, Feng Ling 3, Zhixiang Da 4, Wenlong Song 5, Fengjiao Zhao 1, Jijian Lian 1
Editor: Chong Xu6
PMCID: PMC12928421  PMID: 41729865

Abstract

Against the backdrop of accelerating global climate change and urbanization, urban land cover change has emerged as a critical indicator for understanding the dynamic evolution of cities and the transformation of urban ecosystems. This study proposes a data-driven framework for fine-scale urban land cover change assessment based on the UASFNet model, enabling high-precision evaluation of urban land cover dynamics. The approach first performs preprocessing and co-registration of bi-temporal remote sensing images from the study area, and applies the trained UASFNet model to identify urban land cover types and extract land cover information for each temporal phase. The Analytic Hierarchy Process (AHP) is then employed to determine the weights of various indicator factors. By integrating building disturbance, greenbelt disturbance, and road disturbance indices, the framework quantitatively evaluates the intensity of land cover change at both pixel and regional scales. Experimental results across three benchmark datasets, consisting of high-resolution sub-meter RGB urban remote sensing imagery, demonstrate that UASFNet achieves superior segmentation accuracy, with mean Intersection over Union (mIoU) values of 91.52%, 93.31%, and 88.90%, substantially outperforming several state-of-the-art baseline models. Spatial analysis of the Langfang urban area (2017–2023) reveals a marked increase in impervious surface coverage (+16.86%) and a sharp decline in greenbelt (−40%), with the urban landscape exhibiting a multi-core, belt-like expansion pattern oriented toward newly developed districts. The proposed framework not only enhances the interpretability and generalization of remote sensing models in complex urban environments but also provides a scalable analytical tool to support urban spatial planning, ecological conservation, and sustainable city governance.

1. Introduction

With the continued intensification of global urbanization and climate change, the increasing frequency of extreme weather events has become one of the most prominent environmental challenges [14]. Observational evidence indicates that global mean temperature has risen by approximately 1.1 °C over the past century [5], accompanied by a significant increase in both the intensity and frequency of extreme precipitation events [6,7]. Against this backdrop, impervious surfaces within urban areas have expanded rapidly, profoundly altering the physical structure of regional underlying surfaces [812]. According to the United Nations World Urbanization Prospects (2022), the global proportion of urban population has increased from 30% in 1950 to 57% in 2020 and is projected to exceed 68% by 2050 [13]. The urbanization process in China has been particularly remarkable, with the urbanization rate soaring from 17.9% in 1978 to around 67% by 2025, while urban construction land area has expanded nearly fivefold [14]. This rapid transformation has led to large-scale replacement of natural surfaces such as greenbelts and wetlands by impervious materials, resulting in a pronounced spatial reorganization of regional land cover structures. Therefore, establishing a scientifically robust and fine-scale urban land cover assessment framework capable of accurately capturing the spatial distribution and temporal evolution of surface cover types has become an urgent need for understanding urban environmental change and advancing spatial governance.

In recent years, urban land cover change has become a central topic in global environmental change and sustainable development research [1517]. Existing studies have primarily focused on the identification of urban expansion patterns [18,19], the analysis of land-use driving mechanisms [20,21], and the evaluation of landscape evolution dynamics [2224]. Earlier approaches mainly relied on statistical data or low-resolution remote sensing imagery, using indicators such as land-use transition matrices, landscape metrics, and temporal change rates to characterize urban expansion [25]. However, these methods are limited in both spatial resolution and classification accuracy, making it difficult to capture the fine structural and dynamic features of complex urban surfaces [26]. With the rapid advancement of high-resolution remote sensing data and deep learning technologies, research efforts have increasingly shifted toward fine-grained classification and dynamic monitoring of urban surfaces using multi-temporal, multi-scale, and multi-source imagery [2730]. In particular, the introduction of deep learning methods has greatly enhanced the automation and accuracy of land-cover feature recognition [28,31]. Although models such as UNet [32], DeepLab [33], and SegFormer [34] have achieved notable progress in remote sensing image segmentation, most existing studies remain focused on natural surfaces or general suburban areas [3539]. Fine-scale identification and change detection within typical urban cores—characterized by dense building clusters, intricate road networks, and interwoven greenbelts—are still relatively underexplored.

Urban land cover change not only reflects the rate and direction of spatial expansion but is also directly linked to the evolution of ecological security patterns and the vulnerability of urban systems [11,40,41]. In existing studies, mainstream assessment approaches typically rely on indicator-based frameworks and weighting analyses [42]—employing techniques such as the AHP [43], Entropy Weight Method [44], and Fuzzy Comprehensive Evaluation [45]—to quantify the magnitude of land-use change and its ecological implications. These methods have been widely applied in regional-scale analyses of urban vulnerability, sustainability, and landscape pattern dynamics [46]. However, conventional approaches that depend on statistical data or low spatiotemporal resolution imagery are insufficient to capture subtle surface disturbances and spatial heterogeneity associated with urbanization processes. By leveraging pixel-level land cover change results derived from deep learning models as the primary analytical driver, a fine-scale urban surface change assessment framework can be established to accurately identify and quantify urban land cover dynamics. However, existing studies often treat fine-grained land cover mapping and change assessment as relatively independent tasks. Deep learning–based segmentation models are mainly evaluated in terms of pixel-level accuracy, while indicator-based change assessment frameworks commonly rely on coarse-resolution or aggregated inputs. As a result, the linkage between boundary-aware urban land cover recognition and disturbance-level interpretation remains weak, particularly in dense urban cores. This gap highlights the need for an integrated framework that explicitly connects fine-scale semantic segmentation with weighted, interpretable urban land cover change assessment.

This study aims to propose a data-driven framework for fine-scale urban land cover change assessment based on UASFNet, designed to support urban spatial governance and ecological optimization decisions while revealing the dynamic evolution of surface systems during urbanization. The potential academic contributions of this study are as follows: (i) An Urban Adaptive Shared-feature Attention Network (UASFNet) is developed for fine-grained urban land cover mapping, and its effectiveness in enhancing boundary delineation and inter-class separability is systematically evaluated against representative semantic segmentation baselines in complex urban scenes;(ii) A weighted disturbance integration scheme based on the Analytic Hierarchy Process (AHP) is introduced to fuse building, road, and greenbelt change indicators, enabling interpretable and spatially coherent urban land cover change assessment;(iii) A multi-scale evaluation framework combining pixel-level classification and region-level aggregation is proposed to improve the characterization and interpretability of urban land cover change patterns beyond pixel-only analyses. The structure of this paper is organized as follows: The Materials and Methods section describes the materials and methods, including the proposed methodological framework and data preparation. The Results section presents the results of urban land cover classification and change analysis based on the UASFNet model. The Discussion section discusses the performance, applicability, and implications of the proposed approach. The Conclusion section concludes the study.

2. Materials and methods

2.1. Fine-scale assessment framework for urban land cover change

This study proposes a data-driven framework for fine-scale urban land cover change assessment based on the UASFNet architecture (Fig 1). The framework consists of four main stages. First, bi-temporal high-resolution remote sensing images of the study area are preprocessed and co-registered to ensure spatial consistency between different observation periods. These images serve as the input data for pixel-level land cover classification. Second, the trained UASFNet model is applied to the preprocessed imagery to identify urban land cover types and generate fine-grained land cover maps for each temporal phase. The resulting classification outputs provide the basis for subsequent change analysis. Third, building disturbance, greenbelt disturbance, and road disturbance indicators are derived from the classification results. The Analytic Hierarchy Process (AHP) is employed to determine the relative weights of these indicators, enabling the integration of heterogeneous change information. Finally, the weighted disturbance indicators are aggregated to evaluate the comprehensive intensity and spatial distribution of urban land cover change across the study area.

Fig 1. Overall framework of the proposed fine-scale urban land cover change assessment method.

Fig 1

All elements were created by the authors.

2.2. Data preparation

2.2.1. Study area.

In the experiments, to evaluate the segmentation performance and computational efficiency of the proposed algorithmic model, a comparative study was conducted using three high-resolution RGB image datasets obtained from different geographic regions and satellite sources: (a) the Langfang dataset, (b) the Potsdam dataset, and (c) the Guiyang dataset. Fig 2 illustrates the geographical locations of the study areas and the corresponding images for the three datasets.

Fig 2. Location map of the study area.

Fig 2

(A) the Langfang dataset (China). (B) the Potsdam dataset (Germany). (C) the Guiyang dataset (China). The global location map was generated using the Natural Earth dataset, which is in the public domain. Representative remote sensing imagery was obtained from the Copernicus Data Space Ecosystem (Copernicus Sentinel data), the ISPRS Potsdam dataset and processed by the authors. All map elements and annotations were created by the authors.

2.2.2. Database construction.

Fig 3 presents sample images and corresponding semantic label maps from the Potsdam datasets. The detailed parameters of each dataset are as follows:

Fig 3. Example image patches and corresponding ground truth labels from the Potsdam dataset.

Fig 3

The figure was generated by the authors for visualization purposes.

  • (a) Langfang Dataset: This dataset was constructed using RGB three-band remote sensing images of the Langfang Economic Development Zone with a spatial resolution of 0.6 m. Four land-use classes were annotated using GIS software, and the dataset was partitioned into training, validation, and test sets using a spatial block segmentation method at a ratio of 70%/ 10%/ 20%, respectively.

  • (b) Potsdam Dataset: This dataset is derived from ultra–high-resolution TOP imagery with a ground sampling distance (GSD) of 5 cm. The Potsdam region is known for its complex building layouts and dense urban structures. The dataset covers an area of 3.42 km² and includes pixel-level annotations for six semantic categories. It has become a standard benchmark for semantic segmentation research. In this study, an improved four-class semantic labeling scheme—including buildings, greenbelts, roads, and others—was adopted to better suit urban analysis tasks, resulting in 2299 training, 605 validation, and 1694 test image patches of size 1024 × 1024.

  • (c) Guiyang Dataset: This dataset covers the urban core area of Guiyang City, China, representing a typical mountainous urban environment. It was constructed following the same data preparation, annotation protocol, and data partition strategy as the Langfang dataset, ensuring consistency and comparability across datasets.

For all datasets, standard preprocessing procedures were applied prior to model training and inference to ensure consistency across multi-source and multi-temporal imagery. These procedures include geometric co-registration between different acquisition periods, spatial resampling to a unified spatial resolution, and normalization of RGB values. No aggressive radiometric correction or handcrafted feature extraction was introduced, in order to preserve original spatial patterns. The same preprocessing pipeline was applied to all datasets and baseline models to ensure a fair and unbiased comparison [47,48]. All remote sensing imagery used in this study was obtained from publicly accessible datasets under open data policies, including Copernicus Sentinel data and the ISPRS Potsdam dataset. All maps and visualizations were generated by the authors. No third-party online basemap services were used.

2.3. Structure of the urban adaptive shared-feature attention network (UASFNet)

In urban land cover change detection, different surface objects exhibit pronounced spatial heterogeneity and semantic coupling relationships. Urban features such as buildings, roads, and greenbelts differ substantially in spatial scale and morphological characteristics, and their spectral representations in imagery often display strong local discontinuities and abrupt boundary transitions. To address these challenges, this study proposes an Urban Adaptive Shared Feature Attention (UASF) module that achieves adaptive fusion of semantic consistency, spatial continuity, and structural stability through a unified mapping mechanism (See S1 Table).

Given an input feature X ∈ RB×C×H×W, the UASF module can be formulated within the joint domain Z=ϑ of the feature space and the geometric space ϑ as follows:

FUASF(X)=Φproj(A(X)KUASF(X))+X (1)

Here, KUASF(X) denotes the semantic–structural consistency kernel mapping, A(X) represents the global adaptive gating operator, and Φproj(·) refers to the channel reconstruction and projection mapping. This formulation defines the overall mapping relationship of the module in the form of operator composition, enabling UASF to achieve unified modeling of semantic aggregation and dynamic modulation within the joint feature–structure space. Unlike conventional residual structures, the joint mapping mechanism of UASF not only performs nonlinear feature reconstruction but also establishes a synergistic linkage between semantic and geometric representations at the operator level.

First, the core of the UASF module is the semantic–structural consistency kernel function KUASF(X), which simultaneously computes correlations within both the feature domain and the gradient domain to capture long-range semantic dependencies and local boundary characteristics. Its formal representation is given as:

KUASF(X)=Softmax[ϕ(X)ϕ(X)Tφ(X)]Θ(X) (2)

Here, ϕ(X) denotes the shared semantic embedding obtained through depthwise separable convolution, which is used to extract spatial contextual features; φ(X) represents the structure modulation function based on the Sobel gradient operator, designed to characterize object boundary information; Θ(X) is the linear reconstruction operator responsible for generating the fused feature representation.

After completing the joint modeling of semantic and structural features, the UASF module adaptively balances the contribution weights of different feature domains through a global gating mechanism. The gating operator A(X) is defined as:

A(X)=Γgate(GAP(X))=Softmax[W2σ(W1GAP(X))] (3)

Here, GAP(X) denotes the global average pooling operation, Γgate(·) represents the dynamic gating function composed of two fully connected layers, σ(·) is the nonlinear activation function, and W1 and W2 are the learnable weight matrices. Through this mechanism, the model dynamically adjusts the weighting between semantic and structural features during the fusion process according to the global feature distribution, thereby achieving adaptive responses under varying levels of spatial complexity.

The semantic–structural consistency kernel encourages the alignment of semantic representations with geometric cues, which is beneficial for preserving object boundaries and reducing confusion between visually similar land cover classes. The gradient-domain formulation highlights local structural variations, such as edges and shape transitions, that are particularly relevant for fine-scale urban patterns. The global gating operator adaptively modulates the relative contributions of semantic and structural information across spatial locations, thereby promoting spatial coherence while limiting redundant feature propagation.

At the architectural level, the UASF module is embedded into the bottleneck and decoder stages of the U-Net backbone, forming the core framework of UASFNet. By aggregating global semantic features during the encoding phase and restoring spatial boundary details during the decoding phase, UASFNet establishes a robust adaptive coupling between global and local representations. This design not only effectively mitigates issues of object overlap and boundary discontinuity commonly observed in high-resolution imagery but also provides, from a theoretical perspective, an interpretable end-to-end modeling mechanism of semantic–geometric consistency. In summary, under a unified mechanism, UASFNet achieves coordinated optimization of global modeling, structural constraint, and adaptive fusion, significantly enhancing boundary delineation accuracy and semantic representation in high-resolution urban remote sensing scenes, thereby providing a stable feature foundation for subsequent change detection and spatiotemporal disturbance analysis.

2.4. Evaluation metrics

This study conducts a comprehensive evaluation of the land classification model’s performance using multiple metrics to ensure the objectivity and reliability of the results. The calculation method is shown in Table 1 [49]. The mean Intersection over Union (mIoU) was used as the primary metric to quantify overall pixel-level classification accuracy across land-cover classes. Precision, recall, and F1-score were employed to further characterize class-wise discrimination performance, particularly for imbalanced or structurally complex categories. In addition, Kappa coefficient was included to evaluate the agreement between predicted and reference maps beyond chance, providing a robust measure of classification reliability. Together, these metrics capture not only pixel-wise accuracy but also classification consistency and robustness, which are essential for ensuring that the resulting land-cover maps can serve as reliable inputs for subsequent urban change and disturbance analysis.

Table 1. Evaluation index.

Effect Evaluation index and formula
Overall accuracy evaluation Accuracy=TP+TNTP+TN+FP+FN (4)
Classification ability assessment Precision=TPTP+FP,Recall=TPTP+FN (5)
MF1=1Ci=1C2×Precisioni×RecalliPrecisioni+Recalli (6)
Prediction region matching degree evaluation IoU=TPTP+FP+FN,MIoU=1Ci=1CIoUi (7)
Consistency evaluation K=P0Pe1Pe (8)

Here, TP, TN, FP, and FN represent true positives, true negatives, false positives, and false negatives, respectively. C is the total number of categories, and F1i is the F1 Score for the i-th category. Precision represents the proportion of actual positive samples among those predicted as positive. Recall represents the proportion of actual positive samples correctly identified as positive.

It should be noted that, as in most large-area urban land cover mapping studies, reference labels are subject to unavoidable uncertainties, particularly in transitional zones, complex building boundaries, narrow road segments, and shadow-affected areas. Since all models were trained and evaluated using the same reference data and partitioning strategy, such uncertainties affect all methods consistently and do not compromise the validity of the relative performance comparison.

2.5. Methods for urban land cover change assessment

This study proposes an urban land cover (LC) change assessment framework that integrates the advantages of image-based recognition and indicator system analysis. The specific steps are as follows: (1) The urban land cover recognition module is used to construct hierarchical layers for building disturbance, road disturbance, and greenbelt disturbance, upon which a comprehensive LC change evaluation indicator system is established. (2) AHP is then applied to calculate the weights of each indicator, enabling quantitative evaluation and hierarchical representation of urban land use value changes. Direct aggregation of these disturbance indicators would implicitly assume equal importance across different land-cover components, which may mask dominant drivers of urban land cover change and reduce interpretability in dense urban environments.

2.5.1. Selection and quantification of indicator factors.

The selection and quantification of index factors are mainly achieved through the following steps:

  • (1) Framework of index system

The structure of the indicator system in this study is mainly divided into a target layer and an indicator layer. The target layer represents the urban LC change assessment system. The indicator layer consists of the urban building disturbance layer, urban road disturbance layer, and urban greenbelt disturbance layer.

  • (2) Large-scale disturbance layer preparation

First, the satellite images are segmented using a sliding window method, dividing high-resolution remote sensing images from different time periods into multiple pixel-based image patches, each representing an independent spatiotemporal unit. Next, the pre-trained urban land cover classification model is applied to perform pixel-level multi-class classification on each image patch, and the classification results are stored in raster file format. Finally, the image patches are reassembled according to the original mapping scheme to reconstruct a complete regional land cover classification raster map, as well as individual raster layers for specific land-use types, for subsequent analysis and evaluation.

  • (3) Quantization of index factors

This study uses raster cells as the basic unit to quantify the disturbance areas of buildings, roads, and greenbelts. First, a raster layer with a 10 × 10 m resolution is generated within the study area, and the disturbance areas of different land types are assigned to the corresponding raster cells. The disturbance area is derived by calculating the difference in land type coverage within the same raster cell at different time points, thereby quantifying the LULC changes in urban land use.

2.5.2. Urban LULC changes calculation method.

  • (1) Index weight calculation method

In this study, weighting is introduced to explicitly reflect the differentiated contributions of building, road, and greenbelt disturbances to the overall urban land cover change assessment. The Analytic Hierarchy Process (AHP) is a mathematical method used for multi-criteria decision analysis, suitable for decision problems with multiple objectives or complex characteristics [50]. This method analyzes complex problems in depth and makes mathematical decisions based on limited quantitative information. In this study, AHP is applied for the quantitative analysis of urban LULC changes factors. The judgment matrix was constructed based on aggregated expert pairwise comparisons, and its logical consistency was verified using the AHP consistency ratio (CR).The specific steps are as follows:

First, each factor is compared pairwise, and a pairwise matrix is established based on the comparison results.

M=[@ccccc@1a12a13a1na211a23a2na31a321an1an21],aij=weight of attribute iweight of attribute j (9)

Where, aij represents the comparison result of the i th factor with the j th factor, and the value ranges from 1 to 9.

The matrix obtained through calculation can be verified for consistency using the Consistency Ratio (CR), which is calculated using the following formula:

CR=λmaxnn1·1RI (10)

Here, λmax is the maximum eigenvalue of the matrix, and n is the matrix dimension, and RI is the Random Consistency Index. If CR is less than or equal to 0.1, the judgment matrix has reasonable consistency; if CR is greater than 0.1, the pairwise matrix needs to be adjusted.

In this study, three major factors—buildings, greenbelts, and greenbelt—were considered. After consistency testing, the weight values for each factor were calculated and used for further creation of the urban LULC changes change map.

  • (2) Disturbance calculation method

This study assesses the overall urban disturbance level by considering the disturbances from urban buildings, roads, and greenbelts. The disturbance calculation formula for each grid cell is as follows:

Disturbancei=ωδBi+ωλRi+ωμGi (11)

In the equation, Bi, Ri, and Gi represent the urban building disturbance area, urban road disturbance area, and urban greenbelt disturbance area within the i-th grid, respectively. ωb, ωr, and ωg denote the weights of the urban building disturbance index, urban road disturbance index, and urban greenbelt disturbance index, respectively.

2.6. Computational environment and data augmentation strategy

In this study, the bi-temporal remote sensing interpretation module was implemented using Python and the PyTorch framework. All model training and evaluation were conducted on a Windows 11 operating system equipped with an NVIDIA® GeForce RTX™ 4090 GPU, ensuring efficient computational resource support. For data augmentation, random horizontal and vertical flipping, as well as rotation within ±15°, were applied to increase the diversity of the training dataset, thereby enhancing the model’s ability to generalize to unseen data.

During the training phase, the batch size is set to 4 based on the GPU memory capacity and model complexity, with an initial learning rate of 1e-4. The Adam optimizer is used to optimize the training process by automatically adjusting the gradient mean and variance for each parameter. The total number of training epochs is set to 300 to ensure that the model converges after sufficient learning. In classification, the model is divided into four categories (building, road, greenbelt, and others). A composite loss function combining categorical cross-entropy and Dice loss was adopted, with equal weighting, to balance classification accuracy, improve the recognition of hard samples, and optimize the segmentation of small target regions. This design also helps mitigate edge cases caused by class imbalance and limited training samples for certain land-cover categories, particularly for thin structures such as roads and fragmented greenbelts.

For performance evaluation, UASFNet was compared with several representative deep learning–based semantic segmentation models commonly used in urban land-cover mapping, including ResNet [36], UNetformer [37], CMTFNet [38], CM-Unet [35], and MFNet [39].

3. Results

3.1. Model performance

This section evaluates the classification performance of the proposed UASFNet in comparison with five models introduced in the Materials and Methods section across different datasets. By reporting both category-level and overall performance, this analysis aims to provide an objective assessment of the relative effectiveness of UASFNet for fine-grained urban land cover classification.

To validate the overall performance of the proposed UASFNet model, comparative experiments were conducted on three representative urban remote sensing datasets—Langfang, Potsdam, and Guiyang—against five representative models. The classification results of each model on the the Potsdam dataset are illustrated in Fig 4. As shown, the spatial recognition performance across different land-cover types varies considerably among models. For the building and road classes, UASFNet accurately preserves the integrity and continuity of object boundaries, producing sharp building edges and coherent road structures. In contrast, models such as CM-Unet and ResNet tend to exhibit blurred boundaries, road discontinuities, or local adhesions in dense urban areas. Regarding greenbelt identification, UNetformer and CMTFNet demonstrate satisfactory overall recognition of greenbelt areas but still produce scattered noise in regions affected by high reflectance or shadow interference. In contrast, UASFNet effectively suppresses such noise through its multi-scale shared-feature attention mechanism, achieving smoother boundary transitions and more coherent greenbelt patches. Overall, across various urban scenes, UASFNet exhibits superior spatial consistency and finer detail restoration, producing segmentation results that more closely approximate real-world land-cover distributions.

Fig 4. Visual comparison of urban land cover classification results produced by different models on the the Potsdam dataset.

Fig 4

All classification results were generated by the authors.

Table 2 summarizes the detailed quantitative results of all models across the three datasets. In the Langfang dataset, UASFNet achieved an average mIoU of 91.52%, outperforming the second-best model, CMTFNet, by 1.2%. The IoU values for buildings, greenbelts, and roads reached 96.50%, 95.61%, and 95.12%, respectively—the highest among all models—while the overall Accuracy and Kappa coefficient were 97.29% and 0.9580. On the Potsdam dataset, UASFNet also demonstrated superior performance, achieving an mIoU of 93.31%, an F1-score of 96.37%, and a Kappa coefficient of 0.9783—significantly outperforming MFNet (mIoU 91.96%) and UNetformer (90.55%).On the Guiyang dataset, UASFNet achieved the best overall performance, with an mIoU of 88.90%, an F1-score of 93.65%, and a Kappa value as high as 0.9493. Among the four land-cover categories, the IoU values for buildings, roads, and greenbelts reached 96.58%, 96.44%, and 94.62%, respectively. These results demonstrate consistent performance advantages over multiple baseline models and further support the robust generalization capability of UASFNet across different resolutions and domains.

Table 2. Quantitative performance comparison of different semantic segmentation models on three urban remote sensing datasets.

Dataset Method IoU(%) per Category Metrics
Building Greenbelt Road Others Accuracy MF1 MIoU Kappa
Langfang Dataset ResNet 95.16 91.16 87.83 63.50 94.49 91.02 84.41 0.9153
UNetformer 92.78 92.49 87.56 76.54 94.70 93.11 87.34 0.9179
CMTFNet 96.71 92.66 92.77 79.07 96.71 94.77 90.30 0.9277
CM-Unet 95.11 91.16 90.89 79.90 95.93 94.23 89.27 0.9370
MFNet 94.99 92.56 91.92 70.32 95.78 92.98 87.45 0.9346
UASFNet (Ours) 96.50 95.61 95.12 78.84 97.29 95.41 91.52 0.9580
Potsdam Dataset ResNet 95.92 94.24 94.48 60.41 95.34 91.6 86.26 0.9573
UNetformer 96.46 95.19 94.39 61.04 95.51 91.71 86.77 0.9602
CMTFNet 96.42 95.07 94.56 76.14 95.65 94.63 90.55 0.9623
CM-Unet 89.76 94.74 92.8 68.84 96.11 92.43 86.54 0.9381
MFNet 96.32 95.68 94.91 80.94 96.91 95.58 91.96 0.9665
UASFNet (Ours) 96.77 97.64 94.99 83.84 97.43 96.37 93.31 0.9783
Guiyang Dataset ResNet 96.35 96.2 94.46 66.32 96.27 93.28 88.33 0.9468
UNetformer 96.32 96.22 94.56 67.86 96.32 93.56 88.74 0.9476
CMTFNet 95.69 96.07 92.71 64.94 95.73 92.69 87.35 0.9392
CM-Unet 95.65 96.09 93.76 65.71 96.00 92.97 87.80 0.9429
MFNet 95.07 96.24 93.79 66.05 95.91 92.98 87.79 0.9419
UASFNet (Ours) 96.58 96.44 94.62 67.95 96.45 93.65 88.90 0.9493

Across the results from the three datasets, UASFNet consistently maintains leading performance on most evaluation metrics, with performance improvements exhibiting strong consistency. Overall, UASFNet exhibited superior performance in boundary preservation, inter-class separability, and overall accuracy compared to benchmark models, confirming that the proposed shared-feature and adaptive attention fusion mechanism effectively enhances multi-scale representation and spatial consistency.

3.2. Multi-Temporal urban land cover classification results

This subsection presents the multi-temporal urban land cover classification results derived from UASFNet for different observation periods. By analyzing classification maps from multiple years, this section examines the temporal consistency of the model outputs and highlights major patterns of land cover change associated with urban development.

Fig 5 presents the land cover classification results of the Langfang Economic Development Zone for the years 2017 and 2023. In the 2017 classification results, large areas of greenbelt are observed, while built-up areas are relatively scattered and primarily concentrated in the southeastern region. By 2023, built-up land had expanded substantially, with building density increasing notably in the western, northern, and eastern parts of the city. The spatial pattern of urban development exhibited a “center-to-outskirts” expansion trend, indicating a clear acceleration of the urbanization process. Greenbelts became increasingly fragmented, particularly in the northwestern and south-central zones, where large portions of greenbelt were replaced by buildings. The 2023 classification results also reveal a clearer and more continuous road network, reflecting significant improvements in urban infrastructure connectivity.

Fig 5. Multi-temporal urban land cover classification results for the Langfang Economic Development Zone.

Fig 5

The background remote sensing imagery was obtained from the Copernicus Data Space Ecosystem (Copernicus Sentinel data) and processed by the authors. The imagery represents original satellite observations rather than an online basemap. All classification maps, boundaries, annotations, and map elements were generated by the authors.

Fig 6 illustrates the changes in the proportions of roads, buildings, greenbelts, and impervious surfaces across 34 subregions of the Langfang Economic Development Zone between 2017 and 2023. Impervious surface and greenbelt changes were derived by aggregating class-wise pixel areas from the multi-temporal classification maps. Overall, the study area experienced substantial land surface transformation, characterized by intensified imperviousness, a reduction in greenbelt, and a general expansion of built-up areas.

Fig 6. Comparative charts of multiple indicators across the 34 subregions of the Langfang Economic Development Zone (2017–2023).

Fig 6

All map elements were generated by the authors.

In terms of road coverage, most regions showed significant growth, with an average increase of approximately 37%. The most prominent increases were observed in regions 4 (+163.5%), 20 (+160.3%), and 27 (+109.6%). Building coverage exhibited a distinct polarization pattern: core development zones (e.g., regions 12, 8, 6, and 16) experienced substantial growth in built-up area, with the largest increase reaching +864.6% in region 16. In contrast, several old urban or functionally adjusted zones (e.g., regions 24, 25, and 23) showed dramatic declines, with maximum decreases exceeding −90%. Greenbelt coverage declined across all regions, with an average reduction of approximately 40%. The most pronounced decreases occurred in regions 16 (−75.9%) and 21 (−61.5%), indicating that urban expansion has exerted significant pressure on ecological spaces.

Consequently, the impervious surface ratio increased across all subregions. In 2017, imperviousness ranged from 8.5% to 47%, with an average of 25.37%; by 2023, it had risen to 13%–58%, averaging 42.23%, representing an overall increase of approximately 16.86 percentage points. Spatially, the most pronounced increases in imperviousness were concentrated in the western and northern portions of the study area, reflecting a clear trend of urban development expanding outward from the city center.

3.3. Spatial distribution of urban land cover change evaluation indicators

Based on the multi-temporal classification results, this subsection analyzes the spatial distribution of urban land cover change evaluation indicators. The analysis focuses on the spatial patterns and heterogeneity of building, road, and greenbelt changes at the pixel and local scales.

The spatial distribution of urban land cover change evaluation indicators shown in Fig 7 reveals pronounced spatial differentiation characteristics of the Langfang Economic Development Zone during the urbanization process from 2017 to 2023. Overall, changes in buildings, greenbelts, and roads all exhibit distinct spatial clustering and gradient distribution patterns. Building changes are mainly concentrated in the central–southern, southeastern, and western subregions, forming a “high-in-center, low-at-periphery” spatial pattern that reflects the outward expansion of urban construction activities from the core areas. Greenbelt changes show a high degree of spatial coupling with building changes. High-value zones are primarily distributed within the core development areas and southeastern belt regions, indicating that the greenbelt system has been strongly disturbed and fragmented by urban expansion, while the ecological functions in peripheral areas remain relatively stable. The spatial pattern of road changes reflects the outward expansion of the urban transportation network. High-change areas are mainly located in the northwestern, southern, and southeastern corridors, consistent with the direction of new urban development, forming a “multi-core belt-shaped expansion” spatial configuration. Overall, the three indicators demonstrate a synergistic spatial evolution pattern, where urban construction, road expansion, and greenbelt transformation interact to jointly shape an urban surface evolution pattern characterized by intensive development in the core areas and belt-like expansion toward the periphery. These spatially heterogeneous distribution patterns observed in Fig 7 indicate that urban land cover change is not spatially uniform but tends to concentrate in areas with intensive development activities. The co-occurrence and spatial coupling of building, road, and greenbelt disturbances suggest that urban expansion processes exhibit structured spatial organization rather than random dispersion. From an observational perspective, such spatial patterns provide useful spatial references for identifying development hotspots and relatively stable zones, which may inform future urban planning and land management decisions in terms of spatial prioritization and differentiated control.

Fig 7. Spatial distribution of individual urban land cover disturbance indicators.

Fig 7

All thematic maps were produced by the authors.

3.4. Analysis of urban land cover change evaluation results

This subsection provides a comprehensive analysis of urban land cover change evaluation results by integrating multiple indicators into an overall disturbance assessment. The analysis is conducted at both pixel and regional levels to examine how localized changes aggregate into broader urban transformation patterns.

To determine the weights of urban Land Cover changes indicators, the AHP was employed to establish the relative importance among indicators. The judgment matrix in Table 3 was constructed using expert scoring and subjected to a consistency test, yielding a consistency index (CI) of 0.0193, a random consistency index (RI) of 0.52, and a consistency ratio (CR) of 0.037, which is less than 0.1. This result indicates that the constructed judgment matrix meets the consistency requirements. Consequently, the importance ranking of the indicators is as follows: building disturbance (0.637), road disturbance (0.258), and greenbelt disturbance (0.105). The pairwise comparison values were obtained through independent expert scoring following the standard AHP nine-point scale. The final judgment matrix represents the aggregated expert consensus and was verified using the consistency ratio (CR), ensuring logical consistency of the weighting scheme.

Table 3. Expert-based AHP judgment matrix for urban land-cover disturbance indicators.

Evaluation index Building disturbance Road disturbance Greenbelt disturbance
Building disturbance 1 5 3
Road disturbance 1/5 1 1/3
Greenbelt disturbance 1/3 3 1

The overall disturbance distribution of the study area, calculated using Equation (10), is illustrated in Fig 8. The study area exhibits pronounced spatial heterogeneity and a distinct hierarchical pattern. High-disturbance zones are sporadically clustered in the southern, western, and southeastern construction hotspots, indicating a significant intensification of human activities at the local scale. Medium-disturbance zones are primarily located in transitional belts between the urban core and peripheral areas, corresponding to regions of active functional adjustment and spatial expansion. In contrast, low-disturbance zones are widely distributed across the central-northern and peripheral regions of the city. Overall, the disturbance intensity displays a spatial gradient from peripheral stability to core intensification, reflecting a typical urbanization development pattern. At the pixel scale, high-disturbance pixels form discrete patches within densely built-up areas, medium-disturbance pixels are distributed mainly along urban boundaries, and low-disturbance pixels constitute a continuous and stable background—collectively revealing a decreasing gradient from the urban core to the periphery. At the regional scale, the disturbance levels exhibit spatial clustering patterns: low-disturbance zones are concentrated in ecologically favorable northern and northwestern areas, while high-disturbance zones align with urban expansion corridors, forming several disturbance cores that highlight the directional nature of urbanization.

Fig 8. Spatial distribution of comprehensive urban land cover change intensity.

Fig 8

All visual elements were generated by the authors.

Table 4 summarizes the degree of land cover change within the study area at both the pixel level and the regional scale. Low-disturbance pixels cover an area of 4,800.75 ha, accounting for 70.15% of the total area, while medium- and high-disturbance pixels account for 22.78% and 7.07%, respectively. This indicates that, although the overall degree of change is relatively low, it is spatially concentrated in specific localities. At the regional scale, high-disturbance zones constitute the largest proportion (38.24%) of all regions, suggesting that although high-disturbance pixels are spatially limited, they produce a pronounced spatial aggregation effect at the regional level.

Table 4. Land Cover Change Intensity at the Pixel and Regional Levels within the Study Area.

Disturbance level Low Medium High
Pixel layer Area(ha) 4800.75 1559.03 483.50
Scale(%) 70.15 22.78 7.07
Regional Layer Quantity 11 10 13
Scale(%) 32.35 29.41 38.24

4. Discussion

4.1. Advantage analysis

This study develops a data-driven fine-scale urban land cover change assessment framework based on UASFNet, aiming to improve both recognition accuracy and interpretability in complex urban environments. Overall, the proposed framework demonstrates clear advantages in balancing segmentation performance, computational efficiency, and spatial coherence, which are critical for large-area and multi-temporal urban monitoring tasks.

From the perspective of model efficiency and performance balance, UASFNet demonstrates clear advantages over existing deep learning–based segmentation approaches. As summarized in Table 5, the proposed model achieves superior segmentation accuracy without relying on excessively large parameter sizes or computational costs. This balanced performance can be attributed to the Urban Adaptive Shared-feature Attention (UASF) module, which promotes effective semantic information sharing across multiple land cover categories while suppressing redundant feature propagation. As a result, UASFNet is able to extract more compact and discriminative representations, improving recognition performance under comparable computational conditions. All models were trained and evaluated on the same datasets and data partitions to ensure a controlled and fair comparison across different urban scenarios. The moderate increase in computational cost is justified by the consistent and non-trivial accuracy gains achieved across all datasets, which are critical for reliable large-area and multi-temporal urban change analysis.

Table 5. Model efficiency and performance comparative analysis.

Method FLOPs (G) Param.(M) mIoU (%)
ResNet 23.5 47.43 84.41
UNetformer 30.17 61.59 87.34
CMTFNet 76.41 96.14 90.30
CM-Unet 33.26 64.02 89.27
MFNet 54.39 67.72 87.45
UASFNet (Ours) 30.925 79.88 91.52

The performance advantages of UASFNet can be attributed to the Urban Adaptive Shared-feature Attention (UASF) mechanism. This mechanism is designed to emphasize semantic and structural patterns that are consistently informative across different urban contexts, such as building boundaries, road connectivity, and greenbelt textures. Because the attention weights are learned in a data-driven manner rather than being manually specified, the mechanism can adapt to variations in urban morphology, density, and spatial configuration across different study areas. Nevertheless, its adaptability may be constrained under conditions involving pronounced domain shifts, such as cities with markedly different architectural styles, severe shadow effects, or highly heterogeneous surface materials. In such cases, incorporating additional domain-specific supervision or multi-source auxiliary information may further enhance robustness, which is considered a direction for future work.

Beyond efficiency considerations, the spatial behavior of the proposed framework further highlights its practical advantages. The large-area recognition results from the Langfang dataset (Fig 9) illustrate that UASFNet maintains stable and coherent classification performance across extensive urban regions. Typical land cover transitions between 2017 and 2023, including urban expansion, road network evolution, and greenbelt degradation, are consistently captured. In particular, the clear delineation of impervious surfaces and linear structures indicates that the model is effective not only at the pixel level but also in preserving meaningful spatial patterns, which is essential for reliable urban change analysis.

Fig 9. Large range of recognition effect comparison diagram in Langfang.

Fig 9

Background imagery was obtained from the Copernicus Data Space Ecosystem(Copernicus Sentinel data) and processed by the authors for visualization and comparison. The imagery corresponds to original satellite data and is not derived from any third-party online basemap or proprietary map service. All classification results, overlays, and map elements were generated by the authors.

An additional strength of the proposed framework lies in the integration of fine-grained land cover classification with an urban land cover change evaluation scheme. By coupling high-precision pixel-level outputs with an AHP-based disturbance assessment that incorporates weighted indicators of buildings, roads, and greenbelts, the framework establishes a systematic link between classification performance and change interpretation. In this context, Table 5 and Fig 9 serve as supporting evidence demonstrating how improvements in model efficiency and spatial recognition stability contribute to more reliable inputs for quantitative urban land cover change assessment.

Overall, the advantages of the proposed framework extend beyond incremental accuracy gains. The combination of balanced computational efficiency, spatially coherent recognition, and assessment-oriented design suggests its potential applicability in scenarios such as smart city platforms, automated urban monitoring, and evidence-based spatial planning.

4.2. Implication and further application

The proposed framework provides meaningful implications for urban land cover change analysis and spatial governance. By enabling fine-grained land cover classification and multi-scale change assessment, it facilitates a detailed characterization of urban expansion dynamics, infrastructure development, and greenbelt transformation. Such spatially explicit information supports a more refined evaluation of land use intensity and ecological pressure during rapid urbanization processes. The integration of weighted disturbance indicators further enhances the interpretability of urban land cover change assessment. By synthesizing building, road, and greenbelt disturbances into a unified evaluation framework, the proposed approach enables consistent comparison across regions and time periods. This integrated perspective is particularly suitable for complex urban environments, where multiple land cover transitions often occur simultaneously and interactively rather than in isolation. Nevertheless, the practical application of the proposed framework may be influenced by limitations in the underlying data, such as inconsistencies in image acquisition time and variations in spatial resolution across different datasets, which could introduce uncertainty in long-term change interpretation. Addressing these data-related constraints through improved data consistency and preprocessing remains an important direction for future research. Overall, the proposed framework demonstrates strong potential for long-term urban land cover monitoring and automated change assessment. Its data-driven and modular design allows flexible adaptation to different urban contexts and data sources, providing a scalable methodological foundation for future studies on urban dynamics and spatial governance.

5. Conclusion

This study developed a data-driven fine-scale assessment framework for urban land cover change based on UASFNet, integrating deep learning–based multi-object recognition with Analytic Hierarchy Process (AHP) weighting to systematically characterize the spatiotemporal evolution of urban surface systems. Supported by multi-source high-resolution remote sensing data, the proposed framework combines high-precision classification with quantitative disturbance evaluation, providing a novel technical pathway for urban spatial governance and ecological optimization. The main conclusions are as follows:

  • (1) The proposed UASFNet model introduces an Urban Adaptive Shared-feature Attention module that enables collaborative modeling of semantic and structural features, maintaining boundary continuity and spatial consistency in the recognition of buildings, roads, and greenbelts.

  • (2) Across three representative datasets, UASFNet achieved mean mIoU values of 91.52%, 93.31%, and 88.90%, respectively—significantly outperforming existing representative models in the recognition accuracy of buildings, greenbelts, and roads—demonstrating strong cross-domain generalization capability.

  • (3) Spatial analysis indicates that during 2017–2023, high-disturbance areas in the Langfang Economic Development Zone were primarily concentrated in the southern, western, and southeastern regions, forming an urbanization gradient pattern characterized by intensive core development and peripheral belt-like expansion.

  • (4) The framework achieves high recognition accuracy with low computational complexity while maintaining strong interpretability and scalability, making it suitable for applications in urban impervious surface monitoring, land-use planning evaluation, and ecological resilience analysis.

Supporting information

S1 Table. Urban Adaptive Shared-feature Attention (UASF) Module.

(DOCX)

pone.0342350.s001.docx (17.3KB, docx)

Data Availability

Due to data use agreements and institutional restrictions, the datasets generated and/or analyzed during the current study are not publicly available. Data access requests may be directed to the Data Management Office of Tianjin Huashui Engineering Consulting Co., Ltd. (contact email: tjhuashui_data@163.com), which is independent of the authors and responsible for coordinating data access and ensuring long-term data stewardship on behalf of the institution. Access will be considered upon reasonable request and subject to institutional approval.

Funding Statement

The research was supported by Tianjin major science and technology projects (24ZXKJGX00070) awarded to Q.L., National Key R&D Program of China (2022YFB4200700) awarded to Q.L., and Research and Application of Key Technologies for Intelligent Mechanical Manufacturing Production Lines (23ZGZNGX00020) awarded to Q.L.

References

  • 1.Risser MD, Ombadi M, Wehner MF. Granger causal inference for climate change attribution. Environ Res: Clim. 2025;4(2):22001. [Google Scholar]
  • 2.Quilcaille Y, Gudmundsson L, Schumacher DL, Gasser T, Heede R, Heri C, et al. Systematic attribution of heatwaves to the emissions of carbon majors. Nature. 2025;645(8080):392–8. doi: 10.1038/s41586-025-09450-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li C, Liu J, Du F, Zwiers FW, Feng G. Increasing certainty in projected local extreme precipitation change. Nat Commun. 2025;16(1):850. doi: 10.1038/s41467-025-56235-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Faranda D, Messori G, Coppola E, Alberti T, Vrac M, Pons F, et al. ClimaMeter: Contextualizing extreme weather in a changing climate. Weather Clim Dynam. 2024;5(3):959–83. doi: 10.5194/wcd-5-959-2024 [DOI] [Google Scholar]
  • 5.Lee H, Calvin K, Dasgupta D, Krinner G, Mukherji A, Thorne P, et al. Climate change 2023 synthesis report summary for policymakers. Synth. Rep. IPCC. 2024. [Google Scholar]
  • 6.Zhu D, Pfahl S, Knutti R, Fischer EM. Future extreme precipitation may shift to colder seasons in northern mid- and high latitudes. Commun Earth Environ. 2025;6(1). doi: 10.1038/s43247-025-02651-0 [DOI] [Google Scholar]
  • 7.Fu Y, Wu Q. Recent emerging shifts in precipitation intensity and frequency in the global tropics observed by satellite precipitation data sets. Geophys Res Lett. 2024;51(15):e2023GL107916. [Google Scholar]
  • 8.Najah FT, Abdullah SFK, Abdulkareem TA. Urban land use changes: effect of green urban spaces transformation on urban heat islands in Baghdad. Alex Eng J. 2023;66:555–71. [Google Scholar]
  • 9.Li Y, Yang T, Zhao G, Ma C, Yan Y, Xu Y. A systematic review of studies involving canopy layer urban heat island: monitoring and associated factors. Ecol Indic. 2024;158:111424. [Google Scholar]
  • 10.Zheng C, Yang W, Jiang X, Lian J, Hu D, Yan X, et al. A novel integrated Urban flood risk assessment approach coupling GeoDetector-Dematel and clustering method. J Environ Manage. 2024;354:120308. doi: 10.1016/j.jenvman.2024.120308 [DOI] [PubMed] [Google Scholar]
  • 11.Yang W, Zheng C, Jiang X, Wang H, Lian J, Hu D. Study on urban flood simulation based on a novel model of swtm coupling d8 flow direction and backflow effect. J Hydrol (Amst). 2023;621:129608. [Google Scholar]
  • 12.Li D, Hou J, Zhou Q, Lyu J, Pan Z, Wang T, et al. Urban rainfall-runoff flooding response for development activities in new urbanized areas based on a novel distributed coupled model. Urban Climate. 2023;51:101628. doi: 10.1016/j.uclim.2023.101628 [DOI] [Google Scholar]
  • 13.Un-Habitat. World cities report 2022: envisaging the future of cities. UN. 2022.
  • 14.Qiao W, Huang X. Assessment the urbanization sustainability and its driving factors in chinese urban agglomerations: an urban land expansion-urban population dynamics perspective. J Clean Prod. 2024;449:141562. [Google Scholar]
  • 15.Guan S, Chen Y, Wang T, Hu H. Mitigating urban heat island through urban-rural transition zone landscape configuration: Evaluation based on an interpretable ensemble machine learning framework. Sustainable Cities and Society. 2025;123:106272. doi: 10.1016/j.scs.2025.106272 [DOI] [Google Scholar]
  • 16.Yang H, Wu Z, Dawson RJ, Barr S, Ford A, Li Y. Quantifying surface urban heat island variations and patterns: Comparison of two cities in three-stage dynamic rural–urban transition. Sustainable Cities and Society. 2024;109:105538. doi: 10.1016/j.scs.2024.105538 [DOI] [Google Scholar]
  • 17.Gao J, Qu L, Wu W. Spatial-temporal variation of surface temperature and cold island network construction in the Yangtze River delta urban agglomeration: Perspectives from current and future scenarios. Sustainable Cities and Society. 2025;126:106396. doi: 10.1016/j.scs.2025.106396 [DOI] [Google Scholar]
  • 18.Xiao W, Ruan L, Wang K, Xu S, Yue W, He T, et al. Measuring three-dimension urban expansion using multi-source data and change detection algorithm: A case study of Shanghai. Cities. 2025;158:105682. doi: 10.1016/j.cities.2024.105682 [DOI] [Google Scholar]
  • 19.Chakraborty TC, Venter ZS, Demuzere M, Zhan W, Gao J, Zhao L, et al. Large disagreements in estimates of urban land across scales and their implications. Nat Commun. 2024;15(1):9165. doi: 10.1038/s41467-024-52241-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Shang S, Cui T, Wang Y, Gao Q, Liu Y. Dynamic variation and driving mechanisms of land use change from 1980 to 2020 in the lower reaches of the yangtze river, china. Front Environ Sci. 2024;11:1335624. [Google Scholar]
  • 21.Tang X, Qu W, Zhang J, Li G, Zhang X, Yang S, et al. Driving mechanism of urban expansion in the Bohai Rim urban agglomeration from the perspective of spatiotemporal dynamic analysis. Sci Rep. 2024;14(1):31191. doi: 10.1038/s41598-024-82436-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wang Q, Zhang P, Chang Y, Li G, Chen Z, Zhang X, et al. Landscape pattern evolution and ecological risk assessment of the yellow river basin based on optimal scale. Ecol Indic. 2024;158:111381. [Google Scholar]
  • 23.Ding N, Zhang Y, Wang Y, Chen L, Qin K, Yang X. Effect of landscape pattern of urban surface evapotranspiration on land surface temperature. Urban Climate. 2023;49:101540. doi: 10.1016/j.uclim.2023.101540 [DOI] [Google Scholar]
  • 24.Woodman TL, Alexander P, Burslem DFRP, Travis JMJ, Winkler K, Eigenbrod F. Global assessment of landscape pattern changes from 1992 to 2020. Landsc Ecol. 2025;40(11):196. doi: 10.1007/s10980-025-02210-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Zhou Y, Cao W, Zhou J. Land-Use Transfer and Its Ecological Effects in Rapidly Urbanizing Areas: A Case Study of Nanjing, China. Sustainability. 2024;16(23):10615. doi: 10.3390/su162310615 [DOI] [Google Scholar]
  • 26.Qin R, Liu T. A Review of landcover classification with very-high resolution remotely sensed optical images—analysis unit, model scalability and transferability. Remote Sensing. 2022;14(3):646. doi: 10.3390/rs14030646 [DOI] [Google Scholar]
  • 27.Ienco D, Interdonato R, Gaetano R, Ho Tong Minh D. Combining Sentinel-1 and Sentinel-2 Satellite Image Time Series for land cover mapping via a multi-source deep learning architecture. ISPRS Journal of Photogrammetry and Remote Sensing. 2019;158:11–22. doi: 10.1016/j.isprsjprs.2019.09.016 [DOI] [Google Scholar]
  • 28.Saidi S, Idbraim S, Karmoude Y, Masse A, Arbelo M. Deep-learning for change detection using multi-modal fusion of remote sensing images: A review. Remote Sensing. 2024;16(20):3852. doi: 10.3390/rs16203852 [DOI] [Google Scholar]
  • 29.Tong X-Y, Xia G-S, Lu Q, Shen H, Li S, You S, et al. Land-cover classification with high-resolution remote sensing images using transferable deep models. Remote Sensing of Environment. 2020;237:111322. doi: 10.1016/j.rse.2019.111322 [DOI] [Google Scholar]
  • 30.Li Z, Chen B, Wu S, Su M, Chen JM, Xu B. Deep learning for urban land use category classification: A review and experimental assessment. Remote Sensing of Environment. 2024;311:114290. doi: 10.1016/j.rse.2024.114290 [DOI] [Google Scholar]
  • 31.Irfan A, Li Y, E X, Sun G. Land use and land cover classification with deep learning-based fusion of SAR and optical data. Remote Sensing. 2025;17(7):1298. doi: 10.3390/rs17071298 [DOI] [Google Scholar]
  • 32.Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. Med. Image Comput. Comput. Assist. Interv. (MICCAI), 2015. 234–41. [Google Scholar]
  • 33.Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell. 2018;40(4):834–48. doi: 10.1109/TPAMI.2017.2699184 [DOI] [PubMed] [Google Scholar]
  • 34.Xie E, Wang W, Yu Z, Anandkumar A, Alvarez JM, Luo P. Segformer: Simple and efficient design for semantic segmentation with transformers. Adv Neural Inf Process Syst. 2021;34:12077–90. [Google Scholar]
  • 35.Liu M, Dan J, Lu Z, Yu Y, Li Y, Li X. Cm-unet: hybrid cnn-mamba unet for remote sensing image semantic segmentation. Arxiv Preprint. 2024. doi: 2405.10530 [Google Scholar]
  • 36.Liu X, Goh KM. Resnet: enabling deep convolutional neural networks through residual learning. Arxiv Preprint. 2025. doi: 2510.24036 [Google Scholar]
  • 37.Wang L, Li R, Zhang C, Fang S, Duan C, Meng X, et al. UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery. ISPRS Journal of Photogrammetry and Remote Sensing. 2022;190:196–214. doi: 10.1016/j.isprsjprs.2022.06.008 [DOI] [Google Scholar]
  • 38.Wu H, Huang P, Zhang M, Tang W, Yu X. CMTFNet: CNN and multiscale transformer fusion network for remote-sensing image semantic segmentation. IEEE Trans Geosci Remote Sensing. 2023;61:1–12. doi: 10.1109/tgrs.2023.3314641 [DOI] [Google Scholar]
  • 39.Ma X, Zhang X, Pun M, Huang B. A unified framework with multimodal fine-tuning for remote sensing semantic segmentation. IEEE Trans Geosci Remote Sens. 2025. [Google Scholar]
  • 40.Zhong J, Hao L, Sajinkumar KS, Yan D. Changes of ecological vulnerability in areas with different urban expansion patterns- A case study in the Yanhe river basin, China. J Environ Manage. 2024;370:122607. doi: 10.1016/j.jenvman.2024.122607 [DOI] [PubMed] [Google Scholar]
  • 41.Luo M, Wang J, Li J, Sha J, He S, Liu L, et al. The response of ecological security to land use change in east and west subtropical China. PLoS One. 2023;18(11):e0294462. doi: 10.1371/journal.pone.0294462 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Liu Y, Zhao C, Liu X, Chang Y, Wang H, Yang J, et al. The multi-dimensional perspective of ecological security evaluation and drive mechanism for baishuijiang national nature reserve, china. Ecol Indic. 2021;132:108295. [Google Scholar]
  • 43.Karimpour MH, Najafi A, Ghaderi M. Application of fuzzy ahp method to iocg prospectivity mapping: A case study in taherabad prospecting area, eastern iran. Int J Appl Earth Obs Geoinf. 2014. [Google Scholar]
  • 44.Baemez M. Strategic multi-criteria framework for nuclear plant siting: integrating ahp, ewm, and game theory with gis. Prog Nucl Energy. 2025;188. [Google Scholar]
  • 45.Wang C, Qiu X, Shen H, Rao C. Evaluation mechanism of urban green competitiveness via a gray fuzzy comprehensive evaluation model. Ecol Indic. 2025;175. [Google Scholar]
  • 46.Wang X, Zhou L, Lopez-Carr D, Song Y, Gao H, Che T. Urban grey-green scales: A new perspective for assessing dynamic urban spatial trade-offs. Int J Appl Earth Obs Geoinf. 2024. [Google Scholar]
  • 47.Wang Z, Yi J, Chen A, Chen L, Lin H, Xu K. Accurate semantic segmentation of very high-resolution remote sensing images considering feature state sequences: From benchmark datasets to urban applications. ISPRS Journal of Photogrammetry and Remote Sensing. 2025;220:824–40. doi: 10.1016/j.isprsjprs.2025.01.017 [DOI] [Google Scholar]
  • 48.Li M, Long J, Stein A, Wang X. Using a semantic edge-aware multi-task neural network to delineate agricultural parcels from remote sensing images. ISPRS Journal of Photogrammetry and Remote Sensing. 2023;200:24–40. doi: 10.1016/j.isprsjprs.2023.04.019 [DOI] [Google Scholar]
  • 49.Wu B, Gu Z, Zhang W, Fu Q, Zeng M, Li A. Investigator accuracy: A center-weighted metric for evaluating the location accuracy of image segments in land cover classification. International Journal of Applied Earth Observation and Geoinformation. 2023;122:103402. doi: 10.1016/j.jag.2023.103402 [DOI] [Google Scholar]
  • 50.Liu Q, Zhao F, Zheng C, Lian J, Da Z, Duan M. Optimization framework for urban flood mitigation strategies considering collaborative drainage mechanisms. Water Research X. 2026;30:100474. doi: 10.1016/j.wroa.2025.100474 [DOI] [Google Scholar]

Decision Letter 0

Chong Xu

11 Dec 2025

Dear Dr. Zheng,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 25 2026 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Chong Xu

Academic Editor

PLOS One

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. Please note that PLOS One has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

4. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

5. Thank you for stating the following in the Competing Interests/Financial Disclosure section:

The authors have declared that no competing interests exist.

We note that one or more of the authors are employed by a commercial company: Tianjin Huashui Engineering Consulting Co., Ltd

a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

6. In this instance it seems there may be acceptable restrictions in place that prevent the public sharing of your minimal data. However, in line with our goal of ensuring long-term data availability to all interested researchers, PLOS’ Data Policy states that authors cannot be the sole named individuals responsible for ensuring data access (http://journals.plos.org/plosone/s/data-availability#loc-acceptable-data-sharing-methods).

Data requests to a non-author institutional point of contact, such as a data access or ethics committee, helps guarantee long term stability and availability of data. Providing interested researchers with a durable point of contact ensures data will be accessible even if an author changes email addresses, institutions, or becomes unavailable to answer requests.

Before we proceed with your manuscript, please also provide non-author contact information (phone/email/hyperlink) for a data access committee, ethics committee, or other institutional body to which data requests may be sent. If no institutional body is available to respond to requests for your minimal data, please consider if there any institutional representatives who did not collaborate in the study, and are not listed as authors on the manuscript, who would be able to hold the data and respond to external requests for data access? If so, please provide their contact information (i.e., email address). Please also provide details on how you will ensure persistent or long-term data storage and availability.

7. We note that Figures 2, 3, 4, 5, 6, 7, 8, and 9 in your submission contain map/satellite images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (a) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (b) remove the figures from your submission:

a. You may seek permission from the original copyright holder of Figures 2, 3, 4, 5, 6, 7, 8, and 9 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an ""Other"" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

8. If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Partly

Reviewer #2: Yes

Reviewer #3: Partly

Reviewer #4: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: I Don't Know

Reviewer #2: I Don't Know

Reviewer #3: No

Reviewer #4: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: No

Reviewer #4: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: Yes

**********

Reviewer #1: 1. Expand the study to include cities with different topographies, such as mountainous cities like Guiyang.

2. On line 256, the ratio of training to validation data is not specified. Please clarify this key experimental detail.

3. For large-area identification, even with high-resolution imagery, errors are inevitable. However, the paper does not address the error characteristics of the reference data, which is critical for evaluating the reliability of the proposed method.

4. The discussion section requires revision. Since five algorithms are mentioned, it would greatly enhance the persuasiveness of your work to compare your algorithm with these existing methods using the same study areas reported in their respective papers.

Reviewer #2: Please see the attached file. Address all of my comments and gicve response to each comment. Implementing these revisions can enhance the paper's clarity, engagement, and overall contribution to the field of urban land cover change assessment.

Reviewer #3: Manuscript Title: A Fine-grained Evaluation Framework for Urban Land Cover Change Based on UASFNet Data-driven Approach

he manuscript aims to create a data-driven framework for fine-scale urban land cover change assessment using an Urban Adaptive Shared-feature Attention Network (UASFNet). The intended contribution is to support urban spatial governance and ecological optimization by revealing the dynamic evolution of surface systems during urbanization, and this framework has potential value for scientists, planners, and environmental managers who rely on accurate change detection to inform policy and restoration decisions. However, in its current form, the manuscript does not sufficiently articulate testable scientific questions, nor does it clarify how the proposed framework is evaluated relative to existing methods or how each methodological component contributes to measurable improvements. In addition, while the abstract presents what it calls “experimental results,” the manuscript contains no dedicated Results section—only a Case Study—which makes it difficult to understand how findings were derived or how performance was objectively assessed. In my opinion, the Case Study should be removed, with its material reused so that its computational details are incorporated into the Methods while its model comparisons and land-cover disturbance findings are placed into a properly structured Results section. Strengthening these elements collectively would improve clarity, reproducibility, and the interpretability and impact of the work.

Major Comments by Section

Title:

The acronym UASFNet does not convey meaning to most readers and may obscure the paper’s intent. A more reader-focused alternative would be: “A Fine-grained Evaluation Framework for Urban Land Cover Change Based on Feature Monitoring with Remotely Sensed Imagery.” This revision communicates the achievement rather than emphasizing a technical acronym unfamiliar to the journal’s broad readership.

Abstract:

The abstract outlines the general workflow but omits crucial information about the type of remote sensing imagery required. Given the diversity of sensors, platforms, pixel sizes, and spectral characteristics, this omission limits interpretability and reproducibility.

The following components from the Case Study section should be incorporated into the abstract to clarify imagery requirements: Google Earth RGB imagery, 0.6 m resolution (Langfang dataset); Ultra–high-resolution TOP imagery, 5 cm GSD, multiple spectral combinations (Potsdam dataset); Standardized NAIP imagery for additional regional evaluation. Adding this information makes clear that the approach requires sub-meter to centimeter-scale RGB (and optionally IR) imagery characteristic of urban remote sensing segmentation tasks.

Introduction:

The final paragraph presents the study’s intended academic contributions, but these are not framed as testable scientific questions. For example, the introduction could explicitly pose questions such as:

Does UASFNet significantly outperform established semantic segmentation methods (e.g., ResNet, UNetformer, MFNet) in boundary preservation and inter-class separability for urban land cover types?

Does integrating AHP-derived indicator weights meaningfully improve the spatial coherence or interpretability of disturbance-level assessments?

Can combined pixel-level and region-level analyses provide a measurably more accurate or actionable representation of urban land cover change than existing pixel-only approaches?

Explicit questions would help readers understand what hypotheses the study is designed to test and how success is measured.

Methods:

sub-heading UASFNet Mathematical Formulation- The core equations for the UASF module—semantic–structural consistency kernel, gradient-domain formulation, and global gating operator—are insufficiently explained for a general audience. The manuscript introduces functions (ϕ(X), ϕ(∇X), Θ(X), A(X)) without an illustrative example.

The authors should include:

A worked example: Starting from a raw image tile, show how features, gradients, embeddings, and fused representations are extracted and transformed through each stage.

A schematic diagram showing how semantic and structural pathways merge and how gating weights are computed.

Code availability: Since PLOS ONE emphasizes reproducibility, the authors should provide code or pseudocode to clarify the transformations embodied in Equations (2) and (3).

Without such explanation, the method is not reproducible or interpretable by most readers.

sub-heading Evaluation Framework - The manuscript states that “multiple metrics” are used—mIoU, accuracy, F1, Kappa, etc.—but does not clearly tie these metrics to specific stated objectives (boundary preservation, inter-class separability, semantic consistency). The connection between objectives and metrics must be explicit. Similarly, the land cover change evaluation framework (using AHP to weight building, road, and greenbelt disturbance indices) would benefit from: A clear workflow diagram; A rationale for indicator selection; An explanation of how errors in land cover classification propagate into disturbance-level scoring.

Case Study Section

1. Section 3.2: “Experimental Detail”. This subsection belongs more naturally in Methods under a heading such as: “Computational Environment and Data Augmentation Strategy.”

The description of the GPU, Python environment, and augmentation techniques is overly brief and should be expanded with parameter values and more details for helping others evaluate, test, replicate, improve your work. There are standard components of deep learning methodology and are necessary for reproducibility.

2. Model Performance (currently part of Case Study)

This section actually constitutes Results, not a case study. The models used for comparison—ResNet, UNetformer, CMTFNet, CM-Unet, MFNet—should be introduced earlier in the Methods section under: “Benchmark Models and Comparative Evaluation Design.” Currently, the paper reads as though the evaluation details appear only after results, which makes the structure confusing.

3. Case Study Results: Land cover change interpretation

The manuscript presents tables of pixel-level and region-level disturbance statistics and interprets the spatial implications, but this should be more clearly labeled as Results, not embedded inside the case study narrative. The interpretation itself is appropriate and meaningful, showing both fine-scale and aggregated change patterns.

Objectivity of Performance Evaluation

Strengths: The manuscript compares UASFNet to five baseline models across three datasets (Langfang, Potsdam, NAIP). Metrics (mIoU, F1, Kappa) are standard for segmentation research. Improvements reported are consistent (≈1–3% advantage across categories).

Weaknesses: The evaluation is not explicitly tied to the stated objectives (e.g., boundary preservation, structural consistency). No statistical tests (e.g., paired t-tests on pixel accuracy, variance analyses across tiles) are reported to support claims of significance. The study does not examine failure cases, computational complexity trade-offs, or robustness to noise, which would strengthen claims of generalizability. The evaluation does not address whether the AHP-based disturbance metrics produce more accurate or actionable land cover change assessments than alternative weighting methods.

Conclusion on Objectivity: The manuscript does provide a comparative performance assessment, but it is only partially objective and not fully aligned with the evaluation goals described in the Methods section. Strengthening the study would require tighter linkage between model objectives and metrics, inclusion of statistical validation, and clearer articulation of how the proposed framework improves land cover change interpretation beyond classification accuracy alone.

Reviewer #4: Overall Comments

This paper presents a data mining-based method for identifying urban area changes from remote sensing images. The manuscript is generally well-written. The proposed method has been implemented on several areas and its performance has been compared with several common methods. However, revisions are necessary before publication.

Major Comments

1. Introduction/Literature Review: While a general categorization of research in the field is provided, a more focused review of studies directly related to the presented methodology is needed. This would establish a clearer baseline and context, allowing readers to better assess the novelty and contribution of the proposed method relative to existing work.

2. Research Objectives: The objectives stated at the end of the introduction should more explicitly highlight and emphasize the novel aspects of the proposed approach compared to previous studies.

3. Methodology Description (Clarity & Flow):

o Lines 111-113 / Overall Procedure: The description of the proposed method is somewhat vague. The specific steps of the pipeline should be clearly delineated. For each stage, it should be explicitly stated what processing is applied to the input, what the output is, and why this output is fed into the next stage. This lack of clarity is also reflected in the provided flowchart, making it difficult for the reader to follow the logical flow of the method.

o Justification of Weighting: The rationale and importance of using a weighting scheme (Lines 111-113, Line 180) are not sufficiently explained. A clear justification is required: Why is weighting crucial for identifying the target changes in this specific context? What problem does it solve that non-weighted approaches do not?

4. Methodology Subsections & Citations: The subsections under the methodology present various formulas and relationships without any citations to prior work. It appears that all elements were developed by the authors, which is unusual for a standard data mining/remote sensing workflow. Relevant sources for established techniques should be cited.

5. Subsection Structure (Lines 234-236): The inclusion of several very brief subsections with minimal content is not effective. Each subsection should contain substantive material related to its title.

Specific Comments on Figures and Tables

• Figure 1: The caption should be more detailed and specific, describing what the figure illustrates. This applies to all figure and table captions throughout the manuscript.

• Figure 3: The defined legend for this figure appears to be incorrect. Please verify and correct it.

• Table 2: It is unclear why all values for the proposed method are highlighted with a different color. Typically, only the best-performing values (e.g., highest accuracy) should be bolded or highlighted for clear comparison.

• Table 3: The methodology for generating the rankings is not described. How were these rankings obtained? Was a single expert used, or multiple experts? If multiple, how was consensus reached? This information is essential for assessing the validity of the comparison.

Recommendation

The paper addresses an interesting topic but requires significant revisions, primarily to improve the clarity, justification, and context of the proposed methodology, and to ensure rigorous presentation of results and comparisons. Addressing these points will strengthen the manuscript considerably.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: No

Reviewer #2: Yes: Muhammad Nasar AHMAD

Reviewer #3: No

Reviewer #4: Yes: Milad Janalipour

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

To ensure your figures meet our technical requirements, please review our figure guidelines: https://journals.plos.org/plosone/s/figures

You may also use PLOS’s free figure tool, NAAS, to help you prepare publication quality figures: https://journals.plos.org/plosone/s/figures#loc-tools-for-figure-preparation.

NAAS will assess whether your figures meet our technical requirements by comparing each figure against our figure specifications.

Attachment

Submitted filename: PONE-D-25-61989_Report.docx

pone.0342350.s002.docx (885KB, docx)
PLoS One. 2026 Feb 23;21(2):e0342350. doi: 10.1371/journal.pone.0342350.r002

Author response to Decision Letter 1


12 Jan 2026

Thanks for all comments and suggestions of the reviewers. They are all significant for our research work and paper writing. We have a detailed revision of this paper and changes are marked in yellow. The presentation in the revised paper has been improved. Now we will response to the comments as follows.

Response to reviewer #1:

(1)Q: Expand the study to include cities with different topographies, such as mountainous cities like Guiyang.

A: We sincerely appreciate this valuable suggestion. We agree that including cities with diverse topographies is crucial for validating the robustness and generalization ability of our method. Following your advice, we have expanded our study to include the Guiyang Dataset, which represents a typical mountainous urban environment. This addition ensures our study covers a broader range of geographical scenarios.

The revisions have been made in Section 2.2, 3.1 (Lines 147-150, 348-353) of the revised manuscript. The added description is as follows:.

“(c) Guiyang Dataset: This dataset covers the urban core area of Guiyang City, China, representing a typical mountainous urban environment. It was constructed following the same data preparation, annotation protocol, and data partition strategy as the Langfang dataset, ensuring consistency and comparability across datasets.

On the Guiyang dataset, UASFNet achieved the best overall performance, with an mIoU of 88.90%, an F1-score of 93.65%, and a Kappa value as high as 0.9493. Among the four land-cover categories, the IoU values for buildings, roads, and greenbelts reached 96.58%, 96.44%, and 94.62%, respectively.”

(2)Q: On line 256, the ratio of training to validation data is not specified. Please clarify this key experimental detail.

A: We thank the reviewer for pointing out this oversight. We apologize for not specifying the ratio of training to validation data in the original manuscript. To address this, we have explicitly clarified the data partition strategy and the specific quantity of data for both the Langfang and Potsdam datasets in the revised version.

Specifically, we specified a ratio of 70% / 10% / 20% (training/validation/test) for the Langfang dataset and provided the exact number of image patches for the Potsdam dataset. The revisions can be found in Section 2.2.2 (Lines 136-146) of the revised manuscript:

“(a) Langfang Dataset: This dataset was constructed using RGB three-band remote sensing images of the Langfang Economic Development Zone with a spatial resolution of 0.6 m. Four land-use classes were annotated using GIS software, and the dataset was partitioned into training, validation, and test sets using a spatial block segmentation method at a ratio of 70% / 10% / 20%, respectively.

(b) Potsdam Dataset: This dataset is derived from ultra–high-resolution TOP imagery with a ground sampling distance (GSD) of 5 cm. The Potsdam region is known for its complex building layouts and dense urban structures. The dataset covers an area of 3.42 km² and includes pixel-level annotations for six semantic categories. It has become a standard benchmark for semantic segmentation research. In this study, an improved four-class semantic labeling scheme—including buildings, greenbelts, roads, and others—was adopted to better suit urban analysis tasks, resulting in 2299 training, 605 validation, and 1694 test image patches of size 1024 × 1024.”

(3)Q: For large-area identification, even with high-resolution imagery, errors are inevitable. However, the paper does not address the error characteristics of the reference data, which is critical for evaluating the reliability of the proposed method.

A: This is a highly insightful and critical comment. We fully agree with the reviewer that addressing the error characteristics of the reference data is essential for properly evaluating the reliability of the proposed method. To address this concern, we have added a discussion acknowledging that, as with most large-area urban mapping studies, reference labels are subject to unavoidable uncertainties, particularly in transitional zones, complex building boundaries, and shadow-affected areas. Furthermore, we clarified that since all comparative models were trained and evaluated using the same reference data and partitioning strategy, these uncertainties affect all methods consistently and do not compromise the validity of the relative performance comparison.

The revisions can be found in Section 2.4 (Lines 230-234) of the revised manuscript:

“It should be noted that, as in most large-area urban land cover mapping studies, reference labels are subject to unavoidable uncertainties, particularly in transitional zones, complex building boundaries, narrow road segments, and shadow-affected areas. Since all models were trained and evaluated using the same reference data and partitioning strategy, such uncertainties affect all methods consistently and do not compromise the validity of the relative performance comparison.”

(4)Q: The discussion section requires revision. Since five algorithms are mentioned, it would greatly enhance the persuasiveness of your work to compare your algorithm with these existing methods using the same study areas reported in their respective papers.

A: We greatly appreciate the reviewer’s suggestion regarding the comparative analysis. We understand that evaluating algorithms on their original study areas can provide context. However, direct comparisons across different original datasets can introduce significant bias due to variations in image sources, resolutions, annotation standards, and partition strategies.

To ensure the most rigorous and fair comparison, our strategy was to evaluate all five algorithms under identical experimental settings (using the same datasets and data partition protocols). This approach eliminates the interference of data heterogeneity and allows for a more accurate assessment of the intrinsic performance differences between the models. We have clarified this rationale in the revised manuscript to emphasize the validity of our controlled comparison. The revisions can be found in Section 4.1 (Lines 488-489) of the revised manuscript:.

“All models were trained and evaluated on the same datasets and data partitions to ensure a controlled and fair comparison across different urban scenarios.”

Response to reviewer #2:

(1)Q: All figures should be improved in term of map elements delivery and clarity. For each figure, include clear captions contextualizing the data presented, methodologies used for analysis, and key takeaways. e.g. Fig. 3 it is difficult to identify difference between others and road class. I suggest mark road as red/grey and others class as sand color. Figure 1 is difficult to understand. Only there are two steps highlighted, flow is not symmetrical.

A: Thank the reviewers for their detailed suggestions regarding the quality of the charts, we have redrawn the relevant diagrams in the text, with a particular focus on enhancing the expression of map elements and improving the overall clarity. The specific modifications are as follows:

Regarding Figure 1: We have redesigned the structure of the flowchart, optimized the highlighting of steps, and ensured that the overall process is more symmetrical and the logic is more coherent, so that readers can understand our technical route more easily.

Regarding Figure 3: We have adopted your suggestion and adjusted the color scheme, enhancing the contrast between the categories (especially between the roads and other categories) to ensure a clear visual distinction.

The updated figures and captions can be found in Section 2 of the revised manuscript. The revised captions are as follows:.

Fig. 1 Overall framework of the proposed fine-scale urban land cover change assessment method.

Fig. 3 Example image patches and corresponding ground truth labels from the Potsdam dataset.”

(2)Q: How does the proposed UASFNet model compare to traditional methods in terms of accuracy and computational efficiency?

A: We thank the reviewer for this important question. we have provided a detailed comparative analysis in the revised manuscript (see Table 5). The results indicate that, from the perspective of model efficiency and performance balance, UASFNet demonstrates clear advantages over existing deep learning–based segmentation approaches. Specifically, our model achieves superior segmentation accuracy without relying on excessively large parameter sizes or computational costs. This balanced performance is primarily attributed to the Urban Adaptive Shared-feature Attention (UASF) module, which promotes effective semantic information sharing across multiple land cover categories while suppressing redundant feature propagation. As a result, UASFNet is able to extract more compact and discriminative representations, improving recognition performance under comparable computational conditions.

The revisions and detailed discussions can be found in Section 4.1 (Lines 481-488) and Table 5 of the revised manuscript..

“From the perspective of model efficiency and performance balance, UASFNet demonstrates clear advantages over existing deep learning–based segmentation approaches. As summarized in Table 5, the proposed model achieves superior segmentation accuracy without relying on excessively large parameter sizes or computational costs. This balanced performance can be attributed to the Urban Adaptive Shared-feature Attention (UASF) module, which promotes effective semantic information sharing across multiple land cover categories while suppressing redundant feature propagation. As a result, UASFNet is able to extract more compact and discriminative representations, improving recognition performance under comparable computational conditions.

Table 5 Model Efficiency and Performance Comparative Analysis

Method FLOPs (G) Param.(M) mIoU (%)

ResNet 23.5 47.43 84.41

UNetformer 30.17 61.59 87.34

CMTFNet 76.41 96.14 90.30

CM-Unet 33.26 64.02 89.27

MFNet 54.39 67.72 87.45

UASFNet (Ours) 30.925 79.88 91.52

(3)Q: In 2.1 Fine-Scale Assessment Framework (Lines 106-113), what specific preprocessing techniques were applied to the bi-temporal remote sensing images? Could this impact the overall accuracy of the analysis?

A: We thank the reviewer for the inquiry regarding preprocessing details. We agree that specifying these steps is essential for reproducibility and reliability.

In the revised manuscript, we have explicitly detailed the preprocessing techniques applied. Specifically, to ensure consistency across multi-source and multi-temporal imagery, we performed geometric co-registration, spatial resampling to a unified resolution, and RGB value normalization. Regarding the potential impact on accuracy, we clarified that we deliberately avoided aggressive radiometric correction to preserve original spatial patterns. Furthermore, by applying this identical preprocessing pipeline to all datasets and baseline models, we minimize bias and ensure a fair and unbiased comparison.

The revisions can be found in Section 2.2.2 (Lines 151-157) of the revised manuscript:

“For all datasets, standard preprocessing procedures were applied prior to model training and inference to ensure consistency across multi-source and multi-temporal imagery. These procedures include geometric co-registration between different acquisition periods, spatial resampling to a unified spatial resolution, and normalization of RGB values. No aggressive radiometric correction or handcrafted feature extraction was introduced, in order to preserve original spatial patterns. The same preprocessing pipeline was applied to all datasets and baseline models to ensure a fair and unbiased comparison.”

(4)Q: For 2.2 UASFNet Model (Lines 117-164), how does the shared-feature attention mechanism operate in varying urban contexts? Are there limitations to its adaptability?

A: We thank the reviewer for this insightful question regarding the core mechanism of our model. Discussing the adaptability and boundary conditions of the attention mechanism is crucial for understanding the model's reliability.

In the revised manuscript, we have elaborated on the operational principle of the UASF mechanism: it operates in a data-driven manner to learn attention weights, thereby automatically focusing on semantic and structural patterns that remain consistent across varying urban contexts (such as building boundaries, road connectivity, and greenbelt textures).

In the discussion section, relevant explanations were also provided. When encountering significant "domain shifts" (such as huge differences in architectural styles, severe shadow effects, or highly heterogeneous surface materials), the adaptability of the model may be limited.

The revisions and discussion can be found in Section 4.1 (Lines 493-502) of the revised manuscript:

“The semantic–structural consistency kernel encourages the alignment of semantic representations with geometric cues, which is beneficial for preserving object boundaries and reducing confusion between visually similar land cover classes. The gradient-domain formulation highlights local structural variations, such as edges and shape transitions, that are particularly relevant for fine-scale urban patterns. The global gating operator adaptively modulates the relative contributions of semantic and structural information across spatial locations, thereby promoting spatial coherence while limiting redundant feature propagation.

The performance advantages of UASFNet can be attributed to the Urban Adaptive Shared-feature Attention (UASF) mechanism. This mechanism is designed to emphasize semantic and structural patterns that are consistently informative across different urban contexts, such as building boundaries, road connectivity, and greenbelt textures. Because the attention weights are learned in a data-driven manner rather than being manually specified, the mechanism can adapt to variations in urban morphology, density, and spatial configuration across different study areas. Nevertheless, its adaptability may be constrained under conditions involving pronounced domain shifts, such as cities with markedly different architectural styles, severe shadow effects, or highly heterogeneous surface materials. In such cases, incorporating additional domain-specific supervision or multi-source auxiliary information may further enhance robustness, which is considered a direction for future work.”

(5)Q: (Lines 165-171) Why were the specific evaluation metrics chosen, and do you believe they comprehensively reflect the model's performance?

A: We thank the reviewer for the question regarding the selection of evaluation metrics. We fully agree that a well-justified combination of metrics is essential for a comprehensive and objective assessment of model performance.

To address this, we have elaborated on the scientific rationale behind our choices in the revised manuscript. Specifically, we employed mIoU as the primary metric for quantifying overall pixel-level accuracy. This is complemented by Precision, Recall, and F1-score to characterize class-wise discrimination, which is particularly critical for handling imbalanced or structurally complex categories common in urban scenes. Furthermore, we included the Kappa coefficient to evaluate the agreement between predicted and reference maps beyond random chance.

We believe that this suite of metrics comprehensively reflects the model's performance by capturing pixel-wise accuracy, class-wise consistency, and statistical robustness. This ensures that the resulting land-cover maps serve as reliable inputs for the subsequent urban disturbance analysis. The revisions can be found in Section 2.4 (Lines 217-224) of the revised manuscript:

“The mean Intersection over Union (mIoU) was used as the primary metric to quantify overall pixel-level classification accuracy across land-cover classes. Precision, recall, and F1-score were employed to further characterize class-wise discrimination performance, particularly for imbalanced or structurally complex categories. In addition, Kappa coefficient was included to evaluate the agreement between predicted and reference maps beyond chance, providing a robust measure of classification reliability. Together, these metrics capture not only pix

Attachment

Submitted filename: Response to Reviewers.docx

pone.0342350.s004.docx (4.8MB, docx)

Decision Letter 1

Chong Xu

22 Jan 2026

A fine-grained evaluation framework for urban land cover change based on feature monitoring with remotely sensed imagery

PONE-D-25-61989R1

Dear Dr. Zheng,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support .

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Chong Xu

Academic Editor

PLOS One

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

Reviewer #1: (No Response)

Reviewer #3: I would like to thank the authors for their careful responses to my prior comments. The revisions have strengthened the manuscript. The goal was to make the contribution easier to interpret, reproduce, and evaluate relative to existing approaches.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: No

Reviewer #3: No

**********

Acceptance letter

Chong Xu

PONE-D-25-61989R1

PLOS One

Dear Dr. Zheng,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Chong Xu

Academic Editor

PLOS One

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Urban Adaptive Shared-feature Attention (UASF) Module.

    (DOCX)

    pone.0342350.s001.docx (17.3KB, docx)
    Attachment

    Submitted filename: PONE-D-25-61989_Report.docx

    pone.0342350.s002.docx (885KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0342350.s004.docx (4.8MB, docx)

    Data Availability Statement

    Due to data use agreements and institutional restrictions, the datasets generated and/or analyzed during the current study are not publicly available. Data access requests may be directed to the Data Management Office of Tianjin Huashui Engineering Consulting Co., Ltd. (contact email: tjhuashui_data@163.com), which is independent of the authors and responsible for coordinating data access and ensuring long-term data stewardship on behalf of the institution. Access will be considered upon reasonable request and subject to institutional approval.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES