Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 May 11;11:9979. doi: 10.1038/s41598-021-89034-5

First comprehensive quantification of annual land use/cover from 1990 to 2020 across mainland Vietnam

Duong Cao Phan 1,2,, Ta Hoang Trung 3, Van Thinh Truong 4, Taiga Sasagawa 1, Thuy Phuong Thi Vu 5, Dieu Tien Bui 6, Masato Hayashi 7, Takeo Tadono 7, Kenlo Nishida Nasahara 8
PMCID: PMC8113344  PMID: 33976255

Abstract

Extensive studies have highlighted a need for frequently consistent land cover information for interdisciplinary studies. This paper proposes a comprehensive framework for the automatic production of the first Vietnam-wide annual land use/land cover (LULC) data sets (VLUCDs) from 1990 to 2020, using available remotely sensed and inventory data. Classification accuracies ranged from 85.7 ± 1.3 to 92.0 ± 1.2% with the primary dominant LULC and 77.6 ± 1.2% to 84.7 ± 1.1% with the secondary dominant LULC. This confirmed the potential of the proposed framework for systematically long-term monitoring LULC in Vietnam. Results reveal that despite slight recoveries in 2000 and 2010, the net loss of forests (19,940 km2) mainly transformed to croplands over 30 years. Meanwhile, productive croplands were converted to urban areas, which increased approximately ten times. A threefold increase in aquaculture was a major driver of the wetland loss (1914 km2). The spatial–temporal changes varied, but the most dynamic regions were the western north, the southern centre, and the south. These findings can provide evidence-based information on formulating and implementing coherent land management policies. The explicitly spatio-temporal VLUCDs can be benchmarks for global LULC validation, and utilized for a variety of applications in the research of environmental changes towards the Sustainable Development Goals.

Subject terms: Carbon cycle, Environmental sciences, Hydrology, Natural hazards, Planetary science, Ecology, Climate sciences

Introduction

Information about land use/land cover (LULC) and its dynamic changes are fundamental to a variety of studies on environmental issues1 such as climate change2, drought and flood3, and carbon emissions4. That is, frequently updated accurate LULC products provide policymakers with a profound understanding of the complex interplay between land use/cover change (LUCC) and its risk, which helps to inform coherent policies for the sustainable management of land resources57.

The ready availability of remote sensing data and computing technologies opens a great era in cost-effective mapping LULC at a broad scale. Numerous algorithms have been developed to improve LULC classification, e.g. Spatial Temporal Adaptive Algorithm8, Automatic Land Cover Classification Method9, and Apply Change-vector Analysis in Posterior Probability Space10. Together with the development of these complex algorithms, special projects have been designed for large-scale land cover assessment. For example, at the 10-m spatial resolution, several attempts have been made to publish 13-category LULC maps of Europe11,12 and global LULC maps13 using a great set of Sentinel MSI images. For a coarser spatial resolution (30 m), there are quality multi-category LULC products, including the National Databases of the United States1417, and the GlobeLand30 global product of 10-category LULC18. Nonetheless, owing to the computational restriction and the limitations of representative reference data to train and test classifiers, these products have not reflected consistently and frequently the detailed patterns and characteristics of LULC at local or national scales19,20. Also, due to the predefined research periods and differences in the land cover classification systems (LCCSs), these products seldom meet the prime requirement of projects’ specific objectives.

Recently, a remarkable performance in cloud computing has advanced LULC observation sciences. For example, the National Aeronautics and Space Administration Earth Exchange (NASA-NEX) and Amazon Web Service (AWS) allow analysts to access and process the NASA Earth Observation (EO) data on the cloud21. More importantly, Google Earth Engine (GEE) provides an outstanding cloud computing platform with open access to a variety of EO data. Thanks to the potential of big data processing of these platforms, researchers have completed extensive studies at greater extent, for example, on urban change monitoring22, cultivated land mapping23, and forest disturbance detection24. Multi-category land cover products were also produced such as a 13-category land cover map of South-east Asia covering 11 nations25. Although the overall accuracy of such products reaches up to 86%, the authors identify limitations regarding the insufficiency of high-quality reference data for time-series analyses26. Therefore, very few studies have conducted for multi-temporal LULC mapping at a broad scale.

The quantity and quality of training data play an essential role in the production of LULC maps. Yet, collecting sufficient and precise training data requires considerable efforts, especially at large scales and multiple periods27. Several attempts have been proposed methods that allow to collect cost-effectively high-quality training data. Bagan et al. (2019) extracted training data from a previous land cover map and utilized them for mapping new LULC products28. While the authors applied a bi-temporal spectral measurement to decrease the bias of extracted training data, the accuracy of these data may not be ensured due to the inherent classification errors of the previous maps29. To enhance the effectiveness of training data collection, Huang et al. (2020) used spectral similarity and distance indicators to detect the changed and unchanged training sites, and thus kept the unchanged ones as migrated training data27. The measurement was applied for the availability of Landsat TM images. Results showed that the accuracies of the migrated training data obtained over 92.98% and the classification map which used the migrated training data had a similar overall accuracy of 71.42% to that used ground-truth data in 2010. Nevertheless, these results were validated by outdated maps, namely the ESA global CCI land cover data sets, which may contain inherent classification errors. In addition, using the Sentinel MSI images, Ghorbanian et al. (2020) employed the same approach to migrate Iran-wide training data from 2017 to 201926. The classified map that utilized the migrated training data obtained a great accuracy of 91.35%. Despite the potential of the automatic training migration method, it is still not known whether this method can be applied for multi-sensor data sources such as the different Landsat sensors or the harmonized Landsat and Sentinel images27. Therefore, an exploration of the potential training migration method for multi-sensor remote sensing data is integral for a time-series assessment of multi-category LULC dynamics at a large scale.

Given the ideas, this research aim is to explore the potential training migration method for multi-sensor remote sensing data and then produce the first Vietnam-wide annual land use/cover data sets (VLUCDs) from 1990 to 2020 as a case study. In Vietnam, remotely sensed data have been utilized to produced quality LULC products, but most products cover a small area of the country or a few predefined periods3032. The previous inter-provincial LULC data sets were seven-category LULC maps for the central and southern Vietnam in 2007 and 2017, and the northern Vietnam in 2007 and 201520. More recently, Vietnam-wide maps were produced to map annual forest cover from 2015 to 201919,33. Despite the potential of these products, due to the primary focus on forest monitoring, the classification accuracy of non-forest LULC categories may be insufficient for other applications. Meanwhile, there has been a highly dynamic LUCC which varies among different regions in Vietnam. Despite the report of continuous net forest gain by the Ministry of Agricultural and Rural Development (MARD)34, a systematically comprehensive review has reported forest loss in Vietnam35,36. We do not know exactly the rates and patterns of changes at the nation scale. Hence, timely, accurate, and comprehensive LULC products can provide a profound understanding of LUCC patterns and processes. This information can supports policymakers in forming crucial decisions on sustainable development and resource management. The maps may be benchmarks for quantifying regional and global land cover products.

The central novelty of this paper is to propose a new framework for the automatic nationwide annual LULC monitoring and provide the results of the first VLUCDs and LUCC over the recent three decades. There are major tasks: (1) Data preparation; (2) Design a proper LCCS and reference data; (2) Proposing a consistent framework for the automatic production of the VLUCDs; (3) Creating and validating the VLUCDs, and; (4) Detecting profound changes in LULC since 1990. We developed a new random-forest-based classification approach to classify the wide availability of Landsat Thematic Mapper (TM), Enhanced Thematic Mapper Plus (ETM+) and Operational Land Imager (OLI), and Sentinel C-band Synthetic Aperture Radar Ground Range Detected (SAR GRD) and MultiSpectral Instrument (MSI) time-series images over the study period. We describe fully our implementation in the method section.

Results

The accuracy of the first VLUCDs

Utilizing ground-based data and all the freely available remotely sensed images, we have provided a coherent method and the results of the first VLUCDs. The proposed method generated consistently spatio-temporal LULC maps, using a definitive LCCS designed with reference to end-users’ recommendations and a standard LCCS37. For a visual presentation, level-1 VLUCDs of the year 1990 and 2020 are presented in Fig. 1. The 5-year-interval maps (1990, 1995, 2000, 2005, 2010, 2015, and 2020) of level-2 VLUCDs are presented in Fig. 2. The level-1 and level-2 VLUCDs included ten categories of primary dominant land use/cover (PDLC) and eighteen categories of secondary dominant land use/cover (SDLC), respectively (see detail in Supplementary Table S1).

Figure 1.

Figure 1

(a, b) show the level-1 Vietnam-wide LULC maps in 1990 and 2020 produced from a fusion of Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images with the random-forest-based algorithm. (c) presents a spatial–temporal dynamic change in LULC from 1990 to 2020 in Vietnam. This figure is generated using QGIS 3.18.0-Zurich (https://qgis.org/en/site/) while the country boundary is extracted from the GADM (https://gadm.org/about.html).

Figure 2.

Figure 2

The level-2 Vietnam-wide LULC maps in 1990, 1995, 2000, 2005, 2010, 2015 and 2020 produced from a fusion of Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images with the random-forest-based algorithm. This figure is generated using QGIS 3.18.0-Zurich (https://qgis.org/en/site/) while the country boundary is extracted from the GADM (https://gadm.org/about.html).

The reliability of the VLUCDs was evaluated by using both visual interpretations and statistical approaches. Based on high-resolution satellite images in Google Earth, we found that the VLUCDs were clear and noise-free. A confusion matrix method with stratified random sampling (1050 points/LULC category) was utilized to independently validate classification accuracy. Statistical metrics were measured, namely producer accuracy (PA), user accuracy (UA), F1 score, overall accuracy (OA), standard error (SE), and kappa coefficient (KC). These metrics of the level-1 and level-2 LULC maps of the year 2020 are fully described in Tables 1 and 2. Meanwhile, overall obtained OA, KC, and uncertainty of the level-1 and level-2 VLUCDs are presented briefly in Figs. 3 and 4. Uncertainty of measurement was estimated with a 95% confidence interval. Specifically, the OA of the level-1 and level-2 LULC maps ranged from 77.6 ± 1.2% to 84.7 ± 1.1%, and 85.7 ± 1.3% to 92.0 ± 1.2% over the study period, respectively.

Table 1.

Confusion matrix of the 2020 Vietnam-wide land use/cover map (Level 1) created from the integration of Landsat OLI, Sentinel SAR GRD and MSI satellite images with the random-forest-based algorithm. PA: Producer accuracy (%); PA: Producer accuracy (%); UA: User accuracy (%); SEM: Standard error of the mean for UA; F1: F1 score; Overall accuracy: 91.6%, and Kappa coefficient: 90.7 (%). RL: Residence; RP: Rice paddies; CL: Cropland; GL: Grassland; BL: Barren land; SL: Scrubland; FL: Forest land; WL: Wetland; OW: Open water; AC: Aquaculture.

Land cover map
RL RP CL GL BL SL FL WL OW AC
Reference data RL 988 3 21 1 6 4 0 2 0 2
RP 1 988 26 1 4 1 8 0 0 1
CL 27 18 772 1 28 38 17 1 0 0
GL 1 15 57 990 15 8 64 0 0 0
BL 25 8 14 44 980 11 11 0 0 1
SL 0 4 23 13 17 986 44 1 0 3
FL 1 2 14 0 0 2 853 0 0 0
WL 3 2 119 0 0 0 53 1040 0 44
OW 0 3 0 0 0 0 0 1 1031 4
AC 4 7 4 0 0 0 0 5 19 995
PA 96.2 95.9 85.6 86.1 89.6 90.4 97.8 82.5 99.2 96.2
UA 94.1 94.1 73.5 94.3 93.3 93.9 81.2 99.0 98.2 94.8
SEM 0.7 0.7 1.4 0.7 0.8 0.7 1.2 0.3 0.4 0.7
F1 0.95 0.95 0.79 0.90 0.91 0.92 0.89 0.90 0.99 0.96

Table 2.

Confusion matrix of the 2020 Vietnam-wide LULC map (Level 2) created from the integration of Landsat OLI, and Sentinel SAR GRD and MSI satellite images with the random-forest-based algorithm. PA: Producer accuracy (%); UA: User accuracy (%); SEM: Standard error of the mean for UA; F1: F1 score; Overall accuracy: 84.7%, and Kappa coefficient: 83.8 (%). R1: Residence 1; R2: Residence 2; RP: Rice paddies; WC: Woody crops; OC: Other crops; IC: In-house crops; GL: Grassland; BL: Barren land; SL: Scrubland; DBF: Deciduous broadleaf forest; EBR: Evergreen broadleaf forest; ENF: Evergreen needleleaf forest; PL: Plantation land; MF: Mangrove forest; IW: Inland wetland; OW: Open water; AC: Aquaculture; BA: Bamboo areas.

Land cover map
R1 R2 RP WC OC IC GL BL SL
Reference data R1 804 137 3 0 10 0 2 6 1
R2 212 848 4 17 12 3 1 7 8
RP 0 1 932 5 27 1 3 5 0
WC 0 2 12 674 45 0 6 3 6
OC 0 2 27 26 801 2 1 5 11
IC 15 34 9 15 26 1044 0 26 84
GL 0 0 14 39 45 0 975 17 9
BL 16 14 11 8 12 0 40 960 10
SL 0 1 3 13 8 0 17 16 898
DBF 1 0 1 21 5 0 0 2 17
EBF 0 0 1 8 1 0 2 1 1
ENF 0 0 0 37 0 0 0 0 5
PL 0 0 12 12 16 0 1 1 0
MF 0 6 2 7 2 0 0 1 0
IW 0 3 7 160 35 0 0 0 0
OW 1 0 2 0 0 0 0 0 0
AC 1 2 10 0 3 0 0 0 0
BA 0 0 0 8 2 0 2 0 0
PA 83.2 76.1 95.0 84.6 89.4 82.9 83.4 88.2 88.4
UA 76.6 80.8 88.8 64.2 76.3 99.4 92.9 91.4 85.5
SEM 1.3 1.2 1.0 1.5 1.3 0.2 0.8 0.9 1.1
F1 0.80 0.78 0.92 0.73 0.82 0.90 0.88 0.90 0.87
Land cover map
DBF EBF ENF PL MF IW OW AC BA
Reference data R1 0 0 0 0 0 0 0 3 0
R2 0 0 0 0 0 2 0 1 0
RP 2 0 0 5 0 0 0 0 0
WC 5 7 0 35 2 0 0 0 0
OC 3 2 0 15 0 1 0 0 0
IC 6 0 0 0 0 0 0 0 0
GL 2 24 0 43 0 0 0 0 1
BL 6 1 0 5 0 0 1 4 0
SL 33 12 0 13 0 0 0 2 0
DBF 980 30 4 35 0 0 0 0 0
EBF 2 355 1 41 0 0 0 0 6
ENF 9 202 1045 19 0 0 0 0 0
PL 2 39 0 603 2 3 0 0 0
MF 0 2 0 17 1036 1 1 3 0
IW 0 0 0 100 2 1042 0 95 0
OW 0 0 0 0 1 0 1028 3 0
AC 0 0 0 0 6 1 20 939 0
BA 0 376 0 119 1 0 0 0 1043
PA 89.4 84.7 79.3 87.3 96.1 72.2 99.3 95.6 67.2
UA 93.3 33.8 99.5 57.4 98.7 99.2 97.9 89.4 99.3
SEM 0.8 1.5 0.2 1.5 0.4 0.3 0.4 0.9 0.3
F1 0.91 0.48 0.88 0.69 0.97 0.84 0.99 0.92 0.80

Figure 3.

Figure 3

The overall accuracy (OA) and kappa coefficient (KC) of the level-1 Vietnam-wide annual LULC maps produced from the all freely available Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images with the random-forest-based algorithm. The OA and KC are obtained by using a confusion matrix and a stratified validation method with independent samples (1050 points/LULC category). The bars indicate uncertainties of OA measured with a 95% confidence interval.

Figure 4.

Figure 4

The overall accuracy (OA) and kappa coefficient (KC) of the level-2 Vietnam-wide annual land use/cover maps produced from all the freely available Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images with the random-forest-based algorithm. The OA and KC are obtained by using a confusion matrix and a stratified validation method with independent samples (1050 points/LULC category). The bars indicate uncertainties of OA measured with a 95% confidence interval.

With the numerous LULC types and long-term observation, these results constituted an outstanding achievement38. For the detailed LULC products (Level 2), open water and mangrove had the highest levels of accuracy, accounting for over 96% in both the PA and UA. This successful classification may be explained by the benefits of spectral indices (Table 4) such as the NDWI, WRI, and NDPI, which could distinguish open water from land, aquaculture ponds, and others, whereas mangrove can be accurately identified with the MVI39. This was followed by rice paddies, which had an accuracy of above 90%. It seems possibly that rice is frequently cultivated in flat terrain, where is not be affected by topographic problems such as the shadows of mountainsides. The spectral reflectance of rice is also stable40. Although the model could separate forests from others, it tended to misclassifying different forests. Another limitation is to classify plantation forests from woody crops, which is also found by numerous studies41,42. Likewise, the model could not entirely divide the different types of residential areas, but it showed a clear separation of the residential areas from others. To increase the accuracy of the maps for further analyses, we combined these mixed categories. This combination obtained an increase in accuracy of approximately 6% with a few losses of detail in LULC types.

Table 4.

Spectral indices derived from Landsat TM, ETM + and OLI, and Sentinel MSI satellite images to enhance the accurate performance of Vietnam-wide annual LULC mapping from 1990 to 2020.

Name Equation No. Ref
Atmospherically Resistant Vegetation Index ARVI=Bnir-2Bred+BblueBnir+2Bred+Bblue (1) 72
Difference Vegetation Index DVI=Bnir-Bred (2) 99
Enhanced Built-Up and Bareness Index EBBI=Bswir-Bnir10Bswir+Bnir (3) 100
Enhanced Vegetation Index EVI=2.5Bnir-BredBnir+6Bred-7.5Bblue+1 (4) 101
Green Chlorophyll Index GCI=BnirBgreen-1 (5) 102
Mangrove Vegetation Index MVI=Bnir-BgreenBswir-Bgreen (6) 80
Normalized Burn Ratio NBR=Bnir-BswirBnir+Bswir (7) 81
Normalized Different Bareness Index NDBaI=Bswir-BtirBswir+Btir (8) 79
Normalized Difference Built-Up Index NDBI=Bswir-BnirBswir+Bnir (9) 91
Normalised Difference Pond Index NDPI=Bgreen-BswirBgreen+Bswir (10) 92
Normalized Difference Turbidity Index NDTI=Bred-BgreenBred+Bgreen (11)  92
Normalized Difference Vegetation Index NDVI=Bnir-BredBnir+Bred (12) 93
Normalized Difference Water Index NDWI=Bgreen-BnirBgreen+Bnir (13) 94
Soil Adjusted Vegetation Index SAVI=1.5Bnir-BredBnir+Bred+0.5 (14) 95
Structure Insensitive Pigment Index SIPI=Bnir-BblueBnir-Bred (15) 96
Urban Index UI=Bswir-BnirBswir+Bnir (16) 97
Water Ratio Index WRI=Bgreen+Bred+BnirBblue (17) 98

Distribution and trend of LUCC

Change detection was conducted to comprehend LUCC patterns and processes. To this end, the level-1 (PDLC) Vietnam-wide annual LULC data sets (L1-VLUCDs) was utilized for further analysis in this study. Although the annual maps are integral to obtain the process of LULC dynamic changes in Vietnam, the five-year-interval land cover products in 1990, 1995, 2000, 2005, 2010, 2015, and 2020 were utilized to acquire a more profound change visualization. A post-classification analysis method was employed to measure the spatio-temporal LUCC and the percentage of changes.

The spatial distributions and the patterns of Vietnam LULC are shown in Fig. 1. The temporal distribution of the net changes in LULC from 1990 to 2020 is presented in Fig. 5. The most dominant LULC was forests, accounting for approximately half area of the entire country. This was followed by croplands (16.3%), rice fields (14.2%), and open water (including parts of saltwater, 8.1%). Grassland and scrubland occupied a relatively similar proportion (2.8%) while the smallest LULC was residential areas (1.3%).

Figure 5.

Figure 5

Temporal distribution of LULC across Vietnam extracted from the level-1 Vietnam-wide annual LULC data sets. The data labels represent the area of each LULC category (km2) in the year 1990, 1995, 2000, 2010, 2015, and 2020.

What can be clearly seen in Fig. 5 is the steady decline of forest area from 170,458 km2 in 1990 to 150,517 km2 in 2020. The area of wetlands experienced a slight increase from 4404 km2 in 1900 to 5138 km2 in 1995, followed by a continual decrease to 2490 km2 in 2020. In contrast, there was a sharp increase in the area of aquaculture and residential land, accounting for approximately three and ten times over the three decades. The area of open water showed a slight rise while there was a small fluctuation in the area of the other land types.

Meanwhile, Fig. 6 reveals the highly dynamic change in LULC in Vietnam. The most considerable change was the area of urban land, which increased by about 50% over each five-year interval. Interestingly, the graph shows substantially opposite trends between aquaculture and wetlands. The area of aquaculture decreased 8% whereas the wetlands expanded 17% from 1990 to 1995, followed by a 50% increase in aquaculture but a 22% decrease in wetlands by 2000. Forest cover had undergone an up-and-down variation by 2015, but it has presented a remarkable drop until now. It is noticed that the percentage of change in the forest cover was insignificant, but its dynamic areas were remarkable (Fig. 7).

Figure 6.

Figure 6

Temporal dynamics of net changes in LULC across Vietnam, extracted from the level-1 Vietnam-wide annual LULC data sets in the years 1990, 1995, 2000, 2010, 2015, and 2020. The data labels represent the percentage of changes (%) within five-year intervals. The positive and negative values indicate an increase and a decrease, respectively.

Figure 7.

Figure 7

LULC gain/loss and conversions between 1990 and 2020; “ + ” means gain and “-” means loss in area (km2).

Major spatio-temporal LULC dynamics

Vietnam’s LULC has experienced a considerable change over the past 30 years. Figure 1c shows the spatio-temporal dynamic changes. The north and south were the most dynamic areas, especially the western north and the most south. The dynamic conversions among different LULC can be seen in Fig. 7. There was a fundamentally dynamic conversion between forests and croplands. Forest areas remarkably reduced while residential and aquaculture land significantly increased. To easily visualize the LULC transformation, we created additional data in Fig. 8. It is noticed that a considerable proportion of forests was converted into croplands while a major driver of wetland loss resulted in the expansion of aquaculture. Residential lands mainly expanded on the areas of rice, croplands, and barren lands, which are located nearby coastlines. To acquire a more detailed visualization of change patterns, few hotspot regions were extracted throughout the country to discuss the change pattern and processes.

Figure 8.

Figure 8

Temporal gross land use/cover conversions in Vietnam. (a, b) represent transitions among different land types from 1990 to 2010, and from 2010 to 2020, respectively. The numbers indicate the areas of forests (km2).

Discussion

Large-scale annual LULC information is integral for understanding the land dynamic process, thus supporting the strategies of land management. In reviewing the literature, little consistent multi-spatio-temporal LULC data was found on a national or regional scale. In this study, a comprehensive framework is developed to produce consistently Vietnam-wide annual LULC data sets, using remote sensing and ground-based data. Results show that surface reflectance images can provide a coherent time-series data set as long as they are atmospherically corrected. Surprisingly, although all available Landsat images of the entire year are utilized, there are data gaps due to cloud and shadow masking areas. These gaps might affect the accuracy of classified maps even though gaps have been filled by ancillary data such as terrain indices. This issue may be explained by the fact that Vietnam is one of the cloudiest countries in the world43. However, the harmonious blend of Landsat OLI, Sentinel SAR GRD and MSI has filled such missing-data gaps since 2015, which can also improve the accuracy of mapping (Figs. 3, 4). Besides, the training migration model significantly reduced the cost and efforts in collecting training data.

Regarding change patterns, the rapid development of urbanization is considered an essential interest in Vietnam. The expansion of urban areas has frequently occurred in the capital and regional capitals, namely Hanoi, Hai Phong, Da Nang, Ho Chi Minh (HCM), and Can Tho cities. Herein, we analyse the process of change in HCM as a typical example. As shown in Fig. 9a, the growth of urban land has remarkably increased since the 1990s. This may be explained by the fact that the introduction of new policies known as “Renovation” (1986), which has promoted the development of socio-economic factors, followed by a massive population migration to cities44. The urbanization has primarily taken place on croplands, which agrees with the findings of previous studies30,4446. The development is predicted to accelerate over developing regions, which causes the loss of croplands, and thus may threaten sustainability and livelihoods45.

Figure 9.

Figure 9

Spatial–temporal dynamics (left) and change pattern (right) of LULC in (a) residential land, (b) aquaculture land, and (c) forests land in Vietnam. This figure is generated using QGIS 3.18.0-Zurich (https://qgis.org/en/site/) while the country boundary is extracted from the GADM (https://gadm.org/about.html).

Another considerable change is the uncontrolled development of agricultural and aquaculture land, especially in the Vietnamese Mekong Delta (VMD). As part of the third-largest basin in the world, the VMD plays an integral role in the contribution of agricultural products, due to its favourable natural condition of 700 km coastline and a dense network of rivers. In fact, rice and shrimp have contributed to a vital position in Vietnam’s economic development for decades47,48. However, the intensification of uncontrolled aquatic farming has caused significant changes in LULC across the region, especially along coastal zones since 2000 (Fig. 9b). There are several possible explanations for this finding. In 2000 and 2001, the Vietnamese government proposed resolutions 09/ND-CP and 1116, which replaced low-value (e.g. rice) to high-value (e.g. shrimp and fruit) agricultural production, encouraging farmers to transform certain coastal areas into aquatic production. Also, due to the high profits of shrimp (200,000 VND/kg) in comparison to traditional crops such as rice (5000 VND/kg)30, numerous inland areas were converted into aquatic farming. Local people illegally cut mangrove forests to expand aquaculture in several coastal regions49. Since 2010, there was not only an increase in aquaculture, but also a significant conversion of other croplands into rice paddies (Fig. 9b). These changes resulted in the formulation of another policy aiming at the increase of intensive rice and fish farming in 201250,51. These findings indicate that the development of socio-economic policies is considered as the primary reasons driving LUCC. Land policies, therefore, should be formed and implemented in serious consideration of regional socio-economic and environmental development.

Contrary to the expectations, this study indicates a net area loss of forests instead of the constant increase reported by the Vietnamese Ministry of Agriculture and Rural Development (MARD). Despite the net forest regrowth in 2000 and 2010, the forest cover has undergone a decrease in recent years. Also, the area of forests in this study is greater than the data reported by the MARD. These inconsistencies are due to several reasons, especially the difference of forest definitions. The MARD excluded agricultural (e.g., rubber), aqua-cultural ecosystems, scattered trees, bamboos, and palms, etc. from forests34. These non-forest lands, covering a relatively large area of the country (e.g., 10 km2 of rubber only; 2017), were highly dynamic19 but not fully reported by the MARD. In 2008, the revised definition of forests set a minimum of 10% tree cover as forests, instead of 30% tree cover in the previous definition34. Since 2016, they started to include certain agricultural lands managed by the Vietnam Administration of Forestry in the forest lands but without forest cover51. These revisions likely resulted in an increase in the reporting data of forests. Furthermore, our results show that deforestation occurred in numerous regions. Figure 9c presents a representative example of forest loss in the central highlands. There has been a constant decrease in forest cover due to the expansion of rice paddies, barren lands, and croplands. This finding of forest loss corroborates the discoveries of a great deal of previous work in LULC observation covering Vietnam52,53.

Regarding limitations of this study, we could not estimate the benefit of the individual sensor’s characteristics although the harmonious blend of the Landsat OLI, Sentinel SAR GRD and MSI images can fill data gaps and improve the accuracy of mapping classification. Moreover, instead of using ground-truth data, we validated the annual maps of the year 1990 to 2014 with the data collected by visual interpretation. Although great and careful efforts were applied in the collection procedure, errors might not be inevitable due to the restricted high-resolution images in Google Earth, especially before 2000. In addition, we have utilized the random forest algorithm to eliminate the less important input features, but there are relatively numerous remaining features, resulting in a high computational cost. For future work, the nationwide annual multi-category LULC maps and overall change detection were successfully developed, but further research should be undertaken to investigate the drivers of LULC changes in more detail for individual land cover. Finally, deep learning neural networks are expected to be applied for large-scale LULC mapping.

Conclusion

Regularly updated and accurate LULC information is fundamental to interdisciplinary studies. The recent advancement of remote sensing and computational science has improved the mapping capacity of LULC. This study set out to develop a new framework for automatically monitoring nationwide annual LULC and provide the first VLUCDs over the past 30 years. To this end, we utilized ground-based data, the informative Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images, after comprehensively assembled and preprocessed on the GEE platform. Then, we developed a new random-forest-based method and an automatic training migration model (ATMM) to map Vietnam LULC. The obtained VLUCDs had overall accuracies ranging from 85.7 ± 1.3% to 92.0 ± 1.2% with the ten primary dominant land use/cover and 77.6 ± 1.2% to 84.7 ± 1.1% with the eighteen secondary dominant land use/cover. This confirms the potential of the proposed framework for systematically long-term monitoring of LULC in Vietnam. Results reveal that there was a decrease in the area of forests (19,940 km2) and wetlands (1914 km2) whereas the area of aquaculture and urban increased approximately three and ten times over the three decades, respectively. The deforestation was mainly due to the expansion of croplands, which were in return replaced by numerous built-up areas. The rapid growth of aquaculture was considered as a main driver of wetland loss. The explicit spatio-temporal benchmark of the VLUCDs can be utilized for a tremendous variety of applications in the research of environmental changes towards the Sustainable Development Goals. In addition, the ATMM allows analysts to remarkably save time, cost, and labour for collecting sufficient and representative training data. This proposed method is possible to apply for a multi-temporal LULC assessment at a broader scale.

Materials and methods

The overall method is presented in Fig. 10 with major steps: (1) Data preparation; (2) Defining a proper LULC classification system and reference data; (3) Proposing a consistent framework for the automatic production of Vietnam-wide annual LULC data sets from 1990 to 2020; (4) Creating and validating the VLUCDs, and; (5) Generating major change pattern and processes of LULC over the past three decades.

Figure 10.

Figure 10

The overall workflow for automatic Vietnam-wide annual land use/cover mapping and monitoring, using Landsat TM, ETM + and OLI, and Sentinel SAR GRD and MSI images with the random-forest-based algorithm. This figure is generated using yEd Graph Editor (https://www.yworks.com/products/yed). The logos of the Google Earth Engine, pyQGIS, Google Earth, and Machine Learning is taken from https://earthengine.google.com/https://automating-gis-processes.github.io/site/develop/lessons/L7/overview.htmlhttps://logos.fandom.com/wiki/Google_Earth, and https://www.pngitem.com/middle/hRJJRRJ_machine-learning-course-near-me-machine-learning-logo/, respectively.

Study area

The study area is mainland Vietnam with a population of 97 million people (2018; Fig. 11). The country covers an area of over 300,000 km2 including the Red River Delta, and the Mekong River Delta which is the third-largest delta in the world. The topography of Vietnam is diverse (up to 3300 m altitude) with over 75% of the total area being hills and mountains. These areas are covered by mainly tropical rainforests. Climate is changeable but dominated by a tropical monsoon type with mean annual humidity of 84%, mean annual rainfall from 1200 to 3000 mm, and mean annual temperature from 5 to 37 °C54. The complex patterns of climate and topography create the rich biodiversity and landscape heterogeneity of Vietnam’s LULC. Nonetheless, there are the identifying characteristics of LULC in different regions. While the southern region is principally occupied with rice, orchards and aquaculture lands, the northern region is primarily covered by forests and plantations, except for the Red River Delta. Dominant LULC types in the northern centre are evergreen broadleaf forests and annual croplands whereas woody crops, deciduous broadleaf forests, and evergreen needle-leaf forests dominate the southern centre. In this study, to reduce the complexity of the landscape information, we divided the whole country into five main regions and separately classified for each region. These regions are presented in Fig. 11.

Figure 11.

Figure 11

Location of mainland Vietnam in the world: major division zones (bold lines), distribution of validation data points across the country. These points are independent from the training data. This figure is generated using QGIS 3.18.0-Zurich (https://qgis.org/en/site/) while the country boundary is extracted from the GADM (https://gadm.org/about.html).

Land cover classification system

Defining a standard land cover classification system (LCCS) is a crucial step in the practical land cover assessment. It should be delineated precisely depending on the objectives of users and the availability of mapping resources. Most LULC maps employ the theory and framework of the International Geosphere-Biosphere Programme (IGBP)55, the Land Cover Classification System (LCCS)37, and the Coastal Change Analysis Program (C-CAP) Land Cover Classifications56. Meanwhile, the most updated LCCS of previous LULC products, covering entire Vietnam, includes 18 land cover categories57. However, some categories are inappropriate for Vietnam’s LULC. For example, snow and ice do not exist, while one cropland category does not represent the diverse croplands in Vietnam. Although detailed classifications of high and low developed built-up areas play a fundamental role in urban planning and management for the rapid urbanization of Vietnam, they are not in the previous LULC products. In this study, therefore, a new LULC classification system or topology was developed by remaining the appropriate categories of the Food and Agriculture Organization (FAO) LCCS and adding new proper categories based on the local biophysical environment and end-users’ recommendations in Vietnam. First, we classified a 10-category system of primary dominant land use/cover (PDLC/Level-1). The PDLC was then separated into more detailed land types to generate an 18-category system of secondary dominant land use/cover (SDLC/Level-2). We found that this system is appropriate for practically mapping and applications. The categories and descriptions of the system are presented in Supplementary Table S1.

Remote sensing data

Multi-sensor remote sensing data were used in this study. The data were pre-processed and derived from the GEE. The data were re-projected to Universal Transverse Mercator (UTM) projection (Zone 47-49 N and WGS-84 datum) and then resembled into a 30-m spatial resolution using a bicubic interpolation method58. The Geospatial Data Abstraction Library (GDAL), the Geographic Resources Analysis Support System, and Python were utilized for these processing tasks. Specifically, the data included the United States Geological Survey (USGS) Landsat TM, ETM+, and OLI Surface Reflectance Tier 1 with a 30-m spatial resolution, Sentinel MSI Level-2A and SAR GRD with a 10-m spatial resolution. The Landsat and Sentinel MSI have been atmospherically corrected while each scene of Sentinel SAR GRD was preprocessed using Sentinel-1 Toolbox for thermal noise removal, radiometric calibration, terrain correction using the Shuttle Radar Topography Mission (SRTM)59, and then converting to decibels60. Landsat ETM + images, after the Scan Line Corrector failure in 2003, were removed from this study since the failure may result in inconsistently time-series comparison. Over 99% of the datasets from the GEE archive are reported to have high geometric accuracy with the error being less than half a pixel27. Otherwise, the images were eliminated from our image collection to reduce the obvious bias of further analysis.

For reliable and consistent time-series analysis, further processing is integral. For the optical data, to reduce illumination impacts from elevation, aspect and slope, the topographic correction was performed using the Modified Sun-Canopy-Sensor Topographic Correction algorithm61. While the Landsat Ecosystem Disturbance Adaptive Processing System (LEDAPS) 62 was applied to perform atmospheric correction for Landsat TM and ETM+, the Land Surface Reflectance Code (LaSRC)63 was adopted for Landsat OLI. All Landsat images were masked and removed clouds, cloud shadows and saturation pixels utilizing the Function of Mask (CFMASK)64. Sen2Cor was adopted to correct atmospheric issues and mask clouds for Sentinel MSI65. Finally, because of the different solar and view angles of Landsat OLI and Sentinel MSI, normalizing the bidirectional reflectance distribution function (BRDF) was applied for the data. Although numerous approaches have developed for BRDF correction, the recent technique generated by Roy et al.66 is frequently utilized due to its reliability and effective implementation67. This method, therefore, was employed for the BRDF correction of all selected optical images in this study. For the Sentinel SAR GRD data, a further process was speckle filtering. The filtering was done using Lee filter, which is superior due to its capacity of maintaining point targets, edge, linear spaces and texture information68.

To increase the availability of cloud-free composite data, the harmonization of different Landsat satellite sensor images, and the Landsat OLI with Sentinel MSI into a congruent time-series was desirable for a cloudy region such as Vietnam. The harmonization allows to accurately compare across all years and to measure the spectral similarity and spectral distance between different years. The measurement of spectral similarity and spectral distance was applied for an automatic training migration model, which was described in the following sections. A linear transformation with band-respective coefficients was applied for the harmonization of Landsat TM and ETM + spectral feature to OLI spectral feature69. In the meantime, the harmonized Landsat OLI and Sentinel MSI images were processed by employing a method developed by Claverie et al.70. The band-respective coefficients with slope and intercept image constants are presented in Table 3. After that, we generated composites of seven bands including blue, green, red, nir, swir 1, swir 2 and thermal bands for two seasons, the dry season from April to September and the wet season from October to March of the following year. These composites were adopted to measure a variety of covariates, which is represented in the following paragraphs. In addition, seasonal composites of VV and VH polarization in ascending and descending orbits of Sentinel SAR GRD were handled in this research.

Table 3.

Band-respective coefficients are defined with slope and intercept image constants and used for the harmonized Landsat OLI and Sentinel MSI images.

Band-respective Blue Green Red NIR SWIR 1 SWIR 2
Harmonizing Landsat TM, ETM + and OLI Intercept 0.0003 0.0088 0.0061 0.0412 0.0254 0.0172
Slope 0.8474 0.8483 0.9047 0.8462 0.8937 0.9071
Harmonizing Landsat OLI and Sentinel MSI Intercept - 0.0107 0.0026 -0.0015 0.0033 0.0065 0.0046
Slope 1.0946 1.0043 1.0524 0.8954 1.0049 1.0002

Satellite-based covariate calculation

In this section, a series of covariates was calculated from the band composites. For the optical data, we calculated the medoid71 and the standard deviation for the six bands (Table 3). Following the successful application of numerous features extracted from original satellite image bands, this study also added the medoid of the 20th and 80th percentile57 of the six bands into the seasonal composites to detect the seasonal changes in the biophysical environment.

The ratios between spectral bands were calculated; they are blue/green, red/blue, red/green, red/nir and nir/(red*swir 1). Besides, a great number of spectral indices were also measured from Landsat TM, ETM + and OLI, and Sentinel MSI images (Eqs. (1–17); Table 4). In addition, we calculated the seasonal mean of VH, VV, and the normalized difference between VH and VV polarizations from the Sentinel SAR GRD images. Finally, we generated seasonal composite collections of covariates.

Ancillary data sets

Extensive research has shown that ancillary information can improve the accurate performance of LULC classification20,82,83. In this study, we first added terrain indices including slope, aspect and elevation. These indices were computed from ALOS Global Digital Surface Model or “ALOS World 3D-30 m (AW2D30)”84. Also, distance to rivers, coastlines, transport systems and buildings, and soil types were included in the covariate collections. The buildings and transport systems were generated from the OpenStreetMap, while soil types and river networks were extracted from the OpenDevelopmentMekong86.

Reference data

Reference data of 18 LULC categories (Supplementary Table S1) was created from field surveys, provincial LULC statistics, and visual interpretations. We conducted nationwide comprehensive surveys in 2015, 2016, 2018, 2019, and 2020 to collect 3078, 2659, 10,550, 41,986 and 32,853 reference samples, respectively. Along with these ground-observed data, previous outdated LULC maps19,20,87, provincial LULC statistics and high-resolution satellite images available in the Google Earth were also considered. Herein, we generated approximately 9360 polygons of single homogeneous LULC types (Fig. 11) throughout the country for each year from 2015 to 2020. From these polygons, we extracted up to 120,000 reference pixels (points) for each of the years. Meanwhile, due to the non-availability of ground-truth data, reference data from 1990 to 2014 were collected using provincial LULC statistics, the natural-color images of Landsat TM, ETM + and OLI, Sentinel MSI, and high-resolution satellite images available in the Google Earth. For each year, we randomly extracted 1050 points per LULC category for validating the classification models and the others were used for training the classification model.

The reference data of the year 2020 were utilized not only for creating and validating the LULC map of the year 2020 but also for implementing the automatic training migration model from this reference year to any target years. The migration model is described in more detail in the following section.

Automatic training migration model

Training data is tremendously essential in mapping LULC; however, collecting sufficiently accurate training samples is challenging, especially for large-scale areas, long-term history analyses, and data-scarce environments such as Vietnam73. If training data are not collected consistently, it can result in misclassification or low accuracies74. Thus, it is paramount to propose a practical approach for training data collections.

In this study⁠, we utilized an automatic model to migrate from the reference data of a reference year to target years. The method had three essential steps. First, we created a set of training data from a reference year (2020). Then, for each pixel, we computed its surface reflectance values from its corresponding Landsat TM, ETM +, and OLI images of the reference year and target years. The surface reflectance values of six bands (Table 3) were utilized for the measurement of Euclidean distance (ED)75 and spectral angle distance (SAD)76. Finally, with the ED (Eq. 18) and SAD (Eq. 19), we distinguished changed pixels and unchanged pixels by running a trial and error model to determine thresholds. Although the thresholds can be estimated by analysts, the experimented thresholds of ED and SAD in this study was 0.05 and 0.95, respectively. These thresholds were successfully applied to migrate the training data of the year 2020 to target years. The unchanged pixels were preserved and utilized as training data for the target years.

ED=i=1NXi-Yi2 18
SAD=arccosi=1NXiYii=1NXi2i=1NYi2 19

where X is spectral signature vector of an image pixel in the reference year; Y is spectral signature vector of an image pixel in the target year; N is the number of image bands (N = 6).

Machine learning modelling

After completing the data preprocessing task, we generated covariate collections, including (1) optical-image-based covariates in dry seasons and (2) in wet seasons, and (3) SAR-based covariates in dry seasons and (4) in wet seasons. The ancillary information was also added to these covariate collections. It is worth noting that some of these covariates or features may not significantly contribute to the enhancement of classification performance while overabundant features can affect the performance speed or run out of computing capacity of the classification model. Hence, random forest algorithm77 was employed to estimate important features. We removed some less important features and kept essential features which were represented in Supplementary Table S2.

For classification, we applied a random forest algorithm for several reasons. First, it has previously been observed that the random forest algorithm can handle principal drawbacks that a single-tree-based method may face such as an over-fitting and non-optimal solution78. Also, the random forest shows the outperformance of its rivals such as fuzzy adaptive resonance theory-supervised predictive mapping (Fuzzy ARTMAP), support vector machine (SVM), artificial neural network (ANN), Mahalanobis distance (MD), and spectral angle mapper (SAM)39.

A new random-forest-based approach was developed in this study. Unlike the common use of single-time classification, for each pixel, we independently estimated prior probability values belonging to each of the specified land covers for each of the covariate collections. These prior probability values were then joined to create a set of posterior probability values. The largest value of the posterior probabilities corresponding to a specific land cover was utilized to label the predicted land cover. However, the predicted prior probability of a pixel, for example, p(Ck), might reach almost zero or zero because of ‘No data’ of that pixel at that covariate collection. If this occurs, the posterior probability of that pixel will be nearly zero or zero. That is, although the prior probability of most other collections equals 100% voting for a specified land cover, the probability product of this pixel might be almost zero, causing misclassification. Hence, the prior probability of a pixel corresponding to a particular land cover must not be extremely tiny. To this end, Eq. (20) was developed to adjust the prior probability values while posterior probability values were calculated as Eq. (21).

p,Ck=cpCk+1-cN 20
pcCk=i=1Ipi,Ck 21

where p’(Ck) is the adjustment of prior probability value of a land cover Ck; c is a constant value (c = 0.7) which was evaluated by trial and error experiments in this study; N is the number of land cover categories (N = 18); pc(Ck) is the posterior probability value of category Ck; and I is the number of covariate collections (I = 4).

The random-forest-based model was performed using Scikit-Learn 0.22 and Python 3.8.5. Since the input data of multi-sensor image bands and covariates dramatically varied over the 30-year period, we could not optimize all the parameters of the random forest algorithm. The number of trees (n_estimators) in the forest and the size of the random subsets of features (max_features), however, are highly recommended to be adjusted77. Using RandomizedSearchCV in the Scikit-Learn, we found that n_estimators = 200 (trees) and max_features = 8 were optimal in this work. The other parameters were set as the default values.

Accuracy assessment

Following the wide-ranging recommendations of instruction manuals88,89, a statistic-based testing data set (Reference data section) was independently generated to estimate the accuracy of final LULC products. We utilized a stratified sampling (1050 points/LULC category) method and a confusion matrix to assess Vietnam-wide annual LULC products from 1990 to 2020. The matrix produced profound accuracy metrics, namely overall accuracy (OA), user accuracy (UA), standard error (SE), and kappa coefficient (KC). The uncertainty of accuracy was measured with a 95% confidence interval. These metrics are fully described in Tables 1 and 2.

Change analysis

The analysis of changes in LULC is to measure the differences including spatio-temporal dynamic patterns, the magnitude, and rate of variations observed over the study period. First, we estimated the diversity of LULC within each 30-m pixel width by counting the number of times that LULC changes over 30 years (Fig. 1c). We then estimated the area of each LULC within a five-year interval from 1990 to 2020 to observe the trend of LULC change (Fig. 5). We also computed the percentage of net change (Eq. 22) and then rescaled the percentage to a rank between 0 and 100% to monitor the most dynamic LULC (Fig. 6). Finally, we employed a Sankey diagram to emphasize the major transfers of LULC90.

p=At2-At1At1t2-t1100 22

where p is the percentage of net change; and At1 and At2 (km2) are the area of the LULC type in the observation years t1 and t2, respectively (t1 < t2).

Supplementary Information

Acknowledgements

The authors would like to thank the developers of the GEE for providing satellite images and a high computational platform for the big data processing. This work was supported from the Ecosystem Group of Earth Observation Research Centre (EORC) of the JAXA and the Project for Human Resource Development Scholarship by Japanese Grant Aid (JDS), Japan International Cooperation Agency (JICA) 10.13039/501100004532.

Author contributions

D.C.P. the conception, design of the work, the acquisition, analysis, and interpretation of data, the creation of new software used in this work, and the original draft; T.H.T. software, investigation, and visualization; V.T.T. validation, investigation, and the original draft; T.S. and T.P.T.V. the original draft and investigation; D.T.B., M.H., T.T. and K.N.N. writing – editing and substantively revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-89034-5.

References

  • 1.Gómez C, White JC, Wulder MA. Optical remotely sensed time series data for land cover classification: A review. ISPRS J. Photogramm. Remote. Sens. 2016;116:55–72. doi: 10.1016/j.isprsjprs.2016.03.008. [DOI] [Google Scholar]
  • 2.Dale VH. The relationship between land-use change and climate change. Ecol. Appl. 1997;7:753–769. doi: 10.1890/1051-0761(1997)007[0753:TRBLUC]2.0.CO;2. [DOI] [Google Scholar]
  • 3.Dai A. Increasing drought under global warming in observations and models. Nat. Clim. Chang. 2013;3:52–58. doi: 10.1038/nclimate1633. [DOI] [Google Scholar]
  • 4.Houghton RA, et al. Carbon emissions from land use and land-cover change. Biogeosciences. 2012;9:5125–5142. doi: 10.5194/bg-9-5125-2012. [DOI] [Google Scholar]
  • 5.Lambin EF, et al. The causes of land-use and land-cover change: Moving beyond the myths. Glob. Environ. Chang. 2001;11:261–269. doi: 10.1016/S0959-3780(01)00007-3. [DOI] [Google Scholar]
  • 6.Song XP, et al. Global land change from 1982 to 2016. Nature. 2018;560:639–643. doi: 10.1038/s41586-018-0411-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Grimm NB, et al. Global change and the ecology of cities. Science. 2008;319:756–760. doi: 10.1126/science.1150195. [DOI] [PubMed] [Google Scholar]
  • 8.Utkin LV, et al. A weighted random survival forest. Knowl. Based Syst. 2019;177:136–144. doi: 10.1016/j.knosys.2019.04.015. [DOI] [Google Scholar]
  • 9.Gašparović M, Zrinjski M, Gudelj M. Automatic cost-effective method for land cover classification (ALCC) Comput. Environ. Urban Syst. 2019;76:1–10. doi: 10.1016/j.compenvurbsys.2019.03.001. [DOI] [Google Scholar]
  • 10.Hu Y, Dong Y, Batunacun. An automatic approach for land-change detection and land updates based on integrated NDVI timing analysis and the CVAPS method with GEE support. ISPRS J. Photogramm. Remote Sens. 2018;146:347–359. doi: 10.1016/j.isprsjprs.2018.10.008. [DOI] [Google Scholar]
  • 11.ESA. Sentinel-2 Mission Requirements Document. Earth (2007).
  • 12.Main-Knorn, M. et al. Sen2Cor for Sentinel-2. In 3 (2017). 10.1117/12.2278218.
  • 13.Gong P, et al. Stable classification with limited sample: Transferring a 30-m resolution sample set collected in 2015 to mapping 10-m resolution global land cover in 2017. Sci. Bull. 2019;64:370–373. doi: 10.1016/j.scib.2019.03.002. [DOI] [PubMed] [Google Scholar]
  • 14.Homer C, Huang C, Yang L, Wylie B, Coan M. Development of a 2001 National Land-Cover Database for the United States. Photogramm. Eng. Remote. Sens. 2004;70:829–840. doi: 10.14358/PERS.70.7.829. [DOI] [Google Scholar]
  • 15.Fry JA, et al. Completion of the 2006 national land cover database for the conterminous united states. Photogramm. Eng. Remote Sens. 2011;77:858–864. [Google Scholar]
  • 16.Homer C, et al. Completion of the 2011 national land cover database for the conterminous United States—Representing a decade of land cover change information. Photogramm. Eng. Remote Sens. 2015;81:345–354. [Google Scholar]
  • 17.Yang L, et al. A new generation of the United States National Land Cover Database: Requirements, research priorities, design, and implementation strategies. ISPRS J. Photogramm. Remote Sens. 2018;146:108–123. doi: 10.1016/j.isprsjprs.2018.09.006. [DOI] [Google Scholar]
  • 18.Chen J, et al. Global land cover mapping at 30 m resolution: A POK-based operational approach. ISPRS J. Photogramm. Remote Sens. 2015;103:7–27. doi: 10.1016/j.isprsjprs.2014.09.002. [DOI] [Google Scholar]
  • 19.Hoang, T. T., Truong, V. T., Hayashi, M., Tadono, T. & Nasahara, K. N. New JAXA high-resolution land use/land cover map for Vietnam aiming for natural forest and plantation forest monitoring. Remote Sens.12, 2707 (2020).
  • 20.Duong PC, Trung TH, Nasahara KN, Tadono T. JAXA high-resolution land use/land cover map for Central Vietnam in 2007 and 2017. Remote Sens. 2018;10:1406. doi: 10.3390/rs10091406. [DOI] [Google Scholar]
  • 21.Nemani, R. Nasa earth exchange: Next generation earth science collaborative. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci.XXXVIII-8/, 17–17 (2012).
  • 22.Mugiraneza, T., Nascetti, A. & Ban, Y. Continuous monitoring of urban land cover change trajectories with landsat time series and landtrendr-google earth engine cloud computing. Remote Sens.12, 2883 (2020).
  • 23.Jin Y, Liu X, Yao J, Zhang X, Zhang H. Mapping the annual dynamics of cultivated land in typical area of the Middle-lower Yangtze plain using long time-series of Landsat images based on Google Earth Engine. Int. J. Remote Sens. 2020;41:1625–1644. doi: 10.1080/01431161.2019.1673917. [DOI] [Google Scholar]
  • 24.Hu Y, Hu Y. Detecting forest disturbance and recovery in Primorsky Krai, Russia, using annual landsat time series and multi-source land cover products. Remote Sens. 2020;12:129. doi: 10.3390/rs12010129. [DOI] [Google Scholar]
  • 25.Miettinen J, Shi C, Liew SC. 2015 Land cover map of Southeast Asia at 250 m spatial resolution. Remote Sens. Lett. 2016;7:701–710. doi: 10.1080/2150704X.2016.1182659. [DOI] [Google Scholar]
  • 26.Ghorbanian A, et al. Improved land cover map of Iran using Sentinel imagery within Google Earth Engine and a novel automatic workflow for land cover classification using migrated training samples. ISPRS J. Photogramm. Remote Sens. 2020;167:276–288. doi: 10.1016/j.isprsjprs.2020.07.013. [DOI] [Google Scholar]
  • 27.Huang H, et al. The migration of training samples towards dynamic global land cover mapping. ISPRS J. Photogramm. Remote Sens. 2020;161:27–36. doi: 10.1016/j.isprsjprs.2020.01.010. [DOI] [Google Scholar]
  • 28.Bagan, H. & Yamagata, Y. Land-cover change analysis in 50 global cities by using a combination of Landsat data and analysis of grid cells. Environ. Res. Lett.9, 064015 (2014).
  • 29.Radoux J, et al. Automated training sample extraction for global land cover mapping. Remote Sens. 2014;6:3965–3987. doi: 10.3390/rs6053965. [DOI] [Google Scholar]
  • 30.Tran H, Tran T, Kervyn M. Dynamics of land cover/land use changes in the Mekong Delta, 1973–2011: A Remote sensing analysis of the Tran Van Thoi District, Ca Mau Province, Vietnam. Remote Sens. 2015;7:2899–2925. doi: 10.3390/rs70302899. [DOI] [Google Scholar]
  • 31.Chi VK, et al. Land transitions in Northwest Vietnam: An integrated analysis of biophysical and socio-cultural factors. Hum. Ecol. 2013;41:37–50. doi: 10.1007/s10745-013-9569-9. [DOI] [Google Scholar]
  • 32.Villamor GB, Catacutan DC, Truong VAT, Thi LD. Tree-cover transition in Northern Vietnam from a gender-specific land-use preferences perspective. Land Use Policy. 2017;61:53–62. doi: 10.1016/j.landusepol.2016.11.002. [DOI] [Google Scholar]
  • 33.Truong VT, et al. JAXA annual forest cover maps for Vietnam during 2015–2018 Using ALOS-2/PALSAR-2 and auxiliary data. Remote Sens. 2019;11:2412. doi: 10.3390/rs11202412. [DOI] [Google Scholar]
  • 34.And, R. D. of A. Vietnam’S Modified Submission on Refreence Levels for Redd+ Results Based Payments Under Unfccc. https://redd.unfccc.int/files/vietnam_frl_modified__submission_final_for_posting.pdf (2016).
  • 35.Xu X, Jain AK, Calvin KV. Quantifying the biophysical and socioeconomic drivers of changes in forest and agricultural land in South and Southeast Asia. Glob. Chang. Biol. 2019;25:2137–2151. doi: 10.1111/gcb.14611. [DOI] [PubMed] [Google Scholar]
  • 36.Hansen MC, et al. High-resolution global maps of 21st-century forest cover change. Science (80- ) 2013;342:850–853. doi: 10.1126/science.1244693. [DOI] [PubMed] [Google Scholar]
  • 37.Di Gregorio, A., and Jansen, L. J. M. Land Cover Classification System (LCCS): Classification Concepts and User Manual. Faohttp://www.fao.org/3/x0596e/x0596e00.htm (2000).
  • 38.Van Thinh T, Duong PC, Nasahara KN, Tadono T. How does land use/land cover map’s accuracy depend on number of classification classes? Sci. Online Lett. Atmos. 2019;15:28–31. [Google Scholar]
  • 39.Talukdar S, et al. Land-use land-cover classification by machine learning classifiers for satellite observations—a review. Remote Sens. 2020;12:1135. doi: 10.3390/rs12071135. [DOI] [Google Scholar]
  • 40.Song S, Gong W, Zhu B, Huang X. Wavelength selection and spectral discrimination for paddy rice, with laboratory measurements of hyperspectral leaf reflectance. ISPRS J. Photogramm. Remote Sens. 2011;66:672–682. doi: 10.1016/j.isprsjprs.2011.05.002. [DOI] [Google Scholar]
  • 41.Szantoi Z, Smith SE, Strona G, Koh LP, Wich SA. Mapping orangutan habitat and agricultural areas using Landsat OLI imagery augmented with unmanned aircraft system aerial photography. Int. J. Remote Sens. 2017;38:2231–2245. doi: 10.1080/01431161.2017.1280638. [DOI] [Google Scholar]
  • 42.Hill RA, Thomson AG. Mapping woodland species composition and structure using airborne spectral and LiDAR data. Int. J. Remote Sens. 2005;26:3763–3779. doi: 10.1080/01431160500114706. [DOI] [Google Scholar]
  • 43.Kontgis C, Schneider A, Ozdogan M. Mapping rice paddy extent and intensification in the Vietnamese Mekong River Delta with dense time stacks of Landsat data. Remote Sens. Environ. 2015;169:255–269. doi: 10.1016/j.rse.2015.08.004. [DOI] [Google Scholar]
  • 44.Kontgis C, et al. Monitoring peri-urbanization in the greater Ho Chi Minh City metropolitan area. Appl. Geogr. 2014;53:377–388. doi: 10.1016/j.apgeog.2014.06.029. [DOI] [Google Scholar]
  • 45.D’Amour CB, et al. Future urban land expansion and implications for global croplands. Proc. Natl. Acad. Sci. USA. 2017;114:8939–8944. doi: 10.1073/pnas.1606036114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ha TV, Tuohy M, Irwin M, Tuan PV. Monitoring and mapping rural urbanization and land use changes using Landsat data in the northeast subtropical region of Vietnam. Egypt. J. Remote Sens. Sp. Sci. 2020;23:11–19. [Google Scholar]
  • 47.Nhan TQ, Van Ly LT, Tan LV. How much do rice farmers earn from their crops? Evidence from a rice-exporting country. J. Agric. Stud. 2020;8:302. [Google Scholar]
  • 48.Truong TD, Do LH. Mangrove forests and aquaculture in the Mekong river delta. Land Use Policy. 2018;73:20–28. doi: 10.1016/j.landusepol.2018.01.029. [DOI] [Google Scholar]
  • 49.Lam-Dao N, Pham-Bach V, Nguyen-Thanh M, Pham-Thi M-T, Hoang-Phi P. Change detection of land use and riverbank in Mekong Delta, Vietnam using time series remotely sensed data. J. Resour. Ecol. 2011;2:370–374. [Google Scholar]
  • 50.Ha TTP, van Dijk H, Visser L. Impacts of changes in mangrove forest management practices on forest accessibility and livelihood: A case study in mangrove-shrimp farming system in Ca Mau Province, Mekong Delta, Vietnam. Land Use Policy. 2014;36:89–101. doi: 10.1016/j.landusepol.2013.07.002. [DOI] [Google Scholar]
  • 51.Le TN, Bregt AK, van Halsema GE, Hellegers PJGJ, Nguyen LD. Interplay between land-use dynamics and changes in hydrological regime in the Vietnamese Mekong Delta. Land Use Policy. 2018;73:269–280. doi: 10.1016/j.landusepol.2018.01.030. [DOI] [Google Scholar]
  • 52.Khuc QV, Tran BQ, Meyfroidt P, Paschke MW. Drivers of deforestation and forest degradation in Vietnam: An exploratory analysis at the national level. For. Policy Econ. 2018;90:128–141. doi: 10.1016/j.forpol.2018.02.004. [DOI] [Google Scholar]
  • 53.Meyfroidt P, Vu TP, Hoang VA. Trajectories of deforestation, coffee expansion and displacement of shifting cultivation in the Central Highlands of Vietnam. Glob. Environ. Chang. 2013;23:1187–1198. doi: 10.1016/j.gloenvcha.2013.04.005. [DOI] [Google Scholar]
  • 54.Ngo-Duc T, Kieu C, Thatcher M, Nguyen-Le D, Phan-Van T. Climate projections for Vietnam based on regional climate models. Clim. Res. 2014;60:199–213. doi: 10.3354/cr01234. [DOI] [Google Scholar]
  • 55.Lindesay JA, et al. International geosphere-biosphere programme/international global atmospheric chemistry SAFARI-92 field experiment: Background and overview. J. Geophys. Res. Atmos. 1996;101:23521–23530. doi: 10.1029/96JD01512. [DOI] [Google Scholar]
  • 56.Klemas VV, Dobson JE, Ferguson RL, Haddad KD. A coastal land cover classification system for the NOAA Coastwatch Change Analysis Project. J. Coast. Res. 1993;9:862–872. [Google Scholar]
  • 57.Saah D, et al. Primitives as building blocks for constructing land cover maps. Int. J. Appl. Earth Obs. Geoinf. 2020;85:101979. doi: 10.1016/j.jag.2019.101979. [DOI] [Google Scholar]
  • 58.Keys RG. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. 1981;29:1153–1160. doi: 10.1109/TASSP.1981.1163711. [DOI] [Google Scholar]
  • 59.Farr, T. G. et al. The shuttle radar topography mission. Rev. Geophys.45, RG2004 (2007).
  • 60.Filipponi F. Sentinel-1 GRD preprocessing workflow. Proceedings. 2019;18:11. doi: 10.3390/ECRS-3-06201. [DOI] [Google Scholar]
  • 61.Soenen SA, Peddle DR, Coburn CA. SCS+C: A modified sun-canopy-sensor topographic correction in forested terrain. IEEE Trans. Geosci. Remote Sens. 2005;43:2148–2159. doi: 10.1109/TGRS.2005.852480. [DOI] [Google Scholar]
  • 62.Saleous, N. & Kutler, J. LEDAPS Calibration, Reflectance, Atmospheric Correction Preprocessing Code, Version 2 Summary: Data Citation: Model Product Description: Oak Ridge National Laboratory Distributed Active Archive Center http://daac.ornl.gov/cgi-bin/dsviewer.pl?ds_id=1146 (2012) 10.3334/ORNLDAAC/1146.
  • 63.Vermote E, Justice C, Claverie M, Franch B. Preliminary analysis of the performance of the Landsat 8/OLI land surface reflectance product. Remote Sens. Environ. 2016;185:46–56. doi: 10.1016/j.rse.2016.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Foga S, et al. Cloud detection algorithm comparison and validation for operational Landsat data products. Remote Sens. Environ. 2017;194:379–390. doi: 10.1016/j.rse.2017.03.026. [DOI] [Google Scholar]
  • 65.Louis, J. et al. Sentinel-2 Global Surface Reflectance Level-2a Product Generated with Sen2Cor. In International Geoscience and Remote Sensing Symposium (IGARSS) 8522–8525 (2019). 10.1109/IGARSS.2019.8898540.
  • 66.Roy DP, et al. A general method to normalize Landsat reflectance data to nadir BRDF adjusted reflectance. Remote Sens. Environ. 2016;176:255–271. doi: 10.1016/j.rse.2016.01.023. [DOI] [Google Scholar]
  • 67.Roy DP, et al. Examination of Sentinel-2A multi-spectral instrument (MSI) reflectance anisotropy and the suitability of a general method to normalize MSI reflectance to nadir BRDF adjusted reflectance. Remote Sens. Environ. 2017;199:25–38. doi: 10.1016/j.rse.2017.06.019. [DOI] [Google Scholar]
  • 68.Lee JS, Ainsworth TL, Wang Y, Chen KS. Polarimetric SAR speckle filtering and the extended sigma filter. IEEE Trans. Geosci. Remote Sens. 2015;53:1150–1160. doi: 10.1109/TGRS.2014.2335114. [DOI] [Google Scholar]
  • 69.Roy DP, et al. Characterization of Landsat-7 to Landsat-8 reflective wavelength and normalized difference vegetation index continuity. Remote Sens. Environ. 2016;185:57–70. doi: 10.1016/j.rse.2015.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Claverie M, et al. The Harmonized Landsat and Sentinel-2 surface reflectance data set. Remote Sens. Environ. 2018;219:145–161. doi: 10.1016/j.rse.2018.09.002. [DOI] [Google Scholar]
  • 71.Flood N. Seasonal composite landsat TM/ETM+ Images using the medoid (a multi-dimensional median) Remote Sens. 2013;5:6481–6500. doi: 10.3390/rs5126481. [DOI] [Google Scholar]
  • 72.Kaufman YJ, Tanré D. Atmospherically Resistant Vegetation Index (ARVI) for EOS-MODIS. IEEE Trans. Geosci. Remote Sens. 1992;30:261–270. doi: 10.1109/36.134076. [DOI] [Google Scholar]
  • 73.Saah D, et al. Land cover mapping in data scarce environments: Challenges and opportunities. Front. Environ. Sci. 2019;7:150. doi: 10.3389/fenvs.2019.00150. [DOI] [Google Scholar]
  • 74.Li C, Wang J, Wang L, Hu L, Gong P. Comparison of classification algorithms and training sample sizes in urban land classification with landsat thematic mapper imagery. Remote Sens. 2014;6:964–983. doi: 10.3390/rs6020964. [DOI] [Google Scholar]
  • 75.Richards, J. A. Remote sensing digital image analysis: An introduction. Remote Sensing Digital Image Analysis: An Introduction vol. 9783642300 (2013).
  • 76.Kruse FA, et al. The spectral image processing system (SIPS)-interactive visualization and analysis of imaging spectrometer data. Remote Sens. Environ. 1993;44:145–163. doi: 10.1016/0034-4257(93)90013-N. [DOI] [Google Scholar]
  • 77.Breiman L. Random forests. Mach. Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
  • 78.He Y, Lee E, Warner TA. A time series of annual land use and land cover maps of China from 1982 to 2013 generated using AVHRR GIMMS NDVI3g data. Remote Sens. Environ. 2017;199:201–217. doi: 10.1016/j.rse.2017.07.010. [DOI] [Google Scholar]
  • 79.Zhao, H. & Chen, X. Use of normalized difference bareness index in quickly mapping bare areas from TM/ETM+. In International Geoscience and Remote Sensing Symposium (IGARSS) vol. 3 1666–1668 (IEEE, 2005).
  • 80.Baloloy AB, Blanco AC, Raymund Rhommel RRC, Nadaoka K. Development and application of a new mangrove vegetation index (MVI) for rapid and accurate mangrove mapping. ISPRS J. Photogramm. Remote Sens. 2020;166:95–117. doi: 10.1016/j.isprsjprs.2020.06.001. [DOI] [Google Scholar]
  • 81.García MJL, Caselles V. Mapping burns and natural reforestation using thematic mapper data. Geocarto Int. 1991;6:31–37. doi: 10.1080/10106049109354290. [DOI] [Google Scholar]
  • 82.Wright C, Gallant A. Improved wetland remote sensing in Yellowstone National Park using classification trees to combine TM imagery and ancillary environmental data. Remote Sens. Environ. 2007;107:582–605. doi: 10.1016/j.rse.2006.10.019. [DOI] [Google Scholar]
  • 83.Hutchinson CF. Techniques for combining Landsat and ancillary data for digital classification improvement. Photogramm. Eng. Remote Sens. 1982;48:123–130. [Google Scholar]
  • 84.Tadono T, et al. Precise global DEM generation by ALOS PRISM. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2014;II–4:71–76. doi: 10.5194/isprsannals-II-4-71-2014. [DOI] [Google Scholar]
  • 85.Jokar Arsanjani, J., Zipf, A., Mooney, P. & Helbich, M. OpenStreetMap in GIScience. OpenStreetMap in GIScience: Experiences, Research, Applications (2015). 10.1007/978-3-319-14280-7.
  • 86.Open Development Mekong. OpenDevelopmentMekong. East-West Management Institutehttps://data.opendevelopmentmekong.net/organization/6f37a27d-2790-4b9a-8570-a36cb1d8108f?res_format=KML (2015).
  • 87.Truong, V. T. et al. JAXA annual forest cover maps for Vietnam during 2015–2018 Using ALOS-2/PALSAR-2 and auxiliary data. Remote Sens.11 (2019).
  • 88.Olofsson P, et al. Good practices for estimating area and assessing accuracy of land change. Remote Sens. Environ. 2014;148:42–57. doi: 10.1016/j.rse.2014.02.015. [DOI] [Google Scholar]
  • 89.Congalton RG. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 1991;37:35–46. doi: 10.1016/0034-4257(91)90048-B. [DOI] [Google Scholar]
  • 90.Schmidt M. The Sankey diagram in energy and material flow management—part II: Methodology and current applications. J. Ind. Ecol. 2008;12:173–185. doi: 10.1111/j.1530-9290.2008.00015.x. [DOI] [Google Scholar]
  • 91.Zha Y, Gao J, Ni S. Use of normalized difference built-up index in automatically mapping urban areas from TM imagery. Int. J. Remote Sens. 2003;24:583–594. doi: 10.1080/01431160304987. [DOI] [Google Scholar]
  • 92.Lacaux JP, Tourre YM, Vignolles C, Ndione JA, Lafaye M. Classification of ponds from high-spatial resolution remote sensing: Application to Rift Valley Fever epidemics in Senegal. Remote Sens. Environ. 2007;106:66–74. doi: 10.1016/j.rse.2006.07.012. [DOI] [Google Scholar]
  • 93.Tucker CJ. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979;8:127–150. doi: 10.1016/0034-4257(79)90013-0. [DOI] [Google Scholar]
  • 94.McFeeters SK. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996;17:1425–1432. doi: 10.1080/01431169608948714. [DOI] [Google Scholar]
  • 95.Huete AR. A soil-adjusted vegetation index (SAVI) Remote Sens. Environ. 1988;25:295–309. doi: 10.1016/0034-4257(88)90106-X. [DOI] [Google Scholar]
  • 96.Penuelas J, Baret F, Filella I. Semi-empirical indices to assess carotenoids/chlorophyll a ratio from leaf spectral reflectance. Photosynthetica. 1995;31:221–230. [Google Scholar]
  • 97.Jayamanna S, Kawamura M, Tsujiko Y. Relation between social and environmental conditions in colombo, sri lanka and the urban index estimated by satellite remote sensing data. Int. Arch. Photogram. Remote Sens. 1996;31:321–326. [Google Scholar]
  • 98.Shen, L. & Li, C. Water body extraction from Landsat ETM+ imagery using adaboost algorithm. In 2010 18th International Conference on Geoinformatics, Geoinformatics 2010 (2010). 10.1109/GEOINFORMATICS.2010.5567762.
  • 99.Perry CR, Lautenschlager LF. Functional equivalence of spectral vegetation indices. Remote Sens. Environ. 1984;14:169–182. doi: 10.1016/0034-4257(84)90013-0. [DOI] [Google Scholar]
  • 100.As-syakur AR, Adnyana IWS, Arthana IW, Nuarsa IW. Enhanced built-UP and bareness index (EBBI) for mapping built-UP and bare land in an urban area. Remote Sens. 2012;4:2957–2970. doi: 10.3390/rs4102957. [DOI] [Google Scholar]
  • 101.Liu HQ, Huete A. Feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Trans. Geosci. Remote Sens. 1995;33:457–465. doi: 10.1109/TGRS.1995.8746027. [DOI] [Google Scholar]
  • 102.Gitelson AA, Viña A, Ciganda V, Rundquist DC, Arkebauer TJ. Remote estimation of canopy chlorophyll content in crops. Geophys. Res. Lett. 2005;32:1–4. doi: 10.1029/2005GL022688. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES