Skip to main content
PLOS One logoLink to PLOS One
. 2020 Feb 28;15(2):e0229448. doi: 10.1371/journal.pone.0229448

Mapping Atlantic rainforest degradation and regeneration history with indicator species using convolutional network

Fabien H Wagner 1,2,*, Alber Sanchez 1, Marcos P M Aidar 3, André L C Rochelle 4, Yuliya Tarabalka 5,6, Marisa G Fonseca 1, Oliver L Phillips 7, Emanuel Gloor 7, Luiz E O C Aragão 1,8
Editor: Judi Hewitt9
PMCID: PMC7048271  PMID: 32109946

Abstract

The Atlantic rainforest of Brazil is one of the global terrestrial hotspots of biodiversity. Despite having undergone large scale deforestation, forest cover has shown signs of increases in the last decades. Here, to understand the degradation and regeneration history of Atlantic rainforest remnants near São Paulo, we combine a unique dataset of very high resolution images from Worldview-2 and Worldview-3 (0.5 and 0.3m spatial resolution, respectively), georeferenced aerial photographs from 1962 and use a deep learning method called U-net to map (i) the forest cover and changes and (ii) two pioneer tree species, Cecropia hololeuca and Tibouchina pulchra. For Tibouchina pulchra, all the individuals were mapped in February, when the trees undergo mass-flowering with purple and pink blossoms. Additionally, elevation data at 30m spatial resolution from NASA Shuttle Radar Topography Mission (SRTM) and annual mean climate variables (Terraclimate datasets at ∼ 4km of spatial resolution) were used to analyse the forest and species distributions. We found that natural forests are currently more frequently found on south-facing slopes, likely because of geomorphology and past land use, and that Tibouchina is restricted to the wetter part of the region (southern part), which annually receives at least 1600 mm of precipitation. Tibouchina pulchra was found to clearly indicate forest regeneration as almost all individuals were found within or adjacent to forests regrown after 1962. By contrast, Cecropia hololeuca was found to indicate older disturbed forests, with all individuals almost exclusively found in forest fragments already present in 1962. At the regional scale, using the dominance maps of both species, we show that at least 4.3% of the current region’s natural forests have regrown after 1962 (Tibouchina dominated, ∼ 4757 ha) and that ∼ 9% of the old natural forests have experienced significant disturbance (Cecropia dominated).

Introduction

Brazil holds 20% of Earth’s biodiversity, and its third largest biome, the Atlantic rainforest, is a biodiversity hotspot and a global priority for conservation [14]. Over the two last centuries, this biome has been drastically reduced to less than 15% of its original area [5, 6]. Currently, the Atlantic forest is extremely fragmented with over 80% of the fragments < 50 ha and is poorly protected as reserves protect only 9% of the remaining forest, which represents ∼ 1–2% of the original area covered [7]. On the other hand, large areas of this biome are currently recovering from deforestation, as seen by an increase in tree cover since the year 2000 [8]. This increase in tree cover is driven mainly by eucalyptus plantations and natural regeneration [9], and also by forest restoration initiatives [10]. The natural regeneration, which occurs mainly in abandoned pasture lands, increases the provision of ecosystem services and habitat availability [11]. However, regional scale indicators to assess recovery stage, diversity or disturbance levels of the current natural forests are still underdeveloped, thereby adding uncertainty to the estimation of the ecosystem services and their value for conservation [1214]. In this context, remote sensing is a key tool to monitor biodiversity, resources, and ecosystem services, as well as the human impact on natural ecosystems at a regional/biome scale [1518].

Recently, a deep learning method called U-net was used with very high resolution images (WorldView-3, 30 cm spatial resolution, Digital Globe) to map natural and planted forests and to produce the first regional map of all individual adults of a natural tree species, Cecropia hololeuca, in a region of 1600 km2 of the Atlantic forest in the state of São Paulo [14]. The classification accuracies above 95% for vegetation type and over 97% for C. hololeuca show that U-net outperforms other image segmentation methods and could support species mapping at regional scale [14]. Since C. hololeuca is a pioneer species, the spatial distribution of the individuals was then used to estimate anthropogenic disturbances inside the natural forest fragments, which are not accessible by others means. Over this same region, we have now gathered several other available high or very-high resolution remote sensed datasets for use in association with deep learning results or models to support conservation efforts of the Atlantic forest, to evaluate humans impacts and to understand forest landscape history.

In 1962, ten years before the launch of the first Landsat satellite, panchromatic aerial photographs were taken over São Paulo State by the company Aerofoto Natividade Ltda, including the region studied by [14], for the Brazilian Institute of Agronomy (Instituto Agronômico, IAC) at a spatial scale of approximately 1/25000. This benchmark of the forest cover will be used in this study to estimate the change in natural forest cover between 1962 and today (obtained from U-net), to assess the relative age of the forest (before or after 1962), and also to understand the currently growing planted forests cover, as the plantation of eucalyptus at large scale in this region only began after 1966, when the Brazilian federal law 5.106 came into force with the goal to encourage afforestation and reforestation in Brazil [19].

Additionally, very high resolution WorldView images will be used for the same region studied by [14], to map another pioneer tree, the species Tibouchina pulchra. This species is locally named Manacá da Serra and is mainly distributed in the Serra do Mar, a mountain system that follows the Atlantic coast where the largest remaining fragments of Atlantic forest are located. This tree species has ideal characteristics to support remote sensing methods of disturbance estimation in the Atlantic forest; it is found mainly in disturbed and secondary forests, sometimes being the dominant species [2023]. Its trees bloom synchronously in February, and these crowns covered with pink flowers can be visually detected in very high resolution satellite images. The combined map of Tibouchina pulchra and Cecropia hololeuca is useful to understand the forested landscape of this Atlantic forest region.

Furthermore, high resolution climate and elevation data are available for this region. These datasets, along with tree species maps could bring new insights to species distribution models based on remote sensing data [17]. For example, TerraClimate is a new dataset with a high spatial resolution (1/24°, ∼4km) of monthly climate and climatic water balance for global terrestrial surfaces from 1958-2015 [24]. Due to its high resolution, it reproduces in detail the strong climate gradient over the studied region. From the Atlantic coast following North through the Serra do Mar mountains and descending to the valley (“vale do Paraiba”), in less than 80 km, precipitation decreases from ∼ 2500–3000 mm.year−1 to less than 1400 mm.year−1 [24]. As Tibouchina pulchra is known to be sensitive to precipitation, its accurate location associated with climate data could help to assess its spatial distribution envelope. The region’s topography also varies strongly in elevation (from 0 to 1650 m), slope, orientation and relief. With the elevation data at 30 m spatial resolution from the Shuttle Radar Topography Mission [25], indices could be derived to help describe and interpret the distribution of Tibouchina pulchra and Cecropia hololeuca. As the U-net method maps individual trees at an unprecedented scale, combining this information with climatic and edaphic conditions should bring new information on the species’ preferences and distribution.

The following are the main objectives of this study. First, to assess the forest history in a sub-region of Atlantic forest (“Rio do Chapeu” region) based on the natural forest map established from a 1962 aerial image and the very high resolution natural/planted forest map of 2017 produced with the U-net [26]. Second, to produce the map of Tibouchina pulchra and to combine this map with the previously produced map of Cecropia hololeuca to analyse the relations of these two pioneer species with the history of the forest. Then, we use the obtained results to assess the anthropogenic disturbance history of the entire region. Finally, we analyse the association of Tibouchina pulchra and Cecropia hololeuca spatial distributions with climatic and edaphic conditions.

A more general objective is to show that with the new information on tree species locations brought by deep learning, new knowledge can be gained on the history of the forests and tree species distribution, even for the most visible and common tree species.

Materials

Study site

This study was undertaken in a region of the Atlantic forest biome located in the São Paulo State, Brazil, and centered at 23°11’43’S and 45°21’50W, Fig 1. The area was chosen because it contains several remnants of the Atlantic Forest biome as well as secondary forests at different stages of regeneration and planted eucalyptus forests, Fig 1b. Most of the forest plots from the BIOTA project [2], established to study the effect of fragmentation, are also located inside this region.

Fig 1.

Fig 1

Geographical locations of the Atlantic Forest biome in green and of our region of interest in red (a); region of interest with 2017 land use/cover classes from the MapBiomas project, the WorldView-2 and WorldView-3 tiles used in this study, and the area covered by the aerial photography of 1962 called further “Rio do chapeu” region (b).

WorldView-2 images and pre-processing

The two WorldView-2 images (DigitalGlobe, Inc., USA) were acquired over the region on February 17, 2017, at an average off nadir view angle of 18 and 20.7°, respectively. DigitalGlobe catalog IDs of the images were A01001032124B900 and A01001032124BC00. These two images were distributed in tiles of 16384 × 16384 pixels which represents 30 tiles for each image. Only 36 tiles covering land were kept in the analysis covering a region of ∼ 2410 km2, Fig 1. The spatial resolution was 0.5 m for the panchromatic band (464–801 nm) and 2 m for the selected multispectral bands: Red (629–689 nm), Green (511–581 nm) and Blue (447–508 nm). The three WorldView-3 images (DigitalGlobe, Inc., USA) over the region were acquired on August 13, 2017, at an average off nadir view angle of 20.1°, further details are given in [14].

All bands were scaled from raw image digital numbers (11 bits) to 0–255 (8 bits). The Red-Green-Blue (RGB) bands were pan-sharpened with the panchromatic band using the method Simple RCS of the Orfeo toolbox addon otbcli_BundleToPerfectSensor [27] to create a single high-resolution RGB image at 0.5 m spatial resolution. No atmospheric correction was performed.

Forest cover in 1962

To assess the recent history of the current forest stands and to test how the pioneer tree species Tibouchina pulchra and Cecropia hololeuca were associated with the regeneration of abandoned pasturelands or disturbed forest, we used an independent dataset of aerial photographs taken in 1962 over the São Paulo State region. The aerial photographs were realised in 1962 by the company Aerofoto Natividade Ltda for the Brazilian Institue of Agronomy (Instituto Agronômico, IAC) at a scale of approximatelly 1/25000 and made available by the Brazilian Institute of Geography and Cartography (Instituto Geográfico e Cartográfico do Estado de São Paulo, IGC). The IGC data are not freely available but low resolution samples can be browsed at http://datageo.ambiente.sp.gov.br/app/# in the directory /Base Imagem/Imagens IGC/IGC—Fotoíndice Vôo 1962. The sample contained 40 aerial photographs covering the region of the “Rio do Chapeu” near São Luiz do Paraitinga (Fig 1b), with an overlap of approximately 40% to 50% between each adjacent photography. The images were mounted in two panoramas using the software AutoStitch© [28, 29]. Then, these two panoramas were registered using the 2017 WorldView-2 images in ArcGis 10.4 [30] with 290 and 343 control points, respectively, using a spline function for interpolation.

Climate data

To test if the T. pulchra and C. hololeuca spatial distributions were related to local climate, we computed the mean annual values of 12 variables from the high-resolution global dataset Terraclimate at ∼ 4 km spatial resolution (2.5 arc-minute, 1/24th degree) [24]. The climate variables used were actual evapotranspiration (mm), climatic water deficit (mm), Palmer drought severity index, reference evapotranspiration (mm), precipitation (mm), soil moisture (mm), downward shortwave radiation (Wm2), maximum temperature (°C), minimum temperature (°C), vapor pressure (KPa), vapor pressure deficit (kPa) and windspeed (m.s−1) [24].

Elevation data

To test if the T. pulchra and C. hololeuca spatial distributions were related to elevation (or elevation-related variables), elevation data from the shuttle radar topography mission at 30 m spatial resolution were used [25]. From this dataset, we used the variable ‘elevation’ and computed five additional elevation related indices that consider the eight neighbor pixels: slope (°); aspect, which indicates the compass direction that the slope faces (in °, where 0 is the North); topographic position index, which compares the elevation of the pixel to the mean elevation of the neighboring pixels (positive value indicates ridge, negative value indicates valley and near zero value indicates flat areas or areas of constant slope); terrain ruggedness index, an index of topographic heterogeneity computed as the sum of the change in elevation between the pixels and its eight neighbors cells (zero indicates no ruggedness); and, finally, roughness, computed as the difference between the maximum and the minimum value of a cell and its eight surrounding cells [31, 32].

Natural and planted forests mask

Natural and planted forests are identifiable in the images by their color and structure, see Fig 2. All the forests were mapped in two tiles of the Worldview-2 image using the U-net algorithm and the weights previously trained to segment forests from Worldview-3 images [14]. After all forests were segmented in both tiles, planted forests were then manually delineated resulting in 2154 polygons (716 + 1438). Cropping the mask of the two tiles in 256 × 256 pixels images resulted in a sample of 7780 images to train the model of natural and planted forests detection and segmentation. Additionally, outside the extent of the Worldview-2 image, we used previous natural and planted forest maps produced with Worldview-3 images and the same U-net methodology used here [14]. For consistency with the map produced with the WorldView-2 image, these maps were resampled from 0.3 to 0.5 m spatial resolution with the function average and coded as 1 if the pixel value was above 0 (natural or planted forest in the 0.5 m pixel) or 0 otherwise.

Fig 2. Example of images (256 × 256 pixels) with natural (a-b) forest and planted eucalyptus forest (c-d).

Fig 2

Satellite image courtesy of the DigitalGlobe Foundation.

Tibouchina pulchra mask

All individuals of T. pulchra bloom synchronously in February, rendering the species identifiable in the images due to its unique rose/purple color, see Fig 3. All occurrences of Tibouchina pulchra trees were manually delineated in one tile of the Worldview-2 images. The delineated sample was composed of 10813 polygons where each polygon can represent more than one individual. With the delineated polygons, a raster mask coded in RGB was produced with these values: Background [0, 0, 0] and Tibouchina pulchra [127, 127, 127]. Cropping the mask in 128 × 128 pixel images resulted in 1002 images containing at least one T. pulchra tree for the training sample. Additionally, 1046 randomly sampled images of the same tile containing only background were added to the training set to provide a wider range of images to the network with features such as constructions or pastureland.

Fig 3. Example of images (256 × 256 pixels) with the species of interest, Tibouchina pulchra (a-d).

Fig 3

Satellite image courtesy of the DigitalGlobe Foundation.

Cecropia map

For the Cecropia trees in the region, we used a map previously produced in [14] with Worldview-3 images and the same U-net methodology used here for Tibouchina pulchra. For consistency with the map produced with the WorldView-2 image, the C. hololeuca map was resampled from 0.3 to 0.5 m spatial resolution with the function average and coded as 1 if the pixel value was above 0 (presence of Cecropia in the 0.5 m pixel) or 0 otherwise.

Methods

U-net model

Architecture

In this study, we used a convolutional network for multi-class image segmentation known as U-net [14, 26]. This network performs a per-pixel classification, predicting the probability of each pixel to belong to a particular class. This U-net model has recently proven to become a new standard in image dense labeling [33]. We adapted the U-net architecture [26] with twice less filters, since our training set is limited and a smaller number of filters helps in preventing overfitting. Furthermore, we used a three-band RGB image as the input and have adapted the network architecture accordingly, see [14] and Fig 4. Sigmoid activation functions were used to ensure that output pixel values range between 0 and 1. For the training, we used an input size of 256 × 256 pixels for forest type segmentation and 128 × 128 pixels for the Tibouchina pulchra trees segmentation.

Fig 4. U-net architecture for the forest types segmentation, adapted from [26] and reproduced from [14].

Fig 4

The number of channels is indicated above the cuboids and the vertical numbers indicate the row and column size in pixels.

Network training

The training samples comprised 2048 images of 128 × 128 pixels for T. pulchra trees and 7780 images of 256 × 256 pixels for forest types. The size of 128 × 128 pixels was selected because T. pulchra crowns are smaller than 128 pixels in diameter (128 pixels = 64m). An image size of 256 × 256 pixels was used for natural forest/plantations to include better contextual; plantation trees are generally planted in lines, and this information is more noticeable with this image size. The images were extracted from uniform grids of 128 × 128 pixels and 256 × 256 pixels, without any overlap between neighboring images. 80% of these images were used for training, and 20% used for validation. During network training, we used a standard stochastic gradient descent optimisation. The loss function was designed as a sum of two terms: mutual cross-entropy and Dice coefficient-related loss [3436]. We used the optimiser RMSprop (unpublished, adaptive learning rate method proposed by Geoff Hinton here http://www.cs.toronto.edu/~tijmen/csc321/slides/lecture_slides_lec6.pdf) with an initial learning rate of 1e-4. We trained our network for 100 epochs, where each epoch comprised 78 batches with 16 images per batch. The optimisation was stopped when the loss function improvement did not exceed 1e-4.

Data augmentation was applied randomly to the input images, including 0/90/180/270° rotations and changes in the brightness, saturation and hue by converting RGB to Brightness-Saturation-Hue space (BSH) and modulating the current values between 95%–110% for brightness, 95%–105% for saturation and 99%–101% for hue (as changes in the plant hues are not expected).

Segmentation accuracy assessment

Three performance metrics were computed. First, the overall accuracy was computed as the percentage of correctly classified pixels. Second, the intersection over union (IoU) of the object class, which is the number of pixels labelled as object in both the prediction and the reference, divided by the number of pixels labelled as object in the prediction and in the reference. Third, the F1 score was computed for each class i as the harmonic average of the precision and recall, Eq 1, where precision was the ratio of the number of segments classified correctly as i and the number of all segments (true and false positive) and recall was the ratio of the number of segments classified correctly as i and the total number of segments belonging to class i (true positive and false negative). This score varies between 0 (lowest value) and 1 (best value).

F1i=2×precisioni×recalli(precisioni+recalli) (1)

Prediction

For prediction, each Worldview-2 tile of 16384 × 16384 pixels was cropped based a regular grid of 512 × 512 pixels, and 64 neighbour pixels were added on each side to create an overlap between the patches. If there was a remaining blank portion (for example, due to the tile border) it was filled by the symmetrical image of the non blank portion. The prediction of both models (forest types and T. pulchra tree segmentations) were made on these images of 640 × 640 pixels, and the resulting images were cropped to 512 × 512 pixels and merged to reconstitute the 16384 × 16384 pixels WV-2 tile. This overlapping method was used to avoid the artifact of prediction on the border, a known problem for the U-net algorithm [26]. To belong to a given class, the pixel prediction value must be greater than or equal to 0.5. From the raster image of the predictions, we generated the spatial polygons of each classes.

Algorithm

The model was coded in R language [37] with Rstudio interface to Keras [35, 36] and Tensorflow backend [38]. R code is available upon request. The training of the models took ∼ 2 to 20 hours using GPU on a Nvidia Quadro K6000 with a 12 GB dedicated memory. Prediction using GPU of a single tile of 16384 × 16384 pixels (∼ 67 km2) took approximately 35 minutes. The code of the U-net model was adapted from the original U-net code developed for Keras and Rstudio, and is available here http://doi.org/10.5281/zenodo.3601503.

Forest and species distribution analysis

Past history of the 2017 forest cover

First, the mask of natural/planted forest in 2017 was produced with the U-net at 0.5 m. Second, the past history of the 2017 forest cover, that is, if they were present or not in 1962, was mapped. To do this, only for the forests that were mapped in 2017, the mask of the forests in 1962 was produced with the registered aerial images of 1962. This map was produced in QGIS and resampled in a raster tile at 0.5 m spatial resolution [39]. The history of the 2017 forest cover was then described in the “Rio do Chapeu” region. With these two maps, we were able to verify if a forested pixel present in 2017 was already present in 1962 (old forest) or if it was a recent regrowth, posterior to 1962 (new forest). We also determined how much pasture and forest were converted to eucalyptus plantation. Finally, a raster mask at 0.5 m was created with three classes, eucalyptus plantation, old natural forest and new natural forest in 2017.

Association of pioneer species with forest history

The “Rio do Chapeu” region was covered by the 1962 images, and here we determined the association between the pioneer species and the age of the forest. We describe for each Tibouchina pulchra and Cecropia hololeuca pixels if they were inside an old or new natural forest (regrowth after 1962) and their minimum distance to new and old natural forest, within a maximum distance threshold of 25 m and of 50 m, respectively. To assess the dominance of both tree species in the natural forest, we computed the percent of pixel of the species in the forest in a circular area of 100 m radius around each pixel of the species; that is, the number of pixels of the species was divided by the number of pixels of the forest in the area. This index ranges from 0 to 100% and, for example, if it is equal to 50% this means that 50% of the canopy is occupied by the species. These indices were computed for both pioneer species in new and old forests. This index was computed for the “Rio do Chapeu region”, then we used the mean of the index in new and old forest to determine, where both species were dominant, here defined as pixels with a higher compaction that the mean of the compaction of the species in the new or in old forest.

Distribution of Tibouchina pulchra and Cecropia hololeuca in relation to elevation and local climate

Both species’ distribution associations with local climate (see section for more details) and with altitude and related topographic indices (see section) were described over the full extent of the Worldview-3 image for C. hololeuca and over the full extent of the WV2 image for T. pulchra, Fig 1. First, for a given species, the centroid coordinates of the delineated polygon were computed. Then, the same amount of points was sampled randomly over the map. The values for climate, elevation and related topographic indices were extracted for the coordinates corresponding to the centroids (real species distribution) and to the random points (random distribution). Then the distributions of the variable at random coordinates and at the real species coordinates were compared using histograms. In the text, the values extracted from the random coordinates are referred to as mean landscape values.

Results

Detection accuracy

The time for convergence was ∼ 20hrs for the forest-types model and < 3hrs for the T. pulchra trees model. The best models were obtained after 28 epochs with 8 images per batch for the forest-type model and 44 epochs with 16 images per batch for the T. pulchra trees model, Table 1. For the forest type segmentation, the overall accuracies and IoU coefficient were 95.71% and 0.95, respectively. The overall accuracy of the T. pulchra trees segmentation was 98.92% and the IoU coefficient was 0.87. Accuracies of the C. hololeuca tree segmentation previously presented in [14] are similar to the accuracies of T. pulchra segmentation with an overall accuracy of 97.09% and an IoU coefficient of 0.861.

Table 1. Numerical evaluation of the models, training and validation sample size and convergence details.

model epoch batch training sample validation sample IoU Overall accuracy
Forest types 28 8 6224 1556 0.950 95.71
Tibouchina trees 44 16 1638 410 0.867 98.92
Cecropia treesa 28 16 1946 486 0.861 97.09

a from [14]

The natural forest class showed the best F1-score, followed by the eucalyptus plantation class, Table 2. Recall was higher in natural forests than in plantations, thereby indicating a lower rate of false negatives. Precision was higher in natural forests than in plantations, thereby indicating a lower rate of false positives. The F1-score of the Tibouchina trees segmentation was lower than forest-types F1-score, with a value of 0.87. For the Tibouchina class, precision was slightly higher than recall, with values of 0.88 and 0.85 respectively, Table 2. The F1-score of the Cecropia trees segmentation obtained previously was lower than forest-types F1-score, with a value of 0.80 [14]. For the Cecropia class, F1-score, precision and recall values were similar, Table 2.

Table 2. F1-scores of the segmentation for natural and eucalyptus forests as well as for T. pulchra and C. hololeuca trees.

model classes precision recall F1-score
Forest types Natural forest 0.956 0.956 0.956
Eucalyptus plantation 0.947 0.939 0.943
background 0.970 0.965 0.968
Tibouchina trees Tibouchina trees 0.884 0.851 0.868
background 0.993 0.995 0.994
Cecropia treesa Cecropia trees 0.808 0.801 0.804
background 0.985 0.983 0.984

a from [14]

Examples of Tibouchina segmentation result with F1-score and manual segmentation are presented in Fig 5. The crowns of T. pulchra trees are small, mainly < 10 m of diameter. The border of the T. pulchra crowns is not sharp in the image, and there is often a small variation between the manual and automatic segmentation, Fig 5. Some small T. pulchra trees were missed by the algorithm, Fig 5b, or missed by the producer, Fig 5c. When T. pulchra trees present larger crowns and/or concentrated patches, there is a very good agreement between manual and U-net segmentation Fig 5d.

Fig 5. Example of validation images (128 × 128 pixels) for T. pulchra trees with manual delineation in orange and the U-net delineation in green (a-d).

Fig 5

Satellite image courtesy of the DigitalGlobe Foundation.

Besides eucalyptus unnoticeable plantation structure and natural forests’ homogeneous canopy [14], the most common source of error here was the detection of old planted forests. Old plantations that were well delineated with WV-3 data (0.3 m spatial resolution) presented more error and were more difficult to detect by the algorithm in WV-2 images (0.5 m spatial resolution). In these plantations, the plantation structure (regular lines of planted trees) has disappeared; the crowns were large and did not show regular size and shape.

Forest cover and studied species distribution in the “Rio do Chapeu” region

Forest cover in the “Rio do Chapeu” region

In 2017, the forests covered 55.3% (142 km2) of the “Rio do Chapeu” region, Fig 6. 43% of these forests were already present in 1962 (hereafter referred as old forests) while 57% of forests were pasture (hereafter referred as new forests). Among the new forests, 76.9% were natural regeneration while 23.1% were eucalyptus plantation. The eucalyptus were mainly planted where there was no forest in 1962; only 21.1% of the plantation were natural forests in 1962. As only two time periods have been assessed, 1962 and 2017, intermediate land uses that could have existed between transition from natural forests and to planted forests are unknown.

Fig 6. Distribution and relative age of the “Rio do Chapeu” forests in 2017.

Fig 6

Two types of forests, planted or natural, were identified with the U-net method. And, among the natural forests, new forests indicate forests that were pasture in 1962 (from the registered 1962 aerial photos) while old forests indicate forests that were already there in 1962. Note that all eucalyptus forest have been planted after 1962. All the pixels which are not planted or natural forests are in white.

Cecropia hololeuca distribution pattern

Cecropia covered 0.25% of the current natural forest, Fig 7. An interesting pattern is observed in their distribution: 75% of these trees are located in forests that were already established in 1962, 86% inside or at a distance less than 25 m from an old forest and 91% inside or at a maximal distance of 50 m from an old forest. Considering only the old forest, in the neighboring area of 25 m around each C. hololeuca pixel, there is a mean of 2.84% of C. hololeuca pixels (55.8 m2); and 1.2% (94.25 m2) in a neighboring area of 50 m. These low percentages indicate that C. hololeuca trees did not form homogeneous forest cover but were mixed with other trees species. We further use this threshold of 2.84% of C. hololeuca pixels to define area where there is C. hololeuca dominance (pixel with more than 2.84% of neighbors C. hololeuca pixels within a radius of 25 m).

Fig 7. Distribution of Tibouchina pulchra (transparent magenta) and Cecropia hololeuca (transparent blue) dominances in the natural forests of the “Rio do Chapeu” region.

Fig 7

Dominance was defined as pixels with more than 2.84% of C. hololeuca or more than 17.5% of T. pulchra within a radius of 25 m, for C. hololeuca and T. pulchra, respectively. New forests indicate forests that were pasture in 1962 (from the registered 1962 aerial photos) while old forests indicate forests that were already there in 1962. All the pixels which are not natural forests are in white.

Tibouchina pulchra distribution pattern

Tibouchina pulchra with flowers covered 3.4% of the current forest, Fig 7. For this species, an opposite behaviour of the C. hololeuca is observed. 66.8% of these trees were located in new forests that were pasture in 1962, 81.4% inside or at a distance below 25 m of a new forest and 89.7% inside or at a distance below 50 m of a new forest. T. pulchra is also over-represented close to the “Serra do Mar”, which is located in the bottom of “Rio do Chapeu” region. Considering only the new forests which contain the majority of the Tibouchina trees, in the neighboring area of 25 m around each Tibouchina pixels, there is a mean of 17.5% of Tibouchina pixels (343.5 m2) and 12.9% (1011.3 m2) in a neighboring area of 50 m. Tibouchina, therefore, is distributed in more homogeneous and dominant stands than the C. hololeuca.

Species regional distibution

At the regional scale, forests with T. pulchra dominance are distributed almost exclusively in the South where the forest is fragmented and before the edge of the ‘Serra dor Mar’, Fig 8. Overall, forested pixels with T. pulchra dominance cover 4.28% of the natural forests in the WorldView-2 image (∼ 4757 ha), however it can reach 21.89% in some tiles of the image. This species forms patches with a mean area of 0.27 ha and the maximum observed size is of 72.2 ha. Few small patches can be observed in the largest fragment of natural forest in the South, which is the largest remaining fragment of Atlantic forest. C. hololeuca is distributed all over the region, Fig 8, and forested pixels with C. hololeuca dominance covered 8.8% of the natural forests in the WorldView-3 image. In the ‘Serra dor Mar’, the area dominated by C. hololeuca and T. pulchra appeared systematically separated in space, Fig 8.

Fig 8. Map of the natural forests in 2017 with additional information on Tibouchina pulchra and Cecropia hololeuca dominances.

Fig 8

Dominance was defined as pixels with more than 2.84% of C. hololeuca or more than 17.5% of T. pulchra within a radius of 25 m, for C. hololeuca and T. pulchra, respectively. Natural forest, T. pulchra and C. hololeuca were mapped on the extends cover by the Worldview-3 and -2 image.

Forest and species distribution associations with elevation and related variables

Natural forests are present at all elevations, while non-forested area are limited to the sea level or to altitude above 500 m, Fig 9a. Areas with altitudes between 50 and 500 m are only present in the steep, forested slopes of the Serra do Mar (the first uphill). Forests are found more often on steeper slopes than non-forest areas, Fig 9b. Furthermore, the distribution of forest aspect indicates that they occur predominantly on south-oriented slopes (south is 180°), Fig 9c. On the other hand, non-forested areas are more frequently oriented to the North (0°). There is no apparent difference of TPI between the two classes, Fig 9d. Finally, forests are distributed on slightly rougher terrains than the non-forested area, Fig 9e and 9f. All these characteristics of the forest distribution remain even after removing the largest remaining fragment at the south, which could have biased the results, S1 Fig.

Fig 9. Distribution of the natural forests and non forest (drawn from a random distribution) in relation to elevation and five elevation-related variables at the regional scale.

Fig 9

The five elevation-related indices consider the 8 neighbor pixels and are slope (°); aspect (compass direction facing slope in °, where 0 is the North); topographic position index (elevation of the pixel in relation to the mean elevation of the neighbor pixels); terrain ruggedness index (sum of the change in elevation between the pixels and its 8 neighbors cells); and finally roughness (difference between the maximum and the minimum value of a cell and its 8 surrounding cells) [31, 32].

As forests have a particular association with elevation and related variables, to avoid misinterpretation, in the following we compare the distribution of the T. pulchra and C. hololeuca to the distribution of forested pixels without their dominance. The distributions of T. pulchra and C. hololeuca presented an important overlap with the distribution of natural forests without their dominance’s, S2 and S3 Figs. They were both mainly found at elevations above 600 m with a peak of their distribution at 800-900 m S2a and S3a Figs. They are found similar slopes, even if the distribution of the slope for T. pulchra is more skewed, b and S3b Fig. As for the forest, they both occur predominantly on the south-oriented slopes, S2c and S3c Figs. For the topographic position there is a slight tendency of T. pulchra to appear on TPI more negative than the rest of the forest d, while the contrary is observed for the C. hololeuca S3d Fig. Finally, both species are distributed on terrains with similar roughness as other natural forested area, even if the distribution of roughness for T. pulchra is a little more skewed, S2e, S2f, S3e and S3f Figs.

Species distribution associations with climate variables

For most of the climate variable, T. pulchra distribution in relation to climate variables is different from a random distribution association with climate variable, that is, the mean climatic landscape, Fig 10. For the climate variables associated with water, precipitation and soil moisture, T. pulchra distribution was narrower than the distribution of the mean climatic landscape, peak at a median annual precipitation of 1775 mm, Fig 10a, and occurred on soil with a higher soil moisture (peak at 670 mm versus 550mm), Fig 10b. Its reference evapotranspiration and actual evapotranspiration present a skewed distribution around 905 and 965, respectively, Fig 10c and 10d. Its distribution encompasses locations with lowest climatic water deficit and the distibution of Palmer drought index peak at a value below the value of the mean climatic landscape, Fig 10e and 10f. For climate variables related to radiation, T. pulchra occurred in locations with less downward shortwave radiation, and maximum temperature lower that the mean climatic landscape, Fig 10g and 10h, while there is not so much difference for minimum temperature, Fig 10i. As for minimum temperature, not much differences are observed for the vapor pressure, Fig 10j, while T. pulchra occurred in location with lowest vapor pressure deficit than mean climatic landscape, Fig 10k. Finally, T. pulchra is mainly distributed where the highest wind speeds of the region are observed, Fig 10l.

Fig 10. Comparison of the distribution of the forests with or without T. pulchra dominance in relation with annual mean of climate variables over the Worldview2 image extent.

Fig 10

Climate variables were extracted for T. pulchra dominance polygon centroids (n = 461355) and from randomly distributed natural forests points without T. pulchra dominance (n = 50000).

C. hololeuca was distributed over the whole region, Fig 8. As a consequence, there are no noticeable differences between the distributions of the climate variables extracted for C. hololeuca coordinates and extracted with random coordinates across the image, S1S4 Figs.

Discussion

Mapping blooming trees

Here, a regional distribution map of all individuals of a natural tree species was produced based on its reproductive phenology using a U-net network and multispectral remote sensing images. The overall accuracy of 98.92% and an IoU of 0.867 could be explained by the unique spectral values (pink/purple) of the T. pulchra flowers, Fig 5. Other tropical tree species present intensely colored flowering blooms, such as the genus Jacaranda or Tabebuia, and could be mapped with U-net methods for ecological studies or to limit the damage of selective logging. Using flowering trees for mapping natural ecosystems has two main limitations. First, the number of adult trees is likely underestimated if the trees did not all flourish synchronously or every year such as already observed with the species Tabebuia guayacan at Barro Colorado Island [40]. Second, some other species can flower at the same time and have the same reflectance in the satellite image. In the Atlantic forest, two other species of the genus Tibouchina, T. mutabilis and T. sellowiana, show a similar behaviour as T. pulchra. They flower the same time, show similar spectral values, and occur in the same secondary succession, independently of the species. They are all know under the same vernaculary name “Manacá da Serra”. Due to this, we acknowledge that some errors in labelling can occur however all the mapped trees show similar characteristics that are to make pink/purple flowers synchronously and to dominate in new forests regenerating from pasture. Another problem with mapping flowering trees, specific with T. pulchra, is that they are also used as ornamental trees, and, consequently, they can be present outside of their natural or preferential range. There are some planted specimens at the National institute of Space Research of Brazil (INPE) and they are also sold on other continents. The method based on U-net and very high resolution images remains difficult to apply on a biome scale for the flowering blooms as it would require an enormous amount of storage capacity and images without clouds at the blooming time. However, the signal of T. pulchra flowers might be found in the time series from high resolution image such as Landsat or Sentinel-2 data, and further work will be made to map the T. pulchra in all the Atlantic forest biome. Tibouchina pulchra flower at different moments of the year, earlier north of the Atlantic forest than in the south, and deciphering the date of flowering of the same species on gradients of precipitation, temperature and during several years and for different latitudes could help to understand the mechanisms and climate variables inducing flowering in tropical trees.

Rio do Chapeu landscape history and its species indicators

The difference in forest cover between 1962 from the aerial image and 2017 from the U-net forest model over the Rio do Chapeu region showed that forest cover has largely increased, with more than half of the current forest cover (57%) grown after 1962, Fig 6. Among these new forests, almost one quarter were eucalyptus plantations, and only 20% of these plantations were made after the cut of an existing forest in 1962. After more than two centuries of intensive deforestation [6], this forest expansion is likely due to the federal law 5.106 of 1966 which has encouraged reforestation with fiscal incentive [19] by allowing farmers to apply 50% of their income tax to reforestation. It was also likely combined with the economical success of eucalyptus plantation in the early 1970s; in 1973, Brazil was already the largest eucalyptus producer in the world [41], and 80% of Brazil’s plantations were located in the São Paulo state, where our region of interest is located [42].

Here, we find more than 90% of T. pulchra were inside or less than 50 m away from a new forest, Fig 8. They are only dominating in stands of new forests, after forest recovery from a pastureland, as already observed locally in a field study in this region [23]. It is not only its presence but rather its dominance that indicates forest regeneration on an abandoned pasture. Consequently, we can use this dominance information to assess the history of the landscape. While occurring mainly in new forests, T. pulchra does not occur in all the new forests. As we now know the recent history in this region, other indicators from remote sensing that can inform on the forest succession stage might be tested here, such as α and β diversity computed directly from the image spectra [43, 44], canopy structure index based on textural information [45] or leaf area index, and carbon and nitrogen estimates from WorldView images [46]. T. pulchra population distribution is clearly related to regrowth after pasture; however, in a natural ecosystem, this dominant behaviour in open areas could be linked to landslides, which is the only natural process that can produce large opening in the forest that are comparable in size to pastureland, and which is the main process of landform development in this region [47].

While Cecropia hololeuca is one of the most common and recognisable tree in the Atlantic forest, our study shows a behaviour that was unknown before; that is, they occur mainly inside or close to old forest stands and rarely in new regeneration from pastureland. So they indicate canopy degradation of an old forest and not secondary succession from pasture. This was unexpected and not described before, likely because the individual needs to be mapped on a large scale to enable this observation, highlighting the importance of deep learning methods to improve our understanding of the tree species’ biology. This distribution might have three main reasons: the first is that the Cecropia hololeuca germinated better in a forest gap or close to a forest border than in an open area. An alternative explanation could be that large, old individuals that are well detected by the u-net are only found inside or on the border of forest because this genus is sensitive to wind, with low wood density and low survival rate after hurricanes such as observed in Puerto Rico by [48]. Finally, and more likely, this could be because of the previous land use. In the Amazon, it has been shown that Cecropia sp. occured in secondary forests after a clear cut only if there was no subsequent use of fire and conversion to pasture, and that another pioneer species, Vismia sp., dominated after fire and pasture conversion [49, 50]. In the Atlantic forest, as our result shows, C. hololeuca trees occurred predominantly in old forests, and we might be in a similar scenario as in the Amazon; that is, the C. hololeuca are absent of the secondary forest succession after use of pasture and fire. In this Atlantic forest region, all pastures originated from the traditional slash and burn methods, made initially for coffee plantations that were further converted to pasture [6].

Insights of large scale mapping for conservation and species distribution studies

The forest distribution of natural forest seems also to translate the human impacts. Remaining natural forest are present on steeper slopes and rougher terrain, which are obviously more difficult to exploit, and interestingly more oriented to the South, Fig 9. As reported by [6], in a manual for the coffee plantation management first published in 1847 by Francisco Peixoto de Lacerda Werneck and largely followed at the time, the plantation orientation on the South slopes was strongly discouraged [51]. This could explain why the natural forests with south orientation have been preserved until now. Based only on the species distribution, our results show that the forests of the region have undergone an important human influence, Fig 8. The forest degradation, described by the C. hololeuca dominance, represents 8.8% of the forest landscape. This dominance represents forests where the C. hololeuca concentration is above its mean concentration, and as our index of degradation is conservative, the degradation could be underestimated. Fortunately, our results also seem to indicate that these forests are old (> 60 years), and that they could have never been a pasture as C. hololeuca is present. This extends our previous findings [14] where, for the same WorldView-2 image, an index of disturbance based on C. hololeuca spatial distribution was produced, so now we know that these forests are old and which are the level of disturbance of these fragments. The regeneration of the forest, based on the T. pulchra dominance, represent 5% of the regional forest cover. Our results from the Rio do Chapeu, and because the 1966 law to encourage reforestation [19], seems to indicate that these new forests had grown after 1966. As not all new forests are dominated by T. pulchra, Fig 7, here we also underestimated the real regeneration from pasture. In Amazonia, abandoned clearcuts characterized by Cecropia sp. showed a richer mix of other arboreal genera characteristic of forest succession than arrested succession on abandoned pastures with history of fire dominated by the genus Vismia, another pioneer species [49]. Similarly, we observed C. hololeuca dominance mainly in old forests that have likely less or older history of fires than abandoned pasture where T. pulchra dominates. This could indicate that the forests characterized by the C. hololeuca dominance have a richer mix of species that the Tibouchina dominated forests but further studies combining biodiversity measured from the field, for example from the Biota Project [4], and our results are needed to confirm this. For the biodiversity studies and conservation, the dominance of C. hololeuca and T. pulchra measured from satellites could be important indices to report for the Atlantic forest.

The distribution of both species in relation to elevation and elevation-related variables show no remarkable differences with the distribution of the forest. This shows that analysis of forest distribution is necessary before studying individual species distribution to avoid biases. For example, in our case, the studies species are located on the south-oriented slopes, not because of their habitat preference, but because it is where the remaining/regrowing forests are. C. hololeuca is distributed over the region and did not show significant relation with climate (Fig 1), as expected since its the large climate envelope covering all the Atlantic forest biome [52]. This is not the case for T. pulchra even at the regional scale. Its distribution relation with climate variable shows that this species seems to prefer wet climate with annual precipitation above ∼ 1600 mm and maximum annual temperature below 24° while its distribution seems unaffected by minimum temperature, Fig 10. However, we cannot conclude on the wettest part of the gradient because higher annual precipitation are observed in the south fragment, which has not undergone deforestation and agriculture, so even if there are natural occurrences of T. pulchra the natural conditions for dominance are limited. Further works are needed to develop methods to study tree species spatial distribution based on the map of all individuals.

Conclusion

We have assessed the forest history in the ‘Rio do Chapeu’ region based on the difference of 1962 forest cover obtained from aerial images and of 2017 obtained through deep learning. The history of the forest was then related to the presence of two pioneer species: Tibouchina pulchra, which has been found to be dominant only in new forests regenerating from pasture, and Cecropia hololeuca, which is dominant only in old forests; that is, forests already existing in 1962 and with likely no preceding use of fire. The map of both species for the entire region shows that at least 5% of the current natural forests are recent regeneration from pasture, and that at least 10% of the natural forest show signs of important disturbance as indicated by the C. hololeuca dominance. Overall, we observed that the landscape has been highly impacted by humans. The positive point is that forest in this region has increased during the studied period, and that a lot of fragments are old forests (present before 1960 as deduced by the C. hololeuca dominance) likely containing more biodiversity that regeneration from pasture. The analysis of forest and species distribution show mainly: (i) that forests are oriented to the South and on steeper slopes that non forest, likely due to expansion of coffee plantation favoured on North slopes, (ii) that Cecropia hololeuca has a larger distribution range than the observed in studied region, and (iii) that Tibouchina pulchra seems to be restricted to wet environments with annual precipitation above 1600 mm. Finally, here we show how to use multiple sources of remote sensing data and new deep learning techniques to retrieve information on biology, such as tree species distribution and their behaviour, and how it can be used to interpret the landscape history.

Supporting information

S1 Fig. Distribution of the natural forests, excluding the largest fragment at the south, and non forest (drawn from a random distribution) in relation to elevation and five elevation-related variables at the regional scale.

The five elevation related indices consider the eight neighbor pixels and are slope (°); aspect (compass direction facing slope in °, where 0 is the North); topographic position index (elevation of the pixel in relation to the mean elevation of the neighbour pixels); terrain ruggedness index (sum of the change in elevation between the pixels and its 8 neighbors cells); and, finally roughness (difference between the maximum and the minimum value of a cell and its 8 surrounding cells) [31, 32].

(PDF)

S2 Fig. Comparison of the distribution of the forests with or without T. pulchra dominance in relation with elevation-related variables over the WorldView-2 image extent.

Elevation-related variables were extracted for T. pulchra dominance polygon centroids (n = 461355) and from randomly distributed natural forests points without T. pulchra dominance (n = 50000).

(PDF)

S3 Fig. Comparison of the distribution of the forests with or without Cecropia hololeuca dominance in relation with elevation and elevation-related variables over the WorldView-3 image extent.

Elevation-related variables were extracted for C. hololeuca dominance polygon centroids (n = 169542) and from randomly distributed natural forests points without C. hololeuca dominance (n = 11717).

(PDF)

S4 Fig. Comparison of the distribution of the forests with or without Cecropia hololeuca dominance in relation with annual mean of climate variables over the WorldView-3 image extent.

Climate variables were extracted for C. hololeuca dominance polygon centroids (n = 169542) and from randomly distributed natural forests points without C. hololeuca dominance (n = 11717).

(PDF)

Acknowledgments

We thank the two referees for their review and positive comments to improve the text. We thank DigitalGlobe for the provision of WorldView-2 and WorldView-3 satellite images.

Data Availability

The Worldview images and the 1962 aerial images are owned by DigitalGlobe and Emplasa, respectively, and cannot be shared publicly. However, a code example of the U-net model with a simulated Worldview image for reproducibility is available (https://doi.org/10.5281/zenodo.3601503) and the data of the maps presented in the Fig 8 are available on a public repository (https://doi.org/10.5281/zenodo.3601487). The authors confirm they had no special access or privileges to the Worldview images that other researchers would not have.

Funding Statement

The research leading to these results received funding from the project BIO-RED ‘Biomes of Brazil -- Resilience, Recovery, and Diversity’, which is supported by the São Paulo Research Foundation (FAPESP, 2015/50484-0) and the U.K. Natural Environment Research Council (NERC, NE/N012542/1). F.H.W. has been funded by FAPESP (grant 2016/17652-9). A.S. acknowledges the support of the FAPESP (grant 2016/03397-7). M.P.M.A. has been funded by ECOFOR Project, BIOTA-FAPESP (grant number 2012/51872-5). A.L.C.R. has been funded by supported by the State of São Paulo Research Foundation/FAPESP as part of the project ECOFOR (Process number 2012/51872-5) within the BIOTA/FAPESP Program - The Biodiversity Virtual Institute (www.biota.org.br) and co-supported by the British Natural Environment Research Council/NERC (NE/K016431/1). M.G.F acknowledges the support of Capes through a postdoctoral fellowship. Y.T. has been funded by the project EPITOME ANR-17-CE23-0009 of the French National Research Agency (ANR). L.E.O.C.A. thank the support of FAPESP (grant 2013/50533-5) and CNPq (grant 305054/2016-3). We also thank the Amazon Fund through the financial collaboration of the Brazilian Development Bank (BNDES) and the Foundation for Science, Technology and Space Applications (FUNCATE) no. 17.2.0536.1 (Environmental Monitoring of Brazilian Biomes). The funders (FAPESP, NERC, Capes, ANR and Luxcarta Technology) provided support in the form of salaries for authors F.W., A.S., Y.T. and M.G.F., but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

References

  • 1.SSC. Secretariat for Social Communication—Presidency of the Federative Republic of Brazil, Biodiversity in Brazil—Fact Sheet. In: United Nations Conference on Biological Diversity (COP11)- Hyderabad, India; 2012.
  • 2. Joly CA, Rodrigues RR, Metzger JP, Haddad CFB, Verdade LM, Oliveira MC, et al. Biodiversity Conservation Research, Training, and Policy in São Paulo. Science. 2010;328(5984):1358–1359. 10.1126/science.1188639 [DOI] [PubMed] [Google Scholar]
  • 3. Laurance WF. Conserving the hottest of the hotspots. Biological Conservation. 2009;142(6):1137 10.1016/j.biocon.2008.10.011. [DOI] [Google Scholar]
  • 4. Joly CA, Metzger JP, Tabarelli M. Experiences from the Brazilian Atlantic Forest: ecological findings and conservation initiatives. New Phytologist. 2014;204(3):459–473. 10.1111/nph.12989 [DOI] [PubMed] [Google Scholar]
  • 5.INPE. Atlas dos remanescentes florestais da Mata atlântica—período 2015-2016—Relatório técnico, Fundação SOS Mata Atlântica—Instituto Nacional de Pesquisas Espaciais; 2017.
  • 6. Dean W. With broadax and firebrand: the destruction of the Brazilian Atlantic Forest. Univ of California Press; 1997. [Google Scholar]
  • 7. Ribeiro MC, Metzger JP, Martensen AC, Ponzoni FJ, Hirota MM. The Brazilian Atlantic Forest: How much is left, and how is the remaining forest distributed? Implications for conservation. Biological Conservation. 2009;142(6):1141–1153. 10.1016/j.biocon.2009.02.021. [DOI] [Google Scholar]
  • 8. Hansen MC, Potapov PV, Moore R, Hancher M, Turubanova SA, Tyukavina A, et al. High-Resolution Global Maps of 21st-Century Forest Cover Change. Science. 2013;342(6160):850–853. 10.1126/science.1244693 [DOI] [PubMed] [Google Scholar]
  • 9. Silva AL, Alves DS, Ferreira MP. Landsat-Based Land Use Change Assessment in the Brazilian Atlantic Forest: Forest Transition and Sugarcane Expansion. Remote Sensing. 2018;10(7):996–999. 10.3390/rs10070996 [DOI] [Google Scholar]
  • 10. Rodrigues RR, Gandolfi S, Nave AG, Aronson J, Barreto TE, Vidal CY, et al. Large-scale ecological restoration of high-diversity tropical forests in SE Brazil. Forest Ecology and Management. 2011;261(10):1605–1613. 10.1016/j.foreco.2010.07.005. [DOI] [Google Scholar]
  • 11. Strassburg BB, Barros FS, Crouzeilles R, Iribarrem A, Santos JSd, Silva D, et al. The role of natural regeneration to ecosystem services provision and habitat availability: a case study in the Brazilian Atlantic Forest. Biotropica. 2016;48(6):890–899. 10.1111/btp.12393 [DOI] [Google Scholar]
  • 12. Diaz S, Fargione J, Chapin IFS, Tilman D. Biodiversity Loss Threatens Human Well-Being. PLoS Biol. 2006;4(8):e277 10.1371/journal.pbio.0040277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.World Resources Institute. Millennium Ecosystem Assessment—Ecosystems and human well-being: Biodiversity synthesis. Washington (D. C.); 2005.
  • 14. Wagner FH, Sanchez A, Tarabalka Y, Lotte RG, Ferreira MP, Aidar MPM, et al. Using the U-net convolutional network to map forest types and disturbance in the Atlantic rainforest with very high resolution images. Remote Sensing in Ecology and Conservation. 2019;0(0). [Google Scholar]
  • 15. Kwok R. Ecology’s remote-sensing revolution. Nature. 2018;556 (7699):137–138. 10.1038/d41586-018-03924-9 [DOI] [PubMed] [Google Scholar]
  • 16. Turner W. Sensing biodiversity. Science. 2014;346(6207):301–302. 10.1126/science.1256014 [DOI] [PubMed] [Google Scholar]
  • 17. He KS, Bradley BA, Cord AF, Rocchini D, Tuanmu MN, Schmidtlein S, et al. Will remote sensing shape the next generation of species distribution models? Remote Sensing in Ecology and Conservation. 2015;1(1):4–18. 10.1002/rse2.7 [DOI] [Google Scholar]
  • 18. Pettorelli N, Safi K, Turner W. Satellite remote sensing, biodiversity research and conservation of the future. Philosophical Transactions of the Royal Society B-Biological Sciences. 2014;369(1643, SI). 10.1098/rstb.2013.0190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sampaio A. 2 Sampaio, Anavarro. Os eucaliptos no Brasil. In:, pp.5-10. In: O eucalipto e as ecologias. Aracruz Celulose S.A. (Aracruz, ES); 1975. p. 5–10.
  • 20. Lorenzi H. Árvores brasileiras: manual de identificação e cultivo de plantas arbóreas nativas do Brasil. Nova Odessa: Editora Plantarum 352p-col illus Por Geog. 1992;4. [Google Scholar]
  • 21.Leitão Filho HdF, Pagano S, Cesar O, Timoni J, Rueda J. Ecologia da mata atlântica em Cubatão. Campinas, Editora Unesp da Fundação para o Desenvolvimento da Universidade Estadual Paulista, Editora da Universidade de Campinas. 1993;.
  • 22. Brito VL, Sazima M. Tibouchina pulchra (Melastomataceae): reproductive biology of a tree species at two sites of an elevational gradient in the Atlantic rainforest in Brazil. Plant Systematics and Evolution. 2012;298(7):1271–1279. 10.1007/s00606-012-0633-5 [DOI] [Google Scholar]
  • 23. Tabarelli M, Mantovani W. A regeneração de uma floresta tropical montana após corte e queima (São Paulo-Brasil). Revista Brasileira de Biologia. 1999;59(2):239–250. 10.1590/S0034-71081999000200008 [DOI] [Google Scholar]
  • 24. Abatzoglou JT, Dobrowski SZ, Parks SA, Hegewisch KC. TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015. Scientific data. 2018;5:170191 10.1038/sdata.2017.191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Farr TG, Rosen PA, Caro E, Crippen R, Duren R, Hensley S, et al. The shuttle radar topography mission. Reviews of geophysics. 2007;45(2). 10.1029/2005RG000183 [DOI] [Google Scholar]
  • 26. Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. CoRR. 2015;abs/1505.04597. [Google Scholar]
  • 27. Grizonnet M, Michel J, Poughon V, Inglada J, Savinaud M, Cresson R. Orfeo ToolBox: open source processing of remote sensing images. Open Geospatial Data, Software and Standards. 2017;2(1):15 10.1186/s40965-017-0031-6 [DOI] [Google Scholar]
  • 28.Brown M, Lowe D. Recognising Panoramas. In: Proceedings of the 9th International Conference on Computer Vision. vol. 2. Nice; 2003. p. 1218–1225.
  • 29. Brown M, Lowe DG. Automatic Panoramic Image Stitching using Invariant Features. International Journal of Computer Vision. 2007;74(1):59–73. 10.1007/s11263-006-0002-3 [DOI] [Google Scholar]
  • 30.ESRI. ArcGIS Desktop 10.4 Geostatistical Analyst; 2018. Environmental Systems Research Institute.
  • 31. Horn BK. Hill shading and the reflectance map. Proceedings of the IEEE. 1981;69(1):14–47. 10.1109/PROC.1981.11918 [DOI] [Google Scholar]
  • 32. Wilson MF, O’Connell B, Brown C, Guinan JC, Grehan AJ. Multiscale terrain analysis of multibeam bathymetry data for habitat mapping on the continental slope. Marine Geodesy. 2007;30(1-2):3–35. 10.1080/01490410701295962 [DOI] [Google Scholar]
  • 33.Huang B, Lu K, Audebert N, Khalel A, Tarabalka Y, Malof J, et al. Large-scale semantic classification: outcome of the first year of Inria aerial image labeling benchmark. In: IEEE International Geoscience and Remote Sensing Symposium—IGARSS 2018. Valencia, Spain; 2018. Available from: https://hal.inria.fr/hal-01767807.
  • 34. Dice LR. Measures of the amount of ecologic association between species. Ecology. 1945;26(3):297–302. 10.2307/1932409 [DOI] [Google Scholar]
  • 35.Allaire J, Chollet F. keras: R Interface to’Keras’; 2016. Available from: https://keras.rstudio.com.
  • 36.Chollet F, et al. Keras; 2015. https://keras.io.
  • 37.R Core Team. R: A Language and Environment for Statistical Computing; 2016. Available from: https://www.R-project.org/.
  • 38.Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, et al.. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems; 2015. Available from: https://www.tensorflow.org/.
  • 39.QGIS Development Team. QGIS Geographic Information System; 2009. Available from: http://qgis.osgeo.org.
  • 40. Sánchez-Azofeifa A, Rivard B, Wright J, Feng JL, Li P, Chong MM, et al. Estimation of the distribution of Tabebuia guayacan (Bignoniaceae) using high-resolution remote sensing imagery. Sensors. 2011;11(4):3831–3851. 10.3390/s110403831 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. da Silva MC, Scarano FR, de Souza Cardel F. Regeneration of an Atlantic forest formation in the understorey of a Eucalyptus grandis plantation in south-eastern Brazil. Journal of Tropical Ecology. 1995;11(1):147–152. 10.1017/S0266467400008518 [DOI] [Google Scholar]
  • 42.Viana MB. O eucalipto e os efeitos ambientais do seu plantio em escala. 2004;.
  • 43. Feret JB, Asner GP. Mapping tropical forest canopy diversity using high-fidelity imaging spectroscopy. Ecological Applications. 2014;24(6):1289–1296. 10.1890/13-1824.1 [DOI] [PubMed] [Google Scholar]
  • 44. Draper FC, Asner GP, Honorio Coronado EN, Baker TR, García-Villacorta R, Pitman NCA, et al. Dominant tree species drive beta diversity patterns in Western Amazonia. Ecology. 2019;0(ja):e02636 10.1002/ecy.2636 [DOI] [PubMed] [Google Scholar]
  • 45. Couteron P, Barbier N, Gautier D. Textural ordination based on Fourier spectral decomposition: a method to analyze and compare landscape patterns. Landscape Ecology. 2006;21(4):555–567. 10.1007/s10980-005-2166-6 [DOI] [Google Scholar]
  • 46. Mutanga O, Dube T, Omer G. Assessing edge effect on the spatial distribution of selected forest biochemical properties derived using the Worldview data in Dukuduku forests, South Africa. African Journal of Ecology. 2019; p. aje.12604. 10.1111/aje.12604 [DOI] [Google Scholar]
  • 47. Furian S, Barbiéro L, Boulet R. Organisation of the soil mantle in tropical southeastern Brazil (Serra do Mar) in relation to landslides processes. Catena. 1999;38(1):65–83. 10.1016/S0341-8162(99)00015-6 [DOI] [Google Scholar]
  • 48. Zimmerman JK, Everham EM III, Waide RB, Lodge DJ, Taylor CM, Brokaw NV. Responses of tree species to hurricane winds in subtropical wet forest in Puerto Rico: implications for tropical tree life histories. Journal of Ecology. 1994; p. 911–922. 10.2307/2261454 [DOI] [Google Scholar]
  • 49. Mesquita RC, Ickes K, Ganade G, Williamson GB. Alternative successional pathways in the Amazon Basin. Journal of Ecology. 2001;89(4):528–537. 10.1046/j.1365-2745.2001.00583.x [DOI] [Google Scholar]
  • 50. Wieland LM, Mesquita RC, Bobrowiec PED, Bentos TV, Williamson GB. Seed rain and advance regeneration in secondary succession in the Brazilian Amazon. Tropical Conservation Science. 2011;4(3):300–316. 10.1177/194008291100400308 [DOI] [Google Scholar]
  • 51. Werneck FPdL. Memória sobre a fundação de uma fazenda na província do Rio de Janeiro. Brasília/Rio de Janeiro: Senado Federal, Casa de Rui Barbosa. 1985; p. 57–58. [Google Scholar]
  • 52. Franco-Rosselli P, Berg CC. Distributional patterns of Cecropia (Cecropiaceae): a panbiogeographic analysis. Caldasia. 1997; p. 285–296. [Google Scholar]

Decision Letter 0

Judi Hewitt

29 Nov 2019

PONE-D-19-23068

Mapping Atlantic rainforest degradation and regeneration history with indicator species using convolutional network

PLOS ONE

Dear Dr. Wagner,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Jan 13 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Judi Hewitt

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

1. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a highly technical paper and I am not an expert in remote sensing techniques, so I cannot really judge all of the methods. That said, the approach seems very rigorous to me and offers new insights into species distributions and how they can be detected using high-resolution remote sensing data. The approach is focused on classifying two tree species that are "indicator" species of young and old secondary forest in the region. The results on regrowth forest and Eucalyptus plantations is very interested and relevant. The limitations of the methods are clearly presented, but it is not clear how errors due to including other flowering species might change the results of the analysis.

page 12, line 11: As there are only two time periods assessed, how do you know that forests directly converted to plantations? There could have been intermediate agricultural land use.

Section 5.1: how would results be affected if some of these other flowering trees were labelled as Tibouchina pulchra? Are they also good indicators of regenerating forests?

page 21, line 3: Most pastures in São Paulo state were derived from coffee plantations and not from shifting cultivation.

page 21, line 24: Where are these data presented? I don't think that evidence is presented in this paper to support this claim. Results from Amazonia in Cecropia vs. Vismia dominated secondary forests are not relevant here.

page 22, line 11: these still could be old second-growth (more than 60 yr old). Deforestation happened centuries ago in this region. Old forest is a vague term and for some it means old-growth forest (not cleared historically).

Reviewer #2: General Comments

The authors present in this work a method to evaluate landscape forest degradation and regeneration history using very high resolution satellite data and aerial photographs to map the occurrence of two indicator species. The method is based on machine learning and identifies the presence of species based on the image’s spectral characteristics. They have also crossed this species occurrence data to local climate and geomorphological data in order to understand the species’ habitat preferences.

The workflow of the geoprocessing is well described and includes a good error estimation section, which increases the robustness of the results. This is important because in my view the framework has good potential for expansion to larger areas and use as a tool for other studies, especially in areas in which Cecropia hololeuca and Tibouchina pulchra occur in the netropics, but also by using other local species. However, in order to increase the usage of this method by the scientific community, some aspects especially regarding scientific reproducibility should be included.

First, scientific research with remote sensing is heavily based on freely available satellite data, which is not the case with the very high resolution Worldview catalogue. It would be invaluable to analyze how free datasets (e.g. Landsat, CBERS, Sentinel-2) fare in comparison to the Worldview data for detecting the target species within the framework (even if they perform poorly, strengthening the case for free very high resolution data). Second, it is important that the algorithm code becomes available through GitHub or another code sharing platform from the time of the publication, together with an image sample from 1962 and another from Worldview, providing a demo of the analysis. This would strengthen the citation and usage frequency by this study.

Finally, I noticed in this study a general lack of usage of forest fragmentation as an analysis and discussion base within tropical ecology (Laurance et al. 2011; Chaplin-Kramer et al. 2015; Lôbo et al. 2011) and remote sensing (Dantas De Paula, Groeneveld, and Huth 2016; Pettorelli 2015; Melito, Metzger, and de Oliveira 2018). This I feel would be invaluable for understanding the occurrence of the indicator species within a forest edge-core gradient.

Specific comments:

Page 1, Line 7 – elevation data instead of only elevation.

P2L4 – Also high diversity forest restoration initiatives in the Atlantic Forest, see (Melo et al. 2013; Rodrigues et al. 2011)

P2L24- would eucalyptus plantations be considered as reforestation?

P2L29 – Maybe not relevant to include common names.

P3L29 - I know that in Brazil the forest domains are called "biomes" but this has another connotation in the outside scientific community, maybe use another term?

P5L4 – The digitalized images are panchromatic or have also RGB bands?

P5L27 - Since SRTM is a surface model and not a terrain model, ruggedness could be influenced by canopy characteristics.

P8L2 - How does this compare with conventional supervised/unsupervised image classification methods?

P10L5 – I suggest that a sample analysis demo with the complete code be prepared and shared for this study.

P10L9 - Why not also use other low resolution forest change maps in addition (e.g. Hansen et al. 2013)

P12L8 - The data should be better presented, in order to show how forest cover (natural and plantations) has changed (i.e. loss or gained) from 1962 to 2017.

P12L14 - Probably the fact that these remaining forests are now edge-dominated fragments?

Fig.6 - This could be better presented with a "forest change" color scheme similar to Hansen (unchanged, gain, loss in relation to 1962).

Fig.7 – Same as Fig. 6.

P17L7 - Would be interesting to see how the species are located in terms of edge distance.

P20L25 - This may be related to fragmental collapse.

References

Chaplin-Kramer, Rebecca, Ivan Ramler, Richard Sharp, Nick M Haddad, James S Gerber, Paul C West, Lisa Mandle, et al. 2015. “Degradation in Carbon Stocks near Tropical Forest Edges.” Nature Communications 6: 10158. https://doi.org/10.1038/ncomms10158.

Dantas De Paula, Mateus, Jürgen Groeneveld, and Andreas Huth. 2016. “The Extent of Edge Effects in Fragmented Landscapes: Insights from Satellite Measurements of Tree Cover.” Ecological Indicators 69: 196–204. https://doi.org/10.1016/j.ecolind.2016.04.018.

Hansen, M. C., P. V. Potapov, R. Moore, M. Hancher, S. A. Turubanova, A. Tyukavina, D. Thau, et al. 2013. “High-Resolution Global Maps of 21st-Century Forest Cover Change.” Science 342 (6160): 850–53. https://doi.org/10.1126/science.1244693.

Laurance, William F., José L.C. Camargo, Regina C.C. Luizão, Susan G Laurance, Stuart L Pimm, Emilio M Bruna, Philip C Stouffer, et al. 2011. “The Fate of Amazonian Forest Fragments: A 32-Year Investigation.” Biological Conservation. https://doi.org/10.1016/j.biocon.2010.09.021.

Lôbo, Diele, Tarciso Leão, Felipe P L Melo, André M M Santos, and Marcelo Tabarelli. 2011. “Forest Fragmentation Drives Atlantic Forest of Northeastern Brazil to Biotic Homogenization.” Diversity and Distributions 17 (2): 287–96. https://doi.org/10.1111/j.1472-4642.2010.00739.x.

Melito, Melina, Jean Paul Metzger, and Alexandre A. de Oliveira. 2018. “Landscape-Level Effects on Aboveground Biomass of Tropical Forests: A Conceptual Framework.” Global Change Biology 24 (2): 597–607. https://doi.org/10.1111/gcb.13970.

Melo, Felipe P.L., Víctor Arroyo-Rodríguez, Lenore Fahrig, Miguel Martínez-Ramos, and Marcelo Tabarelli. 2013. “On the Hope for Biodiversity-Friendly Tropical Landscapes.” Trends in Ecology & Evolution 28 (8): 462–68. https://doi.org/10.1016/j.tree.2013.01.001.

Pettorelli, Nathalie. 2015. “Agree on Biodiversity Metrics to Track from Space.” Nature 523: 5–7. https://doi.org/10.1038/523403a.

Rodrigues, Ricardo Ribeiro, Sergius Gandolfi, André Gustavo Nave, James Aronson, Tiago Egydio Barreto, Cristina Yuri Vidal, and Pedro H.S. Brancalion. 2011. “Large-Scale Ecological Restoration of High-Diversity Tropical Forests in SE Brazil.” Forest Ecology and Management 261 (10): 1605–13. https://doi.org/10.1016/j.foreco.2010.07.005.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Mateus Dantas de Paula

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Feb 28;15(2):e0229448. doi: 10.1371/journal.pone.0229448.r002

Author response to Decision Letter 0


20 Jan 2020

Response to Reviewers

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a highly technical paper and I am not an expert in remote sensing techniques, so I cannot really judge all of the methods. That said, the approach seems very rigorous to me and offers new insights into species distributions and how they can be detected using high-resolution remote sensing data. The approach is focused on classifying two tree species that are "indicator" species of young and old secondary forest in the region. The results on regrowth forest and Eucalyptus plantations is very interested and relevant. The limitations of the methods are clearly presented, but it is not clear how errors due to including other flowering species might change the results of the analysis.

FW : Dear Reviewer #1, thank you very much for you positive review that I believe have improved the paper. In the following text you will find your comments and our answers in blue.

page 12, line 11: As there are only two time periods assessed, how do you know that forests directly converted to plantations? There could have been intermediate agricultural land use.

FW : You are right it could have been intermediate use. To clarify the sentence was changed to: “The eucalyptus were mainly planted where there was no forest in 1962; only 21.1 % of the plantation were natural forests in 1962. As only two time periods have been assessed, 1962 and 2017, intermediate land uses that could have existed between transition from natural forests and to planted forests are unknown.”

Section 5.1: how would results be affected if some of these other flowering trees were labelled as Tibouchina pulchra? Are they also good indicators of regenerating forests?

FW: You are right some other flowering trees can be labelled Tibouchina pulchra, likely other Tibouchina species. However, from the results, we find that ”66.8% of these trees were located in new forests that were pasture in 1962, 81.4% inside or at a distance below 25 m of a new forest and 89.7% inside or at a distance below 50 m of a new forest”. So even considering errors in the labelling, we find a similar preference for the habitat, that is inside or nearby a regenerating forest, for all the mapped trees. Furthermore, there is a clear dominance behaviour of the mapped species: “in the neighbouring area of 25 m around each Tibouchina pixels, there is a mean of 17.5 % of Tibouchina pixels (343.5 m²)”; so even considering labelling errors this dominance behaviour seems also shared. And, we mapped regeneration only based on the dominance of the trees, which means that our method does not account for isolated individuals. Finally, the massive and time synchronous flowering with a similar spectral value is shared among all this trees. To conclude, even some errors in labels can occurs, all the mapped trees show similar characteristics that are (i) to make flowers synchronously and (ii) to dominate in new forests regenerating from pasture.

In the text, the following sentence was added: “ … are all know under the same vernaculary name "Manacá da Serra". Due to this, we acknowledge that some errors in labelling can occur however all the mapped trees show similar characteristics that are to make pink/purple flowers synchronously and to dominate in new forests regenerating from pasture.”

page 21, line 3: Most pastures in São Paulo state were derived from coffee plantations and not from shifting cultivation.

FW: I totally agreed with that, the main transition was likely: forest -> coffee plantations -> pastures. I think it is exactly the meaning of the sentence “In this Atlantic forest region, all pastures originated from the traditional slash and burn methods, made initially for coffee plantations that were further converted to pasture (Dean, 1997).”. However, here Dean does no refer to slash and burn cultivation but only to slash and burn technics to prepare the land for coffee plantation, but I understand it can be a bit confusing for the reader. To clarify the sentence was changed to “In this Atlantic forest region, all pastures originated from forest cutting and burning, made initially for coffee plantations that were further converted to pasture (Dean, 1997).”.

page 21, line 24: Where are these data presented? I don't think that evidence is presented in this paper to support this claim. Results from Amazonia in Cecropia vs. Vismia dominated secondary forests are not relevant here.

FW : The paper of Cecropia vs. Vismia show that the forest succession is different depending on the past use of fire, that is, if only clearcut, Cecropia sp. dominated, while in case of clearcut + fire, Vismia sp. dominated. I really believe that it could be linked at some point to what we are observing in the Atlantic forest, that is C. hololeuca are almost absent in the succession from pasture regeneration. This is the closest study have found to discuss my results. I have changed the sentence to “Similarly, we observed C. hololeuca dominance mainly in old forests that have likely less or older history of fires than abandoned pasture where T. pulchra dominates. This could indicate that the forests characterized by the C. hololeuca dominance have a richer mix of species that the Tibouchina dominated forests but further studies combining biodiversity measured from the field, for example from the BiotaProject (Joly et al., 2014), and our results are needed to confirm this.”

page 22, line 11: these still could be old second-growth (more than 60 yr old). Deforestation happened centuries ago in this region. Old forest is a vague term and for some it means old-growth forest (not cleared historically).

FW: Yes, you are right, and, in the sentence, I have removed “and with likely no preceding use of fire.” which was more an assumption that have emerged from the discussion than a result. The term old forest in the paper refer only to forest present in 1962 as defined in the section 3.2.1 of the Methods. And to clarify, I sometimes repeat the paper definition of old forest. I didn’t find a better term, and for the graphics it is better as it is short and very discriminative, but if you have a suggestion, I can include it.

Just as a remark, in my opinion these forests are very old and have at least more than a century. In the aerial photographs, I have saw that there was already Cecropias in almost all the fragments with Cecropias today, that how I had the idea of this study, that is that this species only occurs inside old forest fragments and that’s what our results shows (and was different of what botanist were thinking of this species). I didn’t analyse this because it will be too qualitative: the quality of the aerial photography is not so good as satellite image and I only have images from the Rio do Chapeu region, however, you can clearly see some white spots in the fragments which are the Cecropias, see Fig. 1. So, following our results, this means that these fragments were forest at least 60 years before the aerial photographs of 1962.

Figure 1 : Example of forest fragments with visible Cecropias in the 1962 aerial image from the rio do chapeu region (these fragments can be found in the figure 7 of the paper in the top left part)

Reviewer #2: General Comments

The authors present in this work a method to evaluate landscape forest degradation and regeneration history using very high resolution satellite data and aerial photographs to map the occurrence of two indicator species. The method is based on machine learning and identifies the presence of species based on the image’s spectral characteristics. They have also crossed this species occurrence data to local climate and geomorphological data in order to understand the species’ habitat preferences.

The workflow of the geoprocessing is well described and includes a good error estimation section, which increases the robustness of the results. This is important because in my view the framework has good potential for expansion to larger areas and use as a tool for other studies, especially in areas in which Cecropia hololeuca and Tibouchina pulchra occur in the netropics, but also by using other local species. However, in order to increase the usage of this method by the scientific community, some aspects especially regarding scientific reproducibility should be included.

First, scientific research with remote sensing is heavily based on freely available satellite data, which is not the case with the very high resolution Worldview catalogue. It would be invaluable to analyze how free datasets (e.g. Landsat, CBERS, Sentinel-2) fare in comparison to the Worldview data for detecting the target species within the framework (even if they perform poorly, strengthening the case for free very high resolution data). Second, it is important that the algorithm code becomes available through GitHub or another code sharing platform from the time of the publication, together with an image sample from 1962 and another from Worldview, providing a demo of the analysis. This would strengthen the citation and usage frequency by this study.

Finally, I noticed in this study a general lack of usage of forest fragmentation as an analysis and discussion base within tropical ecology (Laurance et al. 2011; Chaplin-Kramer et al. 2015; Lôbo et al. 2011) and remote sensing (Dantas De Paula, Groeneveld, and Huth 2016; Pettorelli 2015; Melito, Metzger, and de Oliveira 2018). This I feel would be invaluable for understanding the occurrence of the indicator species within a forest edge-core gradient.

FW : Dear Reviewer #2, thank you very much for you positive review, I have try to account of most of you comment and believe it has improved the paper.

I have now added the data of Cecropia and Tibouchina dominances and natural forest cover (Figure 8) in shapefile as well as the U-net model code example with a simulated WordView image on Zenodo public repository(map data: https://doi.org/10.5281/zenodo.3601487 and the Unet model example : https://doi.org/10.5281/zenodo.3601503 ). Unfortunately, I can’t share the original Worldview and the aerial images as I don’t have the rights to do this.

I understand your point of using other dataset, but it goes beyond the scope of this paper, that is why this appears in perspectives in the discussion. I am currently working on this with sentinel data, it works at some point, but you definitively need high resolution image to be 100% sure to identify the objects you are mapping. And working with low resolution goes with tons of other problem such as registration, shade, clouds, no data etc… I have schedule one entire year to work on this, it is not something that can be done quickly and added in this paper. This will be the next work and I am convinced there are some very promising perspectives for large scale species mapping.

The fragmentation analysis you are asking to me, based on the same Cecropia map used in this work, has been published in Remote Sensing in Ecology and Conservation with Nathalie Pettorelli as Editor in 2019 (see here: https://zslpublications.onlinelibrary.wiley.com/doi/full/10.1002/rse2.111). It includes a more detailed methodology section as well as the complete analysis of the Cecropia distance to the edge in all the fragments, plus a discussion with full section focused on fragmentation. That why I didn’t much explore this in this paper and focus more on the history of the fragment. For the Tibouchina this analysis is not so relevant because they are dominant, so they are distributed all over the new forests, from the edge to the center. I didn’t make this distribution analysis because I know that it will bring more work than valuable information. In my opinion, the best information that Tibouchina trees can show us is that a fragment dominated by Tibouchina is a recent regeneration of a pasture, which is, I believe a new and very valuable information for conservation.

Furthermore, in the other paper I have computed an index for each fragment but what I have saw that there is a problem to deal with large fragment and you lost important information, that is why I have adopted a more pixel based approach. For example, considering a large fragment of 1000 ha with 8 hectares of disturbed forest with Cecropia dominance, if I compute one disturbance index, let’s say the percentage of Cecropia dominance, the fragment will be classified as undisturbed because Cecropia dominance only represent 0.8 %. Now if we consider a fragment of 10 with the same 8 hectares of disturbed forest with Cecropia dominance, the fragment will be now classified as disturbed because Cecropia dominance represent 80% of the fragment. In both case we have 8 ha dominated by Cecropia and in the case of large fragment we totally lost this information. With the method developed in this paper, independently of the fragment size, we can map exactly where the disturbance is and its size, it goes beyond fragmentation analysis, we are looking at indicator species distribution inside the fragment to identify forest degradation history, something that nobody have done before at this scale with high resolution image.

In the following text you will find your comments and our answers in blue.

Specific comments:

Page 1, Line 7 – elevation data instead of only elevation.

FW : agree and corrected

P2L4 – Also high diversity forest restoration initiatives in the Atlantic Forest, see (Melo et al. 2013; Rodrigues et al. 2011)

FW : agree and added. I have only added the reference of Rodrigues et al because in contains real data while the Melo et al paper is more about concepts and potential implications of successional trajectories for conservation planning. The sentence now reads :” This increase in tree cover is driven mainly by eucalyptus plantations and natural regeneration (Silva et al., 2018), and also by forest restoration initiatives (Rodrigues et al., 2011).”

P2L24- would eucalyptus plantations be considered as reforestation ?

FW : you are right, plantation are not considered as reforestation but as afforestation, the sentence now reads “… after 1966, when the Brazilian federal law 5.106 came into force with the goal to encourage afforestation and reforestation in Brazil (Sampaio, 1975).”

P2L29 – Maybe not relevant to include common names.

FW : I still think that this is a nice information, particularly for the Brazilian readers who know only this tree by the common name. For example, when I present this work to Brazilians and people working on Atlantic forest, a large part of the audience know this species with the vernaculary name “Manaca da serra” as this species is really common in the Atlantic forest, but if I say only Tibouchina pulchra, nobody knows which plant it is.

P3L29 - I know that in Brazil the forest domains are called "biomes" but this has another connotation in the outside scientific community, maybe use another term?

FW : This is the term in use in our scientific community, and also used as a synonym of ‘domain’ (which I believe is term you are suggesting). For example, the term ‘biome’ is used in all the publications about Atlantic forest you cite in references and used as a synonym of ‘domain’ in previous Plos One article on Atlantic forest (for example Bogoni et al, 2018, https://doi.org/10.1371/journal.pone.0204515). So, as there is no clear consensus on using biome or domains I have kept biome in the text for consistency.

Bogoni JA, Pires JSR, Graipel ME, Peroni N, Peres CA (2018) Wish you were here: How defaunated is the Atlantic Forest biome of its medium- to large-bodied mammal fauna? PLoS ONE 13(9): e0204515. https://doi.org/10.1371/journal.pone.0204515

P5L4 – The digitalized images are panchromatic or have also RGB bands?

FW : The aerial images are panchromatic, so they look as a gray image (see Figure 1 above, in the responses to Reviewer#1). The digital images of the aerial images are *.png, so they have RGB bands, however the image still looks as a gray image. In the text I have now included this information: “In 1962, ten years before the launch of the first Landsat satellite, panchromatic aerial photographs were taken over São Paulo…”

P5L27 - Since SRTM is a surface model and not a terrain model, ruggedness could be influenced by canopy characteristics.

FW : As the spatial resolution of SRTM is of 30 m and the ruggedness computed on a moving windows of 3x3 pixel (90m), I don’t expect so much influence of the canopy characteristics and the forest region is really not flat, more like a succession of hills, so on 90 m, I expect more variation of elevation from the terrain that from the canopy.

P8L2 - How does this compare with conventional supervised/unsupervised image classification methods?

FW : It is important to point out that the result of the U-net model already gives the segmentation of the trees. In traditional non-deep learning approaches, we can separate two methods that could be used for the species identification, object-based image analysis or pixel-based analysis. Both methods need more than one step to produce the species map.

First, pixels-based analysis will likely fail because in the crown of Cecropia or Tibouchina there is a large variability of reflectance, see Figure 5 of the paper for Tibouchina (or Figure 3 of of the paper https://zslpublications.onlinelibrary.wiley.com/doi/full/10.1002/rse2.111 for the Cecropia). For example, in the Cecropia crowns there are bright pixels and other are darks. This is due to the architecture of the tree. Even if the method could find the bright pixels that we could attribute to the Cecropia, then we would have to find a method to group them as a crown, which mean a human based threshold, which is likely more subjective that the U-net approach, and could never reach the accuracy of Unet.

Second, for object-based image analysis, we would also have to make two steps, the first is the segmentation of the individual tree crowns (ITCs) and the second is the identification of the species of these crowns. There is no non-deep learning individual tree crown segmentation method that can reach level of accuracy above 90%, even with hyperspectral and lidar data combined [Tochon et al., 2015; Dalponte et al., 2014; Singh et al., 2015]. Furthermore, Dalponte et al. (2014) reported that large and small crowns density tend to be underestimated when optical images are used to delineate ITCs. And after this uncertain tree crown segmentation, it remains to predict the species. There are absolutely no chances for a non-deep learning algorithm to reach accuracies obtained here with the U-net model for image segmentation, and this has already been demonstrated by the work of [Huang et al., 2018], paper which is cited in introduction.

References :

M. Dalponte, H. O. Orka, L. T. Ene, T. Gobakken, E. Naesset, Tree crown delineation and tree species classi_cation in boreal forests using hyperspectral and ALS data, Remote Sensing of Environment 140 (2014) 306-317.

G. Tochon, J. Fret, S. Valero, R. Martin, D. Knapp, P. Salembier, J. Chanussot, G. Asner, On the use of binary partition trees for the tree crown segmentation of tropical rainforest hyperspectral images, Remote Sensing of Environment 159 (2015) 318-331.

M. Singh, D. Evans, B. S. Tan, C. S. Nin, Mapping and characterizing selected canopy tree species at the angkor world heritage site in Cambodia using aerial data, PLOS ONE 10 (2015) 1-26.

Huang, B.; Lu, K.; Audebert, N.; Khalel, A.; Tarabalka, Y.; Malof, J.; Boulch, A.; Le Saux, B.; Collins, L.; Bradbury, K.; Lefèvre, S.; El-Saban, M. Large-scale semantic classification: outcome of the first year of Inria aerial image labeling benchmark. IEEE International Geoscience and Remote Sensing Symposium – IGARSS 2018.

P10L5 – I suggest that a sample analysis demo with the complete code be prepared and shared for this study.

FW : agreed and done. The U-net code for training and prediction with a simulated WorldView image have been made available on a public repository here: https://doi.org/10.5281/zenodo.3601503. I also have made available the maps of Cecropia and Tibouchina dominances and natural forest cover here : https://doi.org/10.5281/zenodo.3601487

P10L9 - Why not also use other low resolution forest change maps in addition (e.g. Hansen et al. 2013)

FW : Mainly because (i) the WorldView image is not orthorectified so depending on the elevation and the satellite view angle the low and high resolution does not match spatially very well, I will have to orthorectified the WorldView images first to match the low resolution data which is a lot of work and in comparison of making my own map and (ii) I can compute the accuracy on my own map, while we don’t know so much about the Hansen data accuracy for this region and finally (iii) low resolution with 30 m will lead to more error on estimation of the forest fragment size, with high resolution I can do far better.

P12L8 - The data should be better presented, in order to show how forest cover (natural and plantations) has changed (i.e. loss or gained) from 1962 to 2017.

FW : It is not tree cover change (such as previously written in the title of section 3.2.1 but have been corrected now) but only the past history of the forest mapped in 2017. It is the difference between the polygons of forests in 2017 and the polygons of forests in 1962 that were still forests in 2017. To map the past forests, I have made a first mask of the forest in 2017 and then only delineate forests in 1962 inside this mask. So there is only 3 classes, which are the classes of the Figure 6: old forests, new forests, and eucalyptus forest. And, all Eucalyptus plantation of the region were all made after 1962. That was already a huge work of manually drawing the past forest segments and I didn’t do that for loss because it was out of the scope of this paper, which is more about determining the degradation history of exiting forest fragments. I have now rewritten the method and change the name of the section 3.2.1 to clarify this. The paragraph is now:

“3.2.1 Past history of the 2017 forest cover

First, the mask of natural/planted forest in 2017 was produced with the U-net at 0.5 m. Second, the past history of the 2017 forest cover, that is, if they were present or not in 1962, was mapped. To do this, only for the forests that were mapped in 2017, the mask of the forests in 1962 was produced with the registered aerial images of 1962. This map was produced in QGIS and resampled in a raster tile at 0.5 m spatial resolution (QGIS Development Team, 2009). The history of the 2017 forest cover was then described in the "Rio do Chapeu" region. With these two maps, we were ……”.

P12L14 - Probably the fact that these remaining forests are now edge-dominated fragments?

FW : sorry but I am not sure to understand this comment. The result P12L14 is to compare to the Tibouchina results: ~ 90% of the Cecropia are located inside or a at maximum of 50 m of an old forest and ~90% of the Tibouchina are located inside or at a maximum of 50 m of a new forest.

Fig.6 - This could be better presented with a "forest change" color scheme similar to Hansen (unchanged, gain, loss in relation to 1962).

FW : As in the response of one of your previous comment, the classes are not forest change but only past history of the forests existing in 2017. I have corrected the paragraph 3.2.1 to clarify this.

Fig.7 – Same as Fig. 6.

FW : Same as previous comment.

P17L7 - Would be interesting to see how the species are located in terms of edge distance.

FW : The complete analysis of the distance to the edge with the same Cecropia map has been published in 2019 in Remote Sensing in Ecology in Conservation, see here: https://zslpublications.onlinelibrary.wiley.com/doi/full/10.1002/rse2.111

P20L25 - This may be related to fragmental collapse.

FW: Yes, it might but unfortunately, I don’t have the results to support this assumption. Could be great to have LiDAR data of the fragment to test for this.

References

Chaplin-Kramer, Rebecca, Ivan Ramler, Richard Sharp, Nick M Haddad, James S Gerber, Paul C West, Lisa Mandle, et al. 2015. “Degradation in Carbon Stocks near Tropical Forest Edges.” Nature Communications 6: 10158. https://doi.org/10.1038/ncomms10158.

Dantas De Paula, Mateus, Jürgen Groeneveld, and Andreas Huth. 2016. “The Extent of Edge Effects in Fragmented Landscapes: Insights from Satellite Measurements of Tree Cover.” Ecological Indicators 69: 196–204. https://doi.org/10.1016/j.ecolind.2016.04.018.

Hansen, M. C., P. V. Potapov, R. Moore, M. Hancher, S. A. Turubanova, A. Tyukavina, D. Thau, et al. 2013. “High-Resolution Global Maps of 21st-Century Forest Cover Change.” Science 342 (6160): 850–53. https://doi.org/10.1126/science.1244693.

Laurance, William F., José L.C. Camargo, Regina C.C. Luizão, Susan G Laurance, Stuart L Pimm, Emilio M Bruna, Philip C Stouffer, et al. 2011. “The Fate of Amazonian Forest Fragments: A 32-Year Investigation.” Biological Conservation. https://doi.org/10.1016/j.biocon.2010.09.021.

Lôbo, Diele, Tarciso Leão, Felipe P L Melo, André M M Santos, and Marcelo Tabarelli. 2011. “Forest Fragmentation Drives Atlantic Forest of Northeastern Brazil to Biotic Homogenization.” Diversity and Distributions 17 (2): 287–96. https://doi.org/10.1111/j.1472-4642.2010.00739.x.

Melito, Melina, Jean Paul Metzger, and Alexandre A. de Oliveira. 2018. “Landscape-Level Effects on Aboveground Biomass of Tropical Forests: A Conceptual Framework.” Global Change Biology 24 (2): 597–607. https://doi.org/10.1111/gcb.13970.

Melo, Felipe P.L., Víctor Arroyo-Rodríguez, Lenore Fahrig, Miguel Martínez-Ramos, and Marcelo Tabarelli. 2013. “On the Hope for Biodiversity-Friendly Tropical Landscapes.” Trends in Ecology & Evolution 28 (8): 462–68. https://doi.org/10.1016/j.tree.2013.01.001.

Pettorelli, Nathalie. 2015. “Agree on Biodiversity Metrics to Track from Space.” Nature 523: 5–7. https://doi.org/10.1038/523403a.

Rodrigues, Ricardo Ribeiro, Sergius Gandolfi, André Gustavo Nave, James Aronson, Tiago Egydio Barreto, Cristina Yuri Vidal, and Pedro H.S. Brancalion. 2011. “Large-Scale Ecological Restoration of High-Diversity Tropical Forests in SE Brazil.” Forest Ecology and Management 261 (10): 1605–13. https://doi.org/10.1016/j.foreco.2010.07.005.

Decision Letter 1

Judi Hewitt

7 Feb 2020

Mapping Atlantic rainforest degradation and regeneration history with indicator species using convolutional network

PONE-D-19-23068R1

Dear Dr. Wagner,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Judi Hewitt

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Judi Hewitt

10 Feb 2020

PONE-D-19-23068R1

Mapping Atlantic rainforest degradation and regeneration history with indicator species using convolutional network

Dear Dr. Wagner:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Judi Hewitt

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Distribution of the natural forests, excluding the largest fragment at the south, and non forest (drawn from a random distribution) in relation to elevation and five elevation-related variables at the regional scale.

    The five elevation related indices consider the eight neighbor pixels and are slope (°); aspect (compass direction facing slope in °, where 0 is the North); topographic position index (elevation of the pixel in relation to the mean elevation of the neighbour pixels); terrain ruggedness index (sum of the change in elevation between the pixels and its 8 neighbors cells); and, finally roughness (difference between the maximum and the minimum value of a cell and its 8 surrounding cells) [31, 32].

    (PDF)

    S2 Fig. Comparison of the distribution of the forests with or without T. pulchra dominance in relation with elevation-related variables over the WorldView-2 image extent.

    Elevation-related variables were extracted for T. pulchra dominance polygon centroids (n = 461355) and from randomly distributed natural forests points without T. pulchra dominance (n = 50000).

    (PDF)

    S3 Fig. Comparison of the distribution of the forests with or without Cecropia hololeuca dominance in relation with elevation and elevation-related variables over the WorldView-3 image extent.

    Elevation-related variables were extracted for C. hololeuca dominance polygon centroids (n = 169542) and from randomly distributed natural forests points without C. hololeuca dominance (n = 11717).

    (PDF)

    S4 Fig. Comparison of the distribution of the forests with or without Cecropia hololeuca dominance in relation with annual mean of climate variables over the WorldView-3 image extent.

    Climate variables were extracted for C. hololeuca dominance polygon centroids (n = 169542) and from randomly distributed natural forests points without C. hololeuca dominance (n = 11717).

    (PDF)

    Data Availability Statement

    The Worldview images and the 1962 aerial images are owned by DigitalGlobe and Emplasa, respectively, and cannot be shared publicly. However, a code example of the U-net model with a simulated Worldview image for reproducibility is available (https://doi.org/10.5281/zenodo.3601503) and the data of the maps presented in the Fig 8 are available on a public repository (https://doi.org/10.5281/zenodo.3601487). The authors confirm they had no special access or privileges to the Worldview images that other researchers would not have.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES