Skip to main content
Ecology and Evolution logoLink to Ecology and Evolution
. 2014 Dec 2;4(24):4658–4668. doi: 10.1002/ece3.1273

MODISTools – downloading and processing MODIS remotely sensed data in R

Sean L Tuck 1, Helen RP Phillips 2,3, Rogier E Hintzen 2,3, Jörn PW Scharlemann 4, Andy Purvis 2,3, Lawrence N Hudson 3
PMCID: PMC4278818  PMID: 25558360

Abstract

Remotely sensed data – available at medium to high resolution across global spatial and temporal scales – are a valuable resource for ecologists. In particular, products from NASA's MODerate-resolution Imaging Spectroradiometer (MODIS), providing twice-daily global coverage, have been widely used for ecological applications. We present MODISTools, an R package designed to improve the accessing, downloading, and processing of remotely sensed MODIS data. MODISTools automates the process of data downloading and processing from any number of locations, time periods, and MODIS products. This automation reduces the risk of human error, and the researcher effort required compared to manual per-location downloads. The package will be particularly useful for ecological studies that include multiple sites, such as meta-analyses, observation networks, and globally distributed experiments. We give examples of the simple, reproducible workflow that MODISTools provides and of the checks that are carried out in the process. The end product is in a format that is amenable to statistical modeling. We analyzed the relationship between species richness across multiple higher taxa observed at 526 sites in temperate forests and vegetation indices, measures of aboveground net primary productivity. We downloaded MODIS derived vegetation index time series for each location where the species richness had been sampled, and summarized the data into three measures: maximum time-series value, temporal mean, and temporal variability. On average, species richness covaried positively with our vegetation index measures. Different higher taxa show different positive relationships with vegetation indices. Models had high R2 values, suggesting higher taxon identity and a gradient of vegetation index together explain most of the variation in species richness in our data. MODISTools can be used on Windows, Mac, and Linux platforms, and is available from CRAN and GitHub (https://github.com/seantuck12/MODISTools).

Keywords: Conservation biology, earth observation, global change, land processes, macroecology, PREDICTS, remote-sensing, satellite imagery

Introduction

Remote-sensing and ecology

Remotely sensed data – available at global scales at a fine to medium resolution, and often for free – are increasingly being used in ecology (Donoghue 2002; Kerr and Ostrovsky 2003; Bai et al. 2008; Chawla et al. 2012; Naeem et al. 2012; Sutherland et al. 2013). In particular, products derived from NASA's MODerate-resolution Imaging Spectroradiometer (MODIS) instruments (Justice et al. 1998) are a unique resource for research in many areas of ecology, conservation biology, and global change research (Table 1). Remote-sensing instruments measure physical, geochemical, and biological processes that allow us to evaluate the environment in a scalable way: from characterizing the distribution of a species or phenology of a plant community, to recording land cover change, natural or human-caused, from the ecosystem up to the global level. Several MODIS data products have been used to investigate land surface temperatures, monitor intensification of croplands, estimate volume of timber stocks, and assess change in the magnitude and frequency of fires (Table 1). Whilst it is beyond our scope to review in detail all ecological questions to which MODIS data have been applied, we discuss the relevance of MODIS data to monitoring global biodiversity change as this will be pertinent to our example analysis.

Table 1.

Moderate-resolution Imaging Spectroradiometer products available for download using MODISTools, and examples of their use. Potential summary measures are techniques that are currently used in the literature and may be incorporated into MODISTools in future releases. Several other methods have been proposed to summarize and thereby reduce the serial correlation in time series of remotely sensed data, including principal components analysis (PCA; (Eastman and Fulk 1993), temporal Fourier processing (e.g., Scharlemann et al. 2008), and simple metrics such as bioclimatic variables (e.g., BIOCLIM variables; Xu and Hutchinson 2011)

Code Product Examples of product use Potential summary measures
MOD09/MYD09 Surface reflectance Spatiotemporal distribution of rice phenology (Sakamoto et al. 2006) NA – reflectance data from which many measures can be derived
Monitoring intensification of croplands (Galford et al. 2008)
Land cover mapping of Germany (Colditz et al. 2011)
MOD11/MYD11 Surface temperature and emissivity Investigating the relationship between land surface temperature and vapor pressure deficit (Hashimoto et al. 2008) Degree days; length of period above temperature threshold
Calculating air surface temperature using remotely sensed data and meteorological data (Benali et al. 2012)
MOD43/MCD43 Nadir BRDF-adjusted Reflectance Land cover mapping of South Africa (Colditz et al. 2011) NA – reflectance data from which many measures can be derived
Studying vegetation phenology in the United States (Zhang et al. 2003)
MOD13/MYD13 Vegetation Indices Crop-related LULC classification in the U.S. Central Great Plains (Wardlow et al. 2007) Phenological measures (eg. season shift index); change vector magnitude; integrated vegetation indices
GPP estimates for biomes across the conterminous US (Xiao et al. 2010)
Quantifying tree cover in an African grass savanna (Gaughan et al. 2013)
MOD15 LAI/FPAR Comparison of products suitable for vegetation phenology (Ahl et al. 2006) Phenological metrics; annually integrated LAI/FPAR
GPP estimates for biomes across the conterminous US (Xiao et al. 2010)
MOD17 Gross primary productivity (GPP), Net photosynthesis Calculating global terrestrial net primary production (Running et al. 2004) Total productivity; peak NPP; seasonality of GPP
Validation of NPP/GPP across multiple biomes (Turner et al. 2006)
MCD12 Land cover and change Estimation of timber volume (Nelson et al. 2009) Already functional in MODISTools
Presentation and validation of global land cover types (Friedl et al. 2002)

Global biodiversity is declining (Balmford et al. 2003; Gaston et al. 2003; Collen et al. 2009; Tittensor et al. in press) and is predicted to continue to decline at unprecedented rates (Pereira et al. 2010; Tittensor et al. in press). Tackling this problem requires global biodiversity monitoring, such as (Collen et al. 2009; Pereira et al. 2010) which have analyzed indicators of global biodiversity change over recent decades. At global spatial scales, the time and cost involved make field-based monitoring near impossible. Remotely sensed data with global coverage and continuous, frequent measurements over periods exceeding the lifespan of most ecological studies can help bridge this gap in biodiversity monitoring; consequently, there has been an increase in the use of remotely sensed data to project biodiversity over large extents (Fuller et al. 1998; Kerr and Ostrovsky 2003; Turner et al. 2003; Pettorelli et al. 2005; Lassau and Hochuli 2008; St-Louis et al. 2014). Combining remotely sensed data with biodiversity observations using meta-analysis (Gibson et al. 2011) or synthetic analysis (Newbold et al. 2014) has become more fruitful with advances in remote-sensing instruments, data archives, and error correction methods such as the bidirectional reflectance distribution function (BRDF) that removes directional effects of view angle and illumination (Nicodemus et al. 1977). Importantly, these data can now be accessed at spatial and temporal scales similar to ecological field data (Kerr and Ostrovsky 2003).

MODIS remotely sensed data

MODIS provides data with global coverage at 250 m2, 500 m2, and 1 km2 spatial resolutions collected twice daily by NASA's Terra and Aqua satellites since the year 2000. Raw data from the MODIS sensors are composited to daily, 8 day, 16 day, and yearly imagery and preprocessed into discipline-specific MODIS products for atmospheric, oceanic, or land process applications. The Land Processes Distributed Active Archive Center (LP DAAC – https://lpdaac.usgs.gov) holds MODIS data on metrics of land processes such as surface reflectance, land cover/land cover change, land surface temperature and emissivity, vegetation indices (Normalized Difference Vegetation Index, NDVI, and Enhanced Vegetation Index, EVI), leaf area index and fPAR (fraction of photosynthetically active radiation), evapotranspiration, net photosynthesis, and primary productivity (https://lpdaac.usgs.gov/products/modis_products_table; examples of their usage are shown in Table 1).

Many ecological studies, such as meta-analyses, observation networks or globally distributed networks of experiments, combine information from large numbers of locations. Such studies allow comparison of patterns across space and time and estimation of global responses. They can help to identify and understand generalities (Borer et al. 2014), and synthesize the literature to predict the responses of ecological communities to global change (Newbold et al. 2013). The global coverage of remotely sensed data presents a valuable resource for global ecological studies but the existing MODIS access mechanisms make it hard to obtain the relevant data.

Despite the increasingly common use of MODIS data, there is a burden on the investigator to access, download, and store them. The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC) provides online access to spatially and temporally delimited subsets (ORNL DAAC 2014). The user can download one subset at a time, via email, after manually inputting subset definitions each time. Manual input of spatial and temporal limits can be both error-prone and time-consuming. A web service is available (http://daac.ornl.gov/MODIS/MODIS-menu/modis_webservice.html) that can facilitate automation but this requires familiarity with protocols and languages such as SOAP (Simple Object Access Protocol; Gudgin et al. 2007) and XML (Extensible Markup Language), which ecologists are typically not familiar with, are time-consuming to learn and shift attention away from science.

Utilizing MODIS data with MODISTools

MODISTools – a package for the R Statistical Computing Language (R Core Team 2014) – allows MODIS data for multiple locations, time periods and products to be downloaded, and stored using a single line of R code. Downloaded data are stored in a simple, memory-efficient way that can be easily retrieved and manipulated. MODISTools provides functions for processing downloaded data and merging these data with the user's ecological response data, making it possible to apply MODIS data to research questions with minimal effort. By avoiding the time-consuming and often error-prone manual steps, the package simplifies access to MODIS data, thereby increasing research efficiency. MODISTools completes all functioning within the R environment without requiring external software.

We are aware of one other existing R package, MODIS, that downloads MODIS data (Mattiuzzi et al. 2013). This is geared toward use of data in a geographic information system (GIS), which often requires the user to download and interact with additional software, such as the MODIS Reprojection Tools (LP DAAC 2014). MODISTools completes all functioning within the R environment without requiring external software. The MODIS R package provides useful functions to download MODIS data at one or a few locations as raster files for use in GIS. Many users of MODIS data will require some data processing steps in a GIS environment, for example, extracting a complex shape, such as the boundary of a country or national park, from the downloaded rectangular MODIS tile. In our experience of ecological modeling, an ideal format for downloaded MODIS data has been in ASCII plain text format, which can be readily downloaded from the ORNL DAAC MODIS web service. Although MODISTools does not itself contain functions for GIS processing steps, it connects to a GIS environment efficiently by providing a function for creating ASCII raster grids from the downloaded files. These new files are provided with the correct MODIS projection datum (PRJ file) that allows them to be easily imported into standard GIS software.

Example analysis

We demonstrate the utility of MODISTools applying MODIS data to ecological questions, particularly studies involving many sites, by conducting an illustrative analysis of our own. We analyzed species richness data from the literature, investigating the relationship between local species richness in temperate regions and spatiotemporally matched vegetation indices. We show every step from downloading data, through data processing, and analysis; this demonstrates the value that MODISTools provides in each step of the method and how it can aid researchers to apply MODIS data to ecological questions in general.

Many studies have reported a positive correlation between species richness and aboveground net primary productivity, although the relationship might not be linear depending on the spatial scale (Gaston 2000). Previous studies have established a relationship between aboveground net primary productivity and vegetation indices (Reed et al. 1994; Waring et al. 2006): vegetation indices describe the greenness of the vegetation and the Normalized Difference Vegetation Index (NDVI) is one of the most commonly used (Pettorelli et al. 2005). NDVI is calculated using red and near-infrared wavelengths of light that are reflected and captured by the satellite. Chlorophyll will absorb red light, while the mesophyll structure of a leaf will scatter reflected near-infrared light. Therefore, if the proportion of reflected near-infrared light captured is greater than red light captured this represents a signal of vegetation. NDVI values increase from 0 to 1 as the amount of vegetation increases, whilst negative values indicate an absence of vegetation (Myneni et al. 1995). An extension of NDVI, the Enhanced Vegetation Index (EVI), adjusts for atmospheric aerosol interference and improves sensitivity so values do not saturate in areas of high biomass (Huete et al. 2002). Because of their relationship with aboveground net primary productivity, NDVI and EVI are often used as predictors for species richness, with good results (Hurlbert and Haskell 2003; Seto et al. 2004; Waring et al. 2006). Time series, rather than one off measurements, of vegetation indices are required to capture vegetation dynamics relevant to the species richness data, and to minimize the contamination of this signal of vegetation dynamics due to noise.

Normalized Difference Vegetation Index is publicly available from various data sets; the earliest was collected from the National Oceanic and Atmospheric Administration's Advanced Very High Resolution Radiometer (NOAA-AVHRR), with data extending back to 1981 (James and Kalluri 1994). EVI has been proposed comparatively recently, but MODIS products provide global data for both indices. As with the multitude of other products available through the LP DAAC MODIS archive, these vegetation indices can be summarized in many ways to produce measures relevant to ecological study (see Pettorelli et al. 2005 for a review of NDVI measures).

Here, we focus on three measures of NDVI and EVI time series that are calculated by MODISTools: the maximum value in a time series, the temporal mean, and the temporal variability (Fig. 1). The time series were smoothed with temporal interpolation, using MODISSummaries, prior to calculating these summary statistics. Smoothing reduces noise in the time series due to cloud cover, or high solar or scan angles (Pettorelli et al. 2005). The temporal variability can be characterized as the average temporal variation of vegetation index within a site, factoring in the minimum value: it is the time-averaged difference between the total area under the time series and the area under the minimum value in the time series. Temporal variability is of interest when comparing among sites as the average variation in a vegetation index can be equal when the mean is not, and vice versa. In this context, temporal variability can be seen as a measure of seasonality; higher temporal variability indicates increased change in vegetation cover from one season to another. It could alternatively represent offtake, with a higher temporal variability indicating more harvesting at a site.

Figure 1.

Figure 1

Time series of NDVI following a QualityScreen, as produced by an optional argument DiagnosticPlots in MODISSummaries, that illustrates the relationship between the three measures used: maximum time-series value, temporal mean, and temporal variability. On the y-axis is NDVI at 250 m2 resolution (the axis label is the data band name for this Science Data Set). On the x-axis is time, with 16-day regular intervals. This time series produces a temporal variability value of 0.4. Variability is defined in the introduction of the main text. Upper black dashed line indicates the maximum value in the time series and the lower black dashed line indicates the minimum value in the time series. The solid red line indicates the interpolated values of the NDVI. Red dashed line indicates the mean of these values.

These summary measures of NDVI and EVI were the explanatory variables that we related to local species richness across temperate forests globally. The diversity data were sampled from the PREDICTS database (http://www.predicts.org.uk – Newbold et al. 2014; Hudson et al. unpublished data), a compilation of data from studies worldwide that have measured diversity (of one or more species) over multiple habitat types. Data were collected from published papers and, when necessary, subdivided further into studies with distinct methodologies. Within each study, there were multiple sites where diversity (in this case species richness) was measured. Our methods detail the utility that MODISTools provided at each stage. We then discuss our results and reflect on the benefits of using MODISTools for such research and more broadly. The code used to analyze the data and a dummy data set, generated from the real data we have analyzed, are available for reproducing the entire workflow presented (see Table S1, Appendices S1S4).

Method

Data preparation

A subset of diversity data from the PREDICTS database was extracted using the following criteria: (1) studies were within the Temperate Broadleaf and Mixed Forests Biome (as determined by the WWF biome layer, Olson et al. 2001); (2) the studies sampling period ended post 2000 (to ensure MODIS data were available); (3) studies contained more than 20 sites; and (4) studies captured a diversity metric that could be converted to species richness. The resulting data set contained 526 sites from eight different studies (Brunet et al. 2011; Buczkowski 2010; Chapman and Reich 2007; Lantschner et al. 2008; Lentini et al. 2012; Su et al. 2011; Weller and Ganzhorn 2004; Winfree et al. 2007 – Magura et al. 2010 was also considered but removed from analysis due to small sample size).

Normalized Difference Vegetation Index and EVI were downloaded for each of the 526 sites using the MODISSubsets function. MODISSubsets fetches data from the ORNL DAAC MODIS Land Product Subsets web services. The input to the function is an R data.frame or a text file specifying all sites where diversity data were collected (ST1). For each site, this data frame must specify lat/long coordinates and dates for the time series requested. End dates and optionally start dates for each time series can be specified; or, just end dates can be given with an optional time-series length that MODISSubsets will consistently download for when the timespan of preceding data permits. Specific column names are necessary for MODISSubsets to find the information it needs: “lat,” “long,” “end.date,” and optionally “start.date”. The input coordinates must be in decimal degrees format with WGS84 datum. As with any spatial data, ensuring the coordinates consistently conform to the intended datum is imperative for correctly specifying locations. MODISTools provides the ConvertToDD function to reformat coordinates into decimal degrees from either degrees minutes seconds or degrees decimal minutes. The dates can be specified in two different formats: (1) years, in which case data will be downloaded from the beginning of the year, or (2) POSIX formatted dates (YYYY-MM-DD) that allow more precise requests.

Once the time series coordinates and dates have been specified, the user must delimit the area of interest around each location, defined as the distance in kilometers around each location. Two directions must be specified, firstly in the north/south direction from the focal pixel (pixel identified by the coordinate supplied), and secondly the east/west direction. Using the values (0,0) would result in data for the focal pixel only. The values (1,1) identify an area of interest that, in addition to the focal pixel, includes surrounding pixels for 1 km in each direction. The maximum area of interest surrounding a location that can be retrieved using the ORNL DAAC MODIS web service is (100,100). This two-dimensional input means rectangles of any size smaller than the maximum area of interest can be specified. The number of pixels included in a request depends on the resolution of the data requested: for NDVI data at 250 m2 resolution, a 2.25 × 2.25 km area, downloaded using values (1,1), will contain 81 pixels.

Users must also state what MODIS data product is required. Each data “product” released contains many variables that are stored in separate “Science Data Sets,” also known as data “bands.” For example, the EVI and NDVI bands that we require can be found in the vegetation indices product (MOD13Q1), alongside other variables and indicators of pixel reliability. Functions GetProducts and GetBands help specify the desired data type.

Download the data

We downloaded NDVI and EVI at 250 m2 resolution as well as pixel reliability for these data to allow quality control before analysis. The size of area of interest used to relate vegetation indices to our diversity data may have an impact on the result. It may be that the most local conditions sufficiently explain species richness, or that the surrounding area must also be captured. We analyzed the data at both the focal pixel (Size = c(0,0)) and 6.25 × 6.25 km tile (Size = c(3,3)) scales and compared which best explained variation in species richness. The function ExtractTile can extract subsets from larger areas of interest to avoid duplicated downloads. To ensure all estimates are based on the same length of time series, we requested 3 years prior to the species richness sampling date at each site (year of sampling date plus TimeSeriesLength argument). The following line of code completes the download for the request described:

MODISSubsets(LoadDat = PREDICTS, Products =

“MOD13Q1”, Bands = c("250m_16_days_NDVI",

“250m_16_days_EVI”,

“250m_16_days_pixel_reliability”), Size = c(3,3),

StartDate = FALSE, TimeSeriesLength = 2)

For each site downloaded MODISSubsets saves a text file (ASCII format, comma separated, no header), with a log file listing all the unique downloads and their download status (see Table 2 for an explanation of downloaded data structure). The data saved to these files can be easily read back into R using MODISTimeSeries. If the input data set contains unique identifiers (IDs) for each site, these will be used as file names; if not, the function will generate unique IDs itself and use these. All files will be stored in a user-specified directory (working directory by default), and this directory path will be printed in R prior to downloading. The download speed primarily depends on server traffic at the ORNL DAAC MODIS web service, and to a lesser extent on internet connection and size of data set being downloaded. Speed can be highly variable for even the same data set. We replicated the download of our 526 sites and on one occasion it took 2–3 h and another approximately 12 h (see Table 3 for performance metrics).

Table 2.

An explanation of the sections, for example, text string (in text), which shows the format of data subsets written in the ASCII text file outputs from MODISSubsets. These ASCII files can be read back into an R workspace using read.csv(“filename.asc”, header = FALSE, as.is = TRUE). The resulting data.frame would contain columns for each section described below and rows for each date in the time series

Section description Example
Number of tile rows 1
Number of tile columns 1
x-coordinate (MODIS datum longitude) for lower left corner of tile 13702705
y-coordinate (MODIS datum latitude) for lower left corner of tile –3709977
Pixel size (meters) 231.6564
Unique Identifier MOD13Q1.A2009001.h30v12.005.2009020003129.250m_16_days_EVI
Shortname code for the MODIS product requested MOD13Q1
Date code for this string of data, year and Julian day (A[YYYYDDD]) A2009001
Input coordinates and the width (Samp) and height (Line) in number of pixels of the tile surrounding the input coordinate Lat-33.3636449991Lon147.548402Samp1Line1
Date–time that MODIS data product was processed (YYYYDDDHHMMSS) 2009020003129
All values following are data for each pixel in the tile (n = Samp × Line), which are ordered by row. The number of columns should equal the number of pixels in the tile – in this case 1. See ?ExtractTile to rearrange the data back into spatially ordered tiles. 1567

Table 3.

Performance metrics for the downloading function, MODISSubsets, all times reported in seconds. The times reported here are for a simple subset request (one site, focal pixel only, for 1 year) using an example data.frame provided with MODISTools called SubsetExample. The effect of time-series length (3 years), tile size (2.25 × 2.25 km tile size – 81 pixels), and number of sites (four sites, using the MODISTools data.frame EndCoordinatesExample) on time taken to download is shown for multiple computers. The largest source of variation in download times will be traffic at the ORNL DAAC MODIS server, and internet connection

System Simple request Time-series length Tile size Number of sites
MacBook Air (2013) 51.962 119.687 32.177 173.033
 Processor: 1.3 GHz Intel Core i5
 Memory: 8 GB 1600 MHz DDR3
 Software: OSX 10.9.4
 Internet: up to 30 MB wireless
MacBook Pro 36.887 99.253 37.093 148.616
 Processor: 2.4 GHz Intel Core 2 Duo
 Memory: 4 GB 1067 MHz DDR3
 Software: OSX 10.6.8
 Internet: up to 90 MB wireless
MacBook Mini 47.387 112.134 39.210 158.606
 Processor: 2.6 GHz Intel Core i7
 Memory: 16 GB 1600 MHz DDR3
 Software: OSX 10.8.5
 Internet: Network cable

No internet service can guarantee 100% availability and MODISSubsets employs a strategy to work around temporary loss of availability of the ORNL DAAC MODIS server and to download as many of the requested subsets as possible. If MODISSubsets encounters a problem, such as a loss of connection, it produces a warning message and retries that subset until either it has been downloaded or 15 min have elapsed, after which time it attempts to download the next of the requested subsets. Any error and warning messages encountered during downloading are stored in the log file so they can be traced to the problematic time series. MODISSubsets then attempts a second download of any subsets that could not be fetched in the first pass, minimizing the amount of missing data in the final output. MODISSubsets writes each unique subset (i.e., combination of lat/long location and time-series start and end dates) to a single text file, minimizing download and storage load. The text files can easily be combined into a single text file (CSV format), along with the input ecological data, facilitating modeling (see Statistical analysis). UpdateSubsets can also be used to complete an unfinished download, or download for new sites added to a data set.

Process the data

The MODISSummaries function collates the downloaded data and computes the per-pixel time-series summary statistics. Prior to summarizing the data, it uses the pixel reliability indicator to filter out poor-quality and missing data (data with pixel reliability values > 0 were omitted). MODISSummaries then assembles the pixel level summary statistics with the user's data (PREDICTS in our example) that contain response variables and returns a data frame that can be used for statistical modeling with existing R modeling tools. Hence using two function calls, one to download data (MODISSubsets function call above) and the other to process data (MODISSummaries function call below), we generated new columns in our ecological data set that provided the explanatory variables for our analysis:

MODISSummaries(LoadDat = PREDICTS, Product =

“MOD13Q1", Bands = c(“250m_16_days_NDVI”,

“250m_16_days_EVI”), ValidRange = c(-2000,10000),

NoDataFill = -3000, ScaleFactor = 0.0001,

StartDate = FALSE, QualityScreen = TRUE,

QualityBand = “250m_16_days_pixel_reliability”,

QualityThreshold = 0, Max = TRUE, Mean = TRUE,

Interpolate = TRUE, Yield = TRUE)

Statistical analysis

We fitted generalized linear mixed effects models to the species richness data, using a poisson error distribution with a log link function as is appropriate for count data. To avoid collinearity among explanatory variables that are based on the same data (including EVI and NDVI variables, as EVI is an extension of NDVI), each variable was fitted in a different model. To account for homogeneity among sites within a study, we fitted the study grouping variable as a random effect. The fixed effects were the remotely sensed vegetation index measures, a higher taxon factor (Aves, Coleoptera, Hymenoptera, Pinopsida), and the interaction between these effects. Models were fitted using lme4 v1.1-6 in R (Bates et al. 2014). The R2 for the models was calculated using the R.squaredGLMM function from the MuMIn R package (Barton 2014). R2 is a good indication of the goodness of fit of the data to the model and a simple indicator of how well the model could be used for prediction (Nakagawa and Schielzeth 2013).

Results and Discussion

On average, species richness increased with increasing vegetation indices, as has been reported in previous studies (Waring et al. 2006). Our findings also corroborate previous results that have shown these responses to be taxa-specific (Bailey et al. 2004; Gibson et al. 2011; Newbold et al. 2013, 2014) and, although less frequent in our findings, scale-specific (Hurlbert and Haskell 2003). Overall, models at all scales produced similar R2: values range from 0.57 to 0.83, which suggests the models explain a large amount of variation in species richness and hold some predictive power. However, rather than being definitive results, these findings are preliminary and presented here primarily to highlight the use of MODISTools. Thus, only a subsection of the results are discussed in the text, but additional results and figures can be found in the Supporting Information (Appendices S3, S4, and Figs. S1, S2, S3).

Species richness among all higher taxa except Aves covaried positively with maximum NDVI at the focal pixel scale, but these responses were variable so the interaction term was retained (Fig. 2). The strongest relationships estimated were for Hymenoptera and Coleoptera. Although these slope estimates should be interpreted with caution, as our data set only included 22 sites for Coleoptera, previous studies have also shown strong positive correlations of beetle species to NDVI (Lassau and Hochuli 2008). Mean NDVI at both the small and large spatial scale showed a positive correlation with species richness for all taxa except Aves (Figs. S1 and S2). All higher taxa showed similar positive correlations with NDVI variability at the focal pixel scale (Fig. S1), but with varying intercepts; the interaction was retained, however, for NDVI variability at the larger scale.

Figure 2.

Figure 2

Responses of four taxa to changes in maximum NDVI. Aves (black line) showed no significant response to changes in maximum NDVI, while Coleoptera (purple line), Hymenoptera (yellow line) and Pinopsida (green line) showed a positive response to increasing maximum NDVI. Shaded areas indicate 95% confidence intervals.

In most cases, EVI and NDVI variables produced similar patterns, but the response of Pinopsida species richness to maximum value in a time series was dependent on the vegetation index used (Figs. S2 and S3). Unlike NDVI, EVI considers the nonlinear differences between the radiative transfer of red and near-infrared light through a canopy (Pettorelli et al. 2005). This may underlie the discrepancy between vegetation indices, as conifers produce a different tree architecture and canopy structure from deciduous trees. Leaf color may also be important: The bluish color found in adult trees of some species within Pinopsida would be incorporated into the EVI, which is calculated using blue reflectance, but not NDVI.

Despite previous work showing the contrary (see Koh et al. 2006; using 1 × 1 km NDVI derived from the SPOT-VGT imaging sensor), bird species richness was not significantly affected by changes in any of the vegetation index variables, at neither the small or large spatial scales. Seto et al. (2004) found that the relationship between bird species richness and vegetation index variables strengthened as the spatial extent considered increased, possibly due to birds having larger home ranges. Therefore, the lack of relationship we found within Aves may in part be caused by analyzing our remotely sensed variables at an inappropriate spatial extent.

Conclusions

The benefits of the automated approach that MODISTools takes are valuable. All 526 subsets of data, for the entire timespan of interest, can be retrieved using one line of R code. Using any other method that cannot automate the process over subsets, such as the email service using the online tool, would require 526 independent subset requests to be input manually by the user: researcher time and opportunity for error would greatly increase. MODISTools provides a more scalable approach, so the benefit of using MODISTools increases with the number of subsets requested. Our tool was equally beneficial for the data processing steps (without using a GIS environment) by producing our MODIS derived variables and matching them to the species richness data, ready for ecological modeling.

The package is being actively developed, and future releases will include more product-specific post-download processing. Summaries suitable for vegetation indices, such as time-integrating vegetation indices, are already available but these will be expanded to include a range of summary functions for a wider set of MODIS products, for example, temporal Fourier processing (Scharlemann et al. 2008). Processing will become more flexible by allowing the user to write custom summary functions. Additional methods for quality control of time-series data will also be included, allowing interpolation of missing values using adjacent time-series data (Rowhani et al. 2008). The interaction between MODISTools and spatial processing tools, in R and GIS environments, can also be further expanded.

The current stable release of MODISTools is available on CRAN (http://cran.r-project.org/web/packages/) – the usual mechanism for making R packages readily available – and can be installed by running: install.packages(“MODISTools”). The latest “in-development” version of MODISTools can be accessed at https://github.com/seantuck12/MODISTools. For a more in-depth walkthrough to downloading and using MODIS data with MODISTools, see the associated vignette by installing the package and then entering vignette(“UsingMODISTools”) into R.

Acknowledgments

We thank NASA LP DAAC for making MODIS data freely available and ORNL DAAC for providing the MODIS Land Product Subsets web service. SLT was supported by UK Natural Environment Research Council (NERC) grant NE/K500811/1. HRPP was supported by the Hans Rausing Scholarship. JPWS, AP, and LNH were supported by UK Natural Environment Research Council (NERC) grant NE/J011193/1. This study is a contribution from the Imperial College Grand Challenges in Ecosystems and the Environment initiative. Thanks to Tim Newbold for supplying code for plotting the generalized mixed effects models, Stewart Jennings for testing and providing feedback on the package, and two anonymous reviewers for constructive feedback on the manuscript.

Conflict of Interest

None declared.

Supporting Information

Additional Supporting Information may be found in the online version of this article:

Appendix S1. R script for downloading MODIS vegetation indices for analysis.

ece30004-4658-sd1.R (3.6KB, R)

Appendix S2. R function to create a vector of weighted means the same order as a vector.

ece30004-4658-sd2.R (1.6KB, R)

Appendix S3. R script for analysing EVI and NDVI data for a single pixel.

ece30004-4658-sd3.R (14.4KB, R)

Appendix S4. R script for analysing EVI and NDVI data for a 6×6 km area of interest.

ece30004-4658-sd4.R (22.9KB, R)

Figure S1. Response of species richness to vegetation indices at the focal pixel scale.

ece30004-4658-sd5.pdf (48KB, pdf)

Figure S2. Response of species richness to NDVI at the 6.25×6.25 km scale using spatially-weighted and unweighted data.

ece30004-4658-sd6.pdf (54.4KB, pdf)

Figure S3. Response of species richness to EVI at the 6.25×6.25 km scale using spatially-weighted and unweighted data.

ece30004-4658-sd7.pdf (50.5KB, pdf)

Table S1. Dummy ecological data set for conducting analysis.

ece30004-4658-sd8.csv (33.5KB, csv)

References

  1. Ahl DE, Gower ST, Burrows SN, Shabanov NV, Myneni RB, Knyazikhin Y. Monitoring spring canopy phenology of a deciduous broadleaf forest using MODIS. Remote Sens. Environ. 2006;104:88–95. [Google Scholar]
  2. Bai ZG, Dent DL, Olsson L, Schaepman ME. Proxy global assessment of land degradation. Soil Use Manag. 2008;24:223–234. [Google Scholar]
  3. Bailey SA, Horner Devine MC, Luck G, Moore LA, Carney KM, Anderson S, et al. Primary productivity and species richness: relationships among functional guilds, residency groups and vagility classes at multiple spatial scales. Ecography. 2004;27:207–217. [Google Scholar]
  4. Balmford A, Green RE, Jenkins M. Measuring the changing state of nature. Trends Ecol. Evol. 2003;18:326–330. [Google Scholar]
  5. Barton K. 2014. MuMIn: Multi-model inference. R package version 1.10.0. Retrieved from http://CRAN.R-project.org/package=MuMIn.
  6. Bates D, Maechler M, Bolker BM, Walker S. 2014. lme4: Linear mixed-effects models using Eigen and S4. R package version 1.1-6. –89. Retrieved from http://CRAN.R-project.org/package=lme4.
  7. Benali A, Carvalho AC, Nunes JP, Carvalhais N, Santos A. Estimating air surface temperature in Portugal using MODIS LST data. Remote Sens. Environ. 2012;124:108–121. [Google Scholar]
  8. Borer ET, Harpole WS, Adler PB, Lind EM, Orrock JL, Seabloom EW, et al. Finding generality in ecology: a model for globally distributed experiments (R. Freckleton, Ed.) Methods Ecol. Evol. 2014;5:65–73. [Google Scholar]
  9. Brunet J, Valtinat K, Mayr ML, Felton A, Lindbladh M, Bruun HH. Understory succession in post-agricultural oak forests: habitat fragmentation affects forest specialists and generalists differently. For. Ecol. Manage. 2011;262:1863–1871. [Google Scholar]
  10. Buczkowski G. Extreme life history plasticity and the evolution of invasive characteristics in a native ant. Biol. Invasions. 2010;12:3343–3349. [Google Scholar]
  11. Chapman KA, Reich PB. Land use and habitat gradients determine bird community diversity and abundance in suburban, rural and reserve landscapes of Minnesota, USA. Biol. Conserv. 2007;135:527–541. [Google Scholar]
  12. Chawla A, Yadav PK, Uniyal SK, Kumar A, Vats SK, Kumar S, et al. Long-term ecological and biodiversity monitoring in the western Himalaya using satellite remote sensing. Curr. Sci. 2012;102:1143–1156. [Google Scholar]
  13. Colditz RR, Schmidt M, Conrad C, Hansen MC. Land cover classification with coarse spatial resolution data to derive continuous and discrete maps for complex regions. Remote Sens. Environ. 2011;115:3264–3275. [Google Scholar]
  14. Collen B, Loh J, Whitmee S, McRae L, Amin R, Baillie JEM. Monitoring change in vertebrate abundance: The Living Planet Index. Conserv. Biol. 2009;23:317–327. doi: 10.1111/j.1523-1739.2008.01117.x. [DOI] [PubMed] [Google Scholar]
  15. Donoghue D. Remote sensing: environmental change. Prog. Phys. Geogr. 2002;26:144–151. [Google Scholar]
  16. Eastman JR, Fulk M. Long sequence time-series evaluation using standardized principal components. Photogrammetric Engineering and Remote Sensing. 1993;59:991–996. [Google Scholar]
  17. Friedl M, McIver D, Hodges J, Zhang X, Muchoney D, Strahler A, et al. Global land cover mapping from MODIS: algorithms and early results. Remote Sens. Environ. 2002;83:287–302. [Google Scholar]
  18. Fuller RM, Groom GB, Mugisha S, Ipulet P, Pomeroy D, Katende A, et al. The integration of field survey and remote sensing for biodiversity assessment: a case study in the tropical forests and wetlands of Sango Bay, Uganda. Biol. Conserv. 1998;86:379–391. [Google Scholar]
  19. Galford GL, Mustard JF, Melillo J, Gendrin A, Cerri CC, Cerri CEP. Wavelet analysis of MODIS time series to detect expansion and intensification of row-crop agriculture in Brazil. Remote Sens. Environ. 2008;112:576–587. [Google Scholar]
  20. Gaston KJ. Global patterns in biodiversity. Nature. 2000;405:220–227. doi: 10.1038/35012228. [DOI] [PubMed] [Google Scholar]
  21. Gaston KJ, Blackburn TM, Goldewijk KK. Habitat conversion and global avian biodiversity loss. Proceedings of the Royal Society B: Biological Sciences. 2003;270:1293–1300. doi: 10.1098/rspb.2002.2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gaughan AE, Holdo RM, Anderson TM. Using short-term MODIS time-series to quantify tree cover in a highly heterogeneous African savanna. Int. J. Remote Sens. 2013;34:6865–6882. [Google Scholar]
  23. Gibson L, Lee TM, Koh LP, Brook BW, Gardner TA, Barlow J, et al. Primary forests are irreplaceable for sustaining tropical biodiversity. Nature. 2011;478:378–381. doi: 10.1038/nature10425. [DOI] [PubMed] [Google Scholar]
  24. Gudgin M, Hadley M, Mendelsohn N, Moreau J-J, Nielsen HF, et al. 2007. SOAP version 1.2 part 1: messaging framework (second edition). W3C. Retrieved July 24, 2013, from http://www.w3.org/TR/2007/REC-soap12-part1-20070427/
  25. Hashimoto H, Dungan J, White M, Yang F, Michaelis A, Running S, et al. Satellite-based estimation of surface vapor pressure deficits using MODIS land surface temperature data. Remote Sens. Environ. 2008;112:142–155. [Google Scholar]
  26. Huete A, Didan K, Miura T, Rodriguez EP, Gao X, Ferreira LG. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002;83:195–213. [Google Scholar]
  27. Hurlbert AH, Haskell JP. The effect of energy and seasonality on avian species richness and community composition. Am. Nat. 2003;161:83–97. doi: 10.1086/345459. [DOI] [PubMed] [Google Scholar]
  28. James ME, Kalluri S. The Pathfinder AVHRR land data set: an improved coarse resolution data set for terrestrial monitoring. Int. J. Remote Sens. 1994;15:3347–3363. [Google Scholar]
  29. Justice CO, Vermote E, Townshend J, DeFries R, Roy DP, Hall DK, et al. The Moderate Resolution Imaging Spectroradiometer (MODIS): land remote sensing for global change research. IEEE Trans. Geosci. Remote Sens. 1998;36:1228–1249. [Google Scholar]
  30. Kerr JT, Ostrovsky M. From space to species: ecological applications for remote sensing. Trends Ecol. Evol. 2003;18:299–305. [Google Scholar]
  31. Koh C-N, Lee P-F, Lin R-S. Bird species richness patterns of northern Taiwan: primary productivity, human population density, and habitat heterogeneity. Divers. Distrib. 2006;12:546–554. [Google Scholar]
  32. Lantschner MV, Rusch V, Peyrou C. Bird assemblages in Pine Plantations replacing native ecosystems in NW Patagonia. Biodivers. Conserv. 2008;17:969–989. [Google Scholar]
  33. Lassau SA, Hochuli DF. Testing predictions of beetle community patterns derived empirically using remote sensing. Divers. Distrib. 2008;14:138–147. [Google Scholar]
  34. Lentini PE, Martin TG, Gibbons P, Fischer J, Cunningham SA. Supporting wild pollinators in a temperate agricultural landscape: maintaining Mosaics of natural features and production. Biol. Conserv. 2012;149:84–92. [Google Scholar]
  35. Land Processes Distributed Active Archive Center (LP DAAC) 2014. MODIS Reprojection Tool (LP DAAC, L.P.D.A.A.C, Ed.)., Published online: 01 September 1994; | doi:10.1038/371065a0, 371, 65–66. Retrieved July 24, 2014, from https://lpdaac.usgs.gov/tools/modis_reprojection_tool.
  36. Magura T, Horvath R, Tothmeresz B. Effects of urbanization on ground-dwelling spiders in forest patches, in Hungary. Landscape Ecol. 2010;25:621–629. [Google Scholar]
  37. Mattiuzzi M, Verbesselt J, Stevens F, Mosher S, Hengl T, Kilsch A, et al. 2013. MODIS: MODIS acquisition and processing package. R package version 0.9-14/r439. Retrieved November 25, 2013, from http://R-Forge.R-project.org/projects/modis/
  38. Myneni RB, Hall FG, Sellers PJ, Marshak AL. The interpretation of Spectral Vegetation Indexes. IEEE Trans. Geosci. Remote Sens. 1995;33:481–486. [Google Scholar]
  39. Naeem S, Duffy JE, Zavaleta E. The functions of biological diversity in an age of extinction. Science. 2012;336:1401–1406. doi: 10.1126/science.1215855. [DOI] [PubMed] [Google Scholar]
  40. Nakagawa S, Schielzeth H. A general and simple method for obtaining R2 from generalized linear mixed-effects models (R.B. O'Hara, Ed.) Methods Ecol. Evol. 2013;4:133–142. [Google Scholar]
  41. Nelson R, Ranson KJ, Sun G, Kimes DS, Kharuk V, Montesano P. Estimating Siberian timber volume using MODIS and ICESat/GLAS. Remote Sens. Environ. 2009;113:691–701. [Google Scholar]
  42. Newbold T, Scharlemann JPW, Butchart SHM, Sekercioglu CH, Alkemade R, Booth H, et al. Ecological traits affect the response of tropical forest bird species to land-use intensity. Proceedings of the Royal Society B: Biological Sciences. 2013;280:20122131. doi: 10.1098/rspb.2012.2131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Newbold T, Hudson LN, Phillips HRP, Hill SLL, Contu S, Lysenko I, et al. A global model of the response of tropical and sub-tropical forest biodiversity to anthropogenic pressures. Proceedings of the Royal Society B: Biological Sciences. 2014;281:20141371. doi: 10.1098/rspb.2014.1371. doi: 10.1098/rspb.2014.1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Nicodemus FE, Richmond JC, Hsim JJ, Ginsberg IW, Limperis T. Geometrical considerations and nomenclature for reflectance. Washington, D.C: National Bureau of Standards; 1977. [Google Scholar]
  45. Olson DM, Dinerstein E, Wikramanayake ED, Burgess ND, Powell GVN, Underwood EC, et al. Terrestrial ecoregions of the World: a New Map of Life on Earth. Bioscience. 2001;51:933–938. [Google Scholar]
  46. Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC) 2014. MODIS subsetted land products, Collection 5. Available on-line [ http://daac.ornl.gov/MODIS/modis.html] from ORNL DAAC, Oak Ridge, Tennessee, U.S.A. Accessed July 24, 2014.
  47. Pereira HM, Leadley PW, Proença V, Alkemade R, Scharlemann JPW, Fernandez-Manjarres JF, et al. Scenarios for global biodiversity in the 21st century. Science. 2010;330:1496–1501. doi: 10.1126/science.1196624. [DOI] [PubMed] [Google Scholar]
  48. Pettorelli N, Vik JO, Mysterud A, Gaillard JM, Tucker CJ, Stenseth NC. Using the satellite-derived NDVI to assess ecological responses to environmental change. Trends Ecol. Evol. 2005;20:503–510. doi: 10.1016/j.tree.2005.05.011. [DOI] [PubMed] [Google Scholar]
  49. R Core Team. 2014. R: A language and environment for statistical computing. Retrieved from http://www.R-project.org/
  50. Reed BC, Brown JF, VanderZee D, Loveland TR, Merchant JW, Ohlen DO. Measuring phenological variability from satellite imagery. J. Veg. Sci. 1994;5:703–714. [Google Scholar]
  51. Rowhani P, Lepczyk CA, Linderman MA, Pidgeon AM, Radeloff VC, Culbert PD, et al. Variability in energy influences avian distribution patterns across the USA. Ecosystems. 2008;11:854–867. [Google Scholar]
  52. Running SW, Nemani RR, Heinsch FA, Zhao MS, Reeves M, Hashimoto H. A continuous satellite-derived measure of global terrestrial primary production. Bioscience. 2004;54:547–560. [Google Scholar]
  53. Sakamoto T, Van Nguyen N, Ohno H, Ishitsuka N, Yokozawa M. Spatio-temporal distribution of rice phenology and cropping systems in the Mekong Delta with special reference to the seasonal water flow of the Mekong and Bassac rivers. Remote Sens. Environ. 2006;100:1–16. [Google Scholar]
  54. Scharlemann J, Benz D, Hay SI, Purse BV, Tatem AJ, Wint GRW, et al. Global data for ecology and epidemiology: a novel algorithm for temporal Fourier processing MODIS data. PLoS ONE. 2008;3:e1408. doi: 10.1371/journal.pone.0001408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Seto KC, Fleishman E, Fay JP, Betrus CJ. Linking spatial patterns of bird and butterfly species richness with Landsat TM derived NDVI. Int. J. Remote Sens. 2004;25:4309–4324. [Google Scholar]
  56. St-Louis V, Pidgeon AM, Kuemmerle T, Sonnenschein R, Radeloff VC, Clayton MK, et al. Modelling avian biodiversity using raw, unclassified satellite imagery. Philosophical Transactions of the Royal Society B: Biological Sciences. 2014;369:20130197. doi: 10.1098/rstb.2013.0197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Su ZM, Zhang RZ, Qiu JX. Decline in the diversity of willow trunk-dwelling Weevils (Coleoptera: Curculionoidea) as a result of urban expansion in Beijing, China. J. Insect Conserv. 2011;15:367–377. [Google Scholar]
  58. Sutherland WJ, Freckleton RP, Godfray HCJ, Beissinger SR, Benton T, Cameron DD, et al. Identification of 100 fundamental ecological questions. J. Ecol. 2013;101:58–67. [Google Scholar]
  59. Tittensor DP, Walpole M, Hill SLL, Boyce DG, Britten GL, Burgess ND, et al. A mid-term analysis of progress towards international biodiversity targets. Science. 2014 doi: 10.1126/science.1257484. [DOI] [PubMed] [Google Scholar]
  60. Turner W, Spector S, Gardiner N, Fladeland M, Sterling E, Steininger M. Remote sensing for biodiversity science and conservation. Trends Ecol. Evol. 2003;18:306–314. [Google Scholar]
  61. Turner DP, Ritts WD, Cohen WB, Gower ST, Running SW, Zhao M, et al. Evaluation of MODIS NPP and GPP products across multiple biomes. Remote Sens. Environ. 2006;102:282–292. [Google Scholar]
  62. Wardlow BD, Egbert SL, Kastens JH. Analysis of time-series MODIS 250 m vegetation index data for crop classification in the US Central Great Plains. Remote Sens. Environ. 2007;108:290–310. [Google Scholar]
  63. Waring RH, Coops NC, Fan W, Nightingale JM. MODIS enhanced vegetation index predicts tree species richness across forested ecoregions in the contiguous USA. Remote Sens. Environ. 2006;103:218–226. [Google Scholar]
  64. Weller B, Ganzhorn JU. Carabid beetle community composition, body size, and fluctuating asymmetry along an urban-rural gradient. Basic Appl. Ecol. 2004;5:193–201. [Google Scholar]
  65. Winfree R, Griswold T, Kremen C. Effect of human disturbance on bee communities in a forested ecosystem. Conserv. Biol. 2007;21:213–223. doi: 10.1111/j.1523-1739.2006.00574.x. [DOI] [PubMed] [Google Scholar]
  66. Xiao J, Zhuang Q, Law BE, Chen J, Baldocchi DD, Cook DR, et al. A continuous measure of gross primary production for the conterminous United States derived from MODIS and AmeriFlux data. Remote Sens. Environ. 2010;114:576–591. [Google Scholar]
  67. Xu T, Hutchinson M. ANUCLIM version 6.1 user guide. Canberra: The Australian National University; 2011. [Google Scholar]
  68. Zhang X, Friedl MA, Schaaf CB, Strahler AH, Hodges JC, Gao F, et al. Monitoring vegetation phenology using MODIS. Remote Sens. Environ. 2003;84:471–475. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix S1. R script for downloading MODIS vegetation indices for analysis.

ece30004-4658-sd1.R (3.6KB, R)

Appendix S2. R function to create a vector of weighted means the same order as a vector.

ece30004-4658-sd2.R (1.6KB, R)

Appendix S3. R script for analysing EVI and NDVI data for a single pixel.

ece30004-4658-sd3.R (14.4KB, R)

Appendix S4. R script for analysing EVI and NDVI data for a 6×6 km area of interest.

ece30004-4658-sd4.R (22.9KB, R)

Figure S1. Response of species richness to vegetation indices at the focal pixel scale.

ece30004-4658-sd5.pdf (48KB, pdf)

Figure S2. Response of species richness to NDVI at the 6.25×6.25 km scale using spatially-weighted and unweighted data.

ece30004-4658-sd6.pdf (54.4KB, pdf)

Figure S3. Response of species richness to EVI at the 6.25×6.25 km scale using spatially-weighted and unweighted data.

ece30004-4658-sd7.pdf (50.5KB, pdf)

Table S1. Dummy ecological data set for conducting analysis.

ece30004-4658-sd8.csv (33.5KB, csv)

Articles from Ecology and Evolution are provided here courtesy of Wiley

RESOURCES