Skip to main content
NASA Author Manuscripts logoLink to NASA Author Manuscripts
. Author manuscript; available in PMC: 2020 Jul 7.
Published in final edited form as: Remote Sens Environ. 2017 Oct 21;204:392–400. doi: 10.1016/j.rse.2017.10.016

Information theoretic evaluation of satellite soil moisture retrievals

Sujay V Kumar 1, Paul A Dirmeyer 2, Christa D Peters-Lidard 3, Rajat Bindlish 4, John Bolten 5
PMCID: PMC7340154  NIHMSID: NIHMS1505558  PMID: 32636571

Abstract

Microwave radiometry has a long legacy of providing estimates of remotely sensed near surface soil moisture measurements over continental and global scales. A consistent assessment of the errors and uncertainties associated with these retrievals is important for their effective utilization in modeling, data assimilation and end-use application environments. This article presents an evaluation of soil moisture retrieval products from AMSR-E, ASCAT, SMOS, AMSR2 and SMAP instruments using information theory-based metrics. These metrics rely on time series analysis of soil moisture retrievals for estimating the measurement error, level of randomness (entropy) and regularity (complexity) of the data. The results of the study indicate that the measurement errors in the remote sensing retrievals are significantly larger than that of the ground soil moisture measurements. The SMAP retrievals, on the other hand, were found to have reduced errors (comparable to those of in-situ datasets), particularly over areas with moderate vegetation. The SMAP retrievals also demonstrate high information content relative to other retrieval products, with higher levels of complexity and reduced entropy. Finally, a joint evaluation of the entropy and complexity of remotely sensed soil moisture products indicates that the information content of the AMSR-E, ASCAT, SMOS and AMSR2 retrievals is low, whereas SMAP retrievals show better performance. The use of information theoretic assessments is effective in quantifying the required levels of improvements needed in the remote sensing soil moisture retrievals to enhance their utility and information content.

Keywords: soil moisture, remote sensing, information theory

1. Introduction

Soil moisture plays an important role in modulating the exchanges of water and energy at the land atmosphere interface and profoundly influences the spatial and temporal variability of weather and climatic conditions (Koster et al. (2004); Seneviratne et al. (2010)). Accurate characterization of soil moisture is, therefore, important for applications such as flood/drought forecasting, weather and climate modeling, agricultural and water resources management. Observations of soil moisture from ground measurements tend to be sparse and are often not sufficient to capture the spatial heterogeneity and variability of soil moisture at larger spatial scales, required for such applications. Space-borne measurements of soil moisture, primarily from microwave (MW) remote sensing, provide an alternative for developing observations of soil moisture over larger spatial extents (Jackson (1993); Njoku and Entekhabi (1995)). In the past several decades, near surface soil moisture retrievals have become available from a number of low-frequency (C, X, Ku- and L-band) passive and active microwave sensors (Wagner et al. (2003); Njoku et al. (2003); Wen et al. (2003); Owe et al. (2008); Kerr et al. (2010); Entekhabi et al. (2010)).

Microwave soil moisture sensors exploit the fact that the emission of the land surface is affected by variables such as surface temperature, roughness, vegetation and soil moisture. The influence of soil moisture is most prominent at low frequencies (~10 – 1 GHz, making it the ideal range of satellite remote sensing (Njoku and Kong (1977); Jackson et al. (1982); Ulaby et al. (1986)). Unlike the visible and infrared sensors, the microwave sensors are not limited by cloud cover and nighttime conditions. The observations can be made at any time of the day and are not dependent on solar illumination (Jackson et al. (1996)). Longer wavelengths (L-band; 1 –2 GHz) also allow for deeper penetration into the soil and reduce the influence of vegetation in attenuating the soil moisture signal (Jackson et al. (1982)). The active instruments can provide measurements at higher spatial resolutions than the passive microwave instruments, though radar systems are more strongly affected by the local topography, roughness and vegetation than passive radiometer systems (Entekhabi et al (2010); Lakshmi (2013)). However, studies such as Brocca et al (2011) have suggested that ASCAT can outperform passive microwave based retrievals over areas with moderate vegetation. Passive observations on the other hand, are more impacted by spatial heterogeneity and scaling effects because of poor spatial resolution. The spatial resolution of the passive microwave soil moisture observations is typically coarse (~25 to 50 km), with the satellite foot-print size increasing with wavelength and altitude. The presence of snow cover, frozen soil and precipitation events also limits the skill of the soil moisture retrievals (Parinussa et al. (2011)).

Due to the differences in the spatial and temporal span of different MW instruments and due to the limited availability of reliable ground measurements, a consistent evaluation of soil moisture remote sensing datasets is difficult. Land surface model climatology has often been used the reference to address the climatological differences between different retrievals when developing multi-sensor products (Liu et al. (2011b)) and for consistent evaluations of multiple products. In a recent study, Kumar et al. (2015) has shown that such approaches lead to the loss of valuable signals and cause the statistical properties of the retrieval products to be similar to that of the reference datasets. Therefore, performance measures not reliant on the availability of ancillary soil moisture data can be useful for characterizing and assessing the quality of the soil moisture retrieval datasets. As a result, studies have used indirect approaches such as triple collocation (TC; Stoffelen (1998); Dorigo et al. (2010)) and spectral fitting (SF; Su et al. (2014)) to assess the relative quality of global soil moisture retrievals. TC comparisons involve three different soil moisture products (often a mix of satellite soil moisture retrievals and land surface model estimates), with assumptions of linearity (between the true soil moisture and observations), signal and error stationarity, error orthogonality and independence of errors in the constituent datasets (Gruber et al. (2016b)). Recent studies have examined the applicability of these assumptions for soil moisture datasets (Yilmaz and Crow (2014)) and have proposed enhancements to address the limitations imposed by these assumptions, making it a powerful method for global soil moisture evaluation (Zwieback et al. (2013); Gruber et al. (2016b,a). The SF error estimator, based on the method developed by Su et al. (2013) for de-noising satellite soil moisture datasets, estimates the stochastic random errors by comparing the spectral properties of a given soil moisture time series and a linearized water balance model. This method also does not require ancillary datasets and was shown to provide error estimates comparable to those from TC.

Similar to these stand-alone assessment methods, here we present the use of information theoretic and autoregressive analysis of time series data for quantifying errors and information content of remote sensing retrieval datasets from a number of recent soil moisture missions. Information theory measures, originally proposed by Shannon (1948), consider the stochasticity in time series data as sources of information. A key information theoretic measure is entropy, which quantifies the information content or randomness associated with the probability distribution of the data. Similarly, temporal measures of complexity rooted in information theory can be used to discriminate datasets based on time series complexity. Entropy and complexity provide separate measures of information by characterizing the randomness and state changes within a given time series of the data. Entropy is a measure of uncertainty, which is low for periodic sequences and high for random processes. On the other hand, complexity is a measure that is low for both periodic and random sequences, but high for sequences that are not easy to describe with a minimal set of parameters (Lange (1999)). Such measures have been employed for comparing model outputs of soil moisture (Pachepsky et al. (2006)), space-borne soil moisture retrievals (Nearing et al. (2017)), runoff and precipitation measurements from different catchment systems (Lange (1999); Hauhs and Lange (2008)) and ecological systems (Parrott (2010)). A key advantage of information theoretic methods is that they enable the quantification of hidden patterns and structures of the data without requiring ancillary or independent data.

In addition to the use of information theoretic measures, we also employ time series red noise spectrum analysis to develop estimates of accuracy. Vinnikov et al. (1996) employed a first-order Markov process model framework to evaluate observational soil moisture data, which was extended by Dirmeyer et al. (2016) in a recent study to compare measurement errors from different in-situ soil moisture observational networks. Here we apply this method for comparing measurement errors associated with remote sensing soil moisture retrievals. Similar to the information theoretic measures, a key advantage of this approach is that it does not require specific validation or independent reference data. The simultaneous development of information theoretic and measurement error estimates allows the comparison of associated tradeoffs in accuracy, uncertainty and complexity.

The article is organized as follows: Section 2 presents the details of the datasets and the evaluation approaches. The application of the information theory methods to the remote sensing soil moisture retrievals is described in Section 3. Section 4 provides a summary and discussion of the major conclusions of this study.

2. Approach

2.1. Methods

The information theoretic measures are developed by treating the time series data as a symbol sequence with a finite number of states. The standard approach is to categorize the time series data into a binary string (“symbols”) (Lange (1999); Pachepsky et al. (2006)), by encoding values above and below the median (for time series at each grid point), as 1 and 0, respectively. The entropy and complexity measures are then computed based on the probabilities of observing patterns of states/words (a group of L consecutive symbols) within the sequence. In this article, we use three symbol states (L=3), consistent with prior studies (Pachepsky et al. (2006); Pan et al. (2011)). These include the probability of occurrence of a given state i (pL,i) as well as the second order probability (pL,ij) of observing state i next to j. For binary symbol sequences, there are 2L possible words of length L. (For example, if an encoded symbol string starts as ‘0011’, then the first word is ‘001’, which transitions to the second word ‘011’ and so on.)

Shannon entropy is the expected value of the information contained in a symbol sequence. The metric entropy is specified as the normalized measure of Shannon entropy for states of size L and is defined as:

H(L)=i=12LpL,ilog2pL,iL (1)

H(L) ranges between 0 (for constant sequences) and 1 (for uniformly distributed random sequences).

The fluctuation complexity (Bates and Shephard (1993)), which measures the spread between information within a symbol string between consecutive states is expressed as:

C(L)=i,j2LpL,ij(log2pL,ipL,j)2 (2)

C(L) can be thought of as a measure of the ordering of states within a symbol sequence, with high and low values associated with complex and simple orderings, respectively. The fluctuation complexity, therefore, is a measure of the extent of the changes in information gain or loss in a time series and it approaches zero for signals with limited probable states (Pan et al. (2011)).

Note that both the choice of the classification and the length of the words have an impact on the metrics that are computed. The use of a finer classifications (rather than wet and dry) and the use of larger number of words enables a more granular detection of the entropy and complexity measures, but requires longer and consistent time series. Though the use of the three-symbol states in this study limits the granularity of the soil moisture changes detected by the information theory measures, they are helpful in examining the general trends across various remote sensing datasets.

The analysis of measurement errors used in this study is based on the fact that soil moisture, due to its memory, can be described as a first order Markov process (Delworth and Manabe (1988)). The lagged autocorrelation of soil moisture (r(τ)) reduces exponentially with time:

r(τ)=eλτ (3)

where λ is decay frequency and τ is the time lag. Due to the presence of measurement errors, a linear regression of ln(r) vs τ does not pass through τ = 0, r=1. Therefore, the displacement term a of the correlation at τ = 0 can be used to compute estimates of measurement error (Vinnikov et al. (1996)). The relative measurement error (ϵ) can be expressed as the square root of the fraction of the random error variance and the variance of soil moisture, as follows:

ϵ=(a1+a) (4)

In other words, ϵ is the root mean square (RMS) of the measurement error normalized by the standard deviation of soil moisture. This statistical model assumes that soil moisture evolution can be represented by a first-order ordinary differential equation (ODE) driven by white-noise precipitation forcing (Delworth and Manabe (1988)). Essentially the model assumes that noise quantified here is that which does not fit the first order ODE. In the analysis below, the error estimates are generated using autocorrelations at lags of 1, 2 and 3 days.

2.2. Data

Retrievals from five recent satellite soil moisture microwave instruments are used in this study. They include: (1) the Advanced Microwave Scanning Radiometer-Earth Observing System (AMSR-E) aboard the Aqua satellite, (2) the Advanced Scatterometer (ASCAT), a C-band active microwave remote sensing instrument aboard the Meteorological Operational (METOP) satellites, (3) the Soil Moisture Ocean Salinity (SMOS) mission, (4) the Advanced Microwave Scanning Radiometer 2 (AMSR2) onboard the Global Change Observation Mission-Water (GCOM-W) satellite, and (5) the Soil Moisture Active Passive (SMAP) mission. Except for AMSR-E, which stopped functioning in October 2011, all these instruments are currently providing measurements of surface soil moisture. Soil moisture retrievals are generated from the raw measurements using different retrieval algorithms and systems. The AMSR-E retrievals with the Land Parameter Retrieval Model (LPRM) algorithm (Owe et al., 2008) is used here as prior studies have quantified better performance of AMSR-E LPRM data relative to other available AMSR-E retrieval products (Rudiger et al. (2009); Champagne et al. (2010); Liu et al. (2011a)). The Soil Moisture Operational Products System (SMOPS; Liu et al., 2012) of NOAA/NESDIS is used for obtaining soil moisture retrievals from the backscatter measurements acquired by ASCAT and the L-band radiometer measurements of SMOS. Note that the ASCAT retrievals available through SMOPS are the same as the Near Real Time (NRT) retrievals from EUMETSAT, designed to meet the latency requirements of the operational Numerical Weather Prediction (NWP) community. The SMOS retrievals in SMOPS are produced through a single channel retrieval algorithm based on Jackson (1993). The SMOPS product is used for operational soil moisture data assimilation at several agencies around the world due to its NRT availability. The AMSR2 retrievals (Level 3 products) from the Japan Aerospace Exploration Agency (JAXA; Fujii et al., 2009; Koike, 2013) are used in this study as they have been shown to perform better compared to other available retrieval products (Bindlish et al. (2017)).

The SMAP mission consists of two instruments, a L-band high resolution radar (1 km) and a coarse-resolution radiometer (40 km). The SMAP radar encountered an anomaly a few months after launch and is currently inoperable. As a result, in this study we use the level 3, coarse resolution (36 km) passive microwave measurements (L3_SM_P; O’Neill et al. (2012); Chan et al. (2016)) available through the National Snow and Ice Data Center (NSIDC). The temporal extents of the data sets used in this study are as follows: AMSR-E data from June 2002 to October 2011, ASCAT from January 2007 to December 2016, AMSR2 from July 2012 to December 2016, SMOS from April 2012 to December 2016 and SMAP from April 2015 to December 2016. To ensure a reasonable temporal continuity in these datasets, gaps of less than 3 days are filled using a 1-d discrete cosine transform Wang et al. (2012) method, consistent with the strategy used in Su et al. (2013). Unlike Dirmeyer et al. (2016), where interpolation was used to fill gaps of less than 10 days, we used a shorter time window to ensure that the temporal interpolation itself does not significantly impact the computation of the metrics. As the temporal gaps and irregular sampling of remote sensing datasets are intrinsic to these product, we omit analyses that reconciles these differences to a common repeat period.

3. Results

Figure 1 shows the maps of relative measurement error and its distribution for soil moisture retrievals from each sensor. The data quality flags provided with each sensor are employed in screening the data values used in the comparisons. For example, a subset of data locations that conform to the recommended Quality Assessment (QA) classifications (‘good retrievals’) of the SMOPS system is employed in the comparisons. The spatial patterns in Figure 1 show a strong signal of vegetation density with larger errors over areas with thick vegetation (e.g., Amazon, Congo, Eastern U.S.) and smaller errors over Savannas and Arid regions (e.g., India, Western U.S.). Compared to the SMOS retrievals, the ASCAT retrievals show larger errors over arid regions of the world (Sahara, Western U.S., deserts of Australia). This is consistent with prior studies (Wagner et al. (2007); Gruhier et al. (2010)) that also reported that the scatterometer retrievals are less accurate than the radiometer retrievals over dry regions. This is due to the fact that over dry environments when the soil dries out completely, the scattering contributions from surface inhomogeneities impact the soil moisture retrievals more than the soil moisture content itself (Wagner et al. (2012)). The relative measurement error computations in Figure 1 confirm these previous findings.

Figure 1:

Figure 1:

Relative measurement error (ϵ) for soil moisture retrievals from AMSR-E, ASCAT, SMOS, AMSR2 and SMAP. The lower right figure shows the distribution of ϵ for each sensor.

The relative measurement error of in-situ soil moisture datasets reported in Dirmeyer et al. (2016) showed a range of 0.1–0.3 for most measurement systems with larger errors for systems employing sensors just above the land surface. From Figure 1, it can be seen that the errors associated with the satellite-based retrievals are generally larger, in the 0.4–0.6 range. The domain averaged relative measurement errors are 0.46, 0.44, 0.54, 0.47, and 0.42 for AMSR-E, ASCAT, SMOS, AMSR2 and SMAP, respectively. Across different sensors, SMAP based retrievals show better performance over different climatic zones and biomes, with relative measurement errors significantly reduced over areas with moderate vegetation. Some areas with notably low skill for SMAP are the Sahara and Western Australia deserts, which are likely due to factors such as the surface temperature biases used in the SMAP retrievals (SMAP science team, pers. comm.) and the deeper contributing depth of the microwave signal over arid areas. In addition, the limited dynamic range of soil moisture over deserts and forested areas also contributes to higher relative errors over these areas. Generally, the soil moisture dynamic ranges are higher over non-forested areas with moderate vegetation and SMAP retrievals show high skills over such regions. Note that such issues are also observed in retrievals from ASCAT, SMOS and AMSR-E. The comparison of the distribution of measurement errors also confirms the fact that overall, SMAP retrievals are improved relative to the skill of the retrievals from other MW sensors. The ASCAT retrievals show reduced error levels in the high latitudes, which contribute to the increased span in the medium error range (0.2–0.4) in the distribution comparisons.

Figures 2 and 3 show comparisons of the soil moisture retrievals from the 5 sensors based on metric entropy and fluctuation complexity, respectively. The maps of metric entropy show discrimination of areas with different levels of randomness in the retrievals. For example, areas of high vegetation density show up as areas with high randomness in the retrievals, as larger H values are seen over the Amazon, Eastern U.S. and Congo. Larger uncertainty is also seen over arid regions in the Western U.S., Sahara and Western Australia, especially in the ASCAT and AMSR2 retrievals. Conversely, the fluctuation complexity maps show reduced values over these regions with larger randomness, which are indicative of low information content in the time series at these locations. Similar to the trends seen in Figure 1, SMAP shows a distinctly different behavior in these comparisons. Generally, the metric entropy values are significantly lower (reduced randomness in the SMAP time series) and fluctuation complexity values are higher (larger information content compared to a periodic or random noise signal). SMAP retrievals particularly show high information content (less noise) in the midlatitude regions in the comparisons in Figures 2 and 3. The plots of the distribution of the metric entropy and fluctuation complexity values across the whole domain also confirm these trends. The metric entropy and fluctuation complexity distributions for all sensors except SMAP are skewed to the high and low values, respectively, indicating that overall, the information content of the retrievals from these sensors have large amount of noise. The SMAP distribution spans an intermediate range, suggesting reduced levels of randomness and increased levels of complexity in the time series.

Figure 2:

Figure 2:

Similar to Figure 1, but for metric entropy (H)

Figure 3:

Figure 3:

Similar to 2, but for fluctuation complexity (C)

Note that the AMSR-E and AMSR2 retrieval algorithms are based on X-band passive microwave observations, whereas ASCAT uses C-band radar observations. The observations based on these channels have lower sensitivity to soil moisture and are more influenced by the presence of moderate to dense vegetation compared to the retrievals using lower frequency (L-band) channels. Nevertheless, the comparison of ASCAT versus SMOS/AMSR-E/AMSR2 presented in Figures 1 to 3 indicates that in many parts of the world, the active and passive retrievals have comparable skills. It is interesting, however, that the SMAP retrievals show higher skill and increased information content compared to SMOS, though both are L-band based retrievals. Though the SMOS and SMAP instruments are similar, they use different technologies. The SMAP instrument is a real aperture radiometer whereas SMOS uses a synthetic aperture radiometer. Previous studies (Oliva et al. (2013)) have documented that the unique SMOS brightness temperature (Tb) observations have a higher Noise Equivalent Delta Temperature (NEDT), which represents the temperature difference that would produce a signal equivalent to the internal noise of the instrument. The SMOS retrieval algorithm attempts to reduce the impact of NEDT by using Tb from all incidence angles. The error in the soil moisture retrieval is then minimized by the relationship between Tb and the incidence angles. The quality and the number of Tb samples, however, reduce as the distance from the center of the swath decreases. SMAP, on the other hand, provides observations of a particular location at a fixed incidence angle, which likely contributes to the reduced noise in the measurements, as confirmed in our analysis. Note also that though SMOS and SMAP both operate L-band radiometers, the SMOS retrievals suffer more from the man-made radio frequency interference (RFI) contamination, which were unknown before the SMOS launch. The SMAP mission, on the other hand, developed measures to mitigate the effect of RFI prior to launch, which has likely contributed to the improved performance of the SMAP retrievals relative to SMOS.

A comparison of the average values of the three metrics stratified by vegetation type is shown in Figure 4. The seven vegetation categories are derived from the modified International Geosphere-Biosphere Programme (IGBP) Moderate Resolution Imaging Spectroradiometer (MODIS) data (Friedl et al. (2010)). Similar to the patterns seen in the spatial maps, smaller errors are seen for moderate vegetation types and larger errors for bare ground and thick vegetation types. SMAP shows the smallest errors among different sensors across most vegetation types. In particular, SMAP retrievals show lowest errors over the Cropland and Grassland types. In the information theory comparisons, SMAP retrievals show reduced levels of randomness and high fluctuation complexity among the 5 sensors across all vegetation types. Generally, the stratification also indicates higher information content over moderate vegetation types compared to thick vegetation types. For other sensors, however, the obvious contrasts in the metrics between vegetation types are not always observed. For example, AMSR2 shows similar metric entropy values across all vegetation types. The performance of SMOS and ASCAT are comparable for different vegetation types, except for the low metric entropy values over bare ground areas.

Figure 4:

Figure 4:

Stratification of metrics by vegetation type

Metric entropy is a measure of the amount of uncertainty inherent in a Markov process (Gray (2011)), but it does not characterize the state changes in a time series, which can be captured by complexity measures. As a result, joint consideration of the two measures is necessary to quantify the information content of a time series in terms of its randomness and state changes within the sequences. Previous studies have shown that the functional relationship between entropy and complexity generally follows an inverse parabolic relationship (Lange (1999)), as complexity is low for periodic (low entropy) and random noise (high entropy) signals, but high for time series that are different from random or trivial sequences (intermediate entropy). Figure 5 shows “heatmaps”/density of grid points as a function of these two variables, for the 5 remote sensing retrievals. In addition, Figure 5 also includes joint evaluations of the entropy and complexity from ground soil moisture measurements and outputs from two land surface model simulations. The ground soil moisture measurements are obtained from the U.S. Department of Agriculture Soil Climate Analysis Network (SCAN; Schaefer et al. (2007)), whereas the Noah (Ek et al. (2003)) and Mosaic (Koster and Suarez (1996)) model soil moisture estimates from the Global Land Data Assimilation System (GLDAS; Rodell et al. (2004)) are used as the land surface model outputs.

Figure 5:

Figure 5:

Density of grid points mapped as a function of metric entropy (x-axis) and fluctuation complexity (y-axis).

The comparisons shown in Figure 5 indicate the different regions of the Entropy-Complexity (E-C) space spanned by each soil moisture dataset. The remote sensing measurements AMSR-E, ASCAT, SMOS and AMSR2 show high density of grid points in the lower right part of the E-C space, the area dominated by high randomness and low complexity. This suggests that the information content of these retrievals is low. Comparatively, SMAP shows improved performance, where the density of grid points is shifted to the area with high complexity and intermediate randomness. The in-situ measurements from SCAN show high density in the E-C space in regions with high complexity, but with marginally reduced entropy (compared to SMAP). The heatmaps from GLDAS-Noah and GLDAS-Mosaic also indicate high complexity and intermediate randomness in their soil moisture time series. It can be observed that the land models, ground measurements and remote sensing datasets span different parts of the E-C space and together, they encompass the inverse parabolic relationship between entropy and complexity. Generally, entropy is lower in the land model estimates, increases marginally for the ground soil moisture measurements, and is highest for remote sensing datasets. On the other hand, complexity is comparable across land surface model and ground soil moisture estimates, but significantly lower for remote sensing measurements (except those from SMAP). If ground measurements are considered as reference, the comparison in Figure 5 shows that significant improvements to the remote sensing retrievals are required for improving their information content, to improve their utility in modeling and data assimilation environments.

As the metric entropy and fluctuation complexity measures quantify the information of the signal and are not necessarily direct assessments of the skill of the measurement itself, they should be viewed as a complementary analysis to standard validation metrics. For example, in an arid region, the soil moisture signal may not have significant variability and as a result, the complexity and entropy of the natural signal may be low. Arguably, the utility of remote sensing measurements is higher over areas where soil moisture dynamics are inherently more variable and capturing them accurately is difficult. Over such areas, the information theory metrics are useful for providing both assessments of signal quality as well as for intercomparing model, satellite and ground reference data products. The information theory based discrimination can also be used for developing merged products with improved information content.

4. Summary

Remote sensing based observations of soil moisture, primarily from passive and active microwave remote sensing, are of great value as they provide measurements across a range of spatial and temporal scales and extents. A consistent evaluation of the accuracy and information content of these products, however, is difficult since reliable, spatially coherent ground measurements of soil moisture are lacking in many parts of the world. In this article, we present a time series based information theoretic analysis for an intercomparison of recent satellite-based soil moisture products.

Soil moisture retrievals from five recent microwave remote sensing instruments, including AMSR-E, ASCAT, SMOS, AMSR2 and SMAP are used in this study. Three measures that quantify the accuracy, randomness, and the complexity of the data are used to intercompare these retrieval products. An autoregressive analysis that models soil moisture as a first order Markov process is used to develop estimates of measurement errors. Information theory measures of metric entropy and fluctuation complexity that quantify the stochasticity in time series data are used to provide comparisons of information content in these retrievals. Metric entropy measures the amount of randomness inherent in a Markov process whereas fluctuation complexity provides a measure that evaluates the level of regularity and randomness in the time series data.

The information theory measures are developed by translating the soil moisture time series to binary symbol strings and by examining the probabilities of patterns of states defined by a sequence of consecutive symbols. The article uses three symbol states, consistent with previous literature and similar applications of the information theory measures for hydrological model evaluations.

The results of the red noise spectrum analysis provide an assessment of the strengths and limitations of the soil moisture retrieval products. Generally these products have reduced measurement errors over areas with moderate vegetation density and large errors over areas with thick vegetation. In many instances, large measurement errors are also observed over bare soil areas. The estimates of measurement error also indicate that generally remote sensing retrievals have larger errors compared to that of in-situ measurements. Among the remote sensing retrieval datasets, the SMAP-based products were found to have lower errors over different climatic regimes in the world. In particular, the SMAP retrieval errors were comparable to that of the in-situ measurements over areas with moderate vegetation density (relative errors in the range of 0.2–0.3).

Comparison of the metric entropy and fluctuation complexity measures from these retrieval products also indicates similar trends. The signature of vegetation density is prominent in these information theory evaluations as the evaluations indicate larger uncertainty and lower complexity over areas of the world with thick vegetation. Comparatively, the SMAP retrievals show improved information content relative to other retrievals. The level of randomness was generally lower in the SMAP retrievals, whereas the complexity of the SMAP time series data was generally higher, compared to the AMSR-E, ASCAT, SMOS and AMSR2 products. SMAP soil moisture product is based on L-band passive microwave observations (which are most sensitive to soil moisture). Other satellites use different frequencies, which are less sensitive to soil moisture (AMSR-E and AMSR2 use X-band radiometers, ASCAT uses a C-band radar). SMOS L-band observations are affected by the presence of RFI.

A joint comparison of the metric entropy and fluctuation complexity of the remote sensing retrieval products is also presented in this study. Generally, it can be argued that a time series signal is of high information content, if it possesses intermediate entropy and high complexity. Combinations of high entropy and low complexity are symptomatic of random noise signals whereas low entropy and low complexity are indicative of periodic/trivial signals. The simultaneous assessment of entropy and complexity indicates that the majority of retrievals from AMSR-E, ASCAT, SMOS and AMSR2 have low information content. Comparatively, the performance of the SMAP retrievals is better, with higher density of grid points with increased complexity and reduced entropy. A similar evaluation of in-situ soil moisture and land surface model output data is also presented in the article. The in-situ measurements encapsulate the region of high information content in the entropy-complexity space. The land surface models also indicate marginally lower randomness with high levels of complexity in their estimates. Together, the three sets of soil moisture estimates (remote sensing, in-situ and model) span the majority of the inverse parabolic space expected in the entropy complexity comparisons. Generally, the land surface model and remote sensing datasets span mutually exclusive regions of the E-C space. This suggests that improvements in the remote sensing retrievals are necessary before including them in data assimilation environments that rely on observational information to constrain model simulations and forecasts. The results also indicate that SMAP retrievals with low entropy and increased complexity can provide valuable information for hydrologic modeling studies.

5. Acknowledgments

This research was supported by NASA Applied Sciences Grant entitled Enhancing the Information Content and Utilization of SMAP products for Agricultural Applications (John D. Bolten Principal Investigator). Computing was supported by the resources at the NASA Center for Climate Simulation. Dr. Dirmeyer is supported by NASA grant NNX13AQ21G. We are grateful to Dr. Xiwu Zhan and Dr. Jicheng Liu of NOAA NESDIS for providing SMOPS soil moisture datasets.

Contributor Information

Sujay V. Kumar, Hydrological Sciences Laboratory, NASA GSFC, Greenbelt, MD

Paul A. Dirmeyer, George Mason University, Fairfax, VA

Christa D. Peters-Lidard, Earth Sciences Division, NASA GSFC, Greenbelt, MD

Rajat Bindlish, Hydrological Sciences Laboratory, NASA GSFC, Greenbelt, MD.

John Bolten, Hydrological Sciences Laboratory, NASA GSFC, Greenbelt, MD.

6. References

  1. Bates J, Shephard H, 1993. Measuring complexity using information fluctuation. Physics Letters 172, 416–425. [Google Scholar]
  2. Bindlish R, Cosh M, Jackson T, Koike T, Fujii H, Chan S, Asanuma J, Berg A, Bosch D, Caldwell T, Holifield Collins C, McNairn H, Martinez-Fernandez J, Prueger J, Rowlandson T, Seyfried M, Starks P, Thibeault M, Van Der Velde R, Walker J, Cooper-smith E, 2017. GCOM-W AMSR2 soil moisture product validation using core validation sites. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (J-STARS) doi: 10.1109/JSTARS.2017.2754293. in print. [DOI] [Google Scholar]
  3. Brocca L, Hasenauer S, Lacava T, Melone F, Moramarco T, Wagner W, Dorigo W, Matgen P, Martnez-Fernndez J, Llorens P, Latron J, Martin C, Bittelli M, 2011. Soil moisture estimation through ASCAT and AMSR-E sensors: An intercomparison and validation study across Europe. Remote Sensing of Environment 115, 3390–3408. doi: 10.1016/j.rse.2011.08.003. [DOI] [Google Scholar]
  4. Champagne C, Berg A, Belanger J, McNairn H, Jeu RD, 2010. Evaluation of soil moisture derived from passive microwave remote sensing over agricultural sites in Canada using ground-based soil moisture monitoring networks. International Journal of Remote Sensing 31, 3669–3690. doi: 10.1080/01431161.2010.483485. [DOI] [Google Scholar]
  5. Chan SK, Bindlish R, O’Neill PE, Njoku E, Jackson T, Colliander A, Chen F, Burgin M, Dunbar S, Piepmeier J, Yueh S, Entekhabi D, Cosh MH, Caldwell T, Walker J, Wu X,Berg A, Rowlandson T, Pacheco A, McNairn H, Thibeault M, Martnez-Femndez J, Gonzlez-Zamora,., Seyfried M, Bosch D, Starks P, Goodrich D, Prueger J, Palecki M, Small EE, Zreda M, Calvet JC, Crow WT, Kerr Y, 2016. Assessment of the SMAP Passive Soil Moisture Product. IEEE Transactions on Geoscience and Remote Sensing 54, 4994–5007. doi: 10.1109/TGRS.2016.2561938. [DOI] [Google Scholar]
  6. Delworth T, Manabe S, 1988. The influence of potential evaporation on the variabilities of simulated soil wetness and climate. J. Climate 1, 523–547. [Google Scholar]
  7. Dirmeyer P, Wu J, Norton H, Dorigo W, Quiring S, Ford T, Santanello J, Bosilovich M, Ek M, Koster R, Balsamo G, Lawrence D, 2016. Confronting weather and climate models with observational data from soil mositure networks over the United States. Journal of Hydrometeorology 17. doi: 10.1175/JHM-D-15-0196.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dorigo W, Scipal K, Parinussa R, Liu Y, Wagner W, de Jeu R, Naeimi V, 2010. Error characterization of global active and passive microwave soil moisture datasets. Hydrol. Earth Syst. Sci 14, 2605–2616. doi: 10.5194/hess-14-2605-2010. [DOI] [Google Scholar]
  9. Ek M, Mitchell K, Yin L, Rogers P, Grunmann P, Koren V, Gayno G, Tarpley J, 2003. Implementation of Noah land-surface model advances in the NCEP operational mesoscale Eta model. Journal of Geophysical Research 108, doi: 10.1029/2002JD003296. [DOI] [Google Scholar]
  10. Entekhabi D, Njoku E, O’Neill P, Kellogg K, Crow WT, Edelstein W, Entin J, Goodman S, Jackson T, Johnson J, Kimball J, Piepmeier J, Koster R, Martin N, McDonald K, Moghaddam M, Moran S, Reichle R, Shi J, Spencer M, Thurman S, Tsang L, Van Zyl J, 2010. The Soil Moisture Active Passive (SMAP) Mission. Proceedings of the IEEE 98, 704–716. doi: 10.1109/jproc.2010.2043918. [DOI] [Google Scholar]
  11. Friedl M, Sulla-Menashe D, Tan B, Schneider A, Ramankutty N, Sibley A, Huang X, 2010. MODIS collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sensing of Environment 114, 168–182. [Google Scholar]
  12. Fujii H, Koike T, Imaoka K, 2009. Improvement of the amsr-e algorithm for soil moisture estimation by introducing a fractional vegetation coverage dataset derived from modis data. Journal of The Remote Sensing Society of Japan 29, 282–292. doi: 10.11440/rssj.29.282. [DOI] [Google Scholar]
  13. Gray R, 2011. Entropy and Information Theory. Springer. [Google Scholar]
  14. Gruber A, Su CH, Crow WT, Zwieback S, Dorigo WA, Wagner W, 2016a. Estimating error cross-correlations in soil moisture data sets using extended collocation analysis. Journal of Geophysical Research: Atmospheres 121, 1208–1219. doi: 10.1002/2015JD024027.2015JD024027. [DOI] [Google Scholar]
  15. Gruber A, Su CH, Zwieback S, Crow W, Dorigo W, Wagner W, 2016b. Recent advances in (soil moisture) triple collocation analysis. International Journal of Applied Earth Observation and Geoinformation 45, 200–211. doi: 10.1016/j.jag.2015.09.002. [DOI] [Google Scholar]
  16. Gruhier C, de Rosnay P, Hasenauer S, Holmes T, de Jeu R, Kerr Y, Mougin E, Njoku E, Timouk F, Wagner W, Zribi M, 2010. Soil moisture active and passive microwave products: intercomparison and evaluation over a Sahelian site. Hydrology and Earth System Sciences 14, 141–156. doi: 10.5194/hess-14-141-2010. [DOI] [Google Scholar]
  17. Hauhs M, Lange H, 2008. Classification of runoff in headwater catchments: a physical problem? Geography Compass 2, 235–254. [Google Scholar]
  18. Jackson T, Schmugge J, Engman E, 1996. Remote sensing applications to hydrology: soil moisture. Hydrological Sciences 41, 517–529. [Google Scholar]
  19. Jackson T, Schmugge T, Wang J, 1982. Passive microwave sensing of soil moisture under vegetation canopies. Water Resources Research 18, 1137–1142. 10.1029/WR018i004p01137. [DOI] [Google Scholar]
  20. Jackson TJ, 1993. Measuring surface soil moisture using passive microwave remote sensing. Hydrological Processes 7, 139–152. [Google Scholar]
  21. Kerr Y, Waldteufel P, Wigneron JP, Martinuzzi JM, Font J, Berger M, 2010. The SMOS Mission: New Tool for Monitoring Key Elements of the Global Water Cycle. Proceedings of the IEEE 98, 666–687. [Google Scholar]
  22. Koike T, 2013. Soil moisture algorithm descriptions of GCOM-W1 AMSR2 (Rev A). Technical Report Earth Observation Research Center, Japan Aerospace Exploration Agency. [Google Scholar]
  23. Koster R, Suarez M, 1996. Energy and Water Balance Calculations in the Mosaic LSM Technical Report 104606. NASA. [Google Scholar]
  24. Koster R, Suarez M, Liu P, Jambor U, Berg AA, Kistler M, Reichle RH, Rodell MR, Famiglietti J, 2004. Realistic initialization of land surface states: Impacts on subseasonal forecast skill. Journal of Hydrometeorology 5, 1049–1063. Doi: 10.1175/JHM-387.1. [DOI] [Google Scholar]
  25. Kumar S, Peters-Lidard C, Santanello J, Reichle R, Draper C, Koster R, Nearing G, Jasinski M, 2015. Evaluating the utility of satellite soil moisture over irrigated areas and the ability of land data assimilation methods to correct for unmodeled processes. Hydrol. Earth Syst. Sci 19, 4463–4478. doi: 10.5149/hess-19-4483-2015. [DOI] [Google Scholar]
  26. Lakshmi V, 2013. Remote Sensing of Soil Moisture. ISRN Soil Science 2013, 33. doi: 10.1155/2013/424178. [DOI] [Google Scholar]
  27. Lange H, 1999. Are ecosystems dynamical systems? International Journal of Computing Anticipatory Systems 3, 169–186. [Google Scholar]
  28. Liu J, Zhan X, Zhao L, Jensen K, Wang X, 2012. NOAA Soil Moisture Operational Product System (SMOPS) and its soil moisture retrieval from AMSR-E and WindSat, in: 92nd American Meteorological Society Annual Meeting, New Orleans, LA. [Google Scholar]
  29. Liu Q, Reichle RH, Bindlish R, Cosh MH, Crow WT, de Jeu R, Lannoy GJMD, Huffman GJ, Jackson TJ, 2011a. The Contributions of Precipitation and Soil Moisture Observations to the Skill of Soil Moisture Estimates in a Land Data Assimilation System. Journal of Hydrometeorology 12, 750–765. doi: 10.1175/JHM-D-10-05000.1. [DOI] [Google Scholar]
  30. Liu Y, Parinussa R, Dorigo W, de Jeu R, Wagner W, Van Dijk A, McCabe M, Evans J, 2011b. Developing an improved soil moisture dataset by blending passive and active microwave satellite-based retrievals. Hydrol. Earth Syst. Sci, 425–436. [Google Scholar]
  31. Nearing GS, Yatheendradas S, Crow WT, Bosch DD, Cosh MH, Goodrich DC, Seyfried MS, Starks PJ, 2017. Nonparametric triple collocation. Water Resources Research 53, 5516–5530. doi: 10.1002/2017WR020359. [DOI] [Google Scholar]
  32. Njoku E, Jackson T, Lakshmi V, Chan T, Nghiem S, 2003. Soil moisture retrieval from AMSR-E. IEEE T. Geosci. Remote, 215–229 doi: 10.1109/TGRS.2002.808243. [DOI] [Google Scholar]
  33. Njoku E, Kong J, 1977. Theory for passive microwave remote sensing of near-surface soil moisture. J. Geophys. Res 82, 3108–3118. [Google Scholar]
  34. Njoku EG, Entekhabi D, 1995. Passive microwave remote sensing of soil moisture. Journal of Hydrology 184, 101–130. [Google Scholar]
  35. Oliva R, Martin-Neira M, Corbella I, Torres F, Kainulainen J, Tenerelli JE, Cabot F, Martin-Porqueras F, 2013. SMOS Calibration and Instrument Performance After One Year in Orbit. IEEE Transactions on Geoscience and Remote Sensing 51, 654–670. doi: 10.1109/TGRS.2012.2198827. [DOI] [Google Scholar]
  36. O’Neill P, Chan S, Njoku E, Jackson T, Bindlish R, 2012. Soil Moisture Active Passive (SMAP) Algorithm Theoretic Basis Document (ATBD); SMAP Level 2 & 3 soil moisture (passive) (L2_SM_P, L3_SM_P). Technical Report. Jet Propulsion Laboratory. [Google Scholar]
  37. Owe M, de Jeu R, Holmes T, 2008. Multisensor historical climatology of satellite-derived global land surface moisture. J. Geophys. Res 113. doi: 10.1029/2007JF000769. [DOI] [Google Scholar]
  38. Pachepsky Y, Guber A, Jacques D, Simunek J, van Genuchten M, Nicholson T, Cady R, 2006. Information content and complexity of simulated soil water fluxes. Geoderma, 253–266. [Google Scholar]
  39. Pan F, Pachepsky Y, Andrey G, Hill R, 2011. Information and Complexity Measures Applied to Observed and Simulated Soil Moisture Time Series. Hydrological Sciences Journal 56,1027–1039. [Google Scholar]
  40. Parinussa R, Meesters AGCA, Liu Y, Dorigo W, Wagner W, de Jeu RAM, 2011. Error Estimates for Near-Real-Time Satellite Soil Moisture as Derived from the Land Parameter Retrieval Model. IEEE Geoscience and Remote Sensing Letters 8, 779–783. [Google Scholar]
  41. Parrott L, 2010. Measuring ecological complexity. Ecological Indicators 10, 1069–1076. doi: 10.1016/j.ecolind.2010.03.014. [DOI] [Google Scholar]
  42. Rodell M, Houser PR, Jambor U, Gottschalck J, Mitchell K, Meng CJ, Arsenault K, Cosgrove B, Radakovich J, Bosilovich M, Entin JK, Walker JP, Lohmann D, Toll D, 2004. The Global Land Data Assimilation System. Bulletin of the American Meteorological Society 85, 381–394. [Google Scholar]
  43. Rudiger C, Calvet JC, Gruhier C, Holmes TRH, de Jeu RAM, Wagner W, 2009. An Intercomparison of ERS-Scat and AMSR-E Soil Moisture Observations with Model Simulations over France. Journal of Hydrometeorology 10,431–447. doi: 10.1175/2008JHM997.1. [DOI] [Google Scholar]
  44. Schaefer G, Cosh M, Jackson T, 2007. The USDA natural resources conservation service soil climate analysis network (SCAN). J. Atmos. Oceanic Technol 24, 2073–2077. [Google Scholar]
  45. Seneviratne S, Corti T, Davin E, Hirschi M, Jaeger E, Lehner I, Orlowsky B, Teuling A, 2010. Investigating soil moisture-climate interactions in a changing climate: A review. Earth-Sci. Rev 99, 125–161. [Google Scholar]
  46. Shannon C, 1948. A Mathematical Theory of Communication. Bell System Technical Journal 27, 623–666. doi: 10.1002/j.1538-7305.1948.tb00917.x. [DOI] [Google Scholar]
  47. Stoffelen A, 1998. Towards the true near-surface wind speed: Error modelling and calibration using triple collocation. Journal of Geophysical Research: Atmosphere 103, 7755–7766. [Google Scholar]
  48. Su CH, Ryu D, Crow WT, Western AW, 2014. Stand-alone error characterisation of microwave satellite soil moisture using a Fourier method. Remote Sensing of Environment 154, 115–126. doi: 10.1016/j.rse.2014.08.014. [DOI] [Google Scholar]
  49. Su CH, Ryu D, Western A, Wagner W, 2013. De-noising of passive and active microwave satellite soil moisture time series. Geophysical Research Letters, 3624–3630. [Google Scholar]
  50. Ulaby F, Moore R, Fung A, 1986. Microwave Remote Sensing, Active and Passive, vol. III: From Theory to Applications; Artech House, Boston, MA. [Google Scholar]
  51. Vinnikov KY, Robock A, Speranskaya N, Schlosser C, 1996. Scales of temporal and spatial variability of midlatitude soil moisture at different levels. J. Geophys. Res 101, 7163–7174. doi: 10.1029/95JD02753. [DOI] [Google Scholar]
  52. Wagner W, Hahn S, Gruber A, Dorigo W, 2012. Identification of soil moisture retrieval errors: Learning from the comparison of SMOS and ASCAT, in: 2012 IEEE International Geoscience and Remote Sensing Symposium, pp. 3795–3798. doi: 10.1109/IGARSS.2012.6350586. [DOI] [Google Scholar]
  53. Wagner W, Naeimi V, Scipal K, de Jeu R, Martinez-Fernandez J, 2007. Soil moisture from operational meteorological satellites. Hydrogeology Journal 15, 121–131. [Google Scholar]
  54. Wagner W, Scipal K, Pathe C, Gerten D, Lucht W, Rudolf B, 2003. Evaluation of the agreement between the first global remotely sensed sol moisture data with model and precipitation data. J. Geophys. Res 108. doi: 10.1029/2003jd003663. [DOI] [Google Scholar]
  55. Wang G, Garcia D, Liu Y, de Jeu R, Dolman AJ, 2012. A three-dimensional gap filling method for large geophysical datasets: Application to global satellite soil moisture observations. Environmental Modelling & Software 30, 139–142. doi: 10.1016/j.envsoft.2011.10.015. [DOI] [Google Scholar]
  56. Wen J, Su Z, Ma Y, 2003. Determination of land surface temperature and soil moisture from Tropical Rainfall Measurement Mission/Microwave Imager remote sensing data. J. Geophys. Res 108. doi: 10.1029/2002JD002176. [DOI] [Google Scholar]
  57. Yilmaz MT, Crow W, 2014. Evaluation of Assumptions in Soil Moisture Triple Collocation Analysis. Journal of Hydrometeorology 15,1293–1302. doi: 10.1175/JHM-D-13-0158.1. [DOI] [Google Scholar]
  58. Zwieback S, Dorigo W, Wagner W, 2013. Estimation of the temporal autocorrelation structure by the collocation technique with an emphasis on soil moisture studies. Hydrological Sciences Journal 58, 1729–1747. [Google Scholar]

RESOURCES