Skip to main content
NASA Author Manuscripts logoLink to NASA Author Manuscripts
. Author manuscript; available in PMC: 2018 Apr 27.
Published in final edited form as: Conf Rec Asilomar Conf Signals Syst Comput. 2017 Oct 29;2017:885–889. doi: 10.1109/ACSSC.2017.8335476

EXPLORATION AND DATA REFINEMENT VIA MULTIPLE MOBILE SENSORS BASED ON GAUSSIAN PROCESSES

Mohammad Shekaramiz 1, Todd K Moon 1, Jacob H Gunther 1
PMCID: PMC5918342  NIHMSID: NIHMS913791  PMID: 29706751

Abstract

We consider configuration of multiple mobile sensors to explore and refine knowledge in an unknown field. After some initial discovery, it is desired to collect data from the regions that are far away from the current sensor trajectories in order to favor the exploration purposes, while simultaneously, exploring the vicinity of known interesting phenomena to refine the measurements. Since the collected data only provide us with local information, there is no optimal solution to be sought for the next trajectory of sensors. Using Gaussian process regression, we provide a simple framework that accounts for both the conflicting data refinement and exploration goals, and to make reasonable decisions for the trajectories of mobile sensors.

1. INTRODUCTION

We consider the exploration problem using multiple mobile sensors. The initial problem is to locate regions where interesting phenomena occur then, having found interesting regions, to expend sensor capability on refining data in interesting regions while also continuing to search for other possibly existing interesting regions. There is thus after an initial discovery an intrinsic tradeoff between increasing knowledge by taking more measurements in regions known to be of interest, and increasing knowledge by exploring in regions where phenomena of interest (PoI) might possibly exist, but are not known to be present. Due to the uncertainties of exploration, the problem is not posed as one of optimal path planning, but a solution which balances the competing imperatives of refining measurements while exploring new territory.

The paradigm developed in this paper is applicable to general exploration problems, but to illustrate how the principles are used we consider a problem of space exploration in the vicinity of a planet. One of the main design steps in any space mission is to specify the trajectory of a satellite or a constellation of satellites [16]. The recent advances in small satellites have made it possible to perform missions with a launch of group of small satellites (swarm of satellites) but with an adaptive constellation [7]. For example, consider an exploration problem in which a swarm of low-cost small satellites are deployed and then the goal is to change their orbital planes in order to find, capture, and track the most interesting features of the PoI. The exploration problem with multiple mobile sensors that we investigate in this paper has application in such area. We assume here that measurements from the PoI are obtained directly from the current trajectories of mobile sensors. This differs from other sensor array configuration methods in remote sensing. For example, in [8] the configuration of sensors array in remote image formation was sought.

In this paper we devise a general exploratory framework for specifying the trajectory of sensor-bearing mobile sensors (e.g., satellites) in order to find and characterize the important features of phenomena in a region of interest. The approach adaptively makes decisions on the trajectories of mobile sensors based on balancing between refinement of the measurements and the desire of exploration. Our proposed framework consists of two main stages. The first stage is the prediction stage, where a Gaussian process regression (GPR) model is applied. Once some initial measurements are collected, the GPR model predicts the behavior of the PoI at the other unseen locations. GPs have been used as a powerful supervised learning tool for the regression and prediction problems [9]. (This is also referred to as kriging in the geo-statistics literature [10]). GPR models are applicable to a wide variety of problems such as the prediction and estimation of temperature, precipitation, missing pixel and un-mixing of pixels in hyper-spectral imaging (HSI), human head pose estimation, concentration of carbon dioxide in the atmosphere etc [9,1114]. The second stage of the proposed framework is the decision stage. Suppose that some measurements from the region have already been collected. Also, suppose the GPR model (first stage) has already predicted the PoI at the unseen locations. Now the question is how to decide on the next set of trajectories based on the information obtained from the previous stage. There is a fundamental tradeoff between repeating measurements in the same (or nearby) locations to obtain more informative measurements where it is known that interesting things are happening vs. making measurements in new locations (exploration) with the possibility of discovering additional interesting information. There is not sufficient information to obtain a solution a priori, since the locality of the available information precludes information about the unexplored areas. Here, we set up an optimization problem, using the estimates obtained from GPR, to make reasonable decisions for the trajectory of mobile sensors. Our approach provides a parameter to be tuned in either greater emphasis on refining data in known-interesting regions, or exploring new regions. It is also possible to adjust this parameter in accordance with the cost of maneuvering between different trajectories.

2. REVIEW OF GAUSSIAN PROCESSES

Suppose that there exist an unknown (and probably nonlinear) function representing the behavior of PoI. A common method for estimating such a function is to construct a parametric model that provides a good match with the observations. In contrast, GP is a non-parametric model where a probability distribution function can be defined as a prior over the set of unknown functions. In other words, GP defines distributions over functions in the function space and the inference is performed directly in the space of functions [9]. In GP it is assumed that the behavior of phenomena is governed by a stochastic process, in which each observation is an outcome of jointly distributed Gaussian random variables. GP provides a posterior distribution over the unknown function f(·) once some data are observed. The arguments of the function are collected into a variable set u, referred to as the input data. For a set of N observations with the input data set {u1, . . . , uN}, GP assumes that the distribution p(f(u1), . . . , f(uN)) is jointly Gaussian with mean μ(U) and covariance matrix K(U) given by [K]ij =κ(ui, uj),i, j =1, . . . , N, where κ, ·) is a kernel function and U :=[u1, . . . , uN]. The kernel function specifies the covariance between pairs of random variables at the corresponding data points. A GP model is denoted as f (U) ~ 𝒢℘(μ(U),K(U)), where f (U) :=[f(u1), . . . , f(uN)]T and μ(U) :=[μ(f(u1)), . . . , μ(f(uN))]T.

GP can be used as a regression tool, referred to as GPR. GPR treats the available data as the training data and predicts the behavior of the phenomena at the unseen data points. Let y = f (U)+ε be the observation model, where ε~N(0,σn2I) and yn = f(un)+εn,n = 1, . . . , N. We refer the pair (un, f(un)) to as the nth training data. The goal is to predict the underlying function f evaluated at other input data U i.e., inferring f (U), where U := [u,1, . . . , u,M ]T and f (U) := [f(u,1), . . . , f(u,M )]T. We refer to U as the input test data set. Based on GP modeling, the prior joint distribution between the training and test data can be expressed as [15]

[yf]=N([μyμf],[KKTKK]), (1)

where f denotes f (U). AlsoK:=K(U,U),K :=K(U,U), and K★★ :=K(U, U). Therefore, the predictive distribution over the test data can be expressed as follows

fU,y,U~N(μf,f), (2)

where

μf=μy+KT(K+σn2I)-1(y-μy)f=K-KT(K+σn2I)-1K. (3)

3. TRAJECTORY DETERMINATION OF MOBILE SENSORS

In the prediction setting, the collected data are treated as the training set, where the locations at which the data are collected are used as the input-training data and the corresponding measurements are referred to as the output-training data. The test data are then the other unseen locations over the region of under study and the output-test data are unknown and are required to be predicted using GPR. We assume that the locations of the training data (previously observed data) are collected into the set Us = {u1, . . . , uN1}. The output-training data are accumulated in y = [y1, . . . , yN1 ]T, where yn is the output corresponding to the input un. Furthermore, we collect the locations of the test data into the set U = {u,1, . . . , u,N2}. The unknown outputs evaluated at the input-test data are defined as f = [f(u,1), . . . , f(u,N2 )]T. As prior knowledge, we assume that the joint density function between the training and test data is zero-mean Gaussian, meaning that on the average we expect interesting phenomena occur very rarely. Suppose that the measurement noise for location on the field of interest is N(0,σn2). We further assume that the PoI has a smooth behavior, as often occurs in nature for distributed phenomena. We define the squared exponential covariance function to promote smoothness over the nearby regions as

K(ui,uj)=exp[-ui-uj222l], (4)

where l is a scale factor, and ui and uj are the coordinates of any two arbitrary locations in the region under investigation. As the available prior knowledge changes, one may define a different kernel function than (4). Let 𝒮 = {s(1), s(2), . . . , s(K)} denote the set of all feasible trajectories of the sensors over the PoI. We denote the set of locations along the trajectory s(k) at which measurements are collected by the set Us(k)={us(k)1,,us(k)P}. The locations in the set Us(k) belong to the set of test data i.e., Us(k)U. Using the GPR model (2) and the the kernel function (4), we obtain two set of information over the region of under study. One is the estimate of the interesting stuff and the second is the amount of uncertainty of our predictions. The uncertainty can be evaluated via the kernel function in (4). We denote (Us(k) ) and Σ̂(Us(k) ) as the estimate of the PoI and the measure of uncertainty (variance) along the trajectory s(k), respectively. This process is performed in the first stage of the proposed framework, the prediction-refinement stage.

The goal is to decide on the next trajectory of the sensors based on the available and estimated data and also our measure of uncertainty over the region in order to explore the interesting phenomena. The further away the estimates of the PoI are from the past measurements, the higher the corresponding variance becomes, resulting in less confidence in the estimates at those locations. Therefore, if we put emphasis on only the measurements, there is little impetus to continue exploration. In contrast, if we put more emphasis on the evaluated amount of uncertainty then the sensors are encouraged to choose the trajectories which are far away from the previous trajectories to fulfill only the exploration objective of the mission. In this case, even when interesting phenomena are found on previous trajectories, the sensors have preference to return, in the interest of exploring more. In order to incorporate this tradeoff, we set up an optimization problem. We define the functions

f^n(Us(k))=1Pp=1Pf^(us(k)p) (5)

as a measure of the data-refinement, and

^n(Us(k))=1Pp=1P^(us(k)p). (6)

as a measure of the desire for data exploration. These are combined in the decision functional

k=argmaxkf^n(Us(k))+λ(t)^n(Us(k)). (7)

Here, k denotes the best trajectory of the sensor from the set of all possible practical trajectories: The factor λ balances between data-refinement and exploration. Selecting large values for λ places more emphasis on the desire for exploration, while smaller values for λ emphasize refinement of knowledge. λ may depend on time. For example, with few measurements, the usual desire may be to put more emphasis on the exploration to get some sense of the PoI, while there is still available energy for unfocused exploration. That is, λ may be set to a large value depending on the affordable cost. Once more measurements over the PoI are collected, the value of λ may be decreased to focus more on data refinement compared to the exploration. We refer to the decided trajectory as Us(k). Finally, the data obtained from Us(k) is added to the training set U, and the whole process starts again.

4. SIMULATION RESULTS

In order to demonstrate how the framework works, we consider a simplified version of a space mission consisting of a constellation of two small steerable satellites for exploring a phenomena in some region of under study. Based on some prior knowledge the initial orbital planes for the constellation over the region of interest has been predetermined. The PoI remains essentially unchanged for the amount of time of the study. The sensors of the satellites take in situ measurement from the phenomena along the current trajectory of the satellites. Fig. 1(a) illustrates an example including the orbital planes of such satellites, where the rectangular shape shows the region of under study, and Fig. 1(b) shows a phenomena of interest (from [8]). In order to emphasize the PoI, we illustrate the PoI as shown in Fig. 1(c). For the simulation purposes the most and the least interesting stuff corresponding to the PoI are shown with yellow and blue, respectively. This image can be thought of as a discretized version of the region of interest defined by the pixel values. Here, without loss of generality and for the simplification purposes, we assume that the PoI has the same profile as shown in Fig. 1(c) along the z-axis over the rectangular region shown in Fig. 1(a). Based on the trajectories determined via the initial constellation, shown in Fig. 1, the corresponding set of obtained measurements is illustrated in Fig. 4. Once the measurements are obtained, we apply the GPR model defined in (2) and (4) to estimate the behavior of the PoI and the measure of uncertainty at the un-sampled locations. In the simulations, we set l=10 in the kernel function (4). Fig. 4 also illustrates the estimation of the PoI and the measure of variance over the region of under study, using the GPR interpolation. The variance along the initial trajectories is set to zero, as shown by the dark blue color in the middle plot of Fig. 4. The variance increases with increasing distance away from the measured trajectories.

Figure 1.

Figure 1

An example showing the satellites orbits and the region of interest.

Figure 4.

Figure 4

From left to right: Initial measurement, measure of variance, and reconstruction of PoI.

In order to make comparison, we define four different cases and the trajectories are determined using (2) and (7). In case (1), we set λ = 0.25, which places less emphasis on exploring the whole field. In Fig. 5, from left to right, we illustrate the measurements corresponding to the previous and current trajectories, the measure of amount of uncertainty based on the measurements, and the reconstruction of the field based on the estimation obtained from the available data. Fig. 6 illustrates the results obtained after making 13 successive decisions for the trajectory of satellites for the case where λ = 0.25. The legend Tri in our figures denotes the ith determined trajectory. We observe in Fig. 6 that all the selected trajectories are in the vicinity of the initial trajectories. The reason for these selected trajectories is because the initial trajectories had found some interesting phenomena and also λ was set to a small value.

Figure 5.

Figure 5

Measurements, measure of uncertainty, and reconstruction using the initial and the first two determined trajectories for case (1) i.e., when λ = 0.25.

Figure 6.

Figure 6

Measurements, measure of uncertainty, and reconstruction after 13 successive trajectories for case (1) i.e., when λ = 0.25.

In case (2), we set λ = 2in order to put more emphasis on the exploration rather than data refinement. Fig. 7 illustrates some of the results obtained. As we expected, we observe in Fig. 7 that since λ is set to a large value, the first specified trajectories are further away from the initial trajectories to satisfy the impetus to exploration. We also see that the trajectories provide a kind of quasi-uniform sampling over the region of under study — exploration occurs while at the same time reducing uncertainty over the region.

Figure 7.

Figure 7

Measurements, measure of uncertainty, and reconstruction after 13 successive trajectories for case (2) i.e., when λ = 2.

In case (3), we vary λ starting from 2 and the decrease rate of 0.7 (λ[t+1] = 0.7λ[t]), meaning that first the exploration desire is emphasized and then once we obtain some sense about the phenomena, λ decreases to emphasize refining information where information is already roughly known. Fig. 8 illustrates some of the results obtained.

Figure 8.

Figure 8

Measurements, measure of uncertainty, and reconstruction based on the 13 successive trajectories for case (3).

In case (4), we make λ dependent on the cost to do the maneuvering and orbit change. The more maneuvering we can afford, the higher the desire to explore will be. This means that if the total remaining available thrust is high, we set λ to a large value and vice versa. In practice, changing the constellation and maneuvering depends on the delta-V budget, which is provided by the thrust of the rocket engine of the satellites. For simulation purposes, we create a scenario with synthetic data such that initially we set λ(0) = 2 corresponding to the maximum affordable cost for the maneuvering. As the trajectories are determined successively and the maneuvering is performed, the maximum affordable is reduced and as a result λ also decreases. Fig. 9 illustrates some of the results.

Figure 9.

Figure 9

Measurements, measure of uncertainty, and reconstruction based on the 13 determined trajectories for case (4).

In Fig. 2, the first two plots show the value chosen for λ for solving (7) and the total affordable cost for changing the trajectories, respectively. Initially, since the affordable cost is large, we also have a large value for λ, meaning that we can afford to select the trajectories which satisfy the exploration desire over the region. The bottom plot in Fig. 2 shows the cost to perform a trajectory change. Comparing all the four cases, it turns out that for the PoI, the best performance belongs to case (1), where λ=2. Notice that such case neglected the constraint on the cost for performing the maneuvering. The reason for having the best performance in case (1) compared to the other cases is because of having some sort of uniform sampling all over the region of under study, the contiguity behavior of the true PoI, and our modeling of the contiguity in the kernel function of the GPR. Finally, Fig. 3 provides a more detailed results for all the four cases. According to the scenarios we considered, it turns out that emphasis on exploration provides more information about the PoI. For example, in the left plot of Fig. 3 the most reduction on the measure of uncertainty over the region belongs to case (2), where λ=2 in case of having no constraint on the cost. But, once we constraint the decision making on the cost, the best performance belongs to case (4). This is due to the fact that for case (4) was initialized to λ[0] =2 and then it was reduced more slowly than case (3). As a conclusion, in cases where the PoI has a very smooth behavior, solving the sensor trajectory problem for satisfying both the exploration and data refinement desires via the proposed framework seems to provide encouraging results.

Figure 2.

Figure 2

From top to bottom, showing the change of emphasizing factor on exploration, affordable cost, and cost of change in trajectories.

Figure 3.

Figure 3

Comparison of all the four case scenarios (1–4) in terms of variance, reconstruction error, and peak signal-to-noise ratio.

5. CONCLUSION

The general problem of exploration using multiple mobile sensors was considered. We focused on the case where desire is to get as much information as possible from the phenomena of interest based on the local information in a sub-optimal fashion. Using GPR, we provided a framework that is able to balance between the desire of exploration and data refinement.

Acknowledgments

This work is supported in part by NASA Grant NNX13 AD 39 A.

References

  • 1.Han L, Li C, Zeng Y. Study on satellite orbit design for boundary environment monitoring. Int. Conf. on Remote Sensing, Environment and Transportation Engineering (RSETE); Nanjing. 2011. pp. 783–786. [Google Scholar]
  • 2.Píriz R, Peiró BM, Merino MR. The Galileo costellation design: A systematic approach. 18th International Technical Meeting of the Satellite Division; Sep, 2005. pp. 1296–1306. [Google Scholar]
  • 3.Beste DC. Design of satellite constellations for optimal continuous coverage. IEEE Trans on Aerospace and Elec Syst. 1978 May;14:466–473. [Google Scholar]
  • 4.Walker JG. Continuous whole-Earth coverage by circular-orbit satellites pattern. Royal Aircraft Establishment, Technical Report 77044. 1977 [Google Scholar]
  • 5.de Weck OL, de Neufville R, Chaize M. Staged deployment of communications satellite constellations in low Earth orbit. Journal of Aerospace Computing, Information and Communication (JACIC) 2004 Mar;1:119–136. [Google Scholar]
  • 6.de Weck OL, Scialom U, Siddiqi A. Optimal reconfiguration of satellite constellations with the auction algorithm. ACATA Astro. 2008;62:112–130. [Google Scholar]
  • 7.Schilling K. Earth observation by distributed networks of small satellites. Int. Conf. on Instrumentation, Communications, Information Technology, and Biomedical Eng. (ICICI-BME); 1–3, Nov. 2009. [Google Scholar]
  • 8.Sharif B, Kamalabadi F. Optimal sensor array configuration in remote image formation. IEEE Trans Image Proc. 2008 Feb;17:155–166. doi: 10.1109/TIP.2007.914225. [DOI] [PubMed] [Google Scholar]
  • 9.Rasmussen C, Williams C. Gaussian Processes in Machine Learning. MIT Press; 2006. [Google Scholar]
  • 10.Isaak EH, Srivastava RM. An Introduction to Applied Statistics. Oxford University Press; 1989. [Google Scholar]
  • 11.Wu T, Li Y. Spatial interpolation of temperature in the United States using residual kriging. Applied Geography. 2013;44:112–120. [Google Scholar]
  • 12.Ranganathan A, Yang MH, Ho J. Online sparse Gaussian process regression and its applications. IEEE Trans on Image Proc. 2011 Feb;20:391–404. doi: 10.1109/TIP.2010.2066984. [DOI] [PubMed] [Google Scholar]
  • 13.Xing Z, Zhou M, Castrodad A, Sapiro G, Carin L. Dictionary learning for noisy and incomplete hyper-spectral images. SIAM J Imag Sci. 2012;5(1):33–56. doi: 10.1109/TIP.2011.2160072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Holdaway MR. Spatial modeling and interpolation of monthly temperature using kriging. Climate Research. 1996;6:215–225. [Google Scholar]
  • 15.Murphy KP. Machine Learning: A Probabilistic Perspective. MIT Press; 2012. [Google Scholar]

RESOURCES