Box 1.
As an example, consider a project in which volunteer recorders report detections and non-detections of an alien species at a large number of locations. Rather than inferring whether a location is occupied (i.e., the alien species is present) for each location individually, such data may be modelled using hierarchical parameters that govern the distribution of occupied locations. For instance, one might introduce the hierarchical parameter that reflects the fraction of locations that are occupied. To illustrate this, consider a project in which volunteer recorders visit locations times each. Let reflect the number of visits at location that resulted in a detection, and the remaining in a non-detection. Let us further denote by wheter location is occupied ( or not ( and by and the false negative and false positive detection rates, respectively. Under this model,
As an example, we consider the case with visits per location, and . As shown in Fig. 4A, accurately identifying occupied locations is difficult under these parameters: the most likely data at occupied locations is , which is almost equally likely to get at non-occupied locations as well. To infer the hierarchical parameters , and , we integrate out to obtain the relevant likelihood
In Fig. 4B,C, we show Bayesian estimates of the parameters , and from data simulated at , or locations, confirming that these hierarchical parameters can be inferred rather accurately if sufficient locations were surveyed. Importantly, however, error rates can only be accurately learned if there are enough sites with multiple detections and hence sufficiently many visits. For a fixed number of visits, estimation errors are therefore minimized for intermediate number of visits per location, for the error rates chosen here at about (Fig. 4D). |