Skip to main content
PLOS One logoLink to PLOS One
. 2022 Aug 30;17(8):e0271896. doi: 10.1371/journal.pone.0271896

Bias in product availability estimates from contraceptive outlet surveys: Evidence from the Consumer’s Market for Family Planning (CM4FP) study

Brett Keller 1, Dale Rhoda 2, Caitlin Clary 2, Claire Rothschild 3, Mark Conlon 4, Paul Bouanchaud 5,*; CM4FP Group
Editor: Manoj Kumar6
PMCID: PMC9426883  PMID: 36040979

Abstract

Area-based sampling approaches designed to capture pharmacies, drug shops, and other non-facility service delivery outlets are critical for accurately measuring the contraceptive service environment in contexts of increasing de-medicalization of contraceptive commodities and services. Evidence from other disciplines has demonstrated area-based estimates may be biased if there is spatial heterogeneity in product distribution, but this bias has not yet been assessed in the context of contraceptive supply estimates. The Consumer’s Marker for Family Planning (CM4FP) study conducted censuses and product audits of contraceptive outlets across 12 study sites and 2–3 rounds of quarterly data collection in Kenya, Nigeria, and Uganda. We assessed bias in estimates of contraceptive product availability by comparing estimates from simulations of area-based sampling approaches with census counts among all audited facilities for each study site and round of data collection. We found evidence of bias in estimates of contraceptive availability generated from simulated area-based sampling. Within specific study sites and rounds, we observed biased sampling estimates for several but not all contraceptive method types, with bias more likely to occur in sites with heterogeneity in both spatial distribution of outlets and product availability within outlets. In simulations varying size of enumeration areas (EA) and number of outlets sampled per EA, we demonstrated that the likelihood of substantial bias decreases as EA size decreases and as the number of outlets sampled per EA increases. Straightforward approaches such as increasing sample size per EA or applying statistical weights may be used to reduce area-based sampling bias, indicating a pragmatic way forward to improve estimates where design-based sampling is infeasible. Such approaches should be considered in development of improved methods for area-based estimates of contraceptive supply-side environments.

Introduction

Outlet surveys of contraceptive availability that use area-based sampling are a common source of estimates for family planning indicators. When availability of family planning commodities at outlets is spatially clustered, and this is not accounted for in area-based sampling strategies, there is potential for biased estimates. Using only data from a survey of outlets that employed a two-stage area-based sampling approach, it is not possible to assess such bias within the survey itself. Outlet census data from the Consumer’s Market for Family Planning (CM4FP) study provides an opportunity to explore such bias through simulation approaches, and to test avenues for mitigating it.

Large-scale health service assessments, such as the Demographic and Health Survey’s (DHS) Service Provision Assessment and the World Health Organization’s Service Availability and Readiness Assessment surveys, are widely used for estimating health systems’ readiness to provide contraceptive services [13]. Such assessments rely on a master facility list that enumerates public and private health facilities in the country (or geographic region of interest) as a sampling frame, thereby restricting findings to “brick-and-mortar” health facilities [4] and often only to facilities that are formally registered. As a result, supply-side estimates may lack external validity to non-facility outlets; internal validity may also be threatened due to selection bias if unregistered private facilities are less likely to be captured on master facility lists.

With rapid regulatory changes and delivery channel innovation in the field of sexual and reproductive health (SRH), there is growing interest in measuring service delivery outside of health facilities, such as in pharmacies, drug shops, and by informal vendors [5]. This presents challenges to traditional health facility assessment methodologies [4, 6] since these contraceptive outlets are rarely–if ever–captured in a comprehensive list that can be used as a design-based sampling frame.

While previous studies have measured the total contraceptive market in specific localities using censuses [7], thereby precluding the need for area-based sampling, identifying and surveying every service outlet is not a feasible approach for national, multi-year survey programs. The Performance Monitoring for Accountability (PMA) project has addressed this challenge by using a novel, area-based sampling approach to capture information on contraceptive service provision. PMA selects a sample of enumeration areas (EAs) for a household survey and collects data on contraceptive outlets related to those EAs. PMA first conducts a census of all contraceptive outlets located within each EA. This census is used as a sampling frame to select and survey a fixed quota of up to three private and non-facility outlets within each EA. In addition, all public facilities whose catchment area includes the selected EA are included in the sample of public facilities even if the facility itself is outside the boundaries of the EA [810]. The PMA “[service delivery point (SDP)] sample thus reflects the services available to a representative population, rather than being representative of all SDPs in the country” [9].

Simple or stratified random sampling of outlets is comprehensive, but requires continuously updated, comprehensive master facility lists (MFLs) of all public and private outlets, which often do not exist [11]. MFLs may not document private outlets as comprehensively as other types of outlets [12]. Such designs are therefore often restricted to public facilities or do not include outlets that are not formal health facilities. Area-based sampling designs such as the approach developed by PMA are thus necessary for measuring the complete contraceptive service environment, particularly in settings with rapid de-medicalization of contraceptive products and care. However, there is relatively little evidence of the validity and precision of estimates of contraceptive availability that are generated from area-based–or spatial–sampling approaches [13]. Area-based sampling approaches, like simple random sampling, are based on the key assumption that sampling units–for example, facilities included in a master facility list–are independent and identically distributed [14]. When sampling within spatially defined EAs, heterogeneity in the spatial distribution of sampling units within the EA or in the spatial distribution of product availability within those outlets could introduce bias into sampling estimates even if there is no bias in the selection of the EAs. Sampling bias due to spatial heterogeneity has been detected in other disciplines such as environmental health and conservation science [15, 16].

Previous studies have documented spatial heterogeneity in SRH service provision. For example, mapping of facility-based SRH services in four districts in rural Mozambique revealed strong spatial heterogeneity in clinic placement within districts [17, 18], and analysis of health facilities in Nigeria found spatial inequalities between facility placement and population size [19]. However, to our knowledge, spatial-sampling bias has not been described in the context of family planning (FP) provision. Understanding bias in area-based sampling approaches is critical for advancing methodologies that measure service availability and readiness in mixed health systems. Using data from the CM4FP study, a census and longitudinal cohort of all public and private outlets offering contraceptive services within specific geographies in Kenya, Nigeria, and Uganda, we compared estimates of contraceptive product availability generated from area-based sampling simulations to the study’s census counts to explore the magnitude of spatial sampling bias and to describe spatial characteristics associated with bias.

Methods

CM4FP study design

The Consumer’s Market for Family Planning (CM4FP) project was a multi-round longitudinal family planning (FP) outlet census with an accompanying repeated cross-sectional household survey of women aged 18 to 49. The present analysis uses only the FP outlet (supply-side) data. The CM4FP methodology and data are outlined here but have been described in detail elsewhere [20]. The study aimed to test the feasibility and utility of a range of novel and modified approaches to data collection for understanding the supply and the demand sides of the FP markets in predominantly urban and semi-urban sites in Nigeria, Uganda, and Kenya. This study design allowed for directly linking FP users to the outlets where they obtained their most recent FP method. The study also aimed to represent the full FP supply environments to which the sampled consumers had access. CM4FP collected outlet data on a quarterly basis from four sites in each country between 2019 and 2020. The study focused on urban and semi-urban areas to better understand the total FP market, particularly the private sector, in these zones. In each country, four sites were selected from within an urban area of different size (large, medium, small, and semi-urban). In Uganda, there was one site in a rural area instead of in a semi-urban one.

In each site, CM4FP delineated an outer ring, consisting of contiguous administrative wards or parishes to measure the FP supply-side total market. The geographical boundaries for the outer ring at each site encompassed one or more contiguous wards (Kenya and Nigeria) or parishes (Uganda), that were completely censused to measure the total market for FP products and services within each ring-fenced area. To determine the geographic boundary of the outer rings, an initial target area was selected to capture a total of 600 outlets across all sites in each country. The number of outlets included in the study was not statistically predetermined, but instead based on pragmatic considerations allowing for a deep dive into localized family planning markets within budget and time constraints. CM4FP’s supply-side outlet census dataset includes outlet and provider characteristics as well as longitudinal data on FP service provision and FP products (including product type, brands, price, availability and current and past stockouts) collected from quarterly product audits in all FP outlets in each study geography.

The CM4FP outlet survey builds on the FPwatch study design, which also conducted an FP outlet census by enumerating, mapping, and surveying all outlets (of all sectors and levels) that offered FP methods and/or services of all types (but excluding outlets that offer only male condoms) [7, 21]. As in FPwatch, CM4FP FP outlets in the study included hospitals, medical centers, clinics, health centers, pharmacies, and drug shops/chemists/Patent and Proprietary Medicine Vendors (PPMVs), but in an extension to FPwatch, CM4FP collected longitudinal data from outlets, returning to censused outlets quarterly. Outlets were eligible for inclusion in the census of FP product and service providers if they had stocked at least one modern FP method (aside from male condoms) or offered any FP services during the past three months. Public and private health provider and health retail outlets of all types within the outer ring, including hospitals, health facilities, pharmacies, patent and proprietary medicine vendors (PPMVs), and drug shops, were screened for inclusion. Outlets that served the military but not the general public were excluded, as were general retailers, bars, hotels, and brothels where only condoms are typically available. In the Lagos and Abia sites in Nigeria, a small number of general retailers/supermarkets offered oral contraceptive pills and/or emergency contraceptive pills, so these outlet types were screened and included if eligible. In the overall CM4FP study, some CHWs were included in the outlet census, but in our analyses, all CHWs are excluded as they did not have a specific geographical location. We also focus on the private sector, and so exclude public facilities from our analyses.

Product audits were conducted for all available contraceptive method and brand/formulation combinations offered and in-stock on the date of the survey. Audited contraceptive products included all modern contraceptive methods, such as male and female condoms, oral contraceptive pills (OCP], emergency contraceptive pills (EC), contraceptive injectables, implants, and hormonal and copper intrauterine devices (IUDs).

While CM4FP also included cross-sectional survey data on women from households in a smaller designated survey area (the inner ring), this paper draws only on the CM4FP supply side outlets (collected across the outer ring). In this paper, “CM4FP study site” is used to describe the outer ring. The study collected GPS coordinates for all outlets and households included in the study. To ensure anonymity and protect the privacy of household respondents, all geographic information below county, state, or district level has been removed from the publicly available data. This paper includes graphs that depict the real spatial relationship between outlets, but to keep outlet identities confidential, map directions (north/south and east/west) have been flipped in some cases, and the actual boundary of the CM4FP study site, which combines contiguous administrative units, has been replaced with a pseudo-boundary to prevent visual identification. By obscuring study site boundaries, spatial relationships between outlets and of the availability of FP products at those outlets have been preserved while minimizing the risk of identifying specific outlets, which could identify specific matched households. More information about the methodology used and data available may be found on the project website (www.cm4fp.org).

Research ethics and participant consent

Ethical approval was provided by the PSI Research Ethics Board (01.2019 and 04.2019), the AMREF Ethics & Scientific Review Committee in Kenya (P615-2019), the National Health Research Ethics Committee of Nigeria (NHREC/01/01/2007-27/05/2019), the Uganda National Council for Science and Technology review board (SS 5041 and SS 5104), and the Mildmay Uganda Research Ethics Committee (1105–2019). Informed consent was obtained from all household and outlet/CHW survey respondents prior to conducting study procedures. To protect the identify of participants, consent was obtained verbally, except in Uganda where consent was written as mandated by the in-country review board. Verbal consent was witnessed and recorded by fieldworkers using electronic data collection devices. The following protocol deviations occurred, and were reported to relevant ethics boards during the study: modifications to study incentive strategy and timing; outlet and household mapping and sampling modified during implementation; an extension of period of retention of GPS coordinate data beyond planned timeframe in original protocol; changes to study staffing.

Inclusivity in global research

Additional information regarding the ethical, cultural, and scientific considerations specific to inclusivity in global research is included in the S1 File.

Creation of pseudo-EAs

Within each study site we created pseudo-EAs for sampling simulations. The study sites were selected from contiguous administrative units and varied substantially in area and estimated population size. If digitized EA boundaries from the national population census had been available to the study team for each study site, they could have been used in the simulation. Because the true EA boundaries were not available for all sites in a digital format, we generated pseudo-EAs of six sizes, each of which was relative to the overall size of the CM4FP study site. The size of the square pseudo-EAs generated is depicted in Fig 1 with the overall study site pseudo-boundary in blue (Fig 1).

Fig 1. Pseudo-EA sizes in medium urban site in Kenya.

Fig 1

Pseudo-EAs are depicted as black squares, while the CM4FP study site pseudo-boundary is depicted in blue. In each study site, six pseudo-EA sizes were created, ranging from huge (three entire EAs fit across the study site) to tiny (24 EAs fit across the study site), with the same ratios being used to define these categories of pseudo-EA for each study site. Because the study sites vary in area and population, the geographic footprint and estimated average population of these pseudo-EA sizes also vary across study sites.

Simulation approach

To test the potential for bias in different sampling approaches, a simulation was employed (Fig 2). The simulation was conducted using data from four sites each in Kenya, Nigeria, and Uganda, three rounds of data collection, six sizes of pseudo-EAs (termed huge, large, medium, small, smaller, and tiny), four sampling schemes (selecting either 3, 6, 9, or 12 outlets per pseudo-EA), and for eight common FP product availability indicators (male condoms, OCP, EC, injectables, implants, and copper IUDs, plus composite indicators of 3 or more and 5 or more methods of any type, including those not among the 8 common products evaluated).

Fig 2. Structure of simulation to create pseudo-EAs, sample outlets, and estimate product availability.

Fig 2

This schematic describes the structure of the simulation used to create pseudo-EAs and repeatedly draw samples within them to calculate indicators within each study site, study round, and size of pseudo-EA.

Product availability estimates from surveys were calculated using repeated draws within each site from a frame of geolocated private outlets where product availability was known from each round of CM4FP data collection. The simulation was restricted to private outlets to best approximate the private sector sampling methods of FP outlet surveys such as PMA. Pseudo-EA boundaries were simulated ten times for each combination of study site, study round, and pseudo-EA size, with boundaries jittered in latitude and longitude for each simulation. For each of the ten sets of pseudo-EAs of a given size, four different sampling schemes were simulated (3, 6, 9, or 12 outlets selected per pseudo-EA), and product availability estimates were produced for each contraceptive method. This approach resulted in 240 estimates per product per round per site (6 pseudo-EA sizes x 10 iterations x 4 sampling schemes).

The estimates were compared with the true or census count of private outlet product availability from all CM4FP data for that site and round. Bias estimates were calculated for each study site, round, and contraceptive method by comparing the census count with the mean availability estimate for each of six pseudo-EA sizes and four sampling schemes. The simulation and analysis were conducted with Stata v17.

Results

Study site characteristics

Characteristics of the four CM4FP study sites per country are described in Table 1. Each site was located within a portion of a different urban area and in a different county, state, or district (indicated in Table 1). In Kenya, the number of outlets included in the census ranged from 66 to 239 in the semi-urban site and the medium urban site, respectively. In Nigeria the number of outlets surveyed ranged from 86 to 165. In Uganda the number of outlets surveyed ranged from 147 to 157 in three urban study sites, while the rural site had 14 outlets surveyed.

Table 1. Characteristics of CM4FP study sites and estimated population size of pseudo-EAs.

Country Study site CM4FP study site (outer ring) characteristics Median population size estimate for different size pseudo-EAsc
Number of outlets surveyed a Area (km2) Outlets per km2 Population size (estimate)b Population density (pop per km2) Outlets per 10,000 population (estimate) Huge Large Medium Small Smaller Tiny
Kenya Large urban (Nairobi County) 223 14.3  15.6 281,927 19,715 7.9 47,379 11,515 6,890 3,884 1,714 919
Medium urban (Nakuru County) 239 35.6 6.7 269,572 7,572 8.9 50,635 10,263 4,393 1,888 781 422
Small urban (Kilifi County) 81 307.3 0.3 163,495 532 5.0 21,164 4,334 1,869 967 424 226
Semi-urban (Migori County) 66 84.3 0.8 56,822 674 11.6 4,368 1,070 460 263 115 64
Nigeria Large urban (Lagos State) 150 7.3 20.5 195,204 26,740 7.7 34,880 9,395 4,155 2,327 988 517
Medium urban (Kaduna State) 165 22.3 7.4 205,626 9,221 8.0 24,554 7,037 3,016 1,566 719 370
Small urban (Abia State) 136 5.0 27.2 82,090 16,418 16.6 10,163 2,496 1,161 634 323 185
Semi-urban (Niger State) 86 29.4 2.9 169,241 5,756 5.1 21,438 5,862 2,336 1,237 478 246
Uganda Large urban (Kampala District) 151 6.2 24.4 106,408 17,163 14.2 13,518 3,464 1,471 858 334 171
Medium urban (Mbarara District) 147 11.8 12.5 69,042 5,851 21.3 12,702 3,237 1,396 762 325 171
Small urban (Gulu District) 157 33.3 4.7 153,150 4,599 10.3 32,447 7,109 2,917 1,558 644 354
Rural (Soroti District) 14 54.5 0.3 11,678 214 12.0 2,088 480 219 125 53 28

a. Outlet numbers reported in this table include only static outlets and exclude Community Health Workers, who were included in the CM4FP data collection but who were excluded from the spatial analysis reported in this paper.

b. Population was not directly measured for the CM4FP study site outer ring. Figures were estimated by summing the estimated population in the study site using data from worldpop.org. Estimates are constrained to match 2020 United Nations population estimates at the country level and further constrained to only assign population to settled portions of the countryside.

c. Pseudo-EA sizes are relative to the size of the CM4FP study site and thus not equivalent across sites. Population estimates for pseudo-EAs were generated using the same technique as in footnote b, but applied to the pseudo-EA boundaries, to provide an approximate measure of EA population size for comparison with other sampling approaches.

Areas and population estimates for the study sites and corresponding pseudo-EAs are presented in Table 1. Because of differences in study site scale and population density, different sizes of pseudo-EA likely correspond most closely to the true scale and population of census EAs that would be used for sampling of other outlet surveys, such as PMA. We estimated CM4FP study site and pseudo-EA population sizes (Table 1) using data from worldpop.org, with a spatial resolution of 100x100m. [2224] Further detail on the approach taken is provided in the S2 File.

Census of FP product availability for comparison with simulation

Fig 3 illustrates the variability of FP product availability within the CM4FP census data (Fig 3). The top panel shows availability for each of four different FP products at the same outlets during the same round of data collection, in the Nigeria semi-urban site during round 2. Oral pills were available in 57% of outlets, while implants were available in 6%. The bottom panel of Fig 3 illustrates a different kind of availability, showing that availability of a single product (injectables) varied from 15% of outlets in the small urban site to 33% in the large urban site during round 1 data collection in Nigeria. The spatial pattern of availability varies by product within the same site and round and varies by site within the same product and round.

Fig 3. Variability in contraceptive product availability, by product type and by study site.

Fig 3

These maps show which outlets within the CM4FP study site have specific FP products, from a census of all outlets with contraceptive products available beyond just male condoms. Grey circles represent outlets without a product, red pluses represent outlets with a product, and the grey boundary denotes the CM4FP study site pseudo-boundary. The top panel illustrates spatial heterogeneity of availability varying across products. The bottom panel illustrates spatial heterogeneity of availability varying across the same product in different study sites.

Simulation results

Simulation results were summarized using scatter plots called cone plots where the y-axis represents number of outlets in the estimate and the x-axis represents product availability, which can range from 0–100% (Fig 4). The census count sits at the apex of the cone, indicated with a red triangle. Each of the four sets of scattered points beneath the apex of the cone represents a different sample size of outlets sampled per pseudo-EA (either 3, 6, 9, or 12).

Fig 4. Simulated sampling bias cone plots.

Fig 4

Each scatter plot depicts the census (true) availability measure as a red triangle, along with point estimates of availability from simulated samples that drew 3, 6, 9, and 12 outlets per EA. Simulations with varying sample size per EA are arranged from smallest sample (n = 3 per EA) in blue at bottom of cone to largest sample (n = 12 per EA) in red at top of cone. In scenarios where simulated samples show systematic bias, the points form an asymmetric shape. When bias of the simulated metric exceeds 5%, a downward-facing arrow denotes the simulated average at the top of the plot. This illustrates a general finding from the simulations: bias often occurred for some, but not all, indicators.

In cases where the four sets of scatter points are unbiased, they should form a wide-based symmetrical cone with a narrow tip at or near the census count. The decreasing width of the cone as the number of outlets sampled per pseudo-EA increases represents decreasing variance with increased sample size. Scenarios with systematic bias form an asymmetric shape, still wider at the base than at the top, but shaped more like a right triangle where most, or possibly all the mass of the cone falls to the left or right of the census value.

Each Fig 4 cone plot within the top panel represents samples drawn using different sizes of pseudo-EAs. Each cone plot in the bottom panel represents a different product within the same pseudo-EA. If outlets were spaced homogeneously and product availability were also spatially homogeneous, then the cone plots would all be symmetric. When outlets and the products are distributed with heterogeneous density, the sample estimates can exhibit systematic bias–either biased too high or too low. When bias in the simulated estimate is greater than 5% (a threshold chosen arbitrarily to represent a non-negligible level of bias), it is denoted by a downward-facing arrow (V) at the top of the cone plot where the point of the triangle aligns with the mean simulated proportion.

The top panel of Fig 4 plots 240 simulated availability estimates for male condoms in the Kenya small urban site, round 2. All six cone plots exhibit notable bias, where the sample-based estimates are more likely to be below the census figure than above it. When the pseudo-EAs are huge, the number of outlets in each sample is relatively small, so the variance is large. As pseudo-EA size decreases, the number of pseudo-EAs increases so the number of outlets per sample increases and the variance diminishes but the sample bias remains. Even when the pseudo-EA size is tiny, most estimates fall below the census figure, but the average bias is still greater than 5% when only 3 outlets are sampled per EA.

The bottom panel of Fig 4 plots 40 availability estimates for each of six different products in the Kenya semi-urban site, round 1, holding the pseudo-EA size fixed at large (meaning that about six entire pseudo-EAs would fit across the CM4FP study site). In this panel, the simulated estimates exhibit bias for the three products in the top row (stocking three or more methods, male condoms, and OCPs), and those in the bottom row (stocking five or more methods, injectable, and implant) do not. For some product availability estimates to be biased and others to not be biased means that the bias is not only a function of the density or distribution of outlets but must also be related to spatial heterogeneity of the products across the outlets.

We explored the nature of the heterogeneities of outlet distribution and product availability that may result in bias in simulated samples by plotting a histogram, line graph, and cone plot for a specific CM4FP study site (Fig 5); each are explained in turn below.

Fig 5. Bias as a function of outlet concentration and heterogeneity in product availability.

Fig 5

In the left panel, outlet density is more constant than in many distributions (A), but the availability of OCP does not vary with density (B), and sampling bias is therefore low (C). In the right panel, outlet density varies more starkly, producing a clearly bimodal distribution (D). When product availability is heterogeneous (E), bias is seen in the simulation cone plot (F). When product availability is homogenous (G), bias is not seen in the simulation cone plot (H). Bias in the simulated results requires both outlet density heterogeneity and product availability heterogeneity.

Spatial concentration of outlets at each site was examined using a histogram of the number of other outlets located within 2km of each outlet. Perfect homogeneity would be represented by a single tall bar in the histogram (e.g., every outlet has 10 other outlets within 2km). Heterogeneity is characterized by spread in the histogram and by heavy right or left-side tails in the distribution (i.e., where many outlets have many other outlets within 2 km and many outlets have very few within 2 km) (Fig 5).

Even with widely varying densities of outlets, it would be possible to have a homogenously available product (e.g., for every outlet, 70% of the outlets within 2km carry the product). To characterize heterogeneity of product availability overlaid on outlet concentration, we plotted the percentage of outlets within 2 km of each outlet that have the product available versus outlet concentration in line graphs (Fig 5). Blue dots represent observed values for each outlet with a blue line of best fit. The horizontal red line indicates the availability of the FP product at the census proportion count, assuming a homogenous spatial distribution. If product availability is perfectly homogeneous, the blue best-fit line will be horizontal and lie atop the red line at the y-values of the census measure of product availability. But if product concentration varies with outlet concentration, then the blue best-fit line will have a non-zero slope and cross the red census availability line. A non-zero slope in a blue line is a reliable predictor of bias and tends to correspond with a skewed cone plot even though the concentration heterogeneity plot is calculated from a census of outlets and the bias is demonstrated to be a sampling phenomenon. The non-zero slope in the census-based plot is an indicator of an environment that may yield biased samples.

Fig 6 demonstrates how the mean bias of groups of ten simulated samples varies by pseudo-EA size, by sample size per pseudo-EA, and by heterogeneity of outlet density (Fig 6). Red points in the figure indicate that a batch of ten samples had a mean bias with absolute value ≥ 5%. Note that bias is most common when pseudo-EA size is large and when few outlets are selected per pseudo-EA–represented in the sub-plots near the upper left corner of the figure–both factors that keep the simulated sample size small. Within sub-plots, bias is more likely to be positive at the left side of the plots, where distributions of outlet concentration do not have heavy tails. Conversely, bias is more likely to be negative at the far-right side of the sub-plots, in particular in locations where outlet concentration varies substantially (meaning that a large portion of outlets appear in the outer two-thirds of the concentration distribution).

Fig 6. Sampling bias by EA size, sample size, and heterogeneity of outlet density.

Fig 6

When outlet concentration is more heterogeneous across the study site, estimates tend to be biased negatively; estimates are more likely to be positive biased when outlet concentration heterogeneity is low. In the CM4FP data, product availability tends to be lower in areas where outlets are less dense, producing this pattern. Notable bias is more often observed when EAs are larger, and the number of outlets sampled per pseudo-EA is smaller. This figure pools data from simulations of all product availability indicators and data from all 12 CM4FP study sites and all rounds of data collection.

Our simulation demonstrated that bias is more likely to occur with Eas that are larger; bias greater than 5% (positive or negative) occurred in 30% of simulations in huge pseudo-Eas, and just 2% of simulations in tiny pseudo-Eas.

In our simulations, bias of greater than 5% occurred in 20% of simulations with 3 outlets sampled per pseudo-EA, 10% of simulations with n = 6, 6% of simulations with n = 9, and 4% of simulations with n = 12. Fig 7 summarizes the proportion of simulations where bias greater than 5% from the census count was observed, by country, method, and outlets sampled per pseudo-EA (Fig 7). Bias was most common in estimates of male condoms and OCP, and generally more common in the Kenya data. Estimates were more likely to be biased downward than upward. Across all three countries and all availability indicators, bias decreased with larger numbers of outlets sampled per pseudo-EA.

Fig 7. Proportion of round 1 simulated product availability estimates with bias of greater than 5%.

Fig 7

The proportion of simulated estimates that were biased negatively by 5% or more and positively by 5%, as well as the mean absolute bias, is presented by country for each of the product availability indicators and for four different sampling scenarios (3,6, 9, or 12 outlets per pseudo-EA). Bias greater than 5% in either direction, and mean absolute bias, tend to decrease as sample size per pseudo-EA is increased, with the largest reduction in bias occuring when sample size increased from 3 to 6 outlets per pseudo-EA.

We tested a simple statistical weighting approach using the same simulation approach. For each simulated sample, the outlets selected from a pseudo-EA were assigned a survey weight equal to the total number of outlets in the pseudo-EA, such that outlets selected from pseudo-Eas with a larger total number of outlets are more influential for summary estimates than outlets selected from pseudo-Eas with a smaller total number of outlets. Fig 8 illustrates a reduction in bias when this weighting technique was applied to one of the most biased estimates, that for male condom availability in the small urban site in Kenya, in CM4FP data collection round 2 (Fig 8).

Fig 8. Effect of applying statistical weights in reducing bias in simulated estimates of product availability.

Fig 8

As in Fig 4, each scatter cone plot depicts the census (true) availability measure as a red triangle, along with point estimates of availability from simulated samples that drew 3, 6, 9, and 12 outlets per EA. Simulations vary by sample size per EA and by relative size of pseudo-EA (huge, large, medium, small, smaller, and tiny). In scenarios where simulated samples show systematic bias, the points form an asymmetric shape. When bias of the simulated metric exceeds 5%, a downward-facing arrow denotes the simulated average at the top of the plot. The top panel shows simulation results without statistical weighting, indicating substantial bias is present in most scenarios. The bottom panel shows simulation results in which selected outlets are weighted based on the number of total number of outlets in the pseudo-EA, indicating the weighting approach alleviates bias.

Discussion

We found evidence of potential bias in FP outlet surveys by simulating different sampling schemas and comparing the resulting estimates of FP product availability with the census means. In some cases, we found substantial levels of bias in product availability estimates. We found that heterogeneity in outlet spatial distribution and in product availability were associated with greater likelihood of bias. Differential bias by FP product type may be explained by the differential levels of availability of FP products within each outlet, and differential distributions of FP products across outlets in a geographic area. The following section explores potential mechanisms for the bias seen in the results, before discussing the implications of these findings.

Understanding how bias might arise

We found evidence that bias in indicator estimates based on samples may arise when there is heterogeneity in outlet distribution in the wider areas from which the EA is drawn. This bias may under- or over-estimate the indicator of interest (as compared to its census value).

Notionally, consider two types of Eas within a single study site: outlet-sparse Eas with a few outlets that are all selected into every sample and outlet-dense Eas with so many outlets that different samples include different subsets of outlets. If the average availability of products is substantially higher in outlet-dense Eas than in sparse Eas, then every sample of the entire study site will underestimate the true proportion of outlets that carry the product; the cone plots will be skewed with all sample points falling below (to the left of) the census value. This bias will persist in estimates for a larger region. Conversely, if the product were more available in outlet-sparse Eas than outlet-dense ones, then the samples would systematically overestimate availability. The principle holds even with a continuum of outlet density if there is a correlation between outlet density and product availability.

Fig 9 shows a simplified illustration of how this type of bias may arise, depicting a scenario in which product availability varies at outlets across three hypothetical Eas (Fig 9). When outlets are sampled using area-based sampling under these conditions, estimates are usually biased in comparison to the census results.

Fig 9. Stylized illustration of how bias may arise.

Fig 9

In this stylized scenario, there are 20 outlets spread unequally across three enumeration areas. Product availability is 50% across all 20 outlets, but varies from 0% to 75% by EA. If samples of outlets are drawn from the universe of outlets in the EA, mean availability varies but can be consistently biased.

Implications for outlet surveys that use sampling approaches

Identifying bias in estimates generated from area-based sampling approaches is critical for several reasons. First, if estimates are used for situational analyses or assessment of trends in FP delivery and availability, understanding the magnitude of potential bias and whether bias is differential with respect to characteristics of the sampled areas is critical for interpretation of findings. Differential bias is particularly problematic if it results inappropriate resource allocation or planning due, for example, to systematic underestimation or overestimation of contraceptive availability in certain areas. Because the bias described in this paper can differ for different products within the same CM4FP study site, it can also lead to incorrect conclusions about the relative availability of one product compared to others.

Second, biased area-based sampling estimates would be expected to bias estimated associations between contraceptive supply and demand. Estimating relationships between contraceptive services and individual contraceptive behaviors is an active area of research that attempts to link household-based health surveys with the nearest or nearby health facility assessments [13, 2527]. Existing evidence is mixed, and difficulties accurately measuring the total FP market–particularly in urban settings with active private and non-facility FP markets–have been cited as a potential explanation for these inconsistent associations [25]. Improved approaches to sampling from total FP markets that provide unbiased estimates of the local FP market (including private sector providers and non-facility outlets) will be critical for better understanding how supply-side factors influence contraceptive intentions and behaviors, including method uptake, switching, continuation, and satisfaction.

Limitations

The CM4FP study on which our simulation is based had an urban focus, meaning that conclusions may not be readily generalized to rural contexts, (though the study does include one rural site, in Soroti District, Uganda), where private sector facilities and outlets are less prevalent. In theory, such biases may be present in rural outlet surveys, but we were unable to directly test this hypothesis. Additionally, the CM4FP data came from a limited number of study sites and were not intended to constitute nationally or sub-nationally representative samples. Thus, results evaluated for bias in this paper, such as product availability, should be interpreted as being from 12 distinct study sites spread across three countries, rather than being representative of higher geographical units or levels. Moreover, study sites accounted for a limited geographic area within each setting and results evaluated are not statistically representative of the wider geographical area or urban setting. While not geographically representative, we consider the findings presented in the manuscript to be nevertheless generalizable to other studies that employ sampling approaches similar to those simulated here.

The exclusive focus on the private sector in this analysis is also a limitation. This simulation approach focused on private outlets due to their larger number within CM4FP study sites and especially because of the close parallel with existing area-based sampling of outlets, such as in PMA. Because the PMA sampling approach for public facilities differs, the findings of our simulated approach cannot be applied directly to public sector PMA results, and do not describe the localized total market. For outlet surveys where public sector sampling parallels PMA’s private sector outlet sampling approach, the same biases are possible if public sector spatial heterogeneity and product availability display similar patterns.

Last, the inability to access census Eas across all sites increased the required simulation volume. The sheer volume of simulation output was multiplied six-fold because we did not have census EA boundaries for the study sites and wanted to cover a range of possible EA sizes. It is likely that the census Eas would match different simulation EA sizes at different sites. Fig 6 shows some notable examples of both positive and negative bias in even the smallest Eas when only three outlets are sampled per EA, so although we do not know precisely which EA sizes are most realistic at each site, the potential for bias exists in some of these sites and products at externally valid scales (Fig 6).

Recommendations for outlet surveys

Other studies that employ sampling approaches like those simulated in our analysis might wish to consider the risk of bias that may potentially result. Possible remedies include varying sampling approaches when spatial heterogeneity is known to exist, increasing sample size, and weighting data. In the common case where such a priori data do not exist, study designers will need to choose between using limited resources to increase sample size within Eas and tolerating potential bias in estimates. In addition to increasing sample size per EA where possible, survey designers should consider planning for statistical weighting approaches that may reduce such bias regardless of sample size. We explore potential options for limiting bias in more detail below.

The literature on spatial sampling suggests several techniques that may remedy bias, though some of these techniques may not be operationally efficient. If product availability variance were known a priori, then the number of outlets to be sampled per EA could be “assigned to each subarea according to the area and/or variance proportion,” but this is unlikely to be practical [14]. Another solution suggested by the same authors is that spatial statistics may be used to apply corrections to existing datasets if the factors responsible for bias (such as heterogeneity of outlet spacing combined with heterogeneity of product availability) have been identified and are available for a broader dataset of outlets. Finally, sampling bias can be mitigated by creating a sampling frame of outlets within an enumeration area and applying a spatial sampling technique wherein the geographic spacing of outlets is considered rather than simple random sampling. [14]. Notably, these proposed approaches to spatial sampling require specialized sampling techniques based on spatial data from an outlet census, a process that is both cost- and time-intensive.

We found that the likelihood of substantial bias decreases as EA size decreases and as the number of outlets sampled increases. These findings suggest that decreasing EA size may be one approach to minimizing area-based sampling bias. However, relying on existing population census-derived Eas is common for operational reasons, so varying their size in area-based sampling designs may not be feasible. Another simple method for mitigating against this bias would be to sample more outlets per selected EA. The simulation approach we employed indicates that bias originating from spatial characteristics of outlet location and product availability may be reduced–though not eliminated–in the design phase by drawing a sample of more outlets per EA. For example, in simulations of the availability of 3 or more methods at an outlet, increasing the per-EA outlet sample from 3 to 6 outlets per pseudo-EA decreased the occurrence of bias greater than 5% from 28% of simulations to 13% of simulations. A larger sample size is more important in the presence of spatially heterogeneous outlet distribution and product availability.

It may be possible to mitigate the bias by applying survey weights to availability estimates, by weighting outlets based on the total number of outlets present in the EA or by other criteria. We illustrated this weighting approach in Fig 8. More systematic work to develop optimal weighting techniques for outlet surveys may be required, including weighting approaches that may be relevant for surveys already collected, as well as for planned outlet surveys.

Common area-based outlet sampling approaches may be prone to bias in product availability estimates. When certain conditions of outlet spatial heterogeneity and product availability coincide, for private outlets in urban settings. It is difficult to assess the presence of such factors without a comprehensive census of all outlets in an area, so we cannot directly estimate the prevalence of such bias in published surveys that do not use a census approach. Instead, surveys of healthcare outlets, including FP outlet surveys, should where possible consider implementing measures to mitigate bias, either at the stage of study design, or retrospectively through weighting.

Conclusions

Area-based sampling designs are increasingly being used to estimate complex healthcare markets. We used a simulation approach to compare estimates derived from area-based sampling approaches with estimates from a full census of outlets providing FP products and services. We demonstrated that bias in area-based sampling estimates of contraceptive product availability is common and is more likely to occur when spatial heterogeneity in outlet distribution and product availability are both present. In some areas this bias can be large enough to affect conclusions about the relative availability of different products and could potentially lead to poor policy or programmatic decision-making. Other researchers designing studies to measure FP supply-side indicators through outlet surveys should consider the tradeoff between risk of bias, and the increased resources needed to increase sample sizes or–more likely to be cost-effective–to make other sampling design and analysis changes to limit that risk. We demonstrate that simple techniques employed in survey design and data analysis through weighting may mitigate substantial bias arising from area-based sampling in the presence of spatial heterogeneity. These approaches may present practical ways forward for improving estimates of service availability where it is impractical or infeasible to use design-based sampling approaches.

Supporting information

S1 File. Inclusivity in global research.

(PDF)

S2 File. Pseudo-EA population estimation.

A brief description of population estimates for pseudo-enumeration areas.

(PDF)

Acknowledgments

We would like to thank the study participants for generously sharing their time and information with this project. We are also grateful to the CM4FP Group and data collection teams, supervisors, and field staff for their contributions to data collection and analysis between 2019 and 2021.

CM4FP Group 2019–2021: Dr Jennifer Anyanti; Justin Archer; Dr. Kimberly Ashburn; Henry Bakira; Dr Paul Bouanchaud (Technical Director 2021) pbouanchaud@psi.org; Peter Buyungo; Dr Caitlin Clary; Mark Conlon; Eden Demise; Kevin Duff; Dr Uche Ekhator-Mobayode; Hoda Elmasry; Dr Hildah Essendi; Jordan Freeman; Dr Susannah Gibbs; Risa Griffin; Dr Nathan Heard; Dr Bo Hu; Ashley Jackson; Dr Amanda Kalamar (Technical Director 2018–21); Brett Keller; Irene Kyomuhangi; Baker Lukwago; Dr Peter Macharia; Alison Malmqvist; Harmon Momanyi; Micheal Mugerwa; Doreen Nakimuli; Julius Njogu; Dr Anthony Nwala; Noah Nyende; Daniel Olemo; Jacob Olila; Alyssa Om’Iniabohs; Chinedu Onyezobi; Dale Rhoda; Dr Claire Rothschild; Raymond Songo; Raymond Sudoi; Ekerette Emmanuel Udoh; Dr Nkemdiri Wheatley (Principal Investigator 2018–21); Dr Wei Yang

Data Availability

CM4FP public datasets exclude all latitude/longitude geographic coordinates because in the full data sets, outlets can be linked directly to households, meaning that GPS coordinates for outlets would potentially allow identification of households. CM4FP has instead disclosed time/distance matrices to allow partial replication of geographic analyses. See https://www.psi.org/wp-content/uploads/2021/09/CM4FP-GIS-Methodology.pdf for more details. Data from the CM4FP study are available through the Dataverse repository, using the following links: Kenya: Outlet: https://doi.org/10.7910/DVN/DJXKVA; Household: https://doi.org/10.7910/DVN/TQ1MSE; Time/distance matrices: https://doi.org/10.7910/DVN/AP9XGL Nigeria: Outlet: https://doi.org/10.7910/DVN/G31FHL; Household: https://doi.org/10.7910/DVN/IPXDML; Time/distance matrices: https://doi.org/10.7910/DVN/KADZGA Uganda: Outlet: https://doi.org/10.7910/DVN/1NRFSD; Household: https://doi.org/10.7910/DVN/1NRFSD; Time/distance matrices: https://doi.org/10.7910/DVN/OHQ2IZ Further data requests may be sent to the PSI Research Ethics Board, contact: Kelly O’Keefe, Senior Technical Advisor PSI REB kokeefe@psi.org.

Funding Statement

Supported by Bill & Melinda Gates Foundation (https://www.gatesfoundation.org/) grant number INV-006705 to Population Services International. BMGF provided input into the study design and provided feedback on a draft of the manuscript.

References

  • 1.Ali M, Farron M, Ramachandran Dilip T, Folz R. Assessment of Family Planning Service Availability and Readiness in 10 African Countries. Glob Health Sci Pract. 2018. Oct 3;6(3):473–83. doi: 10.9745/GHSP-D-18-00041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bintabara D, Ntwenya J, Maro II, Kibusi S, Gunda DW, Mpondo BCT. Client satisfaction with family planning services in the area of high unmet need: evidence from Tanzania Service Provision Assessment Survey, 2014–2015. Reprod Health. 2018. Dec;15(1): 127. doi: 10.1186/s12978-018-0566-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kanyangarara M, Sakyi K, Laar A. Availability of integrated family planning services in HIV care and support sites in sub-Saharan Africa: a secondary analysis of national health facility surveys. Reprod Health. 2019. May;16(S1): 60. doi: 10.1186/s12978-019-0713-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.O’Neill K, Takane M, Sheffel A, Abou-Zahr C, Boerma T. Monitoring service delivery for universal health coverage: the Service Availability and Readiness Assessment. Bull World Health Organ. 2013. Dec 1;91(12):923–31. doi: 10.2471/BLT.12.116798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Chin-Quee DS, Stanback J, Orr T. Family planning provision in pharmacies and drug shops: an urgent prescription. Contraception. 2018. Nov;98(5):379–82. doi: 10.1016/j.contraception.2018.08.013 [DOI] [PubMed] [Google Scholar]
  • 6.ICF. “SPA Methodology”. The DHS Program website. Funded by USAID. https://dhsprogram.com/Methodology/Survey-Types/SPA-Methodology.cfm. [Accessed October 29, 2021]. [Google Scholar]
  • 7.Babazadeh S, Thanel K, Garfinkel D, Riley C, Bertrand J, Shaw B. FPwatch: Facility-based Survey Data for Family Planning Market Analysis in Five FP2020-focus Countries. Studies in Family Planning. 2018. Dec;49(4):385–95. doi: 10.1111/sifp.12077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zimmerman L, Desta S, Yihdego M, Rogers A, Amogne A, Karp C, et al. Protocol for PMA-Ethiopia: A new data source for cross-sectional and longitudinal data of reproductive, maternal, and newborn health. Gates Open Res. 2020. Sep 9; 4:126. doi: 10.12688/gatesopenres.13161.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zimmerman L, Olson H, Tsui A, Radloff S. PMA2020: Rapid Turn‐Around Survey Data to Monitor Family Planning Service and Practice in Ten Countries. Studies in Family Planning. 2017. Sep;48(3):293–303. doi: 10.1111/sifp.12031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.PMA. PMA Survey Methodology [Internet]. PMA. [cited 2021Oct14]. Available from: https://www.pmadata.org/data/survey-methodology.
  • 11.Makinde OA, Azeez A, Adebayo W. Potential use cases for the development of an electronic health facility registry in Nigeria: Key informant’s perspectives. OJPHI. 2016. Aug 15;8(2): e191. doi: 10.5210/ojphi.v8i2.6350 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Rose-Wood A, Heard N, Thermidor R, Chan J, Joseph F, Lerebours G, et al. Development and use of a master health facility list: Haiti’s experience during the 2010 earthquake response. Glob Health Sci Pract. 2014. Aug;2(3):357–65. doi: 10.9745/GHSP-D-14-00029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Skiles MP, Burgert CR, Curtis SL, Spencer J. Geographically linking population and facility surveys: methodological considerations. Popul Health Metrics. 2013. Dec;11(1): 14. doi: 10.1186/1478-7954-11-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wang J, Stein A, Gao B, Ge Y. A review of spatial sampling. Spatial Statistics. 2012. Dec; 2:1–14. doi: 10.1016/j.spasta.2012.08.001 [DOI] [Google Scholar]
  • 15.McNew LB, Handel CM. Evaluating species richness: Biased ecological inference results from spatial heterogeneity in detection probabilities. Ecological Applications. 2015. Sep;25(6):1669–80. doi: 10.1890/14-1248.1 [DOI] [PubMed] [Google Scholar]
  • 16.Paciorek CJ. The Importance of Scale for Spatial-Confounding Bias and Precision of Spatial Regression Estimators. Statist Sci. 2010. Feb 1;25(1): 107–125. doi: 10.1214/10-STS326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Yao J, Agadjanian V. Bypassing health facilities in rural Mozambique: spatial, institutional, and individual determinants. BMC Health Serv Res. 2018. Dec;18(1) 1006. doi: 10.1186/s12913-018-3834-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Yao J, Murray AT, Agadjanian V, Hayford SR. Geographic influences on sexual and reproductive health service utilization in rural Mozambique. Applied Geography. 2012. Mar;32(2):601–7. doi: 10.1016/j.apgeog.2011.07.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Nwakeze NM, Kandala N. The spatial distribution of health establishments in Nigeria. APS. 2011. Dec 30;25(2): 680–696. doi: 10.11564/25-2-251 [DOI] [Google Scholar]
  • 20.Conlon M, Buyungo P, Njogu J, Nwala A, Gibbs S, Wheatley N, et al. Linking Family Planning Market Census Data with Consumer Experiences in Three Countries: The Consumer’s Market for Family Planning Study Protocol and Data. 2021. Gates Open Research, 5. doi: 10.12688/gatesopenres.13441.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Babazadeh S, Lea S, Kayembe P, Akilimali P, Eitmann L, Anglewicz P, et al. Assessing the contraceptive supply environment in Kinshasa, DRC: trend data from PMA2020. Health Policy and Planning. 2018. Mar 1;33(2):155–62. doi: 10.1093/heapol/czx134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bondarenko M., Kerr D., Sorichetta A., and Tatem A.J. Census/projection-disaggregated gridded population datasets, adjusted to match the corresponding UNPD 2020 estimates, for 51 countries across sub-Saharan Africa using building footprints (Kenya). WorldPop, University of Southampton, UK. 2020. Sept. doi: 10.5258/SOTON/WP00683 [DOI] [Google Scholar]
  • 23.Bondarenko M., Kerr D., Sorichetta A., and Tatem A.J. Census/projection-disaggregated gridded population datasets, adjusted to match the corresponding UNPD 2020 estimates, for 51 countries across sub-Saharan Africa using building footprints (Nigeria). WorldPop, University of Southampton, UK. 2020. Sept. doi: 10.5258/SOTON/WP00683 [DOI] [Google Scholar]
  • 24.Bondarenko M., Kerr D., Sorichetta A., and Tatem A.J. Census/projection-disaggregated gridded population datasets, adjusted to match the corresponding UNPD 2020 estimates, for 51 countries across sub-Saharan Africa using building footprints (Uganda). WorldPop, University of Southampton, UK. 2020. Sept. doi: 10.5258/SOTON/WP00683 [DOI] [Google Scholar]
  • 25.Fruhauf T, Zimmerman L, Kibira SPS, Makumbi F, Gichangi P, Shiferaw S, et al. Measuring family planning quality and its link with contraceptive use in public facilities in Burkina Faso, Ethiopia, Kenya and Uganda. Health Policy and Planning. 2018. Sep 1;33(7):828–39. doi: 10.1093/heapol/czy058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Babazadeh S, Anglewicz P, Wisniewski JM, Kayembe PK, Hernandez J, Bertrand JT. The influence of health facility-level access measures on modern contraceptive use in Kinshasa, DRC. PLoS ONE. 2020. Jul 23;15(7): e0236018. doi: 10.1371/journal.pone.0236018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kalamar A, Wheatley N, Conlon M, Essendi H, Njogu J, Nwala A, et al. Linking Contraceptive Users to Outlets: Innovations and Challenges from the Consumer’s Market for Family Planning Study in Kenya, Nigeria, and Uganda. Forthcoming. [Google Scholar]

Decision Letter 0

Manoj Kumar

7 Apr 2022

PONE-D-21-34612Bias in product availability estimates from contraceptive outlet surveys: evidence from the Consumer’s Market for Family Planning (CM4FP) studyPLOS ONE

Dear Dr. Bouanchaud,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Authors are advised to elaborate on causes and the impact of different types of bias. In the discussion section, please mention about why only selective contraceptive methods were chosen for biased sampling estimates. Also please elaborate clearly about why likelihood of substantial bias decreases as EA size decreases and when the number of outlets sampled per EA increases. 

Please submit your revised manuscript by May 22 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Manoj Kumar

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

3. Please include a complete copy of PLOS’ questionnaire on inclusivity in global research in your revised manuscript. Our policy for research in this area aims to improve transparency in the reporting of research performed outside of researchers’ own country or community. The policy applies to researchers who have travelled to a different country to conduct research, research with Indigenous populations or their lands, and research on cultural artefacts. The questionnaire can also be requested at the journal’s discretion for any other submissions, even if these conditions are not met.  Please find more information on the policy and a link to download a blank copy of the questionnaire here: https://journals.plos.org/plosone/s/best-practices-in-research-reporting. Please upload a completed version of your questionnaire as Supporting Information when you resubmit your manuscript.

4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

5. One of the noted authors is a group or consortium [insert name of group or team]. In addition to naming the author group, please list the individual authors and affiliations within this group in the acknowledgments section of your manuscript. Please also indicate clearly a lead author for this group along with a contact email address.

6. Please include your full ethics statement in the ‘Methods’ section of your manuscript file. In your statement, please include the full name of the IRB or ethics committee who approved or waived your study, as well as whether or not you obtained informed written or verbal consent. If consent was waived for your study, please include this information in your statement as well. 

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Review Comments to the Author

The study examined the bias in product availability estimates from contraceptive outlet surveys in Kenya, Nigeria, and Uganda. They found evidence of bias in estimates of contraceptive availability generated from simulated area-based sampling. Within specific study sites and rounds, they observed biased sampling estimates for several but not all contraceptive method types, with bias more likely to occur in sites with heterogeneity in both spatial distributions of outlets and product availability within outlets. I commend the authors for the work done. Authors are encouraged to consider the following issues.

1. The introduction does not highlight the reason for urban focus. Is there a special reason for limiting it urban? Authors could explain the reasons for the selection of geographics in the introduction section.

Can authors also explain bias in product availability estimates from contraceptive outlet in public sectors? It would be good to understand how bias in product availability estimates from contraceptive outlet has been across the countries in the introduction.

There are different causes for bias including area-based bias. Authors should also dedicate a paragraph stating those causes and the impact of different types of bias.

2. The authors stated that for the purpose of the study, they visited hospitals, medical centers, clinics, health centers, pharmacies, and drug shops/chemists/Patent and Proprietary Medicine Vendors (PPMVs), in the study sites. Authors should be clear whether they visited all facilities for four sites in each country. If it is the case that not all facilities were visited, authors should provide justification for the selected facilities. It is unclear why those sites from Kenya, Nigeria, and Uganda were selected.

3. The authors observed biased sampling estimates for several but not all contraceptive method types. In the discussion, authors should provide better explanation on this finding.

4. The authors demonstrated that the likelihood of substantial bias decreases as EA size decreases and as the number of outlets sampled per EA increases. Author are requested to focus more on this result.

5. The household survey will not allow for sampling from all households within the outlet study area. This may thus introduce sampling bias which would cause the household survey data to not be representative of the full study area. The authors should address this issue.

6. The household survey might not designed to follow the same respondent over the 2-3 survey rounds, the study is limited in its ability to explain individual-level changes in FP behavior and any correlation between these changes in behavior and changes in the FP supply market.

7. The survey does not capture the full breadth of patterns of FP use and outlet choices.

8. Most importantly, the study is failed to produce data or findings that are representative at the regional or national levels.

9. Recall issues of family planning may come into play, which is missing in the manuscript.

10. The 2-3 rounds of data collection implemented quarterly may present a burden for outlet respondents, especially those which stock many FP products and brands. Firstly, it may increase the non-response or attrition rate in subsequent rounds. Secondly, it may cause respondents to provide data that are not valid (e.g., junk data in order to finish an interview more quickly). Both of these issues would affect the validity and reliability of the data. Authors should work more on it.

11. There is no statistically predetermined number of outlets that is sought based on a given level of precision. This is not explained in the manuscript.

12. Inclusion and exclusion criteria of the outlet are missing in the manuscript.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Aug 30;17(8):e0271896. doi: 10.1371/journal.pone.0271896.r002

Author response to Decision Letter 0


2 Jun 2022

REVIEWER’S COMMENTS

Responses to reviewer’s comments

Comments to the Author

General comment:

Authors are advised to elaborate on causes and the impact of different types of bias. In the discussion section, please mention about why only selective contraceptive methods were chosen for biased sampling estimates. Also please elaborate clearly about why likelihood of substantial bias decreases as EA size decreases and when the number of outlets sampled per EA increases.

RESPONSE: Thank you for these thoughtful comments and questions. We have responded to them in turn below, in blue.

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

RESPONSE: Having reviewed the PLOS One guidance on patient privacy and exceptions to sharing materials, our project’s policy around potentially identifying geocoordinates, and our pre-existing project data in the public domain, we would like to request an exemption from making the data used for the analysis presented in this manuscript public. These data consist of GPS coordinates for outlets, which when combined with other project data that we have available through our project website (and stored in a public repository), would permit the identification of study participants.

The decision in CM4FP to make public the HH-outlet distance matrices was a compromise that we took to maximize the availability and usefulness of the project’s geographic data while ensuring that the human subjects involved (in the household surveys, not included in this manuscript) could not be identified from these data. Part of that anonymization included the removal of site location information and attempts to mask outer/inner ring shapes in documentation to avoid the chance of someone being able to superimpose real geographic maps onto regenerated network maps made from the household-outlet distance matrices. The figures presented in our manuscript for PLOS One were a part of that masking effort, as we aimed to render the images unconvertable to real maps of study sites.

The household-outlet distances are already available in our public data (through www.cm4fp.org), and so our primary concern now is to ensure that any further data release would not increase the likelihood of those data becoming identifying for our female respondents.

We believe that releasing the geocoordinates used in this manuscript’s analysis, even with jitters (something we have been exploring in response to the PLOS request), when combined with product availability data and quite precise (+/- 10m) outlet-to-household distances, increases the risk of participant identification unacceptably. We therefore hope that you would consider granting an exception to this data sharing requirement in this case.

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

RESPONSE: Thank you for these responses. We further address each of the specific comments below.

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Review Comments to the Author

The study examined the bias in product availability estimates from contraceptive outlet surveys in Kenya, Nigeria, and Uganda. They found evidence of bias in estimates of contraceptive availability generated from simulated area-based sampling. Within specific study sites and rounds, they observed biased sampling estimates for several but not all contraceptive method types, with bias more likely to occur in sites with heterogeneity in both spatial distributions of outlets and product availability within outlets. I commend the authors for the work done. Authors are encouraged to consider the following issues.

1. The introduction does not highlight the reason for urban focus. Is there a special reason for limiting it urban? Authors could explain the reasons for the selection of geographics in the introduction section.

Can authors also explain bias in product availability estimates from contraceptive outlet in public sectors? It would be good to understand how bias in product availability estimates from contraceptive outlet has been across the countries in the introduction.

RESPONSE: Thank you for this comment. The urban focus of the study reflected the available data from the CM4FP project (which concentrated only on urban/ semi-urban areas with the exception of one site in Uganda). This urban focus was due to the original project objectives of generating supply and demand side data for selected urban and semi-urban sites. More information about the study design has been published elsewhere (https://gatesopenresearch.org/articles/5-176/v1). We have added text to the manuscript (location: page 7) to better reflect the design.

The private sector is the focus of this paper for a few reasons. First, it is relatively under-researched in supply-side market dynamics studies, and yet is playing an increasingly important role in the provision of FP products and services in the countries included here. Second, we wanted to make a contribution to the literature in relation to the design of similar studies, notably the PMA SDP sampling approach. PMA samples up to three private sector outlets per EA which we were able to simulate using the CM4FP datasets. However, PMA’s public sector sampling approach involves visiting the public sector outlets either within, or near to each EA included, to capture facilities at primary, secondary and tertiary levels of the public sector. The CM4FP census included only those public facilities found within the study areas (not those in neighboring areas), and therefore does not allow for a simulation of the PMA approach. We agree that bias in public sector outlets would certainly be interesting and important to examine, but this was outside of the bounds of the present study.

There are different causes for bias including area-based bias. Authors should also dedicate a paragraph stating those causes and the impact of different types of bias.

Thanks for this suggestion. We agree that multiple different causes of bias are possible. As the focus of this manuscript is on sampling bias in contraceptive service delivery point estimates, we believe that the key alternative bias (other than area-based sampling bias) is bias that arises due to incomplete ascertainment of facilities on master facility lists (MFLs). (MFLs are used as the sampling frame for traditional, large-scale supply-side environment surveys, including the Demographic and Health Survey’s Service Provision Assessments and the World Health Organizations’ Service Availability and Readiness Assessments.) MFL-based sampling may be subject to sampling bias if specific facilities (such as unregistered private facilities) are less likely to be captured on MFLs. In addition, MFL-based samples may lack generalizability to non-facility outlets that are not captured at all in the sampling frame, such as pharmacies and drug shops. We have added additional text on page 4 to emphasize the impact of this type of bias.

2. The authors stated that for the purpose of the study, they visited hospitals, medical centers, clinics, health centers, pharmacies, and drug shops/chemists/Patent and Proprietary Medicine Vendors (PPMVs), in the study sites. Authors should be clear whether they visited all facilities for four sites in each country. If it is the case that not all facilities were visited, authors should provide justification for the selected facilities. It is unclear why those sites from Kenya, Nigeria, and Uganda were selected.

RESPONSE: Thanks for this comment. All outlets/facilities/vendors within each study site that stocked FP products/ services (more than just male condoms) were included in the study (pages 7 and 9).

3. The authors observed biased sampling estimates for several but not all contraceptive method types. In the discussion, authors should provide better explanation on this finding.

RESPONSE: Thanks for this question, we have added FP product specific text to the discussion section on page 22.

4. The authors demonstrated that the likelihood of substantial bias decreases as EA size decreases and as the number of outlets sampled per EA increases. Author are requested to focus more on this result.

RESPONSE: Thanks for this request for clarification. To some extent this result is expected, as increasing sample size per EA, from smaller pools of outlets per EA (i.e. smaller EAs) would result in a closer approximation to a full census of the study area. However, we agree that this finding could be emphasized and have added additional text on page 27 to reiterate this finding to contextualize our recommendations for improving area-based sampling approaches.

5. The household survey will not allow for sampling from all households within the outlet study area. This may thus introduce sampling bias which would cause the household survey data to not be representative of the full study area. The authors should address this issue.

RESPONSE: The present manuscript does not include any data from the Household surveys. These data were simply discussed in the methods section of the manuscript for completeness (as they are a part of the overall CM4FP study). We acknowledge that this has the potential to cause confusion and so have made their exclusion from this analysis clearer in the methods section (page 6).

6. The household survey might not designed to follow the same respondent over the 2-3 survey rounds, the study is limited in its ability to explain individual-level changes in FP behavior and any correlation between these changes in behavior and changes in the FP supply market.

RESPONSE: As above, the manuscript focusses only on the CM4FP FP outlet data, in which the same outlets were repeatedly surveyed. The analysis does not consider the household data, nor any interactions between supply and demand sides of the FP market.

7. The survey does not capture the full breadth of patterns of FP use and outlet choices.

RESPONSE: This manuscript focuses exclusively on the FP outlet data, aiming to present novel information about potential bias from samples of outlets. Individual FP user behaviors and preferences are certainly important for our understandings of the FP market, but are beyond the scope of this present analysis.

8. Most importantly, the study is failed to produce data or findings that are representative at the regional or national levels.

RESPONSE: The CM4FP study as originally conceived was a methodological pilot, aiming to test novel approaches to gathering and analyzing FP supply and demand data. The study’s limitations include not providing representativeness at any geographical level beyond the specific study sites. We have expanded the limitations section to more clearly express this issue (page 23). However, while not representative, we hope that the findings presented in the manuscript are nevertheless generalizable to other studies that employ sampling approaches similar to those simulated here.

9. Recall issues of family planning may come into play, which is missing in the manuscript.

RESPONSE: Thank you for this comment. We agree that recall issues are likely to play a role in FP user datasets. The data that this manuscript relies on is from the outlet censuses however, in which FP products were audited by data collectors. As such, we do not believe that recall would be a significant source of bias in this particular case.

10. The 2-3 rounds of data collection implemented quarterly may present a burden for outlet respondents, especially those which stock many FP products and brands. Firstly, it may increase the non-response or attrition rate in subsequent rounds. Secondly, it may cause respondents to provide data that are not valid (e.g., junk data in order to finish an interview more quickly). Both of these issues would affect the validity and reliability of the data. Authors should work more on it.

RESPONSE: Thanks for this comment. The FP product data were collected by data collectors (rather than relying on providers to report these data) in order to avoid the burden that the reviewer rightly identifies. We found outlet attrition to be relatively rare (for example, in the Kenya data, of the total 549 unique non-CHW outlets, 478 appeared in all three rounds, and 523 appeared in at least two of the three rounds. Outlets could be lost from the study, or added to the study between rounds.

11. There is no statistically predetermined number of outlets that is sought based on a given level of precision. This is not explained in the manuscript.

RESPONSE: The CM4FP data on which this manuscript is based consisted of a census of all outlets in a set of geographical areas. The geographical boundaries for the outer ring at each site encompassed one or more contiguous wards (Kenya and Nigeria) or parishes (Uganda), that were completely censused to measure the total market for FP products and services within each ring-fenced area. To determine the geographic boundary of the outer rings, an initial target area was selected to capture a total of 600 outlets across all sites in each country. Because of the exploratory nature of the study and the census approach, the number of outlets included in the study was not statistically predetermined. Rather, the target number of outlets was determined pragmatically to allow for a deep dive into localized family planning markets within the constraints of the available budget and timeframe. Fieldwork was concurrent across study sites within each country, and complete wards or parishes were censused and added to the outer ring sequentially until the target sample size for each country was achieved or exceeded. Once the overall target number of outlets for the country had been achieved, the outlet census was completed in the current ward/parish in each study site, and no further wards/parishes were added. The final number of outlets counted in the census varied across sites, largely due to differences in outlet density within wards or parishes; larger urban area outer rings generally had a greater number of outlets in each ward or parish, and small urban, semi-urban, and rural areas had fewer outlets in each ward or parish.

We have added more detail to the manuscript explaining this approach on page 7

12. Inclusion and exclusion criteria of the outlet are missing in the manuscript.

RESPONSE: Apologies for this omission. Outlets were eligible for inclusion in the census of FP product and service providers if they had stocked at least one modern FP method (aside from male condoms) or offered any FP services during the past three months. Public and private health provider and health retail outlets of all types within the outer ring, including hospitals, health facilities, pharmacies, patent and proprietary medicine vendors (PPMVs), and drug shops, were screened for inclusion. Outlets that served the military but not the general public were excluded, as were general retailers, bars, hotels, and brothels where only condoms are typically available. In the Lagos and Abia sites in Nigeria, a small number of general retailers/supermarkets offered oral contraceptives and/or emergency contraceptives, so these outlet types were screened and included if eligible. In the overall CM4FP study, CHWs were included in the outlet census if they operated in the community as a mobile provider of FP products and not only within brick-and-mortar facilities. Any outlet or CHW from the initial census no longer meeting the inclusion criteria in a subsequent round was excluded from that point forward. However in our analyses, all CHWs were excluded as they did not have a specific geographical location.

We have added further information to the manuscript on page 9.

Attachment

Submitted filename: Responses to reviewers final 220602.docx

Decision Letter 1

Manoj Kumar

11 Jul 2022

Bias in product availability estimates from contraceptive outlet surveys: evidence from the Consumer’s Market for Family Planning (CM4FP) study

PONE-D-21-34612R1

Dear Dr. Bouanchaud,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Manoj Kumar

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: All of the comments were addressed. The manuscript looks scientifically sound. No additional comments from the reviewer.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Dr. Nitai Roy

**********

Acceptance letter

Manoj Kumar

19 Aug 2022

PONE-D-21-34612R1

Bias in product availability estimates from contraceptive outlet surveys: evidence from the Consumer’s Market for Family Planning (CM4FP) study

Dear Dr. Bouanchaud:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Manoj Kumar

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Inclusivity in global research.

    (PDF)

    S2 File. Pseudo-EA population estimation.

    A brief description of population estimates for pseudo-enumeration areas.

    (PDF)

    Attachment

    Submitted filename: Responses to reviewers final 220602.docx

    Data Availability Statement

    CM4FP public datasets exclude all latitude/longitude geographic coordinates because in the full data sets, outlets can be linked directly to households, meaning that GPS coordinates for outlets would potentially allow identification of households. CM4FP has instead disclosed time/distance matrices to allow partial replication of geographic analyses. See https://www.psi.org/wp-content/uploads/2021/09/CM4FP-GIS-Methodology.pdf for more details. Data from the CM4FP study are available through the Dataverse repository, using the following links: Kenya: Outlet: https://doi.org/10.7910/DVN/DJXKVA; Household: https://doi.org/10.7910/DVN/TQ1MSE; Time/distance matrices: https://doi.org/10.7910/DVN/AP9XGL Nigeria: Outlet: https://doi.org/10.7910/DVN/G31FHL; Household: https://doi.org/10.7910/DVN/IPXDML; Time/distance matrices: https://doi.org/10.7910/DVN/KADZGA Uganda: Outlet: https://doi.org/10.7910/DVN/1NRFSD; Household: https://doi.org/10.7910/DVN/1NRFSD; Time/distance matrices: https://doi.org/10.7910/DVN/OHQ2IZ Further data requests may be sent to the PSI Research Ethics Board, contact: Kelly O’Keefe, Senior Technical Advisor PSI REB kokeefe@psi.org.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES