Skip to main content
Journal of Urban Health : Bulletin of the New York Academy of Medicine logoLink to Journal of Urban Health : Bulletin of the New York Academy of Medicine
. 2024 Sep 10;101(6):1221–1234. doi: 10.1007/s11524-024-00914-3

Virtual Assessment of Physical Activity–Related Built Environment in Soweto, South Africa: What Is the Role of Contextual Familiarity?

Motlatso Godongwana 1,11,, Khulu Gama 1, Vongani Maluleke 1, Lisa K Micklesfield 1, Damilola Odekunle 2,3, Yves Florent Wasnyo 4,5, Boris Elouna 6, Edwin Ngwa 4,5, Adalberto Lopes 7, Muhammad Rabiu Balarabe 8, Eva Coringrato 9, Alice McGushin 10, Tolu Oni 10, Louise Foley 10, Tiago Canelas 10
PMCID: PMC11652452  PMID: 39256287

Abstract

Understanding how urban environments shape physical activity is critical in rapidly urbanizing countries such as South Africa. We assessed the reliability of virtual audits for characterizing urban features related to physical activity in Soweto, South Africa. We used the Microscale Audit of Pedestrian Streetscapes Global tool to characterize pedestrian-related features from Google Street View images in four neighborhoods of Soweto. Neighborhoods were selected to represent different levels of deprivation. Inter-rater reliability was analyzed according to the rater’s familiarity with the local area. The results show a higher inter-rater reliability was observed among auditors with greater contextual familiarity. Many measurements however generated inconclusive results due to either low variability in the raters’ responses or the absence of the features in the streets. It is evident from our findings that virtual audits are efficient tools that can be used to assess the built environment. However, to ensure meaningful use of these tools in diverse settings, we recommend that auditors comprise of people with contextual familiarity.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11524-024-00914-3.

Keywords: Built environment, Physical activity, Contextual familiarity, MAPS-Global, Virtual assessment

Introduction

By 2050, the majority of urban growth is expected to occur in low-income and middle-income countries (LMICs) [1]. South Africa exemplifies this trend, experiencing dramatic urban expansion in recent decades [2]. This pivotal moment of expansion, transformation in urban planning, and informal urban development directly influences the built environment, which is closely linked to residents’ physical activity levels [3, 4]. Understanding these dynamics is crucial for developing cities that promote healthier lifestyles [5].

Globally, one in four adults and four in five adolescents are insufficiently physically active despite the substantial health benefits of physical activity [6, 7]. It was estimated that only 60% of South Africans are meeting the World Health Organization (WHO) recommended standards for PA [8]. Despite the WHO targets to reduce physical inactivity by 15% by 2030 [9] and the ambitious policies to create healthy cities that will increase physical activity, several studies have found that there is a gap between what has been implemented and what is needed to achieve these targets [4, 5]. However, lack of data regarding the built environment in LMICs cities is a barrier for creating healthy environments that support physical activity [10].

Traditionally, the collection of the built environment data has been carried out through field audits whereby assessors walk a predetermined route through a specific area and use an observational form to assess predefined environmental characteristics [10, 11]. The Microscale Audit of Pedestrian Streetscapes Global version (MAPS-Global) is one of the tools used to measure and characterize features of the built environment such as street characteristics, sidewalks, intersections, streets aesthetics, and other design features which may help to explain physical activity variation within a population [12]. However, where these assessments are most relevant and needed, such as highly urbanized African cities, very little evidence exists on the nature of the built environment. Assessing the built environment as a first step is important for measuring the association between features of the built environment and physical activity.

The use of virtual assessment tools has been advocated to reduce the time and resources required for conducting in-person audits [11, 13, 14]. In high-income countries, these tools have been found to be reliable assessments of the built environment [11, 13, 15, 16]. Few researchers have explored the concept of contextual familiarity (living or working in the study area vs outside the study area) when assessing the reliability of the tools [13, 16, 17]. This concept of familiarity becomes paramount when conclusions about the reliability of virtual tools are drawn from studies that do not contemplate African settings [10, 18]. The African setting is unique because of its informality and factors including rapid urbanization. Thus, the aim of this study was to measure the reliability among raters with different levels of familiarity to a highly urbanized African city using the MAPS-Global tool.

Methods

We conducted virtual audits of the built environment in Soweto, South Africa. Soweto is located in Johannesburg and was established in 1931 as a result of spatial segregation laws during the apartheid regime in South Africa. The region is now an urban settlement characterized by varying levels of socioeconomic deprivation with a population of approximately 1.9 million people living in a 200 km2 area [19].

We collected data from four small areas within Soweto: Chiawelo, Diepmeadow, Orlando East, and Protea Glen (Fig. 1). We purposively selected the areas to provide variation in socioeconomic deprivation (two areas of higher deprivation, two of lower deprivation). We determined deprivation level based on a methodology previously outlined by The Healthy Life Trajectories Initiative (HeLTi) enumeration study that collected data in Soweto and local knowledge from the study team. In this study, Protea Glen and Diepkloof were assigned as areas of higher SES (low deprivation) and Chiawelo and Orlando East as lower SES (high deprivation) [20].

Fig. 1.

Fig. 1

Reference map of the small areas audited and their deprivation level

To assess features of the built environment, we used MAPS-Global [12, 13]. The MAPS-Global tool was developed by researchers from the University of California San Diego and validated across countries with varying built environmental characteristics such as Australia, Belgium, Brazil, China, and Spain [21]. The instrument comprises four sections, collected along a predefined route: (1) segment (measures block faces between intersections); (2) crossings (collects information on street intersections); (3) route (evaluates destinations and use, streetscape characteristics and aesthetic and social characteristics from a defined origin to a defined destination); (4) cul-de-sac (assesses amenities in dead ends).

For this study, the routes were chosen using a geographically stratified sampling design. Specifically, the selection process involved three key features: firstly, random households (extracted from the OpenBuildings dataset [22]) were utilized as starting points; secondly, local points of interest (POIs) identified by the local team served as endpoints; and thirdly, the street network (from OpenStreetMaps [23]) functioned as the connecting routes between these starting and ending points. Routes had a length between 400 and 700 m. For each small area, sampled routes covered 25% of the total street network, which was considered to give an adequate representation of the built environment in that small area [24]. Data collection was conducted using the Google Street View (GSV) functionality within Google Earth Pro, where the designated routes were uploaded.

The virtual audits were conducted between April and May 2023, and data collection took place in two phases. The auditing team consisted of 10 researchers collaborating with the Global Diet and Physical Activity (GDAR) network from five different countries (South Africa, Nigeria, Cameroon, the United States, and the UK). There were three categories of auditors, seven with no experience of the Soweto context (none), three who worked in the Soweto area (context), and two auditors from Soweto who had conducted field audits on the same streets 9 months prior to the virtual assessment (field). The two field auditors are also within the context group.

All auditors participated in an online training session to standardize the data collection methodology using the MAPS-Global material [25]. Subsequently, each auditor was tasked to assess seven routes in phase one, and three routes in phase two. All auditors assessed the same set of routes.

Data entry was completed in REDCap, both via its online platform and mobile application. REDCap’s functionality also enabled the upload of precise counts of segments and crossings (which varied by route), which were required for conducting the intraclass correlation coefficient (ICC) analysis (using R version 3.x). This study expands on the preliminary GDAR research assessing the built and food environments in four African cities (unpublished data), for which five items from MAPS-Global were incorporated into a different assessment of the food environment. Therefore, these five items were not scored in the current study. The list of items used in the current study can be found in Supplementary Table 1.

The inter-rater reliability of MAPS-Global was measured on several single-item indicators, subscales, valence scores (composite of positive or negative), and overall scores as described in Millstein et al. [26]. Numerical data was assessed with the ICC measurement and Cohen’s kappa coefficients for categorical data using the package “pysch” in R version 3.x. For this study, the ICC and Cohen’s kappa were classified to indicate inter-rater reliability that was “excellent” (ICC ≥ 0.75), “good” (0.60–0.74), “fair” (0.40–0.59), and “poor” (< 0.40) [27]. If the absence of features in the subscales, valence, and overall scores creation was higher than 80% (i.e., places of worship, private recreational facilities, etc.), we excluded it from the analysis as there would not be enough variability for a correct interpretation of the ICC.

Results

The Feasibility and Operational Practicality of Virtual Assessments in Soweto

We encountered two significant challenges in adhering to the MAPS-Global auditing procedures in phase 1 of data collection. Firstly, the coverage of several areas by GSV was incomplete, with GSV coverage for routes as low as 14.3%, limiting our ability to complete the audits in some streets. Secondly, inconsistencies in the number of segments and crossings recorded by different auditors hindered our ability to make comparisons. These challenges contributed to fluctuations in assessment times and inter-rater reliability.

To address these issues in phase 2 of the data collection, we excluded routes with less than 75% GSV coverage. The challenge of discrepancies in the number of segments and crossings was partly derived from Soweto’s street layouts, which differ from those assumed in MAPS-Global procedures. For example, some streets lack sidewalks, complicating the determination of safe pedestrian passages and crossing points. Acknowledging this limitation, the trainers standardized the number of segments and crossings for each route prior to audit, enabling consistent comparisons between auditors.

In addition to the initial challenges, we also encountered significant network issues, as the auditors’ limited broadband access delayed the auditing process. During the debriefing following the second phase of data collection, some auditors revealed they used Google searches to confirm or identify elements that were difficult to discern in the GSV images, particularly concerning land use, such as the presence of amenities along the route or the type of business present.

In the second phase, a total of three routes, 19 segments, and 16 crossings were analyzed virtually by all auditors. Most of the images used were approximately 1 year old, although some were as old as 10 years. The second phase of data collection showed marked improvement from the first phase, with GSV coverage rates reaching almost full coverage (96% vs. 76% in the first phase) for all routes. The data collection process was significantly more efficient, with the mean assessment time for routes at 7.7 ± 6 min (compared with 12.3 ± 11.6 min in the first phase). Furthermore, the average time to assess segments and crossings was reduced to 4.1 ± 2.2 and 1.1 ± 0.1 min, respectively (compared with 4.9 ± 6.3 min and 2 ± 2.6 min in the first phase) indicating a more consistent and streamlined auditing procedure. Data entries where image date coincided with the collection day were excluded from the analysis as they were considered a methodological error by auditors. Cul-de-sacs were not included in the analysis as there was none on the routes audited. Detailed descriptions of each route are delineated in Supplementary Table 2.

The Influence of Familiarity in the IRR

Overall, we found that contextual familiarity was associated with greater inter-rater reliability of virtual audits in Soweto. Figure 2 shows that for almost all the subscales, valence, and overall scores, inter-rater reliability was higher when the online auditors were familiar with the context. We calculated measurements for only 30 out of 41 items or subscales, adhering to the criterion that required more than 80% presence for calculation.

Fig. 2.

Fig. 2

Inter-rater reliability for virtual MAPS-Global assessments in Soweto

Routes

Detailed results for route reliability subscales and valence scores are presented in Table 1. In overall, for the route section, reliability was markedly lower compared with segment and crossing, with no clear pattern by familiarity.

Table 1.

MAPS-Global—route section item-level and subscale inter-rater reliability and descriptive statistics

Route section—variable description # items (range of scores) Null count (%) Mean (S.D) ICC/Kappa CI (95%) Sample items and overall subscale description
Positive destinations and land use
 Institutional-service 3 (0–15) 12.5% f: 4.33 (2.25) 0.53 (− 0.85 to 0.98) Bank, health-related professional, other service
c: 4 (1.87) 0.31 (− 0.35 to 0.97)
n: 2.73 (2.05) 0.73 (0.21–0.99)
 Private recreation 2 (0–10) 100% f: 0 NA Private indoor, private outdoor facility
c: 0
n: 0
 Public recreation 4 (0–20) 83.3% f: 0.17 (0.41) NA Public indoor, public outdoor facility, park, trail
c: 0.22 (0.44)
n: 0.13 (0.35)
 Residential mix 4 (0–3) 0.0% f: 1.5 (0.55) 0* Single family, multi-family, mixed, apartment over retail
c: 1.33 (0.5)  − 0.5*
n: 1 (0) 1.00*
 Restaurant-entertainment 4 (0–20) 16.7% f: 2.67 (1.63) 0.75 (− 0.70 to 0.99) Fast food, sit-down, café, entertainment
c: 2.67 (2.65) 0.17 (− 0.39 to 0.95)
n: 2.27 (1.75) 0.83 (0.39–1.00)
 School 1 (0–5) 75% f: 0.33 (0.52) 1.00 (1.00–1.00) School land use
c: 0.22 (0.44) 0.50 (− 0.18 to 0.98)
n: 0.27 (046) 0.33 (− 0.09 to 0.96)
 Shops 8 (0–28) 0.0% f: 4.83 (2.04) 0.97 (0.22–1.00) Grocery, convenience store, bakery, drugstore, other retail, shopping mall, strip mall, open-air market
c: 5.11 (1.83) 0.54 (− 0.24 to 0.98)
n: 4.27 (2.09) 0.35 (− 0.09 to 0.97)
 Worship 1 (0–5) 100% f: 0 NA Place of worship
c: 0
n: 0
Negative destinations and land use
 Age-restricted bar or nightclub 1 (0–5) 100% f: 0 NA Age-restricted bar or nightclub
c: 0
n: 0
 Liquor or alcohol store 1 (0–5) 33.3% f: 0.67 (0.52) 1.00 (1.00–1.00) Liquor or alcohol store
c: 0.67 (0.50) 1.00 (1.00–1.00)
n: 0.73 (0.59) 0.86 (0.50–1.00)
Valence and overall scores
 Positive DLU 28 (0–111) 0.0% f: 11.9 (5.12) 0.90 (− 0.33 to 1.00) Sum of the positive DLU subscales
c: 15.2 (3.93) 0.85 (0.20–1.00)
n: 15.7 (3.20) 0.72 (0.21 – 0.99)
 Negative DLU 2 (0–10) 33.3% f: 0.67 (0.52) 1.00 (1.00–1.00) Sum of the negative DLU subscales
c: 0.67 (0.50) 1.00 (1.00–1.00)
n: 0.73 (0.59) 0.86 (0.50–1.00)
 Overall DLU 30 0.0% f: 15 (2.97) 0.88 (− 0.42 to 1.00) Positive DLU—negative DLU
c: 14.6 (3.75) 0.83 (0.33–1.00)
n: 11.2 (4.74) 0.64 (0.11–0.99)
Streetscape characteristics
 Positive streetscape 25 (0–22) 20.8% f: 2.83 (2.86) 0.40 (− 0.89–0.98) Transit, traffic calming, trash bins, benches, bike racks, bike lockers, bike docking stations, kiosks, hawkers
c: 2.67 (2.55)  − 0.43 (− 0.49 to 0.04)
n: 2.07 (1.87) 0.15 (− 0.16 to 0.94)
Aesthetics and social characteristics
 Positive aesthetics / social 4 (0–4) 54.2% f: 1 (1.26) 0 (− 0.95 to 0.95) Hardscape, water, softscape, landscaping
c: 0.67 (1.12) 0 (− 0.43 to 0.93)
n: 0.60 (0.63) 0 (− 0.2 to 0.88)
 Negative aesthetics / social 6 (0–5) 8.3% f: 0.67 (0.52) 0 (− 0.95 to 0.95) Buildings not maintained, graffiti, litter, dog fouling, physical disorder, highway near
c: 0.89 (0.60)  − 0.25 (− 0.47 to 0.83)
n: 1.33 (0.49)  − 0.17 (− 0.21 to 0.64)
 Overall aesthetics / social 10 25% f: 0.33 (1.75) 0 (− 0.95 to 0.95) Positive aesthetics/social—negative aesthetics/social
c: − 0.22 (1.64)  − 0.06 (− 0.44 to 0.91)
n: − 0.73 (0.88)  − 0.08 (− 0.22 to 0.83)

Familiarity: f: field, c: context, n: none. Agreement: “excellent” (ICC ≥ 0.75), “good” (0.60–0.74), “fair” (0.40–0.59), and “poor” (< 0.40). DLU, destinations and land use; ICC, intraclass correlation coefficient; CI, confidence interval; SD, standard deviation; NA, not applicable. *Indicates the analysis was Cohen’s Kappa

Destinations and Land Use

We evaluated five out of eight positive subscales and single items, because three (place of worship, public and private recreation) had over 80% zeros. Interestingly, in the destinations and land use section, unfamiliarity with the local context was associated with higher reliability, in contrast with the valence and overall scores. Notably high agreement between auditors who were not familiar with the context was observed in the assessment of residential mix (ICC = 1.00, 95% CI [1.00, 1.00]), and restaurants and entertainment (ICC = 0.83, 95% CI [0.39, 1.00]), and institutional services (ICC = 0.73, 95% CI [0.21, 0.99]). Conversely, agreement on the numbers of schools (ICC = 1.00, 95% CI [1.00, 1.00]) and shops (ICC = 0.97, 95% CI [0.22, 1.00]) was higher in field-experienced auditors compared to the other familiarity groups. Among negative subscales, none of the raters identified any age-restricted bar or nightclub. The presence of liquor or alcohol stores yielded near-perfect agreement between all familiarity groups, with the field and context team achieving a perfect score. Upon measuring positive and negative valences along with the overall scores, the influence of familiarity became evident, as the field group exhibited higher ICC values than the other groups.

Streetscape Characteristics

The streetscape’s positive subscale revealed no cycling infrastructure across audited routes. Notably, the mean count for all familiarity groups was below three streetscape features per route (out of a maximum of 22) noting a very low presence of amenities in the surveyed routes. The ICC for all familiarity groups was poor, or with negative values, implying that any agreement among raters was lower than what would be expected by chance alone.

Aesthetics and Social

Both the positive and negative subscales, as well as the overall aesthetics and social scale, demonstrated a lack of consensus among auditors, unaffected by their familiarity with the area. Notably, no measurements exceeded 0, suggesting either random variations in ratings or lower agreement than by chance.

Segments

Detailed results for segment reliability subscales, valence, and overall scores are presented in Table 2. Two categories, cycling infrastructure and informal path or shortcut, had over 80% zeros, indicating a lack of these features in the audited areas. Familiarity with the local context variably influenced agreement levels for the different segment subscales. The field group showed higher agreement between the auditors in both the positive (ICC = 0.85, 95% CI [0.64, 0.94]) and negative (ICC = 0.85, 95% CI [0.65, 0.94]) valence scores, as well as the overall score. All the subscales with exception of the buffer had the field or context group achieving the highest reliability.

Table 2.

MAPS-Global—route section item-level and subscale inter-rater reliability and descriptive statistics

Segments section—variable description # items (range of scores) Null count (%) Mean (S.D) ICC/Kappa CI (95%) Sample items and overall subscale description
Positive segment subscales
 Bicycle infrastructure 3 (0–15) 100% f: 0 NA Bank, health-related professional, other service
c: 0
n: 0
 Buffer 2 (0–5) 46.10% f: 1.95 (0.32) 0 (− 0.44 to 0.44) Parking along street, buffer
c: 1.95 (0.32) 0.05 (− 0.18 to 0.37)
n: 0.96 (1.41) 0.35 (0.16–0.60)
 Building aesthetics and design 1 (0–2) 41.30% f: 1.63 (0.75) 0.52 (0.10–0.78) Street windows
c: 1.39 (0.86) 0.43 (0.15–0.70)
n: 0.63 (0.73) 0.40 (0.19–0.64)
 Building height-road width ratio 5 (0–3) 4.70% f: 2.81 (0.7) 1.00 (1.00–1.00) Building height, setback and road width
c: 2.87 (0.58) 0.50 (0.23–0.74)
n: 2.45 (0.86) 0.26 (0.08–0.52)
 Building height-setback 4 (0–10) 4.61% f: 3.34 (1.36) 0.47 (0.03–0.75) Building height, smallest and largest setback
c: 3.74 (1.41) 0.46 (0.18–0.72)
n: 3.41 (1.82) 0.31 (0.18–0.72)
 Hawkers/shops 1 (0–2) 71.60% f: 0.27 (0.45) 0.23 (− 0.24 to 0.61)* Hawkers/shops on sidewalk/pedestrian zone
c: 0.43 (0.57) 0.31 (0.03–0.61)*
n: 0.22 (0.42) 0.10 (− 0.04 to 0.34)*
 Informal path or shortcut 1 (0–1) 85.20% f: 0.13 (0.34) NA Informal path connecting to something else
c: 0.11 (0.31)
n: 0.17 (0.38)
 Pedestrian infrastructure 5 (0–5) 44.70% f: 0.42 (0.55) 0.48 (0.05–0.76) Mid-segment crossing, pedestrian bridge, covered place to walk, street lights
c: 0.42 (0.53) 0.63 (0.38–0.82)
n: 0.67 (0.53) 0.40 (0.19–0.64)
 Shade 3 (0–6) 36.80% f: 0.61 (0.64) 0.56 (0.16–0.80) Number of trees, sidewalk coverage, shade
c: 0.63 (0.62) 0.59 (0.34–0.80)
n: 0.74 (0.64) 0.58 (0.38–0.78)
 Sidewalk 2 (0–6) 2.63% f: 4.18 (0.98) 0.46 (0.03–0.75) Sidewalk presence and width
c: 3.98 (1.17) 0.29 (0.02–0.60)
n: 3.88 (0.94) 0.42 (0.22–0.66)
Valence and overall scores
 Positive segment 27 (0–45) 0.00% f: 15.3 (2.97) 0.85 (0.64–0.94) Sum of the positive segment subscales
c: 14.9 (2.99) 0.75 (0.54–0.88)
n: 13.1 (4.14) 0.51 (0.31–0.73)
 Negative segment 7 (0–13) 0.00% f: 5.24 (1.82) 0.85 (0.65–0.94) Sum of the negative segment single items (non-continuous sidewalk, trip hazards, obstructions, cars blocking walkway, slope, gates, driveways)
c: 5.09 (1.71) 0.65 (0.41–0.83)
n: 4.2 (1.43) 0.48 (0.27–0.66)
 Overall segment 34 0.00% f: 10 (3.05) 0.76 (0.48–0.90) Positive segment—negative segment
c: 9.84 (3.17) 0.67 (0.44–0.84)
n: 8.89 (4.3) 0.59 (0.39–0.78)

Familiarity: f: field, c: context, n: none. Agreement: “excellent” (ICC ≥ 0.75), “good” (0.60–0.74), “fair” (0.40–0.59), and “poor”’ (< 0.40). DLU, destinations and land use; ICC, intraclass correlation coefficient; CI, confidence interval; SD, standard deviation; NA, not applicable. *Indicates the analysis was Cohen’s Kappa

Crossings

Detailed results for crossing reliability subscales, valence, and overall scores are presented in Table 3. The only positive crossing subscale that did not have more than 80% zeros was the intersection control and signage subscale and this still only reported the mean number of features as < 1/ crossing. Notably, crosswalk amenities, which are crucial for safe road crossing, had 90% zeros. In the negative subscale assessing road width, the group unfamiliar with the context showed no agreement, while those with context knowledge scored fair to excellent agreement. In the overall score, only the field auditors reached an excellent score and the other groups a fair reliability.

Table 3.

MAPS-Global—route section item-level and subscale inter-rater reliability and descriptive statistics

Crossing section—variable description # items (range of scores) Null count (%) Mean (S.D) ICC/Kappa CI (95%) Sample items and overall subscale description
Positive crossing subscales
 Bicycle feature 3 (0–3) 99.2% f: 0 NA Waiting area, bike lane crossing the crossing, bike signal
c: 0.01
n: 0
 Crosswalk amenities 7 (0–7) 90.6% f: 0 NA Crossing aids, marked crosswalk, high visibility striping, different material, curb extension, raised crosswalk, refuge islands
c: 0.09 (0.33)
n: 0.13 (0.33)
 Curb quality and presence 3 (0–5) 88.30% f: 0.13 (0.55) NA Curb presence, curb ramps lined up, tactile paving
c: 0.34 (1.11)
n: 0.42 (1.11)
 Intersection control and signage 7 (0–8) 39.80% f: 0.63 (0.49) 1.00 (1.00–1.00) Yield signs, stop signs, traffic signal, traffic circle, pedestrian walk signals, push buttons, countdown signal
c: 0.61 (0.53) 0.64 (0.37–0.84)
n: 0.63 (0.49) 0.57 (0.35–0.79)
 Overpass 1 (0–1) 100% f: 0 NA Crossing on pedestrian overpass, bridge
c: 0
n: 0
Negative crossing subscales
 Road width 1 (0–2) 60.90% f: 0.06 (0.25) 1.00 (1.00–1.00) Distance of crossing leg
c: 0.61 (0.52) 0.50 (0.20–0.76)
n: 0.04 (0.20) 0.00 (− 0.12 to 0.23)
Valence and overall scores
 Positive crossing 21 (0–24) 39.80% f: 0.63 (0.49) 1.00 (1.00–1.00) Sum of the positive crossing subscales
c: 0.61 (0.53) 0.64 (0.37–0.84)
n: 0.63 (0.49) 0.57 (0.35–0.79)
 Negative crossing 1 (0–2) 60.90% f: 0.06 (0.25) 1.00 (1.00–1.00) Sum of the negative crossing subscales
c: 0.61 (0.52) 0.50 (0.20–0.76)
n: 0.04 (0.20) 0 (− 0.12 to 0.23)
 Overall 22 39.80% f: 0.56 (0.5) 1.00 (1.00–1.00) Positive crossing—negative crossing
c: 0.58 (0.5) 0.58 (0.28–0.81)
n: 0.0 (0.68) 0.55 (0.32–0.77)

Familiarity: f: field, c: context, n: none. Agreement: “excellent” (ICC ≥ 0.75), “good” (0.60–0.74), “fair” (0.40–0.59), and “poor” (< 0.40). DLU, destinations and land use; ICC, intraclass correlation coefficient; CI, confidence interval; SD, standard deviation; NA, not applicable. *Indicates the analysis was Cohen’s Kappa

Grand Score Reliability

Detailed results for the positive, negative, and final overall score are presented in Table 4. Across all three scales, we observe a familiarity gradient, with the field group having higher reliability than the other groups. However, the mean scores among rater familiarity groups are notably similar, indicating that the presence or degree of context familiarity does not markedly distinguish these groups.

Table 4.

MAPS-Global—route section item-level and subscale inter-rater reliability and descriptive statistics

Overall section—variable description # items (range of scores) Null count (%) Mean (S.D) ICC/Kappa CI (95%) Sample items and overall subscale description
 Overall positive 101 (0–205) 0.0% f: 35 (3.63) 0.95 (0.02–1.00) Positive DLU, positive streetscape, positive aesthetics/social, positive segment (mean of all segments), positive crossing (mean of all segments)
c: 33.6 (4.41) 0.67 (− 0.12 to 0.99)
n: 27.8 (7.85) 0.83 (0.39–1.00)
 Overall negative 16 (0–22) 0.0% f: 7.01 (2.09) 0.98 (0.36–1.00) Negative DLU, negative aesthetics/social, negative segment (mean of all segments), negative crossing (mean of all crossings)
c: 6.96 (1.66) 0.52 (− 0.25 to 0.98)
n: 7.03 (0.95) 0.76 (0.26–0.99)
 Overall 113 0.0% f: 28 (4.28) 0.95 (0.00–1.00) Overall Positive – Overall Negative
c: 26.7 (4.71) 0.89 (0.31–1.00)
n: 20.8 (7.96) 0.84 (0.41–1.00)

Familiarity: f: field, c: context, n: none. Agreement: “excellent” (ICC ≥ 0.75), “good” (0.60–0.74), “fair” (0.40–0.59), and “poor” (< 0.40). DLU, destinations and land use; ICC, intraclass correlation coefficient; CI, confidence interval; SD, standard deviation; NA, not applicable. *Indicates the analysis was Cohen’s Kappa

Discussion

To our knowledge, this is the first study assessing the inter-rater reliability of MAPS-Global in an African urban context, and our findings highlight the importance of local knowledge in applying research tools effectively. Our findings suggest that auditors with local familiarity yielded more reliable audits compared to their international peers. Despite the global accessibility of virtual platforms like Google Street View and Google Earth for environmental assessment, our results underscore the value of contextual familiarity in enhancing the meaningful application and rigor of research tools.

Incorporating contextual familiarity in global health research is crucial and at the same time an ethical responsibility when the tools we use have not been validated in the contexts where we work [28]. This practice risks oversimplifying complex realities and may lead to wrong or misleading conclusions. Rzotkiewicz et al. (2018) highlighted this gap, noting the absence of studies using Google Street View in Africa and limited research in Latin America and Asia, challenging the assumption of universal applicability for virtual audits.

This study contributes from an African setting to the limited and inconclusive research on rater familiarity in evaluating the built environment. Two studies, one utilizing the MAPS-Global in Belgium and the other applying S-VAT tool in Norway, demonstrated that auditors with greater contextual familiarity or those conducting audits in-person reported higher inter-rater reliability [15, 17]. On the other hand, Fox et al. (2021) using MAPS-Global in five HICs countries and Zhu et al. (2017) using MAPS-Global in the US suggested that familiarity does not significantly affect virtual audit outcomes. Most studies that have used virtual tools to characterize the built environment have been carried out in HICs with different environmental characteristics compared to LMICs [13, 1517, 29, 30]. Their studies present findings from well-planned cities making it difficult to draw similar conclusions to a highly urbanized and dynamic environment of the Soweto township.

Not only are LMICs underrepresented in virtual auditing, but there are also acute spatial inequalities in the amount of GSV coverage within cities depending on deprivation levels [31]. Several authors have stated that virtual audits are a reliable alternative to in-person street audits but with a caveat that there is the need for high coverage and updated images [13, 17]. Our study tackled the variability in coverage by selecting routes with at least 75% visibility. However, assessing the recency of GSV images posed a challenge, as image dates can vary widely even within the same location, depending on the viewing angle. Incorporating insights from a local team regarding acceptable image year ranges can significantly enhance the relevance of urban assessments, especially in LMICs where rapid urbanization is prevalent (Ritchie et al., 2024; UN-Habitat, 2022). This dialogue with the local auditors is crucial due to the continuous and fast-paced urban changes, underscoring the necessity for up-to-date imagery in environmental audits of the built environment.

Our auditors faced several challenges, including issues with image quality, outdated images, blurriness, and obstructions, similar to studies elsewhere [13, 15, 18]. An unexpected challenge emerged from feedback sessions: some resorted to using search engines to identify unclear elements in images. This practice potentially introduced inconsistencies in the auditing process, emphasizing the need for clearer guidelines in the training to ensure uniformity in virtual environmental assessments [13, 32, 33]. Although it is not unusual to find a high percentage of absence in some features of the microscale [11, 13], we found our study to lack many of the features of MAPS-Global. This highlights the lack of many essential amenities in these low-resourced settings but also raises the concern whether MAPS-Global was indeed the correct tool for our study site. The choice of MAPS-Global was made collectively by the GDAR Network members [34]. To enhance representativity, future studies using global audit tools should consider the differences within and between neighborhoods, regions, and countries. It is important to note that developmental patterns, urbanization levels, land uses, and socioeconomic statuses of residents have different definitions, interpretations, and representations across the world.

Similar to other virtual audits of the built environment, the subjective features of the built environment such as the streets or building aesthetics had the lowest IRR across all the MAPS-Global measurements [13, 15, 16, 33]. Our findings indicated inter-rater reliability was highest in the land use section of the routes, similar to results from Zhu et al. (2017). The high frequency of null responses in the crossing section (indicating a lack of infrastructure to facilitate road crossing) is notable in a country where pedestrians constitute almost 40% of road traffic fatalities [35]. Additionally, a study in a low-income community in South Africa showed that half of the children walking to school alone report experiences with pedestrian collisions [36].

We acknowledge the limitations of our study posed by a small sample size, due to a combination of challenges, such as image coverage and limited time resources. While certain measurements, such as the pedestrian buffers, showed high percentage of agreement across all auditors, the statistical measures of reliability, such as ICC or Kappa, indicated lower values. This discrepancy stems from limited variability in exposure, where we obtained low ICC or Kappa despite a high percent agreement [16, 37]. However, the adoption of virtual auditing markedly decreased the costs associated with conducting audits, offering a more economical alternative to traditional in-field methods. The auditors categorized as field familiarity for this study conducted in-field audits in the same area nine months prior, and the virtual audits were completed faster (unpublished data). Additionally, while in-field audits necessitated pairs of auditors for safety reasons, virtual audits allowed individuals to work solo, providing flexibility and the comfort of conducting audits from any location.

Conclusion

We advocate for the thoughtful application of virtual audits in highly urbanized African cities. Utilizing raters familiar with the local context will help to ensure the benefits of virtual audits, including efficiency, resource allocation, and safety, are realized. However, key factors need to be considered including image coverage, the recency of images, dynamic, and the suitability of global tools to capture local environments. Careful evaluation of these aspects would ensure that auditors are well placed to conduct accurate and effective virtual audits.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

This study was funded by the National Institute for Health Research (NIHR) (NIHR133205) using the UK aid from the UK Government to support global health research. The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.

Author Contribution

MG and TC wrote the manuscript. TC analyzed the data and interpreted the results. LM and LF reviewed the manuscript. KG, VM, DO, YF, BE, EN, AL, MRB, EC, AM, and TO contributed by reading and approved the final manuscript.

Funding

Open access funding provided by University of the Witwatersrand.

Data Availability

The datasets during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Declarations

Ethical Approval

An ethics waiver was received from the University of the Witwatersrand Institutional Ethics Review Board (HRECNMW22/06/08) as the research study did not include any participants.

Consent for Publication

Not applicable.

Footnotes

Key Messages:

1. Familiarity with the local context improves the reliability of virtual audits of the built environment.

2. Current “global” auditing tools might not be entirely suitable for African cities; hence, contextual knowledge, local expertise, and tailored training are necessary for effective virtual auditing of built environments.

3. Thoughtful and transparent application of global tools with locally specific adaptations is essential.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.UN-Habitat. Envisaging the future of cities. World Cities Report 2022. Nairobi, Kenya: UN Habitat; 2022. p. 422. Available from: https://unhabitat.org/sites/default/files/2022/06/wcr_2022.pdf. Accessed 4 Mar 2024.
  • 2.Ritchie H, Samborska V, Roser M. Urbanization. Our world data. 2024. Available from: https://ourworldindata.org/urbanization.  Accessed 4 Mar 2024.
  • 3.Bauman AE, Reis RS, Sallis JF, Wells JC, Loos RJ, Martin BW. Correlates of physical activity: why are some people physically active and others not? The Lancet. 2012;380(9838):258–71. [DOI] [PubMed] [Google Scholar]
  • 4.Cerin E, Sallis JF, Salvo D, Hinckson E, Conway TL, Owen N, Van Dyck D, Lowe M, Higgs C, Moudon AV, Adams MA, Cain KL, Christiansen LB, Davey R, Dygrýn J, Frank LD, Reis R, Sarmiento OL, Adlakha D, Boeing G, Liu S, Giles-Corti B. Determining thresholds for spatial urban design and transport features that support walking to create healthy and sustainable cities: findings from the IPEN Adult study. Lancet Glob Health. 2022;10(6):e895-906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lowe M, Adlakha D, Sallis JF, Salvo D, Cerin E, Moudon AV, Higgs C, Hinckson E, Arundel J, Boeing G, Liu S, Mansour P, Gebel K, Puig-Ribera A, Mishra PB, Bozovic T, Carson J, Dygrýn J, Florindo AA, Ho TP, Hook H, Hunter RF, Lai PC, Molina-García J, Nitvimol K, Oyeyemi AL, Ramos CDG, Resendiz E, Troelsen J, Witlox F, Giles-Corti B. City planning policies to support health and sustainability: an international comparison of policy indicators for 25 cities. Lancet Glob Health. 2022;10(6):e882–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Guthold R, Stevens GA, Riley LM, Bull FC. Worldwide trends in insufficient physical activity from 2001 to 2016: a pooled analysis of 358 population-based surveys with 1·9 million participants. Lancet Glob Health. 2018;6(10):e1077–86. [DOI] [PubMed] [Google Scholar]
  • 7.Guthold R, Stevens GA, Riley LM, Bull FC. Global trends in insufficient physical activity among adolescents: a pooled analysis of 298 population-based surveys with 1·6 million participants. Lancet Child Adolesc Health. 2020;4(1):23–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Basu D, Kibachio Mwangi J, Mahura L, Nemkula B, Ngako K, Ntsie R, Pinkney-Atkinson V, Sayed B, Setlhare I, Singh S, Tucker J. National strategic plan for the prevention and control of non-communicable diseases, 2022 – 2027. South Africa: Departmen of Health; 2022 p. 72. Available from: https://www.sancda.org.za/wp-content/uploads/2022/05/NCDs-NSP-SA-2022-2027.pdf. Accessed 9 Aug 2024.
  • 9.Bull FC, Al-Ansari SS, Biddle S, Borodulin K, Buman MP, Cardon G, Carty C, Chaput JP, Chastin S, Chou R, Dempsey PC, DiPietro L, Ekelund U, Firth J, Friedenreich CM, Garcia L, Gichu M, Jago R, Katzmarzyk PT, Lambert E, Leitzmann M, Milton K, Ortega FB, Ranasinghe C, Stamatakis E, Tiedemann A, Troiano RP, Van Der Ploeg HP, Wari V, Willumsen JF. World Health Organization 2020 guidelines on physical activity and sedentary behaviour. Br J Sports Med. 2020;54(24):1451–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dixon BN, Ugwoaba UA, Brockmann AN, Ross KM. Associations between the built environment and dietary intake, physical activity, and obesity: a scoping review of reviews. Obes Rev. 2021;22(4): e13171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Phillips CB, Engelberg JK, Geremia CM, Zhu W, Kurka JM, Cain KL, Sallis JF, Conway TL, Adams MA. Online versus in-person comparison of Microscale Audit of Pedestrian Streetscapes (MAPS) assessments: reliability of alternate methods. Int J Health Geogr. 2017;16(1):27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cain KL, Geremia CM, Conway TL, Frank LD, Chapman JE, Fox EH, Timperio A, Veitch J, Van Dyck D, Verhoeven H, Reis R, Augusto A, Cerin E, Mellecker RR, Queralt A, Molina-García J, Sallis JF. Development and reliability of a streetscape observation instrument for international use: mAPS-global. Int J Behav Nutr Phys Act. 2018;15(1):19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Fox EH, Chapman JE, Moland AM, Alfonsin NE, Frank LD, Sallis JF, Conway TL, Cain KL, Geremia C, Cerin E, Vanwolleghem G, Van Dyck D, Queralt A, Molina-García J, Hino AAF, Lopes AADS, Salmon J, Timperio A, Kershaw SE. International evaluation of the Microscale Audit of Pedestrian Streetscapes Global instrument: comparative assessment between local and remote online observers. Int J Behav Nutr Phys. 2021;18(1):84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Mooney SJ, Bader MDM, Lovasi GS, Teitler JO, Koenen KC, Aiello AE, Galea S, Goldmann E, Sheehan DM, Rundle AG. Street audits to measure neighborhood disorder: virtual or in-person? Am J Epidemiol. 2017;186(3):265–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Andersen OK, O’Halloran SA, Kolle E, Lien N, Lakerveld J, Arah OA, Gebremariam MK. Adapting the SPOTLIGHT Virtual Audit Tool to assess food and activity environments relevant for adolescents: a validity and reliability study. Int J Health Geogr. 2021;20(1):4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zhu W, Sun Y, Kurka J, Geremia C, Engelberg JK, Cain K, Conway T, Sallis JF, Hooker SP, Adams MA. Reliability between online raters with varying familiarities of a region: microscale Audit of Pedestrian Streetscapes (MAPS). Landsc Urban Plan. 2017;167:240–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Vanwolleghem G, Ghekiere A, Cardon G, De Bourdeaudhuij I, D’Haese S, Geremia CM, Lenoir M, Sallis JF, Verhoeven H, Van Dyck D. Using an audit tool (MAPS Global) to assess the characteristics of the physical environment related to walking for transport in youth: reliability of Belgian data. Int J Health Geogr. 2016;15(1):41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rzotkiewicz A, Pearson AL, Dougherty BV, Shortridge A, Wilson N. Systematic review of the use of Google Street View in health research: major themes, strengths, weaknesses and possibilities for future research. Health Place. 2018;52:240–6. [DOI] [PubMed] [Google Scholar]
  • 19.SAHO. South African History Online, towards a people’s history. Available from: https://www.sahistory.org.za/place/soweto-johannesburg#:~:text=In%202023%2C%20the%20population%20of,aged%207%20to%2017%20years. Accessed 6 Feb 2024.
  • 20.Prioreschi A, Ware LJ, Draper CE, Lye S, Norris SA. Contextualising individual, household and community level factors associated with sugar-sweetened beverage intake and screen time in Soweto. South Africa J Hunger Environ Nutr. 2022;8:1–17. [Google Scholar]
  • 21.Queralt A, Molina-García J, Terrón-Pérez M, Cerin E, Barnett A, Timperio A, Veitch J, Reis R, Silva AAP, Ghekiere A, Van Dyck D, Conway TL, Cain KL, Geremia CM, Sallis JF. Reliability of streetscape audits comparing on-street and online observations: mAPS-Global in 5 countries. Int J Health Geogr. 2021;20(1):6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sirko W, Kashubin S, Ritter M, Annkah A, Bouchareb YSE, Dauphin Y, Keysers D, Neumann M, Cisse M, Quinn J. Continental-scale building detection from high resolution satellite imagery. arXiv; 2021. Available from: http://arxiv.org/abs/2107.12283. Accessed 12 Jun 2023.
  • 23.OpenStreetMap contributors. OpenStreetMap database. 2023. Available from: openstreetmap.org. Accessed 4 Mar 2024.
  • 24.McMillan TE, Cubbin C, Parmenter B, Medina AV, Lee RE. Neighborhood sampling: how many streets must an auditor walk? Int J Behav Nutr Phys Act. 2010;7(1):20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.MAPS. Dr. James Sallis. Available from: https://www.drjimsallis.com/maps. Accessed 7 Feb 2024.
  • 26.Millstein RA, Cain KL, Sallis JF, Conway TL, Geremia C, Frank LD, Chapman J, Van Dyck D, Dipzinski LR, Kerr J, Glanz K, Saelens BE. Development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS). BMC Public Health. 2013;13(1):403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Cicchetti DV. Methodological commentary the precision of reliability and validity estimates re-visited: distinguishing between clinical and statistical significance of sample size requirements. J Clin Exp Neuropsychol. 2001;23(5):695–700. [DOI] [PubMed] [Google Scholar]
  • 28.Canelas T, Godongwana M, Wayas FA, Lambert EV, Wasnyo Y, Foley L. Charting a course: navigating rigor and meaning in global health research. J Phys Act Health. 2024;21(2):113–4. [DOI] [PubMed] [Google Scholar]
  • 29.Curtis JW, Curtis A, Mapes J, Szell AB, Cinderich A. Using google street view for systematic observation of the built environment: analysis of spatio-temporal instability of imagery dates. Int J Health Geogr. 2013;12(1):53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kelly CM, Wilson JS, Baker EA, Miller DK, Schootman M. Using Google Street View to audit the built environment: inter-rater reliability results. Ann Behav Med. 2013;45(S1):108–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Fry D, Mooney SJ, Rodríguez DA, Caiaffa WT, Lovasi GS. Assessing Google Street view image availability in Latin American cities. J Urban Health. 2020;97(4):552–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Griew P, Hillsdon M, Foster C, Coombes E, Jones A, Wilkinson P. Developing and testing a street audit tool using Google Street View to measure environmental supportiveness for physical activity. Int J Behav Nutr Phys Act. 2013;10(1):103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gullón P, Badland HM, Alfayate S, Bilal U, Escobar F, Cebrecos A, Diez J, Franco M. Assessing walking and cycling environments in the streets of Madrid: comparing on-field and virtual audits. J Urban Health. 2015;92(5):923–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Global Diet and Activity Research Network. Global Diet and Activity Research Network. 2023. Available from: https://gdarnet.org/. Accessed 19 Mar 2024.
  • 35.International Transport Forum. Road Safety Annual Report 2019. OECD; 2019. (Road Safety Annual Report). Available from: https://www.oecd-ilibrary.org/transport/road-safety-annual-report-2019_2f0e33fe-en. Accessed 19 Mar 2024.
  • 36.Koekemoer K, Van Gesselleen M, Van Niekerk A, Govender R, Van As AB. Child pedestrian safety knowledge, behaviour and road injury in Cape Town. South Africa Accid Anal Prev. 2017;99:202–9. [DOI] [PubMed] [Google Scholar]
  • 37.McHugh ML. Interrater reliability: the kappa statistic. Biochem Medica. 2012;22:276–82. [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets during and/or analyzed during the current study are available from the corresponding author on reasonable request.


Articles from Journal of Urban Health : Bulletin of the New York Academy of Medicine are provided here courtesy of New York Academy of Medicine

RESOURCES