Health Sentinel: A mobile crowdsourcing platform for self-reported surveys provides early detection of COVID-19 clusters in San Luis Potosí, Mexico

Salvador Ruiz-Correa; Rubén López-Revilla; Fernando Díaz-Barriga; Francisco Marmolejo-Cossío; Viridiana del Carmen Robledo-Valero; Emilio Ernesto Hernández-Huérfano; Leonardo Álvarez-Rivera; Mónica Liliana Rangel-Martínez; Miguel Ángel Lutzow-Steiner; Luis Alfredo Ortiz-Vázquez; Andrea Rebeca Mendoza-Lara; Montserrat Olivo-Rodríguez; Marco Sebastián Galván-Ramírez; Ángel Emanuel Morales-Neri; Víctor Uriel Martínez-Donjuan; Massiel Isabella Cervantes-Irurzo; Andreu Comas-García; Fernando Hernández-Maldonado; Carlos Aguilar-Acosta

doi:10.1016/j.ijmedinf.2021.104508

. 2021 May 28;153:104508. doi: 10.1016/j.ijmedinf.2021.104508

Health Sentinel: A mobile crowdsourcing platform for self-reported surveys provides early detection of COVID-19 clusters in San Luis Potosí, Mexico

Salvador Ruiz-Correa ^a,^*,¹, Rubén López-Revilla ^b,¹, Fernando Díaz-Barriga ^c, Francisco Marmolejo-Cossío ^d, Viridiana del Carmen Robledo-Valero ^a, Emilio Ernesto Hernández-Huérfano ^a, Leonardo Álvarez-Rivera ^a, Mónica Liliana Rangel-Martínez ^e, Miguel Ángel Lutzow-Steiner ^e, Luis Alfredo Ortiz-Vázquez ^a, Andrea Rebeca Mendoza-Lara ^a, Montserrat Olivo-Rodríguez ^a, Marco Sebastián Galván-Ramírez ^a, Ángel Emanuel Morales-Neri ^a, Víctor Uriel Martínez-Donjuan ^a, Massiel Isabella Cervantes-Irurzo ^a, Andreu Comas-García ^c, Fernando Hernández-Maldonado ^e, Carlos Aguilar-Acosta ^f

PMCID: PMC8159707 PMID: 34098316

Abstract

Background

The Health Sentinel (Centinela de la Salud, CDS), a mobile crowdsourcing platform that includes the CDS app, was deployed to assess its utility as a tool for COVID-19 surveillance in San Luis Potosí, Mexico.

Methods

The CDS app allowed anonymized individual surveys of demographic features and COVID-19 risk of transmission and exacerbation factors from users of the San Luis Potosí Metropolitan Area (SLPMA). The platform’s data processing pipeline computed and geolocalized the risk index of each user and enabled the analysis of the variables and their association. Point process analysis identified geographic clustering patterns of users at risk and these were compared with the patterns of COVID-19 cases confirmed by the State Health Services.

Results

A total of 1554 COVID-19 surveys were administered through the CDS app. Among the respondents, 50.4 % were men and 49.6 % women, with an average age of 33.5 years. Overall risk index frequencies were, in descending order: no-risk 77.8 %, low risk 10.6 %, respiratory symptoms 6.7 %, medium risk 1.4 %, high risk 2.0 %, very high risk 1.5 %. Comorbidity was the most frequent vulnerability category (32.4 %), followed by the inability to keep home lockdown (19.2 %). Statistically significant risk clusters identified at a spatial scale between 5 and 730 m coincided with those in neighborhoods containing substantial numbers of confirmed COVID-19 cases.

Conclusions

The CDS platform enables the analysis of the sociodemographic features and spatial distribution of individual risk indexes of COVID-19 transmission and exacerbation. It is a useful epidemiological surveillance and early detection tool because it identifies statistically significant and consistent risk clusters in neighborhoods with a substantial number of confirmed COVID-19 cases.

Keywords: COVID-19, SARS-CoV-2, Personal risk index, Vulnerability, Mobile crowdsourcing platform, Geolocalized survey, Point pattern distribution, Risk cluster, Infection hotspot

1. Introduction

The coronavirus disease 2019 (COVID-19) epidemic in Mexico is part of the pandemic caused by the Severe Acute Respiratory Syndrome Coronavirus Virus 2, SARS-CoV-2 [1]. Mexican health authorities confirmed the first COVID-19 case on February 27, 2020 [2] and declared a health emergency on April 9, 2020. Mitigation measures implemented in early March 2020 to avoid healthcare saturation, included voluntary home confinement, maintaining a safe distance between people, suspension of non-critical school and work activities, and frequent hand hygiene [3]. At the time of writing (January 27, 2021), there had been 1,788,905 confirmed COVID-19 cases, 152,016 deaths, and a case fatality ratio (CFR) of 8.5 % in Mexico, the country with the largest CFR [4]. COVID-19 features in Mexico are summarized in Table 1 . The disease quickly spread across Mexican cities and on March 13 the first case was confirmed in the San Luis Potosí Metropolitan Area (SLPMA); at the time of writing, the state of San Luis Potosí had 48,486 confirmed cases, and 3632 deaths.

Table 1.

Main COVID-19 features in Mexico [2,31,32].

1
Around 80 % of patients infected by SARS-CoV-2 recover without needing hospital admission.
2
Around one out of five people who get COVID-19 become seriously ill and develop difficulty breathing.
3
Obesity is the strongest predictor for COVID-19, followed by diabetes and hypertension in both sexes, and chronic renal failure in women only.
4
Active smoking is associated with an increased risk of COVID-19.
5
Men are more likely to be hospitalized and to die than women.
6
Individuals more than 50 years old are more likely to be hospitalized and die.
7
Individuals with hypertension, obesity, and diabetes are more likely to be hospitalized and die than those without these morbidities.
8
Individuals infected by SARS-CoV-2 have a higher chance of dying if they are simultaneously affected by two or more chronic conditions.
9
There is a high risk of fatal complications from COVID-19 due to the high prevalence of chronic non-degenerative diseases in Mexicans.

Open in a new tab

COVID-19 spreads in clusters [5], as individuals pass the virus to their contacts, resulting in clusters of geographically localized infections [[6], [7], [8]]. Cluster mapping has been useful to explore transmission characterized by spatial features distinctive of early infection scattering, community spread, and full-scale outbreak [9,10]. Epidemiological surveillance of COVID-19 cases in Mexico is carried out through sentinel surveillance for early detection of suspected cases through their most frequent signs and symptoms and epidemiological antecedents [11]. Only 0.4 qRT-PCR COVID-19 tests per 1000 inhabitants have been administered since only symptomatic subjects are eligible for testing [11].

COVID-19 apps are informational, self-assessment/medical-reporting, contact tracing, multipurpose, and others; multipurpose apps combine the functionality of at least two of the first three categories [12]. The limited resources to conduct diagnostic tests for early detection of SARS-CoV-2 in San Luis Potosí prompted us to develop the Health Sentinel (CDS), a mobile crowdsourcing platform with a multipurpose app (Table S 1) for collecting real-time surveys of self-reported demographic data, signs and symptoms, and vulnerabilities related to COVID-19. The CDS app also serves as a public surveillance tool, to educate users about their risk, and as a community triage to support the Epidemiological Intelligence task force of the UASLP School of Medicine to address the COVID-19 pandemic. CDS data collected from April 1 to July 31, 2020, presented here address the research questions listed in Table 2 .

Table 2.

Research questions addressed in this study.

RQ1: What are the sociodemographic features of the app users?

RQ2: How are the risk indexes of the app users geographically distributed?

RQ3: Does the point-pattern distribution of risk cases and confirmed COVID-19 cases show statistically significant and consistent clustering features?

RQ4: At what spatial scale are the clustering features of risk cases statistically significant; do they change with time and correlate with the clustering features of COVID-19 confirmed cases?

Open in a new tab

2. Methods

2.1. Study design

This study stemmed from the local VIRUS Program, inspired by the United Nations Human Security methodology [13], focused on collective actions to address community threats and comorbidities associated with COVID-19. It allowed our research team to evaluate the potential contribution of the CDS app for epidemiological surveillance of COVID-19 in the SLPMA, through a descriptive analysis of the users (demographics, risk categories, geographic location, and clusters), and by statistically comparing their spatial and temporal distribution with confirmed COVID-19 cases as a way of validating the CDS app (Table 2).

2.2. Setting and participants

The SLPMA includes the San Luis Potosi City and Soledad de Graciano Sánchez municipalities, subdivided into primary geostatistical areas (AGEBs) [15]. Data collected in the SLPMA consists of two distinct data sets: one originating from survey responses of volunteer app users, and another one of confirmed COVID-19 cases provided by the local health department. Out of the 9971 anonymized COVID-19 cases confirmed through qRT-PCR tests between March 12 and August 3 in the SLPMA, only 9453 (94.8 %) with known postal addresses were included. Privacy laws did not allow app users to indicate if they had been tested and included in the COVID-19 confirmed cases database.

The Mexican Health Ministry established that COVID-19 testing was required for people with suspected COVID-19 symptoms [11]. The overall spectrum of symptom severity is included in the confirmed cases database.

Federal regulations require subjects manifesting COVID-19 symptoms to provide consent for the use of their data. SSSLP records were stored in the National Supercomputing Center data center at IPICYT, a tier 3 facility designed to assure data integrity and privacy according to Mexican law. Only authorized personnel had access to the SSSLP data.

The CDS requires 11.76 MB of data space and uses state-of-the-art mobile crowdsourcing [14] to collect self-assessment survey data; it was deployed on April 1, 2020, in the SLPMA, and became available through Google Play Store on May 1. CDS app users do not require a personal account and their data are sent to the cloud through an open Wi-Fi connection to prevent the use of their data; users can participate multiple times. The local health department registered the CDS app after checking its compliance with data privacy, security, and ethical requirements. To activate the app, users had to read and agree with the terms of use and consent form. CDS architecture is described in Appendix A. The CDS app was available for Android OS only; 68.3 % of Mexico’s population has access to a smartphone, with 82.5 % Android OS users, and 17.1 % iOS users.

2.3. Personal COVID-19 risk index assessment

A group of medical and public health experts from UASLP School of Medicine, SSSLP, and IPICYT developed the algorithm to compute the individual risk assessment. Risk calculation combines information about the risk of infection and risk of disease severity into one category. This category is most useful at the individual level to help users decide on an informed course of action by following official recommendations from local health authorities, based on the COVID-19 operational definition of the World Health Organization [16]. This category is also helpful to understand the risk of infection across the SLPMA area, but its use to investigate the risk of severity is limited.

The personal profile includes the user’s gender, age, and occupation. Risk index calculation depends on three categories of variables: contact, signs-and-symptoms, and comorbidity. The CDS app enables the administration of the eight questions on risk items (Table 3 ), conforming to the COVID-19 guidelines of the World Health Organization [17] and the Mexican System for Epidemiological Surveillance [18].

Table 3.

Survey questions of the three risk categories.

Risk category	Questions
Contact factors	1 During the last 14 days have you traveled, had close contact with people who have traveled outside the country or who have come from abroad?
	2 Have you had close contact with someone with signs of COVID-19 (cough, fever, shortness of breath), even though they have not had the confirmation test?
	3 Have you had close contact with someone who tested positive for COVID-19 or was with the disease by a doctor?
Signs and symptoms	4 During the last 7 days, have you had these two symptoms, cough and fever higher than 37.5 °C?
	5 During the last 7 days, have you had any of these symptoms: headache, muscle aches, extreme tiredness, sore throat, joint pain, diarrhea?
	6 During the last 7 days, have you had any other respiratory additional symptoms such as shortness of breath, and chest pain?
High risk and comorbidity factors	7 Do you have one or more of these conditions: pregnancy, 65 years or older, work in a place where you serve people (gas station, food store, banks, informal work)?
High risk and comorbidity factors	8 Do you have one or more of these diseases: diabetes, hypertension, heart problems, liver problems, asthma, COPD, HIV/AIDS, cancer, chronic kidney failure, obesity, smoking?

Open in a new tab

Contact considers whether the user recently traveled to a country with a COVID-19 epidemic, had close contact with someone recently returned from abroad, or has suspected or confirmed infection. Signs and symptoms include cough and fever above 37.5 °C, six general symptoms, and two respiratory symptoms. Comorbidity includes risk factors that can exacerbate the disease, and whether the user belongs to one or more risk groups or has one or more concomitant illnesses that can worsen COVID-19. Risk groups are pregnant women and people aged 65 years or more. Concomitant diseases include diabetes, hypertension, heart problems, liver disease, asthma, chronic pulmonary obstructive disease (COPD), HIV/AIDS, cancer, chronic kidney failure, obesity, and smoking which increase the risk of fatal outcomes associated with COVID-19 [19]. Two questions address social vulnerabilities: 1) Can you stay home regardless of developing COVID-19 symptoms? and 2) Do you have someone (relative, friend, neighbor) who can help you in case you get ill?; these questions are motivated by the fact that a significant proportion of Mexicans have informal employment and cannot stay at home [20].

Personal risk indexes are, in order of increasing severity: risk-free, low-risk, moderate-risk, high-risk, severe-risk. Based on their risk index, users were subdivided into risk-free cases, risk cases (low- to severe-risk), and respiratory-risk cases (Table 4 ). Since the risk index is a triage tool, respiratory-risk cases without evidence of COVID-19 are treated separately, atypical pneumonia being one of the possible diagnoses [21].

Table 4.

Calculation of the users’ risk index. (For interpretation of the references to color in this table, the reader is referred to the web version of this article).

Open in a new tab

^aBased on the answers to the survey questions of the three risk categories enlisted in Table 3.

2.4. Georeferenced database

Data collected through CDS includes the georeferenced users’ contributions, and a web application allows data visualization through a dashboard (Fig. 1 ) as soon as the user completes the survey.

Fig. 1 — *CDS* platform front-end dashboard. The web application allows epidemiologists and public health specialists to visualize real-time data through a dashboard. The geographic information system displays several data layers, including population density, poverty index, overcrowding, medical center locations, business locations, and contributing users’ locations. The dashboard also shows descriptive statistics of all the variables of the *CDS* survey. Color codes: green (risk-free), low-risk (yellow), medium-risk (orange), high-risk (red), very high risk (maroon), respiratory signs only (purple). The image on the top right shows the location of the SLPMA (population 1.4 million) within Mexico (population 135 million). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article).

During the survey collection period, health authorities issued a home confinement recommendation. Visual inspection of the GPS data shows that most users completed and sent their surveys from residential areas, probably from their place of residence. These areas also encompass social and recreational activities.

2.5. Spatial data analysis

Coronavirus infection spreads in clusters, whose early identification is essential for slowing down SARS-CoV-2 transmission [9,22]. The spatial analysis allowed us to determine whether the geographical features of the data collected with the CDS app suggest the presence of COVID-19 clusters across geographical space and whether they coincide with those of confirmed COVID-19 case clusters. Two complementary statistical tools enabled quantification of spatial features: computation of point process descriptive statistics, and spatial correlation modeling. We emphasize that the risk index calculation combines information about risk of infection and risk of disease severity into one category. Therefore, clustering information derived from the index is useful to understand risk of infection, but not about risk of disease severity. This is important to interpret the data presented in the results section.

2.5.1. Point process descriptive statistics

We used kernel density estimation tools to estimate point process intensity functions [23] and visually explore risk-free and risk cases. We used Ripley’s K-function and Monte Carlo simulations to describe how point patterns occur over a given area and to determine whether they are dispersed, clustered, or randomly distributed. Point process statistics computation is described in Appendix B.

2.5.2. Spatial correlation modeling

Point process statistics allowed us to establish clustering features in CDS geographic data from observed point pattern correlations. An approach to account for these correlations is the log-Gaussian Cox process model [24], which describes the spatial distribution of symptoms as a function of explanatory variables and estimates a residual process independent of these variables (Appendix C).

3. Results

3.1. Socio-demographic users’ features (RQ1)

Contributors to CDS surveys resided in Mexico (n = 2125) and other countries (United States, Bolivia, Chile, Spain, Argentina, France, and Bangladesh; n = 64). Only SLPMA users (n = 1554) were included in this study, whose overall mean age ± SD was 33.5 ± 11.5 years, with a median of 30.0 years (Table 5 ). The predominant age group was that of 20−29 years (44.0 %). The three largest groups, spanning 20–49 years, comprised 85.1 % of the total. Female and male contributors represented 49.6 % and 50.4 %, respectively.

Table 5.

Age and gender distribution of CDS data^a.

Age group (years)	Male (M)		Female (F)		All		M/F ratio
10−19	33	2.1%	22	1.4 %	55	3.5%	1.50
20−29	346	22.3%	338	21.8%	684	44.0 %	1.02
30−39	202	13.0%	217	14.0%	419	27.0%	0.93
40−49	115	7.4%	104	6.7 %	219	14.1%	1.11
50−59	63	4.1%	61	3.9%	124	8.0%	1.03
60−69	21	1.4 %	25	1.6%	46	3.0%	0.84
70−79	3	0.2%	4	0.3%	7	0.5%	0.75
Total	783	50.4 %	771	49.6 %	1554	100.0%

Open in a new tab

In the SLPMA the predominant age groups are, in descending order, those of 10−19 years (18.65 %), 20–29 years (17.54 %), 0−9 years (16.65 %), and 40–49-years (13.29 %). Our study did not consider children under 10 years of age.

3.2. Spatial analysis of the risk cases

3.2.1. Geographic distribution of risk indexes (RQ2)

The SLPMA has 1,094,177 inhabitants distributed over 1,787.72 km². This area is subdivided into AGEBs with a mean ± SD size of 3.80 ± 5.88 km², and a mean ± SD population of 2348 ± 3098 inhabitants.

A population density map is shown in Fig. 2 , and the AGEBs heat-map of risk cases shown in Fig. 3 depicts the normalized number of risk reports per AGEB, conditioned to a given risk index, to provide information on the conditional probability of the number of reports. This figure intends to depict the normalized number of risk reports per AGEB, conditioned to a given risk category (low-risk, medium-risk, etc.); namely the heat maps provide information a conditional probability on the number of reports. The following data are presented as supplementary material: personal risk index distribution by gender and age in Table S 2, occupation distribution of risk cases by gender in Table S 3, personal risk index distribution by gender in Table S 4, and overall distribution of risk factors in Table S 5.

Point process analysis of the risk cases generated darker green areas indicating higher intensity. Fig. 4 depicts the spatial distribution of all survey contributions, no-risk cases, and risk (low- to severe-risk) cases. This figure aims to provide an alternative visualization of raw data collected and how they segregate as a function of all users’ risk and no-risk contributions. These functions consider kernel density estimation techniques with a bandwidth h = 1200 m. Lower bandwidth values are more likely to overfit data. Note that Fig. 4c summarizes raw data from Fig. 3a–d.

Most CDS contributions corresponded to non-risk cases (Fig. 4b); risk cases amounted to 15.77 %, and their maximum intensity was located in an area of high mobility and social interaction (Fig. 4c) which is also a populated residential area.

3.2.2. Point pattern clustering features (RQ3)

3.2.2.1. Point process descriptive statistics

Ripley’s L function was computed using Procedure I of Appendix B to test whether the point patterns of risk cases fulfill the nonhomogeneous CSR hypothesis. The reference probability measure was given by the AGEBs population (Fig. 2). L functions of Periods I (April-May) and II (April-July) are shown in Fig. 5 .

Fig. 5 — Spatial distribution patterns of risk cases. Period I: (a) L-function (solid line) and upper and lower envelopes (dashed lines) from Monte Carlo simulations for risk cases. The L function is above the upper envelope at scales between 0 and 600 m, suggesting significant clustering features (*L > L_up >* 0, *p <* 0.001) at these scales. (b) The corresponding p-values for each h in the analysis range. The red dashed line represents a p-value of 0.0. Period II: (c) L-function (solid line) and upper and lower envelopes (dashed lines) resulting from MC simulations for the risk cases. (d) The L function is above the upper envelope at scales between 0 and 730 m, suggesting significant clustering features (*L > L_up >* 0, *p <* 0.001) at these scales. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article).

The L-function (solid line) and the corresponding upper and lower envelopes (dashed lines) for risk cases computed with Period I and Period II data are shown in Fig. 5a and c, respectively. The L function is above the upper envelope at scales between 0 and 600 m (0 and 730 m), suggesting that accumulated risk cases do not fulfill the CRS hypothesis and exhibit clustering features in this spatial range (L > L_up > 0, p < 0.001). Fig. 5b and d show the corresponding p-values, where the red dashed line represents p = 0.01.

Procedure 2 of Appendix B enabled local tests of homogeneous CSR hypotheses. Fig. 6 shows color-labeled AGEBs of the SLPMA with p-values < 0.05 calculated at scales of 300, 450, and 600 m for Period I and Period II. Several AGEBs exhibited statistically significant clusters (Fig. 6 d–f) at different scales in the 0−730 m range. Risk-cases clusters from Period II are those statistically significant at a 730 m scale. Period I has similar results at different scales in the 0−600 m range (Fig. 6 a–c) with risk case clusters as those statistically significant at a 600 m scale. Some risk clusters computed in Period I intersect those computed in Period II (compare Fig. 6 a–c and d-f plots).

3.2.2.2. Spatial correlation model

We conducted Bayesian inference of a log-Gaussian Cox model of the risk cases data [25] considering seven covariates: population density, male proportion, age, travel abroad or contact with someone who traveled abroad, contact with a suspect COVID-19 case, contact with a confirmed COVID-19 case, risk conditions, and comorbidities. Calculations were conducted over a 128 × 128 grid. The Markov chain Monte Carlo inference algorithm was run with 2,200,000 iterations, an initial burn-in of 200,000 iterations followed by 2,000,000 further iterations from which every 1000th sample was stored. Markov chain mixing and convergence were conducted through standard techniques [24]. To reveal SLPMA areas with a high incidence of risk symptoms (Fig. 7 ) we also computed exceedance probabilities (Appendix C) with the variance σ², scale φ, and other latent field parameters enlisted in Table 6 .

Fig. 7 — Areas of a high incidence of risk symptoms for Period II data. The posterior exceedance probabilities *P(exp(Y) > k|X)* computed for several values of k identified areas of the SLPMA with a particularly high incidence of risk symptoms. The figure shows the exceedence probability for *k = 1*.

Table 6.

Summary of the latent field parameters.

Parameter	Significant effect	Relative risk	Median (95 % Confidence Interval)
σ	--	--	1.66 (1.39–1.90)
φ	--	--	153.20 (99.51–240.60)
Population density	Yes	Increase	2.32 (1.90–2.90)
Travel abroad	Yes	Increase	9.50 (4.92–19.59)
Contact with suspect COVID-19 patient	Yes	Increase	7.84 (4.27–14.35)
Contact with confirmed COVID-19 patient	Yes	Increase	2.55 (1.38–4.84)
Risk conditions	Yes	Increase	2.05 (1.13–3.83)
Age	No	Reduction	0.98 (0.97–2.46)
Male proportion	No	Increase	1.46 (0.83–2.46)
Comorbidities	No	Increase	1.23 (0.75–2.19)

Open in a new tab

3.2.3. Distribution of risk-clusters and confirmed COVID-19-clusters (RQ4)

We compared the distribution of clusters identified from the 9971 SLPMA COVID-19 cases confirmed between March 12 and August 3, with the distribution of risk case clusters identified from the April-July surveys (Period II). Only 9453 confirmed COVID-19 patients (94.8 %) with known postal addresses were included in the comparison. Individual records included demographics, date when symptoms started, and date when the diagnosis was confirmed. Several neighborhoods had clusters of COVID-19 cases confirmed during Periods I and II (Table 7 ). A summary of the findings retaled to RQ4 are listed in Table 10.

Table 7.

Top five SLPMA neighborhoods with clusters of confirmed COVID-19 cases during March 12 and August 3, 2020.

Month	Accumulated neighborhoods	Accumulated cases	The top five COVID-19 case clusters per neighborhood
			Neighborhood	Cases
			Neighborhood	Number	%
March (M1)	27	42	La Forestal	5	11.9
			La Loma	5	11.9
			Colonia 107	3	7.1
			Tangamanga	2	4.8
			San Pedro	2	4.8
April (M2)	85	147	Industrial Aviación	7	4.8
			Maya Mil	6	4.1
			21 de Marzo	6	4.1
			La Forestal	5	3.4
			La Loma	5	3.4
May (M3)	287	616	Tequisquiapan area	13	2.1
			Jardines del Sur	12	2.0
			21 de Marzo	12	2.0
			San Ángel	11	1.8
			San Ángel Inn	9	1.5
June (M4)	587	1827	Centro	29	1.6
			Tequisquiapan area	26	1.4
			San Ángel	26	1.4
			Jardines Sur	24	1.3
			Las Mercedes	22	1.2
July (M5)	1076	6180	Centro	133	2.2
			Tequisquiapan area	106	1.7
			Simón Díaz	72	1.7
			Las Mercedes	69	1.1
			Satélite	66	1.1
August (M6)	1315	8592	Centro	161	1.9
			Tequisquiapan area	138	1.6
			Progreso	107	1.2
			Simón Díaz	97	1.1
			Capricornio	97	1.1

Open in a new tab

Table 10.

Key findings on RQ4.

Most CDS contributions corresponded to non-risk cases (Fig. 4b); risk cases amounted to 15.77%, and their maximum intensity was located in a densely populated residential area of high mobility and social interaction (Fig. 4c).

Risk cases from Period II (April-July) had statistically significant spatial clustering independent of population density across geographical scales from 5 to 730 m (Fig. 6 d–f) and risk clusters were computed at a scale of 730 m. Similar results were obtained for clusters of COVID-19 cases confirmed in Period I (April-May) at a scale of 600 m (Fig. 6 a–c). Many Period II clusters have significant overlap, are more prominent and expand several Period I clusters (Fig. 8).

Period II clusters also overlap city areas where the exceedance probabilities are close to one (Fig. 7). Risk case clusters in areas with a high incidence of risk of symptoms in Period I evolved to become the larger confirmed COVID-19 clusters of Period II (Fig. 8).

Risk case clusters A–E (Fig. 8) were compared with the quantitative cluster analysis of COVID-19 confirmed cases. Cluster A (294 cases) had several cases comparable to the top-five confirmed neighborhoods. Cluster B (138 cases) and Cluster C (37 cases) intersect an area ranking second among confirmed SARS-CoV-2 hotspots. Cluster D (56 cases) includes the largest city park surrounded by a trail for strolling/jogging/biking. Cluster E (68 cases), on the northwest side of the park, intersects five neighborhoods. Cluster F (77 cases) intersects the Graciano Sanchez municipality, which includes seven neighborhoods.

Comparing the curve of cluster A showing the cumulative number of risk cases per week surveyed in Period II with that of confirmed cases in March 12 and August 3 (Fig. 9) we observed that risk cases started in Period I, before the transition to the exponential phase of confirmed COVID-19 cases.

Open in a new tab

4. Discussion

Early detection of COVID-19 hotspots has been achieved in Israel through one-minute population-wide surveys [9]. In the United Kingdom, a mobile app for real-time tracking of self-reported symptoms predicted potential COVID-19 [26]. CDS is a multi-use platform allowing early detection of COVID-19 hotspots that may be of use to prevent spreading, especially at the beginning of the epidemic, when data of confirmed cases are scarce.

The sociodemographic features of CDS users result in a sampling bias whose sources, listed in Table 9, restrict the assessment to the surveyed population [27].

Table 9.

Sources of data bias [14,27].

Source	Features
Age of the sampled population	The highest proportion of CDS users were those of 20−29 years (44.02%) and 30−39 years (26.96%). The predominant age groups were, in descending order, those of 10−19 years (18.65%), 20−29 years (17.54%), 0−9 years (16.65%), and 40–49-years (13.29%). Children under 10 years were not considered in this study.
Age of the sampled population	The predominant CDS groups are adults with a smartphone and internet connection. The elderly population is less likely to have access to and master the mobile technology deployed in the field; the analysis was directed to risk cases younger than 65 years.
Availability of technological resources	The CDS app was available for Android OS only; 68.3% of Mexico’s population has access to a smartphone, with 82.5% Android OS users, and 17.1% iOS users; the rest use Samsung, Windows, and PlayStation devices.
Availability of PCR COVID-19 tests	Asymptomatic individuals infected with SARS-CoV-2 are not eligible for COVID-19 testing due to the limited number of tests available and therefore are not considered in the SSSLP data.

Open in a new tab

Most CDS contributions corresponded to non-risk cases (Fig. 4b); risk cases amounted to 15.77 % and their maximum intensity was located in an area of high mobility and social interaction (Fig. 4c). This urban space is also a populated residential area.

Risk cases from Period II (April-July) had statistically significant spatial clustering independent of population density across geographical scales from 0 to 730 m (Fig. 6 d-f), and risk clusters were computed at a scale of 730 m. Independence of clustering from population density is further supported by the fact that the Pearson’s correlation coefficient between density and number of risk cases per AGEB is low ( $ρ = 0.17$ ) as show in Fig. 2. Similar results were obtained for clusters of COVID-19 cases confirmed in Period I (April-May) at a scale of 600 m (Fig. 6 a–c). Many Period II clusters have significant overlap, are more prominent, and expand several Period I clusters (Fig. 8 ).

Fig. 8 — Socio-urban analysis of statistically significant point-pattern clusters. Risk-case clusters are shown by the shaded areas labeled A–E. Cluster A (294 confirmed COVID-19 cases), intersects several of the neighborhoods with the highest incidence of COVID-19 cases listed in Table 6, Table 7. Cluster B (138 confirmed COVID-19 cases) intersects the Tequisquiapan area that ranks second among the confirmed SARS-CoV-2 clusters. Cluster C (37 confirmed COVID-19 cases) intersects neighborhoods adjacent to the Tequisquiapan area. Cluster D (56 confirmed COVID-19 cases) includes the largest city park surrounded by a trail for strolling/jogging/biking. Cluster E (68 COVID-19 cases), on the northwest side of the park, intersect five neighborhoods. Cluster F (77 confirmed COVID-19 cases) intersects Graciano Sanchez municipality, which includes seven neighborhoods. H and J are groups of risk cases that also intersect areas with a high incidence of COVID-19 confirmed cases. (a) April-May data, (b) April-July data, (c) Exceedance probabilities *P(exp(Y) > k = 1 |X)* associated with the covariate-adjusted latent field Y (April-July data).

Interestingly, Period II clusters also overlap city areas where the exceedance probabilities are close to one (Fig. 7). Risk case clusters in areas with a high incidence of risk of symptoms in Period I evolved to become the larger confirmed COVID-19 clusters of Period II (Fig. 7, Fig. 8).

Risk clusters computed with Period I and Period II data have significant spatial overlapping of neighborhoods with a high incidence of SARS-CoV-2 hotspots (Fig. 8, Fig. 9 ) and a substantial number of COVID-19 cases confirmed from March 12 to August 3.

Fig. 9 — Cumulative number of risk cases per week in risk cluster A from April to July (solid red line), and epidemiological curve of confirmed cases from March 12 to August 3 (solid black line). Both curves indicate risk cases with symptoms (April-May) preceded confirmed COVID-19 cases occurring during the exponential phase of the pandemic. Cluster A intersects 10 relevant neighborhoods: Bolivar, Torres del Santuario, Barrio de San Sebastián, Barrio de San Miguelito, Himno Nacional, Julián Carrillo, San Luis Rey, San Juan de Guadalupe, Independencia y Barrio de Guadalupe. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article).

The exponential growth of COVID-19 cases in the SLPMA began in June when public activities partially reopened, and SARS-CoV-2 hotspots increased significantly between May and August (Table 7, Table 8 ). The top infected neighborhood reached 161 confirmed COVID-19 cases with an 82 % increase between June and August (Table 8).

Table 8.

Persistence and growth of clusters of confirmed COVID-19 cases in the SLPMA between March 12 and August 31, 2020.

Neighborhoods with five top clusters	Cases accumulated	Cluster growth across time^a	Persistence (months)
Centro	161	82.0 % (M4-M6)	3
Tequisquiapan area	138	81.1% (M3-M6)	4
Progreso	107	0.0% (M6)	1
Simón Díaz	97	25.8% (M5-M6)	2
Capricornio	97	0.0% (M6)	1
Las Mercedes	69	68.1% (M4-M5)	2
Satélite	66	0.0% (M5)	1
San Ángel	26	57.7% (M3-M4)	2
Jardines del Sur	24	50.0 % (M3-M4)	2
21 de Marzo	12	50.0 % (M2-M3)	2
San Angel Inn	9	0.0% (M3)	1
Industrial Aviación	7	0.0% (M2)	1
Maya Mil	6	0.0% (M2)	1
Forestal	5	0.0% (M1-M2)	2
La Loma	5	0.0% (M1-M2)	2
Tangamanga	2	0.0% (M1)	1
San Pedro	2	0.0% (M1)	1

Open in a new tab

M1, month 1; M2, month 2; M3, month 3; M4, month 4; M5, month 5; M6, month 6.

Analysis of the epidemiological curve and the trend line of risk-clusters by neighborhood shed light on the associations between risk clusters, symptoms, and actual infection hotspots (Fig. 9). Analysis of cluster A (Fig. 8b and c) showed that risk cases had symptoms between April and May (red line of Fig. 9) before confirmed COVID-19 cases started to appear during the transition to the exponential phase of the pandemic (black line of Fig. 9).

Despite the biases of this study (Table 9), our results support the hypothesis that the risk clusters identified early coincide with the infection hotspots confirmed later, and that early risk-clusters predict the areas of further SARS-CoV-2 spreading.

The key points for future work are listed in Table 11 .

Table 11.

Future work key points.

An adequate response to the COVID-19 pandemic demands a fine-grained understanding of the population under study, since interventions have disparate effects on target populations, depending on multiple factors [28]. Since variation in the outcomes of pandemic interventions involves allocating extremely limited resources to different population sectors elicit different outcomes, the CDS platform may be used as a valuable tool to optimize resource allocation.

Having a limited supply of COVID-19 tests is not a constraint unique to San Luis Potosí. Whatever few tests available used to diagnose those already with symptoms is not as informative when 40%–45% of those infected with SARS-CoV-2 will remain asymptomatic and the potential of the virus to spread silently may be significant [29]. Whereas extensive contact tracing via testing has been successful in South Korea and other countries, such a scale of testing is infeasible in Mexico. One possible solution involves using heterogeneous population factors to decide how to allocate tests differentially [30]. This approach may profit from using a tool as CDS to quantify risk and different citizen profiles in diverse geographies.

Information on the spatial distribution of cases during the spread of transmissible diseases is crucial to design and execute effective interventions. The CDS platform allows documentation and exploration of COVID-19 risk self-assessment data to enable epidemiological intelligence tasks, and may be extended to address epidemiological events concurrent with COVID-19, such as the next expected influenza A and B outbreaks. Survey data could also provide useful information to allocate vaccination campaigns across the state of San Luis Potosí and elsewhere.

Open in a new tab

Authors’ contributions

Conception and design of the study: ACG, ARML, FDB, MLRM, MOR, RLR, SRC, VCRV. Data acquisition: CAA, FHM, MALS, MICI, EEHH. Analysis and interpretation of data: ACG, AEMN, FDB, MSGR, RLR, SRC, VCRV, VUMD. Drafting the manuscript: FMC, LAR, SRC, VCRV, RLR. All authors revised the manuscript and read and approved the version submitted.

Summary table

•
Mobile phone data providing services such as contact tracing, mobility, and proximity assessment have proven to be of use for COVID-19 epidemiologic surveillance and support of appropriate public health interventions in several countries.
•
We developed and successfully deployed CDS, a mobile crowdsourcing platform for COVID-19 surveillance that includes the CDS app, to survey self-reported signs, symptoms, and vulnerabilities in a city located in central Mexico.
•
Real-time assessment of the users’ self-reported risk factors by the CDS platform provides early detection, prediction, and evolution of actual COVID-19 case clusters.

Declaration of Competing Interest

The authors report no declarations of interest.

Acknowledgments

The study was partially funded by Consejo Estatal Electoral y de Participación Ciudadana (CEEPAC) de San Luis Potosí, Consejo Potosino de Ciencia y Tecnología (COPOCYT), IPICYT’s Youth Innovation Laboratory, and Servicios de Salud del Estado de San Luis Potosí (SSSLP). We are also grateful to the ACM special interest group on economics and computation (SIGecom, https://www.sigecom.org/) for its generous support. We thank the generous technical support form Daniel López Rodríguez, Jesús López Amaro, Rami Safadi and Eric Wasserman (from Facebook).

References

1.Velavan T.P., Meyer C.G. The COVID-19 epidemic. Trop. Med. Int. Health. 2020;25:278–280. doi: 10.1111/tmi.13383. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Suárez V., Suarez Quezada M., Oros Ruiz S., Ronquillo De Jesús E. Epidemiología de COVID-19 en México: del 27 de febrero al 30 de abril de 2020. Rev. Clínica Española. 2020;220(8):463–471. doi: 10.1016/j.rce.2020.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Secretaria de Salud, Gobierno de México . 2020. Inicia la fase 3 por COVID-19 - Coronavirus, Inicia La Fase 3 Por COVID-19.https://coronavirus.gob.mx/2020/04/21/inicia-la-fase-3-por-covid-19-2/ (Accessed 24 June 2020) [Google Scholar]
4.Coronavirus Resource Center . 2020. Mortality Analyses - Johns Hopkins Coronavirus Resource Center.https://coronavirus.jhu.edu/data/mortality (Accessed 24 June 2020) [Google Scholar]
5.Kissler S.M. University of Cambridge; 2017. Geographic and Demographic Transmission Patterns of the 2009 A/H1N1 Influenza Pandemic in the United States.https://www.repository.cam.ac.uk/handle/1810/273772 [Google Scholar]
6.Cai J., Sun W., Huang J., Gamber M., Wu J., He G. Indirect Virus Transmission in Cluster of COVID-19 Cases, Wenzhou, China, 2020. Emerg. Infect. Dis. 2020;26:1343–1345. doi: 10.3201/eid2606.200412. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Koo J.R., Cook A.R., Park M., Sun Y., Sun H., Lim J.T., Tam C., Dickens B.L. Interventions to mitigate early spread of SARS-CoV-2 in Singapore: a modelling study. Lancet Infect. Dis. 2020;20:678–688. doi: 10.1016/S1473-3099(20)30162-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Hodcroft E.B. Preliminary case report on the SARS-CoV-2 cluster in the UK, France, and Spain. Swiss Med. 2020;150 doi: 10.4414/smw.2020.20212. [DOI] [PubMed] [Google Scholar]
9.Rossman H., Keshet A., Shilo S., Gavrieli A., Bauman T., Cohen O., Shelly E., Balicer R., Geiger B., Dor Y., Segal E. A framework for identifying regional outbreak and spread of COVID-19 from one-minute population-wide surveys. Nat. Med. 2020:1–4. doi: 10.1038/s41591-020-0857-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Peng Z., Wang R., Liu L., Wu H. Exploring urban spatial features of COVID-19 transmission in Wuhan based on social media data. ISPRS Int. J. Geo-Inf. 2020;9:402. doi: 10.3390/ijgi9060402. [DOI] [Google Scholar]
11.Secretaría de Salud . 2020. Comunicado emergente sobre el uso de pruebas para la detección de antígeno SARS-CoV-2 en México.https://coronavirus.gob.mx/wp-content/uploads/2020/11/Prueba_antigenica_COVID_11Nov2020.pdf [Google Scholar]
12.Azevedo Silva M. 2020. COVID-19 Apps.https://eena.org/document/covid-19-apps [Google Scholar]
13.Human Security Unit of the United Nations . 2016. Human Security Handbook. An Integrated Approach for the Realization of the SDG’s.https://www.un.org/humansecurity/wp-content/uploads/2017/10/h2.pdf [Google Scholar]
14.Phuttharak J., Loke S.W. A review of mobile crowdsourcing architectures and challenges: toward crowd-empowered internet-of-things. IEEE Access. 2019;7:304–324. doi: 10.1109/ACCESS.2018.2885353. [DOI] [Google Scholar]
15.INEGI . 2020. Catálogo Único de Claves de Áreas Geoestadísticas Estatales, Municipales y Localidades.https://www.inegi.org.mx/app/ageeml/# [Google Scholar]
16.WHO . 2020. Coronavirus Disease (COVID-2019) Situation Reports.https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/ [Google Scholar]
17.World Health Organization Digital technology for COVID-19 response. Newsletter. 2020:1. https://www.who.int/news/item/03-04-2020-digital-technology-for-covid-19-response (Accessed 13 October 2020) [Google Scholar]
18.Sistema Nacional de Vigilancia Epidemiológica . 2020. Sitio oficial COVID-19 México.https://covid19.sinave.gob.mx/ (Accessed 13 October 2020) [Google Scholar]
19.Zhou Y., Yang Q., Chi J., Dong B., Lv W., Shen L., Wang Y. Comorbidities and the risk of severe or fatal outcomes associated with coronavirus disease 2019: a systematic review and meta-analysis. Int. J. Infect. Dis. 2020;99:47–56. doi: 10.1016/j.ijid.2020.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.International Labor Organization . 2014. Informal Employment in Mexico: Current Situation, Policies and Challenges.http://www.ilo.org/wcmsp5/groups/public/---americas/---ro-lima/documents/publication/wcms_245889.pdf [Google Scholar]
21.Secretaría de Salud . 2020. Flujograma diagnóstico de la enfermedad respiratoria aguda en el contexto de la sindemia COVID19-influenza.https://coronavirus.gob.mx/wp-content/uploads/2020/10/Algoritmo_Dx_COVID_Flu_05Oct2020.pdf [Google Scholar]
22.Kissler S.M. University of Cambridge; 2017. Geographic and Demographic Transmission Patterns of the 2009 A/H1N1 Influenza Pandemic in the United States. [DOI] [Google Scholar]
23.Cucala L. Intensity estimation for spatial point processes observed with noise. Scand. Stat. Theory Appl. 2008;35:322–334. doi: 10.1111/j.1467-9469.2007.00583.x. [DOI] [Google Scholar]
24.Taylor B.M., Davies T.M., Rowlingson B.S., Diggle P.J. Bayesian inference and data augmentation schemes for spatial, spatiotemporal and multivariate log-gaussian cox processes in R. J. Stat. Softw. 2015;63:1–48. doi: 10.18637/jss.v063.i07. [DOI] [Google Scholar]
25.Waller L.A., Gotway C.A. John Wiley & Sons; 2004. Applied Spatial Statistics for Public Health Data.https://www.wiley.com/en-mx/Applied+Spatial+Statistics+for+Public+Health+Data-p-9780471387718 [Google Scholar]
26.Menni C., Valdes A.M., Freidin M.B., Sudre C.H., Nguyen L.H., Drew D.A., Ganesh S., Varsavsky T., Cardoso M.J., El-Sayed Moustafa J.S., Visconti A., Hysi P., Bowyer R.C.E., Mangino M., Falchi M., Wolf J., Ourselin S., Chan A.T., Steves C.J., Spector T.D. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat. Med. 2020;26:1037–1040. doi: 10.1038/s41591-020-0916-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Wang C., Han L., Stein G., Day S., Bien-Gund C., Mathews A., Ong J.J., Zhao P.Z., Wei S.F., Walker J., Chou R., Lee A., Chen A., Bayus B., Tucker J.D. Crowdsourcing in health and medical research: a systematic review. Infect. Dis. Poverty. 2020;9:1–9. doi: 10.1186/s40249-020-0622-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Akbarpour M., Cook C., Marzuoli A., Mongey S., Nagaraj A., Saccarola M., Tebaldi P., Vasserman S., Yang H. Socioeconomic network heterogeneity and pandemic policy response. SSRN Electron. J. 2020 doi: 10.2139/ssrn.3623593. (June 15, 2020). University of Chicago, Becker Friedman Institute for Economics Working Paper No. 2020-75. [DOI] [Google Scholar]
29.Oran D.P., Topol E.J. Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review. Ann. Intern. Med. 2020;173:362–367. doi: 10.7326/M20-3012. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Biswas A., Bannur S., Jain P., Merugu S. 2020. COVID-19: Strategies for Allocation of Test Kits, ArXiv Prepr.http://arxiv.org/abs/2004.01740 [Google Scholar]
31.Pulido D., Basurto D., Cándido M., Salas J. 2020. Geospatial Spread of the COVID-19 Pandemic in Mexico, ArXiv Prepr.https://arxiv.org/abs/2006.07784 (Accessed 29 June 2020) [Google Scholar]
32.Hernández-Garduño E. Obesity is the comorbidity more strongly associated for Covid-19 in Mexico. A case-control study. Obes. Res. Clin. Pract. 2020;14:375–379. doi: 10.1016/j.orcp.2020.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0005] 1.Velavan T.P., Meyer C.G. The COVID-19 epidemic. Trop. Med. Int. Health. 2020;25:278–280. doi: 10.1111/tmi.13383. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0010] 2.Suárez V., Suarez Quezada M., Oros Ruiz S., Ronquillo De Jesús E. Epidemiología de COVID-19 en México: del 27 de febrero al 30 de abril de 2020. Rev. Clínica Española. 2020;220(8):463–471. doi: 10.1016/j.rce.2020.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0015] 3.Secretaria de Salud, Gobierno de México . 2020. Inicia la fase 3 por COVID-19 - Coronavirus, Inicia La Fase 3 Por COVID-19.https://coronavirus.gob.mx/2020/04/21/inicia-la-fase-3-por-covid-19-2/ (Accessed 24 June 2020) [Google Scholar]

[bib0020] 4.Coronavirus Resource Center . 2020. Mortality Analyses - Johns Hopkins Coronavirus Resource Center.https://coronavirus.jhu.edu/data/mortality (Accessed 24 June 2020) [Google Scholar]

[bib0025] 5.Kissler S.M. University of Cambridge; 2017. Geographic and Demographic Transmission Patterns of the 2009 A/H1N1 Influenza Pandemic in the United States.https://www.repository.cam.ac.uk/handle/1810/273772 [Google Scholar]

[bib0030] 6.Cai J., Sun W., Huang J., Gamber M., Wu J., He G. Indirect Virus Transmission in Cluster of COVID-19 Cases, Wenzhou, China, 2020. Emerg. Infect. Dis. 2020;26:1343–1345. doi: 10.3201/eid2606.200412. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0035] 7.Koo J.R., Cook A.R., Park M., Sun Y., Sun H., Lim J.T., Tam C., Dickens B.L. Interventions to mitigate early spread of SARS-CoV-2 in Singapore: a modelling study. Lancet Infect. Dis. 2020;20:678–688. doi: 10.1016/S1473-3099(20)30162-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0040] 8.Hodcroft E.B. Preliminary case report on the SARS-CoV-2 cluster in the UK, France, and Spain. Swiss Med. 2020;150 doi: 10.4414/smw.2020.20212. [DOI] [PubMed] [Google Scholar]

[bib0045] 9.Rossman H., Keshet A., Shilo S., Gavrieli A., Bauman T., Cohen O., Shelly E., Balicer R., Geiger B., Dor Y., Segal E. A framework for identifying regional outbreak and spread of COVID-19 from one-minute population-wide surveys. Nat. Med. 2020:1–4. doi: 10.1038/s41591-020-0857-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0050] 10.Peng Z., Wang R., Liu L., Wu H. Exploring urban spatial features of COVID-19 transmission in Wuhan based on social media data. ISPRS Int. J. Geo-Inf. 2020;9:402. doi: 10.3390/ijgi9060402. [DOI] [Google Scholar]

[bib0055] 11.Secretaría de Salud . 2020. Comunicado emergente sobre el uso de pruebas para la detección de antígeno SARS-CoV-2 en México.https://coronavirus.gob.mx/wp-content/uploads/2020/11/Prueba_antigenica_COVID_11Nov2020.pdf [Google Scholar]

[bib0060] 12.Azevedo Silva M. 2020. COVID-19 Apps.https://eena.org/document/covid-19-apps [Google Scholar]

[bib0065] 13.Human Security Unit of the United Nations . 2016. Human Security Handbook. An Integrated Approach for the Realization of the SDG’s.https://www.un.org/humansecurity/wp-content/uploads/2017/10/h2.pdf [Google Scholar]

[bib0070] 14.Phuttharak J., Loke S.W. A review of mobile crowdsourcing architectures and challenges: toward crowd-empowered internet-of-things. IEEE Access. 2019;7:304–324. doi: 10.1109/ACCESS.2018.2885353. [DOI] [Google Scholar]

[bib0075] 15.INEGI . 2020. Catálogo Único de Claves de Áreas Geoestadísticas Estatales, Municipales y Localidades.https://www.inegi.org.mx/app/ageeml/# [Google Scholar]

[bib0080] 16.WHO . 2020. Coronavirus Disease (COVID-2019) Situation Reports.https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/ [Google Scholar]

[bib0085] 17.World Health Organization Digital technology for COVID-19 response. Newsletter. 2020:1. https://www.who.int/news/item/03-04-2020-digital-technology-for-covid-19-response (Accessed 13 October 2020) [Google Scholar]

[bib0090] 18.Sistema Nacional de Vigilancia Epidemiológica . 2020. Sitio oficial COVID-19 México.https://covid19.sinave.gob.mx/ (Accessed 13 October 2020) [Google Scholar]

[bib0095] 19.Zhou Y., Yang Q., Chi J., Dong B., Lv W., Shen L., Wang Y. Comorbidities and the risk of severe or fatal outcomes associated with coronavirus disease 2019: a systematic review and meta-analysis. Int. J. Infect. Dis. 2020;99:47–56. doi: 10.1016/j.ijid.2020.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0100] 20.International Labor Organization . 2014. Informal Employment in Mexico: Current Situation, Policies and Challenges.http://www.ilo.org/wcmsp5/groups/public/---americas/---ro-lima/documents/publication/wcms_245889.pdf [Google Scholar]

[bib0105] 21.Secretaría de Salud . 2020. Flujograma diagnóstico de la enfermedad respiratoria aguda en el contexto de la sindemia COVID19-influenza.https://coronavirus.gob.mx/wp-content/uploads/2020/10/Algoritmo_Dx_COVID_Flu_05Oct2020.pdf [Google Scholar]

[bib0110] 22.Kissler S.M. University of Cambridge; 2017. Geographic and Demographic Transmission Patterns of the 2009 A/H1N1 Influenza Pandemic in the United States. [DOI] [Google Scholar]

[bib0115] 23.Cucala L. Intensity estimation for spatial point processes observed with noise. Scand. Stat. Theory Appl. 2008;35:322–334. doi: 10.1111/j.1467-9469.2007.00583.x. [DOI] [Google Scholar]

[bib0120] 24.Taylor B.M., Davies T.M., Rowlingson B.S., Diggle P.J. Bayesian inference and data augmentation schemes for spatial, spatiotemporal and multivariate log-gaussian cox processes in R. J. Stat. Softw. 2015;63:1–48. doi: 10.18637/jss.v063.i07. [DOI] [Google Scholar]

[bib0125] 25.Waller L.A., Gotway C.A. John Wiley & Sons; 2004. Applied Spatial Statistics for Public Health Data.https://www.wiley.com/en-mx/Applied+Spatial+Statistics+for+Public+Health+Data-p-9780471387718 [Google Scholar]

[bib0130] 26.Menni C., Valdes A.M., Freidin M.B., Sudre C.H., Nguyen L.H., Drew D.A., Ganesh S., Varsavsky T., Cardoso M.J., El-Sayed Moustafa J.S., Visconti A., Hysi P., Bowyer R.C.E., Mangino M., Falchi M., Wolf J., Ourselin S., Chan A.T., Steves C.J., Spector T.D. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat. Med. 2020;26:1037–1040. doi: 10.1038/s41591-020-0916-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0135] 27.Wang C., Han L., Stein G., Day S., Bien-Gund C., Mathews A., Ong J.J., Zhao P.Z., Wei S.F., Walker J., Chou R., Lee A., Chen A., Bayus B., Tucker J.D. Crowdsourcing in health and medical research: a systematic review. Infect. Dis. Poverty. 2020;9:1–9. doi: 10.1186/s40249-020-0622-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0140] 28.Akbarpour M., Cook C., Marzuoli A., Mongey S., Nagaraj A., Saccarola M., Tebaldi P., Vasserman S., Yang H. Socioeconomic network heterogeneity and pandemic policy response. SSRN Electron. J. 2020 doi: 10.2139/ssrn.3623593. (June 15, 2020). University of Chicago, Becker Friedman Institute for Economics Working Paper No. 2020-75. [DOI] [Google Scholar]

[bib0145] 29.Oran D.P., Topol E.J. Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review. Ann. Intern. Med. 2020;173:362–367. doi: 10.7326/M20-3012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0150] 30.Biswas A., Bannur S., Jain P., Merugu S. 2020. COVID-19: Strategies for Allocation of Test Kits, ArXiv Prepr.http://arxiv.org/abs/2004.01740 [Google Scholar]

[bib0155] 31.Pulido D., Basurto D., Cándido M., Salas J. 2020. Geospatial Spread of the COVID-19 Pandemic in Mexico, ArXiv Prepr.https://arxiv.org/abs/2006.07784 (Accessed 29 June 2020) [Google Scholar]

[bib0160] 32.Hernández-Garduño E. Obesity is the comorbidity more strongly associated for Covid-19 in Mexico. A case-control study. Obes. Res. Clin. Pract. 2020;14:375–379. doi: 10.1016/j.orcp.2020.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Health Sentinel: A mobile crowdsourcing platform for self-reported surveys provides early detection of COVID-19 clusters in San Luis Potosí, Mexico

Salvador Ruiz-Correa

Rubén López-Revilla

Fernando Díaz-Barriga

Francisco Marmolejo-Cossío

Viridiana del Carmen Robledo-Valero

Emilio Ernesto Hernández-Huérfano

Leonardo Álvarez-Rivera

Mónica Liliana Rangel-Martínez

Miguel Ángel Lutzow-Steiner

Luis Alfredo Ortiz-Vázquez

Andrea Rebeca Mendoza-Lara

Montserrat Olivo-Rodríguez

Marco Sebastián Galván-Ramírez

Ángel Emanuel Morales-Neri

Víctor Uriel Martínez-Donjuan

Massiel Isabella Cervantes-Irurzo

Andreu Comas-García

Fernando Hernández-Maldonado

Carlos Aguilar-Acosta

Abstract

Background

Methods

Results

Conclusions

1. Introduction

Table 1.

Table 2.

2. Methods

2.1. Study design

2.2. Setting and participants

2.3. Personal COVID-19 risk index assessment

Table 3.

Table 4.

2.4. Georeferenced database

Fig. 1.

2.5. Spatial data analysis

2.5.1. Point process descriptive statistics

2.5.2. Spatial correlation modeling

3. Results

3.1. Socio-demographic users’ features (RQ1)

Table 5.

3.2. Spatial analysis of the risk cases

3.2.1. Geographic distribution of risk indexes (RQ2)

Fig. 2.

Fig. 3.

Fig. 4.

3.2.2. Point pattern clustering features (RQ3)

3.2.2.1. Point process descriptive statistics

Fig. 5.

Fig. 6.

3.2.2.2. Spatial correlation model

Fig. 7.

Table 6.

3.2.3. Distribution of risk-clusters and confirmed COVID-19-clusters (RQ4)

Table 7.

Table 10.

4. Discussion

Table 9.

Fig. 8.

Fig. 9.

Table 8.

Table 11.

Authors’ contributions

Declaration of Competing Interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases