Skip to main content
PLOS One logoLink to PLOS One
. 2021 Mar 11;16(3):e0248285. doi: 10.1371/journal.pone.0248285

Open data and injuries in urban areas—A spatial analytical framework of Toronto using machine learning and spatial regressions

Eric Vaz 1, Michael D Cusimano 2, Fernando Bação 3, Bruno Damásio 3,*, Elissa Penfound 4
Editor: Yanyong Guo5
PMCID: PMC7951915  PMID: 33705490

Abstract

Injuries have become devastating and often under-recognized public health concerns. In Canada, injuries are the leading cause of potential years of life lost before the age of 65. The geographical patterns of injury, however, are evident both over space and time, suggesting the possibility of spatial optimization of policies at the neighborhood scale to mitigate injury risk, foster prevention, and control within metropolitan regions. In this paper, Canada’s National Ambulatory Care Reporting System is used to assess unintentional and intentional injuries for Toronto between 2004 and 2010, exploring the spatial relations of injury throughout the city, together with Wellbeing Toronto data. Corroborating with these findings, spatial autocorrelations at global and local levels are performed for the reported over 1.7 million injuries. The sub-categorization for Toronto’s neighborhood further distills the most vulnerable communities throughout the city, registering a robust spatial profile throughout. Individual neighborhoods pave the need for distinct policy profiles for injury prevention. This brings one of the main novelties of this contribution. A comparison of the three regression models is carried out. The findings suggest that the performance of spatial regression models is significantly stronger, showing evidence that spatial regressions should be used for injury research. Wellbeing Toronto data performs reasonably well in assessing unintentional injuries, morbidity, and falls. Less so to understand the dynamics of intentional injuries. The results enable a framework to allow tailor-made injury prevention initiatives at the neighborhood level as a vital source for planning and participatory decision making in the medical field in developed cities such as Toronto.

1. Introduction

1.1. The injury landscape

Injury is one of the leading causes of death and disability in the United States of America [1]. In Canada alone, an estimated 4.27 million Canadians aged 12 or older, suffered a debilitating injury between 2009–2010 [2]. The growing number of traumas in urban areas has brought a significant public health concern [3] and fostered a negative perception of health and subjective wellbeing [4]. It is projected that by 2020, injuries will be the third foremost cause of death and disability worldwide [5]. Additionally, the repercussion of injuries from traumatic events has a temporal lag on the psychological and social adjustment of victims, jeopardizing wellbeing in general, and leading to depression [6]. Injuries can be divided into two significant groups generating distinct demographic profiles with leading causes and complex characteristics of epidemiological concern [7]. On one side, unintentional injuries [8] form a leading cause of death in the population between the ages of 1 to 39. Intentional injuries, on the other hand, including assaults and suicides, rank as the second leading cause of death in people ranged 15 to 39. Injuries, therefore, have direct consequences on the active population of Canadians, where three individuals die from injury-related causes every day.

Further to these deaths, fifty Canadians are hospitalized due to injuries [9], which poses a severe strain on the Canadian economy and workplace [10]. Injuries currently represent over seven percent of all hospitalizations [11]. Non-fatal injuries accrue an additional burden to society, as many of these injuries affect the brain or spinal cord, leaving a substantial incidence over permanent disability. Costs on the health-care system in terms of waiting times is evident given the encumbrance over the carrying capacity of hospital systems. Geographical and temporal knowledge of injury events may help in optimizing adequate strategies that convey prevention, control, and efficient monitoring. While until recently, the focus was predominantly on the individual characteristics of the injured person, advances in spatial computation and data science promote new and integrative roles of the spatial aspects of what may lay within the injury landscape at regional level [1214]. The injury landscape resonates with the concept of regional intelligence [15], where cities may have a proactive role through ubiquitous data integration in mitigating injury risk. By injury landscape, we define the geographical topology of spatially-explicit interactions of injury, where different types of injury occur with particular spatial attributes throughout a given geographic territory.

This paper has the following structure. The next section, Section 1, offers a literature review of the paradigm of injury, and the importance of novel approaches for injury prevention. Section 2 brings the Methodology presenting a systematic framework of the different tools and techniques and exploring the necessary steps of data that allow the statistical and geostatistical analytics. Section 3 discusses the results of the implemented approach for the three regions, and Section 4 offers some concluding remarks and summarizes potential future works.

1.2. Literature review

Spatial understanding of the geography of metropolitan areas is of emerging importance in regions that have witnessed rapid urbanization [16], and where the incidence of injuries are positively correlated [17, 18]. Geographic Information Systems (GIS), spatial analysis, and geostatistics allow addressing regional phenomena of health-related concerns in a spatially-explicit context [19]. Several studies analyzed the integration of geographical aspects of public health. For example, Kivell and Mason (1999) used geographic information systems (GIS) to place thirty trauma centers across the United Kingdom [20]. Several authors have also used GIS to predict pedestrian injuries [2123]. Research on traffic-accident information systems has optimized the capacity to assess the risk of different types of traffic collisions [2426]. Specifically, in the City of Toronto, researchers have explored the spatial patterns of motor vehicle collisions leading to pedestrian injury based on the pedestrian injury type, age and location within the city [27]. Other studies have examined the relationship between crime and geographic location [28, 29], child maltreatment and geographic location [30], frequency and type of drug use, which influenced the location of drug and HIV-prevention activities [31], and the likelihood of increased risk of violent injury based on racial segregation [32].

An additional aspect of the spatial patterns of injury that has been explored is the comparison of injury by type in rural versus urban areas. These studies discuss how physical space and subsequent infrastructure (i.e., access and distance to hospitals) links to injury severity and morbidity. Additionally, these studies highlight the importance of understanding the spatial nature of injury by type so that injury prevention strategies may be more accurately targeted [3335].

Like this study, there are other studies that have explored the spatial nature of injuries with ambulance datasets, including a 2010 study that, with ambulance data from the City of Toronto, explored the spatial and temporal patterns of violent injury [36] and a 2012 study which, with ambulance data, conducted and analysis of outdoor falls based on temporal, spatial and demographic distribution in Laval and Montréal, Canada [37].

The relationship between the spatial distribution of injuries and demographic composition of injured individuals has also been explored. For example, a 2016 study explored the cultural, social and geographic components leading to higher injury risk for Aboriginal peoples in British Columbia, Canada [38]. Another 2016 study explored injury burden caused by accidental venomous bites based on national geography and demographics in Australia [39] and a 2017 study explored the socio- and geo-demographics linked to firearm injuries in Miami-Dade County, Florida [40].

Analysis’ of the outcomes of injuries and how they are linked to geographic location and demographics have also been conducted in several studies including a 2019 study which examined the association between injury mortality, geography and sex as it related to youth suicide, senior falls and transport injuries [41]. Furthermore, a study conducted by Keeves and others (2019) used electronic databases of various studies to investigate the outcomes of traumatic injury and their geographic variations, globally. This study found that urban pre-hospital patients have a lower risk of mortality compared to rural patients. This research concludes that there are currently gaps in the literature in regard to determining the link between injury outcome and geography and recommends the use of geographic information systems in future studies related to the spatial distribution of injuries [42].

Despite the many contributions, computational power and data availability have, in recent decades, hindered the opportunity of examining large geographical extents or comparing multiple regions simultaneously. Such studies are particularly important to support regional decision-making for injury prevention proactively and determine key characteristics of injury distributions within urban cores [4345]. Concise multi-temporal datasets for extensive studies on the injury landscape are rarely available. This study approaches this gap by assessing the complete injury landscape of Toronto. A spatial-analytical framework allows the critical characteristics of different injury types leading to an integrative vision of the consequences and the underlying patterns of injuries in Toronto while benefiting from open data initiatives the city has available. The integration of open data such as Wellbeing Toronto (WT) is addressed at the neighborhood level, offering insights on the potential participatory role of public health initiatives for injury prevention.

2. Methodology

2.1. Data

2.1.1. Injury data

The National Ambulatory Care Reporting System (NACRS) is a comprehensive database that contains demographic, diagnostic, and procedural information on all injury-related occurrences where an ambulance has been dispatched. ICD-10 codes were selected for unintentional injuries: (i) resulting from external causes (ICD-10 codes S00 to T14), (ii) external morbidity and mortality (ICD-10 codes V01 to V99), and (iii) fall (ICD-10 codes W00 to W19). For intentional self-harm, the ICD-10 codes from X60 to X84 were used. Data cleaning was carried out further to importing the data from its original format in SAS. The presence of a count with less than five events was discarded and considered as 0. A total of 1714512 injuries (intentional and unintentional) were registered and georeferenced by postal code conversion to latitude and longitude coordinates between 2004 to 2010 (Table 1).

Table 1. Distribution of injury events per main categories.
Causes Total Percentage
Injuries from external causes 1602996 93.50%
External Morbidity and Mortality 22888 1.33%
Intentional Injuries 1877 0.11%
Falls 86751 5.06%

The majority of injuries resulted from external causes of which: (i) injuries to the wrist and hand, (ii) injuries to the head, and (iii) injuries to the knee and lower leg were the most significant cause of ambulance dispatch.

2.1.2. Socio-economic data

Wellbeing Toronto (WT) data was used to assess critical variables at the neighborhood level for Toronto (Fig 1). WT corresponds to an integrative and open approach for visualization of Toronto’s 140 neighborhoods [46]. As an open data concept, it hosts a significant amount of data over three reference periods (2008, 2011, and 2014), that include crucial variables encouraging citizen participation, government accountability, and data transparency.

Fig 1. Toronto neighborhoods ([1]).

Fig 1

For health analytics, these are vital requisites for successful policy implementation. The Table below shows the variables that were selected from the WT portal (Table 2).

Table 2. Selected variables from Wellbeing Toronto (N = 140).
Variable Acronym min max mean sd Year
Green Spaces GreeSp 0 14.271 0.58 1.29 2011
Pollutants Released to Air PollRel 0 1585690 58944.02 184007.30 2011
Traffic Collisions TrafCol 15 778 173.99 123.76 2011
Total Population TotPop 6577 65913 19511.22 10033.59 2014
Low Income Families LowIncFam 260 10050 2184.64 1572.53 2014
Visible Minority Category VisMin 6370 65620 19226.57 9942.22 2014
Seniors 65 and over Sen 730 8990 3048.29 1579.02 2014
Recent Immigrants RecIm 95 7405 1342.75 1183.76 2014
Low Income Population LowIncPop 470 15430 4164.79 3045.62 2014
Social Assistance Recipients SocAssRec 28 5576 1385.42 1196.94 2014
Social Housing Units SocHous 0 3399 641.09 653.99 2014
Seniors Living Alone SenLivAl 40 630 221.43 128.84 2014
Rented Dwellings RentDwell 200 13640 3400.68 2396.06 2014
Drug Arrests DrugArr 0 174 20.76 26.47 2014
Assaults Assaults 9 712 108.42 102.19 2014
Robberies Robberies 0 112 20.94 20.13 2014

2.2. Methods

2.2.1. Preliminary data organization

The data was georeferenced utilizing the existing postal code attribute and projected as point features for every single incident onto WGS84. Due to privacy reasons, the data was handled in a secured server and a count selection by location to the nearest census tract performed. This resulted in a generalized geometry dataset. The generalized point count polygons per category of injury were then further simplified onto the neighborhood level and projected into NAD83 17N. The compiled data from WT were added to the data set for further exploration of geostatistical analysis.

2.2.2. Global spatial autocorrelation

Global spatial autocorrelation was tested employing a Moran’s I index per injury category. This statistic was conducted to test the null hypothesis (Ho) relating to the absence of spatial clustering of injuries in Toronto (α = 0.05) (Eq 1):

I=ji=1i=ni=1i=nwiji=1i=nj=1j=nwij(xix_)(xjx_)i=1i=n(xix_)2 (1)

Where wij corresponds to a binary weight matrix defined with the weight of one, given a contiguity of adjacency for any value that holds wij = 1 and any value without adjacency as wij = 0. The product of the distance is defined as xi for any location i in the distance to relation of its mean. This holds as a statistic for assessing the entire spatial distribution of adjacency formed for the city of Toronto. The null hypothesis was rejected in all categories, suggesting a high spatial autocorrelation for all the injury categories in Toronto.

2.2.3. Local spatial autocorrelation

The Local Gi* statistic was calculated by first determining the injury density. While several approaches allow for spatial density estimation, we considered that the importance of neighborhood demographics should hold. Thus, the neighborhood injury density results from a ratio where density corresponded to the injuries found in a neighborhood by the population count of the neighborhood. While greater spatial detail could have helped the accuracy of the assessment, one should note that the objective is related to the potential of participatory interaction of injury with available open data. In this sense, neighborhoods are the ideal geographic boundary for governance and city planning.

This approach allowed for the seamless definition of injury density at a spatial level and computation of the statistic, determining the locational aggregation of injury hotspots and cold spots [47]. The calculation of the local Gi* statistic is as follows (Eq 2):

Gi*(d)=j=1nwi,jxi,jx_j=1nwi,js[nj=1nwi,j2(j=1nwi,j)2]n1 (2)

Where wij is the spatial weight matrix following a 1 km distance (d), and wij(d) is assumed as 1. The maps show densities of injury patient residences as hot spots and cold spots, with red representing the highest concentrations of injury and blue the lowest. The selection of regional socio-demographic characteristics for this analysis was guided by previous research and availability of Wellbeing Toronto data.

2.2.4. Regression framework

Screening of key demographic variables available at Wellbeing Toronto was carried out by means of a stepwise regression through backward elimination. This allowed for a successful preliminary selection of variables that were applied to three distinct regressions frameworks: (i) spatial lag model, (ii) spatial error model, as well as a non-spatial model to compare performance, and (iii) ordinary least squares model. The spatial lag model (SL) (Eq 3) understands spatial dependency by the addition of a dependent variable that defines the spatial attribute.

Y=ρWy+Xβ+ϵ,ϵN(0,σ2I) (3)

Where I represents an identity matrix, and the N(0,σ2I) indicates that the errors follow a normal distribution with mean equal to zero and constant variance. When ρ is zero, the lag-dependent term is canceled out, leaving the model under the Ordinary Least Squares (OLS) form. Though when ρ is not zero, it means that spatial dependency exists, and that non-random spatial observable interactions are present [48]. As for the spatial error model (Eq 4), the spatial dependency ξ is accounted within the error term ϵ, assuming the errors of the model as spatially correlated [49].

Y=Xβ+λWξ+ϵ,ϵN(0,σ2I) (4)

3. Results

3.1. Exploratory data analysis

The Figure below exemplifies the categorization of injuries based on external causes (Fig 2).

Fig 2. Percentage of all injury types between 2004 and 2010.

Fig 2

*Acronyms: Itth—Injuries to the head; Ittn—Injuries to the neck; Ittt—Injuries to the thorax; Ittalblsap—Injuries to the abdomen, lower back lumbar spine and pelvis; Ittsaua—Injuries to the shoulder and upper arm; Itteaf—Injuries to the elbow and forearm; Ittwah—Injuries to the wrist and hand; Itthat—Injuries to the hip and thigh; Ittkall—Injuries to the knee and lower leg; Ittaaf—Injuries to the ankle and foot; Iimbr—Injuries involving multiple body regions; Itupotlobr—Injuries to unspecified parts of trunk, limb or body region; EMaM—External Morbidity and Mortality; Flls–Falls; IntI—Intentional Injuries.

Concerning unintentional injuries in Toronto between 2004 and 2010, for the category of external causes, a total of 1602996 were obtained. For external morbidity and mortality, a total number of 22888 were registered, and for falls, a total of 86751 lead to ambulance dispatch. This constituted the larger set of the data used as intentional injuries corresponded only to a fraction of 1877 events, short of 0.12 percent of the total data set.

3.2. Spatial autocorrelation

3.2.1. Global spatial autocorrelation

Testing for spatial autocorrelation through Moran’s I statistic for each event brought evidence that there is significant spatial autocorrelation for all injury categories at the global level (Table 3). Despite regional differences in the rates of unintentional and intentional injuries, the spatial patterns of the residences of those injured by unintentional or intentional mechanisms were found to be highly spatially autocorrelated (p < 0.01 for each injury type) indicating that the residence locations of those injured by each of these mechanisms were not randomly distributed in the city of Toronto. This suggests a high spatial clustering that justified further local exploration.

Table 3. Moran’s I indices for all categories per neighborhood count.
ICD-10 Injury type Count Moran’s I
Mi** St. Dev
S00-S09 Injuries to the head 340906 0.174 9.782
S10-S19 Injuries to the neck 41399 0.331 18.575
S20-S29 Injuries to the thorax 52003 0.272 15.305
S30-S39 Injuries to the abdomen, lower back, lumbar spine and pelvis 52273 0.292 16.414
S40-S49 Injuries to the shoulder and upper arm 102450 0.275 15.420
S50-S59 Injuries to the elbow and forearm 148408 0.245 13.751
S60-S69 Injuries to the wrist and hand 378980 0.294 16.519
S70-S79 Injuries to the hip and thigh 39945 0.170 9.573
S80-S89 Injuries to the knee and lower leg 167493 0.265 14.898
S90-S99 Injuries to the ankle and foot 219222 0.307 17.230
T00-T07 Injuries involving multiple body regions 12469 0.366 20.659
T08-T14 Injuries to unspecified parts of trunk, limb or body region 24560 0.446 25.722
V01-V99 External Morbidity and Mortality 22888 0.226 4.779
W00-W19 Falls 86751 0.241 5.070
X60-X84 Intentional Injuries 1877 0.158 3.465

** Significant at the 0.01 confidence level.

Highest Moran’s I values were registered for (a) Injuries to unspecified parts of the trunk, limb or body region, (b) Injuries involving multiple body regions, and (c) Injuries to the neck. While (a) and (b) suggest anatomically more extensive regions, injuries to the neck are quite specific and may become a cause for serious concern given the propensity for physical disability, recovery time, and additional cost to care. The spatial aspects of this injury analysis overall lead to a pressing conclusion that there are clearly geographical determinants that should be assessed to understand the landscape of injury (Table 4).

Table 4. Moran’s I indices for main categories per population distribution.
Category Moran’s I
Injuries from external causes 2.918
External Morbidity and Mortality 2.991
Falls 5.069
Intentional Injuries 2.069

As expected, all indices remained high, with falls showing very strong spatial-autocorrelation, followed by injuries and injuries leading to mortality. Intentional Injuries had the lowest Moran’s I, however, still corresponding to a very strong Moran’s I. Local spatial autocorrelation allows us to assess the neighborhoods at a local scale through the integration of hotspots.

3.2.2. Local spatial autocorrelation

The calculation of Local Gi* allowed for the exploration of the spatial distributions of hotspots and their significance levels for the categories of: (i) unintentional injury (external causes), (ii) unintentional injury (resulting in morbidity and mortality), (iii) unintentional injury (due to falls), (iv) intentional injury (self-harm). A weight matrix was generated of queen contiguity type of order 1, for the 140 neighborhoods, as a minimum number of neighbors 3 and a maximum number of neighbors of 11 (Fig 4). The mean and median neighbors corresponded to 5.96 and 6.00, respectively, and a total percentage of non-zero values of 4.26 percent was found.

Fig 4.

Fig 4

a. Unintentional Injury (external causes) Hotspots. b–Unintentional Injury (External Morbidity and Mortality) Hotspots. c–Unintentional Injury (Falls) Hotspots. d–Intentional Injury Hotspots.

Fig 3 depicts the Queen contiguity map for neighborhoods in Toronto, nevertheless the most intriguing aspect of these distributions, besides the clear evidence of hotspots and cold spots, was the unique spatial profile of injuries (Fig 4A–4D). Red represents "hotspots", or areas with high injury density, and blue represents cold spots or areas of low or no density of injury. All injury types depict a distinctive pattern.

Fig 3. Queen contiguity map for neighborhoods in Toronto.

Fig 3

3.3. Regression results

The table below (Table 5) compares the three distinct models. Three of the four injury categories showed moderate performance, suggesting that the data available at Wellbeing Toronto may support well decision making at neighborhood and community participation for injury analysis and integration. In all cases, the spatial regression outperformed the ordinary least squares, with significant improvements in the r2 statistic throughout. The intentional injury model, however, showed low r2, suggesting that demographic data does not explain sufficiently the reasons for self-harm. Finally, it is important to note that injury categories have different variables for each explanatory model, suggesting that there should be different policies and preparedness integration within the city’s public health decisions. The following variables were selected through the initial backward elimination as consistent for the models:

Table 5. Comparison of three different regression models (spatial regressions and OLS).

Variables Spatial Lag Model Spatial Error Model Ordinary Least Squares
z-score z-score t-statistic
Unintentional Injuries
    Social Housing 1.82714 1.49932 2.03605
    Seniors Living Alone -2.58747 -2.23972 -2.59883
    Total Population -3.04475 -3.49583 -2.94316
    Traffic Collisions -1.84367 -1.7844 -1.81751
    Population Density -2.87096 -2.57958 -3.12535
R2 0.457918 0.485750 0.421782
Morbidity
    Traffic Collisions -2.05927 -2.1162 -1.92051
    Total Population -2.09607 -2.52196 -2.26029
    Visible Minority 1.89049 2.30866 2.04684
    Social Housing 1.94944 1.65653 2.29476
    Seniors Living Alone -2.17949 -1.62644 -2.29047
    Area (Km^2) 1.90814 1.92133 1.94006
R2 0.501747 0.504612 0.480922
Falls
    Traffic Collisions -2.98132 -3.01468 -2.70176
    Total Population -2.19972 -2.42254 -2.39975
    Visible Minority 2.03281 2.26119 2.20993
    Social Assistant Recipients -2.27307 -2.30639 -1.96597
    Social Housing 2.43288 2.52839 2.32207
    Seniors Living Alone -2.32257 -2.24224 -1.96219
    Area (Km^2) 2.79554 2.6891 2.70456
R2 0.528434 0.527985 0.507562
Intentional Injuries
    Total Population -2.87985 -2.64276 -2.91692
    Low-Income Families -1.33643 -1.27004 -1.46417
    Low Income Population 1.68586 1.53451 1.72947
    Rented Dwellings -2.36279 -2.2732 -2.13923
    Assaults 3.11778 2.90505 3.06752
    Robberies -2.90614 -2.90247 -2.69822
    Population Density 3.50405 3.43355 3.3855
R2 0.296519 0.290263 0.269535
  1. Unintentional Injuries: Social Housing, Seniors Living Alone, Total Population, Traffic Collision, Population Density.

  2. Morbidity: Traffic Collision, Total Population, Visible Minority, Social Housing, Seniors Living Alone, Area (km2).

  3. Falls: Traffic Collisions, Total Population, Visible Minority, Social Assistant Recipients, Social Housing, Seniors Living Alone, Area (Km2).

  4. Intentional Injuries: Total Population, Low-Income Families, Low Income Population, Rented Dwellings, Assaults, Robberies, Population Density

3.4. SOM cluster results

Analysis of health geography is highly important as it aids in providing evidence of possibly unknown risk factors that may be quantified and better understood only if they are explored spatially [50]. In addition to the regression models (discussed in section 3.3), self-organizing maps (SOM) were built based on the regressors (variables) included in the regression models. In the evaluation of health geography, SOM is a highly useful tool that is used to identify outliers in a dataset [51]. In this analysis, SOM has been used to identify which variables (the attributes or characteristics) are most correlated to injury by type in the City of Toronto neighbourhoods. SOM clusters were generated for each type of injury in the regression model, including unintentional injuries, morbidity, falls and intentional injuries.

The SOM built for unintentional injuries included four clusters. In cluster 1, total population and traffic collisions were the variables that were most strongly correlated with unintentional injuries. This cluster was spatially located in the northeast and northwest peripheries as well as the south-central neighbourhoods in Toronto. In cluster 2, seniors living at home, social housing, and population density were the variables that were most strongly correlated with unintentional injuries. This cluster was spatially distributed throughout Toronto and was less prevalent in south-central Toronto. In cluster 3, traffic collisions, social housing, seniors living at home, total population, population density (ordered from most to least correlated) were the variables that were most strongly correlated with unintentional injuries. This cluster was also spatially distributed throughout Toronto but was more prevalent in south-central Toronto. In cluster 4, population density was most strongly correlated with unintentional injuries. This cluster was represented in a single neighbourhood, located in Toronto’s city center. The results of the heatmaps for the regressors (for unintentional injuries) have been summarized in Table 6, which shows a breakdown of the cluster that each variable is highly correlated with. These results show that clusters 1 and 2 have the highest number of variables correlated with unintentional injuries, whereas cluster 4 has fewer variables correlated with unintentional injuries and cluster 3 has variables that are only moderately correlated with unintentional injuries.

Table 6. Unintentional injuries heatmaps for the regressors summary.

Unintentional Injuries
Variable Cluster
Social Housing 2
Seniors Living Alone 2
Total Population 1
Traffic Collisions 1
Population Density 4

The SOM built for morbidity also included four clusters. In cluster 1, seniors living at home and social housing were the variables that were most strongly correlated with morbidity. This cluster was spatially distributed throughout Toronto but was less prevalent in the northeast part of the city. In cluster 2, seniors living at home, traffic collisions total population and visible minorities (ordered from most to least correlated) were the variables that were most strongly correlated with morbidity. This cluster was spatially distributed throughout Toronto but was more prevalent in the north, south, central and southwest. In cluster 3, total population and visible minorities were the variables most strongly correlated with morbidity, however, seniors living at home was also strongly correlated with morbidity. This cluster was represented in only two neighbourhoods, one is south-central and the other in east Toronto. In cluster 4, traffic collisions and area were most strongly correlated with morbidity. This cluster was also only represented in two neighbourhoods, one in northeast and the other in northwest Toronto. The results of the heatmaps for the regressors (for morbidity) have been summarized in Table 6, which shows a breakdown of the cluster that each variable is highly correlated with. These results show that clusters 3 and 4 have the highest number of variables correlated with morbidity, whereas clusters 1 and 2 have fewer variables correlated with morbidity.

The SOM built for falls, again, included 4 clusters. In cluster 1 seniors living alone, social assistance recipients, social housing, traffic collisions, total population, and visible minorities (ordered from most to least correlated) were the variables most strongly correlated with falls. This cluster was distributed throughout Toronto and was the dominating cluster, representing the majority of the city. In cluster 2, social housing was most strongly correlated with falls. This cluster was only represented in two neighbourhoods, both located in south-central Toronto. In cluster 3, total population and visible minorities were most strongly correlated with falls. This cluster was also only represented in two neighbourhoods, one located in south-central and the other located in east Toronto. In cluster 4, traffic collisions and area were the variables most strongly correlated with falls. Like clusters 2 and 3, this cluster was also only represented in two neighbourhoods, one in northeast and the other in northwest Toronto. The results of the heatmaps for the regressors (for falls) have been summarized in Table 7, which shows a breakdown of the cluster that each variable is highly correlated with. These results show that clusters 1, 3 and 4 have the highest number of variables correlated with falls, whereas cluster 2 has fewer variables correlated with falls.

Table 7. Falls heatmaps for the regressors summary.

Falls
Variable Cluster
Traffic Collisions 4
Total Population 3
Visible Minority 3
Social Assistance Programs 1
Social Housing 2
Seniors Living Alone 1
Area 4

The SOM built for intentional injuries only included 3 clusters (Table 8). In cluster 1, low-income population, low-income family, total population, rented dwelling, and population density (ordered from most to least correlated) were strongly correlated with intentional injuries. This cluster was distributed throughout Toronto and was the dominating cluster, representing most neighbourhoods in the city. In cluster 2, rented dwelling, robberies, assaults, low-income population, low-income family, total population (ordered from most to least correlated) were the variables strongly correlated with intentional injuries. This cluster was represented in several neighbourhoods, all spatially located in south central Toronto. In cluster 3, rented dwelling and population density were strongly correlated with intentional injuries. This cluster was only represented in two neighbourhoods, both located in central Toronto. The results of the heatmaps for the regressors (for intentional injuries) have been summarized in Table 9, which shows a breakdown of the cluster that each variable is highly correlated with. These results show that cluster 2 has the highest number of variables correlated with unintentional injuries, whereas clusters 1 and 3 have fewer variables correlated with intentional injuries.

Table 8. Intentional injuries heatmaps for the regressors summary.

Intentional Injuries
Variable Cluster
Total Population 1 and 2
Low Income Families 1 and 2
Low Income Population 1
Rented Dwellings 2 and 3
Assaults 2
Robberies 2
Population Density 3

Table 9. Morbidity heatmaps for the regressors summary.

Morbidity
Variable Cluster
Traffic Collisions 4
Total Population 3
Visible Minority 3
Social Housing 1
Seniors Living Alone 2
Area 4

Seniors living alone and traffic collisions were strongly correlated with the majority of clusters for unintentional injuries, morbidity, and falls. Indicating that these variables may be more likely to contribute to these types of injuries compared to the other variables included in this analysis. Rented dwelling and low-income population were strongly correlated with the clusters for intentional injuries, indicating that intentional injuries are more likely to occur in poorer (or low income) Toronto neighbourhoods. Overall, clusters that were represented by a larger number of neighbourhoods tended to have a higher number of variables correlated with each injury type, while smaller clusters tended to have fewer numbers of (or more specific) variables associated with injury type. Population density and rented dwellings were variables that tended to be associated with locations in central Toronto (i.e., the neighbourhoods that have higher population density compared to the city’s peripheries).

4. Conclusions

Recent advances in geocomputational methods, as well as spatial analysis, have brought new techniques that better enable the understanding of spatial characteristics of cities and regions [52]. It is of utmost importance to understand regional patterns of epidemiologic concern, to better optimize public health efficiency in rapidly changing regions [53]. In this sense, geocomputational methods, when combined with large spatially-explicit data, allow for significant contributions of regional understanding of injury dynamics. Supported by data availability, open data at the city level may have a profound impact on the assessment and resulting community and policy intervention strategies for neighborhoods. The application of geocomputational techniques to the National Ambulatory Care Reporting System has allowed perceiving that the pattern of the residence locations of injured persons is not spatially random, but clearly very spatially dependent.

There is some disagreement in the literature regarding the effects of immigration status on health and violence. A number of authors have shown that population health determinants such as income and social status, education, employment or working conditions, social and physical environments, personal health practices, healthy child development, biologic and genetic endowment, health services, sex, and culture have a relationship with injury patterns [5456]. Others have argued that the distinction between intentional and unintentional injury is arbitrary [57, 58] and that the risk factors associated with intentional and unintentional injury overlap [5961]. Based on these lines of previous work, we would have expected that the spatial distributions of the residences of those injured by these disparate mechanisms may have overlapped. However, ours is the first study to demonstrate that the spatial distributions of residence locations were similar regardless of whether the mechanism of injury was intentional or unintentional. This finding was consistently seen in the choice of selected variables, despite marked differences in size, economy, and cultural composition. The slightly larger areas of hotspots of home locations of those injured unintentionally may either reflect a difference of the aforementioned factors or simply be related to the larger number of persons injured unintentionally. Finally, the most resounding conclusion is that injury can greatly benefit from tailor-made injury prevention initiatives that address the specificities of neighborhoods and types of injury to guarantee a successful mechanism of injury prevention at the local level.

Data Availability

Data cannot be shared as they are owned by a third party and contain potentially identifiable personal health information. Data can be requested through the Canadian Institute for Health Information, under the Access Data and Reports: https://www.cihi.ca/en/access-data-and-reports.

Funding Statement

This research is supported by the Canadian Institutes of Health Research Strategic Team Grant in Applied Injury Research # TIR-103946.

References

  • 1.Rhee P., Bellal J., Pandit V. Aziz H., Vercruysse G., Kulvatunyou N., et al. (2014) Increasing Trauma Deaths in the United States. Annals of Surgery, 260(1): 13–21. 10.1097/SLA.0000000000000600 [DOI] [PubMed] [Google Scholar]
  • 2.Bilette J-M. & Janz T. (2011) Injuries in Canada: Insights from the Canadian Community Health Survey. Statistics Canada Catalogue, 82–624 (Statistics Canada: Ottawa). [Google Scholar]
  • 3.Kristiansen T., Lossius H.M., Rehn M., Kristensen P., Gravseth H.M., Røislien J., et al. (2014) Epidemiology of trauma: A population-based study of geographical risk factors for injury deaths in the working-age population of Norway. Injury, 45(1): 23–30. 10.1016/j.injury.2013.07.007 [DOI] [PubMed] [Google Scholar]
  • 4.Vaz E., Cusimano M., & Hernandez T. (2015) Land use perception of self-reported health: Exploratory analysis of anthropogenic land use phenotypes. Land Use Policy, 46: 232–240. [Google Scholar]
  • 5.NIH. (2007) Fogarty International Collaborative Trauma and Injury Research Training Program. Available from: http://grants.nih.gov/grants/guide/pa-files/PAR-04-083.html. [Google Scholar]
  • 6.Tracy M., Morgenstern H., Zivin K., Aiello A.E. & Galea S. (2014) Traumatic event exposure and depression severity over time: results from a prospective cohort study in an urban area. Social Psychiatry and Psychiatric Epidemiology, Article in Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Rockett I. R. H., Regier M. D., Kapusta N. D., Coben J. H., Miller T. R. & Hanzlick R. L. (2012) Leading Causes of Unintentional and Intentional Injury Mortality: United States, 2000–2009. American Journal of Public Health, 102(11): 84–92. 10.2105/AJPH.2012.300960 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Smith G.S. & Barss P.G. (1991). Unintentional injuries in developing countries: the epidemiology of a neglected problem. Epidemiologic Review, 13: 228–266. 10.1093/oxfordjournals.epirev.a036070 [DOI] [PubMed] [Google Scholar]
  • 9.HC. Injury Surveillance in Canada: Current Realities and Challenges. (2003) Available from: http://www.injurypreventionstrategy.ca/downloads/InjurySurveillanceinCanada.pdf. [Google Scholar]
  • 10.Gilks J. & Logan R. (2010) Occupational injuries and diseases in Canada, 1996–2008: injury rates and cost to the economy, (Human Resources and Skills Development Canada, Ottawa, Canada). [Google Scholar]
  • 11.CIHI. National Trauma Registry 2003 Report. (2003) Available from: http://dsp-psd.pwgsc.gc.ca/Collection/H115-4-2002E.pdf. [Google Scholar]
  • 12.Falk H. & Briss P. (2011) Environmental-and injury-related epidemic-assistance investigations, 1946–2005. American Journal of Epidemiology, 174(11): 65–79. [DOI] [PubMed] [Google Scholar]
  • 13.Morassaei S., Breslin F. C., Ibrahim S. A., Smith P. M., Mustard C. A., Amick B. C. III, et al. (2013) Geographic variation in work injuries: a multilevel analysis of individual-level data and area-level factors within Canada. Annals of Epidemiology, 23(5): 260–266. 10.1016/j.annepidem.2013.03.008 [DOI] [PubMed] [Google Scholar]
  • 14.Bamzar R. & Ceccato V. (2014) The nature and the geography of elderly injuries in Sweden. GeoJournal, Article in Press. [Google Scholar]
  • 15.Vaz E., Zhao Y. & Cusimano M. (2016) Urban habitats and the injury landscape. Habitat international, 56: 52–62. [Google Scholar]
  • 16.Vaz E. & Bowman L. (2013) An Application for Regional Coastal Erosion Processes in Urban Areas: A Case Study of the Golden Horseshoe in Canada. Land 2013, 2: 595–608. [Google Scholar]
  • 17.Ardalan A., Sepehrvand N., Pourmalek F., Masoumi G., Sarvar M., Mahmoudabadi A., et al. (2014) Deadly rural road traffic injury: A rising public health concern in I.R. Iran, International Journal of Preventive Medicine, 5(2): 241–244. [PMC free article] [PubMed] [Google Scholar]
  • 18.Rothman L., Buliung R., Macarthur C., To T., Howard A. (2014) Walking and child pedestrian injury: A systematic review of built environment correlates of safe walking, Injury Prevention, 20(1):41–49. 10.1136/injuryprev-2012-040701 [DOI] [PubMed] [Google Scholar]
  • 19.Gattrell A. & Loytonen M. (1998) GIS and Health, GISDATA 6, ed. Ian Masser and Francois Salge, (Taylor and Francis: London, UK: ). [Google Scholar]
  • 20.Kivell P. & Mason K. (1999) Trauma systems and major injury centres for the 21st century: an option. Health Place, 5(1): 99–110. 10.1016/s1353-8292(98)00042-2 [DOI] [PubMed] [Google Scholar]
  • 21.Braddock M., Lapidus G., Cromley E., Cromley R., Burke G. & Banco L. (1994) Using a geographic information system to understand child pedestrian injury. Am J Public Health, 84(7): 1158–1161. 10.2105/ajph.84.7.1158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.LaScala E.A., Johnson F.W., & P.J. (2001) Gruenewald, Neighborhood characteristics of alcohol-related pedestrian injury collisions: a geostatistical analysis. Prev Sci, 2(2): 123–34. 10.1023/a:1011547831475 [DOI] [PubMed] [Google Scholar]
  • 23.Lightstone A.S., Dhillon P., Peek-Asa C. & Kraus J. (2001) A geographic analysis of motor vehicle collisions with child pedestrians in Long Beach, California: comparing intersection and midblock incident locations. Inj Prev, 7(2): 155–60. 10.1136/ip.7.2.155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Faghri A. & Ramn N. (1995) A GIS-Based Traffic Accident Information System. J Advance Transportation, 29: 321–334. [Google Scholar]
  • 25.Ng K.S., Hung W.T & Wong W.G. (2002) An algorithm for assessing the risk of traffic accident. J Safety Res, 33(3): 387–410. 10.1016/s0022-4375(02)00033-6 [DOI] [PubMed] [Google Scholar]
  • 26.Vaz E., Tehranchi S. & Cusimano M. (2017) Spatial Assessment of Road Traffic Injuries in the Greater Toronto Area (GTA): Spatial Analysis Framework. Journal of Spatial and Organizational Dynamics, 5(1), 37–55. [Google Scholar]
  • 27.Grisé E., Buliung R., Rothman L. & Howard A. (2018) A geography of child and elderly pedestrian injury in the City of Toronto. Canada, Journal of Transport Geography, 66: 321–329. [Google Scholar]
  • 28.Midford R., Masters L., Phillips M., Daly A., Stockwell T., Gahegan M., et al. (1998) Alcohol consumption and injury in Western Australia: a spatial correlation analysis using geographic information systems. Aust N Z J Public Health, 22(1): 80–5. 10.1111/j.1467-842x.1998.tb01149.x [DOI] [PubMed] [Google Scholar]
  • 29.Zavoski R.W., Lapidus G.D., Lerer T.J., Burke G. & Banco L.I. (1999) Evaluating the impact of a street barrier on urban crime. Injury Prevention, 5(1): 65–8. 10.1136/ip.5.1.65 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ernst J.S. (2000) Mapping child maltreatment: looking at neighborhoods in a suburban county. Child Welfare, 79(5): 555–72. [PubMed] [Google Scholar]
  • 31.Latkin C., Glass G.E., & Duncan T. (1998) Using geographic information systems to assess spatial patterns of drug use, selection bias and attrition among a sample of injection drug users. Drug Alcohol Depend, 50(2): 167–75. 10.1016/s0376-8716(98)00025-8 [DOI] [PubMed] [Google Scholar]
  • 32.Fabio A., Sauber-Schatz E.K., Barbour K.E. & Le W. (2009) The Association Between County-Level Injury Rates and Racial Segregation Revisited: A Multilevel Analysis, American Journal of Public Health, 99(4): 748–753. 10.2105/AJPH.2008.139576 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sherriff B., MacKenzie S., Swart L-A., Amine Seedat M., Bangdiwala S.I. & Ngude R. (2013) A comparison of urban-rural injury mortality rates across two South African Provinces. International Journal of Injury Control and Safety Promotion, 22(1): 75–85. [DOI] [PubMed] [Google Scholar]
  • 34.Raatiniemi L., Steinvik T., Liisanantti J., Ohtonen P., Martikainen M., Alahuhta S., et al. (2016) Fatal injuries in rural and urban areas in northern Finland: a 5-year retrospective study, Acta anaesthesiologica Scandinavica, 60(5): 668–676. 10.1111/aas.12682 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Mitchell R.J. & Lower T. (2018) Rural-urban variation in injury related hospitalization, health outcomes and treatment cost in New South Wales. The Australian Journal of Rural Health, 26: 165–172. 10.1111/ajr.12408 [DOI] [PubMed] [Google Scholar]
  • 36.Cusimano M., Marshall S., Rinner C., Jiang S. & Chipman M. (2010) Patterns of Urban Violent Injury: A Spatio-Temporal Analysis. PLoS One, 5(1). 10.1371/journal.pone.0008669 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Morency P., Voyer C., Burrows S. & Goudreau S. (2012) Outdoor falls in an Urban Context: Winter weather impacts and geographical variations. Canadian Journal of Public Health, 103(3), 218–222. 10.1007/BF03403816 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.George M.A., Brussoni M., Jin A., Lalonde C.E. & McCormick R. (2016) Ecological analysis of the associations between injury risk and socioeconomic status, geography and Aboriginal ethnicity in British Colombia, Canada, Springer Plus, 5(1), 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Welton R.E., Williams D.J. & Liew D. (2016) Injury trends from envenoming in Australia, 2000–2013. Internal Medicine Journal, 47(2), 170–176. [DOI] [PubMed] [Google Scholar]
  • 40.Zebib L., Stoler J. & Zakrison T.L., Geo-demographics of gunshot wound injuries in Miama-Dade County, 2002–2012. BMC Public Health, 17: 174–184. 10.1186/s12889-017-4086-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zandy M., Zhang L.R., Kao D., Rajabali F., Turcotte K., Zheng A., et al. Area-based socioeconomic disparities in mortality due to unintentional injury and youth suicide in British Columbia, 2009–2013. Health Promotion and Chronic Disease Prevention in Canada, 39(2), 35–44. 10.24095/hpcdp.39.2.01 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Keeves J. Ekegren C.L., Beck B. & Gabbe B.J. (2019) The relationship between geographic location and outcomes following injury: A scoping review. Injury, 50, 1826–1838. 10.1016/j.injury.2019.07.013 [DOI] [PubMed] [Google Scholar]
  • 43.LaScala E.A., Johnson F.W., & P.J. (2001) Gruenewald, Neighborhood characteristics of alcohol-related pedestrian injury collisions: a geostatistical analysis. Prev Sci, 2(2): 123–34. 10.1023/a:1011547831475 [DOI] [PubMed] [Google Scholar]
  • 44.Link B.G. & Phelan J.C. (1996) Understanding sociodemographic differences in health—the role of fundamental social causes. Am J Public Health, 86(4): 471–3. 10.2105/ajph.86.4.471 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Pickett W., Garner M., Boyce W. & King M.A. (2002) Gradients in risk for youth injury associated with multiple-risk behaviours: a study of 11,329 Canadian adolescents. Soc Sci Med, 55(6): 1055–68. 10.1016/s0277-9536(01)00224-6 [DOI] [PubMed] [Google Scholar]
  • 46.Barber K., & MacLellan D. (2019) Examining Open Data at the Urban Level: An Exploration of “Wellbeing Toronto”. Journal of Urban Technology, 26(1), 107–121. [Google Scholar]
  • 47.Getis A. & Ord J.K. (1992) The Analysis of Spatial Association by Use of Distance Statistics. Geographical Analysis, 24(3), 189–206. [Google Scholar]
  • 48.Ward M. & Gleditsch K. S. (2008) Spatial Regression Models. Series: Quantitative Applications in the Social Sciences. SAGE Publications, Thousand Oaks, California. [Google Scholar]
  • 49.Anselin L. (2002) Under the hood issues in the specification and interpretation of spatial regression models. Agricultural economics, 27(3), 247–267. [Google Scholar]
  • 50.Bamzar R. & Ceccato V. (2015) The nature and the geography of elderly injuries Sweden. GeoJournal, 80(2), 279–299. [Google Scholar]
  • 51.Vaz E., Zhao Y. & Cusimano M. (2016) Urban habitats and the injury landscape. Habitat international, 56, 52–62. [Google Scholar]
  • 52.Fischer M. M. & Getis A. (2009) Handbook of Applied Spatial Analysis: Software Tools, Methods and Applications. Springer-Verlag; Berlin Heidelberg, Berlin, Germany. [Google Scholar]
  • 53.Vaz E., & Arsanjani J. J. (2015) Predicting Urban Growth of the Greater Toronto Area-Coupling a Markov Cellular Automata with Document Meta-Analysis. Journal of Environmental Informatics, 25(2), 71–80. [Google Scholar]
  • 54.Faelker T., Pickett W. & Brison R.J. (2000) Socioeconomic differences in childhood injury: a population based epidemiologic study in Ontario, Canada. Injury Prevention, 6(3): 203–8. 10.1136/ip.6.3.203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Runyan C.W., Casteel C., Perkis D., Black C., Marshall S.W., Johnson R.M., et al. (2005) Unintentional injuries in the home in the United States Part I: mortality. Am J Prev Med, 28(1): 73–9. 10.1016/j.amepre.2004.09.010 [DOI] [PubMed] [Google Scholar]
  • 56.Cubbin C., LeClere F.B. & Smith G.S. (2000) Socioeconomic status and injury mortality: individual and neighbourhood determinants. J Epidemiol Community Health, 54(7): 517–24. 10.1136/jech.54.7.517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Garbarini C. (1988) The wrong approach. Nursing, (18): 5. [Google Scholar]
  • 58.Gable S. & Peterson L. (1998) School-age children’s attributions about their own naturally occurring minor injuries: a process analysis. J Pediatric Psychol, 23(5): 323–32. 10.1093/jpepsy/23.5.323 [DOI] [PubMed] [Google Scholar]
  • 59.Deal L.W., Gomby D.S., Zippiroli L. & Behrman R.E. (2000) Unintentional injuries in childhood: analysis and recommendations. Future Child, 10(1): 4–22. [PubMed] [Google Scholar]
  • 60.Ramsay L.J., Moreton G., Gorman D.R., Blake E., Goh D., Elton R.A., et al. (2003) Unintentional home injury in preschool-aged children: looking for the key—an exploration of the inter-relationship and relative importance of potential risk factors. Public Health, 117(6): 404–11. 10.1016/S0033-3506(03)00072-6 [DOI] [PubMed] [Google Scholar]
  • 61.Cohen D.A., Mason K., Bedimo A., Scribner R., Basolo V. & Farley T.A. (2003) Neighborhood physical conditions and health. Am J Public Health, 93(3): 467–71. 10.2105/ajph.93.3.467 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Yanyong Guo

4 Jan 2021

PONE-D-20-32866

Open Data and Injuries in the urban areas – A spatial analytical framework of Toronto using machine learning and spatial regressions

PLOS ONE

Dear Dr. Damásio,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by February 2nd, 2021. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Yanyong Guo, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar. If you do not know anyone who can help you do this, you may wish to consider employing a professional scientific editing service.  

Whilst you may use any professional scientific editing service of your choice, PLOS has partnered with both American Journal Experts (AJE) and Editage to provide discounted services to PLOS authors. Both organizations have experience helping authors meet PLOS guidelines and can provide language editing, translation, manuscript formatting, and figure formatting to ensure your manuscript meets our submission guidelines. To take advantage of our partnership with AJE, visit the AJE website (http://learn.aje.com/plos/) for a 15% discount off AJE services. To take advantage of our partnership with Editage, visit the Editage website (www.editage.com) and enter referral code PLOSEDIT for a 15% discount off Editage services.  If the PLOS editorial team finds any language issues in text that either AJE or Editage has edited, the service provider will re-edit the text for free.

Upon resubmission, please provide the following:

  • The name of the colleague or the details of the professional service that edited your manuscript

  • A copy of your manuscript showing your changes by either highlighting them or using track changes (uploaded as a *supporting information* file)

  • A clean copy of the edited manuscript (uploaded as the new *manuscript* file)

3.In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

4.Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This research is supported by the Canadian Institutes of Health Research Strategic Team Grant in

Applied Injury Research # TIR-103946."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

 "NO - The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

5. Please ensure that you refer to Figure 3 in your text as, if accepted, production will need this reference to link the reader to the figure.

6.We note that the figures in your submission contain map images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

1.    You may seek permission from the original copyright holder of the figures to publish the content specifically under the CC BY 4.0 license. 

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

2.    If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

7. We note that you currently have two different versions of tables 1 to 4 included in your manuscript, so that the tables can be differentiated can you please update the table titles (numbering) and in-text citations so that the second set of tables are numbered individually (and not the same as the previous numbers for Tables 1 to 4 already used).

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The topic of this paper is interesting and important. The methods sound. The results are meaningful and useful. There is one suggestion to improve this paper.

1. The quality of the figures are not high. And the figure name and figure number need to be with the figures.

Reviewer #2: It is not surpervise that the ML approach could achive a better results. The motivation should be strengthened. The contributions of the study should be clear. Some related references should be discussed. in addition, there are some typos.

Reviewer #3: The manuscript aims at exploring the spatial relations of injury throughout the city, together with Wellbeing Toronto data. Given the high attention of urban injuries, this work is of great significance. However, there are still some major problems in the current manuscript, and should be addressed.

(1) why use the 2004-2010 injuries data, why not use the recent, do you make sure that results of this study can be applied for the nowadays?

(2) Covariates used in this study are not consistent in the date, which may affect the reliability of findings.

(3) literature reviews are too simple, suggest to added more relevant researches.

(4) figures in this current manuscript are unclear.

(5) table 2 are too simple, some statistics should be done, such as mean, s.d. max and min.

(6) The label of the table is too messy, please check.

(7) please add the line number, it is very inconvenient for reviewers.

(8) Grammatical errors can be seen in several sentences. The whole manuscript needs to be thoroughly proofread.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Mar 11;16(3):e0248285. doi: 10.1371/journal.pone.0248285.r002

Author response to Decision Letter 0


11 Feb 2021

Reviewer 1

Reviewer #1: The topic of this paper is interesting and important. The methods sound. The results are meaningful and useful. There is one suggestion to improve this paper.

1. The quality of the figures are not high. And the figure name and figure number need to be with the figures.

We’ve changed Figure 1, and certified that all Figures are at 300 dpi for easy visualization upon the final version. Finally, we’ve added an acknowledgment section thanking the reviewer for the inspiring comments concerning our manuscript. Thank you very much for your review!

Reviewer 2

Reviewer #2: It is not surpervise that the ML approach could achive a better results. The motivation should be strengthened. The contributions of the study should be clear. Some related references should be discussed. in addition, there are some typos.

Response: Thank you. We’ve now strengthened the motivation, and proofread the manuscript. We would like to thank the reviewer for the inspiring comment, an acknowledgment section has been added accordingly.

Reviewer 3

Reviewer #3: The manuscript aims at exploring the spatial relations of injury throughout the city, together with Wellbeing Toronto data. Given the high attention of urban injuries, this work is of great significance. However, there are still some major problems in the current manuscript, and should be addressed.

Query 1: why use the 2004-2010 injuries data, why not use the recent, do you make sure that results of this study can be applied for the nowadays.

Response 1: NACRS data took circa ten years to get governmental and institutional approval. This is a long process itself. We are convinced that the fact that we have assessed a total of six years timeframe, renders this model current, given that no significant changes have occurred within the dynamics of the city that would radically change the predictive capacity of the response variables.

Query 2: Covariates used in this study are not consistent in the date, which may affect the reliability of findings.

Response 2: While we do understand the reviewers concern, we would like to remind the reviewer that we are using cumulative data between 2004 and 2010, approximate values within the same timeline for covariates are common practice in social sciences as both census, and demographic data are not necessarily yearly derived. Furthermore, they are used for an explanatory model of the phenomena, and not for a direct comparison of an output.

Query 3: literature reviews are too simple, suggest to added more relevant researches.

Response 3: Thanks. We’ve added additional relevant research. Please refer to discussion from line numbers 145 to 175. Thanks.

Query 4: figures in this current manuscript are unclear.

Response 4: We’ve changed Figure 1. Additionally, we’ve reformatted Figure 2 to show the relative percentage of injuries, reflecting better the distribution of injury. The other figures are now at 300 dpi as to guarantee higher resolution both for online and print visualization.

Query 5: table 2 are too simple, some statistics should be done, such as mean, s.d. max and min.

Response 5: Thanks. We’ve now added these. Please refer to the new Table 2.

Query 6: The label of the table is too messy, please check.

Response 6: We’ve now revised the labels of Table 2.

Query 7: please add the line number, it is very inconvenient for reviewers.

Response 7: Line numbers have been added.

Query 8: Grammatical errors can be seen in several sentences. The whole manuscript needs to be thoroughly proofread.

Response 8: Thanks. A thorough proofreading of the manuscript was now conducted.

Acknowledgment: We would like to thank the reviewer for the useful comments and critique that greatly improved the manuscript. We’ve added an acknowledgment section accordingly. Thank you very much!

Decision Letter 1

Yanyong Guo

24 Feb 2021

Open Data and Injuries in the urban areas – A spatial analytical framework of Toronto using machine learning and spatial regressions

PONE-D-20-32866R1

Dear Dr. Damásio,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Yanyong Guo, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #3: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: No

Acceptance letter

Yanyong Guo

26 Feb 2021

PONE-D-20-32866R1

Open Data and Injuries in urban areas – A spatial analytical framework of Toronto using machine learning and spatial regressions

Dear Dr. Damásio:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Yanyong Guo

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    Data cannot be shared as they are owned by a third party and contain potentially identifiable personal health information. Data can be requested through the Canadian Institute for Health Information, under the Access Data and Reports: https://www.cihi.ca/en/access-data-and-reports.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES