Abstract
In response to the demand for advanced tools in environmental monitoring and policy formulation, this work leverages modern software and big data technologies to enhance novel road transport emissions research. This is achieved by making data and analysis tools more widely available and customisable so users can tailor outputs to their requirements. Through the novel combination of vehicle emissions remote sensing and cloud computing methodologies, these developments aim to reduce the barriers to understanding real-driving emissions (RDE) across urban environments. The platform demonstrates the practical application of modern cloud-computing resources in overcoming the complex demands of air quality management and policy monitoring. This paper shows the potential of modern technological solutions to improve the accessibility of environmental data for policy-making and the broader pursuit of sustainable urban development. The web-application is publicly and freely available at https://cares-public-app.azurewebsites.net.
Keywords: Vehicle emissions, Cloud computing, Remote sensing, Point sampling, Real driving emissions
Introduction
Outdoor and indoor air pollution are the two leading causes of economic disadvantage from environmental factors respectively (Lomborg, 2020; Stanaway et al., 2018). Transport, especially in urban environments, is a major contributor to outdoor air pollution (Colvile et al., 2001; O’Driscoll et al., 2018; Pastorello & Melios, 2016). The level of exposure to air pollution and its subsequent negative effects will only be exacerbated by the current global trend of urbanisation (Gu et al., 2021). As more people move to the urban environment there is an increased demand for transport services creating a vicious circle of poor public health. The increase in demand causes a non-linear increase in pollution because the capacity of road networks are largely fixed and the increased demand leads to more severe and frequent congestion, which produces more air pollution per vehicle than non-congested traffic (WHO, 2006). This is likely to lead to an increased burden on the people responsible for delivering the policies aimed at reducing the population’s exposure to air pollution. With the complexities of cost and expertise as well as constrained capacity for delivery, the challenge of effectively reducing air pollution in urban environments remains, despite advances in vehicle emissions controls and the ongoing electrification of the fleet. This study aims to demonstrate the potential of this technology to reduce the complexity and cost burden on air pollution policy makers by allowing them access to up-to-date data and data processing tools.
Air pollution is a significant contributing factor to many health problems (Breslow & Goldsmith, 1958) relating to both cardiovasular and respiratory system problems (Brunekreef & Holgate, 2002). Exposure to air pollution can result in a reduction in lung function amongst children (Gauderman et al., 2004), a reduction in cognitive ability and depression, affecting both children and adults (Fonken et al., 2011; Gatto et al., 2014; Power et al., 2011). The two major air pollutants associated with transport in urban environments are oxides of nitrogen (), and more pressingly nitrogen dioxide (), and fine particulate matter (). is generally regarded as a major contributor to poor health outcomes across a range of different domains (COMEAP, 2015; EEA, 2008; IARC, 2013; Kampa & Castanas, 2008; WHO, 2013; Zhang et al., 2011). Exposure to PM2.5 is also increased in transport environments (Adams et al., 2001; Kinney et al., 2011) and the negative health impacts are significant (Yang et al., 2019).
Air pollution is a negative external cost of transportation (Ward et al., 2021), making policy interventions the most effective way to reduce its impact. Policies that mitigate air pollution or reduce public exposure to it are highly desirable from both health and economic perspectives. The historical failure of type approval legislation to reduce ambient air pollution levels (Boogaard et al., 2012; Carslaw et al., 2011b; Carslaw, 2005; Carslaw et al., 2011a; Holman et al., 2015) indicates the need for new methods, which are increasingly critical for the economic and social well-being of city dwellers. Several factors limit the usefulness of laboratory measurements for developing effective policies, including the disparity between type approval limits and real driving emissions (RDE) values, and the growing number of vehicles with deteriorated or manipulated emissions reduction systems (Pöhler et al., 2023), making in-situ RDE measurements essential. Additionally, the impact of deterioration on emissions over a vehicle’s lifespan is relatively poorly understood. Combined with the differences between urban duty-cycles and test procedures, this may significantly affect overall emissions.
The significant health impacts associated with poor air quality and increasing political awareness of these issues have led to a rise in studies, legislation, and schemes aimed at improving air quality. However, communicating the results of these studies and assessing the effectiveness of new schemes and legislation remains challenging. Efforts have been made to make publicly funded air pollution data more accessible to the general public. Data from the Automatic Urban and Rural Network (AURN) in the UK are available online (https://uk-air.defra.gov.uk); however, accessing the data directly can be challenging and often requires specialised software and expertise. The openAir (Carslaw & Ropkins, 2012), worldMet (Carslaw, 2023), and AQEval (Ropkins et al., n.a.) R packages have lowered the barrier to entry for analysis, but they still require specialised technical expertise, likely beyond the knowledge of most citizens.
The National Center for Environmental Information, in partnership with the National Oceanic and Atmospheric Administration (NCEI, NOAA), offers an online platform where users can view various environmental visualisations within a Geographical Information Systems (GIS) environment such as https://www.ncei.noaa.gov/maps-and-geospatial-products. The Real Urban Emissions Initiative (TRUE Initiative, https://www.trueinitiative.org), a partnership between the FIA Foundation and the International Council on Clean Transportation (ICCT), has integrated vehicle lookup forms into its website, allowing users to check the air quality impact of individual vehicles. OpenAQ (https://openaq.org) provides an open data source for a global network of air pollution monitors that is publicly accessible through an API. The RTVEMVS project (Ding et al., 2021) uses real-time traffic flow data and vehicle metadata to model the vehicle emissions of vehicles on the Chongqing network, but does not measure them directly. The new approach presented in this paper is the first to make real driving emission data publicly available as well as using an innovative document style data structure to facilitate instruments and campaigns with different data outputs being displayed consistently and coherently. This improves the comparability of these data sets and ease of interpretation for users with less capacity or knowledge to use modern data science tools. This, in turn, allows policy decisions to be made using a joined-up data-driven approach rather than a more speculative or intuitive approach, and will lead to more successful policy development in the future.
Optical remote sensing device (RSD) measurement campaigns have historically shed light on the real driving emissions (RDE’s) of in-situ vehicles, giving operators a clearer picture of the contribution of different vehicle types to the overall emissions inventory. The method has been particularly successful for gaseous species such as carbon monoxide (CO), hydrocarbons (HC) and oxides of nitrogen (NO, and ) (Popp et al., 1999; Stedman & Bishop, 1990; Stedman et al., 1994; Zhang et al., 1995, 1996). Optical RSD measurements have been taken across the United Kingdom, continental Europe and worldwide (Carslaw, 2005; Franco et al., 2013; Rushton et al., 2018, 2021; Tate, 2013b). There are currently two leading commercial providers of RSD measurements, OPUS and HEAT. More detailed descriptions of the OPUS and HEAT remote sensing methodologies can be found elsewhere in the literature (Dallmann et al., 2019; Ghaffarpasand et al., 2020; Ropkins et al., 2017). A method for point sampling (PS), distinct from the optical remote sensing methodology has also been developed. The point sampling methodology may reduce the cost of RDE measurements by using less expensive instruments that could be deployed in greater number across cities (Knoll et al., 2023). This may be done with a potential five-fold improvement for particulate matter emission data quality and without significant loss of gaseous species data quality, as proven for NO and . (Giechaskiel et al., 2014; Horbanski et al., 2019; Knoll et al., 2024; Schriefl et al., 2020). The PS methodology and the RS methodology use light-gate technology to capture the individual vehicle pass-throughs and an automatic number plate recognition (ANPR) system was used to attribute tailpipe emissions to a specific vehicle. The number plates are used as a reference for vehicle meta-data and then pseudonymised to comply with data protection regulations. Speed and acceleration characteristics are also captured using the light-gate to give some indication of vehicle dynamics and power requirements at the measurement site (Carslaw et al., 2013).
This paper introduces the data infrastructure that was delivered as part of the CARES project (Davison et al., 2022; Farren et al., 2023; Knoll et al., 2023; Pechout et al., 2022; Pöhler et al., 2023; Qiu & Borken-Kleefeld, 2022; Yang et al., 2022) and a parallel Chinese research project running in parallel to CARES (Hao et al., 2022a, b) showing how it can be applied in the real world. A full list of CARES deliverables may be found at https://cordis.europa.eu/project/id/814966/results. This paper discusses and demonstrates the potential for modern data handling and data science techniques to be used in conjunction with the latest developments in vehicle RDE measurements to lower the barrier to entry for people wishing to engage with existing campaigns or conduct their own campaigns. The CARES project has provided the opportunity to research and develop modern data-science solutions to collate, quality check, process and analyse vehicle remote emission sensing data. The project has also merged more conventional vehicle remote emission sensing measurements made by commercial suppliers with the emerging point sampling solution. The novel use of PS devices and a modern cloud-based data management and processing infrastructure was intended to reduce the barrier of entry for policy makers and reduce their reliance on external contractors to analyse their data.
Data system development
Data Collection
The data collection for this project consisted of four separate campaigns. An instrument characterisation experiment was conducted at Lelystadt (NL), and three city experiments were respectively conducted in Milan (IT), Prague (CZ) and Krakow (PL). Each experiment consisted of a different set of sensors. The data collection and key results are detailed throughout the cited publications of the CARES project and at https://cares-project.eu. A visualisation of the database schematic is shown in Fig. 1 and a summary of the measurements captured across each campaign is presented in Table 1.
Fig. 1.

Schematic showing data infrastructure and the different data sets available for each city
Table 1.
Count of measurements from each city separated by instrument type: RS is optical remote sensing, PS is point sampling
| City | Instrument | Count of measurements |
|---|---|---|
| Milan | PS | 10852 |
| RS | 35568 | |
| Prague | RS | 120611 |
| PS | 10658 | |
| Krakow | RS | 128883 |
| PS | 6162 | |
| Total | All | 1646031 |
Cloud computing platform
Microsoft Azure (https://azure.microsoft.com) provides Infrastructure as a Service (IaaS) through a range of virtual machine, storage and network offerings that allow a user to build and manage their own virtual infrastructure in the cloud. Azure’s Platform as a Service (PaaS) allows a user to develop and deploy web-applications, mobile applications and APIs without worrying about underlying infrastructure. PaaS is useful for building new applications quickly and without the need to worry about managing infrastructure. The data storage platform chosen for this project was CosmosDB (Guay Paz & Guay Paz, 2018).
CosmosDB is a globally distributed database service that allows for the creation and management of document, key-value, graph and column-family data models on one service. It is designed to offer high-throughput, low latency data access to applications around the world. The flexibility offered by CosmosDB is critical to the operation of the data platform. Data collected as part of remote emission sensing campaigns can be inconsistent from location to location and across instruments, with no official standard for the data being designated. As such, a data platform has to be flexible enough to accommodate significantly different data sets and serve them consistently to the end user for analysis. Data security is also an issue that CosmosDB solves effectively. As different data providers are subject to different data sharing agreements, and are not necessarily allowed to see each other’s data, using containers within the CosmosDB structure allows storage on the same database but limiting the access to designated containers depending on the case. Role based access can then be designed into the interfaces, limiting the availability of data to authorised user accounts. There is also a General Data Protection Regulation (GDPR, European Parliament and Council of the European Union (2016)) concern related to serving raw data to unverified end-users. Directly identifiable information such as vehicle registration number are not stored in the database, and vehicle passage time is truncated to the hour (a vehicle passing through at 15:27:00 will be registered at 15:00:00, for example). The truncation of the timestamp means that even with direct access to the database it would currently be impossible to regressively identify a vehicle and hence a user. A hashed and salted (Sriramya & Karthika, 2015) vehicle identifier is on occasion recorded, with the salt being changed regularly to avoid long-term vehicle usage trends being identified by bad actors. To guard against unanticipated future counter-security developments, only visualisations of aggregated and subset data are presented to the user. Neither the individual measurement data, or the tabular aggregated data are ever presented to the end user. The user may not directly query the database themselves without prior permission being granted. This means that a user may get the benefits of the aggregated data but does not risk the consequences of a GDPR breach.
The data structure used on the database was the key-pair JSON style document. This method allowed for flexible storage and retrieval of a wide number of different data sources. The data structure was designed to be transferrable between instruments and measurement campaigns to allow for easy querying of the database by the web-app. An example showing the most important key pairs is shown below. 
Data is accessed by querying the database using the NoSQL format. The web-app uses a range of widgets and inputs to build a query which is then sent to the database. The query is processed by the database and a data frame in a standardised format is returned to the web-app. The data frame is then converted into a plot. An example NoSQL query is shown below and would create a data frame containing two columns, one being the measurement value and one being the site name. The data returned would be all vehicles from the selected container (c) that match the emission standard, fuel type, vehicle type and measurement type. The CARES public app does not grant the user the ability to create custom text strings that can be sent as queries to both protect the database from malicious behaviour such as NoSQL injection attacks, and to preserve the standardisation of the plots that are generated. The secure science app, introduced in Section 2.3, allows for more customised analysis to be performed, including gross emitter vehicle candidate identification, point sampling plume time series analysis, and other in-depth questions. Users with access keys may build fully customised queries and analyse the data in their usual workflows.
Measuring vehicle kinematics and emissions on the kerbside is a complex and often messy undertaking, with observations being influenced by external factors beyond the control of the measurer. Hence, quality control of the data is important especially when the end user may not be a technical expert. Various strategies are built into the data structure including the source file and data ingress time. This allows for rapid removal of the data as it is queried in the event of errors being discovered. Typically the measurements are left as delivered and adjustment factors are built into the post-process and data delivery stages of the infrastructure. A better conversion factor is more easily changed by adjusting a variable in the database query code than rebuilding the database from scratch. Additional technical quality control checks can also be applied as queries are built. For example, the vehicle specific power (VSP) in kW/T of a measurement is calculated as part of the RS measurement using speed and acceleration light gate measurements. Light gates have historically been shown to be very accurate (Rushton, 2016) however occasionally errors occur resulting in unphysical results. The observed VSP can be compared to the known power in kW and the vehicle mass in tons, to determine if the observed VSP was within the limits of the vehicle being measured. An example query is shown below:
Web-application
Web applications were developed to improve the accessibility of the data. Different users have different needs and so three web applications were developed to facilitate different levels of access to the data: the City App was developed for the core CARES team including government or city authorities of project partner cities and the Science App was developed for more experienced users of RS data to gain greater insight into the data. These two apps are protected by an authentication layer to further limit who may access the data, ensuring that all data sharing agreements remain intact. The third Public App was developed as a platform for dissemination of the project’s open data. This version is based on the City App and contains many of the same features but with access to data that is publicly available.
The web-application was developed in Python primarily using Streamlit (https://streamlit.io) as the front end as well as Altair (VanderPlas et al., 2018) and Pandas (Wes McKinney, 2010) to handle the data processing and visualisation. The web-app consists of a series of widgets designed to be user friendly that would build a set of queries to the database. The queries are then executed, returning an appropriate data set and the data set is then visualised in predetermined figures. This strategy allowed for the smoothest operation of the database and minimal cost due to data access. This also allows some quality control to be integrated by limiting exactly what analysis could be performed on the data.
The CARES Public App offers a wide range of high-level views on the data collected for each city within the CARES project. Table 2 shows the features available for each viewing mode.
Table 2.
Summary of categories and subcategories of data visualisation available in the CARES Public App
| Category | Subcategory |
|---|---|
| Fleet Composition | Euro Class |
| Euro Class and Fuel Type | |
| Manufacturer | |
| Vehicle Type | |
| Vehicle Dynamics | Vehicle Specific Power (VS) |
| Speed and Acceleration | |
| Vehicle Emissions | Emissions by Euro Class |
| Emissions by Year of Registration | |
| Emissions by Manufacturer | |
| Weather | Ambient Temperature |
| Atmospheric Pressure | |
| Relative Humidity |
Each of these modes may be viewed across the city as a whole or subset again by location. Each city has at least two different locations for data collection. Each measurement campaign has a different data specification and an objective of this project is to simplify the process of interacting with the data for the end user. To this end, the web app sorts through the user’s request and presents them with the appropriate data channels using natural language processing provided by the fuzzywuzzy python library (https://github.com/seatgeek/fuzzywuzzy). The full specification of each site is detailed in the CARES literature (Davison et al., 2022; Farren et al., 2023; Hao et al., 2022a, b; Knoll et al., 2023; Pechout et al., 2022; Pöhler et al., 2023; Qiu & Borken-Kleefeld, 2022; Yang et al., 2022).
Outputs
This section will give a sense of the data visualisation capabilities of the CARES Public App. This web-app has been designed to be an interactive interface and whilst it is acknowledged that much of the value is lost when presented in a static format, it remains useful to demonstrate the capability where possible.
Fleet composition
Visualising the fleet composition is useful to understand the differences in the local fleet rather than relying on nationwide or super-national estimates of vehicle sales. Local classes of vehicles with high emission characteristics can be identified more readily and with greater confidence. Figure 2 shows how a user might see the different fleet subsets across two different sites, Cilea and Madre Cabrini (Milan). Some differences in the fleet composition is readily determined from Fig. 2 as Cilea has significantly more buses. This is useful information to have when designing policy as buses can be affected by different policies than passenger cars and private hire vehicles. There is also a higher fraction of newer Euro 6d and Euro 6d-temp vehicles in Madre Cabrini, suggesting different land use patterns and/or different demographics. Whilst some local knowledge might be assumed, the data presented allows for a direct reference to be made, free of assumptions about the underlying situation.
Fig. 2.

Figure showing the distribution of different diesel fuelled vehicle types (colour), emission standards (bars) and observation site (Cilea and Madre Cabrini facets) from the Milan campaign. A measurement recorded as null signifies that no metadata about a vehicle was available
It may be of interest to know the most prevalent manufacturers of vehicle that are present in a given location and the app and this is shown in Fig. 3. This information can be linked to vehicle emissions subset by manufacturers to identify any potential problematic trends.
Fig. 3.

Web-app capture of figure showing the distribution of the vehicle fleet by manufacturer at Madre Cabrini, Milan, with a greater area indicating a greater number of records
Vehicle emissions
Clean air zones do not always guarantee clean air (Holman et al., 2015) so understanding the emissions profiles of different vehicle subsets in a fleet is of critical importance to developing policy to reduce it. Various figures can be drawn using the Public App to help a policy maker understand the differences in RDE between different fleet subsets. Figure 4 shows two histograms of emission ratios of for two different fleet subsets. The top panel shows the older Euro 5 ( 2009 - 2014) diesel passenger cars and the current Euro 6d (after date of first registration January 2021 - September 2024). The reduction in emission factors from Euro 5 to Euro 6d vehicles both in modal value and extreme values, as discussed in Yang et al. (2022) and Rushton et al. (2021) would indicate to the policy maker considering a low emission zone or clean air zone (CAZ) that allowing even the last generation of type approved vehicles in would reduce its impact quite significantly. A similar plot can be made using year of registration instead of Euro standard. This is useful when a longer time interval is being considered as it allows for the impact of deterioration to be visualised. Figure 5 shows the ratio of CO to as a function of vehicle registration date. The ratio of CO to for passenger vehicles is relatively flat after 2002, suggesting that passenger cars made after 2002 do not exhibit a significant change due to deterioration. Light 2/3/4 wheel vehicles remain stable after 2008, but show significant differences prior to 2008, suggesting that deterioration due to age is a significant factor in the overall emission ratio of these vehicles. The variance of light commercial vehicles is too high to identify a clear trend using this dataset.
Fig. 4.

Web app capture of histogram showing the different distribution of emission ratios of from older Euro 5 diesel passenger cars and the newest Euro 6d diesel passenger cars taken from the Milan campaign
Fig. 5.
Web app capture of time series showing the magnitude of the mean emission ratios for each year of registration between 1970 and 2024 for petrol vehicles during the Milan campaign
The policy maker may also be interested to know if there are any systematic differences between different manufacturers in their fleet. Figure 6 shows how this might be visualised for the six most common vehicle manufacturers in the Milan campaign. This visualisation is helpful for identifying potentially problematic fleet subset but also putting them in the context of their prevalence. A small number of vehicles may be responsible for a large amount of the emissions (Rushton et al., 2021) and a more granular view of the fleet may help policy makers identify the most polluting vehicles and design policy to more effectively target them.
Fig. 6.
Web app capture of bubble plot showing the magnitude of the mean fuel mass emission ratios and count of measurements for each of the six most prevalent vehicle manufacturers during the Milan campaign
Vehicle dynamics
Vehicle dynamics, usually presented as vehicle specific power (VSP) (Jimenez-Palacios, 1998) is an important metric for understanding vehicle emissions. emissions increase linearly with increased power because of the increased physical work done. Air pollution emissions are often non-linear due to the complicated chemical reactions that underpin the catalytic processes in the pollution mitigation systems (Carslaw et al., 2013). Visualising the speed and acceleration characteristics of vehicles at a site gives operators a better understanding of the real driving conditions at a given site and will allow more specific policy to be developed. The measurable distributions of speed and acceleration, alongside local knowledge possessed by the government or city authority officers in charge can lead to much more effective policy development than relying on national averages or similar crude aggregate values.
Weather
Ambient weather conditions (Fikeraddis & Endeshaw, 2020) are important for understanding the external influencing factors on air pollution concentration. Factors such as temperature, wind conditions and humidity influence the concentration and type of air pollution at a site and may be measured by some remote sensing instrument clusters. and volatile organic compounds (VOC) interact in the presence of sunlight to produce photochemical smog (Dimitriades, 1972). Additionally, emitted from tailpipes (primary ) reacts in the presence of sunlight () through photodissociation to create ozone () and the reaction of nitric oxide (NO) with creates secondary . Colder temperatures can also impact air pollution due to an increase in tailpipe emission due to cold starts and colder running (Dardiotis et al., 2013). It is important to understand the difference in context when determining the best approach for air pollution mitigation. Strategies that work in colder locations may not work in warmer ones, and policy makers need to be aware of these differences.
Summary and future work
Summary
The work presented in this paper is a timely and necessary advancement required for increasing the impact of direct measurements of vehicle emissions in-situ. The presented methodology for data organisation, analysis and dissemination represents a flexible and scalable solution to vehicle remote emission sensing data, which has so far been limited to paper documentation and static markdown documents. The potential of this approach has been recognised and the collective of researchers developing remote sensing technology, part of the ERMES group (https://ermesgroup.eu/web), have access to this platform and have been using it to inform their decision making and planning for future work.
Since the deployment of the CARES Public App there has been interest from additional authorities with their own data. These include data from campaigns conducted in Bosnia, Ireland and Switzerland. These data are yet to be added to the CARES Public App because they are covered by different data sharing agreements. In the future they may become available to the wider vehicle emissions and air pollution community, and the public more generally.
Remote sensing campaigns are still expensive to run and the data collected remains proprietary but with the expanded capabilities of the CARES Public App leading the way, it is probable that in the future there will be a shift in attitude towards a more open data approach. A more open data approach, coupled with the automation of standard remote sensing tasks within a secure environment and a more intuitive interface, would go a long way to standardising the processes and outputs from this research. The standardisation will facilitate further research including cross-site analysis and identification of synergies and differences across geographical locations. These new insights will ensure that the people responsible for making air quality policy decisions have the best possible tools and data available.
This work has successfully developed a data infrastructure that utilises cloud computing and data science for the analysis of vehicle emissions, demonstrating a pragmatic approach to environmental monitoring and policy support. Through the integration of traditional remote sensing and innovative point sampling methods, the project offers an enhanced framework for the detailed investigation of real-driving emissions (RDE) in urban settings. This infrastructure not only facilitates an improved evaluation of air pollution sources but also lowers the barrier of entry for policymakers and researchers to access and analyse real driving emissions data. Importantly, the data platform is publicly and freely available online, providing a valuable resource for ongoing and future environmental research and policy development, as well as citizen engagement. The collaborative nature of this project reflects a forward-thinking approach to addressing the complex challenges of urban air quality management. This work can be used as an example of leveraging technological advancements in the pursuit of creating sustainable urban environments and improving public health.
Future work
Data will continue to be collected and measurement methodologies will continue to improve and diversify. A flexible and scalable approach to handling and serving this data to those that require it will continue to be important to ensure that the maximum impact is achieved from these measurements. There are four areas identified for future work to enable this platform to continue to make meaningful contributions to the vehicle emissions and air pollution policy discussion.
Addition of new data from future remote sensing surveys
Integration of supplementary third-party data sets
Improve time-series data capacity
Integration of IoT for close to real-time analysis
As more surveys are conducted using remote sensing technology, and increasingly using point sampling there will be an ever-increasing data resource that needs to be managed. The data structure of this project has been designed with this in mind and is flexible enough to handle most different data sets. Increasing the breadth and depth of the data set will allow for more and more reliable conclusions to be drawn. It will also enable policy to be more finely tuned to the context it will be deployed in. Currently, data is added to the database on a case-by-case basis and no automatic procedure has been developed. Data to be added is manually quality controlled and judgements about standardisation are made before adding it to the database. This process currently takes a week of development time depending on the similarity, size and complexity of the new data. It is beyond the scope of the work to develop an automated process at this time and researchers interested in engaging with this project are encouraged to contact the paper’s authors.
Currently inter-city comparison is not appropriate as the three cities presented represent very different contexts. Differences in fleet composition, climate, and city policy must be taken into account so analysis resembling like-for-like comparison may be misleading and self-defeating. As the data resource increases the opportunity to do more comparative analysis will present itself though. As more data from cities with similar contexts are made available the contextual variables can be controlled and tested using a range of different methodologies and the web-app platform will be able to present this analysis to a wide range of stakeholders.
Raw data from point sampling and also from mobile instruments deployed in plume chasing vehicles take the form of time series data. Time series data may not be used in the same way as snapshot data in the presented data structure. The CosmosDB document format allows for extended measurements to be included in single documents. A potential new data structure for these specific types of measurements may be introduced to the database to facilitate deeper understanding and more diverse data sets than simple snapshots.
Supplementary data sets can currently be accessed in delayed time. Two data sets of interest to air pollution analysts might be the United Kingdom’s Automatic Urban and Rural Network (AURN) data set and the National Oceanic and Atmospheric Administration (NOAA) Integrated Surfaces Database (ISD) data set. These data sets are stored as a series of RDA data files compatible with the R programming language. Both of these can be accessed in R using the openair and worldmet packages respectively. They may also be accessed using Python using the pyreadr package (https://pypi.org/project/pyreadr/) and connecting directly to the data sources. The AURN dataset which can be accessed online is updated at around midnight each day and access to the previous day’s data is added. The NOAA ISD data set availability is delayed by two days. Neither of these data sets are therefore suitable for exactly real-time comparison but can be integrated into any longer-term analysis in an on-demand fashion.
The main RS technological developments moving forward are likely to be infrastructure based as well as scientific improvements. The current approach to data retrieval from the instruments remains slow and cumbersome despite efforts to improve speed in serving data to the final user. A further area for improvement is making the devices Internet of Things (IoT) capable. Once RS devices can connect to the cloud the devices could begin transferring emissions data to the database in real time. The Azure platform has full IoT functionality built in and connecting devices to the cloud would allow local authorities and policy developers the ability to see, in close to real-time, what is happening on their streets. GDPR regulations do not allow the storage of this data without significant modifications and removals of personally identifiable data. Whilst a public good argument might be made to improve access, the data collection infrastructure and processes would need to be significantly overhauled. This overhaul would include the addition of significant automated processes. Meta-data which is of interest to the scientific community as well as the regulatory community may not be as granular as historical remote sensing data, however demonstration of a secure IoT platform will increase confidence of data holders that RS data systems can be secure.
Acknowledgements
This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 814966. The sole responsibility for the content of this document lies with the authors. It does not necessarily reflect the opinion of the funding authorities. The funding authorities are not responsible for any use that may be made of the information contained herein.
Authors’ contributions
Christopher Rushton is the main author of the paper and developer of the methodology presented within it. James Tate is the editor and supervisor of this work. Ake Sjodin is the principal investigator for this project and has provided valuable testing and feedback throughout.
Funding
This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 814966. The sole responsibility for the content of this document lies with the authors. It does not necessarily reflect the opinion of the funding authorities. The funding authorities are not responsible for any use that may be made of the information contained herein.
Data availability
The raw data for this project is not publicly available and neither is the source code for the app presented. These are to ensure compliance with any data protection rules that may apply, specifically GDPR. The front facing app is publicly and freely available at https://cares-public-app.azurewebsites.net.
Declarations
Ethics approval and consent to participate
No ethics approval is required. No consent to participate is required. All data collected is GDPR compliant.
Consent for publication
Consent is given for publication.
Competing interests
No conflicts or competing interests are declared.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Adams, H., Nieuwenhuijsen, M., Colvile, R., McMullen, M., & Khandelwal, P. (2001). Fine particle (pm2. 5) personal exposure levels in transport microenvironments, London, UK. Science of the Total Environment,279(1–3), 29–44. [DOI] [PubMed] [Google Scholar]
- Boogaard, H., Janssen, N. A., Fischer, P. H., Kos, G. P., Weijers, E. P., Cassee, F. R., van der Zee, S. C., de Hartog, J. J., Meliefste, K., Wang, M., Brunekreef, B., & Hoek, G. (2012). Impact of low emission zones and local traffic policies on ambient air pollution concentrations. Science of the Total Environment,435, 132–140. [DOI] [PubMed] [Google Scholar]
- Breslow, L., & Goldsmith, J. (1958). Health effects of air pollution. American Journal of Public Health and the Nations Health,48(7), 913–917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brunekreef, B., & Holgate, S. T. (2002). Air pollution and health. The Lancet,360(9341), 1233–1242. [DOI] [PubMed] [Google Scholar]
- Carslaw, D. C. (2005). Evidence of an increasing no 2/nox emissions ratio from road traffic emissions. Atmospheric Environment,39(26), 4793–4802. [Google Scholar]
- Carslaw, D. (2023). worldmet: Import Surface Meteorological Data from NOAA Integrated Surface Database (ISD). https://davidcarslaw.github.io/worldmet/index.html. http://davidcarslaw.github.io/worldmet/
- Carslaw, D. C., Beevers, S. D., Tate, J. E., Westmoreland, E. J., & Williams, M. L. (2011a). Recent evidence concerning higher no x emissions from passenger cars and light duty vehicles. Atmospheric Environment,45(39), 7053–7063.
- Carslaw, D., Beevers, S., Westmoreland, E., Williams, M., Tate, J., Murrells, T., Stedman, J., Li, Y., Grice, S., Kent, A., & Tsagatakis, I. (2011b). Trends in nox and no2 emissions and ambient measurements in the UK. Defra.
- Carslaw, D. C., & Ropkins, K. (2012). openair – an r package for air quality data analysis. Environmental Modelling & Software,27–28, 52–61. [Google Scholar]
- Carslaw, D. C., Williams, M. L., Tate, J. E., & Beevers, S. D. (2013). The importance of high vehicle power for passenger car emissions. Atmospheric Environment,68, 8–16. [Google Scholar]
- Colvile, R., Hutchinson, E., Mindell, J., & Warren, R. (2001). The transport sector as a source of air pollution. Atmospheric Environment,35(9), 1537–1565. [Google Scholar]
- Committee on the medical effects of air pollutants. (2015). Statement on the evidence for the effects of nitrogen dioxide on health. https://assets.publishing.service.gov.uk/media/5a80c25340f0b6230269557c/COMEAP_The_evidence_for_the_effects_of_nitrogen_dioxide.pdf. Accessed 6 Jan 2025.
- Dallmann, T., Bernard, Y., Tietge, U., & Muncrief, R. (2019). Remote sensing of motor vehicle emissions in paris. The Real Urban Emissions Initiative.
- Dardiotis, C., Martini, G., Marotta, A., & Manfredi, U. (2013). Low-temperature cold-start gaseous emissions of late technology passenger cars. Applied Energy,111, 468–478. [Google Scholar]
- Davison, J., Rose, R. A., Farren, N. J., Wagner, R. L., Wilde, S. E., Wareham, J. V., & Carslaw, D. C. (2022). Gasoline and diesel passenger car emissions deterioration using on-road emission measurements and measured mileage. Atmospheric Environment: X,14, 100162. [Google Scholar]
- Dimitriades, B. (1972). Effects of hydrocarbon and nitrogen oxides on photochemical smog formation. Environmental Science & Technology,6(3), 253–260. [Google Scholar]
- Ding, H., Cai, M., Lin, X., Chen, T., Li, L., & Liu, Y. (2021). Rtvemvs: Real-time modeling and visualization system for vehicle emissions on an urban road network. Journal of Cleaner Production,309, 127166. [Google Scholar]
- European Environment Agency. (2008). Sources of air pollution. https://www.eea.europa.eu/publications/2599XXX/page010.html. Accessed 6 Jan 2025.
- European Parliament and Council of the European Union. (2016). Regulation (EU) 2016/679 of the European Parliament and of the Council on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation). https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX%3A32016R0679. Accessed 6 Jan 2025.
- Farren, N. J., Schmidt, C., Juchem, H., Pöhler, D., Wilde, S. E., Wagner, R. L., Wilson, S., Shaw, M. D., & Carslaw, D. C. (2023). Emission ratio determination from road vehicles using a range of remote emission sensing techniques. Science of the Total Environment,875, 162621. [DOI] [PubMed] [Google Scholar]
- Fikeraddis, M., & Endeshaw, L. (2020). Influence of temperature and relative humidity on air pollution in Addis Ababa, Ethiopia. Journal of Environmental & Earth Sciences,2(2), 19–25. [Google Scholar]
- Fonken, L., Xu, X., Weil, Z. M., Chen, G., Sun, Q., Rajagopalan, S., & Nelson, R. J. (2011). Air pollution impairs cognition, provokes depressive-like behaviors and alters hippocampal cytokine expression and morphology. Molecular Psychiatry,16(10), 987–995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franco, V., Kousoulidou, M., Muntean, M., Ntziachristos, L., Hausberger, S., & Dilara, P. (2013). Road vehicle emission factors development: A review. Atmospheric Environment,70, 84–97. [Google Scholar]
- Gatto, N. M., Henderson, V. W., Hodis, H. N., John, J. A. S., Lurmann, F., Chen, J.-C., & Mack, W. J. (2014). Components of air pollution and cognitive function in middle-aged and older adults in Los Angeles. Neurotoxicology,40, 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gauderman, W. J., Avol, E., Gilliland, F., Vora, H., Thomas, D., Berhane, K., McConnell, R., Kuenzli, N., Lurmann, F., Rappaport, E., Margolis, H., Bates, D., & Peters, J. (2004). The effect of air pollution on lung development from 10 to 18 years of age. New England Journal of Medicine,351(11), 1057–1067. [DOI] [PubMed] [Google Scholar]
- Ghaffarpasand, O., Beddows, D. C., Ropkins, K., & Pope, F. D. (2020). Real-world assessment of vehicle air pollutant emissions subset by vehicle type, fuel and euro class: New findings from the recent UK edar field campaigns, and implications for emissions restricted zones. Science of The Total Environment,734, 139416. [DOI] [PubMed] [Google Scholar]
- Giechaskiel, B., Maricq, M., Ntziachristos, L., Dardiotis, C., Wang, X., Axmann, H., Bergmann, A., & Schindler, W. (2014). Review of motor vehicle particulate emissions sampling and measurement: From smoke and filter mass to particle number. Journal of Aerosol Science,67, 48–86. [Google Scholar]
- Gu, D., Andreev, K., & Dupre, M. E. (2021). Major trends in population growth around the world. China CDC Weekly,3(28), 604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guay Paz, J. R., & Guay Paz, J. R. (2018). Introduction to azure cosmos db. Microsoft Azure Cosmos DB Revealed: A Multi-Model Database Designed for the Cloud. pp. 1–23.
- Hao, L., Yin, H., Wang, J., Tian, M., Wang, X., Ge, Y., Bernard, Y., & Sjödin, Å. (2022a). Research on analysis method of remote sensing results of no emission from diesel vehicles. Atmosphere,13(7), 1100.
- Hao, L., Zhao, Z., Yin, H., Wang, J., Li, L., Lu, W., Ge, Y., & Sjödin, Å. (2022b). Study of durability of diesel vehicle emissions performance based on real driving emission measurement. Chemosphere,297, 134171. [DOI] [PubMed]
- Holman, C., Harrison, R., & Querol, X. (2015). Review of the efficacy of low emission zones to improve urban air quality in European cities. Atmospheric Environment,111, 161–169. [Google Scholar]
- Horbanski, M., Pöhler, D., Lampel, J., & Platt, U. (2019). The ICAD (iterative cavity-enhanced DOAS) method. Atmospheric Measurement Techniques,12(6), 3365–3381. [Google Scholar]
- International agency for research on cancer. (2013). Outdoor air pollution a leading environmental cause of cancer deaths. https://www.iarc.who.int/wp-content/uploads/2018/07/pr221_E.pdf. Accessed 6 Jan 2025.
- Jimenez-Palacios, J. L. (1998). Understanding and quantifying motor vehicle emissions with vehicle specific power and TILDAS remote sensing. PhD thesis, Massachusetts Institute of Technology.
- Kampa, M., & Castanas, E. (2008). Human health effects of air pollution. Environmental Pollution,151(2), 362–367. [DOI] [PubMed] [Google Scholar]
- Kinney, P. L., Gichuru, M. G., Volavka-Close, N., Ngo, N., Ndiba, P. K., Law, A., Gachanja, A., Gaita, S. M., Chillrud, S. N., & Sclar, E. (2011). Traffic impacts on pm2. 5 air quality in Nairobi, Kenya. Environmental Science & Policy, 14(4), 369–378. [DOI] [PMC free article] [PubMed]
- Knoll, M., Penz, M., Juchem, H., Schmidt, C., Pöhler, D., & Bergmann, A. (2023). Large-scale automated emission measurement of individual vehicles with point sampling. Atmospheric Measurement Techniques.
- Knoll, M., Penz, M., Schmidt, C., Pöhler, D., Rossi, T., Casadei, S., Bernard, Y., Hallquist, Åsa, M., Åke, S., & Bergmann, A. (2024). Evaluation of the point sampling method and inter-comparison of remote emission sensing systems for screening real-world car emissions. Science of The Total Environment, 932 171710. [DOI] [PubMed]
- Lomborg, B. (2020). Welfare in the 21st century: Increasing development, reducing inequality, the impact of climate change, and the cost of climate policies. Technological Forecasting and Social Change,156, 119981. [Google Scholar]
- O’Driscoll, R., Stettler, M. E., Molden, N., Oxley, T., & ApSimon, H. M. (2018). Real world co2 and nox emissions from 149 euro 5 and 6 diesel, gasoline and hybrid passenger cars. Science of the Total Environment,621, 282–290. [DOI] [PubMed] [Google Scholar]
- Pastorello, C., & Melios, G. (2016). Explaining road transport emissions: a non-technical guide. European Environment Agency.
- Pechout, M., Jindra, P., Hart, J., & Vojtisek-Lom, M. (2022). Regulated and unregulated emissions and exhaust flow measurement of four in-use high performance motorcycles. Atmospheric environment: X,14, 100170. [Google Scholar]
- Pöhler, D., Schmidt, C., Horbanski, M., Schmitt, S., & Lampel, J. (2023). Identification of manipulated & defective truck nox emission reduction systems with plume chasing for authority inspections. Transportation Research Procedia,72, 1661–1668. [Google Scholar]
- Popp, P. J., Bishop, G. A., & Stedman, D. H. (1999). Development of a high-speed ultraviolet spectrometer for remote sensing of mobile source nitric oxide emissions. Journal of the Air & Waste Management Association,49(12), 1463–1468. [DOI] [PubMed] [Google Scholar]
- Power, M. C., Weisskopf, M. G., Alexeeff, S. E., Coull, B. A., Spiro, A., III., & Schwartz, J. (2011). Traffic-related air pollution and cognitive function in a cohort of older men. Environmental Health Perspectives,119(5), 682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qiu, M., & Borken-Kleefeld, J. (2022). Using snapshot measurements to identify high-emitting vehicles. Environmental Research Letters,17(4), 044045. [Google Scholar]
- Ropkins, K., DeFries, T. H., Pope, F., Green, D. C., Kemper, J., Kishan, S., Fuller, G. W., Li, H., Sidebottom, J., Crilley, L. R., Kramer, L., Bloss, W. J., & Hager, J. S. (2017). Evaluation of edar vehicle emissions remote sensing technology. Science of the Total Environment,609, 1464–1474. [DOI] [PubMed] [Google Scholar]
- Ropkins, K., Walker, A., Philips, I., Rushton, C., Clark, T., & Tate, J. (2022). Change detection of air quality time-series using the r package aqeval. Available at SSRN 4267722.
- Rushton, C. E. (2016). Measuring and modelling vehicle nox emissions using a remote sensing device. https://etheses.whiterose.ac.uk/18000/. Accessed 6 Jan 2025.
- Rushton, C. E., Tate, J. E., & Shepherd, S. P. (2021). A novel method for comparing passenger car fleets and identifying high-chance gross emitting vehicles using kerbside remote sensing data. Science of The Total Environment,750, 142088. [DOI] [PubMed] [Google Scholar]
- Rushton, C. E., Tate, J. E., Shepherd, S. P., & Carslaw, D. C. (2018). Interinstrument comparison of remote-sensing devices and a new method for calculating on-road nitrogen oxides emissions and validation of vehicle-specific power. Journal of the Air & Waste Management Association,68(2), 111–122. [DOI] [PubMed] [Google Scholar]
- Schriefl, M., Nishida, R., Knoll, M., Boies, A., & Bergmann, A. (2020). Characterization of particle number counters based on pulsed-mode diffusion charging. Aerosol Science and Technology,54(7), 772–789. [Google Scholar]
- Sriramya, P., & Karthika, R. (2015). Providing password security by salted password hashing using bcrypt algorithm. ARPN Journal of Engineering and Applied Sciences,10(13), 5551–5556. [Google Scholar]
- Stanaway, J. D., Afshin, A., Gakidou, E., Lim, S. S., Abate, D., Abate, K. H., Abbafati, C., Abbasi, N., Abbastabar, H., Abd-Allah, F., Abdela, J., Abdelalim, A., Abdollahpour, I., Abdulkader, R. S., Abebe, M., Abebe, Z., Abera, S. F., Abil, O. Z., Abraha, H. N., ... Murray, C. J. L. (2018). Global, regional, and national comparative risk assessment of 84 behavioural, environmental and occupational, and metabolic risks or clusters of risks for 195 countries and territories, 1990–2017: a systematic analysis for the global burden of disease study 2017. The Lancet,392(10159), 1923–1994. [DOI] [PMC free article] [PubMed]
- Stedman, D. H., & Bishop, G. A. (1990). An analysis of on-road remote sensing as a tool for automobile emissions control. Digital Comons @ DU. https://digitalcommons.du.edu/cgi/viewcontent.cgi?article=1007&context=feat_publications. Accessed 6 Jan 2025.
- Stedman, D. H., Bishop, G. A., Beaton, S. P., Peterson, J. E., Guenther, P. L., McVey, I. F., & Zhang, Y. (1994). On-road remote sensing of CO and HC emissions in California. Digital Commons @ DU. https://digitalcommons.du.edu/cgi/viewcontent.cgi?article=1069&context=feat_publications. Accessed 6 Jan 2025.
- Tate, J. E. (2013b). Vehicle emission measurement and analysis - Sheffield city council. https://www.sheffield.gov.uk/sites/default/files/docs/pollution-and-nuisance/air-pollution/air-quality-management/Vehicle%20Emission%20Measurement%20and%20Analysis%202013.pdf. Accessed 6 Jan 2025.
- VanderPlas, J., Granger, B., Heer, J., Moritz, D., Wongsuphasawat, K., Satyanarayan, A., Lees, E., Timofeev, I., Welsh, B., & Sievert, S. (2018). Altair: Interactive statistical visualizations for python. Journal of open source software,3(32), 1057. [Google Scholar]
- Ward, J. W., Michalek, J. J., & Samaras, C. (2021). Air pollution, greenhouse gas, and traffic externality benefits and costs of shifting private vehicle travel to ridesourcing services. Environmental Science & Technology,55(19), 13174–13185. [DOI] [PubMed] [Google Scholar]
- Wes McKinney (2010). Data Structures for Statistical Computing in Python. In S. van der Walt, & J. Millman (Eds.), Proceedings of the 9th Python in Science Conference (pp. 56 – 61).
- WHO. (2006). Who air quality guidelines for particulate matter, ozone, nitrogen dioxide and sulfur dioxide: global update 2005: summary of risk assessment. Technical report.
- WHO. (2013). Review of evidence on health aspects of air pollution: Revihaap project: final technical report. [PubMed]
- Yang, S., Fang, D., & Chen, B. (2019). Human health impact and economic effect for pm2. 5 exposure in typical cities. Applied Energy,249, 316–325. [Google Scholar]
- Yang, Z., Tate, J. E., Rushton, C. E., Morganti, E., & Shepherd, S. P. (2022). Detecting candidate high nox emitting light commercial vehicles using vehicle emission remote sensing. Science of The Total Environment,823, 153699. [DOI] [PubMed] [Google Scholar]
- Zhang, K., Batterman, S., & Dion, F. (2011). Vehicle emissions in congestion: Comparison of work zone, rush hour and free-flow conditions. Atmospheric Environment,45(11), 1929–1939. [Google Scholar]
- Zhang, Y., Bishop, G. A., Beaton, S. P., Guenther, P. L., & McVey, L. F. (1996). Enhancement of remote sensing for mobile source nitric oxide. Journal of the Air & Waste Management Association,46(1), 25–29. [DOI] [PubMed]
- Zhang, Y., Stedman, D. H., Bishop, G. A., Guenther, P. L., & Beaton, S. P. (1995). Worldwide on-road vehicle exhaust emissions study by remote sensing. Environmental Science & Technology,29(9), 2286–2294. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The raw data for this project is not publicly available and neither is the source code for the app presented. These are to ensure compliance with any data protection rules that may apply, specifically GDPR. The front facing app is publicly and freely available at https://cares-public-app.azurewebsites.net.


