Skip to main content
. 2019 Aug 1;24(31):1800216. doi: 10.2807/1560-7917.ES.2019.24.31.1800216

Table 1. Systematic review on airline passenger data in infectious disease modelling, (A) fields recorded and (B) criteria used to determine reproducibility of articles and sources.

Field Description Variable
A. Data description
Article information
Authors At least the first three authors, as on article Text
Year of publication Date
Title Text
Publication name Text
Data source
Commercial data Commercial databases collecting information about flight routings, aircraft size, number of bookings or passengers, e.g. IATA, OAG, Diio Yes/no
Tourism surveys Any surveys done in the context of tourism, e.g. UNWTO Yes/no
National passenger surveys Surveys conducted at airports, e.g. passenger survey Yes/no
Airport published information Data collected and published by airports, may be groups of airports Yes/no
Government immigration data Data collected by governments on migration numbers, inbound passengers Yes/no
Other E.g. information published by airlines Yes/no
Unreported or unclear Yes/no
Data type
Seat capacity Number of seats available on a specific route Yes/no
Itinerary Data include connections, not just information on origin and destination Yes/no
Number of flights Number of flights between cities/airports/countries following a specific routing Yes/no
Number of passengers Data explicitly describe number of passengers travelling Yes/no
Tickets sold Number of tickets sold or booked per routing Yes/no
Origin–destination information Data include origin airport/city/country and destination airport/city/country Yes/no
Direct flight information only Data do not inform on number of passengers taking connecting flights Yes/no
Unreported or unclear Reported information not sufficient to determine data type Yes/no
Data time period
Date range of data is reported Yes/no
Date range Text
Reporting quality (scoring criteria see Table part B)
Fully reproducible All handling and manipulation of the data is described to a detail adequate to enable reproducibility
(reproducibility score = 4)
Yes/no
Partially reproducible Important information on handling of the data is missing, or methodology is vague
(reproducibility score = 3)
Yes/no
Not reproducible Information on methods and/or data source is missing and methodology unclear
(reproducibility score ≤ 2)
Yes/no
Data validation
Data validation attempted A comparison was made with an independent and appropriate source of information Yes/no
Data usage
Transmission model Airline passenger information is used to parameterise a model of transmission Yes/no
Network analysis Airline passenger information is described using social network methodology Yes/no
Descriptive or illustrative Airline passenger information is used to illustrate a transmission risk, but no formal analysis or modelling is performed Yes/no
Other None of the above (specify or describe what was done) Yes/no
Unclear or unreported Insufficient information to determine data usage Yes/no
Pathogen modelled
Non-specific Generic model Yes/no
MERS coronavirus Yes/no
Seasonal influenza Yes/no
Pandemic influenza Yes/no
Other (specify) Text
B. Reproducibilitya
Data accessibility (mutually exclusive categories) Score contributionb
Open source Publicly available, no restrictions on use, no access fees, and source (where online) still accessible as at January 2017 Yes = +1; No = 0
Closed source Publicly available but restricted access, access may be granted following registration and/or fee, e.g. proprietary data Yes = 0; No = 0
Not publicly available Private data, access at discretion of custodian, e.g. airport or airline company information Yes = 0; No = 0
Reporting clarity of data source (All Yes = +1)c
Source identified The source of the original data is clearly stated Yes/no
Data set named The specific name of the data set or database in the source is reported Yes/no
Access date specified The date(s) on which data were accessed is reported Yes/no
Data type reported The type or unit represented by the data is reported, e.g. number of flights/seats/passengers Yes/no
Reporting clarity of data usage
Data handling reported Data manipulation before analysis, including data cleaning and/or aggregation, is reported Yes = +1; No = 0
Date range of data used
Data time range reported The time period covered by the data is reported Yes = +1; No = 0
Total reproducibility score Maximum score = 4.
If multiple sources were used in an article, the average score was calculated.

Diio: data in, intelligence out; IATA: International Air Transport Association; MERS: Middle East respiratory syndrome; OAG: company providing air travel data; UNWTO: World Tourism Organization.

a If studies used a third party’s travel model and if they did not describe the model fully but provide a link or citation, we assessed the cited external documentation for reproducibility.

b Only material using open source data contributes +1 point to the reproducibility score.

c The material must receive a ‘yes’ for all subvariables for this variable to contribute +1 point to the reproducibility score.