Table 1. Systematic review on airline passenger data in infectious disease modelling, (A) fields recorded and (B) criteria used to determine reproducibility of articles and sources.
Field | Description | Variable | ||
---|---|---|---|---|
A. Data description | ||||
Article information | ||||
Authors | At least the first three authors, as on article | Text | ||
Year of publication | Date | |||
Title | Text | |||
Publication name | Text | |||
Data source | ||||
Commercial data | Commercial databases collecting information about flight routings, aircraft size, number of bookings or passengers, e.g. IATA, OAG, Diio | Yes/no | ||
Tourism surveys | Any surveys done in the context of tourism, e.g. UNWTO | Yes/no | ||
National passenger surveys | Surveys conducted at airports, e.g. passenger survey | Yes/no | ||
Airport published information | Data collected and published by airports, may be groups of airports | Yes/no | ||
Government immigration data | Data collected by governments on migration numbers, inbound passengers | Yes/no | ||
Other | E.g. information published by airlines | Yes/no | ||
Unreported or unclear | Yes/no | |||
Data type | ||||
Seat capacity | Number of seats available on a specific route | Yes/no | ||
Itinerary | Data include connections, not just information on origin and destination | Yes/no | ||
Number of flights | Number of flights between cities/airports/countries following a specific routing | Yes/no | ||
Number of passengers | Data explicitly describe number of passengers travelling | Yes/no | ||
Tickets sold | Number of tickets sold or booked per routing | Yes/no | ||
Origin–destination information | Data include origin airport/city/country and destination airport/city/country | Yes/no | ||
Direct flight information only | Data do not inform on number of passengers taking connecting flights | Yes/no | ||
Unreported or unclear | Reported information not sufficient to determine data type | Yes/no | ||
Data time period | ||||
Date range of data is reported | Yes/no | |||
Date range | Text | |||
Reporting quality (scoring criteria see Table part B) | ||||
Fully reproducible | All handling and manipulation of the data is described to a detail adequate to enable reproducibility (reproducibility score = 4) |
Yes/no | ||
Partially reproducible | Important information on handling of the data is missing, or methodology is vague (reproducibility score = 3) |
Yes/no | ||
Not reproducible | Information on methods and/or data source is missing and methodology unclear (reproducibility score ≤ 2) |
Yes/no | ||
Data validation | ||||
Data validation attempted | A comparison was made with an independent and appropriate source of information | Yes/no | ||
Data usage | ||||
Transmission model | Airline passenger information is used to parameterise a model of transmission | Yes/no | ||
Network analysis | Airline passenger information is described using social network methodology | Yes/no | ||
Descriptive or illustrative | Airline passenger information is used to illustrate a transmission risk, but no formal analysis or modelling is performed | Yes/no | ||
Other | None of the above (specify or describe what was done) | Yes/no | ||
Unclear or unreported | Insufficient information to determine data usage | Yes/no | ||
Pathogen modelled | ||||
Non-specific | Generic model | Yes/no | ||
MERS coronavirus | Yes/no | |||
Seasonal influenza | Yes/no | |||
Pandemic influenza | Yes/no | |||
Other (specify) | Text | |||
B. Reproducibilitya | ||||
Data accessibility (mutually exclusive categories) | Score contributionb | |||
Open source | Publicly available, no restrictions on use, no access fees, and source (where online) still accessible as at January 2017 | Yes = +1; No = 0 | ||
Closed source | Publicly available but restricted access, access may be granted following registration and/or fee, e.g. proprietary data | Yes = 0; No = 0 | ||
Not publicly available | Private data, access at discretion of custodian, e.g. airport or airline company information | Yes = 0; No = 0 | ||
Reporting clarity of data source | (All Yes = +1)c | |||
Source identified | The source of the original data is clearly stated | Yes/no | ||
Data set named | The specific name of the data set or database in the source is reported | Yes/no | ||
Access date specified | The date(s) on which data were accessed is reported | Yes/no | ||
Data type reported | The type or unit represented by the data is reported, e.g. number of flights/seats/passengers | Yes/no | ||
Reporting clarity of data usage | ||||
Data handling reported | Data manipulation before analysis, including data cleaning and/or aggregation, is reported | Yes = +1; No = 0 | ||
Date range of data used | ||||
Data time range reported | The time period covered by the data is reported | Yes = +1; No = 0 | ||
Total reproducibility score | Maximum score = 4. If multiple sources were used in an article, the average score was calculated. |
Diio: data in, intelligence out; IATA: International Air Transport Association; MERS: Middle East respiratory syndrome; OAG: company providing air travel data; UNWTO: World Tourism Organization.
a If studies used a third party’s travel model and if they did not describe the model fully but provide a link or citation, we assessed the cited external documentation for reproducibility.
b Only material using open source data contributes +1 point to the reproducibility score.
c The material must receive a ‘yes’ for all subvariables for this variable to contribute +1 point to the reproducibility score.