Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 28;16(10):e0259108. doi: 10.1371/journal.pone.0259108

Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations

Marius Kaffai 1,*, Raphael H Heiberger 1
Editor: Marco Cremonini2
PMCID: PMC8553158  PMID: 34710181

Abstract

Governments around the globe use non-pharmaceutical interventions (NPIs) to curb the spread of coronavirus disease 2019 (COVID-19) cases. Making decisions under uncertainty, they all face the same temporal paradox: estimating the impact of NPIs before they have been implemented. Due to the limited variance of empirical cases, researchers could so far not disentangle effects of individual NPIs or their impact on different demographic groups. In this paper, we utilize large-scale agent-based simulations in combination with Susceptible-Exposed-Infectious-Recovered (SEIR) models to investigate the spread of COVID-19 for some of the most affected federal states in Germany. In contrast to other studies, we sample agents from a representative survey. Including more realistic demographic attributes that influence agents’ behavior yields accurate predictions of COVID-19 transmissions and allows us to investigate counterfactual what-if scenarios. Results show that quarantining infected people and exploiting industry-specific home office capacities are the most effective NPIs. Disentangling education-related NPIs reveals that each considered institution (kindergarten, school, university) has rather small effects on its own, yet, that combined openings would result in large increases in COVID-19 cases. Representative survey-characteristics of agents also allow us to estimate NPIs’ effects on different age groups. For instance, re-opening schools would cause comparatively few infections among the risk-group of people older than 60 years.

Introduction

In 2020, governments tried to curb the spread of COVID-19 without having an appropriate medical response available. Most governments implemented non-pharmaceutical interventions (NPIs) which affect societal life in an unseen manner. Although there is initial evidence that NPIs, such as the closure of educational facilities [13], workplaces [3, 4] and certain businesses [1, 2], the cancellation of mass gatherings [13], case detection and contact tracing [4, 5], shielding of vulnerable people [6], mobility restrictions [7], wearing masks [8] or specific network intervention strategies [9] are effective in reducing the amount of new cases, it is also clear that NPIs come with severe side effects, e.g., on the economy [911], labour markets [12], people’s physical [13] and mental health [14], or concerns on increasing domestic violence [15]. Thus, decision-makers act under immense pressure to balance public health risks while sustaining economic activities. In so doing, policy-makers are facing a temporal paradox when it comes to the implementation of NPIs: they have to decide whether a particular NPI (or a set thereof) is curbing the spread of COVID-19 without having reliable knowledge about the NPIs’ actual effectiveness. The best way to evaluate the effectiveness of NPIs would be to measure the empirical correlation between the implementation of a NPI and the change of the infection rate. However, an empirical estimation is difficult in many areas due to the unavailability of data. Only few studies exist with a rather narrow set of scenarios [13] or coarse resolutions [16]. Empirical studies on the effect of NPIs are by-design limited in scope because they can only measure the impact of what governments implement, i.e., estimations rest on scenarios which have actually happened. Based on inter-temporal cross-country data, Brauner et al. [1], for instance, cannot disentangle the effect of school closure and the effect of university closure, because in most countries these two institutions were closed almost at the same time. To circumvent one’s reliance on detailed empirical data in order to evaluate a larger variety of scenarios, simulation methods can be used to create and investigate the effect of NPIs which did not occur in reality [17, 18].

In addition, there is great need for more detailed knowledge of how NPIs work in the specific context of a country’s or region’s social structure [17, 19]. Preliminary evidence exists that the spread of COVID-19 cases is starkly influenced by structural circumstances, such as age distributions [20, 21] or network densities of geographic areas [22]. To reflect social structures and relevant characteristics of people’s behavior in simulations, agent-based modeling (ABM) have proven to be good complements to epidemiological differential equation models [9, 17, 23, 24]. Once a model with predictive capability is established [25], it is possible to create scenarios that differ from empirical scenarios and to apply the simulation results to reality [17, 18].

Many researchers already exploit the benefits of data-driven agent-based models to investigate NPIs in the context of a countries’ specific circumstances [4, 6, 2629]). ABM exist for the U.S. [26, 30], Canada [4], Colombia [31], Australia [27], France [6], Singapore [28], Iran [29] and the UK [26]. Our model not only adds (parts of) Germany to this list but uses an innovative method to generate an artificial population of agents which resembles the real population of a federal state and, hence, important parts of its social structure. Thus, instead of corresponding to a few, rather coarse macro-characteristics of the target population, our study represents households of survey participants. The micro-level of the simulation—agents with specific attributes living together with other agents with specific attributes—therefore reflects important properties of the social structure of a society; much more than synthesizing the artificial population based on aggregated macro-data on the target populations’s household structure.

Materials and methods

The overall purpose of our simulation study is to investigate the effects of various non-pharmaceutical interventions (NPI) in curbing the spread of COVID-19. We predict case numbers of counter-factual scenarios in which certain NPIs are (not) implemented and compare the predicted case numbers to empirical case numbers. We evaluate our agent-based model by its ability to reproduce time-series data of observed case numbers in a baseline scenario that mimics the factual implementation of NPIs happened in spring 2020 in Germany. Hence, dropping certain measures or combinations thereof (i.e., deviations from the empirical baseline scenario) allows us to estimate each NPI’s efficacy. We model the infection on the level of federal states. We chose the four states in Germany (Baden-Wuerttemberg, Bavaria, Hamburg, Saarland) with the highest infection rate relative to its population size during the first wave of Covid-19 in Germany (March 2020). To describe the model, we roughly follow the ODD (Overview, Design concepts, Details) protocol for describing individual- and agent-based models [32, 33]. All code is made available to reproduce results or to add further scenarios and, respectively, NPIs.

Entities and scales

The four main entities driving the simulation’s dynamics are agents, locations, a virus and a global environment. The agents represent humans living their daily life in the pandemic. During the day, they execute various activities and spend their time either at home, kindergarten, school, university, supermarket or an occupation-specific work place. Each action depends on the day of week, the time of day and individual characteristics of each agent. The values of the agents’ characteristics, such as age, gender, occupation, daily working hours or the time usually spent in the supermarket, are taken from survey data, i.e., each agent represents a survey respondent in regard to his or her attributes.

Locations represent the different sites where agents can reside during the day and between which agents move. Space is modeled by locations, however, no geographical distance between different locations are represented. The types of locations implemented in the model are agents’ homes, workplaces, school classes, kindergartens, universities and supermarkets, in order to simulate the most important NPIs that had come into effect in each federal state in Germany in Spring 2020. Whenever possible, the number of each type of locations is determined by the composition of the agent population and empirical macro data (cf. subsection Initialization). Locations can only be accessed by assigned agents, e.g., only members of a specific household can enter their home or only employees of a specific company can enter the company’s office.

The virus represents COVID-19 infecting the agents. If an agent gets infected by the virus, a multistage disease process is triggered. The virus can be transmitted from an infectious agent to a susceptible agent, if these two are at the same location (cf. subsection Disease process and virus transmission).

The global environment represents laws and decrees influencing the daily schedule of the agents (cf. subsection Non-pharmaceutical interventions). This means especially the implementation of NPIs, but also the closing of schools due to state-wide holidays. For the base model we create a schedule of NPI implementations and relaxations that mimics the history of events during the first wave of COVID-19 in Germany. In the simulation experiments, we then design contra-factual scenarios using schedules that differ from the empirical baseline scenario (cf. subsection Scenarios and model execution).

Time is represented via discrete steps. 18 time steps represent one day, where each time step usually represents one hour. Only in the night time of each day 6 hours are skipped due to run-time optimization. We assume the night time when most people sleep is not relevant for the spread of the virus. Each run simulates 100 days (1800 time steps), starting with the day when the case numbers in the federal state are closest to 50 cases per 100,000 inhabitants. (Start dates and end dates: Hamburg 2020.03.19–2020.06.26; Baden-Wuerttemberg: 2020.03.21–2020.06.28; Bavaria: 2020.03.23 -2020.06.30; Saarland 2020.03.25–2020.07.02).

Initialization

The first step of model initialization is to create the population of empirical-grounded agents. Each agent-based model includes a population of at least 100,000 agents. The values of the agents’ characteristics, such as age, gender, occupation, daily working hours or the time usually spent in the supermarket, are taken from survey data, i.e., each agent represents a survey respondent in regard to his or her attributes. For that purpose, we use the 2017 German Socio-Economic Panel (SOEP) [34]. The SOEP is a representative panel survey that includes entire households of respondents. To set the agents’ daily routines in the simulation, we classify each agent either as kindergartner, school student, university student, working university student, working or non-working agent. All agents in the age from 0 to 5 are classified as kindergarten kids. Each agent in the age from 6 to 19 is classified as school kid. Older agents are either university students, working university students, working (with a certain occupation) or non-working agents. In order to distinguish the occupational fields of the agents we use the current version of the Statistical Classification of Economic Activities in the European Community (NACE Rev. 2) on level 1 (NACE-sections) and level 2 (NACE-divisions) provided in the SOEP. S1 Table lists all variables used to set agents’ attributes.

Like every survey, SOEP has to deal with missing values. For 3.53% of the sample we replace missing data on working hours by age- and gender-specific group means. For 3.46% of the sample we replace missing data on the occupation by a random occupation code in each simulation run. For cases with missing information about the time spent daily in supermarkets (26,92%) we impute the overall mean (approximately 1h). Finally, 1.62% of the households are excluded, because the case itself or a member of the corresponding household has missing data.

To create a population of agents that represents a given federal state, we first create a subset of SOEP-respondents who reside in the given state. In a second step, we take a random sample of households (with replacement) from the subset until a population size of at least 100,000 is reached. The households’ selection probabilities are not equal, but are weighted by the corresponding cross-sectional representativity weight provided by the SOEP for each household. Due to the random sampling of households of different sizes, the final population size is either exactly or slightly above 100,000.

After the creation of the population, the simulated world is built, which consists of different types of locations. In a first step, for each type of location the necessary frequency is calculated. In a second step, the agents are assigned to selected locations. For each household of agents a home is created in which all members of the household (taken from the SOEP) reside. The number of additional location types is adjusted to the characteristics of the agent population and the respective federal state. S1 Table shows the parameters we use to calculate the corresponding number of locations in the simulation model. After all locations are created, kindergarten kids are randomly assigned to one of the kindergartens, pupils are randomly assigned to one of the school classes, university students are assigned to the university and working agents are randomly assigned to one work place of the corresponding NACE-section. All agents above the age of 13 are assigned to two randomly selected super markets.

In order to start a pandemic when the simulation executes, at the beginning of each single simulation run the infection status of 50 randomly selected agents is set to infectious.

Main simulation loop

After the initialization, the main simulation loop starts and simulates one (hourly) time step every round until 100 days were simulated. In each time step, the current date and time are updated by incrementing the datetime by one hour. Because most of the night time is skipped, the clock is set forward to 7 a.m. if it is 1 o’clock.

Every agent starts each time step by an update of its infection status, i.e., an agent checks whether it has to set it’s infection status from exposed to infectious with symptoms or from infectious with symptoms to recovered (cf. subsection Disease process and virus transmission). In the next step, if the agent is infectious, it potentially infects (susceptible) agents present at the same location (cf. subsection Disease process and virus transmission). In the next two processes the agent first checks whether it stays at home that day due to feeling sick or due to an ongoing isolation of the whole household (cf. subsection Non-pharmaceutical interventions). If neither is the case, agents start, continue or finish an activity depending on the weekday, the clock time, the age, student status, daily work hours and the NPIs that are currently in effect (cf. subsection Activities).

Activities

Activities cause agents to leave their home during the day, to visit other locations, to encounter other agents, and as a consequence, to potentially spread the virus through physical proximity. An activity is implemented in the simulation as staying at a location for a certain period of time. Every agent has a daily schedule of activities that determines which locations an agent visits, and for how long. The default activity that every agent exhibits when there is no other activity on its schedule is being at home. From Monday to Friday at 8 o’clock, kindergarten kids go to kindergarten, school kids go to school, university students attend university, and workers go to work. School kids and kindergarten kids return to their homes after 5 hours, university students after 4. While the dwell time at kindergarten, school and university are based on assumptions, the average daily working hours represent those of the modeled survey respondents (we have tested the effect of differing assumptions on the case numbers in the baseline model—cf. S1S3 Figs). If university students have jobs, they first go to work and then go to university. When agents are at home and there is no other activity, there is a certain probability to go to the supermarket. The average time an agent spends at the supermarket is taken from the SOEP. To account for the possibility of home office in the simulation, working agents change their location to their workplace only with a certain probability and otherwise start working from home. The probabilities to work from home are derived from data on the frequency of home office use per NACE-division in Germany in 2018 [35] (cf. S4 Table).

Disease process and virus transmission

The biological aspect of the infection (i.e., what happens to an agent after the virus was transmitted) was implemented in strong orientation to the way the process of infection is implemented in Covasim [36]. It can be considered as an extended S-E-I-R (susceptible, exposed, infectious, recovered) infection model [37]. The process of infection by COVID-19 is modeled by an multi-stage infection process including up to six different phases of infection an agent can have: susceptible, exposed, infectious, infectious with symptoms, infectious without symptoms and recovered. Except for 50 randomly selected agents starting with the status infectious, every agent is initially susceptible to an infection. When an agent got infected, the agent goes through the stages of infection. For each agent and each stage of infection we draw individual time spans from a log-normal-distribution following Covasim [36] (cf. S1 Table). In addition, the decision whether an agent develops symptoms or not is determined by an age-dependent probability, which also resembles Covasim [36].

The first stage after the infection occurs is being exposed to the virus. At this stage, the virus has entered the agent’s body, but the agent is not yet infectious or symptomatic. The second stage of infection is being infectious. This is the time during the infection when the agent is already infectious but not yet symptomatic because the body has not had enough time to develop symptoms. In the next stage the agent can either develop symptoms or remain completely asymptomatic. Finally, an agent reaches the stage of being recovered from the infection. We assume permanent immunity after recovery, thus, an agent cannot be infected for a second time.

The agent-to-agent transmission of the virus is one of the most essential elements in the modeling of infectious diseases. In our model, a transmission of the virus can occur whenever an infectious agent and a susceptible agent are at the same time in the same place. At each time step, each infectious agent randomly selects one agent that is currently in the same location (if the infectious agent is currently not alone in its location). If the agent randomly selected by the infectious agent is susceptible to infection, its infection status is set to ‘exposed’ by a certain probability of infection. This infection probability per time step is an important, yet unknown parameter and is therefore determined by model calibration to empirical infection data of a given state (cf. section Calibration). In the model calibrated for this study, these infection probabilities range from 0.04727 to 0.0645 (cf. Table 3).

Non-pharmaceutical interventions

Non-pharmaceutical interventions (NPIs) affect the agents’ daily schedule by canceling certain activities and causing the agents to stay at home instead. The changes on the agents’ daily lives due to NPIs considered in the model are the following: less to no time at school, kindergarten or university, an increased amount of time working from home, quarantining households of infected agents and reduced or no work hours due a (partial) shutdown or economic downturn in certain occupational fields. For most of the NPIs a daily probability distribution determines for each agent individually whether it starts a certain activity and leaves its home or not. In response to active NPIs, the specific probability reflects whether a certain location is closed (or to what degree), or that work hours are reduced for a given NACE-section. For example, a (partially) closed school is implemented as a daily probability that determines for each school kid separately whether he or she goes to school that day. To model that only half of the pupils are allowed to go to school, the daily probability to go to school for each pupil is set to 0.5. If schools are completely open, the probability for school attendance is 1. Complete closure of schools are reflected in a daily probability of school attendance equal to 0. If, for instance, 80 percent of a specific NACE-division has to work from home, the daily probability to visit the work place is set to 0.2 for all agents belonging to that specific NACE-division. Besides the potential closure of work places and educational facilities also the isolation of households is implemented as an NPI. If an agent shows symptoms of an infection over a certain period of time, the agent and all household members are quarantined at home for 14 days. The period of time from the onset of symptoms of one of the household member to the isolation of the whole household is one of the two calibrated parameters and lies between 21 and 26 time steps (cf. Table 3).

Scenarios and model execution

The model allows us to simulate different scenarios where the duration and degree of implementation of specific NPIs can be varied by entering specific timetables during model initialization. Those timetables contain daily information about the state of each NPI. Because the initial goal of the model is to capture the course of the pandemic during its first wave in Germany, we create a baseline model with a timetable which mimics the factual implementation of NPIs as it took place in Germany in spring 2020. In the baseline model we mimic the course of the lockdown in Germany with its onset at around march 16 and the following 100 days of disease spread. Table 1 shows the varying degree of implementation of different NPIs by date in the baseline scenario.

Table 1. Timetable of NPIs in baseline scenario.

NPI March 16 April 20 May 6 May 18 June 2 June 15
Kindergartens 0.1 0.1 0.3 0.3 0.3 0.6
Schools 0 0.1 0.3 0.3 0 0.3
NACE-section G 0.25 0.5 1 1 1 1
NACE-section I 0.25 0.25 0.5 1 1 1
NACE-section P 0.25 0.5 1 1 1 1
NACE-section R 0.25 0.5 0.75 0.75 0.75 0.75
NACE-section S 0.25 0.5 1 1 1 1
Universities 0.1
Working from home maximum
Household isolation yes
Work hour reduction yes

Table 1 shows the timetable of NPI implementation in the baseline scenario mimicking the factual course of NPI implementation in the first wave of COVID19 in Germany in 2020. For kindergartens, schools and universities as well as the five NACE-sections the values reflect the degree of openness for each kind of location (cf. subsection Non-pharmaceutical interventions). For the NPI Working from Home the value maximum means the full utilization of industry-specific capacities to work from home. See S4 Table for a full list of home-office exploitation rates per NACE-division. For industry-specific values regarding the Work hour reduction see S3 Table.

To estimate the effects of specific NPIs, we simulate different contra-factual scenarios using timetables that differ in one or more aspects from the one used in the baseline model. It is important to note that each effect is the change in the number of cases if a given NPI had not been implemented while all other NPIs remain active (if not reported differently, e.g., the combined effects of education-related NPIs in the School_Kinder-scenario). Overall, the simulation experiment includes 9 counterfactual scenarios with differing timetables. The counterfactual scenarios and their deviation from the baseline scenario are listed in Table 2.

Table 2. Manipulation of NPIs in counter-factual scenarios.

Scenario Manipulation Value Baseline value
Open.Schools Schools (degree of openness) 1 0–0.3
Open.Kinder Kindergartens (degree of openness) 1 0.1–0.6
Open.Uni Universities (degree of openness) 1 0.1
Normal.HomeOffice Working from home (industry-specific use of hours worked f. home) normal maximum
No.Quarantine Household-isolation no yes
Normal.Work.Hours Work hour redu.; NACE-sec. G, I, P, R, S (degree of openness) no; 1 yes; 0.25–1
Open.AllEduc Scenario combines Open.Schools & Open.Kinder & Open.Uni
Open.Schools.Kinder Scenario combines Open.Schools & Open.Kinder

Table 2 shows the differences between the scenario-specific timetables and the baseline scenario timetables shown in column “Baseline value” and in detail in Table 1. The values in the baseline scenario represent the factual degree of NPI-implementation in the first wave of COVID19 in Germany 2020.

For each state and scenario, we simulate 100 days, starting with the day when a state had approximately 50 cases per 100,000 inhabitants. In each run of the simulation we collect the new infections per day, the cumulative cases on a day and the age of the infected agents. If the size of the agent population is exactly equal to 100000 cases, frequencies can be directly interpreted as cases per 100000 inhabitants. Otherwise, the infection frequencies are calculated by I*100000/P, where I is the number of infections and P is the actual population size in the simulation. Therefore, all results can be interpreted as cases per 100000. Due to the many stochastic elements used in the simulation, each single run of the model returns different results, even when the same set of parameters is used. Thus, to obtain reliable results for a given set of parameters we run 60 repetitions of model runs and average the results.

Calibration

Although the model includes relevant micro-level mechanisms of a pandemic and a representative agent population, we must perform some iterative fine-tuning as is often done in ABMs, i.e., calibrating the model to reproduce empirical data [38]. To calibrate the ABMs on infection frequencies in a given federal state, we determine the values of two parameters: the hourly probability an infected agent will infect another agent, and the time needed to quarantine a household after the first symptoms of a household member appear. Among the different simulated scenarios, the baseline scenario is the one whose goal is to represent the scenario that actually happened in reality and thus the one that should be able to reproduce the corresponding empirical data on infections. Therefore, we calibrated the model in the baseline scenario to the corresponding data on the cumulative infection frequencies per 100.000 inhabitants in a given state. We performed the calibration using the Latin Hypercube Sampling algorithm [38], which is implemented in the python package Spotpy [39]. For each model to be calibrated, we searched for the set of model parameters that generate simulated data with the smallest deviation to the corresponding daily empirical data on cumulative number of infections (data source: [40]). We stopped the calibration process when at least one set of parameters was found to generate simulated data with a fit of a root mean squared error less than 20. S6 Fig shows the parameter combinations sampled using Latin Hypercube Sampling and the resulting model fit using these parameter combinations in the baseline scenario. Table 3 shows the parameter combinations yielding the best model fit for each federal state.

Table 3. Calibrated parameters.

Parameter Baden-Wuerttemberg Bavaria Hamburg Saarland
Infection probability per time step 0.06213 0.0645 0.0502 0.04727
Time steps from symptoms to household quarantine 21 23 26 23

Because the calibration of the model for a single federal state requires at least 180 simulation runs (each consisting itself of 60 internal replications and one replication takes approx. 20 minutes) due to varying parameter sets, each process of calibration was serialized on 192 cores provided by BW-HPC, a statewide computing and data infrastructure.

Results

To derive the efficacy of NPIs in counterfactual scenarios, the first step is to ensure the predictive capability of the model by assessing its fit to the actual spread of COVID-19. The proposed model matches the empirical data of the daily number of cumulative cases for each of the four most affected federal states in Germany (Fig 1). Our model predicts the rapid rise of COVID-19 cases in the early stages of the pandemic as well as its subsequent decelerated growth in which the infection numbers stabilize. The model also simulates accurately the different levels of disease spread in the four states, which vary considerable by structure (cf. Supporting information).

Fig 1. Model fit by state.

Fig 1

The solid lines depict the cumulative number of reported COVID-19 cases and the average of cumulative infections across 60 simulation replications per state. Shaded areas represent the 95% confidence intervals of the averages. The root mean squared error of the model’s predictions ranges between 8.414 for Bavaria and 16.56 for Hamburg.

After establishing that the model actually predicts the empirical spread of the Corona virus, we can turn to implementing counterfactual scenarios. We do that by removing each NPI that was at work in reality and examine the effect of its omission from the baseline scenario in the simulation. Fig 2 reveals that decreasing the urge to quarantine infectious cases and reducing home office hours to regular percentages would cause the starkest increase across all implemented NPIs. While quarantine is, by a margin, the most important NPI in the two large territorial states (Baden-Wuerttemberg and Bavaria), it is a bit less important than increased home-office in the two smaller states. At the other end of the efficacy spectrum is the scenario in which there was no reduction of work hours. Across all states, it has almost no effect in our simulation if people would have worked normal hours. This small impact, e.g., compared to the impact of working from home, can be explained by the rather small empirical changes in working hours due to economic downturns. While working time reductions were close to zero in most industries, only hospitality industry (NACE-section I) and arts, entertainment and recreation (NACE-section R) experienced significant reductions of about 32.3% and 15.5%, respectively. For further details on the reduction of work hours per industry sector see S3 Table. In addition, the total closure of businesses and facilities is only selective and temporary in the baseline scenario, which can be seen in Tables 1 and 2.

Fig 2. Effect of counterfactual non-implementation of NPIs, by state.

Fig 2

Each line represents the average cumulative case numbers per scenario. While all NPIs considered are implemented in the baseline scenario to mimic the real scenario, the lines of the other scenarios show the estimated cumulative case numbers caused by the omission of one or more NPIs. For exact values of the average cumulative case numbers, standard deviations and confidence intervals on day 100, see also S5 and S6 Tables.

Which education facilities may be kept running is a controversial topic in Germany, since it affects the lives of millions of children and parents. Turning to education-related NPIs, our model predicts a large increase of COVID-19 cases if all three types of institutions (kindergartens, schools, and universities) would re-open. In contrast to previous studies [1], our model also allows us to inspect the effect of each education facility separately. In the counterfactual scenarios presented in Fig 2, opening schools would have increased COVID-19 cases by several hundred (per day). Fig 3 disentangles the numbers further. It shows that opening all education-related institutions at the same time has a considerable effect (third largest overall, Fig 2), which exceeds the sum of the effects of each institution in all four regions. One possible explanation for this large effect of combined opening of educational institutions is that children who go to kindergarten often have siblings who go to school, and vice versa, as one can see in S9 Table. Thus, households with children could link two of the largest social network hubs (schools and kindergartens) in Germany, making it easy for the virus to spread to large segments of society that would otherwise be unreachable. Therefore, contagious effects seem at work which can be mostly attributed to the interplay of kindergarten kids and pupils (Open.Schools_Kinder in Fig 3). If comparing education-related NPI individually, we see that the effects of all three types are rather similar. Yet, opening universities has slightly lower effects than the other two, and opening schools causes the highest average effect in all four states.

Fig 3. Cumulative infection numbers on day 100 for each counterfactual non-implementation of education-related NPIs, by state.

Fig 3

Boxplots depict the distribution of all replications. For exact values of the average cumulative case numbers, standard deviations and confidence intervals on day 100, see also S5 and S6 Tables.

In general, the opening of schools and kindergartens, both combined or each separately, seem to have notable effects on the case numbers due to two further points besides the linkage between pupils and kindergarten kids through households. Both pupils and kindergarten kids also typically live in relatively large households, which are breeding grounds for virus transmission (cf. S9 Table). In addition, those households not only seem to provide a link between pupils and kindergarten kids, but also provide a link between these two groups and the world of working adults, as S9 Table shows. On the other side, those household compositions can also explain the minor importance of universities regarding the number of infections. University students live in households of smaller average size compared to the households of other people. In addition, university students rarely live together with school kids or kindergarten kids, thus the contacts between those groups are limited (cf. S9 Table). Furthermore, in our sample the frequency of university students is much smaller than kindergarten kids or pupils (cf. S8 Table).

Besides considering NPIs separately, using an ABM with representative survey respondents as agents allows us to trace infections by age group. Age constitutes one of the main factors to identify risk groups [20]. In general and for each NPI, the mean age of infected agents decreases over time (Fig 4). This reflects recent findings using rare empirical data on the age of infected [41]. Furthermore, Fig 4 details how each NPI affects different age groups. It reveals that the counterfactual removal of NPIs drive COVID-19 numbers through the infection of agents younger than 60. Across all states, the scenarios in which schools re-open and pre-pandemic home office rates are attained lead to the lowest proportion among the elderly risk-group. In contrast, re-opening the universities yields a higher proportion of infections among people older than 60, although the vast majority of infections pertains younger people and the absolute numbers of COVID-19 caused by re-opening universities are rather low (Fig 2).

Fig 4. Share of infected agents belonging to the risk-group (age > = 60), by state.

Fig 4

Lines represent the share of cumulative cases in the risk group for each NPI over all simulation runs. To facilitate readability, a ribbon of confidence intervals is omitted. Figures including the CIs are available upon request.

Discussion

Overcoming the temporal paradox of deciding whether the implementation of a particular NPI is decreasing the spread of COVID-19 before knowing its actual effect is crucial for policy-makers and researchers alike. Testing such counterfactual what-if scenarios could improve the basis for decision-making considerably and help to decide which NPIs to implement. Our approach illustrates a way to estimate the effectiveness of otherwise unobservable scenarios by simulating agents with attributes taken from a large survey representing about 27 million people. In so doing, our study generates insights on the effectiveness of several widespread NPIs (quarantine, home-office, closing universities, schools, and kindergartens, reduced work hours) in four federal states in Germany. Going beyond previous studies, our approach enables us to disentangle the effect of individual NPIs and, hence, provides information for policy makers who need to balance the trade-off underlining each NPI between individual and economic freedom on the one side, and prevention of disease spread on the other.

It can be shown that focused quarantining of infected agents and their household members (i.e., close contacts) is the most effective NPI. Governments should therefore make every effort to increase their ability of case detection and contact tracing [4, 5]. The second largest effect is the home office scenario. It can be shown that an increased usage of home office capacities is a very effective measure to reduce the number of new infections, which is in line with observational studies [42]. This result is particularly important, because although there was a considerable increase of home office usage in the beginning of the pandemic [43], both employers and employees still refuse to comprehensively rely on home office in the long term [44].

The most controversial NPI is the closure of educational facilities. In contrast to many observational studies [1, 16, 45], we are able to estimate a separate effect for each education-related NPI. Our simulations suggest that the separate re-opening of each institution yields rather low infection numbers. However, combined openings lead to multiple infections, in particular, re-opening kindergartens and schools at the same time increase COVID-19 cases considerably. Therefore, alternating periods of open educational institutions could represent a promising way to maintain low infection numbers while sustaining institutional functions.

A key characteristic of COVID-19 is its elevated mortality for older people and the subsequent importance of demographics [20]. Taking into account social structures by using agents with survey-characteristics and in accordance with a very recent study for the USA [41], our findings reveal that younger age (less risk-prone) groups sustain the spread of COVID-19 when the fore-mentioned NPIs are omitted. Furthermore, simulations suggest that the removal of NPIs affect age groups differently. Considering age adds an important puzzle piece how to select NPIs, i.e., it allows to distinguish the impact of NPIs in regard to its overall effects and group-specific effects for the people who are most vulnerable to COVID-19.

Like other simulation studies, our analysis has several limitations. First, our model does not contain a representation of contacts between households or agents that do not take place in one of the implemented locations, e.g., contacts with relatives or friends. Second, our model does not contain locations were people meet during spare time. However, the influence of both points is mitigated because private contacts and leisure activities were reduced to a minimum in the observed period in Germany. Having said that, care should be taken when applying the results to other countries or other times. For instance, while in our model elderly and retired people are somewhat protected against COVID-19 outbreaks among younger people because they often live in separate households and do not have explicit contact with relatives, this might not be the case in countries with different household compositions. Finally, the SOEP survey does not contain information on respondents’ social network, hence, we could not consider this important transmission channel.

A further limitation is that we do not model changes of people’s behavior. For example, we do not take into account the change in caution about infection risks in daily life or the obligation to keep distance and wear masks inside buildings. Also beyond the scope of this paper is the inclusion of external influences. Instead, we simulate a closed system. After the initial infections at the beginning of a run no infections occur due to agents coming home from outside, e.g., commuters or tourists.

As many other simulation studies, we had to assume some parameter values due to a lack of empirical data. However, we conducted a series of robustness checks for the most important of those parameters (cf. S1S5 Figs). Sensitivity analyses suggest that the results are robust to changes in the parameters and, hence, to potential errors in assumed values.

Despite those limitations, we provide a reproducible model based on empirical agents which can be adopted and/or extended. For instance, the current model including above NPIs could be applied on any states or regions. Researchers would only need to calibrate the model with subsequent empirical data on COVID-19 cases and survey data. To represent ongoing vaccination efforts, the model could also be modified so that vaccinated agents have lower probabilities to transmit the virus. In addition, researchers could extend the existing model with alternative NPIs which may be relevant for a specific region. The simulated world can be amplified by many other meeting places in which certain persons come into contact.

Despite all efforts and increasing availability of vaccinations, it appears that COVID-19 will accompany our societies for some time to come [46]. Thus, the demand for data-driven epidemiological models including realistic social structures and demography will stay high. Our model is an instance how to incorporate survey-based populations and test counterfactual scenarios. The development and improvement of such models and the timely availability of more accurate data was maybe never more important than in the current global pandemic.

Supporting information

S1 Table. Values and data sources of simulation parameters.

* At the time of conducting our simulation experiments, we used the most recent values listed in the latest version of Covasim and the associated article. Unfortunately, these values have changed (slightly) in the final release of Covasim. Nevertheless, we do not expect these changes to have a significant impact on our model results [34, 36, 47, 48].

(PDF)

S2 Table. Descriptive statistics on federal states [34, 49, 50].

(PDF)

S3 Table. Reduction of work hours per NACE-section.

S3 Table gives the percentages to which the work hours were reduced in the second quarter (April, May, June) of 2020 as compared to 2019 per NACE-section. We assume that the differences between 2020 and 2019 are mainly caused by the COVID-19 pandemic. The values are taken from [51].

(PDF)

S4 Table. Amount of working from home per NACE-divisions.

The column “Normal” gives the share of people frequently working at home office per NACE-division. The column “Maximum” gives the share to which the work could be done in home office i.e. the maximum capacity of working from home per NACE-division. We interpret all values as share of work hours per NACE-division. We assume that in the total lock- and shutdown in the first wave of COVID-19 in Germany 2020 the full capacity of working from home was exploited in each NACE-division. All values are taken from [35].

(PDF)

S5 Table. Case numbers on day 100.

S5 Table shows the cumulative number of infected agents per 100,000 agents after 100 simulated days, averaged over 60 replications per state and scenario, along with the standard deviation and 95% confidence interval.

(PDF)

S6 Table. Case numbers on day 100 as the difference to baseline scenario.

S6 Table shows the cumulative number of infected agents per 100,000 agents after 100 simulated days, averaged over 60 replications per state and scenario, as the difference from the corresponding baseline scenario, along with the standard deviation and 95 percent confidence interval.

(PDF)

S7 Table. Infected agents by age-group.

S7 Table shows the proportion of each age group in the cumulative number of infected agents on simulation day 100.

(PDF)

S8 Table. Share of vocational statuses per federal state.

S8 Table shows the proportion of different vocational statuses per federal state in the population of agents. “None/other” are those agents who are neither working, studying, or of kindergarten or school age. “Working” means any agent who works more than zero hours per day and is not enrolled at a university.

(PDF)

S9 Table. Household composition by vocational status and state.

S9 Table shows for each vocational status the average number of agents living in the same household by their vocational status. For instance, on average, an agent classified as a kindergarten child in Baden-Wuerttemberg lives in a household that contains 1.53 kindergarten children, 0.63 schoolchildren, 0.03 students, 1.4 employed persons, and 0.58 other persons. “None/other” are those agents who are neither working, studying, or of kindergarten or school age. “Working” means any agent who works more than zero hours per day and is not enrolled at an university.

(PDF)

S1 Fig. Robustness check: Hours at kindergarten.

Compares the number of COVID-19 infections in the baseline scenario across different values for the daily hours at kindergarten. The value in the center is the one used in the main analysis.

(TIF)

S2 Fig. Robustness check: Hours at school.

Compares the number of COVID-19 infections in the baseline scenario across different values for hours at school. The value in the center is the one used in the main analysis.

(TIF)

S3 Fig. Robustness check: Hours at university.

Compares the number of COVID-19 infections in the baseline scenario across different values for hours at university. The value in the center is the one used in the main analysis.

(TIF)

S4 Fig. Robustness check: Number of colleagues.

Compares the number of COVID-19 infections in the baseline scenario across different values for number of colleagues. The value in the center is the one used in the main analysis.

(TIF)

S5 Fig. Robustness check: Maximum number of students per university.

Compares the number of COVID-19 infections in the baseline scenario across different values for the maximum number of students per university. We used 100,000 in the main analysis, which means that there is only one university per simulated world.

(TIF)

S6 Fig. Sampled parameter space.

S6 Fig shows the tested combinations of the two calibrated parameters and the respective model fit when using the combination in the baseline model. The parameter combination that gives the best model fit is indicated by the cross.

(TIF)

Acknowledgments

We are deeply grateful for helpful comments made by Lisa Schöllhammer, Uwe Remer and André Bächtiger, which greatly improved this manuscript. In addition, we thank the state of Baden-Wuerttemberg for providing us with computing capacity through bwHPC.

Data Availability

The simulation output data and the code to reproduce the plots and result tables are stored in this repository: DOI 10.17605/OSF.IO/B9YPG. Unfortunately, we are not allowed to share the Socio-Economic Panel (SOEP) data or parts of it due to a very restrictive data access policy. Therefore, we cannot share data on the agent population based on the SOEP. The SOEP is a representative panel study for Germany conducted annually by DIW Berlin. Researchers affiliated with an institution can request access to SOEP at this website: https://www.diw.de/en/diw_01.c.601584.en/data_access.html. All of the code used to run the simulation and the publicly available datasets used to calibrate the simulation model are available in this repository: https://github.com/mariuzka/covid19_sim.

Funding Statement

The authors received no specific funding for this work.

References

  • 1. Brauner JM, Mindermann S, Sharma M, Johnston D, Salvatier J, Gavenčiak T, et al. Inferring the effectiveness of government interventions against COVID-19. Science. 2020;. doi: 10.1126/science.abd9338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Hunter PR, Colon-Gonzalez F, Brainard JS, Rushton S. Impact of non-pharmaceutical interventions against COVID-19 in Europe: a quasi-experimental study. medRxiv. 2020; p. 2020.05.01.20088260. doi: 10.1101/2020.05.01.20088260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Islam N, Sharp SJ, Chowell G, Shabnam S, Kawachi I, Lacey B, et al. Physical distancing interventions and incidence of coronavirus disease 2019: natural experiment in 149 countries. BMJ. 2020;370. doi: 10.1136/bmj.m2743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Ng V, Fazil A, Waddell LA, Bancej C, Turgeon P, Otten A, et al. Projected effects of nonpharmaceutical public health interventions to prevent resurgence of SARS-CoV-2 transmission in Canada. CMAJ. 2020;192(37):E1053–E1064. doi: 10.1503/cmaj.200990 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nature Medicine. 2020;26(6):855–860. doi: 10.1038/s41591-020-0883-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Hoertel N, Blachier M, Blanco C, Olfson M, Massetti M, Rico MS, et al. A stochastic agent-based model of the SARS-CoV-2 epidemic in France. Nature Medicine. 2020;26(9):1417–1421. doi: 10.1038/s41591-020-1001-6 [DOI] [PubMed] [Google Scholar]
  • 7. Lima LL, Atman APF. Impact of mobility restriction in COVID-19 superspreading events using agent-based model. PLOS ONE. 2021;16(3):e0248708. doi: 10.1371/journal.pone.0248708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Mitze T, Kosfeld R, Rode J, Wälde K. Face masks considerably reduce COVID-19 cases in Germany. Proceedings of the National Academy of Sciences. 2020;117(51):32293–32301. doi: 10.1073/pnas.2015954117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Nishi A, Dewey G, Endo A, Neman S, Iwamoto SK, Ni MY, et al. Network interventions for managing the COVID-19 pandemic and sustaining economy. Proceedings of the National Academy of Sciences. 2020;. doi: 10.1073/pnas.2014297117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Polyakova M, Kocks G, Udalova V, Finkelstein A. Initial economic damage from the COVID-19 pandemic in the United States is more widespread across ages and geographies than initial mortality impacts. Proceedings of the National Academy of Sciences. 2020;117(45):27934–27939. doi: 10.1073/pnas.2014279117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Sharma D, Bouchaud JP, Gualdi S, Tarzia M, Zamponi F. V–, U–, L– or W–shaped economic recovery after Covid-19: Insights from an Agent Based Model. PLOS ONE. 2021;16(3):e0247823. doi: 10.1371/journal.pone.0247823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Coibion O, Gorodnichenko Y, Weber M. Labor Markets During the COVID-19 Crisis: A Preliminary View. National Bureau of Economic Research; 2020. w27017. Available from: https://www.nber.org/papers/w27017.
  • 13. Lippi G, Henry BM, Bovo C, Sanchis-Gomar F. Health risks and potential remedies during prolonged lockdowns for coronavirus disease 2019 (COVID-19). Diagnosis. 2020;7(2):85–90. doi: 10.1515/dx-2020-0041 [DOI] [PubMed] [Google Scholar]
  • 14. Brooks SK, Webster RK, Smith LE, Woodland L, Wessely S, Greenberg N, et al. The psychological impact of quarantine and how to reduce it: rapid review of the evidence. The Lancet. 2020;395(10227):912–920. doi: 10.1016/S0140-6736(20)30460-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Bradbury-Jones C, Isham L. The pandemic paradox: The consequences of COVID-19 on domestic violence. Journal of Clinical Nursing. 2020;29(13-14):2047–2049. doi: 10.1111/jocn.15296 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Dehning J, Zierenberg J, Spitzner FP, Wibral M, Neto JP, Wilczek M, et al. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science. 2020;369 (6500). doi: 10.1126/science.abb9789 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Squazzoni F, Polhill JG, Edmonds B, Ahrweiler P, Antosz P, Scholz G, et al. Computational Models That Matter During a Global Pandemic Outbreak: A Call to Action. Journal of Artificial Societies and Social Simulation. 2020;23(2):10. doi: 10.18564/jasss.4298 [DOI] [Google Scholar]
  • 18. Gilbert N, Ahrweiler P, Barbrook-Johnson P, Narasimhan KP, Wilkinson H. Computational Modelling of Public Policy: Reflections on Practice. Journal of Artificial Societies and Social Simulation. 2018;21(1):14. doi: 10.18564/jasss.3669 [DOI] [Google Scholar]
  • 19. Colbourn T. COVID-19: extending or relaxing distancing control measures. The Lancet Public Health. 2020;5(5):e236–e237. doi: 10.1016/S2468-2667(20)30072-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Dowd JB, Andriano L, Brazel DM, Rotondi V, Block P, Ding X, et al. Demographic science aids in understanding the spread and fatality rates of COVID-19. Proceedings of the National Academy of Sciences. 2020;117(18):9696–9698. doi: 10.1073/pnas.2004911117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Davies NG, Klepac P, Liu Y, Prem K, Jit M, Eggo RM. Age-dependent effects in the transmission and control of COVID-19 epidemics. Nature Medicine. 2020;26(8):1205–1211. doi: 10.1038/s41591-020-0962-9 [DOI] [PubMed] [Google Scholar]
  • 22.Kuchler T, Russel D, Stroebel J. The Geographic Spread of COVID-19 Correlates with the Structure of Social Networks as Measured by Facebook. National Bureau of Economic Research; 2020. w26990. Available from: https://www.nber.org/papers/w26990. [DOI] [PMC free article] [PubMed]
  • 23. Epstein JM. Modelling to contain pandemics. Nature. 2009;460(7256):687–687. doi: 10.1038/460687a [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Epstein JM, Axtell R. Growing Artificial Societies: Social Science from the Bottom Up. Brookings Institution Press; 1996. [Google Scholar]
  • 25. Oberkampf WL. Simulation Accuracy, Uncertainty, and Predictive Capability: A Physical Sciences Perspective. In: Beisbart C, Saam NJ, editors. Computer Simulation Validation: Fundamental Concepts, Methodological Frameworks, and Philosophical Perspectives. Simulation Foundations, Methods and Applications. Cham: Springer International Publishing; 2019. p. 69–97. Available from: 10.1007/978-3-319-70766-2_3. [DOI] [Google Scholar]
  • 26.Ferguson N, Laydon D, Nedjati Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand; 2020. Available from: http://spiral.imperial.ac.uk/handle/10044/1/77482.
  • 27. Chang SL, Harding N, Zachreson C, Cliff OM, Prokopenko M. Modelling transmission and control of the COVID-19 pandemic in Australia. Nature Communications. 2020;11(1):5710. doi: 10.1038/s41467-020-19393-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Koo JR, Cook AR, Park M, Sun Y, Sun H, Lim JT, et al. Interventions to mitigate early spread of SARS-CoV-2 in Singapore: a modelling study. The Lancet Infectious Diseases. 2020;20(6):678–688. doi: 10.1016/S1473-3099(20)30162-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Mahdizadeh Gharakhanlou N, Hooshangi N. Spatio-temporal simulation of the novel coronavirus (COVID-19) outbreak using the agent-based modeling approach (case study: Urmia, Iran). Informatics in Medicine Unlocked. 2020;20:100403. doi: 10.1016/j.imu.2020.100403 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Kirpich A, Koniukhovskii V, Shvartc V, Skums P, Weppelmann TA, Imyanitov E, et al. Development of an interactive, agent-based local stochastic model of COVID-19 transmission and evaluation of mitigation strategies illustrated for the state of Massachusetts, USA. PLOS ONE. 2021;16(2):e0247182. doi: 10.1371/journal.pone.0247182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Gomez J, Prieto J, Leon E, Rodríguez A. INFEKTA—An agent-based model for transmission of infectious diseases: The COVID-19 case in Bogotá, Colombia. PLOS ONE. 2021;16(2):e0245787. doi: 10.1371/journal.pone.0245787 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Grimm V, Berger U, Bastiansen F, Eliassen S, Ginot V, Giske J, et al. A standard protocol for describing individual-based and agent-based models. Ecological Modelling. 2006;198(1-2):115–126. doi: 10.1016/j.ecolmodel.2006.04.023 [DOI] [Google Scholar]
  • 33. Grimm V, Railsback SF, Vincenot CE, Berger U, Gallagher C, DeAngelis DL, et al. The ODD Protocol for Describing Agent-Based and Other Simulation Models: A Second Update to Improve Clarity, Replication, and Structural Realism. Journal of Artificial Societies and Social Simulation. 2020;23(2):7. doi: 10.18564/jasss.425933204215 [DOI] [Google Scholar]
  • 34.(SOEP) SEP. Socio-Economic Panel (SOEP), data for years 1984-2017, version 34, SOEP, 2017.; 2019. Available from: 10.5684/soep.v34. [DOI]
  • 35.Alipour JV, Falck O, Schüller S. Germany’s Capacities to Work from Home; 2020.
  • 36. Kerr CC, Stuart RM, Mistry D, Abeysuriya RG, Rosenfeld K, Hart GR, et al. Covasim: An agent-based model of COVID-19 dynamics and interventions. PLOS Computational Biology. 2021;17(7):e1009149. doi: 10.1371/journal.pcbi.1009149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Abou-Ismail A. Compartmental Models of the COVID-19 Pandemic for Physicians and Physician-Scientists. Sn Comprehensive Clinical Medicine. 2020; p. 1–7. doi: 10.1007/s42399-020-00330-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Thiele JC, Kurth W, Grimm V. Facilitating Parameter Estimation and Sensitivity Analysis of Agent-Based Models: A Cookbook Using NetLogo and R. Journal of Artificial Societies and Social Simulation. 2014;17(3):11. doi: 10.18564/jasss.2503 [DOI] [Google Scholar]
  • 39. Houska T, Kraft P, Chamorro-Chavez A, Breuer L. SPOTting Model Parameters Using a Ready-Made Python Package. PLOS ONE. 2015;10(12):e0145180. doi: 10.1371/journal.pone.0145180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Esri-Deutschland-GmbH. NPGEO Corona—RKI COVID19; 2021. Available from: https://npgeo-corona-npgeo-de.hub.arcgis.com/datasets/dd4580c810204019a7b8eb3e0b329dd6_0/data.
  • 41. Monod M, Blenkinsop A, Xi X, Hebert D, Bershan S, Tietze S, et al. Age groups that sustain resurging COVID-19 epidemics in the United States. Science. 2021;. doi: 10.1126/science.abe8372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Alipour JV, Fadinger H, Schymik J. My home is my castle—The benefits of working from home during a pandemic crisis. Journal of Public Economics. 2021;196:104373. doi: 10.1016/j.jpubeco.2021.104373 [DOI] [Google Scholar]
  • 43.Kunze F, Hampel K, Zimmermann S. Homeoffice in der Corona-Krise: eine nachhaltige Transformation der Arbeitswelt?; 2020. Available from: https://kops.uni-konstanz.de/handle/123456789/51524.
  • 44.KFrenken. Research on Home Office: A critical review; 2020. Available from: https://www.eipa.eu/research-on-home-office/.
  • 45. Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–261. doi: 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]
  • 46. Lerner AM, Folkers GK, Fauci AS. Preventing the Spread of SARS-CoV-2 With Masks and Other “Low-tech” Interventions. JAMA. 2020;324(19):1935. doi: 10.1001/jama.2020.21946 [DOI] [PubMed] [Google Scholar]
  • 47.Destatis. Bildung und Kultur. Allgemeinbildende Schulen. Schuljahr 2018/2019. Fachserie 11 Reihe 1.; 2020. Available from: https://www.statistischebibliothek.de/mir/receive/DEHeft_mods_00112288.
  • 48.Bertelsmann-Stiftung. Ländermonitor Frühkindliche Bildungssysteme. Gruppengröße 01.03.2019. Altersübergreifend Gruppen, ab 0 Jahren.; 2020. Available from: https://www.laendermonitor.de/de/vergleich-bundeslaender-daten/personal-und-einrichtungen/kita-strukturen.
  • 49.Bruttoinlandsprodukt (VGR) | Statistikportal.de; 2020. Available from: http://www.statistikportal.de/de/bruttoinlandsprodukt-vgr.
  • 50.Destatis. Verwaltungsgliederung am 31.03.2020 (1. Quartal); 2020. Available from: https://www.destatis.de/DE/Themen/Laender-Regionen/Regionales/Gemeindeverzeichnis/Administrativ/Archiv/Verwaltungsgliederung/Verwalt1QAktuell.html.
  • 51.Destatis. Arbeitnehmerverdienste. 2. Vierteljahr 2020. Fachserie 16 Reihe 2.1.; 2020. Available from: https://www.destatis.de/DE/Themen/Arbeit/Verdienste/Verdienste-Verdienstunterschiede/Publikationen/Downloads-Verdienste-und-Verdienstunterschiede/arbeitnehmerverdienste-vierteljahr-2160210203225.xlsx?__blob=publicationFile.

Decision Letter 0

Marco Cremonini

13 Aug 2021

PONE-D-21-14064

Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations

PLOS ONE

Dear Dr. Kaffai,

Thank you for submitting your manuscript to PLOS ONE. Your paper has been carefully reviewed and was considered positively, overall. However, some aspects need further consideration and should be integrated with additional content. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

In particular, two are the aspects that need particular attention:

  1. Model description: Reviewer #3 has noted that the model should be described more precisely and with further details in order to be completely clear to the reader. We invite authors to expand the description.

  2. Figure explanation and discussion: All reviewers have noted that figures are not completely clear, either in the discussion or in the caption. We invite authors to revise all figures, their captions, and the discussion of their content according to the comments of all reviewers.

  3. References: Pay specific attention to the following Journal Requirements, pointing to a possible problem with references to retracted articles. References should be carefully checked before resubmission, each one should have a clear rationale for being cited. All changes in the references should be reported in the rebuttal letter to inform the reviewers. 

Minor comments have been also provided and require attention as well.

Overall, the paper has merits and is close to the quality needed for publication, which justifies the minor revision status. This however does not imply that the revision could be limited to few superficial fixes. All comments raised by the reviewers and the journal's editors should be specifically addressed, each one needs a comment explaining how the authors decided to proceed, and edits in the manuscript should be appropriate to the comment raised.  

Please submit your revised manuscript by Sep 26 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Marco Cremonini, Ph.D.

University of Milan

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf.

2. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

"Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

3. Please note that in order to use the direct billing option the corresponding author must be affiliated with the chosen institute. Please either amend your manuscript to change the affiliation or corresponding author, or email us at plosone@plos.org with a request to remove this option.

4. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In this manuscript, the authors model the effects of non-pharmaceutical interventions to investigate the spread of COVID-19 for some states in Germany, using an agent-based model with a heterogeneous population based on survey data from the German Socio-Economic Panel. From the results of the simulations, they found that some scenarios, as quarantine and home-office, are the most effective measures to reduce COVID-19 cases. They also investigate and evaluate the effect of some NPIs on the number of COVID-19 cases in the risk group (age > 60 years). The research is interesting and presents a model that can be useful to assist in decision-making regarding COVID-19, a current health problem of great concern. The manuscript is well written, the methodology is sufficiently detailed and, the conclusions are supported by the results found. I consider that the article is scientifically sound and suitable for publication in PLOS ONE. I have a few comments nonetheless:

1. It is clear from the text that Figure 1 is the cumulative number of COVID-19 cases, but maybe the authors can make this clear in the caption or y-axis of the figure to avoid doubts. Does Figure 2 represent accumulated cases as well?

2. Perhaps the authors can explore the results of Figure 4 a little further. Are the differences in the behavior of the curves in the four states linked to demographic variation? Could the finding "the scenarios in which schools re-open and pre-pandemic home office rates are attained lead to the lowest proportion among the elderly risk-group" be influenced by the fact that the model does not consider contacts with relatives or friends (as children transmitting to their grandparents, for example)? (I understand that this would be reduced in the case of severe isolation measures, but it is an important point if the model is used in places where children live or stay with older relatives, as in some underdeveloped countries).

3. In the caption of Figure 4, the title says "Number of infected agents" while the legend says "share of cases". I suggest that authors standardize the name so that "number" is not confused with the absolute number. Also, perhaps the authors could consider presenting the percentages infected for each age group, not only for >=60 (maybe in a supplementary table with the final infected number or percentage), but this is just a suggestion.

4. Perhaps the authors might consider changing the yellow color in Figs 2 and 4, which can be difficult to see depending on the readers’ screen.

5. On page 6, line 302, if I am not mistaken, it seems like “Figures 5-9 in SI” means Figures S1-S5.

Reviewer #2: In this paper, “Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations”, the authors aim to develop an agent-based model concerning the “what-if” fundamental question in implementing the NPIs’ during the COVID-19 pandemic.

The topic is interesting and the document is well presented. While the results trends are illustrative enough in the figures, I suggest to add a table which contains the standard deviation, mean and CI for each intervention removal's configuration. These tables, particularly for figure 3 and 4, make the comparison among baseline and other configurations more clear.

From my point of view the submitted paper is suitable for publication in PLOS ONE in the presented form.

Reviewer #3: The paper presents an application of the Agent-based modeling approach to study the efficacy of Non-pharmaceutical interventions (NPIs) following the outbreak of Covid-19. The authors consider a basic SEIR model and couple it with survey data to build a model where they can effectively track and modify the daily routines of the agents.

The approach adopted in the paper is novel and is a promising avenue of further research in agent-based models. While agent-based models have had success as diagnostic tools and have been able to provide great insight into many systems of interest, convincing applications that integrate real-world micro-level data have been few and far between. By basing their model on data from surveys on the behavior and routines of the people from 4 provinces of Germany, the paper is extremely important towards ABMs becoming more widespread as a modeling paradigm.

Having said that, the paper in question does suffer from a certain number of deficiencies, which undermine the message and the importance of the work. Let me begin by some general comments before providing specific comments:

1. A major shortcoming of the paper is that it isn't clear what the model actually is. Care should be taken to provide a detailed schematic of the various parts of the dynamical model and the different sections of the population considered along with a more detailed description of the NPIs.

2. Furthermore, reading the figures is a bit difficult since the baseline scenario is not the baseline "no-intervention" scenario (as one would intuitively expect) but with all NPIs in place. it could be better to relabel the individual curves so that "no-quarantine" becomes the baseline "no-intervention" scenario.

3. The authors provide a link to the repository where the code for the model is available. While laudable, it is difficult to know where and how to begin using the code since a complete Readme file has not been provided. Given the ever-growing importance of computer simulations, especially in ABMs, it is important that end-users be provided with all the necessary information to be able to run and explore the results of the model. Since the computer code is an integral part of the research underlying this paper, it is important that the authors take the time to address this point.

Moving on to specifics:

1. Lines 193-194: Are there any parameter configurations, other than those presented in the paper that lead to similar results? If so, how dissimilar are these parameter configurations as compared to those presented in the paper?

2. Line 218: What economic sectors are the authors referring to here?

3. Line 219: There seems to be an apparent contradiction here: While establishing home office worker proportions to pre-pandemic times leads to a marked increase in the number of cases, it is strange to see that people working "normal hours" (and thus presuambly going to work) doesn't increase case numbers. Why is this the case?

4. Fig 4: Why is it that the "no quarantine" scenario also leads to the share of cases in the "at-risk" population to reach levels similar to those of "open schools" and "normal home office"?

5. Fig 4: Caption in the body of the paper reads "Number of infected agents" while the figure itself plots the "share of the at-risk population".

Finally, at many points in the paper, the authors state the results of the simulations without providing any explanation as to what the probable causes could be. These are detailed below:

1. Line 216: Why is it the case that quarantine is a less important NPI in the two smaller states?

2. Lines 228-229: The authors write that the consequences of opening up all educational institutions are very different than if only one type of educational institution is opened? Why is this the case? Perhaps the survey data should be able to throw some light on the question.

3. Line 247: Why is it the case that the opening up of universities seems to play only a small role in the total number of cases?

A few typographical errors

1. In the abstract, line number 5 "researcher" should read "researchers".

2. Line 12: "decision-maker" should read "decision-makers".

3. Every instance when a quotation mark has been used must be corrected, for instance in lines 35 and 85.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 28;16(10):e0259108. doi: 10.1371/journal.pone.0259108.r002

Author response to Decision Letter 0


25 Sep 2021

The authors would like to thank the editor and reviewers for their time, attention and those valuable comments.

The comments significantly helped improving the manuscript, especially regarding the model description and output analysis.

We have addressed all the issues raised in this rebuttal letter.

+++ Editor +++

COMMENT 1.1

In particular, two are the aspects that need particular attention:

Model description: Reviewer #3 has noted that the model should be described more precisely and with further details in order to be completely clear to the reader. We invite authors to expand the description.

Answer:

We have revised the section on Material and methods and added information on the design of the model and simulation experiments (cf. comment 4.2).

COMMENT 1.2

Figure explanation and discussion: All reviewers have noted that figures are not completely clear, either in the discussion or in the caption. We invite authors to revise all figures, their captions, and the discussion of their content according to the comments of all reviewers.

Answer:

We facilitated the readability of the figure captions and also followed reviewers advise on how to improve their appearance.

COMMENT 1.3

References: Pay specific attention to the following Journal Requirements, pointing to a possible problem with references to retracted articles. References should be carefully checked before resubmission, each one should have a clear rationale for being cited. All changes in the references should be reported in the rebuttal letter to inform the reviewers.

We have checked all our references and have found no article retracted from a journal in our reference list.

Nevertheless, we have revised our references in regard to preprints.

Accordingly, we updated preprints that have been published in journals.

Preprints that have not yet been published in a journal and are not essential to this study have been removed as references.

Furthermore, we have added three more references.

Removed references:

Chao DL, Oron AP, Srikrishna D, Famulare M. Modeling layered

non-pharmaceutical interventions against SARS-CoV-2 in the United States with

Corvid. medRxiv. 2020; p. 2020.04.08.20058487. doi:10.1101/2020.04.08.20058487.

Preprints replaced by the journal publication:

Kerr CC, Stuart RM, Mistry D, Abeysuriya RG, Hart G, Rosenfeld K, et al.

Covasim: an agent-based model of COVID-19 dynamics and interventions.

medRxiv. 2020; p. 2020.05.10.20097469. doi:10.1101/2020.05.10.20097469.

Alipour JV, Fadinger H, Schymik J. My Home is My Castle - The Benefits of

Working from Home During a Pandemic Crisis: Evidence from Germany.

University of Bonn and University of Mannheim, Germany; 2020.

crctr224 2020 178.

Hunter PR, Colon-Gonzalez F, Brainard JS, Rushton S. Impact of

non-pharmaceutical interventions against COVID-19 in Europe: a

quasi-experimental study. medRxiv. 2020; p. 2020.05.01.20088260.

doi:10.1101/2020.05.01.20088260.

Added references:

Grimm V, Berger U, Bastiansen F, Eliassen S, Ginot V, Giske J, et al. Astandard protocol for describing individual-based and agent-based models.Ecological Modelling. 2006;198(1-2):115–126. doi:10.1016/j.ecolmodel.2006.04.023.

Grimm V, Railsback SF, Vincenot CE, Berger U, Gallagher C, DeAngelis DL,et al. The ODD Protocol for Describing Agent-Based and Other SimulationModels: A Second Update to Improve Clarity, Replication, and StructuralRealism. Journal of Artificial Societies and Social Simulation. 2020;23(2):7.

Destatis. Arbeitnehmerverdienste. 2. Vierteljahr 2020. Fachserie 16 Reihe 2.1.; 2020.

+++Reviewer 1 +++

COMMENT 2.1

In this manuscript, the authors model the effects of non-pharmaceutical interventions to investigate the spread of COVID-19 for some states in Germany, using an agent-based model with a heterogeneous population based on survey data from the German Socio-Economic Panel. From the results of the simulations, they found that some scenarios, as quarantine and home-office, are the most effective measures to reduce COVID-19 cases. They also investigate and evaluate the effect of some NPIs on the number of COVID-19 cases in the risk group (age > 60 years). The research is interesting and presents a model that can be useful to assist in decision-making regarding COVID-19, a current health problem of great concern. The manuscript is well written, the methodology is sufficiently detailed and, the conclusions are supported by the results found. I consider that the article is scientifically sound and suitable for publication in PLOS ONE. I have a few comments nonetheless:

Answer:

We thank reviewer 1 for the overall positive assessment of our work and the helpful comments to further improve the manuscript.

COMMENT 2.2

It is clear from the text that Figure 1 is the cumulative number of COVID-19 cases, but maybe the authors can make this clear in the caption or y-axis of the figure to avoid doubts. Does Figure 2 represent accumulated cases as well?

Answer:

We have changed the y-axis of Figure 1 and 2 to "Cumulative COVID-19 cases / 100.000" to highlight that cumulative case counts are reported in both figures.

In addition, we have revised the caption of the figures.

COMMENT 2.3

Perhaps the authors can explore the results of Figure 4 a little further. Are the differences in the behavior of the curves in the four states linked to demographic variation? Could the finding "the scenarios in which schools re-open and pre-pandemic home office rates are attained lead to the lowest proportion among the elderly risk-group" be influenced by the fact that the model does not consider contacts with relatives or friends (as children transmitting to their grandparents, for example)? (I understand that this would be reduced in the case of severe isolation measures, but it is an important point if the model is used in places where children live or stay with older relatives, as in some underdeveloped countries).

Answer:

We thank the reviewer for these very important issues.

Also one of the motivations of this project has been to link case numbers to demographic variation between different states, it turned out to be not feasible in the context of this article with its many scenarios and few considered states.

We approach this issue in future research in an update of the model together with a more specific research design.

Nevertheless, we have added paragraphs on page 10 linking the results of Figure 2 and 3 to demographic properties of all states, particularly household composition.

Furthermore, we think the reviewer is correct that the low proportion of infections among the elderly risk-group is partly caused by not explicitly considering contacts between households of relatives and friends.

While we assume this model design is appropriate for the early pandemic phase in germany 2020, the reviewer is right that this might be inappropriate if applied to other countries (or other times).

Thus, we have added multiple phrases addressing this issue on page 11.

COMMENT 2.4

In the caption of Figure 4, the title says "Number of infected agents" while the legend says "share of cases". I suggest that authors standardize the name so that "number" is not confused with the absolute number. Also, perhaps the authors could consider presenting the percentages infected for each age group, not only for >=60 (maybe in a supplementary table with the final infected number or percentage), but this is just a suggestion.

Answer:

We have changed the title of Figure 4 to "Share of infected agents belonging to the risk-group (age $>=$ 60), by state.".

Based on the suggestion, we have added Table S7, listing each age-group's share in the infected population.

COMMENT 2.5

Perhaps the authors might consider changing the yellow color in Figs 2 and 4, which can be difficult to see depending on the readers’ screen.

Answer:

Thank you very much for this remark.

We changed the color scheme and avoid yellow in all figures.

COMMENT 2.6

On page 6, line 302, if I am not mistaken, it seems like “Figures 5-9 in SI” means Figures S1-S5.

\\end{mdframed}

Answer:

That is absolutely correct.

We have corrected the respective passage accordingly.

+++ Reviewer 2 +++

COMMENT 3.1

In this paper, “Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations”, the authors aim to develop an agent-based model concerning the “what-if” fundamental question in implementing the NPIs’ during the COVID-19 pandemic.

The topic is interesting and the document is well presented. While the results trends are illustrative enough in the figures, I suggest to add a table which contains the standard deviation, mean and CI for each intervention removal's configuration. These tables, particularly for figure 3 and 4, make the comparison among baseline and other configurations more clear.

From my point of view the submitted paper is suitable for publication in PLOS ONE in the presented form.

Answer:

We thank reviewer 2 for this positive evaluation of our manuscript and the valuable suggestion to include additional tables on the results of the simulation experiments.

Based on this comment, we have added Table S5 and S6 to the appendix, which contain the requested numbers.

+++ Reviewer 3 +++

COMMENT 4.1

The approach adopted in the paper is novel and is a promising avenue of further research in agent-based models. While agent-based models have had success as diagnostic tools and have been able to provide great insight into many systems of interest, convincing applications that integrate real-world micro-level data have been few and far between. By basing their model on data from surveys on the behavior and routines of the people from 4 provinces of Germany, the paper is extremely important towards ABMs becoming more widespread as a modeling paradigm.

Having said that, the paper in question does suffer from a certain number of deficiencies, which undermine the message and the importance of the work. Let me begin by some general comments before providing specific comments:

Answer:

We thank reviewer 3 very much for their careful evaluation of our manuscript, the detailed, critical and helpful comments, as well as for the appreciation of our work.

COMMENT 4.2

1. A major shortcoming of the paper is that it isn't clear what the model actually is. Care should be taken to provide a detailed schematic of the various parts of the dynamical model and the different sections of the population considered along with a more detailed description of the NPIs.

Answer:

Following Reviewer 3's advise, we have completely revised the section on "Material and methods".

The model description now starts with a description of all entities (subsection "Entities and scales"), followed by a description of the initialization process (subsection "Initialization"), and the schedule of the main processes (subsection "Main simulation loop") including the content of these processes (subsections "Activities" and "Disease process and virus transmission").

In two subsections ("Non-pharmaceutical interventions" and "Scenarios and model execution"), we have also added a detailed description of the NPIs and how the scenarios were designed.

The newly added Table 1 and Table 2 give an overview of the implementation of NPIs in the baseline scenario and how the counterfactual scenarios differ from the baseline scenario.

COMMENT 4.3

2. Furthermore, reading the figures is a bit difficult since the baseline scenario is not the baseline "no-intervention" scenario (as one would intuitively expect) but with all NPIs in place. it could be better to relabel the individual curves so that "no-quarantine" becomes the baseline "no-intervention" scenario.

Answer:

We appreciate this comment, yet the "no-quarantine"-scenario is not a scenario without any interventions, but only without quarantining households.

The extended description of NPIs and the baseline scenario in response to comment 4.2 should make this more clear in the revised manuscript.

COMMENT 4.4

3. The authors provide a link to the repository where the code for the model is available. While laudable, it is difficult to know where and how to begin using the code since a complete Readme file has not been provided. Given the ever-growing importance of computer simulations, especially in ABMs, it is important that end-users be provided with all the necessary information to be able to run and explore the results of the model. Since the computer code is an integral part of the research underlying this paper, it is important that the authors take the time to address this point.

Answer:

We have added a README file containing a step-by-step guide for setup and execution of the model.

The README file can be found via this link: https://github.com/mariuzka/covid19_sim#readme.

COMMENT 4.5

Moving on to specifics:

1. Lines 193-194: Are there any parameter configurations, other than those presented in the paper that lead to similar results? If so, how dissimilar are these parameter configurations as compared to those presented in the paper?

Answer:

We infer two parameters ("time from symptoms to quarantine", "infection probability") by calibrating the model to empirical infection data (cf. subsection "Calibration", p. 8).

We tested a wide range of configurations of those two and take the combination with the best model fit.

To illustrate this process in greater detail, we added Fig S6 in the appendix.

It clearly shows that the chosen combinations are no outliers.

COMMENT 4.6

2. Line 218: What economic sectors are the authors referring to here?

Answer:

With 'certain economic sectors' we meant both the NACE-sectors that had to partially close their business as well as the overall work time reduction due to the economic down turn in the most affected branches.

The NACE-sectors affected by partial shutdown are I, R, G, P and S (as now listed in Table 1).

Which occupational fields are most affected by the overall reduction of working hours is now listed in Table S3.

The newly added description of the NPIs and the scenarios should now make this point clear as well.

Because we now find that the wording 'certain economic sectors' is to unspecific, we have changed the corresponding phrase to "At the other end of the efficacy spectrum is the scenario in which there was no reduction of work hours.", which can be found at page 9 of the updated version of the manuscript.

COMMENT 4.7

3. Line 219: There seems to be an apparent contradiction here: While establishing home office worker proportions to pre-pandemic times leads to a marked increase in the number of cases, it is strange to see that people working "normal hours" (and thus presuambly going to work) doesn't increase case numbers. Why is this the case?

Answer:

This contradiction can be explained by the fact that changes in working hours due to business closures and economic downturns tend to be small compared to changes in hours worked in the office due to the (industry-specific) exhaustion of the possibilities of working from home.

In the updated version of the manuscript one can see the changes in the hours worked at the work place due to business closure and economic downturn in Table 1 and Table S3, while changes due to an extended home-based work arrangement are shown in Table S4.

Furthermore, one has to consider that in the counterfactual scenario where no reduction of work takes place the instruction to work from home whenever possible is still in place.

Thus, the effect of the number of working hours is suppressed by the effect of the amount of hours worked from home.

Based on this comment, we have added a paragraph explaining this issue at page 9.

COMMENT 4.8

4. Fig 4: Why is it that the "no quarantine" scenario also leads to the share of cases in the "at-risk" population to reach levels similar to those of "open schools" and "normal home office"?

Answer:

We think this comment might be related to the ambiguous explanation of the baseline scenario in the prior version of the manuscript (see also comment 4.3).

It should now be clear that the "no quarantine" NPI mentioned by Reviewer 3 is not the baseline scenario.

COMMENT 4.9

5. Fig 4: Caption in the body of the paper reads "Number of infected agents" while the figure itself plots the "share of the at-risk population".

Answer:

We have corrected this error.

In the revised version, the caption of Figure 4 now reads "Share of infected agents belonging to the risk-group (age>=60), by state.".

COMMENT 4.10

Finally, at many points in the paper, the authors state the results of the simulations without providing any explanation as to what the probable causes could be. These are detailed below:

1. Line 216: Why is it the case that quarantine is a less important NPI in the two smaller states?

Answer:

Unfortunately, providing a detailed explanation for all mechanisms at work is beyond the scope of our paper.

We focus on the development of a model that allows the inspection of counterfactual NPIs, resting on simulations with survey-based agents.

While investigating each underlying effect and its driving factor is, in general, possible with this model, the current paper is, in our view, sufficiently complex by providing the model description, model fit and a "ranking" of NPIs' efficacy.

However, we added further explanations as suggested by Reviewer 3.

COMMENT 4.11

2. Lines 228-229: The authors write that the consequences of opening up all educational institutions are very different than if only one type of educational institution is opened? Why is this the case? Perhaps the survey data should be able to throw some light on the question.

Answer:

An analyses of the household compositions reveals that kindergarten kids and school kids often live together with siblings that also go to kindergarten or school.

Thus, if both institutions are open, those households provide an open path the virus can take reaching other societal areas.

In addition, those households are often relatively big and also include working parents.

In order to clarify this point in the manuscript, we have added paragraphs on page 9 and 10 as well as the tables S8 and S9.

COMMENT 4.12

3. Line 247: Why is it the case that the opening up of universities seems to play only a small role in the total number of cases?

Answer:

Universities seem to play a minor role in our model, because the share of students is small compared to other groups, students live in smaller households which rarely include kindergarten- or school-age kids.

The manuscript now contains a paragraph on page 10 explaining this issue.

COMMENT 4.13

A few typographical errors

1. In the abstract, line number 5 "researcher" should read "researchers".

2. Line 12: "decision-maker" should read "decision-makers".

3. Every instance when a quotation mark has been used must be corrected, for instance in lines 35 and 85.

Answer:

We have corrected the mentioned typographical errors.

Attachment

Submitted filename: Response to Reviewers.pdf

Decision Letter 1

Marco Cremonini

13 Oct 2021

Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations

PONE-D-21-14064R1

Dear Dr. Kaffai,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Marco Cremonini, Ph.D.

University of Milan, Italy

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Marco Cremonini

20 Oct 2021

PONE-D-21-14064R1

Modeling non-pharmaceutical interventions in the COVID-19 pandemic with survey-based simulations

Dear Dr. Kaffai:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Marco Cremonini

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Values and data sources of simulation parameters.

    * At the time of conducting our simulation experiments, we used the most recent values listed in the latest version of Covasim and the associated article. Unfortunately, these values have changed (slightly) in the final release of Covasim. Nevertheless, we do not expect these changes to have a significant impact on our model results [34, 36, 47, 48].

    (PDF)

    S2 Table. Descriptive statistics on federal states [34, 49, 50].

    (PDF)

    S3 Table. Reduction of work hours per NACE-section.

    S3 Table gives the percentages to which the work hours were reduced in the second quarter (April, May, June) of 2020 as compared to 2019 per NACE-section. We assume that the differences between 2020 and 2019 are mainly caused by the COVID-19 pandemic. The values are taken from [51].

    (PDF)

    S4 Table. Amount of working from home per NACE-divisions.

    The column “Normal” gives the share of people frequently working at home office per NACE-division. The column “Maximum” gives the share to which the work could be done in home office i.e. the maximum capacity of working from home per NACE-division. We interpret all values as share of work hours per NACE-division. We assume that in the total lock- and shutdown in the first wave of COVID-19 in Germany 2020 the full capacity of working from home was exploited in each NACE-division. All values are taken from [35].

    (PDF)

    S5 Table. Case numbers on day 100.

    S5 Table shows the cumulative number of infected agents per 100,000 agents after 100 simulated days, averaged over 60 replications per state and scenario, along with the standard deviation and 95% confidence interval.

    (PDF)

    S6 Table. Case numbers on day 100 as the difference to baseline scenario.

    S6 Table shows the cumulative number of infected agents per 100,000 agents after 100 simulated days, averaged over 60 replications per state and scenario, as the difference from the corresponding baseline scenario, along with the standard deviation and 95 percent confidence interval.

    (PDF)

    S7 Table. Infected agents by age-group.

    S7 Table shows the proportion of each age group in the cumulative number of infected agents on simulation day 100.

    (PDF)

    S8 Table. Share of vocational statuses per federal state.

    S8 Table shows the proportion of different vocational statuses per federal state in the population of agents. “None/other” are those agents who are neither working, studying, or of kindergarten or school age. “Working” means any agent who works more than zero hours per day and is not enrolled at a university.

    (PDF)

    S9 Table. Household composition by vocational status and state.

    S9 Table shows for each vocational status the average number of agents living in the same household by their vocational status. For instance, on average, an agent classified as a kindergarten child in Baden-Wuerttemberg lives in a household that contains 1.53 kindergarten children, 0.63 schoolchildren, 0.03 students, 1.4 employed persons, and 0.58 other persons. “None/other” are those agents who are neither working, studying, or of kindergarten or school age. “Working” means any agent who works more than zero hours per day and is not enrolled at an university.

    (PDF)

    S1 Fig. Robustness check: Hours at kindergarten.

    Compares the number of COVID-19 infections in the baseline scenario across different values for the daily hours at kindergarten. The value in the center is the one used in the main analysis.

    (TIF)

    S2 Fig. Robustness check: Hours at school.

    Compares the number of COVID-19 infections in the baseline scenario across different values for hours at school. The value in the center is the one used in the main analysis.

    (TIF)

    S3 Fig. Robustness check: Hours at university.

    Compares the number of COVID-19 infections in the baseline scenario across different values for hours at university. The value in the center is the one used in the main analysis.

    (TIF)

    S4 Fig. Robustness check: Number of colleagues.

    Compares the number of COVID-19 infections in the baseline scenario across different values for number of colleagues. The value in the center is the one used in the main analysis.

    (TIF)

    S5 Fig. Robustness check: Maximum number of students per university.

    Compares the number of COVID-19 infections in the baseline scenario across different values for the maximum number of students per university. We used 100,000 in the main analysis, which means that there is only one university per simulated world.

    (TIF)

    S6 Fig. Sampled parameter space.

    S6 Fig shows the tested combinations of the two calibrated parameters and the respective model fit when using the combination in the baseline model. The parameter combination that gives the best model fit is indicated by the cross.

    (TIF)

    Attachment

    Submitted filename: Response to Reviewers.pdf

    Data Availability Statement

    The simulation output data and the code to reproduce the plots and result tables are stored in this repository: DOI 10.17605/OSF.IO/B9YPG. Unfortunately, we are not allowed to share the Socio-Economic Panel (SOEP) data or parts of it due to a very restrictive data access policy. Therefore, we cannot share data on the agent population based on the SOEP. The SOEP is a representative panel study for Germany conducted annually by DIW Berlin. Researchers affiliated with an institution can request access to SOEP at this website: https://www.diw.de/en/diw_01.c.601584.en/data_access.html. All of the code used to run the simulation and the publicly available datasets used to calibrate the simulation model are available in this repository: https://github.com/mariuzka/covid19_sim.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES