Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 28;16(10):e0259037. doi: 10.1371/journal.pone.0259037

Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modelling, agent-based simulation, and mobile phone data

Sebastian A Müller 1, Michael Balmer 2, William Charlton 1, Ricardo Ewert 1, Andreas Neumann 3, Christian Rakow 1, Tilmann Schlenther 1, Kai Nagel 1,*
Editor: Itzhak Benenson4
PMCID: PMC8553173  PMID: 34710158

Abstract

Epidemiological simulations as a method are used to better understand and predict the spreading of infectious diseases, for example of COVID-19. This paper presents an approach that combines a well-established approach from transportation modelling that uses person-centric data-driven human mobility modelling with a mechanistic infection model and a person-centric disease progression model. The model includes the consequences of different room sizes, air exchange rates, disease import, changed activity participation rates over time (coming from mobility data), masks, indoors vs. outdoors leisure activities, and of contact tracing. It is validated against the infection dynamics in Berlin (Germany). The model can be used to understand the contributions of different activity types to the infection dynamics over time. It predicts the effects of contact reductions, school closures/vacations, masks, or the effect of moving leisure activities from outdoors to indoors in fall, and is thus able to quantitatively predict the consequences of interventions. It is shown that these effects are best given as additive changes of the reproduction number R. The model also explains why contact reductions have decreasing marginal returns, i.e. the first 50% of contact reductions have considerably more effect than the second 50%. Our work shows that is is possible to build detailed epidemiological simulations from microscopic mobility models relatively quickly. They can be used to investigate mechanical aspects of the dynamics, such as the transmission from political decisions via human behavior to infections, consequences of different lockdown measures, or consequences of wearing masks in certain situations. The results can be used to inform political decisions.

Introduction

When COVID-19 took hold in Germany in February 2020, there was an urgent need for a differentiated modelling capability to predict the consequences of interventions. We used our experience with person-centric modelling of traffic [1] to build a first prototype within two weeks [2]. An advantage of using this starting point is that the whereabouts of all simulated persons, including their overlapping time spent at facilities or in (public transport) vehicles, are already given by the model, which is derived in part from mobile phone data. Since the input data contains age as an attribute of each synthetic person, it was straightforward to include agent-dependent disease progression into the model from the start. A short description of the different model variants over time is provided in S2 Text.

The model is regularly used to advise the German federal government (e.g. [3, 4]). The main contribution of those reports was and is to provide differentiated predictions of the influence of various interventions, such as reductions of activity participation, masks, or vaccinations. For the present paper, we show the contributions of different activity types to the infection dynamics as predicted by the model. We show how most activity types generate over time fairly constant contributions to the reproduction number R, independent from the actual level of R. In consequence it is structurally more stable to report reductions of R caused by interventions as an additive term, rather than a term that is relative to the overall level of R as is usually done (e.g. [5]). The model also explains why there are decreasing marginal returns to stay-at-home interventions [6]. Finally, the model makes a prediction concerning the magnitude of the difference between summer and winter, caused by moving activities indoors during winter.

Related work

Compartmental models

The general dynamics of virus spreading are captured by compartmental models, most famously the so-called SIR model, with S = susceptible, I = infected/infectious, and R = recovered [7]. Every time a susceptible and an infectious person meet, there is a probability that the susceptible person becomes infected. Some time after the infection, the person typically recovers. Variants include, e.g., an exposed (but not yet infectious) compartment between S and I.

Instead of running these models with compartments, one can run them on a graph [8, 9]. Persons are represented as vertices, connections between persons are denoted as edges. The random interactions that are implied by the compartmental models are then replaced by interactions with graph neighbors.

In reality, these interactions change from day to day; in particular, possible superspreading events like weddings or other large gatherings cannot be encoded in a static graph. For this, temporal networks have been investigated ([9], section VIII).

An advantage of compartmental models is that their runtime is independent from system size; in that way, it is easily possible to run a model for a country or a continent. A disadvantage is that one needs a separate compartment for each attribute combination (e.g. age × activity pattern × disease state), and that mechanical aspects such as the reduction of virus intake by masks, are difficult to include into the model. A special case is [10]: It treats each census block as a subpopulation, computes how virus travels from one census block to another via points-of-interest with visitors from both census blocks, and also has internal virus dynamics in each census block. The differences to our work are discussed in more detail in S5 Text.

Person-centric epidemiological modelling

An alternative to compartmental models is to use synthetic persons as the starting point for modelling, and to “consider nodes as entities where multiple individuals or particles can be located and eventually wander by moving along the links connecting the nodes” [9]. Examples of such models can be found since approximately 2004 [1114]. A model of this type by Imperial College [15] had a large impact on policy in the UK. Other recent developments are [16, 17] on the global scale, or [18, 19] on the urban scale. These models typically follow individual synthetic persons. However, most of them, with the possible exception of the Virginia Biotechnology Institute model [12, 20], have explicit person movements only for commute patterns; all other infections are assumed to be in a local environment.

Aleta et al. [21] construct an agent-based model, similar to ours. Their data derives from persons specifically recruited to collect their long-term trajectories. They have long trajectories, with high spatial precision, but for only 2% of the population. This is still an impressive sample; however, with our work we aim for models where we have as many synthetic persons in the model as there are persons in reality.

A special case is by Kucharski et al. [22], who use a pre-existing dataset with recorded social contacts for 40 162 participants. This is close to our approach in that the persons who encounter each other for how long and in which context are microscopically specified. Differences include that it is not a model for the full population of a region, and the study does not trace behavioral changes throughout the pandemic.

Daily activity trajectories

Using daily activity chains as the basis for transport modelling is an established approach in the transport modelling community. An activity chain is a sequence of activities of a person, where activities have types such as home, work, shop, etc., starting and ending times, and locations. There are several ways to generate such activity chains, for example by using activity-based demand generation models (e.g. [23, 24]), by taking them from travel diaries (e.g. [25, 26]), by using mobile phone data (e.g. [27]), or by data fusion from open access data sources (e.g. [28]).

In the present situation, we needed a technology that was readily available, allowed uniform rollout at least in Germany, and that would allow updates along with changes in mobility behavior during the unfolding of the COVID epidemics. For that reason, we used an established process that generates activity chains mostly from mobile phone data [27]. The process is described in more detail in S1 Text. The outcome of the process are activity chains, encoded as events (cf. Fig 1), for as many synthetic persons as Germany has inhabitants. Since the activity chains stem from transport modelling, they also contain knowledge about trips between activities, importantly trips by public transport, and in consequence also contain, for each synthetic person, events when they enter or leave certain public transit vehicles.

Fig 1. Events for travel.

Fig 1

TOP: By individual vehicle. BOTTOM: By public transport. Source: [1].

Person-centric epidemiological models derived from transport simulations

From the section on person-centric epidemiological modelling above, one takes away that having person trajectories, and in particular where persons meet, would be useful for an epidemiological simulation. In consequence, the synthetic person trajectories from transport modelling explained above seem like a good starting point, since they are already available. Smieszek et al. [29, 30] and Hackl and Dubernet [31] construct epidemiological models on top of such pre-existing synthetic person trajectories; these are the main starting point for us. Najmi et al. [32] start from a person-centric transportation planning model for Sydney, and add a disease transmission model that computes possible infections based on co-locations during the simulated day. The approach is similar to ours, but does not use mobile phone data to track the actual mobility behavior. They also do not use an infection model that depends on the spatial situation of the activity type.

The model described in the present paper has always been open source, and earlier version have been described in preprints [2, 33, 34]. This has been picked up by by Manout and Ciari [35] for Montreal, and by Bossert et al. [36] for South Africa. It was also used as the “micro” part by [37].

From reductions of mobility behavior to reductions of infections

There are many data sets that track or analyze mobility changes “during Corona” [3843]. This is, however, not our primary focus; rather, we are interested in how the infection dynamics can be better understood and possibly predicted with the help of mobility and other data. A possible approach to achieve this is data mining [44, 45]. We are, however, interested in models with more detail.

Jia et al. [46] and Xiong et al. [47] look at how long distance travel influences the disease import; they find that a high inflow from areas with high incidences is positively correlated with high infection numbers. They do not, however, look at disease spread within the urban fabric, driven by daily movement patterns.

Fairly close to our work are Chang et al. [10], already mentioned earlier. They first construct, based on mobile phone data, a mobility network between census block groups and points of interest based on mobile phone data, and then use that model to investigate reopening strategies. They have a very detailed resolution of the facilities (they differentiate, e.g., between full-service restaurants, limited-service restaurants, and cafes/snack bars), but on the other hand they do not simulate individual synthetic persons. Similarities and differences are discussed in S5 Text.

Model details

Important sub-models of agent-based epidemics models are: contact model, infection model, and disease progression model. These are described in more detail in the following sections.

Mobility model and resulting contact model

As stated, we take the synthetic persons and their movements from transport modelling, cf. Fig 1. For the present study, the data is generated by a synthetic method developed by Senozon, see S1 Text. We have used and are using the same data for other projects [4851]. From these activity chains, we extract how much time people spend with other people at activities or in (public transport) vehicles. That is, infection opportunities are directly taken from the input data. Details, for example of how multiple days or weekends are modelled, are provided in Sec 1 of S1 Appendix.

Infection model

Once two persons are identified to have contact, and one of them is contagious and the other is susceptible, there is a probability of an infection. For this, we use the mechanical model by Smieszek [45]: infected persons generate a “viral load” that they exhale, cough or sneeze into the environment, and people close by are exposed. Overall, the probability for person n to become infected by this process in a time step t is described as

p(infect|contact)n,t=1-exp(-Θmshm,t·cinm,t·inn,t·τnm,t) (1)

where m goes over all other persons with which the person has contact at time t, sh is the shedding rate (∼ microbial load), ci the contact intensity, in the intake (reduced, e.g., by a mask), τ the duration of interaction between the two individuals, and Θ a calibration parameter. The model of Smieszek has the advantage that it was specifically developed with our transport simulation in mind, but there are many models of the same type (e.g. [52, 53]).

For small values of the exponent and just one contagious person in the room, one can approximate Eq (1) as

p(infect|contact)Θ·sh·ci·in·τ. (2)

We do not use this approximation in our computer implementation, but it helps understanding the following arguments. Fig 2 gives some intuition about when that approximation holds; evidently, the effect of Eq (1) is to saturate when the infection probability becomes large.

Fig 2. How duration (in hours) translates into the infection probability for two different contact intensities.

Fig 2

The linear approximation of Eq (2) is given for either curve by a dashed line.

All parameters can be given in arbitrary units as long as those units are always the same since the units are absorbed by Θ. If one wanted to use physical units, then one could decompose sh=s˜h·out with s˜h the, say, number of shedded virus particles per time. out (for a mask on the shedding side), ci, and in would be correction factors, i.e. 1 for a standard situation. τ would denote the time duration, so that the result would be the number V of virus particles that were inhaled during that time duration. Θ, or more precisely 1 − exp(−ΘV), would translate that number of virus particles into an infection probability. If that translation was known, one could attempt to calibrate the model from first principles. Practically, we use Θ as our main calibration parameter.

Contact intensities

For SARS-CoV-2, it is plausible to assume that a large share of the virus material is shed as aerosol [54]. In consequence, the first relevant term to compute the viral concentration in the air is the shedding rate, sh.

For such aerosols, it is plausible to assume that they mix quickly into the room, leading to the same uniform concentration everywhere [55]. Evidently, that concentration is inversely proportional to room size: if the room is twice as large, the resulting concentration is half as large.

Next, air exchange plays a role [55]. One could, for example, assume that the windows are opened once per hour, and all of the air is replaced with outside air. This would correspond to an air exchange rate of 1/h. If one assumes a constant rate of virus emission, there would be a linear increase of concentration up to the opening of the window, after which the virus concentration in the air would quickly go towards zero. The average virus concentration over this process would be half as much as the maximum concentration just before window opening. In consequence, the resulting average concentration is inversely proportional to the air exchange rate: If the air is exchanged twice as often, the resulting average virus concentration is half as large. This also holds for continuous air exchange, e.g. by mechanical means.

All of the above together replaces Eq 2 by

p(infect|contact)Θ·sh·inrs·ae·τ, (3)

where rs is the size of the room, and ae is the air exchange rate. That is, it sets the contact intensity ci from Eq (1) to

ci=1rs·ae. (4)

Again, the physical units are absorbed into Θ; note, however, that the air exchange rate ae is defined as the frequency of exchanging the air of the full room, and not of, say, cubic meters.

Aspects such as loudness of speech or if persons perform a physical activity are not taken into account in the present model although they are known to play important roles [56, 57], and they could, to some extent, be attached to the activity types. It is planned to include them in a future version of the model.

Estimation of room sizes

As stated above, our data resolves down to the level of “facilities”. These correspond roughly to buildings. In consequence, such a facility can be anything from a single family home to a large office building or a sports arena. Since these facilities are too large compared to typical rooms, we divide facilities into Nspaces rooms. Nspaces is set to 20; the argument for this number is given in Sec 1 of S1 Appendix.

Since our simulation tracks when persons are at facilities, we can, for each facility, obtain the maximum number of persons at that facility, NmaxpersonsAtFacility, over the day. In addition, one can obtain typical floor space per person, fs, from regulatory norms and other sources (see Table 1). This leads to

facilityFloorSpace=NmaxpersonsAtFacility·fs. (5)

Divided by NspacesPerFacility, this leads for the room size to

rs=facilityFloorSpaceNspacesPerFacility=NmaxpersonsAtFacilityNspacesPerFacility·fsroomCapacity·fs (6)

where roomCapacity is the maximum number of persons that are in the room during the day (thus its “capacity”); note that NspacesPerFacility = 1 for home activities (cf. Sec 1 of S1 Appendix).

Table 1. Normalized contact intensities ci′, relative to the contact intensity at home, cihome.
activity type area per person fs [m2] air exchange rate old bldg aeold [1/h] air exchange rate new bldg aenew [1/h] share old buildings resulting ci/cihome
home [58, 59] 22 0.5 0.5 1
schools and day care [60] 2 0.5 0.5 100% 11
universities 4 0.5 0.5 100% 5.5
public transport [61, 62] 0.33 2.0 10.0 50% 10
leisure [63] 1.25 0.5 10.0 50% 9.24
shop 10 0.5 1.5 10% 0.88
work [6466] 10 0.5 1.5 50% 1.47
errands 10 0.5 1.5 50% 1.47
business 10 0.5 1.5 50% 1.47

Both the floor area per person and the air exchange rate come from building manuals or similar standards; note the given references in the table. The share of old buildings/vehicles is an estimate. Universities are assumed to have twice as much space per student as schools. Shop, errands, and business are assumed to follow the same characteristics as work. The contact intensities are computed separately for old and new buildings, and then averaged according to the assumed share of old buildings.

Air exchange rate and normalized contact intensities

Inserting Eq (6) into (4) results in

ci=1roomCapacity·fs·ae1roomCapacity·ci (7)

with the “normalized” contact intensity

ci=1fs·ae. (8)

See Table 1 for values of ci′.

ci′ parameterizes the “closeness” of the interaction. This is, cf. Eq (7), divided by roomCapacity, which denotes the number of persons that fit into the room given typical usage. If we share a room with one infectious other person, then our probability to become infected is, all other things being equal, half as large if the room is twice as large. However, if the room is twice as large, then there will presumably also be twice as many persons in it, doubling our own risk, and thus in the average cancelling out the effect of the larger room size. This second effect is computed directly by our contact model (Sec. From reductions of mobility behavior to reductions of infections above), and thus does not have to be included into the conditional infection probability. This has the additional advantage that if a person is in large container outside its peak usage, the model will calculate a much reduced infection probability. Examples for this are public transport vehicles, premises for large events, or restaurants.

A side effect of this model is that the above division by NspacesPerFacility has no effect in first order. If n persons at the facility are all in one room, and one contagious person is added, the expected number of newly infected persons is np, where p is the individual probability to become infected according to Eq (1) with Eq (7). If the n persons are divided between NspacesPerFacility, and one contagious person is added into one room, the expected number of newly infected persons is (n/NspacesPerFacility) · p′, where p′ in first order is p · NspacesPerFacility, because roomCapacity in Eq (7) is divided by NspacesPerFacility. In consequence, in first order the expected number of newly infected persons in the divided facility is the same, np, as in the undivided facility.—Second-order corrections come from the fact that Eq (1) eventually saturates when infection probabilities become large—then the smaller room sizes reduce the number of infections.

Masks

The effectiveness of different mask types is taken from from [67], i.e. cloth masks reduce shedding and intake to 0.6 and 0.5 of their original values, surgical masks to 0.3 and 0.3, and N95 (FFP2) masks to 0.15 and 0.025. For some discussion of these values see Masks.

Children

Current research implies that the susceptibility and infectivity are reduced for children compared to adults. We model this by including the susceptibility and infectivity into Eq (1). For adults both parameters are set to one. For people below the age of twenty the infectivity is reduced to 0.85 and the susceptibility to 0.45 [68, 69]. Note that this does not mean that the infection probability for children is necessarily lower than for adults, because children are more likely to perform activities with a high contact intensity, as shown in Table 1.

Disease progression model

The disease progression model is taken from the literature [7075] (also see [76]). The model has states exposed, infectious, showing symptoms, seriously sick (= should be in hospital), critical (= needs intensive care), and recovered. The durations from one state to the next follow log-normal distributions; see Fig 3 (LEFT) for details. We use similar age-dependent transition probabilities as [15], shown in Fig 3 (RIGHT).

Fig 3. Disease progression model.

Fig 3

LEFT: State transitions [7075]. RIGHT: Age-dependent transition probabilities from infectious to symptomatic, from symptomatic to seriously sick (= requiring hospitalisation), and from seriously sick to critical (= requiring breathing support or intensive care). Source: [15], except that the numbers in the second column are divided by 2 (discussed in Under-reporting, and its variation over time).

Infecting another person is possible during infectious, and while showing symptoms, but no longer than 4 days after becoming infectious. This models that persons are mostly infectious relatively early through the disease [71], while in later stages the infection may move to the lung [72], which makes it worse for the infected person, but seems to make it less infectious to other persons.

Time-dependent inputs and calibration

Simulation runs

Although the approach was designed with uniform rollout throughout Germany in mind, the project, for reasons described in Sec 4 of S1 Appendix, mostly performed simulations for the metropolitan area of Berlin in Germany, with approx. 5 million people. A typical simulation run looks as follows:

  1. One or more exposed (i.e. recently infected) persons are introduced into the population.

  2. At some point, exposed persons become infectious. From then on, every time they spend time together with some other person in a vehicle or at some activity, Eq (1) is used to calculate the probability that the other person, if susceptible, can become infected (= exposed). If infection happens, the newly infected person will follow the same progression.

  3. Infectious persons eventually move on to other disease states, as described in Fig 3.

The model runs for many days, until no more infections occur and all persons have finished their paths through the disease progression.

Calibration

The calibration procedure undertaken for the present paper is described in the following sections. Calibration is performed by visual comparison, with first priority against the time series of the number of hospital patients in Berlin, and with second priority against the COVID case numbers in Berlin. The calibration procedure, as described in the following, is as much about which elements to include at all as about finding the right parameters. “Second priority” here means that if calibration against hospital numbers is undecided between two alternatives, then the case numbers are used in addition. The case numbers are only used with second priority since the screening procedure has been changed multiple times, which means that the resulting time series is not homogeneous and thus not useful for model calibration. In particular, under-reporting in the initial phases was much larger than later. More information about the COVID case numbers in Berlin can be found in Sec 2 of S1 Appendix. A formal calibration of Θ can be found in Sec. Out of sample prediction. The calibration includes the following elements:

  1. Calibration of the basic doubling time without reduction of activity participation

  2. Integration of spring disease import

  3. Calibration of the consequences of reduced activity participation

  4. Calibration of an indoors/outdoors effect for leisure activities depending on the temperature

  5. Integration of contact tracing, masks, and summer disease import

All calibrations concern Θ (cf. Eq 1); item 4 also involves defining threshold temperatures at which activities are moved outdoors at the end of the winter, and indoors at the end of the summer. All other aspects are data driven.

Unrestricted model

Most parameters of the model are taken from the literature, as explained earlier, in particular Fig 3. The remaining free parameters are, from Eq (1), Θ, sh, and in. We have set the base values of sh = in = 1. As mentioned before, we use these parameters to model the wearing of masks, meaning that they are reduced when masks are worn.

Fig 4 shows the unrestricted base case with four different values of Θ. One finds that the aggregated behavior at this level corresponds to that of typical S(E)IR models, i.e. exponential growth, followed by a maximum, followed by exponential decrease. Based on these plots, thetaFactor values of 1.0 or 1.2 seem plausible to be consistent with the initial growth. A thetaFactor of 1.0 corresponds to Θ = 0.000561.

Fig 4. Unrestricted base case.

Fig 4

LEFT: Case numbers. The green and red dots denote case numbers as reported by Robert Koch Institute [77]; the blue dots denote positive test fractions [78] multiplied by 200. RIGHT: Hospital numbers. Each simulation curve is averaged over 10 independent Monte Carlo runs with different random seeds; the shaded areas denote 5% and 95% percentiles of those 10 runs.

Spring disease import

We take the disease import from abroad from data published by RKI ([77], always on Tuesdays). Currently, for Germany this data is only available on a nationwide aggregated level. For this reason we scale it down to our Berlin model by using the population size. The data is dated on the reporting date and not on the actual date of becoming sick. Since the infection seeds are initiated into our model with the status exposed (cf. Fig 3) and it can be assumed that the reporting date is significantly after the exposure date, we date the data from RKI back by one week. The data provided by RKI is available as weekly values so we assign these values to the respective Monday and then interpolate between them. Since we assume underreporting in the RKI numbers, we multiply them by 4; this is discussed in Sec. Under-reporting, and its variation over time. The initially infected persons are drawn randomly from the population. The resulting disease import is shown in Fig 5. The description so far only concerns the spring disease import; for summer disease import see Sec 3 of S1 Appendix.

Fig 5. Disease import over time.

Fig 5

Based on data taken from [77] (always on Tuesdays), but multiplied by 4 in spring, and divided by 2 in summer (see text for Discussion).

An advantage about adding disease import is that the date of the first infection is no longer a free parameter: As shown in Fig 6, the disease import is sufficient to drive the first wave. The disease import data seems to lack some early cases, thus causing an initially nearly vertical increase in the simulation. The dynamics then settles onto the exponential increase shown in the previous section.

Fig 6. Unrestricted base case, but with initial disease import from data.

Fig 6

LEFT: new cases; RIGHT: hospital occupancies. One finds that the initial slope dynamics is rather independent from the thetaFactor.

In terms of calibration, the initial growth is, within limits, insensitive against changes of Θ, since it is dominated by the disease import. This can be explained by the fact that the exponential growth was running ahead in other areas, and in consequence the share of infected persons from those areas also grew exponentially. Only after travel was stopped did disease import also stop, and the dynamics in Berlin was dominated by internal processes.

Reductions of activity participation

During the unfolding of the epidemics, people decided or were ordered to no longer participate in certain activities. We model this by removing an activity from a person’s schedule, plus the travel to and from the activity. For example, if a person in their original plan goes from home to activity A and then back home, then the activity plus both trips are removed from the schedule. If a person in their original plan goes from home via activity A to activity B and then back home, and activity A is deleted, then the following elements are removed: (a) the trip from home to activity A; (b) activity A; (c) the trip from activity A to activity B. In the current model, the schedule is not repaired: neither is the home activity nor are other activities prolonged, and also the trip chain is not mended. See S6 Text for possible improvements here. The consequence of those activity and trip removals is that the person no longer interacts with people at that activity location, and in consequence neither can infect other persons nor can become infected during that activity, or while in public transport vehicles to and from that activity. Overall, this reduces contact options, and thus reduces epidemic spread.

A very important consequence of our modelling approach is that we can take that reduction in activity participation from data. That data comes from the same source as our original activity patterns. However, the activity type detection algorithm is not very good for these unusual activity patterns during the pandemics, as one can see in S1 Fig when knowing that all educational institutions were closed in Berlin after Mar/15. What is reliable, though, is the differentiation between at-home and out-of-home time, as displayed in Fig 7. One clearly notices that out-of-home activities are somewhat reduced after Mar/8, and dramatically reduced soon after. After some experimentation, it was decided to take weekly averages of the activity non-participation, and use that uniformly across all activity types in our model, except for educational activities, which were taken as ordered by the government.

Fig 7. Change in activity participation compared to the baseline for normal workdays.

Fig 7

All out-of-home activities are combined into one number. (*) denotes the first day of closures of schools, clubs, and bars; and (#) the first day of the so-called contact ban which came together with closures of all restaurants and non-essential stores.

To remove an activity with a certain probability, a random draw is made every time a synthetic person has that activity type in its plan. This means that the model assumes that, say for a 50% work reduction, there will be a different 50% subset of persons at work every day. This intervention, in consequence, does not sever infection networks, but just slows down the dynamics.

One takes from Fig 8 that the mobility reductions, as given by the mobility data, is by itself not sufficient to explain the decreasing case numbers during spring. Evidently, one could now reduce Θ, and this is what we have done in our early simulations. This, however, artificially reduces the infection dynamics, and means that the simulation will miss the second wave in fall.

Fig 8. Simulations with reductions of activity participation as obtained from mobility data.

Fig 8

LEFT: new cases; RIGHT: hospital occupancies.

Outdoors vs. indoors season

The probability of getting infected during an encounter depends on whether the encounter takes place indoors or outdoors. Outdoors, the probability of infection is significantly reduced compared to indoors. This is due to the fact that outdoors the air is constantly in motion and therefore aerosols cannot accumulate. We assume that an encounter outdoors decreases the infection probability by one magnitude [54, 79]. In countries like Germany, seasonality has a great influence on how much time people spend outdoors. In summer, people spend more time outdoors, while in winter they tend to spend more time indoors.

We include into our model that up to 100% of leisure activities are undertaken outdoors during summer, while that share reduces to 0% during winter. When an activity occurs outdoors, the otherwise identical computation of the infection probability is divided by 10. The model takes the actual temperatures as input; if the daily maximum temperature is larger than T* + 5C, then all leisure activities that can happen outdoors are outdoors; if the daily maximum temperature is smaller than T* − 5C, then all leisure activities happen indoors; in between, probabilities are linearly interpolated. We use T* = 17.5C in spring, linearly increasing to T* = 25C in fall; using a lower T* in spring is behaviorally plausible in Germany, and yields a far more plausible infection dynamics than keeping them the same.

The justification for this is as follows. A survey on physical activities [80] shows that, in summer, people in Germany perform about 80% of their physical activities outdoors, while this proportion shrinks to 10% in winter. We have assumed that other leisure activities (e.g. restaurants, visit friends) behave similarly. We also extend our range to 0 and 100% since the fluctuations of the temperature already lead to average values that are more than 0 and less than 100% (cf. Fig 9).

Fig 9. Outdoors fraction for activities of type leisure, depending on the temperature of each day.

Fig 9

Fig 10 shows an example of the infection dynamics where both T* in spring and T* in fall are 17.5C; as one can see, either the decrease of the first wave is not strong enough, or the second wave comes too late; note in particular the hospital numbers, which for all values of thetaFactor do not have enough slope in the second wave. The results with other T*, as long as they are the same in spring and fall, are the same. Fig 11 shows instead using 17.5C for spring and 25C for fall; the second wave now is triggered earlier, and it is steeper. Fig 9 shows the outdoors fractions for this model.

Fig 10. Simulations that now also include a symmetric indoors/outdoors model, with a threshold temperature of 17.5C both in spring and in fall.

Fig 10

LEFT: new cases; RIGHT: hospital occupancies. A thetaFactor between 0.6 and 0.8 is most plausible, but the second wave would come too late (starting after September) and would not be steep enough (compare slope of red dots in right plot after September) (cf. in particular the hospital numbers).

Fig 11. Simulations that now also include an asymmetric indoors/outdoors model, with a threshold temperature of 17.5C in spring, and 25C in fall.

Fig 11

LEFT: new cases; RIGHT: hospital occupancies. A thetaFactor between 0.6 and 0.8 is most plausible, which would well reproduce the second wave (cf. in particular the hospital numbers).

There were some restrictions concerning leisure activities in place in fall. They mostly concerned large events. We know from our mobility data that all activities were at their normal level in September 2020; in consequence, if anything, they were divided into smaller groups.

Masks, contact tracing, and summer disease import

From Fig 11 one takes away that a good calibration with the elements described so far would be possible, with a thetaFactor between 0.6 and 0.8. Nevertheless, we add masks (in public transport and shopping), contact tracing, and summer disease import, since they are plausible elements of the dynamics. In particular, they result in the prediction of reduced infection numbers for public transport and shopping, which both is plausible. This is described in more detail in Sec 3 of S1 Appendix.

Final model

The final model is shown in Fig 12, where the blue line traces the number of new cases with state showingSymptoms from our simulation. Fig 12 (right) shows the cases in need of hospital care and those in need of ICU care from our simulation compared to real data. As stated, we find fitting to the hospital numbers more important; fully fitting to the case numbers is not possible with just one Θ that is constant across the whole simulation. Note that this implies, as stated, a strong deviation of the model curve (in blue) from the reported numbers (in red and green) during the first months. Also see Under-reporting, and its variation over time in the Discussion.

Fig 12. Final model.

Fig 12

LEFT: new cases; RIGHT: hospital occupancies. All simulation results are averaged over 10 runs with different Monte Carlo seeds; the shaded areas denote 5% and 95% percentiles of those 10 runs. Evidently, the relative errors become larger with smaller case numbers. The simulation model can only be fitted against the hospital numbers (right) when significant under-reporting is assumed in the early phase (left).

Methods and results

Infections per activity type

Evidently, in our microscopic models we can track how many infections happen at which activity type. Fig 13 shows, on top, the absolute numbers of infections per activity type for the simulation, and below the share of infections per activity type over time. To obtain these numbers, we evaluate what activity the infected person is performing at the time of infection and date that to the date of infection.

Fig 13. Infections per activity type.

Fig 13

TOP: absolute numbers. Note logarithmic scale. BOTTOM: Share of infections per activity type. The values are averaged over the same 10 runs as for the other figures, and in addition aggregated into weekly bins. One can see, for example, the return to school near the beginning of August, and the fall vacations in October.

Initially, all activity types play a role. After the closure of the universities, schools, and day care in March, both their absolute numbers and their shares go to zero. At the same time, the infections share of work (gray) in April and May reflects that persons were drifting back to normal activity patterns (cf. Fig 7). Leisure (green) would have shown the same trend, but that was counter-acted by the increasing shift of activities to outdoors. In the bottom plot, the purple line shows how the share of infections in public transit decreases significantly near the end of April because of increased wearing of masks. (Recall that we use observed mask compliance.) In July we see how day care (blue) picks up, because it was re-opened. Schools re-open in the second week of August, and pick up accordingly (brown). Also, two weeks of school vacation in October are clearly reflected in the brown curve. From September on we then see a strong increase of the infections share of leisure activities—corresponding to moving leisure activities from outdoors to indoors as explained earlier.

Reproduction number

Since our method is person-centric, we can, for each infected person n, count the number of persons that that person infects, i.e. its reproduction number, Rn. When averaging over multiple persons, one needs to make a decision to which date Rn is assigned. We use the date when n turned contagious, and in consequence

R(t)=nC(t)Rn,

where C(t) refers to all persons who turned contagious on day t. An issue with this approach is that the consequences of interventions become visible in R(t) before the interventions actually start—since the infections that are suppressed happen later than t. This is also the reason why we use the date when turning contagious and not the date when they got infected, since that would increase that temporal gap even more. Fig 14 shows the resulting values, with R(t) much larger than one in the initial phase, then lower than one until the end of summer, and then increasing to above one in fall. We do not offer a comparison with the official R values since they have the same issues as the official case numbers.

Fig 14. Reproduction number R(t) for the duration of the simulation.

Fig 14

As explained in the text, we explicitly count the reproduction number per agent, and then average them over all agents that turned contagious on a given day.

Reproduction number per activity type

More insightful than the number or share of infections, as presented in Sec. Infections per activity type above, is the average reproduction number in each activity type. The method counts for each infected person the number of persons they infect at each activity context. As in Sec. Reproduction number above, the numbers are dated back to the date when the person became contagious, and then averaged over all those persons.

For example, an activity-specific R-value for school of Rschool(t) = 0.1 means that each person that becomes contagious on day t, in the average, infects 0.1 other persons at school. Evidently, if only 10% of persons turning contagious on day t have school anywhere in their activity pattern, then each such person would have to infect one other person in the school context in order to reach the population-average value of 0.1.

Adding up these activity-specific reproduction numbers leads to the overall reproduction number. This explains why, in first order, the overall reproduction number can be additively decomposed into the contributions of the different activity types.

One sees, in Fig 15, that the reproduction number at home remains roughly constant—a person who gets infected in any way reinfects on average about 0.35 persons at home. Work is related to the mobility data—if less time is spent out-of-home, then in the model less time is spent at work, leading to fewer infections. Schools were closed in the middle of March, and not reopened until the second week of August. Also, there is a school vacation during the second and third week of October. Day care according to the model has little effect. Day care was already re-opened partially in June, and fully in July. The reproduction number at leisure is strongly driven by the weather: If it is warm, the model assumes that most of the leisure activities take place outdoors, where they contribute little to the infection dynamics. In consequence, this effect plays an important role in spring, where the warmer temperature played as much a role as the reduction of the out-of-home activites. One also clearly sees the strong growth of the leisure reproduction number in fall, which according to these simulations is driving the second wave in Berlin. Public transport is strongly visible in March, until the obligation to wear masks was introduced. All other infection contexts, e.g. errands or business activities, are combined in the category “other”.

Fig 15. Reproduction number per activity type.

Fig 15

Reductions of R per intervention

Other papers, e.g. [5], report, for various interventions, corresponding percent reductions of R. Our model clarifies that it is structurally more robust to report the additive reduction of the reproduction number by a certain intervention. For example, according to our model closing schools removes the school reproduction number from the dynamics, and in consequence reduces R by about 0.15. If R is 1 when the intervention is introduced, this amounts to 15%; if R is 2, then this amounts to 7.5%.

Table 2 shows, based on simulations as explained in the previous section, the contributions to R of the different activity types. Adding up the boldface numbers leads to R = 2.26, i.e. a strongly super-critical situation. In contrast, the 2020 Germany summer regime corresponds to closed universities, schools and day care, and wearing masks in retail. Together with the leisure summer number this leads to R = 0.88, i.e. makes the situation sub-critical.

Table 2. Contributions to R by activity type and intervention according to our model.

contribution to R
home 0.44
… with cloth / N95 masks 0.20 / 0.02
work 0.17
… @ 75% / 50% 0.10 / 0.04
… with cloth / N95 masks during work 0.06 / 0.01
… @ 50% with N95 masks during work < 0.01
schools 0.15
… @ 75% / 50% 0.07 / 0.02
… with cloth / N95 masks during classes 0.05 / < 0.01
… with N95 masks during classes and 50% attendance < 0.01
day care 0.02
… @ 75% / 50% 0.01 / < 0.01
… with cloth / N95 masks 0.01 / < 0.01
universities 0.23
… @ 75% / 50% 0.11 / 0.03
… with cloth / N95 masks 0.06 / < 0.01
retail and errands 0.09
… @ 75% / 50% 0.06 / 0.03
… with cloth / N95 masks 0.03 / 0.01
leisure (winter) 1.04
… @ 75% / 50% 0.52 / 0.21
… with cloth / N95 masks 0.38 / 0.03
leisure (summer) 0.2
public transport 0.12
… @ 75% / 50% 0.06 / 0.03
… with cloth / N95 masks 0.04 / < 0.01

For these calculations we run the unrestricted model without any interventions and then introduced the interventions described in the left column on 2020–04-01. The reductions to the R values were calculated one week after that, comparing the respective weekly averages. For the mask interventions, the compliance rate is 90%.

It has been pointed out by other studies that the reproduction numbers at home play an important role and reduce the remaining “space” one has available for infections outside home [81]. The reproduction number at home can be reduced by moving persons showing symptoms, and more radically persons identified as contacts by contact tracing, into separate facilities, sometimes called quarantine hotels.

One also notices that all infection contexts can be strongly reduced by wearing masks—this (evidently) even holds for leisure. Clearly, they would need to be worn during the activities, and not just during access and egress. Wearing masks during class at school has hesitantly been adopted in Berlin during November; wearing masks during work, in particular in office buildings, has never been pursued seriously in Germany and is still not obligatory if occupants have at least 10 m2 available per person—which is the value with which our simulations run and which generate the numbers of Table 2.

Evidently, a tricky context is leisure. According to our simulations, leisure alone, in conjunction with home, would be sufficient to keep R above one during winter, and thus needs to be suppressed accordingly. Keeping other activity contexts open without masks implies that leisure needs to be suppressed even further if R < 1 is to be achieved.

Conversely, during summer achieving an R < 1 is relatively easy. This explains why there were few problems during summer in Germany (and most other European countries). Evidently, all of this refers to the original variant of SARS-CoV-2, which was less transmissible than later variants.

Decreasing marginal effect of interventions

In Table 2, for all activity types, a reduction of the participation by 50% reduces the contribution to R by far more than 50%: at work from 0.17 to 0.04, at school from 0.07 to 0.02, etc. In consequence, the next 50% reduction of participation, i.e. closing the activity type completely, will yield a much smaller reduction of infections. This is consistent with the empirical observation that the marginal effect of stay-at-home interventions decreases [6].

From our model, this can be explained as follows (see Fig 16): Assume, for example, that each morning each school child throws a coin and goes to school only when it shows heads; this means that school participation is reduced to 50%. In consequence, if there is an infectious person at school, only half as many other persons have a chance to get infected. (This assumes that they use the same classrooms as before, at half the density.) However, the probability that an undetected infectious person goes to school is also reduced to 50%. Multiplying these two probabilities means that only 50% ⋅ 50% = 25% of the infections happen in this case. That is, the first 50% of the reduction has already 75% of the possible effect.

Fig 16. Effect of dividing a group of 10 persons into two groups of 5 persons each.

Fig 16

In the original situation, each of the 9 susceptible persons (white and cyan) has a probability to get infected of p0, resulting in a expected number of infected persons of 9p0. In the divided situation on the left, the expected number of infections is 4p0. On the right, it is 0. Overall, this results in an expectation value of 12(4p0+0)=2p0. In consequence, when dividing classes and alternating their attendance, the number of infections is reduced from 9p0 to 2p0. For large group sizes, the reduction converges to 1/4. The same holds when each individual attendance is decided randomly with probability 1/2 at the beginning of each day.

More generally, consider an activity in an enclosed space, with N participants, k of them contagious, kN, and conditions such that the linear approximation of Eq (2) for the infection probability holds. In this situation, the expected number of infections is proportional to kN. Now assume that the participation probability at that activity, for each individual person, is reduced from 1 to α < 1. There are two consequences:

  1. The expected number of contagious persons reduces to αk.

  2. The expected number of participants reduces to αN.

The expected number of infections in consequence reduces to αkαN, i.e. α2 as many as before.

Evidently, this means that 1 − 0.752 ≈ 44% of the effect is obtained with the first 25% of the intervention, another 1 − 0.52 − 44% = 31% of the effect are obtained with the next 25% of the intervention, and the remaining 25% of the effect need the remaining 50% of the stay-at-home intervention for this particular activity.

In terms of the management of COVID-19, this implies that it is far better to include each activity type/sector of the economy to some extent, rather than shutting down some sectors completely while leaving some other sectors completely open.

Out of sample prediction

We show the predictive performance of our model by calibrating the simulation on a fixed training set and comparing simulation results into the future against unused data. In this calibration Θ is calibrated such that the Root Mean Squared Logarithmic Error (RMSLE) between hospital cases in the simulation compared to historic data is minimized. For this, the simulation is run with eight different Monte Carlo seeds and then the results are averaged. Because one simulation run is quite computationally expensive, a Tree-structured Parzen Estimator [82], implemented by the Optuna package [83] in Python, is used to sample the parameter space more efficiently.

RMSLE has the advantage that it is less sensitive to the scale of the data than RMSE. That is, relative errors in valleys have as much weight as relative errors on the ridges. This also corresponds to the visual impression of the logarithmic plots often used in epidemics and used throughout the paper. Results using RMSE instead of RMSLE and some more discussion can be found in S3 Text.

We run this calibration up to various dates. From there on, we perform two types of prediction: (a) Fig 17 left: With activity participation levels frozen at the level of the last calibration day (see the second column of Table 3; only during school vacations are school and work activities reduced for prediction dates); (b) Fig 17 right: With activity levels as given from the data also for prediction dates. For both cases, the import is frozen at 4 imported cases per day, while the daily maximum temperature is forecast based on the average over the last 10 years.

Fig 17. Hospitalized persons for different calibration runs compared to real data.

Fig 17

Θ is calibrated such that hospital numbers in the simulation match the real data (red dots) until different points in time as indicated by the legend. After this date, an out of sample prediction is carried out. Until the calibration date real weather and disease import data is used. After the calibration date average weather data from the past ten years is used and the disease import is set to 4 imported cases per day (1 agent per day) LEFT: Activity levels are frozen at the level of the last day of the period used for calibration. RIGHT: Real activity levels are used.—Results are averaged over 30 Monte Carlo seeds.

Table 3. Calibration parameter Θ and activity participation for the different out of sample predictions shown in Fig 17.

run Θ activity participation (if activity level frozen) training error prediction error (frozen activity levels) prediction error (real activity levels)
2020–05–01 1.20e-05 71% 0.372 5.500 0.34
2020–06–01 1.27e-05 88% 0.252 0.187 0.74
2020–07–01 1.29e-05 90% 0.232 0.047 0.11
2020–08–01 1.30e-05 90% 0.200 0.033 0.37
2020–09–01 1.32e-05 96% 0.181 0.442 0.52

RMSLE (= Root Mean Square Logarithmic Error) for the calibration interval (training error) as well as for prediction period between 09–01 and 10–31 (prediction error). The Optuna package does not return confidence intervals for estimated parameters.

One finds that the correct activity level (Fig 17 right) is crucial especially for longer-term predictions: Even with calibration only to the end of April, the model predicts the autumn wave very well, while when the activity numbers are frozen (Fig 17 left), the forecasts have a drift depending on whether the activity level is too low or too high on the day when it is frozen. Particularly striking is the blue curve (“2020–05-01”): on the left, the activity participation level is frozen at 71% (cf. Table 3), while on the right it varies mostly between 80% and 100% as given by the data plotted in Fig 7. This is consistent with the theoretical argument (Sec. Decreasing marginal effect of interventions) that an activity participation of 71% reduces infections to 71% ⋅ 71% ≈ 50% while an activity participation of 90% reduces infections only to 91% ⋅ 90% ≈ 80%. That difference is sufficient to generate the difference between the two curves.

Discussion

Intuition for these results

In an older version of the model [33], we had all contact intensities set to one. The contributions of each activity type to the infection dynamics then in first order corresponded to the average weekly time consumption in the respective activity. For example, averaged over the week including the weekend, school consumes about 5 hours per day for persons going to school. However, since in Berlin only about 10% of the population are school children, https://www.statistik-berlin-brandenburg.de/BasisZeitreiheGrafik/Bas-Schulen.asp?Ptyp=300&Sageb=21001&creg=BBB&anzwer=5 the average time consumption for the school activity is only 0.43 hours per day when taken across the whole population (cf. Table 4). In contrast, there are more persons going to work than to school, thus increasing the weight of work in the infection dynamics (1.83 hours per day). A similar weight comes from the leisure activities, which are not necessarily more hours per week for each individual person, but where all persons contribute to this type of time consumption, resulting in an average of 1.67 hours per day. In consequence, restricting leisure activities had a large effect in that model.

Table 4. Average time consumption of out-of-home activities.

Activity average time consumption [hr] normalized contact intensity ci timeci
day care 0.22 11 2.42
schools 0.43 11 4.73
university 0.13 5.5 0.75
work (incl. business) 1.83 1.47 2.69
shop 0.38 0.88 0.33
errands 0.77 1.47 1.14
leisure 1.67 9.24 15.43
home 16.45 1 16.45

Averaged over a full week including Saturday and Sunday. The remaining time is spent travelling between activities. Contact intensities are taken from Table 1.

In the present model, the time consumptions are now multiplied by the normalized contact intensities in those activity types, cf. Table 4. In consequence, leisure, which already had a large share before, is now multiplied with a large contact intensity, and in consequence now gets even more weight. Work, despite occupying similar amounts of time, is weighted down because of the multiplication by a much smaller contact intensity. On the other end of the scale, public transport has, at full occupancy, a large contact intensity, but the times spent in public transport are considerably smaller than, say, at work. Also, persons in public transport are required to wear masks, while at work they are not.

A complicated case are schools and day care: They occupy large amounts of time, and have a large contact intensity, both somewhat similar to leisure. In consequence, the re-opening of day care in July and of the schools in August should have had strong consequences in the infection numbers but did not. We took that observation as confirmation that their larger-than-average contact intensity is compensated for by a smaller-than-average infectivity and susceptibility (cf. Sec. Children).

For other diseases, for example influenza, all of the above may need to be adapted. For example, children may have a larger infectivity/susceptibility than adults, which then multiplied with their large contact intensity would lead to a large contribution to the infection dynamics. In consequence, these sub-models need to be understood and re-calibrated for each individual communicable disease.

Robustness

The simulation uses one uniform Θ (cf. Eq 1) that remains the same over the whole simulation period. In consequence, the dynamics is driven by other inputs. These are, after the initial disease import (Fig 5), primarily the activity participation (Fig 7) and the temperature-dependent outdoors fraction (Fig 9).

The importance of the activity participation can be taken from Fig 17, where the blue curve (“2020–05-01”) uses exactly the same setup on the left and on the right except for the activity participation level, which on the left is frozen at 71%, and on the right varies mostly between 80% and 100% as given by the data plotted in Fig 7. That was already discussed at the end of Sec. Out of sample prediction.

The importance of the temperature effect can best be taken from the calibration sequence: Fig 6 showing that a Θ smaller than 0.6 is not plausible; Fig 8 showing that reduced activity participation alone is not able to bring infections and resulting hospital levels down to the observed level during summer; Fig 10 showing that a symmetric indoors/outdoors model is able to achieve that but misses the fall wave; and finally Fig 11 showing that the asymmetric indoors/outdoors model, with a significantly higher threshold temperature at the end of the summer, is able to also generate the fall wave. We have checked for other mechanisms driving the fall wave but obvious possibilities, such as the return-to-school or the summer disease import, both make the hospital numbers in fall start going up in the middle of August rather than at the beginning of September. S4 Text also shows that a less asymmetric indoors/outdoors model performs worse. Evidently, it is possible that in reality the virus seasonality is also caused by other aspects [84], and our model absorbs them into the indoors/outdoors model. Still, if one accepts the aerosol infection as major pathway, then the assumption that indoors vs. outdoors activities play a strong role is plausible.

The effect of the contact intensities was already discussed in Sec. Intuition for these results. From that discussion, it follows that the number of infections in an activity context depends, in first order, linearly on the contact intensity of that activity type. In consequence, if a contact intensity in Table 1 or Table 4 is, say, a factor of 2 too large, then the resulting infections (Fig 13 top), share of infections (Fig 13 bottom), and R-value per activity type (Fig 15) in first order get divided by two. In second order, the calibration parameter Θ would have to get increased to bring infections back to the previous level.

S4 Text also shows that the model fit degrades considerably when the mask model is removed.

Comparison to other models

A comparison to compartmental models, in particular to the model of Chang et al. [10] which is at the border between compartmental and agent-based, can be found in S5 Text. The same text contains also a comparison with the model by Aleta et al. [21].

Comparison to other “reductions of R” studies

Table 5 extracts “additional reductions to R” from other studies and compares them to our results. One immediately finds two issues: (A) The categories are not well aligned. For example, “small gathering cancellation” refers to gatherings with 50 persons or less, while other studies cancel gatherings larger than a certain number. Again other studies just consider a “gathering ban”, but at the same time have “event ban” and “venue closure” as separate items. (B) Even where the categories are well aligned, the resulting numbers vary significantly: for example, “closing schools and universities” goes from 16% to 38%.

Table 5. Percent reduction of R in other studies.

Measure Brauner et al. (1st wave) [5] Sharma et al. (2nd wave) [85] Haug et al. “CC” [86] Haug et al. “Lasso” [86] Our model (abs.) (Table 2) Our model R = 2.24
Closing schools and universities 38% 16% 21% 0.38 21%
Closing educational institutions (after implementing protective measures) 7%
Closing schools 0.15 7%
Closures of businesses 35%
Closure of work sector 0.17 8%
Closing some high-risk face-to-face businesses 18%
Closing most nonessential face-to-face businesses 27%
Closing retail and close contact services 12%
Closure of retail and errands sectors 12% 0.09 4%
Gatherings limited to ≤ 1000 23%
Mass gathering cancellation 33% 0%
Gatherings limited to ≤ 100 34%
Gatherings limited to ≤ 10 42%
Small gathering cancellation 35% 22%
Closures of gastronomy 12%
Closures of night clubs 12%
Leisure and entertainment venues 3%
Banning all leisure activities (including gastronomy and private visits) 1.04 92%
Night time curfew 13%
Stricter mask policy (mandatory in most or all shared/public spaces) 12%

Percentages are rounded to integers. To the right are our own results, first in absolute reductions of R, then in percent reductions of R applied to an R of 2.24 (the overall R in the model where these values were taken). Evidently, for a smaller R, our percentage values would be higher.

In part, this is a consequence of the fact that the interventions are not standardized: For example, the number of exemptions in what is called a lockdown varied quite a lot between countries.

Additionally, the transmission mechanisms from policy decision to execution vary, so even if the concept may be the same, the effect may be quite different between countries. For example, our reductions to R caused by school closures come out at the lower end of the range, and lower than those of Brauner et al. [5]. We attribute this to the following two elements: First, the model by Brauner et al. has no initial disease import which is then brought to a halt. In consequence, their approach has to assign all changes in the infection dynamics to the school closures. The school closures in Berlin, with Mar/12 (fri) or Mar/15 (mon) as the last day of school, too late to explain that the infection numbers stopped in the middle of March. Also, Dehning et al. [87] have an additional change point on Mar/7, corroborating that something has changed before the school closures. Second, other than both Brauner et al. and Dehning et al., we have the mobility data of Fig 7 at our disposal. It is clear that there was considerably more societal adaptation around the weekend of Mar/13–14 than just keeping children at home. Brauner et al. themselves write that “the closure of schools … may have caused … behaviour changes. We do not distinguish this indirect signalling effect from the direct effect”. Additionally, in Germany, children staying at home will force their parents to stay at home, thus forcing them into home office. In consequence, some of this may not be signalling, but causal secondary effects. In consequence, our model is more differentiated: What Brauner et al. attribute to the school closures alone is in our model attributed to a combination of school closures, behavioral changes, and the reduction of various other out-of-home activities. Thus, all of the values may be correct: The pure effect of school closures in western countries (with relatively few young people) may not be larger than 7%, but the measurable consequence for R when governments closed schools as their first intervention presumably indeed was much larger.

Clearly, data-driven mechanical models such as ours help clarifying the categories since we can exactly specify what we mean by closing some activity type or wearing a mask at certain activity types. Also, we can differentiate between the transmission from political decision to behavioral execution vs. the consequences of the behavioral execution to the infection dynamics. Finally, we can mechanically include organizational approaches such as contact tracing.

Masks

We have checked our relatively large reductions of R for masks in Table 2 multiple times. They are a consequence of the assumption that N95 masks reduce intake to 2.5%, taken from [67]. The review article [88] comes up with about 5%, a factor of two larger, but still displaying a very large reduction. The same paper [88] also shows that “masks” without a specification of the type has much less of an effect. Finally, there may be the issue that lay people may not be able to use N95 masks at full efficiency. In consequence, our results have to be interpreted once more “mechanically”: They are plausible under the assumption that the fraction of people specified in the model is indeed able to use N95 masks effectively.

Under-reporting, and its variation over time

A known issue with epidemiological data and thus the simulations that build on it is the issue of under-reporting, i.e. that there are more infections in reality than are in the data. Looking at Fig 12, it is clear that our current model assumes only little under-reporting during August to October. This originally led to hospital numbers that were too large; since we cannot reduce the number of infections below the case numbers, this justifies why we reduce simulated hospital numbers by a factor of 2 compared to [15] (cf. Fig 3). This, in turn, implies that, if we want to get the spring hospital numbers right, our simulated infection numbers in spring need to be about a factor of 8 larger than the reported case numbers.

Also note that our simulation includes non-symptomatic cases, which come on top of the symptomatic cases that we show in our figures such as Fig 12; that is, the actual under-reporting is even larger. Still, it is entirely possible that Germany’s testing strategy is missing even more cases, in which case the simulation would need to aim for even larger numbers of infected persons. As long as the number of seropositive persons in Germany remains in the single-digit percentage ranges [89], the predictions made by the simulation are not strongly affected by this issue. Once the infections start to saturate, i.e. approach herd immunity, this will become important. Hopefully, by then systematic antibody screenings will be available, and we will be able to calibrate the model against the case numbers that must have been infected in the past. Given that we have the hospital numbers for control, we expect this to be straightforward.

Making the model more realistic

Evidently, the model can be made (even) more realistic. Important aspects are the adaptation of the daily schedules to to restrictions, the dependence on income, and more realistic contact structures. All three aspects are discussed in S6 Text.

Policy advice

The model was and is used for policy advice. Our regular reports to the government all have a DOI, for example [3] or [4]. Again, see https://depositonce.tu-berlin.de/simple-search?query=modus-covid.

Conclusions

We combine a person-centric human mobility model with a mechanical model of infection and a person-centric disease progression model into an epidemiological simulation model. Different from other models, we take the movements of the persons, including the intervening activities where they can interact with other people, directly from data, which has already been available for transport planning before the pandemics. For privacy reasons, we rely on a process that takes the original mobile phone data, extracts statistical properties, and then synthesizes movement trajectories from the statistical properties; one could use the original mobile phone trajectories directly if they were available. The model is used to replay the epidemics in Berlin. It is shown that the second wave in Berlin can be modelled well with an explicit temperature dependency of the outdoors fraction for leisure activities. The model is then used to evaluate different intervention strategies, such as closing educational facilities, reducing other out-of-home activities, wearing masks, or contact tracing, and to determine differentiated changes of the reproduction number R per intervention.

Supporting information

S1 Appendix. Appendix.

(PDF)

S1 Fig. Reduced activity participation by activity type.

(PDF)

S1 Text. Senozon method.

(PDF)

S2 Text. Model history.

(PDF)

S3 Text. Error metric for calibration.

(PDF)

S4 Text. Robustness runs.

(PDF)

S5 Text. Comparison to other models.

(PDF)

S6 Text. Making the model more realistic.

(PDF)

S1 File

(PDF)

Acknowledgments

We thank Kai Martins-Turner, Dominik Ziemke, Tim Conrad and Natasa Conrad for frequent inputs and discussion. We are grateful to BVG (Berlin public transit operator) for providing the mask compliance rates which they surveyed on a daily basis. The work on the paper was funded by the Ministry of research and education (BMBF) Germany (01KX2022A) and TU Berlin; regular reports can be found through this search: https://depositonce.tu-berlin.de/simple-search?query=modus-covid. Zuse Institute Berlin (ZIB) provided CPU time.

Data Availability

For computer code see https://github.com/matsim-org/matsim-episim. Simulations were computed with version d16656f076640124de0361fc327d3803a80aa466 of the code, started with command java -jar matsim-episim-1.0-SNAPSHOT.jar runParallel --setup org.matsim.run.batch.BerlinSensitivityRuns --params org.matsim.run.batch.BerlinSensitivityRuns$Params. The input data (including the synthetic mobility traces) are made public here: https://doi.org/10.14279/depositonce-11495. The output data used for the figures can be retrieved at: http://dx.doi.org/10.14279/depositonce-12113.

Funding Statement

The work on the paper was funded by the Ministry of research and education (BMBF) Germany (01KX2022A) and TU Berlin. The BMBF Grant also funded the data provided by the commercial company senozon. MB and AN, employed by senozon, worked together with the rest of the team to iterate between input data and simulations until the input data contained all information needed to run the simulation. senozon provided support in the form of salaries for MB and AN, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

References

  • 1. Horni A, Nagel K, Axhausen KW. The Multi-Agent Transport Simulation MATSim. London, UK: Ubiquity Press; 2016. Available from: 10.5334/baw. [DOI] [Google Scholar]
  • 2. Müller SA, Balmer M, Neumann A, Nagel K. Mobility traces and spreading of COVID-19. medarxiv. 2020. doi: 10.1101/2020.03.27.20045302 [DOI] [Google Scholar]
  • 3. Müller SA, Charlton W, Ewert R, Rakow C, Schlenther T, Nagel K. MODUS-COVID Vorhersage vom 8.4.2020. depositonce. 2020. doi: 10.14279/DEPOSITONCE-10016 [DOI] [Google Scholar]
  • 4. Müller SA, Charlton W, Conrad ND, Ewert R, Conrad T, Nagel K, et al. MODUS-COVID Bericht vom 11.09.2020. depositonce. 2020. doi: 10.14279/DEPOSITONCE-10553 [DOI] [Google Scholar]
  • 5. Brauner JM, Mindermann S, Sharma M, Johnston D, Salvatier J, Gavenčiak T, et al. Inferring the effectiveness of government interventions against COVID-19. Science. 2021;371 (6531). doi: 10.1126/science.abd9338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Bendavid E, Oh C, Bhattacharya J, Ioannidis JPA. Assessing Mandatory Stay-at-Home and Business Closure Effects on the Spread of COVID-19. Eur J Clin Invest. 2021; p. e13484. doi: 10.1111/eci.13484 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Anderson RM, May RM. Population biology of infectious diseases: Part I. Nature. 1979;280(5721):361–367. doi: 10.1038/280361a0 [DOI] [PubMed] [Google Scholar]
  • 8. Iannelli F, Koher A, Brockmann D, Hövel P, Sokolov IM. Effective distances for epidemics spreading on complex networks. Phys Rev E. 2017;95(1-1):012313. doi: 10.1103/PhysRevE.95.012313 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Pastor-Satorras R, Castellano C, Van Mieghem P, Vespignani A. Epidemic processes in complex networks. Rev Mod Phys. 2015;87(3):925–979. doi: 10.1103/RevModPhys.87.925 [DOI] [Google Scholar]
  • 10. Chang S, Pierson E, Koh PW, Gerardin J, Redbird B, Grusky D, et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature. 2020. doi: 10.1038/s41586-020-2923-3 [DOI] [PubMed] [Google Scholar]
  • 11. Hufnagel L, Brockmann D, Geisel T. Forecast and control of epidemics in a globalized world. Proc Natl Acad Sci U S A. 2004;101(42):15124–15129. doi: 10.1073/pnas.0308344101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Eubank S, Guclu H, Kumar VSA, Marathe MV, Srinivasan A, Toroczkai Z, et al. Modelling disease outbreaks in realistic urban social networks. Nature. 2004;429(6988):180–184. doi: 10.1038/nature02541 [DOI] [PubMed] [Google Scholar]
  • 13. Halloran ME, Ferguson NM, Eubank S, Longini IM Jr, Cummings DAT, Lewis B, et al. Modeling targeted layered containment of an influenza pandemic in the United States. Proc Natl Acad Sci U S A. 2008;105(12):4639–4644. doi: 10.1073/pnas.0706849105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Bajardi P, Poletto C, Ramasco JJ, Tizzoni M, Colizza V, Vespignani A. Human mobility networks, travel restrictions, and the global spread of 2009 H1N1 pandemic. PLoS One. 2011;6(1):e16591. doi: 10.1371/journal.pone.0016591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ferguson N, Laydon D, Nedjati Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand; 2020. Available from: 10.25561/77482. [DOI]
  • 16. Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;368(6489):395–400. doi: 10.1126/science.aba9757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Kraemer MUG, Yang CH, Gutierrez B, Wu CH, Klein B, Pigott DM, et al. The effect of human mobility and control measures on the COVID-19 epidemic in China. Science. 2020;368(6490):493–497. doi: 10.1126/science.abb4218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Davids A, Rand G, Georg CP, Koziol T, Schasfoort J. SABCoM: A Spatial Agent-Based COVID-19 Model. SSRN Electronic Journal. 2020. doi: 10.2139/ssrn.3663320 [DOI] [Google Scholar]
  • 19. Tadic B, Melnik R. Modeling latent infection transmissions through biosocial stochastic dynamics. medRxiv. 2020; p. 2020.07.30.20164491. doi: 10.1371/journal.pone.0241163 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Chen J, Vullikanti A, Hoops S, Mortveit H, Lewis B, Venkatramanan S, et al. Medical costs of keeping the US economy open during COVID-19. Sci Rep. 2020;10(1):18422. doi: 10.1038/s41598-020-75280-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Aleta A, Martín-Corral D, Pastore Y Piontti A, Ajelli M, Litvinova M, Chinazzi M, et al. Modelling the impact of testing, contact tracing and household quarantine on second waves of COVID-19. Nat Hum Behav. 2020. doi: 10.1038/s41562-020-0931-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Kucharski AJ, Klepac P, Conlan AJK, Kissler SM, Tang ML, Fry H, et al. Effectiveness of isolation, testing, contact tracing, and physical distancing on reducing transmission of SARS-CoV-2 in different settings: a mathematical modelling study. Lancet Infect Dis. 2020. doi: 10.1016/S1473-3099(20)30457-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Axhausen KW. Eine ereignisorientierte Simulation von Aktivitätsketten zur Parkstandswahl. Schriftenreihe des Instituts für Verkehrswesen der Universität Karlsruhe. 1989;40. [Google Scholar]
  • 24.Bowman JL, Bradley M, Shiftan Y, Lawton TK, Ben-Akiva M. Demonstration of an activity-based model for Portland. In: World Transport Research: Selected Proceedings of the 8th World Conference on Transport Research 1998. vol. 3. Elsevier, Oxford; 1998. p. 171–184.
  • 25.Infas, DLR. Mobilität in Deutschland 2008 Ergebnisbericht; 2010. Available from: https://www.bmvi.de/SharedDocs/DE/Anlage/G/mid-2008-ergebnisbericht.pdf.
  • 26.Ahrens GA, Ließke F, Wittwer R, Hubrich S, Wittig S. Mobilität in Städten—SrV 2013. Dresden: TU Dresden, Lehrstuhl Verkehrs- und Infrastrukturplanung; 2014. Available from: http://www.tu-dresden.de/srv/.
  • 27.Neumann A, Balmer M. Mobility Pattern Recognition (MPR) und Anonymisierung von Mobilfunkdaten. Senozon Deutschland GmbH and Senozon AG; 2020. Available from: https://senozon.com/wp-content/uploads/Whitepaper_MPR_Senozon_DE.pdf.
  • 28. Ziemke D, Kaddoura I, Nagel K. The MATSim Open Berlin Scenario: A multimodal agent-based transport simulation scenario based on synthetic demand modeling and open data. Procedia Computer Science. 2019;151:870–877. doi: 10.1016/j.procs.2019.04.120 [DOI] [Google Scholar]
  • 29. Smieszek T. A mechanistic model of infection: why duration and intensity of contacts should be included in models of disease spread. Theor Biol Med Model. 2009;6:25. doi: 10.1186/1742-4682-6-25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Smieszek T, Balmer M, Hattendorf J, Axhausen KW, Zinsstag J, Scholz RW. Reconstructing the 2003/2004 H3N2 influenza epidemic in Switzerland with a spatially explicit, individual-based model. BMC Infect Dis. 2011;11:115. doi: 10.1186/1471-2334-11-115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Hackl J, Dubernet T. Epidemic Spreading in Urban Areas Using Agent-Based Transportation Models. Future Internet. 2019;11(4):92. doi: 10.3390/fi11040092 [DOI] [Google Scholar]
  • 32. Najmi A, Safarighouzhdi F, Miller EJ, MacIntyre R, Rashidi TH. Determination of COVID-19 parameters for an agent-based model: Easing or tightening control strategies. medRxiv. 2020. doi: 10.1101/2020.06.20.20135186 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Müller SA, Balmer M, Charlton B, Ewert R, Neumann A, Rakow C, et al. Using mobile phone data for epidemiological simulations of lockdowns: government interventions, behavioral changes, and resulting changes of reinfections. medRxiv. 2020. doi: 10.1101/2020.07.22.20160093 [DOI] [Google Scholar]
  • 34.Müller SA, Balmer M, Charlton W, Ewert R, Neumann A, Rakow C, et al. A realistic agent-based simulation model for COVID-19 based on a traffic simulation and mobile phone data; 2020. Available from: https://arxiv.org/abs/2011.11453. [DOI] [PMC free article] [PubMed]
  • 35. Manout O, Ciari F. The contribution of everyday activities to the spread of COVID-19. An agent-based simulation from Montreal, Canada. researchgate. 2021. doi: 10.13140/RG.2.2.12578.66245 [DOI] [Google Scholar]
  • 36.Bossert A, Kersting M, Timme M, Schröder M, Feki A, Coetzee J, et al. Limited containment options of COVID-19 outbreak revealed by regional agent-based simulations for South Africa; 2020. Available from: https://arxiv.org/abs/2004.05513.
  • 37. Wulkow H, Conrad TOF, Conrad ND, Müller SA, Nagel K, Schütte C. Prediction of Covid-19 spreading and optimal coordination of counter-measures: From microscopic to macroscopic models to Pareto fronts. PLOS ONE. 2021. doi: 10.1371/journal.pone.0249676 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Apple. COVID-19—Mobility Trends Reports; 2021. https://covid19.apple.com/mobility.
  • 39.Google. COVID-19 Community Mobility Report; 2021. https://www.google.com/covid19/mobility/.
  • 40. Pullano G, Valdano E, Scarpa N, Rubrichi S, Colizza V. Evaluating the effect of demographic factors, socioeconomic factors, and risk aversion on mobility during the COVID-19 epidemic in France under lockdown: a population-based study. Lancet Digit Health. 2020;2(12):e638–e649. doi: 10.1016/S2589-7500(20)30243-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Bonaccorsi G, Pierri F, Cinelli M, Flori A, Galeazzi A, Porcelli F, et al. Economic and social consequences of human mobility restrictions under COVID-19. Proc Natl Acad Sci U S A. 2020;117(27):15530–15535. doi: 10.1073/pnas.2007658117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Eisenmann C, Nobis C, Kolarova V, Lenz B, Winkler C. Transport mode use during the COVID-19 lockdown period in Germany: The car became more important, public transport lost ground. Transp Policy. 2021;103:60–67. doi: 10.1016/j.tranpol.2021.01.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.IVT, WWZ. MOBIS Covid19 Mobility Report; 2021. https://ivtmobis.ethz.ch/mobis/covid19/reports/latest_de.
  • 44. Badr HS, Du H, Marshall M, Dong E, Squire MM, Gardner LM. Association between mobility patterns and COVID-19 transmission in the USA: a mathematical modelling study. Lancet Infect Dis. 2020;20(11):1247–1254. doi: 10.1016/S1473-3099(20)30553-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Fritz C, Kauermann G. On the Interplay of Regional Mobility, Social Connectedness, and the Spread of COVID-19 in Germany; 2020. Available from: https://arxiv.org/abs/2008.03013v2. [DOI] [PMC free article] [PubMed]
  • 46. Jia JS, Lu X, Yuan Y, Xu G, Jia J, Christakis NA. Population flow drives spatio-temporal distribution of COVID-19 in China. Nature. 2020;582(7812):389–394. doi: 10.1038/s41586-020-2284-y [DOI] [PubMed] [Google Scholar]
  • 47. Xiong C, Hu S, Yang M, Luo W, Zhang L. Mobile device data reveal the dynamics in a positive relationship between human mobility and COVID-19 infections. Proc Natl Acad Sci U S A. 2020;117(44):27087–27089. doi: 10.1073/pnas.2010836117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.VSP. Automatisierter und vernetzter öffentlicher Verkehr (AVÖV); 2018. https://www.vsp.tu-berlin.de/projects/2018/avoev/.
  • 49.VSP. RealLabHH; 2020. https://www.vsp.tu-berlin.de/projects/2020/reallabhh/.
  • 50.VSP. MOSAIK-2; 2020. https://www.vsp.tu-berlin.de/projects/2020/mosaik_2/.
  • 51.VSP. KoMoDnext; 2020. https://www.vsp.tu-berlin.de/projects/2020/komodnext/.
  • 52. Sze To GN, Chao CYH. Review and comparison between the Wells-Riley and dose-response approaches to risk assessment of infectious respiratory diseases. Indoor Air. 2010;20(1):2–16. doi: 10.1111/j.1600-0668.2009.00621.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Lelieveld J, Helleis F, Borrmann S, Cheng Y, Drewnick F, Haug G, et al. Model Calculations of Aerosol Transmission and Infection Risk of COVID-19 in Indoor Environments. Int J Environ Res Public Health. 2020;17(21). doi: 10.3390/ijerph17218114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Marr et al. FAQs on Protecting Yourself from aerosol transmission; 2020. https://tinyurl.com/FAQ-aerosols.
  • 55. Hartmann A, Kriegel M. Parameter study for risk assessment in internal spaces regarding aerosols loaded with virus. depositonce. 2020. doi: 10.14279/DEPOSITONCE-10415 [DOI] [Google Scholar]
  • 56. Asadi S, Wexler AS, Cappa CD, Barreda S, Bouvier NM, Ristenpart WD. Aerosol emission and superemission during human speech increase with voice loudness. Sci Rep. 2019;9(1):2348. doi: 10.1038/s41598-019-38808-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Prather KA, Wang CC, Schooley RT. Reducing transmission of SARS-CoV-2. Science. 2020;368(6498):1422–1424. doi: 10.1126/science.abc6197 [DOI] [PubMed] [Google Scholar]
  • 58.Senatsverwaltung für Integration, Arbeit und Soziales. Anlage 1 AV Wohnen—Berlin.de; 2020. https://www.berlin.de/sen/soziales/service/berliner-sozialrecht/kategorie/ausfuehrungsvorschriften/av_wohnen_anlage1-571941.php.
  • 59.Heyder L. Was versteht man unter “Mindestluftwechsel”?; 2015. https://www.baustoffwissen.de/baustoffe/baustoffknowhow/bauelemente/mindestluftwechsel-vorschrift-luftaustausch-lueftung/.
  • 60.Wolters Kluwer Deutschland GmbH. DGUV Information 202-090—Klasse(n)—Räume für Schulen Empfehlungen für gesund… | Schriften | arbeitssicherheit.de; 2020. Available from: https://www.arbeitssicherheit.de/schriften/dokument/0%3A8144291%2C4.html”.
  • 61.Knellwolf B. Coronavirus—Ansteckungsgefahr: Waggons sind keine Covid-19-Hölle—SBB wehrt sich gegen Vorurteil; 2020. https://www.tagblatt.ch/leben/ansteckungsgefahr-waggons-sind-keine-covid-19-holle-sbb-wehrt-sich-gegen-vorurteil-ld.1235317.
  • 62. Rim D, Siegel J, Spinhirne J, Webb A, McDonald-Buller E. Characteristics of cabin air quality in school buses in Central Texas. Atmos Environ. 2008;42(26):6453–6464. doi: 10.1016/j.atmosenv.2008.04.030 [DOI] [Google Scholar]
  • 63.Arbeitsbereich Haustechnik der Fachkommission Haustechnik und Krankenhausbau des Ausschusses für staatlichen Hochbau der Bauministerkonferenz. Planung und Bau von Küchen und Kantinen für 50 bis 1000 Verpflegungsteilnehmer; 2002. Available from: https://www.bauministerkonferenz.de/Dokumente/42319679.pdf.
  • 64.BMJV. Verordnung über Arbeitsstätten; 2020. Available from: http://www.gesetze-im-internet.de/arbst_ttv_2004/.
  • 65.für Arbeitsstätten A. Technische Regeln für Arbeitsstätten ASR A3.6; 2012. Available from: https://www.baua.de/DE/Angebote/Rechtstexte-und-Technische-Regeln/Regelwerk/ASR/pdf/ASR-A3-6.pdf.
  • 66. DIN Deutsches Institut für Normung. Energy performance of buildings—Ventilation for buildings—Part 3: For non-residential buildings—Performance requirements for ventilation and room-conditioning systems (Modules M5-1, M5-4). DIN industry standard. 2019. doi: 10.31030/2596017 [DOI] [Google Scholar]
  • 67. Eikenberry SE, Mancuso M, Iboi E, Phan T, Eikenberry K, Kuang Y, et al. To mask or not to mask: Modeling the potential for face mask use by the general public to curtail the COVID-19 pandemic. Infect Dis Model. 2020;5:293–308. doi: 10.1016/j.idm.2020.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Dattner I, Goldberg Y, Katriel G, Yaari R, Gal N, Miron Y, et al. The role of children in the spread of COVID-19: Using household data from Bnei Brak, Israel, to estimate the relative susceptibility and infectivity of children. PLOS Computational Biology. 2021;17(2):1–19. doi: 10.1371/journal.pcbi.1008559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Davies NG, Klepac P, Liu Y, Prem K, Jit M, CMMID COVID-19 working group, et al. Age-dependent effects in the transmission and control of COVID-19 epidemics. Nat Med. 2020;26(8):1205–1211. doi: 10.1038/s41591-020-0962-9 [DOI] [PubMed] [Google Scholar]
  • 70.WHO. Report of the WHO-China Joint Mission on Coronavirus Disease 2019 (COVID-19). 2020; 2020. https://www.who.int/publications/i/item/report-of-the-who-china-joint-mission-on-coronavirus-disease-2019-(covid-19).
  • 71. He X, Lau EHY, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nature Medicine. 2020;26(5):672–675. doi: 10.1038/s41591-020-0869-5 [DOI] [PubMed] [Google Scholar]
  • 72. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020;581(7809):465–469. doi: 10.1038/s41586-020-2196-x [DOI] [PubMed] [Google Scholar]
  • 73. Dreher M, Kersten A, Bickenbach J, Balfanz P, Hartmann B, Cornelissen C, et al. Charakteristik von 50 hospitalisierten COVID-19-Patienten mit und ohne ARDS. Dtsch Arztebl Int. 2020;117:271–278. doi: 10.3238/arztebl.2020.0271 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Wang L, Xu T, Stoecker TH, Stoecker H, Jiang Y, Zhou K. Machine learning spatio-temporal epidemiological model to evaluate Germany-county-level COVID-19 risk; 2020. Available from: https://arxiv.org/abs/2012.00082.
  • 75.Robert Koch Institute. RKI—SARS-CoV-2 Steckbrief zur Coronavirus-Krankheit-2019 (COVID-19); 2020. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Steckbrief.html.
  • 76. Ashcroft P, Huisman J, Lehtinen S, Bouman J, Althaus C, Regoes R, et al. COVID-19 infectivity profile correction. Swiss Medical Weekly. 2020;150. [DOI] [PubMed] [Google Scholar]
  • 77.Robert Koch Institute. Aktueller Lage-/Situationsbericht des RKI zu COVID-19; 2020. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Situationsberichte/Gesamt.html.
  • 78.LaGeSo Berlin. COVID-19 Berlin Dashboard; 2021. https://www.berlin.de/corona/lagebericht/desktop/corona.html.
  • 79. Nishiura H, Oshitani H, Kobayashi T, Saito T, Sunagawa T, Matsui T, et al. Closed environments facilitate secondary transmission of coronavirus disease 2019 (COVID-19). medRxiv. 2020. doi: 10.1101/2020.02.28.20029272 [DOI] [Google Scholar]
  • 80.Statistisches Bundesamt. Ausübung von Sport im Freien in Deutschland 2017; 2017. https://de.statista.com/statistik/daten/studie/1673/umfrage/sport-im-freien.
  • 81. Bock W, Adamik B, Bawiec M, Bezborodov V, Bodych M, Burgard JP, et al. Mitigation and herd immunity strategy for COVID-19 is likely to fail. medRxiv. 2020. doi: 10.1101/2020.03.25.20043109 [DOI] [Google Scholar]
  • 82.Bergstra JS, Bardenet R, Bengio Y, Kégl B. Algorithms for Hyper-Parameter Optimization. In: Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira F, Weinberger KQ, editors. Advances in Neural Information Processing Systems 24. Curran Associates, Inc.; 2011. p. 2546–2554. Available from: https://papers.nips.cc/paper/4443-algorithms-for-hyper-parameteroptimization.pdf.
  • 83. Akiba T, Sano S, Yanase T, Ohta T, Koyama M. Optuna: A Next-generation Hyperparameter Optimization Framework. Proceedings of the 25rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2019. doi: 10.1145/3292500.3330701 [DOI] [Google Scholar]
  • 84. Xu R, Rahmandad H, Gupta M, DiGennaro C, Ghaffarzadegan N, Amini H, et al. The Modest Impact of Weather and Air Pollution on COVID-19 Transmission. SSRN. 2020. doi: 10.2139/ssrn.3593879 [DOI] [Google Scholar]
  • 85. Sharma M, Mindermann S, Rogers-Smith C, Leech G, Snodin B, Ahuja J, et al. Understanding the effectiveness of government interventions in Europe’s second wave of COVID-19. bioRxiv. 2021. doi: 10.1101/2021.03.25.21254330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Haug N, Geyrhofer L, Londei A, Dervic E, Desvars-Larrive A, Loreto V, et al. Ranking the effectiveness of worldwide COVID-19 government interventions. Nature. 2020. [DOI] [PubMed] [Google Scholar]
  • 87. Dehning J, Zierenberg J, Spitzner FP, Wibral M, Neto JP, Wilczek M, et al. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science. 2020. doi: 10.1126/science.abb9789 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Chu DK, Akl EA, Duda S, Solo K, Yaacoub S, Schünemann HJ, et al. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis. Lancet. 2020;395(10242):1973–1987. doi: 10.1016/S0140-6736(20)31142-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Robert Koch Institute. Serologische Untersuchungen von Blutspenden auf Antikörper gegen SARS-CoV-2 (SeBluCo-Studie); 2020. https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Projekte_RKI/SeBluCo_Zwischenbericht.html.

Decision Letter 0

Itzhak Benenson

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

25 May 2021

PONE-D-21-11422

Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modelling, agent-based simulation, and mobile phone data

PLOS ONE

Dear Dr. Nagel,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jun 26 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Itzhak Benenson, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

3. Thank you for stating the following in the Financial Disclosure section:

'The work on the paper was funded by the Ministry of research and education (BMBF) Germany (01KX2022A) and TU Berlin.'

We note that one or more of the authors are employed by commercial companies: Senozon AZ and Senozon GmbH.

a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.  

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and  there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

c. Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

4. We note that Figure 1 in your submission contains copyrighted images.

All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (a) present written permission from the copyright holder to publish this figure specifically under the CC BY 4.0 license, or (b) remove the figure from your submission:

a. You may seek permission from the original copyright holder of Figure 1 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. 

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b. If you are unable to obtain permission from the original copyright holder to publish this figure under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

5. Please ensure that you refer to Figure 16 in your text as, if accepted, production will need this reference to link the reader to the figure.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The paper “Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modeling, agent-based simulation, and mobile phone data” proposes a person-centric model that incorporates mobility patterns derived by cell phone usage and numerous other factors with a person-specific variant of a S(E)IR model to better understand how specific political interventions affect the rate of reinfections. Overall, the model offers an immaculate depth of a possible system that explains the progression of the pandemic in Berlin. Still, there are multiple major and minor questions that I would like to be answered before advising the manuscript for publication.

Major Remarks:

-While I generally admire the depth of the proposed model, I am not completely sure from what empirical evidence we really draw the conclusion. In particular, I am not sure to what extent the model choices or the mobility patterns drive the results. In my opinion, there are still multiple unknowns regarding the transmission of COVID-19, e.g., where do they happen, what effective difference is between an N95, medical mask, or cloth. Still, all this missing knowledge is completed with sometimes more or less empirical values in the presented model. I definitely see the sense of having this type of model, but the caveats should be stated clearly in the paper.

-The article would greatly benefit from a discussion of the sensitivity and stability of the system. If I understand the manuscript correctly, only one parameter is chosen in the Calibration, but how sensitive and stable is the system to different values of this coefficient? And what kind of sensitivity checks of all other parameters were carried out? By this I mean some assessment of how robust the conclusions drawn are.

An adequate discussion and communication of the uncertainties in the model are missing. On multiple occasions, this would benefit the manuscript.

- Fig. 4: What do the shaded areas indicate?

- Fig. 6 TOP: The numbers are really low, hence some uncertainty over multiple iterations of the pandemic would be great. Here, one could, e.g., sample different mobility patterns in the Senozon method? Overall, multiple runs of the simulations should enable some type of uncertainty quantification.

-The calibration of the only non-fixed parameter also has associated uncertainty. Here some additional information on how the calibration is performed is needed. Can one understand it as some type of method of moments, where the coefficient is picked so that the observed time series of cases and hospitalizations is conserved as closely as possible in the simulation model? If so it is unclear one differentiates between primary and secondary priority. Some mathematical formulas in the Annex or Supplementary Material would be helping.

In the article prediction using the model is named as a possible application. Therefore, some out-of-sample results using historical data are needed.

-What are the effects of the multiplicative form in equation (1)? Doesn't this form assume that all components are inherently on the same scale? Even though the approximation in (2) is only used for illustrative reasons when does the approximation exactly hold?

Minor Remarks:

-Generally, the presentation of the introduction could be more concise. At the moment, the narrative is based on how prior versions of the model were improved over time with multiple features but it does not become apparent why we even need a person-centric simulation model next to the standard S(E)IR models readily available. Also, I would check if all the citations to prior versions of the model are really needed.

It is also not clearly evident what the section on related work really wants to achieve. A clear motivation of agent-based models would be more than enough but the different types of mobility data should rather be discussed in the Annex.

-Senozon method: Some illustrative examples with a Figure akin to Fig. 1 would be a good addition to the explanation. Is for each person one mobility pattern sample for each day? If so how is prediction performed, where no observations of mobility patterns are available? Also, the section would be more meaning full if some toy examples of an origin-destination matrix and the raw data could be provided. What percentage of people is included in the raw data? Some caveats on the uncertainty associated with the plausible classifications of activities would be fitting. Finally, to be it is not clear how people assumed to be meeting one another given two specific meeting patterns.

-While the title of the manuscript suggests that the focus lies on urban areas, in the text, we find repeated comments that complete Germany is modeled as well. Why are only the results for Berlin reported? Does the sample only include urban areas or what data sources are missing for rural areas?

-Does the simulation assume that each home, school etc has the same constant values named in Table 1? Is this reasonable? Even in Berlin, there is some type of income gap, that probably leads to more space per person in wealthy neighborhoods. How could this affect the findings of the article? A random distribution could be used as an alternative maybe even based on the ZIP code?

-Are the lines in Fig. 4 in- or out-of-sample predictions of the model? And is the blu line the number of people with symptom onset? If so why is it so far of especially in the first few months? In addition, the temporal path of the R-value (Fig. 7) compared to the officially reported value by the RKI would be great to see and assess the proposed model.

-Did I understand the reasoning behind the additive strucure of interventions correctly in that it is based on the definition as a sum over individual reinfections? If so this should be stated more clearly in the manuscript.

-Table 3 is not clear to me. The authors of each study should be directly named together with some classification of the used method. Further, the proposed model should be one column in the table. Also, the star at 0 is not clear to me, how can 0 be significantly different from 0?

Reviewer #2: The manuscript “predicting the effects of covid-19 related interventions in urban settings by combining activity-based modeling, agent-based simulation, and mobile phone data” presents a novel approach to simulate the spread of epidemics in urban environments. As opposed to compartmental models, the model presented in the manuscript presents a personalized spatially explicit framework to track the contact between individuals, the probability of infection in each contact, and the progression of the disease in those individuals who were infected. The model manages to predict the trend of the disease spread in the Berlin metropolitan area. As such, I enjoyed reading the manuscript, as it presents a breakthrough in modeling epidemics, by allowing the insertion of explicit and detailed policy measures.

However, the manuscript contains pitfalls and is often unclear. The authors must take this into account in order for the manuscript to make a valid and clear scientific claim regarding the spread of epidemics in urban environments.

Model fit:

The calibration process of the model manages to fit the model with the general trend of the number of cases and hospitalized patients in berlin. However, the manuscript doesn’t suggest any metric that evaluates the model fit. Even if this kind of metric is not the purpose of the authors’ model evaluation, figure 4 presents the data in a logarithmic scale, in order to show the fit of general trend. However, this type of presentation is a bit deceiving without explicitly informing the reader that the purpose of the figure is to compare the trend and not absolute numbers.

Simulating agents activity:

It is clear that the COVID-19 pandemic changed the travel habits of nearly the entire population, in both short-term and long-term habits. It seems that the simulation conducted takes into account only the short-term travel habits, as it does not simulate replanting of agents' plans of mode choice and destination choice, such as changing the activities to telecommuting and teleducation. Choosing a different mode of transport is very likely given lower capacity or frequency in PT systems, and it is possible to rerun the simulation every n days in order to observe what happens to the modal split. It is true that simulating destination choice is very complex and might not be possible using this framework, however the authors should mention this limitation. In addition, it is not clear from the paper what happens when a person's activity is removed - how does the rest of its plan change?

Section - Infection model:

1. page 8: Equation 1 is not clear enough. M is the sum of which other persons? Those in contact? The explanation is for 2 individuals or for multiple individuals?

2. CI is different for each type of activity, due to the fact that people exhale and inhale more air in different activities (talking, eating, working out etc.). Was this taken into account? Though it affects the wedding rate(sh) more than the actual contact intensity, it is reasonable to materialize it through CI.

Page 9:

The explanation for room capacity was hard to grasp. If I understand correctly, nspaces reflects the fact that there is a smaller chance for two people to meet, however if they do, the chance of infection is higher.

Page 10:

1. Table 1 is unclear. What term does each column come to represent? The air exchange low and high calculation is not detailed in the main body of the text, and is unclear.

2. The calculation of CI needs to be refined in an orderly fashion. I tried to manually calculate the values in table 1 and failed.

3. What is the transition probability from being infectious to showing symptoms? It does not appear in the text.

Sub section - Indoors/outdoors and second wave:

Page 13:

Did leisure activities in Berlin in fact take place outdoors, or that they were also restricted? What fraction of the activities do leisure activities take up?

“Too early” “not steep enough” please refer to the specific location in the graph, by annotating the graph or by explicitly mentioning the coordinates on the graph.

Sub section - Infections per activity type:

Page 13:

The share of each activity frequency in both count and total duration across all agents is important in order to fully understand figure 6.

Sub section - Reductions of R per intervention:

Page 16:

Table 2 - is R calculated over the entire period? A certain date? If so please specify when.

When reporting reference [3], you used percent reduction, however here you use additive reduction while explaining why it is better - you should point out the weakness in percent reduction when first mentioning it(page 2).

Sub section - Intuition for these results:

Page 17:

The intuition for decreasing marginal effect of interventions should be presented in a formal way.

A graphical explanation of intuition for the results would improve the understanding of that part.

General remarks regarding the organization of the paper:

I don’t fully understand the motivation of the authors to move the calibration section to the appendix. It seems that the calibration is a fundamental part of the methods section. Moreover, the presence of the calibration sections in the appendix makes it difficult for the reader to understand the various abbreviations and references that appear later on.

Additionally, the section that discusses simulation runs should be a part of the methods section. The “Decreasing marginal effect of interventions” and “Intuition for these results” subsections should be a part of the discussion, as they reflect upon the results and do not report them.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Itzhak Benenson

10 Aug 2021

PONE-D-21-11422R1

Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modelling, agent-based simulation, and mobile phone data

PLOS ONE

Dear Dr. Nagel,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please react to the last remarks of the reviewer #1 trying to preserve the length of the paper or even making it shorter 

Please submit your revised manuscript by Sep 24 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Itzhak Benenson, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

Please react to the rest of the Reviewer #1 comments preserving the length of the paper or even making it shorter

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The revision addressed most of my raised points from my review. However, I still see some possible drawbacks of the proposed method, which I would like to be replied to before advising for its publication:

- I like the added section on Out-of-Sample prediction; however, some benchmark comparison to other models (SEIR or mechanistic models, like yesterday is today) would be interesting.

- Why was the RMSLE used for prediction? As far as I know, this is no standard measure. Why not use the RMSE or some proper scoring rule?

- Could you add some explanation or comments on two findings of the predictive assessment: Why is the prediction error in the first run under frozen activity levels so high? And it also seems as if updating the activity levels did, in fact, not lead to a better performance in all runs. What does this tell us about the model? Maybe some plots on how the activities changed in that time would be a starting point for the explanation?

- Generally, the size of the Figures should be consistent. Fig. 17 is a lot smaller than, e.g., Fig. 18

- While I admire the additional work put into extending the paper, I believe that 30 Pages is too long for the main article. I would put some parts of the discussion into the Appendix and most parts of the Appendix into an Online Supplementary Material (if this is possible with Plos One). Maybe also some of the cut discussion from the previous sections “Using mobile device data to observe changes of mobility behavior during COVID-19” and “From reductions of mobility behavior to reductions of infections” could be interesting in the Appendix of the Paper.

- For completeness, a short explanation of the Whitepaper of the Senozon Method in English (the language of the article itself) would be helpful. Besides, some rationale and legitimisation would benefit the article on why a model for transportation is also suitable for modelling COVID hospitalisation

Reviewer #2: I have read the authors reviewed manuscript and responses and find them satisfactory. Therefore, I recommend accepting the manuscript for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 2

Itzhak Benenson

12 Oct 2021

Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modelling, agent-based simulation, and mobile phone data

PONE-D-21-11422R2

Dear Dr. Nagel,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Itzhak Benenson, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Itzhak Benenson

20 Oct 2021

PONE-D-21-11422R2

Predicting the effects of COVID-19 related interventions in urban settings by combining activity-based modelling, agent-based simulation, and mobile phone data

Dear Dr. Nagel:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Itzhak Benenson

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Appendix.

    (PDF)

    S1 Fig. Reduced activity participation by activity type.

    (PDF)

    S1 Text. Senozon method.

    (PDF)

    S2 Text. Model history.

    (PDF)

    S3 Text. Error metric for calibration.

    (PDF)

    S4 Text. Robustness runs.

    (PDF)

    S5 Text. Comparison to other models.

    (PDF)

    S6 Text. Making the model more realistic.

    (PDF)

    S1 File

    (PDF)

    Attachment

    Submitted filename: Response_to_Reviewers.pdf

    Attachment

    Submitted filename: 2021-09-25 PLOS answer to referees.pdf

    Data Availability Statement

    For computer code see https://github.com/matsim-org/matsim-episim. Simulations were computed with version d16656f076640124de0361fc327d3803a80aa466 of the code, started with command java -jar matsim-episim-1.0-SNAPSHOT.jar runParallel --setup org.matsim.run.batch.BerlinSensitivityRuns --params org.matsim.run.batch.BerlinSensitivityRuns$Params. The input data (including the synthetic mobility traces) are made public here: https://doi.org/10.14279/depositonce-11495. The output data used for the figures can be retrieved at: http://dx.doi.org/10.14279/depositonce-12113.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES