Skip to main content
PLOS One logoLink to PLOS One
. 2021 Apr 29;16(4):e0250778. doi: 10.1371/journal.pone.0250778

Current trends in the application of causal inference methods to pooled longitudinal observational infectious disease studies—A protocol for a methodological systematic review

Heather Hufstedler 1,*, Ellicott C Matthay 2, Sabahat Rahman 3, Valentijn M T de Jong 4, Harlan Campbell 5, Paul Gustafson 5, Thomas Debray 4,6, Thomas Jaenisch 1,7,8, Lauren Maxwell 1, Till Bärnighausen 1,9
Editor: Tim Mathes10
PMCID: PMC8084147  PMID: 33914795

Abstract

Introduction

Pooling (or combining) and analysing observational, longitudinal data at the individual level facilitates inference through increased sample sizes, allowing for joint estimation of study- and individual-level exposure variables, and better enabling the assessment of rare exposures and diseases. Empirical studies leveraging such methods when randomization is unethical or impractical have grown in the health sciences in recent years. The adoption of so-called “causal” methods to account for both/either measured and/or unmeasured confounders is an important addition to the methodological toolkit for understanding the distribution, progression, and consequences of infectious diseases (IDs) and interventions on IDs. In the face of the Covid-19 pandemic and in the absence of systematic randomization of exposures or interventions, the value of these methods is even more apparent. Yet to our knowledge, no studies have assessed how causal methods involving pooling individual-level, observational, longitudinal data are being applied in ID-related research. In this systematic review, we assess how these methods are used and reported in ID-related research over the last 10 years. Findings will facilitate evaluation of trends of causal methods for ID research and lead to concrete recommendations for how to apply these methods where gaps in methodological rigor are identified.

Methods and analysis

We will apply MeSH and text terms to identify relevant studies from EBSCO (Academic Search Complete, Business Source Premier, CINAHL, EconLit with Full Text, PsychINFO), EMBASE, PubMed, and Web of Science. Eligible studies are those that apply causal methods to account for confounding when assessing the effects of an intervention or exposure on an ID-related outcome using pooled, individual-level data from 2 or more longitudinal, observational studies. Titles, abstracts, and full-text articles, will be independently screened by two reviewers using Covidence software. Discrepancies will be resolved by a third reviewer. This systematic review protocol has been registered with PROSPERO (CRD42020204104).

Introduction

The field of medicine has relied heavily on randomized control trials (RCTs) to infer causality. Though considered the gold standard for causal inference, randomization can be unethical or impractical and so cannot always be used to infer causality at the population level. RCTs also often lack external validity [1, 2]—a crucial element for developing evidence-based public health policy. Longitudinal observational research designs offer the opportunity to gather data on a greater number of people over a larger span of time than most longitudinal RCTs. Longitudinal observational studies facilitate the evaluation of interventions where randomization is not ethical, making them invaluable to public health efforts.

In population science, a large sample size is required. A large sample size is also generally required to examine rare exposures/treatments, as is often the case with infectious disease (ID) research. However, conducting single cohort studies large enough to reach such a large sample size can be too expensive and time-consuming. To overcome this problem, scientists often pool data from numerous studies. While the pooling of both aggregate data (AD) and individual patient data (IPD) are valuable, the pooling of IPD can yield more reliable results than AD [3, 4]. Additionally, pooled analyses across diverse cohorts can offer greater variability in exposure and outcome measures, thereby enhancing power and the ability to detect meaningful associations.

In analysing observational data, the famous phrase ‘correlation does not imply causation’ has hindered the use of causal language: academic journals and peers alike often discourage the use of causal terminology. And, although there has been much work done in the field of causal inference with observational data over the last decades [57], many authors still use terms that skirt the issue, employing terms like ‘link’ or ‘associated with’ [7]. However, as some point out, “the proscription against the C-word is harmful to science…”[7].

To infer causality with regards to the health effects of exposures/treatments, health researchers have recently adopted methods, some of which originated in economics, political science, and psychology. Growing use of these methods in epidemiology can enhance the internal validity while maintaining the value of an observational cohort’s external validity. They do so by improving our ability to control for observed and/or unobserved confounders. These causal methods include but are not limited to instrumental variables (IV), which, in simple terms, ‘looks for a randomized experiment embedded in the observational study’ [8]; propensity scores (PS), which can be implemented in several ways, including weighting, matching, or subclassification, e.g., to adjust for covariates, allowing the exposed and unexposed to be more comparable; difference-in-differences (DiD) models are well-suited for pre-/post-interventions or data with shocks in between; and regression models are widely used in medicine to control for observed confounding.

Applications of these methods listed can be seen in the examination of the impact of infectious disease specialist referrals on health outcomes in France where researchers used IV to tackle methodological issues like selection bias and endogeneity [9], and in one study concerned with the effectiveness of a dengue intervention, researchers used propensity score matching to ‘match each treated day with one not treated’, a difference-in-difference (DiD) model to examine the ‘differences between numbers of dengue cases among scaling up phases’, and a linear regression model to estimate the effectiveness of the intervention in the presence of associations between sociodemographic factors’ [10].

Each of these causal methods has assumptions (or, conditions) which must be satisfied in order for the method to yield reliable results: e.g. for IV, relevance, exclusion restriction, exchangeability, and monotonicity or homogeneity; and for propensity score methods (PS), exchangeability, consistency, and positivity. Some of the assumptions, or conditions, required for these methods are testable. Some of the assumptions, or conditions, required for these methods are testable. Some of them, though, are untestable, and requires one to evaluate the feasibility of them, often relying on prior literature, theory, causal models, or background knowledge. Discussing the testing of testable assumptions and evaluation of untestable assumptions in the published research article allows the reader to better understand the rigor with which a researcher approached the issue(s). Although similar to multi-centre single cohort studies, implementing causal methodologies with IPD from several cohorts involves a slightly different process and can be more complex, particularly when accounting for, e.g., differences in types of variables that are captured and the ways they are measured, more extreme heterogeneity in cohort composition, or missing data [11, 12]. While these and other causal inference methods have been useful to the study of infectious disease, it is not well-understood how often these methods are being used, in what ways, whether they are being applied rigorously, whether there are gaps in the reporting or application of these methods, or how these factors have changed over time. One study has reviewed the application of causal inference methods applied to time-dependent confounders and also examined which questions are being investigated using non-randomized exposure variables in cohort data deriving from RCTs. They found that the most commonly-implemented method was marginal structural models (MSM) with inverse probability of treatment weighting (IPTW), with the most common question type being the effect of concomitant medication [13]. However, to our knowledge, there exists no methodological review of the application and reporting of causal methods to pooled, observational, longitudinal infectious disease-related studies.

In this systematic review, we seek to fill this gap. We will search the literature (EBSCO, EMBASE, PubMed and Web of Science) using a combination of MeSH and text terms. We will look for infectious disease studies that pool data from 2+ studies. Applying modern causal inference methods to pooled, longitudinal, observational data has the potential to offer important insights into the causes and consequences of infectious diseases. In the face of the Covid-19 pandemic and in the absence of systematic randomization of exposures or interventions, the saliency of these methods is even more apparent. In the short-term, we hope that this review can inform those researchers who are currently analysing observational data with the hope of inferring causality. In the long-term we expect that findings from this review will lead to concrete recommendations for the conduct and reporting of causal inference methods applied to pooled, longitudinal, observational data in ID applications.

Research question

What are the trends in the conduct and reporting of causal inference methods in ID-related studies using longitudinal, observational data pooled at the participant-level from multiple studies?.

Hypotheses

We hypothesize that the use of novel and modern causal inference methods has increased in observational ID studies over the last 10 years, but that studies will largely fail to report on key aspects of these methods that are necessary to evaluate their application. For example, we expect that reporting on the quantitative evaluation of assumptions for a given casual inference method (e.g. positivity assumption) will be lacking.

Materials and methods

Overview

The protocol for this systematic review has been registered with PROSPERO (CRD42020204104). Our study will test our hypotheses by conducting a systematic review and examine recent trends in the conduct and reporting of causal inference methodology in longitudinal, observational studies pooling data from multiple ID cohorts at the individual level. We will define ‘recent’ as the last 10 years (2009–2019), but will, due to capacity, limit the search to include studies published at three timepoints 2009, 2014, and 2019.

Data collection procedures

The following databases will be searched using a combination of MeSH and text terms that is tailored for each database (see S1 Table):

  1. EBSCO (including Academic Search Complete, Business Source Premier, CINAHL, EconLit, and PsycINFO)

  2. EMBASE

  3. PubMed

  4. Web of Science

We will include studies that 1) used participant-level data, 2) pooled data from longitudinal, observational cohorts in any location; 3) were focused on infectious disease-related outcomes, 4) estimated a causal effect related to a stated causal question (or, what is interpreted by reviewers as a research study motivated by a causal question; see reference to avoidance of causal language in the introduction), 5) are published in the years 2009, 2014, and 2019 (if there is more than one publication date, we will use the electronic publication date), and 6) have full-text accessible through open access, university license, another collaborator on the project, or by requesting the article directly from the authors. We will also include studies that draw data from RCTs if: a) at least one data source pooled in the analysis is drawn from an observational cohort, or b) the study includes only RCTs but at least one exposure or treatment variable analysed was not that which was randomized.

We will exclude studies that exclusively draw data from RCTs to evaluate randomized exposures or treatments. We will also exclude studies using data from a single-centre cohort or multi-centre single-cohort. We will also exclude studies that: 1) do not employ longitudinal data, 2) estimated an effect size that does not correspond to a research question with the goal of inferring causality (e.g. the study is descriptive or focused on prediction), 3) non-human studies, 4) protocols, reviews, commentaries, corrections, editorial, erratum, and 5) studies focused on description, prediction, or prognostics.

Variables

We will extract data on the causal inference designs and methods applied in each study, the quality of reporting on the methods which were applied, and study meta-data (sample size, geographic location of data collection, discipline of parent study, health outcome studied, funding source), etc. See S2 Table for full data extraction sheet.

Examples of data to be collected are:

  • Did the authors take any approach(es) to account for differences in variable definitions and data quality across individual cohorts (e.g. any stated information about harmonization efforts) or statistical methods (e.g. adopting measurement error methods)?

  • How did the authors deal with missing data within and across studies (e.g. multilevel multiple imputation, or separate imputation for each dataset, or complete case analysis)?

  • Do the authors report testing any of the assumptions required for the analysis methods they have chosen to pool the data? Which ones?

  • What approach(es) did the authors apply to account for clustering and heterogeneity at the cohort or study level (whichever units are pooled across)? Did the authors adopt a one-stage or two-stage approach?

  • Did the authors explicitly state and test the assumptions that are required for methods used to account for clustering and heterogeneity?

  • Which causal methods were applied to the pooled data to make causal inferences?

  • Do the authors explicitly state or report testing any of the assumptions required for the analysis methods they have chosen to deliver causal effects?

  • For untestable assumptions (e.g. unmeasured confounding), did the authors do anything to evaluate the plausibility of those assumptions (e.g. negative control exposures or outcomes, quantitative bias analysis)?

  • Did the authors discuss heterogeneity of estimated causal effects and the possible impact on the generalizability of research findings?

Main outcomes

This is a methodological systematic review designed to establish what causal inference methods are used and how they are reported in studies that use longitudinal data from multiple cohorts (such as pooled cohort studies and individual patient data-meta analyses (IPD-MAs)). Expected outcomes of the review are to establish:

  1. Causal inference methods applied in studies using data from multiple cohorts (e.g. instrumental variable approaches, including Mendelian randomization; regression discontinuity; interrupted time series; panel fixed effects; difference-in-differences; G-estimation; multiple regression; propensity score matching; inverse probability of treatment weighting; etc.)

  2. Approaches to account for heterogeneity and clustering of the outcome by cohort or data source

  3. Approaches to account for differences in measured variables, data quality, or missingness across cohorts

  4. Approaches to discussion of methods and the motivating factor(s) in their selection

  5. Practices regarding testing of any required assumptions for the chosen causal inference method

  6. Reporting standards for studies applying causal inference methods to longitudinal data pooled across multiple cohorts

Analysis plan

Study records will be uploaded to Covidence [14] and deduplicated. Two reviewers will independently conduct the title/abstract screening in Covidence, and discrepancies will be resolved by a third reviewer. For results flagged as meeting inclusion criteria or uncertain if they meet inclusion criteria, all efforts will be made to access the full-text through databases, university access and collaborators’ connections, or requesting the articles directly from the authors. Two reviewers will complete the full-text review. Any discrepancies during the full-text review will be resolved by a third reviewer. One author will extract the data using the data extraction form. The screening process will be documented in a PRISMA flow chart.

We will conduct a narrative summary, and present the results in text, tables and figures. We will summarize trends in all variables collected by reporting frequencies over time. For example, we will summarize trends in causal inference methods by tabulating how frequently each method is used over time (2009, 2014, 2019) and by journal discipline (e.g. economics versus public health). We will also evaluate whether the authors presented sufficient detail on the causal inference method employed and the testing of assumptions required for the methods employed. No meta-analysis of the included studies is planned.

Supporting information

S1 Checklist

(DOCX)

S1 Table

(PDF)

S2 Table

(PDF)

Acknowledgments

Membership of the ReCoDID Consortium is available at www.recodid.eu.

Data Availability

All relevant data from this study will be made available upon study completion.

Funding Statement

This project is funded through the RECODID study, which has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No. 825746 and the Canadian Institutes of Health Research, Institute of Genetics (CIHR-IG) under Grant Agreement N.01886-000.

References

  • 1.Cartwright N. Are RCTs the Gold Standard? Biosocieties. 2007;2(1):11–20. [Google Scholar]
  • 2.Cartwright N, Munro E. The limitations of randomized controlled trials in predicting effectiveness. J Eval Clin Pract. 2010;16(2):260–6. 10.1111/j.1365-2753.2010.01382.x [DOI] [PubMed] [Google Scholar]
  • 3.Tierney JF, Fisher DJ, Burdett S, Stewart LA, Parmar MKB. Comparison of aggregate and individual participant data approaches to meta-analysis of randomised trials: An observational study. PLoS Med. 2020;17(1):1–22. 10.1371/journal.pmed.1003019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tudor Smith C, Marcucci M, Nolan S, Iorio A, Sudell M, Riley R, et al. Individual participant data meta-analyses compared with meta-analyses based on aggregate data (Review). 2016; [DOI] [PMC free article] [PubMed]
  • 5.Rubin D. B. Estimating causal effects of treatment in randomized and nonrandomized studies. J Educ Psychol [Internet]. 1974;66(5):688–701. Available from: http://www.fsb.muohio.edu/lij14/420_paper_Rubin74.pdf [Google Scholar]
  • 6.Halloran ME, Struchiner CJ, Epidemiology S, Mar N, Halbran ME, Struchiner CJ. Causal Inference in Infectious Diseases. 1995;6(2):142–51. [DOI] [PubMed] [Google Scholar]
  • 7.Hernan M, Robins J. Causal Inference: What If. Boca Raton: Chapman & Hall/CRC; 2020. [Google Scholar]
  • 8.Baiocchi M, Cheng J, Small DS. Instrumental variable methods of causal inference. Stat Med [Internet]. 2014;33(13):2297–340. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4201653/ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sasikumar M, Boyer S, Remacle-Bonnet A, Ventelou B, Brouqui P. The value of specialist care—infectious disease specialist referrals—why and for whom? A retrospective cohort study in a French tertiary hospital. Eur J Clin Microbiol Infect Dis. 2017;36(4):625–33. 10.1007/s10096-016-2838-y [DOI] [PubMed] [Google Scholar]
  • 10.Quintero J, Pulido NR, Logan J, Ant T, Bruce J, Carrasquilla G. Effectiveness of an intervention for Aedes aegypti control scaled-up under an intersectoral approach in a Colombian city hyperendemic for dengue virus. PLoS One. 2020;15(4):1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lesko CR, Jacobson LP, Althoff KN, Abraham AG, Gange SJ, Moore RD, et al. Collaborative, pooled and harmonized study designs for epidemiologic research: challenges and opportunities. 2018;(February):654–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Resche-Rigon M, White IR. Multiple imputation by chained equations for systematically and sporadically missing multilevel data. Stat Methods Med Res [Internet]. 2018;27(6):1634–49. Available from: http://ovidsp.ovid.com/ovidweb.cgi?T=JS&PAGE=reference&D=medl&NEWS=N&AN=27647809 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Farmer RE, Kounali D, Walker AS, Savović J, Richards A, May MT, et al. Application of causal inference methods in the analyses of randomised controlled trials: A systematic review. Trials. 2018;19(1):1–14. 10.1186/s13063-017-2423-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Covidence systematic review software [Internet]. Melbourne, Australia: Veritas Health Innovation; www.covidence.org

Decision Letter 0

Tim Mathes

22 Feb 2021

PONE-D-20-34304

Current Trends in the Application of Causal Inference Methods to Pooled Longitudinal Observational Infectious Disease Studies -- A Protocol for a Methodological Systematic Review

PLOS ONE

Dear Dr. Hufstedler,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Mar 12 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Tim Mathes

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does the manuscript provide a valid rationale for the proposed study, with clearly identified and justified research questions?

The research question outlined is expected to address a valid academic problem or topic and contribute to the base of knowledge in the field.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Is the protocol technically sound and planned in a manner that will lead to a meaningful outcome and allow testing the stated hypotheses?

The manuscript should describe the methods in sufficient detail to prevent undisclosed flexibility in the experimental procedure or analysis pipeline, including sufficient outcome-neutral conditions (e.g. necessary controls, absence of floor or ceiling effects) to test the proposed hypotheses and a statistical power analysis where applicable. As there may be aspects of the methodology and analysis which can only be refined once the work is undertaken, authors should outline potential assumptions and explicitly describe what aspects of the proposed analyses, if any, are exploratory.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Is the methodology feasible and described in sufficient detail to allow the work to be replicable?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors described where all data underlying the findings will be made available when the study is complete?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception, at the time of publication. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above and, if applicable, provide comments about issues authors must address before this protocol can be accepted for publication. You may also include additional comments for the author, including concerns about research or publication ethics.

You may also provide optional suggestions and comments to authors that they might find helpful in planning their study.

(Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript reports on a protocol for a methodological review on how causal inference methods are used in analyzing pooled longitudinal observational studies in infections diseases. The protocol is complete and clear. The complete search strategy and data extraction forms are attached as supplementary material.

Major comments

1. I cannot make any guess on how many articles will be retrieved, but I thought that limiting the search to three years (2009, 2014, 2019) may yield a limited number of studies included. Are there fallback solutions to extend the search is the number of studies would be relatively small? (I acknowledge that this leaves the definition of a “relatively small” number unanswered).

2. I also wondered whether the search strategy would be specific enough to spare a lot of screening time. I am not a specialist of building search equations, but I can only underline the risks of screening hundreds (thousands?) of articles on tile and abstract to include a small number at the end.

3. I understand specificities of pooled data analyses, in particular regarding the need to account for heterogeneity between cohorts, but to some extent this is not markedly different from heterogeneity between facilities in a multicenter single cohort study (that are excluded), at least from a statistical point-of-view. There may be important differences with e.g. systematically missing data that are not recorded in a specific study that require specific methods (e.g. Resche-Rigon, White. Stat Methods Med Res 2018;27:1634-1649), and the paper also mentions approaches to account .for differences in data quality or definitions. But some more discussion on why focus only on pooled analyses could be given.

Minor comments

1. The meaning of “pooling” (that I understood as pooling different observational studies together) could be clarified in the abstract.

2. Is there really a need for both IV and IVA acronyms? The first one is largely sufficient.

Reviewer #2: This manuscript is a protocol for a methodological systematic review the application of causal inference methods to Pooled longitudinal observational infectious disease studies. No statistical meta-analysis will be needed. This study will be a descriptive summary of the publications that applied causal inference methods. It will be interested in looking at the summary rather than this protocol.

Lines 153-154, why do you select studies published at three timepoints instead of all related publications over the last 15 years?

Reviewer #3: Manuscript ID: PONE-D-20-34304

Title: Current Trends in the Application of Causal Inference Methods to Pooled Longitudinal Observational Infectious Disease Studies -- A Protocol for a Methodological Systematic Review

I thank both the Editor and the Authors for the opportunity to review the above-mentioned manuscript.

In brief, this paper reports the protocol for an ambitious review on causal inference methods applied to pooled observational data in infectious disease studies. The paper promises a useful and most-needed systematic review, which will help researchers and clinicians to better understand the respective usages and pros/cons of the different methods under consideration. While I look forward to the results of this systematic review, I have a couple of very minor comments for this protocol. I hope this will help the Authors clarify some points.

***Abstract:

- Line 39: The term “individual-level effects” can be confusing for causal inference researchers, as it may refer to “individual treatment effects” (i.e. the causal effect of an exposure/intervention in a particular individual; note, this quantity is unmeasurable.) I suppose the Authors refer to the individual-study-level effects instead;

- Line 42: Causal methods can address both/either unmeasured and/or measured confounders;

- Line 44: I understand a big C-something has been invading our lives since last year, but I find it a bit out of context since this systematic review includes studies from 2009, 2014 and 2019 – and not 2020, as I understand.

***Introduction:

- The Authors could add one or two sentences on the different assumptions needed for the causal methods under consideration (e.g. IV methods addressed unmeasured confounders but require specific IV assumptions [relevance, no shared causes with the outcome and exclusion restriction]; PS methods rely on the assumption of no unmeasured confounders and require positivity);

- Line 104: The brief one-sentence summary on PS methods is vague and incorrect: PS methods do not necessarily rely on matching (e.g. weighting, standardization, etc. can be applied too) to provide covariate balance;

- Line 119-120: the sentence “with non-randomized exposure data from RCTs” sounds paradoxical. Could the Authors reformulate?

***Methods:

- Line 166: The Authors state that they will include studies that “4) estimated a causal effect related to a stated causal question”. While it sounds fair, I fear that causal questions are unfortunately too rarely explicitly stated in practice (see Hernan MA. The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data. Am J Public Health. 2018;108(5):616-619.)

- Amongst the bullet points for the extraction, I would also be interested in the covariate selection for adjustment (e.g. Did the study explicitly specify that covariates adjusted for were confounders [and not mediators] of the causal relationship?)

***Other comments:

- I found some typos throughout, and some sentences quite long and heavy to read (in particular, in the introduction).

- Some key references about causal inference methods would be appreciated – e.g. Hernan and Robins book or others.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 29;16(4):e0250778. doi: 10.1371/journal.pone.0250778.r002

Author response to Decision Letter 0


24 Mar 2021

Thank you very much for the constructive as well as positive feedback. I believe we have addressed all of the constructive feedback in a letter attached to the submission. As I said in the comment previously, I believe we have addressed everything except for the request to change our citations from parentheses to brackets. We have tried for the better part of two days without success, and have chosen to submit without this change as to not miss the new deadline. Our apologies, and I hope that you can assist us in aligning with your formatting requirements. Otherwise, we hope we have addressed every other suggestion or comment satisfactorily. Thank you.

Attachment

Submitted filename: Response to Reviewers_ PlosOne_ 19March2021.docx

Decision Letter 1

Tim Mathes

7 Apr 2021

PONE-D-20-34304R1

Current Trends in the Application of Causal Inference Methods to Pooled Longitudinal Observational Infectious Disease Studies -- A Protocol for a Methodological Systematic Review

PLOS ONE

Dear Dr. Hufstedler,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please check correctness of citation and proofread the manuscript. 

Please submit your revised manuscript by May 22 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Tim Mathes

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does the manuscript provide a valid rationale for the proposed study, with clearly identified and justified research questions?

The research question outlined is expected to address a valid academic problem or topic and contribute to the base of knowledge in the field.

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Is the protocol technically sound and planned in a manner that will lead to a meaningful outcome and allow testing the stated hypotheses?

The manuscript should describe the methods in sufficient detail to prevent undisclosed flexibility in the experimental procedure or analysis pipeline, including sufficient outcome-neutral conditions (e.g. necessary controls, absence of floor or ceiling effects) to test the proposed hypotheses and a statistical power analysis where applicable. As there may be aspects of the methodology and analysis which can only be refined once the work is undertaken, authors should outline potential assumptions and explicitly describe what aspects of the proposed analyses, if any, are exploratory.

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Is the methodology feasible and described in sufficient detail to allow the work to be replicable?

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors described where all data underlying the findings will be made available when the study is complete?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception, at the time of publication. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above and, if applicable, provide comments about issues authors must address before this protocol can be accepted for publication. You may also include additional comments for the author, including concerns about research or publication ethics.

You may also provide optional suggestions and comments to authors that they might find helpful in planning their study.

(Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: My comments have been addressed.

Reviewer #3: I thank the Authors for addressing my previous comments/suggestions. I enjoyed reading the revised manuscript, which I think reads very well.

I have a very minor remark, which may however have its own importance: something wrong may have happened with the editing of the references in the revised manuscript - some citations have been deleted in the main text (in particular, Introduction). Could the Authors amend this, and double-check/proof-read before submitting?

I have no further comments other than this one. I thank the Authors for their work.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: Yes: T.L. Nguyen

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Apr 29;16(4):e0250778. doi: 10.1371/journal.pone.0250778.r004

Author response to Decision Letter 1


9 Apr 2021

Thank you for the the opportunity to correct and re-submit our protocol. We have added the two citations which were removed in the introduction, and I believe we have caught both instances of the extra periods, commas, or duplicate parentheses and brackets, as well as the 2 instances where PubMed was written as Pubmed. Thank you.

Attachment

Submitted filename: Response to Reviewers_ PlosOne_ 09April2021.docx

Decision Letter 2

Tim Mathes

14 Apr 2021

Current Trends in the Application of Causal Inference Methods to Pooled Longitudinal Observational Infectious Disease Studies -- A Protocol for a Methodological Systematic Review

PONE-D-20-34304R2

Dear Dr. Hufstedler,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Tim Mathes

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Tim Mathes

20 Apr 2021

PONE-D-20-34304R2

Current Trends in the Application of Causal Inference Methods to Pooled Longitudinal Observational Infectious Disease Studies -- A Protocol for a Methodological Systematic Review

Dear Dr. Hufstedler:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Tim Mathes

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Checklist

    (DOCX)

    S1 Table

    (PDF)

    S2 Table

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers_ PlosOne_ 19March2021.docx

    Attachment

    Submitted filename: Response to Reviewers_ PlosOne_ 09April2021.docx

    Data Availability Statement

    All relevant data from this study will be made available upon study completion.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES