Abstract
Background
Longitudinal research is subject to participant attrition. Systemic differences between retained participants and those lost to attrition potentially bias prevalence of outcomes, as well as exposure-outcome associations. This study examines the impact of attrition on the prevalence of child injury outcomes and the association between sociodemographic factors and child injury.
Methods
Participants were recruited as part of the Environments for Healthy Living (EFHL) birth cohort study. Baseline data were drawn from maternal surveys. Child injury outcome data were extracted from hospital records, 2006–2013. Participant attrition status was assessed up to 2014. Rates of injury-related episodes of care were calculated, taking into account exposure time and Poisson regression was performed to estimate exposure-outcome associations.
Results
Of the 2222 participating families, 799 families (36.0%) had complete follow-up data. Those with incomplete data included 137 (6.2%) who withdrew, 308 (13.8%) were lost to follow-up and 978 families (44.0%) who were partial/non-responders. Families of lower socioeconomic status were less likely to have complete follow-up data (p<0.05). Systematic differences in attrition did not result in differential child injury outcomes or significant differences between the attrition and non-attrition groups in risk factor effect estimates. Participants who withdrew were the only group to demonstrate differences in child injury outcomes.
Conclusion
This research suggests that even with considerable attrition, if the proportion of participants who withdraw is minimal, overall attrition is unlikely to affect the population prevalence estimate of child injury or measures of association between sociodemographic factors and child injury.
Keywords: Attrition, child injury, epidemiology, longitudinal research, data linkage, methods
Strengths and limitations of this study.
Uses a mixed active-passive cohort design from a birth cohort study that employed both data linkage and direct participant follow-up to ascertain comprehensive exposure and outcome data.
Ability to examine impact of participant attrition on outcome estimates using complete data.
Relatively small sample size. The adjusted model removed significance even though there was little change in the point estimate and could represent a type 2 error.
Introduction
Longitudinal birth cohort studies are invaluable for exploring the impact of early environments (physical, social and economic) on a child’s ongoing health and development.1 One of the greatest threats to the validity of this research design is bias due to participant attrition.2 Attrition can occur as a result of a participant actively withdrawing from the study, relocating and becoming lost to follow-up or not responding to follow-up waves.
Research examining attrition in longitudinal studies has identified systemic differences between responders and non-responders across a range of individual, family, social, demographic and health characteristics, with disadvantaged populations generally having higher rates of attrition.3–8 Systemic characteristic differences have also been found between participant groups according to the reason for attrition.6 9 10 In line with epidemiological principles, it is anticipated that these differences will result in a bias of outcome prevalence and exposure-outcome associations. However, while evidence of a link between attrition and prevalence bias has been reported in the literature, there is less evidence of attrition leading to bias in exposure-outcome associations, and the bias effect can vary depending on the nature of the outcomes under investigation.3–5 7 11
Within the context of longitudinal birth studies, the impact that attrition bias has on child injury outcomes is an important area for further research. To date, the majority of attrition-injury studies have focused on participants recruited after injury has already occurred, or older populations.12–22 In these studies, attrition has been linked to socioeconomic and participant characteristics, causes of injury, injury severity and treatment plans. Extending this research to include longitudinal research on outcomes for young children in a broader population will greatly enhance our understanding of this relationship.
Using data linkage of routinely collected administrative health records with participant survey data provides researchers new opportunities to assess the impact of attrition bias on participant outcomes in prospective longitudinal studies.23 Data linkage can provide comprehensive access to outcomes data for the total cohort group, allowing comparisons of outcomes to be made between participants who are active and those who have been lost from ‘active’ (survey) follow-up.
In this manuscript, we report the use of a mixed direct participant contact (including surveys) and linked data design in a longitudinal birth cohort study to examine the impact of participant attrition bias on child injury outcome estimates. In particular, the research will 1) identify whether there are systematic differences in attrition and 2) examine the impact of attrition on both the prevalence of child injury outcomes and the association between sociodemographic variables and child injury.
Methods
Study design
This cohort study combined survey and linked administrative health data.
Environments for Healthy Living (EFHL): Griffith Birth Cohort Study
The EFHL study is a longitudinal birth cohort which investigates how social, environmental, neighbourhood, family, maternal and individual factors impact a child’s health and development.24 Pregnant women in their third trimester were recruited between 2006 and 2011 at one of three public maternity hospitals in South East Queensland and Northern New South Wales. Participants completed a questionnaire at baseline and then follow-up surveys were sent 1, 3 and 5 years after the birth of the child. In between these scheduled contacts, regular newsletters were sent and additional substudy project contacts occurred. Returned mail or contact difficulties triggered the use of alternative contact mechanisms supplied at enrolment and updated at follow-ups, including a relative or friends contact details, email and Facebook.24 25The EFHL sample was largely characteristic of births in the region, however, consistent with the public hospital setting from which participants were recruited, there was a higher than national average representation of families with lower incomes, younger maternal age, more overseas born parents and higher proportions of maternal smoking in pregnancy.25 The study methodology and baseline characteristics of the sample have been comprehensively described in other papers.24 25 EFHL has been included on the Australian and New Zealand Clinical Trials Registry (ACTRN12310000931077).
Participants
Participants were families enrolled in the EFHL project at one of the two Queensland based public hospitals in the region (Logan or Gold Coast Hospital). This allowed direct comparison to Queensland Health child hospital records. Participants were selected from those recruited in the years 2006–2010 only, as these cohorts had completed more than one follow-up survey. Cases of maternal and child death were excluded from the analysis as these participants had died (which is a known outcome) and were not lost to follow-up.
Ethical considerations
Ethics approval was provided by the participating hospitals and by the Human Research Ethics Committees of Griffith University and the University of Queensland.
Data sharing
No additional data available.
Participant consent
Written, informed maternal consent for data used in this analysis was obtained for (1) direct participant contact including the completion of baseline and follow-up surveys; (2) release of hospital perinatal records and (3) for data to be linked to the child’s administrative health records including hospital admission and emergency department (ED) presentations.
Data sources and collection
Baseline social, maternal and household variables
Baseline information was collected from hospital perinatal records and from a self-administered survey completed by women during pregnancy. This baseline survey covered a range of maternal, household and community factors. The child characteristics included in this analysis were gender and age, via length of follow-up. Maternal characteristics included the level of education completed; maternal age at baseline and whether the mother smoked during her pregnancy. Household factors included marital status, whether other children resided in the home, housing status (owned, rented, government/boarding house) and household income level. Household income was divided into quintiles.25
Cohort attrition status
Information on participant attrition status was collected from the EFHL participant tracking database. Records were kept on both the participant’s overall recruitment status, as well as on their participation in follow-up surveys up to May 2014.
Participants were categorised into two groups: (1) those with complete follow-up or (2) those with incomplete follow-up. Participants with complete follow-up included those that had returned all follow-up surveys relevant to their age group and who had not withdrawn during the lifetime of the project.
Participants with incomplete follow-up were categorised into one of three groups. (1) The ‘withdrawn’ group included any participant who had actively asked to be withdrawn from direct participant contact and ongoing survey completion. Consent was retained for the use of existing data and linked administrative data. (2) ‘Lost to follow-up’ included those participants who did not respond to follow-up surveys and who could no longer be contacted through their home address, phone, email or emergency contact addresses. (3) ‘Partial or non-responders’ included people who were locatable, had not withdrawn, but had not completed one or more follow-up surveys beyond the initial baseline survey.
Administrative health data
The child’s health records were extracted between 2006 and 2013 from the Queensland Emergency Department Information System (EDIS) and from the Queensland Hospital Admitted Patients Data Collection (QHAPDC). The matching procedure was undertaken by the Queensland Department of Health (the custodians of the data) using linkage software, based on deterministic and probabilistic methods, to link demographic, child and maternal information to hospital records. Deterministic linkage involves the linking of data sets through comparing fields such as name, year of birth and street name with the requirement that the records agree on all characters. Probabilistic linkage involves the use of statistical models and algorithms to estimate the probability of data from different data sets having commonality (eg, the same person/event). Clerical reviews of the data were undertaken to manually inspect uncertain matches in probabilistic linkage.26
Outcome measure
Injury related to a hospital or ED episode of care was the outcome of interest. Multiple admissions, nested admissions and any corresponding hospital or ED presentations relating to the one injury event were identified through dates of presentations, transfer codes combined with diagnostic fields. These records were subsequently collapsed into one episode of care for the injury event.
Injury classification
Injury presentations were classified using International Statistical Classification of Diseases (ICD)-10-AM, Chapter 19 Injury and Poisonings (S00-T98) and Chapter 20 External Causes (U50-Y98), with late effects from injury excluded. EDIS data contain only a single diagnostic field in which to describe the reason for presentation, whereas QHAPDC contains a primary diagnosis field and multiple other diagnostic fields in which external causes and activity codes may be recorded. Researchers checked text descriptions in the EDIS data to identify any injuries that were not captured in the single diagnosis field. Almost all ED records used the one available diagnostic field to classify the nature of the injury (S00-T98). External cause codes (V01-Y98) were used in <5% of ED records. As a result, injury cause codes were only available comprehensively for the inpatient episodes of care. Injury subgroups were defined and matched to the ICD-10-AM chapter subcategories.
Calculation of person-years
The ages of the children and length of follow-up for injury-related hospital treatment varied considerably across participants due to the five recruitment waves. As such, individual person-years (PYs) of exposure time were calculated for each child, based on the time between birth and 31 December 2013 in which he or she was residing in the state of Queensland, alive and eligible for healthcare.
Analysis
Data cleaning and analyses were undertaken using SAS V.9.4 software. The statistical significance of differences between groups was assessed by Pearson’s χ2 test for categorical data. Using the state-wide linked administrative health data, injury-related episodes of care were obtained for all cohort participants. Rates of injury-related episodes of care were calculated, taking into account PYs exposure time, for each factor and by attrition status (rates/10 PYs).
Poisson regression was performed to estimate crude and adjusted rate ratios (RRs) between multiple exposure factors (child, maternal and household characteristics) and the outcome (count of child injury-related episodes of care). All factors significant in the univariate analyses associated with injury were included in the model. The log of individual PYs of exposure time was included as an offset. The final model included child gender, maternal age and maternal education. Complete case analysis was employed with two-sided significance set at a level of 5%.
Results
In total, 2222 families who enrolled in the EFHL study from 2006 to 2010, at the two participating Queensland hospitals, were included in this analysis. The number of children totalled 2245 (including 23 sets of twins). Maternal age ranged from 16 to 48 years with a mean age of 28.9 years (SD±5.98). At the time of enrolment, 14.1% of the households were sole parent families, 28.2% of maternal participants smoked cigarettes, 34.9% had no other children living in the household and 22.7% of the maternal participants had not completed secondary school.
Attrition status and follow-up
Of the 2222 participating families, 799 families (36.0%) had complete follow-up, 137 (6.2%) were withdrawn, 308 (13.8%) were lost to follow-up and 978 families (44.0%) were partial or non-responders (figure 1). There was no significant difference in the proportion of households with complete or incomplete follow-up across the five recruitment waves (p=0.20).
Follow-up consisted of a total of 11 908.3 PYs from birth to 7 years of age, with a mean 5.3 PYs per child (range 0–7). Automated linkage and manual searching by the state health department data linkage unit found records in QHAPDC for 97.1% of the child participants (n=2245), including their birth record.
Baseline demographic, household and child characteristics by attrition status
Baseline demographic and household characteristics differed significantly between participants who had complete follow-up compared with those families who had incomplete follow-up. Attrition families were more likely to have lower levels of maternal education, lower maternal age, have higher rates of maternal smoking, be single parent households, have more children residing in the home, have lower gross household incomes and live in rental or government boarding housing (all p<0.0001). However, there were no differences in the proportion of child gender for active families compared with attrition families (table 1).
Table 1.
Follow-up | p Value | Attrition by type† | ||||
Complete N (%) | Incomplete N (%) |
Withdrawn N (%) | Lost to follow-up N (%) | Partial/non-responders N (%) | ||
Maternal characteristics | 799 | 1423 | 137 | 308 | 978 | |
Maternal level of education missing=12 | ||||||
Tertiary degree | 192 (24.1) | 178 (12.6) | *** | 24 (17.5)*** | 34 (11.0)* | 120 (12.4) *** |
Secondary school or trade | 490 (61.6) | 849 (60.0) | 89 (65.0) | 182 (59.3) | 578 (59.6) | |
Not complete school | 114 (14.3) | 387 (27.4) | 24 (17.5) | 91 (29.6) | 272 (28.0) | |
Maternal age missing=3 | ||||||
35+ years | 201 (25.2) | 228 (16.1) | *** | 17 (12.4)*** | 53 (17.2)*** | 158 (16.2)*** |
30–34 years | 253 (31.7) | 308 (21.7) | 37 (27.0) | 56 (18.2) | 215 (22.0) | |
25–29 years | 228 (28.6) | 416 (29.3) | 48 (35.0) | 83 (26.9) | 285 (29.2) | |
<25 years | 116 (14.5) | 469 (33.0) | 35 (25.6) | 116 (37.7) | 318 (32.6) | |
Smoking during pregnancy missing=9 | ||||||
No | 640 (80.1) | 949 (67.1) | *** | 102 (74.5) | 87 (60.9)** | 660 (68.0) *** |
Yes | 159 (19.9) | 465 (32.9) | 35 (25.5) | 120 (39.1) | 310 (32.0) | |
Household characteristics | 799 | 1423 | 137 | 308 | 978 | |
Marital status missing=18 | ||||||
Two parent family | 732 (92.1) | 1161 (82.4) | *** | 106 (77.4)*** | 247 (81.2)*** | 808 (83.5)*** |
Sole parent family | 63 (7.9) | 248 (17.6) | 31 (22.6) | 57 (18.8) | 160 (16.5) | |
Other children in household missing=14 | ||||||
None | 299 (37.6) | 471 (33.4) | *** | 53 (39.0) | 97 (31.8)* | 321 (33.0)*** |
1–2 children | 431 (54.2) | 731 (51.7) | 70 (51.5) | 154 (50.5) | 507 (52.2) | |
3+ children | 65 (8.2) | 211 (14.9) | 13 (9.5) | 54 (17.7) | 144 (14.8) | |
Household income missing=354 | ||||||
Highest quintile | 170 (24.0) | 175 (15.1) | *** | 17 (15.2) * | 30 (12.1)*** | 128 (16.0)*** |
Fourth quintile | 154 (21.7) | 206 (17.8) | 31 (27.7) | 31 (12.4) | 144 (18.1) | |
Third quintile | 164 (23.1) | 211 (18.2) | 21 (18.7) | 42 (16.9) | 148 (18.6) | |
Second quintile | 129 (18.2) | 259 (22.3) | 19 (17.0) | 65 (26.1) | 175 (21.9) | |
Lowest quintile | 92 (13.0) | 308 (26.6) | 24 (21.4) | 81 (32.5) | 203 (25.4) | |
Home ownership missing=20 | ||||||
Own/mortgage | 427 (53.6) | 437 (31.1) | *** | 62 (45.3) | 49 (16.2)** | 326 (33.7)*** |
Rent | 297 (37.2) | 740 (52.7) | 57 (41.6) | 203 (67.2) | 480 (49.7) | |
Government/boarding house | 73 (9.2) | 228 (16.2) | 18 (13.1) | 50 (16.6) | 160 (16.6) | |
Child characteristics | 808 | 1437 | 137 | 309 | 991 | |
Gender missing=49 | ||||||
Female | 400 (50.1) | 670 (48.0) | ns | 51 (37.5)** | 151 (49.7) | 468 (48.9) |
Male | 399 (49.9) | 727 (52.0) | 85 (62.5) | 153 (50.3) | 489 (51.1) |
*p<0.05, **p<0.01, ***p<0.0001, ns = not significant.
Maternal and household n=2222; children n=2245 (includes twins).
†Each dropout participant group compared with the complete participant group as the reference, using χ2 test.
These baseline characteristic differences were similar for two of the three attrition groups, lost to follow-up and partial/non-responders. However, those families in the withdrawn group were more similar in baseline characteristics to the families in the complete follow-up group, with the exception of maternal age, marital status and gross household income (table 1). Of note, families in the withdrawn group were more likely to have male children than those families who fully participated (p<0.01).
Prevalence rates of childhood injury by attrition status
The total cohort had an overall child injury rate of 2.59/10 PYs, similar to the child injury rate for those families with complete follow-up (2.60/10 PYs) and those families who were lost to follow-up or partial/non-responders (table 2). However, families in the withdrawn group had significantly higher rates of child injury compared with families with complete follow-up with an unadjusted relative risk (RR)=1.32 (95% CI 1.03 to 1.68). While there was little reduction in the effect size after adjusting for covariates, the difference in child injury rates for families who withdrew compared with families with complete follow-up was no longer statistically significant, with an adjusted RR=1.24 (95% CI 0.96 to 1.58) (table 2).
Table 2.
Child participants | N (%) | Sum injuries | Injury rates/10 PYs | Unadjusted RR (95% CI) | Adjusted RR (95% CI)† |
Total cohort | 2245 (100.0) | 941 | 2.59 | 0.99 (0.88 to 1.13) | 0.97 (0.85 to 1.10) |
Complete follow-up | 808 (36.0) | 340 | 2.60 | Reference | Reference |
Incomplete follow-up | 1437 (64.0) | 601 | 2.58 | 0.99 (0.87 to 1.13) | 0.95 (0.82 to 1.09) |
Withdrawn | 137 (6.1) | 80 | 3.48 | 1.32 (1.03 to 1.68) * | 1.24 (0.96 to 1.58) |
Lost to follow-up | 309 (13.8) | 128 | 2.38 | 0.89 (0.72 to 1.09) | 0.81 (0.65 to 1.01) |
Partial/non-responders | 991 (44.1) | 393 | 2.51 | 0.98 (0.85 to 1.13) | 0.94 (0.81 to 1.10) |
Poisson regression, offset=log of length of follow-up time of child.
*p<0.05.
†Adjusted for maternal age, maternal education, maternal smoking status, number of children in household, family marital status, household income, housing status and child gender in two-step model. Final model included maternal age, maternal education level and child gender.
Relationship between baseline sociodemographic variables (exposure) and child injury episodes (outcome) by attrition status
For the total cohort, rates of child injury increased as the level of maternal education, maternal age and gross household income decreased (table 3). Rates of child injury were higher in single parent households, families living in government or boarding house accommodation and for families with male children. This distribution was similarly reflected in both the families with complete follow-up and those lost to attrition, with no statistically significant differences in the demographically stratified rates of child injury when comparing overall attrition status (p>0.05) (table 3).
Table 3.
Total cohort/10 PYs | Follow-up | p-value | Attrition by type † | ||||
Complete/10 PYs | Incomplete/10 PYs | Withdrawn/10 PYs | Lost to follow-up/10 PYs | Partial/non- responders/10 PYs | |||
Maternal characteristics | |||||||
Maternal level of education missing=12 | |||||||
Tertiary degree | 2.03 | 2.12 | 1.92 | ns | 3.06* | 1.72 | 1.74 |
Secondary school or trade | 2.70 | 2.84 | 2.62 | 3.66 | 2.28 | 2.57 | |
Not complete school | 2.73 | 2.43 | 2.83 | 3.23 | 2.87 | 2.77 | |
Maternal age missing=3 | |||||||
35+ years | 2.02 | 1.83 | 2.18 | ns | 2.83 | 2.76 | 1.91 |
30–34 years | 2.67 | 2.86 | 2.52 | 3.93 | 1.50 | 2.55 | |
25–29 years | 2.27 | 2.35 | 2.23 | 2.93 | 1.69 | 2.27 | |
<25 years | 3.27 | 3.76 | 3.14 | 4.12 | 3.13 | 3.04 | |
Smoking during pregnancy missing=9 | |||||||
No | 2.49 | 2.48 | 2.50 | ns | 3.25* | 2.55 | 2.36 |
Yes | 2.84 | 3.10 | 2.75 | 4.17 | 2.15 | 2.84 | |
Household characteristics | |||||||
Marital status missing=18 | |||||||
Two parent family | 2.45 | 2.53 | 2.41 | ns | 3.36 | 2.33 | 2.30 |
Sole parent family | 3.44 | 3.47 | 3.43 | 3.90 | 2.66 | 3.64 | |
Other children in household missing=14 | |||||||
None | 2.70 | 2.84 | 2.61 | ns | 3.09* | 2.66 | 2.51 |
1–2 children | 2.52 | 2.51 | 2.53 | 3.50 | 2.30 | 2.46 | |
3+ children | 2.52 | 1.99 | 2.68 | 4.21 | 2.17 | 2.73 | |
Household income‡ missing=354 | |||||||
Highest quintile | 2.03 | 2.42 | 1.67 | ns | 2.04 | 1.52 | 1.65 |
Fourth quintile | 2.55 | 2.93 | 2.27 | 2.79 | 1.95 | 2.24 | |
Third quintile | 2.45 | 2.33 | 2.55 | 3.57 | 2.92 | 2.27 | |
Second quintile | 2.54 | 2.32 | 2.65 | 3.74 | 2.32 | 2.66 | |
Lowest quintile | 2.92 | 3.53 | 2.74 | 3.15 | 2.36 | 2.85 | |
Home ownership missing=20 | |||||||
Own/mortgage | 2.41 | 2.53 | 2.29 | ns | 3.68* | 1.47 | 2.16 |
Rent | 2.46 | 2.48 | 2.46 | 2.54 | 2.50 | 2.43 | |
Government/boarding house | 3.55 | 3.39 | 3.60 | 5.85 | 2.93 | 3.57 | |
Child characteristics | |||||||
Gender missing=49 | |||||||
Female | 2.18 | 2.21 | 2.16 | ns | 3.13* | 1.89 | 2.15 |
Male | 3.00 | 2.87 | 3.07 | 3.74 | 2.93 | 2.99 |
Children n=2245 (includes twins).
†Each dropout participant group compared with the complete participant group as the reference, using univariate Poisson regression with time offset.
*p<0.05, ns = not significant.
‡Gross annual household incomes standardised in AUD$2010 values.
While the direction of the demographically stratified child injury rates was similar for all attrition groups, the magnitude of this difference was greater in the withdrawn group for a number of baseline factors. Child injury rates for families that withdrew were higher than families with complete follow-up for all baseline factors, but most pronounced with respect to child gender, maternal education, maternal smoking, number of children living in the household and home ownership (p<0.05).
Discussion
This study examined the impact of attrition in child injury research on the prevalence of child injury outcomes and the association between sociodemographic variables and injury (exposure-outcome relationship). There were three key findings. First, participants with incomplete follow-up, as a group, differed from complete responders across most sociodemographic characteristics and there were also differences in these characteristics across attrition types (within the incomplete follow-up group). Second, despite these systematic differences, there were no statistically significant impacts on the overall injury prevalence estimates and risk factor effect estimates, with the exception of the withdrawn group. The withdrawn group differed least, however, from the complete group in sociodemographic characteristics. Third, despite these systemic differences, the direction and nature of the sociodemographic-child injury relationship did not vary across the attrition groups.
The finding that there were systemic differences in the baseline social, maternal and household characteristics of those with complete follow-up versus those lost to attrition is consistent with previous research in this area, with families categorised as having lower socioeconomic status (SES) less likely to remain as active participants in the study.3 5 9 10 Systematic differences in attrition were also evident across attrition groups, as has been demonstrated in previous research.9 10 Interestingly, while families from the ‘lost to follow-up’ and ‘partial/non-responder’ groups were more likely to be of lower socioeconomic status, our withdrawn participants were more similar in their baseline characteristics to participants with complete follow-up. It may be that families of higher sociodemographic status felt more confident to actively withdraw from this research or that there are unmeasured characteristics in these families that lead to higher rates of withdrawal. Further research is warranted given the relatively small sample size of this withdrawn group.
Consistent with previous research,3 4 27 it was anticipated that systemic differences in attrition would result in differences in the prevalence of child injury outcomes across our groups (attrition-child injury), particularly as the relationship between lower sociodemographic factors and child injury has been well documented in the literature.28–31 However, there were no statistically significant impacts of this differential attrition on the overall injury prevalence estimates and risk factor effect estimates. The only apparent suggestion of difference (although with the observed stable effect losing significance with inclusion of multiple variables in the model) was in the withdrawn group, despite this group differing least from the ‘complete follow-up’ group for all measures of SES, compared with the other attrition categories. It should be noted that the withdrawn group was small in absolute terms (ie, <5% of the total cohort) and had the group been larger, it could have impacted on the overall prevalence of the outcome. This suggests that reasons for attrition may need to be examined in epidemiological research, as a high proportion of withdrawals may indicate the potential for bias.
Importantly, the systemic differences in attrition also had little impact on the direction of the relationship between sociodemographic factors and child injury, with lower socioeconomic status families having higher rates of injury across all the attrition groups. This finding adds to a growing body of evidence indicating that selective attrition does not necessarily impact the relationship between a range of exposure-outcome measures.3 4 10 With follow-up rates for cohort studies regularly being reported in the 30%–70% range,32 it is promising that these results suggest exposure-outcome findings from studies with large amounts of attrition may not need to be interpreted with the degree of caution currently applied.
A key strength of this study was the mixed active-passive nature of the cohort design. Prior to the first published results from this study, there had been few birth cohort studies that employed data linkage methodology in combination with active participant contact (survey follow-up) to ascertain comprehensive exposure and outcome data. Thus, this is one of the few studies capable of examining with rigour the questions we have addressed in this research. The key limitation is the relatively small sample size. This was particularly evident in relation to our main finding that it was in the group of participants who withdrew from the study that we found a bias. The adjusted model removed significance of the main finding even though there was little change in the point estimate (from 1.32 to 1.24) and could represent a type 2 error. The second limitation relates to the nature of the study sample, the determinant variables measured, the injury outcome focus of the analysis and the follow-up protocols used. While the research results could be expected to apply to similar circumstances, it is unknown how validly the study findings should be applied beyond the study group we examined. It is possible that in studies with a demanding follow-up protocol, participant attrition may be higher in people whose determinant variables are already affecting outcomes, and hence in these studies attrition may be more likely to be associated with a biased effect estimate.
The findings of this research provide support for relaxing one of the most challenging expectations of epidemiological studies, that is, complete participant follow-up. Even with considerable attrition, when proportions of participants who withdraw from the study are low, attrition bias is unlikely to affect either the population prevalence estimate or measures of association with key study variables.
bmjopen-2016-015584supp001.docx (41KB, docx)
Supplementary Material
Acknowledgments
The research reported in this publication is part of the Environments for Healthy Living (EFHL): Griffith Study of Population Health (Australian and New Zealand Clinical Trials Registry: ACTRN12610000931077). The EFHL project was conceived by Professor Rod McClure, Dr Cate Cameron, Professor Judy Searle and Professor Ronan Lyons. We are thankful for the contributions of the Project Manager, Rani Scott, and the current and past Database Managers. We gratefully acknowledge the administrative staff, research staff and the hospital antenatal and birth suite midwives of the participating hospitals for their valuable contributions to the study, in addition to the expert advice provided by Research Investigators throughout the project.
Footnotes
Contributors: CMC, JMO, ABS, TMD, NS, RJM were involved in study concept and design. JMO and CMC acquired the data. CMC conducted the data management and analysis. CMC, JMO and RJM were all involved in interpretation of data wrote the first draft of the manuscript. CMC, JMO, ABS, TMD, NS, RJM contributed to the critical revision of the manuscript and approved the final version.
Competing interests: None declared.
Patient consent: Obtained from Guardian.
Ethics approval: Griffith University, University of Queensland, The Logan Hospital, Gold Coast Hospital.
Provenance and peer review: Not commissioned; externally peer reviewed.
Data sharing statement: No additional data available.
References
- 1. Western M, Haynes M, Baffour B, et al. ; The case for a New Australian Birth Cohort Study: institute for Social Science Research, University of Queensland, 2015. [Google Scholar]
- 2. Ahern K, Le Brocque R. Methodological Issues in the effects of Attrition: simple Solutions for Social scientists. Field methods 2005;17:53–69. 10.1177/1525822X04271006 [DOI] [Google Scholar]
- 3. Wolke D, Waylen A, Samara M, et al. Selective drop-out in longitudinal studies and non-biased prediction of behaviour disorders. Br J Psychiatry 2009;195:249–56. 10.1192/bjp.bp.108.053751 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Osler M, Kriegbaum M, Christensen U, et al. Loss to follow up did not Bias associations between early life factors and adult depression. J Clin Epidemiol 2008;61:958–63. 10.1016/j.jclinepi.2007.11.005 [DOI] [PubMed] [Google Scholar]
- 5. Greene N, Greenland S, Olsen J, et al. Estimating Bias from loss to follow-up in the Danish National Birth Cohort. Epidemiology 2011;22:1–22. 10.1097/EDE.0b013e31822939fd [DOI] [PubMed] [Google Scholar]
- 6. Ng SK, Scott R, Scuffham PA. Contactable Non-responders show different characteristics compared to lost to Follow-Up participants: insights from an australian longitudinal birth Cohort Study. Matern Child Health J 2016;20:1472–84. 10.1007/s10995-016-1946-8 [DOI] [PubMed] [Google Scholar]
- 7. Howe LD, Tilling K, Galobardes B, et al. Loss to follow-up in cohort studies: bias in estimates of socioeconomic inequalities. Epidemiology 2013;24:1–9. 10.1097/EDE.0b013e31827623b1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Aylward GP, Hatcher RP, Stripp B, et al. Who Goes and who stays: subject loss in a multicenter, longitudinal follow-up study. J Dev Behav Pediatr 1985;6:3–8. [PubMed] [Google Scholar]
- 9. Young AF, Powers JR, Bell SL. Attrition in longitudinal studies: who do you lose? Aust NZJ Public Health 2006;30:353–61. 10.1111/j.1467-842X.2006.tb00849.x [DOI] [PubMed] [Google Scholar]
- 10. Powers J, Tavener M, Graves A, et al. Loss to follow-up was used to estimate Bias in a longitudinal study: a new approach. J Clin Epidemiol 2015;68:870–6. 10.1016/j.jclinepi.2015.01.010 [DOI] [PubMed] [Google Scholar]
- 11. Cornish R, Tilling K, Boyd A, et al. Using linkage to electronic primary care records to evaluate recruitment and nonresponse Bias in the Avon Longitudinal Study of Parents and Children. Epidemiology 2015;26:e41–2. 10.1097/EDE.0000000000000288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Jourdan C, Bayen E, Bahrami S, et al. Loss to Follow-Up and Social Background in an Inception Cohort of Patients with severe traumatic brain Injury: results from the PariS-TBI Study. J Head Trauma Rehabil 2016;31:E42–E48. 10.1097/HTR.0000000000000147 [DOI] [PubMed] [Google Scholar]
- 13. Tin Tin S, Woodward A, Ameratunga S. Estimating Bias from loss to follow-up in a prospective cohort study of bicycle crash injuries. Inj Prev 2014;20:322–9. 10.1136/injuryprev-2013-040997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Holavanahalli RK, Lezotte DC, Hayes MP, et al. Profile of patients lost to follow-up in the Burn Injury Rehabilitation Model Systems' longitudinal database. J Burn Care Res 2006;27:703–12. 10.1097/01.BCR.0000238085.87863.81 [DOI] [PubMed] [Google Scholar]
- 15. Blaha RZ, Arnett AB, Kirkwood MW, et al. Factors influencing attrition in a multisite, randomized, clinical trial following traumatic brain injury in adolescence. J Head Trauma Rehabil 2015;30:E33–40. 10.1097/HTR.0000000000000059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Corrigan JD, Harrison-Felix C, Bogner J, et al. Systematic Bias in traumatic brain injury outcome studies because of loss to follow-up. Arch Phys Med Rehabil 2003;84:153–60. 10.1053/apmr.2003.50093 [DOI] [PubMed] [Google Scholar]
- 17. Gabbe BJ, Cleland H, Watterson DM, et al. Long term outcomes data for the Burns Registry of Australia and New Zealand: is it feasible? Burns 2015;41:1732–40. 10.1016/j.burns.2015.09.005 [DOI] [PubMed] [Google Scholar]
- 18. Wade SL, Gerry Taylor H, Yeates KO, et al. Long-term parental and family adaptation following pediatric brain injury. J Pediatr Psychol 2006;31:1072–83. 10.1093/jpepsy/jsj077 [DOI] [PubMed] [Google Scholar]
- 19. Krellman JW, Kolakowsky-Hayner SA, Spielman L, et al. Predictors of follow-up completeness in longitudinal research on traumatic brain injury: findings from the National Institute on Disability and Rehabilitation Research traumatic brain injury model systems program. Arch Phys Med Rehabil 2014;95:633–41. 10.1016/j.apmr.2013.10.016 [DOI] [PubMed] [Google Scholar]
- 20. de Groot S, Haisma JA, Post MW, et al. Investigation of Bias due to loss of participants in a dutch multicentre prospective spinal cord injury cohort study. J Rehabil Med 2009;41:382–9. 10.2340/16501977-0346 [DOI] [PubMed] [Google Scholar]
- 21. Neuner B, Fleming M, Born R, et al. Predictors of loss to follow-up in young patients with minor trauma after screening and written intervention for alcohol in an urban emergency department. J Stud Alcohol Drugs 2007;68:133–40. 10.15288/jsad.2007.68.133 [DOI] [PubMed] [Google Scholar]
- 22. C-ya L, DeMatteo CA. The challenges of loss to Follow-up in Longitudinal Pediatric acquired brain Injury Research: one research Team’s Experiences, 2013. [Google Scholar]
- 23. Calderwood L, Lessof C. Enhancing longitudinal surveys by linking to administrative data. Methodology of longitudinal surveys 2009:55–72. [Google Scholar]
- 24. Cameron CM, Scuffham PA, Spinks A, et al. Environments for healthy living (EFHL) Griffith birth cohort study: background and methods. Matern Child Health J 2012;16:1896–905. 10.1007/s10995-011-0940-4 [DOI] [PubMed] [Google Scholar]
- 25. Cameron CM, Scuffham PA, Shibl R, et al. Environments for healthy living (EFHL) Griffith birth cohort study: characteristics of sample and profile of antenatal exposures. BMC Public Health 2012;12:1080 10.1186/1471-2458-12-1080 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Queensland Sof. Queensland Health). Queensland Data Linkage Framework. Brisbane 2016. [Google Scholar]
- 27. Mars B, Cornish R, Heron J, et al. Using data linkage to investigate Inconsistent Reporting of Self-Harm and Questionnaire Non-Response. Arch Suicide Res 2016;20:113–41. 10.1080/13811118.2015.1033121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Hong J, Lee B, Ha EH, et al. Parental socioeconomic status and unintentional injury deaths in early childhood: consideration of injury mechanisms, age at death, and gender. Accident Analysis & Prevention 2010;42:313–9. 10.1016/j.aap.2009.08.010 [DOI] [PubMed] [Google Scholar]
- 29. Kim MH, Subramanian SV, Kawachi I, et al. Association between childhood fatal injuries and socioeconomic position at individual and area levels: a multilevel study. J Epidemiol Community Health 2007;61:135–40. 10.1136/jech.2006.047738 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Laursen B, Nielsen JW. Influence of sociodemographic factors on the risk of unintentional childhood home injuries. Eur J Public Health 2008;18:366–70. 10.1093/eurpub/ckn034 [DOI] [PubMed] [Google Scholar]
- 31. Myhre MC, Thoresen S, Grøgaard JB, et al. Familial factors and child characteristics as predictors of injuries in toddlers: a prospective cohort study: table 1. BMJ Open 2012;2:e000740 10.1136/bmjopen-2011-000740 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Gustavson K, von Soest T, Karevold E, et al. Attrition and generalizability in longitudinal studies: findings from a 15-year population-based study and a Monte Carlo simulation study. BMC Public Health 2012;12:918 10.1186/1471-2458-12-918 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
bmjopen-2016-015584supp001.docx (41KB, docx)