Abstract
Ever since the discovery of APOE ε4 around 25 years ago researchers have been excited about the potential of a blood test for Alzheimer's disease (AD). Since then researchers have looked for genetic, protein, metabolite and/or gene expression markers of AD and related phenotypes. However, no blood test for AD is yet being used in the clinical setting. We first review the trends and challenges in AD blood biomarker research, before giving our personal recommendations to help researchers overcome these challenges. While some degree of consistency and replication has been seen across independent studies, several high-profile studies have seemingly failed to replicate. Partly due to academic incentives there is a reluctance in the field to report predictive ability, to publish negative findings and to independently replicate the work of others. If this can be addressed then we will know sooner whether a blood test for AD or related phenotypes with clinical utility can be developed.
Keywords: Alzheimer disease, blood tests, cohort studies, research design, data reporting, genetics, blood proteins, gene expression, metabolomics
Progress
The identification of genetic markers such as APOE ε4 arguably represented the first step change in progress towards a blood test for late onset Alzheimer's disease (AD), as genetic markers can be measured from blood samples [1]. Since then 19 other significant markers of AD have been identified by a Genome-Wide Association Study (GWAS) [2]. These markers have been combined into a polygenic risk score with ~80k more weakly associated genetic markers achieving an Area Under the Curve (AUC) of 78% for prediction of AD. This compares with 72% achievable with just age, sex and APOE ε4 (the `co-variate only' model) [3]. In a smaller recent study (N ~ 1,600) some of the same authors have shown that the same risk score has an AUC of 84% for predicting pathologically confirmed cases [4], which if confirmed in larger studies may have enough clinical utility to justify the use of genome-wide genotyping in the clinic to aid diagnosis. Even an AUC of 78%, if validated further, may have some utility for recruitment of higher risk individuals to prevention trials [5]. Another promising approach has been to develop polygenic hazard scores to predict age of dementia onset, although the utility of this for the clinic or for enriching trials has not yet been fully assessed [6].
AD polygenic risk scores have also been shown to be associated with brain atrophy [7,8] and cerebrospinal fluid (CSF) amyloid beta [6,7], but to the best of our knowledge they have not yet been shown to be predictive of these phenotypes. In fact, two independent studies (N = 657 and N = 242) have shown that this genetic risk score does not appear to be predictive of amyloid or tau pathology measured from CSF [9,10]. A preliminary report that the polygenic hazard score is better able to predict elevated brain amyloid does not appear to test whether it improves upon a model using age, gender and APOE alone mean its clinical utility is uncertain [11]. If the negative findings are correct then this would be consistent with the idea that biomarkers must be optimised to specific use contexts, e.g. useful markers of AD diagnosis may differ from markers of pathology or progression. For this reason the literature has broadened out from the case-control design to encompass endophenotype designs which look for markers of brain atrophy [12], CSF pathology [13] or cognitive decline [14]. However, it should be noted these findings do not yet seem to have translated into accurate prediction models [15].
Following on from the early genetic work, proteomic researchers joined the search for blood biomarkers of AD. In a clear parallel to earlier genetics research two main approaches were taken: ‘candidate’ studies and ‘discovery’ studies. The AlzBiomarker project has performed a meta-analysis of association studies of ‘candidate’ blood (and CSF) markers with AD, showing that of all candidates in blood, total-tau appears to be the most associated with AD [16]. Predictive results from the use of blood total-tau are only provided for individual small-scale studies, so the reported sensitivity and specificity for predicting AD diagnosis is likely to be overly optimistic, e.g. 97% specificity and 91% sensitivity in Chiu et al., [17].
We have reviewed the 21 ‘discovery’ blood protein studies published between 2002 - 2014, using a wide-range of proteomics techniques and AD-related outcomes. A low consistency of biomarkers identified between studies was observed, but four candidate biomarkers were observed in studies utilizing five independent research cohorts: α-1-antitrypsin, α-2-macroglobulin, apolipoprotein E, and complement C3. When examined in a new dataset these proteins, when combined with age, sex and presence of APOE ε4, had an AUC of 82% for predicting AD diagnosis, versus 79% for the co-variate only model [18]. While superficially better than the genetic risk scores AUC (78% versus 82%), it should be noted that the genetic result is more trustworthy as it comes from a much larger study and shows a greater difference in predictive ability between co-variate only and biomarker models.
In another parallel to genetic research, proteomic studies have also looked-for biomarkers of endophenotypes of AD, which have been reviewed in Baird et al., [19] who see promise in this area, but acknowledge limited success in identification of a reproducible signature. More recently, Ovod et al., [20] have developed an assay for plasma amyloid beta 42 and 40, and used it to show that the ratio has an AUC of 89% for predicting amyloid positivity in a small cohort (N = 41). However, this work has not yet been replicated in independent cohorts.
Blood metabolite studies of AD are more novel, with high profile papers by Mapstone et al., [21] and Proitsi et al., [22]. Mapstone et al., [23] identified ten lipids which could predict conversion from Mild Cognitive Impairment to AD over 2 - 3 years with an AUC of 92%. Proitsi et al., [22] identified 24 metabolites which had an AUC of 71% for predicting AD diagnosis, in a considerably larger study (N = 277 versus N = 85).
Gene expression has also been explored as a potential source of blood biomarkers for AD. Little consistency has been seen in the genes selected by these various studies, leading Han et al., [23] to suggest that greater concordance might be seen at the pathway level. We demonstrated in Voyle et al., [24] a failure to replicate classifiers between independent sample sets, and that simple pathway level summaries of gene expression are no more predictive of AD. Endophenotype approaches have been explored in gene expression studies as well, for example Lunnon et al., [25] show that gene expression is predictive of brain atrophy. However, this work has not yet been replicated in independent cohorts.
More recently, we have been attempting to combine different modalities of biomarker to improve predictive ability. In Voyle et al., [26] we found that five metabolites could be used to predict amyloid positive individuals with 72% accuracy, rising to 79% when combined with levels of the protein fibrinogen gamma. This study was limited in size and requires replication in independent samples. In a similar vein, we showed a marginal improvement in the prediction of CSF amyloid beta levels using genetic risk from the large AD GWAS [2,3] and plasma tau levels (AUC 67% versus 66% for co-variate only model) [9]. Such a small improvement may be artefactual, and even if true is not likely to be useful by itself. Studies seeking to find multi-modal AD blood biomarkers and/or biomarkers of endophenotypes are likely to become more common but have been held back by the sample sizes available, which is a focus for improvement going forward.
Challenges and recommendations
Further progress towards a clinically useful blood test will be slow unless we acknowledge and learn from the limitations of our current approaches, therefore what follows is our personal view on key challenges and important recommendations for future AD blood biomarker research. We draw attention to limitations of existing studies not to dismiss them, but to point out room for improvement in the field and in our own research. What follows will seem obvious to some, but needs to be highlighted as a counter-point to the over-optimism of the field.
The quality of experimental design in this field is variable, although this has been improving over time. One of the most obvious aspects of this is in sample size of non-genetic AD biomarker discovery studies, for example a study seeking to find blood protein AD biomarkers in 2002 used only 18 research participants [27]. Ten years later Doecke et al., [28] achieved N ~ 1000. This seemed a positive trend, but unfortunately, we are not aware of any larger studies published in the five years that followed. In fact, many smaller-scale studies (N ~ 100) are still published, e.g. [20,21,26,29,30]. We should learn from the field of genetics, where small-scale candidate gene studies were plagued with replication issues [31] that were only solved by larger sample size and unbiased approaches. We recommend that this is tackled, in part, using samples from larger cohorts, such as UK Biobank [32] and the Precision Medicine Initiative [33], as well as cohort consortia such as the European Medical Information Framework - AD (http://www.emif.eu/) and Dementia Platform UK (http://www.dementiasplatform.uk).
Additional design considerations involve the appropriateness of the population used, this should be guided by the anticipated context of use of the potential blood tests. Most studies have sought to find a blood test that could be helpful in the diagnosis of Alzheimer's disease, however none have yet been performed in the primary healthcare population in which it would have greatest utility [34]. This has been in large part due to the challenges of recruiting research participants, and the priority given to large scale recruitment rather than to representativeness of populations relative to anticipated context-of-use. Similarly, studies have sought to find AD markers that could be used to identify asymptomatic patients with early signs of AD, but very few have been performed in that population [35,36,37,38]. Another problem of unrepresentative sample populations is that they may not reflect the prevalence of AD related phenotypes (e.g. amyloid positivity) in populations appropriate to the anticipated context-of-use, which could inflate the positive and negative predictive values.
Partly due to the history of this field, which was initially led by clinicians and lab biologists, the level of statistical rigour is understandably variable. One example is the focus of many papers on p-values instead of predictive ability (sensitivity/specificity/positive predictive value etc). Significant p-values, even if replicable, do not necessarily mean that a biomarker is useful for predicting AD. To do so requires a suitably large effect size and a good understanding of confounding factors (e.g. age, gender, APOE ε4, medication use). For readers who may struggle to interpret and understand predictive measures we heartily recommend Tze-Wey Loong's excellent visual explanation [39].
A critically important consideration is cross-validation, i.e. the assessment of predictive models in additional data not used in its construction. The data used in model construction is referred to as the training or in-sample dataset, whereas the independent data used for assessment is called the test or extra-sample data. It is important that predictive performance is reported from the test data, as results from training data can be artefactually better due to overfitting to noise [40]. This is equivalent in importance to blinding in clinical trials in the sense that it helps to protect results from the preconceptions of the researcher. This can be a major concern in studies which report predictive accuracy in training sets only, i.e. where no cross-validation has been performed. In some studies, k-fold cross-validation has been performed, in which the training data is repeatedly split into different training and test subsets and average performance in the test sets given. This is better than no cross-validation, but can still give overly generous predictive performance due to the train and test datasets sharing the same systematic noise, especially in small datasets and when k < 10 [41].
Despite seemingly performing cross-validation, researchers can often subconsciously and artefactually inflate predictive performance. The two most common examples of this are (1) when both train and test data are used for variable selection, for example when variables are selected for model inclusion by ranking the p-values from repeated univariate tests using all available data (and therefore not correctly holding out test data), and (2) by performing correct cross-validation, finding a poor result which does not get reported, and then trying a new model (new variables, or new modelling approach or formula), only reporting models which have good predictive performance. The former mistake is one that we have made ourselves in Kiddle et al., [29], the latter mistake is probably the most common which is evidenced in part by the relative absence of negative results published in this area. The absence of negative results is a form of reporting bias, leading to an overly optimistic impression of this field in the literature. While no evidence of reporting bias for ‘candidate’ blood protein markers of AD is seen in the AlzBiomarker meta-analysis, in the context of discovery and replication of blood biomarker panels we are only aware of the following relevant papers showing negative results, some emerging from our own group [9,24], some consistent with our negative findings [10], and others showing limited [42,43] or no replication [44] of previous high-profile positive results [21,45] when examined by independent researchers. It seems implausible that negative results are truly rare in AD blood biomarker research.
In terms of root causes, we are generally dis-incentivised to share negative or disappointing results, even if they are correct. However, the examples provided above demonstrate that it is simply not true that journals refuse to publish negative results, although we have certainly argued against peer reviewers who fail to see the value in doing so. In a worst case scenario negative results can be published on pre-print servers such as bioRxiv (http://biorxiv.org/). It may be worthwhile for the community to come together to generate a pre-registration platform, and to persuade journals to only publish pre-registered AD biomarker studies, so that groups failing to publish negative findings can be pressured into doing so. While not explicitly focused on blood markers, a very positive move in this direction has been challenges in which predictions are submitted before test data is released, notably the AD DREAM challenge [15], which generated a negative result, and the TADPOLE challenge which is currently open for submission until November 2017 (https://tadpole.grand-challenge.org/).
Another cause could be confirmation bias, the psychological phenomena where individuals seek out evidence that matches their pre-conceived ideas [46]. This can mean that if a researcher believes a blood marker of AD will be found they may disregard evidence to the contrary. This can lead to Hypothesising After Results Are Known, researchers testing many different hypothesis (or models) reporting only the positive results and ignoring the multiple testing problem relating to all the unreported negative results [47]. This has also been called the file drawer effect, meaning that many published results are false positives [48]. John Ioannidis has discussed this in his controversially titled paper – “why most published research findings are false” [49]. While this focuses on p-values, it is likely to apply to studies reporting predictive performance as well. Interestingly, the field of blood biomarkers for AD fits his criteria for risk of high false positive publication rate: (1) small studies, (2) small effect sizes, (3) greater number and lesser pre-selection of tested relationships, (4) flexibility in designs and analyses, (5) financial interests and prejudice, and (6) many teams involved in a scientific field in chase of statistical significance.
The testing of promising biomarkers by independent researchers is essential to progress them towards the clinic [34], however this is rare. This has therefore been a focus of our research [18,50]. Where this have been done for two high-profile studies [21,45] they show a complete failure to replicate [44] or a significantly less promising performance [42,43]. Less promising replications fit with the findings of John Ioannidis [51] who has studied the reasons that discovery results are typically inflated. In terms of clinical utility a high predictive performance is required, meaning that partial replications in well-designed studies almost always rule out the clinical utility of the marker.
Given all the above it is healthy to be sceptical about the potential of biomarkers in the current literature. It is a safe bet to assume that existing AD biomarker candidates at the very least require further validation and at worst are non-replicable, or are not of sufficiently high performance for clinical use. We hope this changes soon, but we believe that our recommendations may help the field to achieve this sooner.
While it is tempting to think that these problems may plague all blood biomarker research, this can be disproved by the example of blood protein markers of aging. Using proteomics techniques that have also been used in AD research we have discovered plasma proteins correlating highly with chronological age, that replicate strongly in an independent cohort [52], in another cohort in serum [50] and in at least two studies by independent researchers [53,54]. This shows that failures to replicate AD biomarkers cannot solely be blamed on the measurement technologies used, but it is certainly true that if a novel technology was able to detect a stronger AD related signal then it is more likely to be replicated.
Given the large reproducibility crisis in science [49], and in this field specifically, how do we improve the way we do research to increase the chance that a reproducible blood test can be found? John Ioannidis has provided general recommendations for researchers to improve the chance of true positive findings that we think are relevant: “large-scale collaborative research, replication culture, registration, sharing, reproducibility practices, better statistical methods, standardization of definitions and analyses, more appropriate (usually more stringent) statistical thresholds, and improvement in study design standards, peer review, reporting and dissemination of research, and training of the scientific workforce” [55]. Other recommendations from John Ioannidis regard institutional changes affecting the incentives for scientists which would be even harder to achieve.
As highlighted by John Ionnadis sharing and quality of reporting is important, and for that reason we strongly recommend that the field adopt the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guidelines [56]. Specifically for the reporting of pre-analytical variables, we recommend the advice provided by Sid O'Bryant et al., [57].
In terms of sharing we recommend that raw data should be shared as widely as possible, allowing independent statisticians to verify reproducibility of findings. This has been most successfully achieved through the Alzheimer's Disease Neuroimaging Initiative (https://ida.loni.usc.edu/), but is also being done to some extent by other cohorts including the Australian Imaging and Behaviour Longitudinal study of aging (https://ida.loni.usc.edu/) and AddNeuroMed (https://www.synapse.org/#!Synapse:syn4907804). Clinicians leading large data collection need to be won over by the benefits of data sharing, including by inclusion of appropriate authorships (e.g. of clinical consortia), to counter the recent high-profile accusation that data sharing leads to “research parasites” [58]. While data sharing is done to some extent, the field would be greatly improved by researchers also sharing analysis scripts wherever possible. This would allow errors to be spotted, would make reproducibility straightforward, and would greatly assist junior researchers to develop the coding skills that are increasingly important for modern biomedical research. Code sharing can be facilitated by websites such as GitHub (https://github.com/), which allow both private and public sharing of code, as well as version control. Care must of course be taken not to release data that is sensitive, including within the script itself, but we have shown that this can be achieved [59].
Our final recommendation is that the ultimate aim is prediction models useful in a given clinical context, and that we shouldn't be limiting ourselves to looking for markers in blood. Other variables, derived from routine clinical data such as electronic health records, wearables or cognitive tests may have more promise for this purpose. This is not to say analysis of these datasets is not without its own challenges. Electronic health records represent sparse, incomplete and often subjective representations of the disease state with important data often buried within free text narrative. The opportunities here are vast though, and although the datasets represent secondary use data, the data themselves are much larger and more representative of clinical settings than we are typically used to in blood biomarker studies, albeit less controlled in terms of co-variates and missing data. We have established research programmes in this area and through information and extraction toolkits such as Clinical Records Interactive Search [60], CogStack [61] and the KConnect programme (KConnect.eu), rolled out to multiple hospitals. We have used natural language processing to explore questions that include characterising trajectories of cognitive decline with a specific focus on identifying and validating associations, with medications for example [62]. The ubiquitious use of smartphones and wearables devices provides the opportunity for a more objective, continuous and pervasive phenotype, throughout the disease continuum from at risk, early diagnosis through to post diagnosis engagement, compliance and self-management. Such data provides the opportunity to augment our blood biomarker studies and clinical trials. We have established programmes such as the RADAR-CNS (RADAR-CNS.org), a major goal of which is to develop a generalised real-time streaming platform that will enable active (eg through questionnaires) and passive (eg. accelerometry and heart rate using sensors on wearables and devices) remote monitoring, tracking phenotypes such as function and cognition.
Conclusions
The field of AD blood biomarkers has expanded to include genetic, protein, metabolite and gene expression markers, as well as combinations of the above. While some consistency has been seen across independent studies, several high-profile studies have seemingly failed to replicate. At the same time, there appears to be a strong reporting bias for studies seeking to find a biomarker panel with few negative results published, making the true state of play impossible to assess. Will a clinically useful blood test for AD be developed? It is simply too early to say, but we will have a better chance if we can improve the design, analysis and reporting of studies. Many alternative markers exist within health records, from wearables or innovative cognitive tests, and these should also be explored.
Acknowledgements
SJK was supported by an MRC Career Development Award in Biostatistics (MR/L011859/1). Nicola Voyle is funded by the Alzheimer’s Society.
The authors acknowledge infrastructure support from the National Institute for Health Research (NIHR) Biomedical Research Centre for Mental Health. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.
The research leading to these results has received support from the Innovative Medicines Initiative Joint Undertaking under grant agreement no 115372, resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7-/2007-2013) and in-kind contribution from EFPIA companies.
This work was supported by awards to establish the Farr Institute of Health Informatics Research, London, from the Medical Research Council, Arthritis Research UK, British Heart Foundation, Cancer Research UK, Chief Scientist Office, Economic and Social Research Council, Engineering and Physical Sciences Research Council, National Institute for Health Research, National Institute for Social Care and Health Research, and Wellcome Trust (grant MR/K006584/1).
References
- [1].Corder EH, Saunders AM, Strittmatter WJ, Schmechel DE, Gaskell PC, Small GW, Roses AD, Haines JL, Pericak-Vance MA. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. Science. 1993;261(5123):921–923. doi: 10.1126/science.8346443. [DOI] [PubMed] [Google Scholar]
- [2].Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, Jun G, DeStefano AL, Bis JC, Beecham GW, Grenier-Boley B, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013;45:1452–1458. doi: 10.1038/ng.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Escott-Price V, Sims R, Bannister C, Harold D, Vronskaya M, Majounie E, Badarinarayan N, Morgan K, Passmore P, Holmes C, Powell J, et al. Common polygenic variation enhances risk prediction for Alzheimer’s disease. Brain. 2015;138:3673–3684. doi: 10.1093/brain/awv268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Escott-Price V, Myers AJ, Huentelman M, Hardy J. Polygenic risk score analysis of pathologically confirmed Alzheimer disease. Ann Neurol. 2017;82:311–314. doi: 10.1002/ana.24999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Hu Y, Li L, Ehm MG, Bing N, Song K, Nelson MR, Talmud PJ, Hingorani AD, Kumari M, Kivimäki M, Xu CF, et al. The Benefits of Using Genetic Information to Design Prevention Trials. Am J Hum Genet. 2013;92(4):547–557. doi: 10.1016/j.ajhg.2013.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Desikan RS, Fan CC, Wang Y, Schork AJ, Cabral HJ, Cupples LA, Thompson WK, Besser L, Kukull WA, Holland D, Chen CH, et al. Genetic assessment of age-associated Alzheimer disease risk: Development and validation of a polygenic hazard score. PLoS Med. 2013;14(3):e1002258. doi: 10.1371/journal.pmed.1002258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Sabuncu MR, Buckner RL, Smoller JW, Lee PH, Fischl B, Sperling RA, ADNI The association between a polygenic Alzheimer score and cortical thickness in clinically normal subjects. Cereb Cortex. 2012;22(11):2653–2661. doi: 10.1093/cercor/bhr348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Lupton MK, Strike L, Hansell NK, Wen W, Mather KA, Armstrong NJ, Thalamuthu A, McMahon KL, de Zubicaray GI, Assareh AA, Simmons A, et al. The effect of increased genetic risk for Alzheimer's disease on hippocampal and amygdala volume. Neurobiol Aging. 2013;40:68–77. doi: 10.1016/j.neurobiolaging.2015.12.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Voyle N, Patel H, Folarin A, Newhouse S, Johnston C, Visser PJ, Dobson RJ, Kiddle SJ, EDAR, DESCRIPA, ADNI Genetic Risk as a Marker of Amyloid-β and Tau Burden in Cerebrospinal Fluid. J Alzheimers Dis. 2017;55(4):1417–1427. doi: 10.3233/JAD-160707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Darst BF, Koscik RL, Racine AM, Oh JM, Krause RA, Carlsson CM, Zetterberg H, Blennow K, Christian BT, Bendlin BB, Okonkwo OC, et al. Pathway-Specific Polygenic Risk Scores as Predictors of Amyloid-β Deposition and Cognitive Function in a Sample at Increased Risk for Alzheimer's Disease. J Alzheimers Dis. 2017;55(2):473–484. doi: 10.3233/JAD-160195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Tan CH, Fan CC, Mormino EC, Sugrue LP, Broce IJ, Hess CP, Dillion WP, Bonham LW, Yokoyama JS, Karch CM, Brewer JB, et al. Common polygenic variation enhances risk prediction for Alzheimer’s disease. [Accessed 28 July 2017];bioRxiv. 2017 http://www.biorxiv.org/content/early/2017/07/18/165373, Posted 18 July 2017. [Google Scholar]
- [12].Furney SJ, Simmons A, Breen G, Pedroso I, Lunnon K, Proitsi P, Hodges A, Powell J, Wahlund LO, Kloszewska I, Mecocci P, et al. Genome-wide association with MRI atrophy measures as a quantitative trait locus for Alzheimer's disease. Mol Psychiatry. 2011;16(11):1130–1138. doi: 10.1038/mp.2010.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Deming Y, Li Z, Kapoor M, Harari O, Del-Aguila JL, Black K, Carrell D, Cai Y, Fernandez MV, Budde J, Ma S, et al. Genome-wide association study identifies four novel loci associated with Alzheimer's endophenotypes and disease modifiers. Acta Neuorpathologica. 2017;133(5):839–856. doi: 10.1007/s00401-017-1685-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Sherva R, Tripodis Y, Bennett DA, Chibnik LB, Crane PK, de Jager PL, Farrer LA, Saykin AJ, Shulman JM, Naj A, Green RC, et al. Genome-wide association study of the rate of cognitive decline in Alzheimer's disease. Alzheimers Dement. 2014;10(1):45–52. doi: 10.1016/j.jalz.2013.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Allen GI, Amoroso N, Anghel C, Balagurusamy V, Bare CJ, Beaton D, Bellotti R, Bennett DA, Boehme KL, Boutros PC, Caberlotto L, et al. Crowdsourced estimation of cognitive decline and resilience in Alzheimer's disease. Alzheimers Dement. 2016;12(6):645–653. doi: 10.1016/j.jalz.2016.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Olsson B, Lautner R, Andreasson U, Öhrfelt A, Portelius E, Bjerke M, Hölttä M, Rosén C, Olsson C, Strobel G, Wu E, et al. CSF and blood biomarkers for the diagnosis of Alzheimer's disease: a systematic review and meta-analysis. Lancet Neurol. 2016;15(7):673–684. doi: 10.1016/S1474-4422(16)00070-3. [DOI] [PubMed] [Google Scholar]
- [17].Chiu M, Yang S, Horng H, Yang C, Chen T, Ho C. Combined plasma biomarkers for diagnosing mild cognition impairment and Alzheimer’s disease. ACS Chem Neurosci. 2013;4(12):1530–1536. doi: 10.1021/cn400129p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Kiddle SJ, Sattlecker M, Proitsi P, Simmons A, Westman E, Bazenet C, Nelson SK, Williams S, Hodges A, Johnston C, Soininen H, et al. Candidate blood proteome markers of Alzheimer's disease onset and progression: a systematic review and replication study. J Alzheimers Dis. 2014;38(3):515–531. doi: 10.3233/JAD-130380. [DOI] [PubMed] [Google Scholar]
- [19].Baird A, Westwood S, Lovestone S. Blood-Based Proteomic Biomarkers of Alzheimer's Disease Pathology. Front Neurol. 2015;6:236. doi: 10.3389/fneur.2015.00236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Ovod V, Ramsey KN, Mawuenyega KG, Bollinger JG, Hicks T, Schneider T, Sullivan M, Paumier K, Holtzman DM, Morris JC, Benzinger T, et al. Amyloid β concentrations and stable isotope labeling kinetics of human plasma specific to central nervous system amyloidosis. Alzheimers Dement. 2017 doi: 10.1016/j.jalz.2017.06.2266. (Ahead of print) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Mapstone M, Cheema AK, Fiandaca MS, Zhong X, Mhyre TR, MacArthur LH, Hall WJ, Fisher SG, Peterson DR, Haley JM, Nazar MD, et al. Plasma phospholipids identify antecedent memory impairment in older adults. Nat Med. 2014;20:415–418. doi: 10.1038/nm.3466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Proitsi P, Kim M, Whiley L, Simmons A, Sattlecker M, Velayudhan L, Lupton MK, Soininen H, Kloszewska I, Mecocci P, Tsolaki M, et al. Association of blood lipids with Alzheimer's disease: A comprehensive lipidomics analysis. Alzheimers Dement. 2017;13(2):140–151. doi: 10.1016/j.jalz.2016.08.003. [DOI] [PubMed] [Google Scholar]
- [23].Han G, Wang J, Zeng F, Feng X, Yu J, Cao HY, Yi X, Zhou H, Jin LW, Duan Y, Wang YJ, et al. Characteristic transformation of blood transcriptome in Alzheimer's disease. J Alzheimers Dis. 2013;35(2):373–386. doi: 10.3233/JAD-121963. [DOI] [PubMed] [Google Scholar]
- [24].Voyle N, Keohane A, Newhouse S, Lunnon K, Johnston C, Soininen H, Kloszewska I, Mecocci P, Tsolaki M, Vellas B, Lovestone S, et al. A Pathway Based Classification Method for Analyzing Gene Expression for Alzheimer's Disease Diagnosis. J Alzheimers Dis. 2016;49(3):659–669. doi: 10.3233/JAD-150440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Lunnon K, Sattlecker M, Furney SJ, Coppola G, Simmons A, Proitsi P, Lupton MK, Lourdusamy A, Johnston C, Soininen H, Kłoszewska I, et al. A blood gene expression marker of early Alzheimer's disease. J Alzheimers Dis. 2012;33(3):737–753. doi: 10.3233/JAD-2012-121363. [DOI] [PubMed] [Google Scholar]
- [26].Voyle N, Kim M, Proitsi P, Ashton NJ, Baird AL, Bazenet C, Hye A, Westwood S, Chung R, Ward M, Rabinovici GD, et al. Blood metabolite markers of neocortical amyloid-β burden: discovery and enrichment using candidate proteins. Trans Psychiatry. 2016;6:e719. doi: 10.1038/tp.2015.205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Choi J, Malakowsky CA, Talent JM, Conrad CC, Gracy RW. Identification of oxidized plasma proteins in Alzheimer's disease. J Biochem Biophys Res Commun. 2002;293(5):566–570. doi: 10.1016/S0006-291X(02)00420-5. [DOI] [PubMed] [Google Scholar]
- [28].Doecke JD, Laws SM, Faux NG, Wilson W, Burnham SC, Lam CP, Mondal A, Bedo J, Bush AI, Brown B, De Ruyck K, et al. Blood-based protein biomarkers for diagnosis of Alzheimer disease. Arch Neurol. 2012;293(5):1318–1325. doi: 10.1001/archneurol.2012.1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Kiddle SJ, Thambisetty M, Simmons A, Riddoch-Contreras J, Hye A, Westman E, Pike I, Ward M, Johnston C, Lupton MK, Lunnon K, et al. Plasma based markers of [11C] PiB-PET brain amyloid burden. J Alzheimers Dis. 2012;7:e44260. doi: 10.1371/journal.pone.0044260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Henkel AW, Müller K, Lewczuk P, Müller T, Marcus K, Kornhuber J, Wiltfang J. Multidimensional plasma protein separation technique for identification of potential Alzheimer's disease plasma biomarkers: a pilot study. J Neural Transm (Vienna) 2012;119:779–788. doi: 10.1007/s00702-012-0781-3. [DOI] [PubMed] [Google Scholar]
- [31].Ioannidis JP, Ntzani EE, Trikalinos TA, Contopoulos-Ioannidis DG. Replication validity of genetic association studies. Nat Genet. 2001;29:306–309. doi: 10.1038/ng749. [DOI] [PubMed] [Google Scholar]
- [32].Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, Downey P, Elliott P, Green J, Landray M, Liu B, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Collins F. A new iniative on precision medicine. N Engl J Med. 2015;372:793–795. doi: 10.1056/NEJMp1500523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].O'Bryant SE, Mielke MM, Rissman RA, Lista S, Vanderstichele H, Zetterberg H, Lewczuk P, Posner H, Hall J, Johnson L, Fong YL, et al. Blood-based biomarkers in Alzheimer disease: Current state of the science and a novel collaborative paradigm for advancing from discovery to clinic. Alzheimers Dement. 2017;13(1):45–58. doi: 10.1016/j.jalz.2016.09.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Kiddle SJ, Steves CJ, Mehta M, Simmons A, Xu X, Newhouse S, Sattlecker M, Ashton NJ, Bazenet C, Killick R, Adnan J, et al. Plasma protein biomarkers of Alzheimer's disease endophenotypes in asymptomatic older twins: early cognitive decline and regional brain volumes. Trans Psychiatry. 2015;5:e584. doi: 10.1038/tp.2015.78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Thambisetty M, Tripaldi R, Riddoch-Contreras J, Hye A, An Y, Campbell J, Sojkova J, Kinsey A, Lynham S, Zhou Y, Ferrucci L, et al. Proteome-based plasma markers of brain amyloid-β deposition in non-demented older individuals. J Alzheimers Dis. 2010;22(4):1099–1109. doi: 10.3233/JAD-2010-101350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Westwood S, Leoni E, Hye A, Lynham S, Khondoker MR, Ashton NJ, Kiddle SJ, Baird AL, Sainz-Fuertes R, Leung R, Graf J, et al. Blood-Based Biomarker Candidates of Cerebral Amyloid Using PiB PET in Non-Demented Elderly. J Alzheimers Dis. 2016;52(2):561–572. doi: 10.3233/JAD-151155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Ijsselstijn L, Dekker LJ, Stingl C, van der Weiden MM, Hofman A, Kros JM, Koudstaal PJ, Sillevis Smitt PA, Ikram MA, Breteler MM, Luider TM. Serum levels of pregnancy zone protein are elevated in presymptomatic Alzheimer's disease. J Proteome Res. 2011;10(11):4902–4910. doi: 10.1021/pr200270z. [DOI] [PubMed] [Google Scholar]
- [39].Loong TW. Understanding sensitivity and specificity with the right side of the brain. BMJ. 2003;327:716–719. doi: 10.1136/bmj.327.7417.716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Moons KG, Kengne AP, Grobbee DE, Royston P, Vergouwe Y, Altman DG, Woodward M. Risk prediction models: II. External validation, model updating, and impact assessment. Heart. 2012;98(9):691–698. doi: 10.1136/heartjnl-2011-301247. [DOI] [PubMed] [Google Scholar]
- [41].Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection; IJCAI’95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2; Stanford, CA. 1995. pp. 1137–1145. [Google Scholar]
- [42].Marksteiner J, Kemmler G, Weiss EM, Knaus G, Ullrich C, Mechtcheriakov S, Oberbauer H, Auffinger S, Hinterhölzl J, Hinterhuber H, Humpel C. Five out of 16 plasma signaling proteins are enhanced in plasma of patients with mild cognitive impairment and Alzheimer's disease. Neurobiol Aging. 2011;32(3):539–540. doi: 10.1016/j.neurobiolaging.2009.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Li D, Misialek JR, Boerwinkle E, Gottesman RF, Sharrett AR, Mosley TH, Coresh J, Wruck LM, Knopman DS, Alonso A. Plasma phospholipids and prevalence of mild cognitive impairment and/or dementia in the ARIC Neurocognitive Study (ARIC-NCS) Alzheimers Dement (Amst) 2016;3:73–82. doi: 10.1016/j.dadm.2016.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Casanova R, Varma S, Simpson B, Kim M, An Y, Saldana S, Riveros C, Moscato P, Griswold M, Sonntag D, Wahrheit J, et al. Blood metabolite markers of preclinical Alzheimer's disease in two longitudinally followed cohorts of older individuals. Alzheimers Dement. 2016;12(7):815–822. doi: 10.1016/j.jalz.2015.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Ray S, Britschgi M, Herbert C, Takeda-Uchimura Y, Boxer A, Blennow K, Friedman LF, Galasko DR, Jutel M, Karydas A, Kaye JA, et al. Classification and prediction of clinical Alzheimer's diagnosis based on plasma signaling proteins. Nat Med. 2007;13(11):1359–1362. doi: 10.1038/nm1653. [DOI] [PubMed] [Google Scholar]
- [46].Wason P. On the failure to eliminate hypotheses in a conceptual task. Quarterly Journal of Experimental Psychology. 1960;12:129–140. [Google Scholar]
- [47].Kerr N. Hypothesizing After Results are Known. Personality and Social Psychology Bulletin. 1998;2(3):196–217. doi: 10.1207/s15327957pspr0203_4. [DOI] [PubMed] [Google Scholar]
- [48].Rosenthal R. The file drawer problem and tolerance for null results. Psychological Bulletin. 1979;86(3):638–641. [Google Scholar]
- [49].Ioannidis J. Why most published research findings are false. PLoS Med. 2005;2(8):e124. doi: 10.1371/journal.pmed.0020124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [50].Voyle N, Baker D, Burnham SC, Covin A, Zhang Z, Sangurdekar DP, Tan Hehir CA, Bazenet C, Lovestone S, Kiddle S, Dobson RJ, et al. Blood Protein Markers of Neocortical Amyloid-β Burden: A Candidate Study Using SOMAscan Technology. J Alzheimers Dis. 2015;46(4):947–961. doi: 10.3233/JAD-150020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Ioannidis J. Why most discovered true associations are inflated. Epidemiology. 2005;19(5):640–648. doi: 10.1097/EDE.0b013e31818131e7. [DOI] [PubMed] [Google Scholar]
- [52].Menni C, Kiddle SJ, Mangino M, Viñuela A, Psatha M, Steves C, Sattlecker M, Buil A, Newhouse S, Nelson S, Williams S, et al. Circulating Proteomic Signatures of Chronological Age. J Gerontol A Biol Sci Med Sci. 2015;70(7):809–816. doi: 10.1093/gerona/glu121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].Di Narzo AF, Telesco SE, Brodmerkel C, Argmann C, Peters LA, Li K, Kidd B, Dudley J, Cho J, Schadt EE, Kasarskis A, et al. High-Throughput Characterization of Blood Serum Proteomics of IBD Patients with Respect to Aging and Genetic Factors. PLoS Genet. 2017;13(1):e1006565. doi: 10.1371/journal.pgen.1006565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Sun BB, Maranville JC, Peters JE, Stacey D, Staley JR, Blackshaw J, Burgess S, Jiang T, Paige E, Surendran P, Oliver-Williams C, et al. Consequences Of Natural Perturbations In The Human Plasma Proteome. [Accessed 28 July 2017];bioRxiv. 2017 http://www.biorxiv.org/content/early/2017/05/05/134551, Posted 5 May 2017. [Google Scholar]
- [55].Ioannidis J. How to make more published research true. PLoS Med. 2014;11(10):e1001747. doi: 10.1371/journal.pmed.1001747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Collins G, Reitsma J, Altman D, Moons K. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162(1):55–63. doi: 10.7326/M14-0697. [DOI] [PubMed] [Google Scholar]
- [57].O'Bryant SE, Gupta V, Henriksen K, Edwards M, Jeromin A, Lista S, Bazenet C, Soares H, Lovestone S, Hampel H, Montine T, et al. Guidelines for the standardization of preanalytic variables for blood-based biomarker studies in Alzheimer's disease research. Alzheimers Dement. 2015;11(5):549–560. doi: 10.1016/j.jalz.2014.08.099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Longo D. Data sharing. N Engl J Med. 2016;374:276–277. doi: 10.1056/NEJMe1516564. [DOI] [PubMed] [Google Scholar]
- [59].Kiddle S, Parodi A, Johnston C, Wallace C, Dobson R. Heterogeneity of cognitive decline in dementia: a failed attempt to take into account variable time-zero severity. [Accessed 28 July 2017];bioRxiv. 2016 http://www.biorxiv.org/content/early/2017/07/11/060830, Posted 11 July 2016. [Google Scholar]
- [60].Perera G, Broadbent M, Callard F, Chang CK, Downs J, Dutta R, Fernandes A, Hayes RD, Henderson M, Jackson R, Jewell A, et al. Cohort profile of the South London and Maudsley NHS Foundation Trust Biomedical Research Centre (SLaM BRC) Case Register: current status and recent enhancement of an Electronic Mental Health Record-derived data resource. BMJ Open. 2016;6(3):e008721. doi: 10.1136/bmjopen-2015-008721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [61].Jackson R, Agrawal A, Lui K, Folarin A, Wu H, Groza T, Roberts A, Gorrell G, Song X, Lewsley D, Northwood D, et al. CogStack - Experiences of Deploying Integrated Information Retrieval And Extraction Services In A Large National Health Service Foundation Trust Hospital. [Accessed 28 July 2017];bioRxiv. 2016 doi: 10.1186/s12911-018-0623-9. http://www.biorxiv.org/content/early/2017/04/02/123299, Posted 2 April 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [62].Baker E, Iqbal E, Johnston C, Broadbent M, Shetty H, Stewart R, Howard R, Newhouse S, Khondoker M, Dobson RJB. Trajectories of dementia-related cognitive decline in a large mental health records derived patient cohort. PLoS One. 2017;12(6):e0178562. doi: 10.1371/journal.pone.0178562. [DOI] [PMC free article] [PubMed] [Google Scholar]
