Citation:
Trotti LM. Twice is nice? Test-retest reliability of the Multiple Sleep Latency Test in the central disorders of hypersomnolence. J Clin Sleep Med. 2020;16(suppl_1):17S–18S.
In 2013, the current author and colleagues reported poor Multiple Sleep Latency Test (MSLT) test-retest reliability in people with noncataplectic hypersomnia disorders (NCHD).1 Half of our participants’ MSLT-based diagnoses changed between MSLTs, driven by changes across the 8-minute sleep latency threshold used to define sleepiness and the 2 sleep-onset REM period (SOREM) threshold used to identify narcolepsy. Before our report, there were concerns about the MSLT,2 but the existing literature suggested adequate MSLT retest reliability in people with narcolepsy (mostly with cataplexy, ie, narcolepsy type 1 [NT1]).3,4
Subsequently, studies have confirmed that MSLT retest reliability is poor in NCHD compared to NT1. Combining data across published studies, repeat MSLT confirms NT1 diagnosis in 91% (68/75) of patients4–6 but only in 45% (26/58) of patients with narcolepsy type 2 (NT2)1,4–7 and in 49% (21/43) of patients with idiopathic hypersomnia.1,5,7 Seventy percent of patients with self-reported sleepiness and a normal first MSLT have a normal second MSLT,1,5,7 and 60% have a normal second MSLT if excluding MSLTs on consecutive days. In patients with NT1, the 8-minute threshold is crossed on only 7% (7/107) of repeat tests, the 2-SOREM threshold on only 10% (12/121) of repeat tests, and both are crossed on 7% (7/107) of repeat tests.4–6,8 For NCHD, the 8-minute threshold is crossed on 35% (63/182) of repeat tests, the 2-SOREM threshold on 28% (66/235) of repeat tests, and both on 24% (41/170) of repeat tests.1,4–9
What explains low retest reliability? Broadly, the possibilities are (1) inadvertent bias introduced by study design, (2) MSLT variability despite stable disease phenotype over time, and (3) disease phenotype variability over time. Many studies of repeat MSLT, including ours,1 were retrospective clinical analyses. Testing may have been repeated because initial results were somehow clinically discordant, leading to a selection bias for repeat testing. However, even when repeat testing is performed routinely, nearly 1 in 5 people with NT2 cross the 2-SOREM threshold at 5 years, significantly more than those with NT1.9
The MSLT has some inherent retest variability because it requires strict adherence to specific protocols2 that are challenging in practice.10 Numerous factors affect the MSLT, not limited to age, sleep during the days and weeks before testing, physical activity, medications, nonprescription substances, and momentary state. This finding may explain the improved retest reliability with shorter retest intervals, eg, consecutive days vs months or years, even in the same patients.7 However, across longer intervals, retest reliability remains poor even when excluding medication changes,1,6 and these other sources of variability should not disproportionately affect those with NCHD.
Finally, these diseases may evolve differently over time. In NT1, once hypocretin cell loss is maximal, symptoms (and the MSLT results) would be expected to remain relatively stable. Because the cause or causes of NCHD remain unknown, it is possible that their underlying pathology shows more fluctuation. Epworth Sleepiness Scale scores vary somewhat more over time in patients with NT2 than in those with NT1.9 It is possible that both sleepiness and REM propensity are fluctuating features of NCHD, making poor retest reliability of these measures inevitable.
Poor MSLT test-retest reliability for NCHD has negative consequences for patients, providers, and health care systems. MSLTs are expensive, in both dollars and time, and repeat tests doubly so. Uncertainty about diagnoses impacts patients’ ability to understand and cope with their chronic disease, stymies clinical research, and undermines disease classification that heavily emphasizes the MSLT. Notably, because treatment may only be approved or covered for those with specific diagnoses, especially NT1 and NT2, the lack of MSLT stability can have profound effects on patient access to medications. If only a single MSLT is performed, then it can feel like a roll of the dice—if ≥ 2 SOREMs appear, then the patient can be treated, otherwise not. Yet idiopathic hypersomnia and NT2 cannot be otherwise distinguished on clinical grounds, making this withholding of medications scientifically unjustifiable.11
The MSLT remains a good, if technically difficult, diagnostic test for NT1, but new paradigms are clearly needed for NCHD. Whether this initiative will involve a change in MSLT analysis parameters,12 novel polysomnography measures,13 cognitive testing,14 or ultimately mechanistic biomarkers, attention to the test-retest reliability of these measures is critical.
DISCLOSURE STATEMENT
Work for this study was supported by the National Institutes of Health (R01 NS111280). This work is solely the responsibility of the author and does not necessarily represent the official views of the funding agency. The author reports no conflicts of interest.
REFERENCES
- 1.Trotti LM, Staab BA, Rye DB. Test-retest reliability of the multiple sleep latency test in narcolepsy without cataplexy and idiopathic hypersomnia. J Clin Sleep Med. 2013;9(8):789–795. 10.5664/jcsm.2922 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Arand D, Bonnet M, Hurwitz T, Mitler M, Rosa R, Sangal RB. The clinical use of the MSLT and MWT. Sleep. 2005;28(1):123–144. 10.1093/sleep/28.1.123 [DOI] [PubMed] [Google Scholar]
- 3.Coelho FM, Georgsson H, Murray BJ. Benefit of repeat multiple sleep latency testing in confirming a possible narcolepsy diagnosis. J Clin Neurophysiol. 2011;28(4):412–414. 10.1097/WNP.0b013e31822734a3 [DOI] [PubMed] [Google Scholar]
- 4.Folkerts M, Rosenthal L, Roehrs T, et al. The reliability of the diagnostic features in patients with narcolepsy. Biol Psychiatry. 1996;40(3):208–214. 10.1016/0006-3223(95)00383-5 [DOI] [PubMed] [Google Scholar]
- 5.Lopez R, Doukkali A, Barateau L, et al. Test-retest reliability of the Multiple Sleep Latency Test in central disorders of hypersomnolence. Sleep. 2017;40(12):zsx164. [DOI] [PubMed] [Google Scholar]
- 6.Ruoff C, Pizza F, Trotti LM, et al. The MSLT is repeatable in narcolepsy type 1 but not narcolepsy type 2: a retrospective patient study. J Clin Sleep Med. 2018;14(1):65–74. 10.5664/jcsm.6882 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kwon Y, Kazaglis L, Cho Y, Howell MJ, Mahowald MW. Test-retest reliability of two consecutive mean sleep latency tests in patients with hypersomnia. Sleep Biol Rhythms. 2017;15(4):337–339. 10.1007/s41105-017-0116-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Huang YS, Tafti M, Guilleminault C. Daytime sleepiness with and without cataplexy in Chinese-Taiwanese patients. Sleep Med. 2006;7(5):454–457. 10.1016/j.sleep.2006.05.009 [DOI] [PubMed] [Google Scholar]
- 9.Huang YS, Guilleminault C, Lin CH, Chen CH, Chin WC, Chen TS. Multiple sleep latency test in narcolepsy type 1 and narcolepsy type 2: a 5-year follow-up study. J Sleep Res. 2018;27(5):e12700. 10.1111/jsr.12700 [DOI] [PubMed] [Google Scholar]
- 10.Pataka A, Yoon CH, Poddar A, Riha RL. Assessment of multiple sleep latency testing in adults in Europe. Sleep Med. 2013;14(2):136–139. 10.1016/j.sleep.2012.09.025 [DOI] [PubMed] [Google Scholar]
- 11.Fronczek R, Arnulf I, Baumann CR, Maski K, Pizza F, Trotti LM. To split or to lump? Classifying the central disorders of hypersomnolence. Sleep. 2020;43(8):zsaa044. 10.1093/sleep/zsaa044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pizza F, Vandi S, Detto S, et al. Different sleep onset criteria at the multiple sleep latency test (MSLT): an additional marker to differentiate central nervous system (CNS) hypersomnias. J Sleep Res. 2011;20(1 Pt 2):250–256. 10.1111/j.1365-2869.2009.00808.x [DOI] [PubMed] [Google Scholar]
- 13.Olsen AV, Stephansen J, Leary E, et al. Diagnostic value of sleep stage dissociation as visualized on a 2-dimensional sleep state space in human narcolepsy. J Neurosci Methods. 2017;282:9–19. 10.1016/j.jneumeth.2017.02.004 [DOI] [PubMed] [Google Scholar]
- 14.Ramm M, Boentert M, Lojewsky N, Jafarpour A, Young P, Heidbreder A. Disease-specific attention impairment in disorders of chronic excessive daytime sleepiness. Sleep Med. 2019;53:133–140. 10.1016/j.sleep.2018.09.021 [DOI] [PubMed] [Google Scholar]
