Abstract
The Stockholm Hierarchy is a professional consensus created to define the preferred approaches to defining analytical quality. The quality of a laboratory measurement can also be classified by the quality of the limits that the value is compared with, namely reference interval limits and clinical decision limits. At the highest level in the hierarchy would be placed clinical decision limits based on clinical outcome studies. The second level would include both formal reference interval studies (studies of intra and inter-individual variations) and clinical decision limits based on clinician survey. While these approaches are commonly used, they require a lot of resources to define accurately. Placing laboratory experts on the third level would suggest that although they can also define reference intervals by consensus, theirs aren’t as well regarded as clinician defined limits which drive clinical behaviour. Ideally both analytical and clinical considerations should be made, with clinicians and laboratorians both having important information to consider. The fourth level of reference intervals would be for those defined by survey or by regulatory authorities because of the focus on what is commonly achieved rather than what is necessarily correct. Finally, laboratorians know that adopting reference limits from kit inserts or textbook publications is problematic because both methodological issues and reference populations are often not the same as their own. This approach would rank fifth and last. When considering which so called ‘common’ or ‘harmonised reference intervals’ to adopt, both these characteristics and the quality of individual studies need to be assessed. Finally, we should also be aware that reference intervals describe health and physiology while clinical decision limits focus on disease and pathology, and unless we understand and consider the two corresponding issues of test specificity and test sensitivity, we cannot assure the quality of the limits that we report.
Introduction
I was fortunate to be amongst the 100 participants from 27 countries that met at the Nobel Institute in Stockholm in April 1999 to consider the many strategies that can be used to set analytical goals for clinical laboratory tests. The meeting was sponsored by the International Federation of Clinical Chemistry, the International Union of Pure and Applied Chemistry, and the World Health Organization, and included 22 formal presentations from the world’s leaders in analytical quality including Anders Kallner,1 Callum Fraser,2 Per Hyltoft Petersen,3 George Klee,4 Linda Thienpont,5 Sverre Sandberg,6 Carmen Ricós,7 Sharon Ehrmeyer8 and James Westgard.9
The meeting ended with a critical discussion that included all participants and the formulation of a consensus statement regarding a hierarchical approach to the definition of analytical quality specifications (Table 1).10 Working Subgroup 3 of ISO Technical Committee 212 (TC212), the committee that developed the internationally accepted Medical Laboratory Quality Standard (ISO 15189), proposed that this hierarchy should be used for the ongoing revision of ISO 15196 (the analytical quality standard) which still remains uncompleted. Nevertheless, the ‘Stockholm Hierarchy’ represents the last word by TC212 on this issue.11
Table 1.
The Stockholm Hierarchy for analytical quality goals. Adapted from Kenny D, et al.10
|
The hierarchy outlined in Table 1 shows that the highest level of defining analytical quality specifications is based on studies that can evaluate the effect of analytical performance on clinical outcome but rarely available in practice. An imprecise or biased method would logically misclassify patients and potentially lead to suboptimal clinical management and poorer clinical outcome. At the other extreme, ‘state of the art’ analytical quality specifications, such as the expected imprecision of a particular method listed in a kit insert, do not guarantee that the method is fit for the variety of clinical purposes that may exist for that analyte. Analytical goals are essentially required to assure diagnostic classification.12–14 Therefore, the hierarchy is a classification of our confidence in the clinical purpose of the assay rather than an isolated study of analytical quality.
ISO 15189 literally requires medical laboratories to include biological reference intervals or clinical decision limits, whichever is applicable, because a single value has little relevance in isolation. While subsequent values can be compared with the initial value when monitoring and trying to judge if a patient is getting better or worse, the initial result is usually compared with a confidence interval that describes apparently healthy reference individuals (reference interval) or with a limit that defines the likelihood of its indicating a particular disease state (clinical decision limit). The quality of reference intervals and decision limits will affect an individual patient’s classification just as much as the analytical quality of that individual patient’s value(s).
While the laboratory profession has already taken significant steps in the traceability or standardisation of methods, more recently we have become concerned with what appears to be a large variation in reference intervals between laboratories. Professional dialogue and co-operation is now required to reduce the unnecessary variation in reference intervals, especially since the move to electronic health records will otherwise entail a plethora of idiosyncratic laboratory-based reference intervals. These professional efforts in harmonisation of reference intervals are aimed at finding the best available evidence and encouraging their use.
Considering that the Stockholm Hierarchy can be viewed as a quality based hierarchy for judging the clinical worth of laboratory measurements, we should be able to use it in ranking the quality of candidate reference intervals (and decision limits) for common use. The remainder of this article is devoted to illustrating the application of each level of the Stockholm Hierarchy to the derivation and quality of reference intervals and clinical decision limits.
Level 1. Clinical Decision Limits Based on Clinical Outcomes in Specific Clinical Settings
Evidence based medicine is about making clinical decisions based on the evidence of proven clinical outcomes. In the Diabetes Control and Complications Trial (DCCT), HbA1c levels over 8.0% (64 mmol/mol) were associated with worse outcomes than levels below 7.0% (53 mmol/mol) and therefore these were proven clinical decision levels for intervention when monitoring diabetes mellitus.15 More recently the global acceptance of HbA1c as a diagnostic test for diabetes mellitus is based on the understanding that the specific adverse clinical outcomes characteristic of that illness (e.g. retinopathy) start to rise at a DCCT level of 6.5% (48 mmol/mol).16 There are some clinical outcomes present in both diabetes and prediabetes (e.g. coronary vascular disease and stroke) that begin to be noticed at DCCT levels of 5.5% (37 mmol/mol).17 Therefore there are several clinical decision limits that we can apply to HbA1c; (i) 7.0% vs. 8.0% for altering diabetic management, (ii) ≥6.5% for diabetes diagnosis and (iii) ≥5.5% for assessing cardiovascular risk.
The strengths of this approach are the link to clinical outcome and the promise of improving those outcomes with targeted intervention. The weakness is that these clinical decision limits are usually specific to a particular clinical outcome, which may or may not be the foremost clinical question for each individual patient. Another major problem is the amount of effort required to define these limits for each possible clinical outcome.
Similarly, while outcome based clinical decision limits may be defined for a specific population, that clinical outcome may have less relevance in other populations. For example, cholesterol cut-offs based on the risk of cardiovascular disease in adults may not be relevant in paediatrics or obstetrics. Similarly with urate, the upper ‘apparently healthy’ reference limit may be different from the clinical decision limit for predicting gout and from the concentrations associated with metabolic syndrome and pre-eclampsia in pregnancy.
Level 2. Evaluation of the Effect of Reference Limits on Clinical Decisions in General
When addressing analytical quality, this level had two subcategories, one based on data derived from biological variation and another based on clinicians’ opinions. Biological variation involves the study of both intra- and inter-individual variation, and population based reference intervals include both of these components as well as the measurement uncertainty of the assay used to generate them. Clinician opinions are based on surveys of what clinicians believe, and this should be distinguished from clinical decision limits which are based on what clinicians have proven through formal clinical outcome studies. As we lack such high level evidence for all clinical decisions, each clinician often has to weigh up the available evidence when deciding on each individual patient’s management. While this process is celebrated as ‘the art of medicine’ and requires years of experience, the task can be made more objective by expert clinicians creating consensus guidelines.
2a. Reference Intervals Defined by Intra- and Inter-Individual Biological Variation
The advantage of reference intervals is that they represent our best effort at defining the spread of results in health, and results outside a reference interval are assumed to indicate a high relative risk of disease. As it is almost impossible to find individuals in a perfect state of health, a practical alternative suggestion is that reference individuals are in a reasonable state of health and not suffering from significant illness.18 The weakness of this approach is that it is hard to define and find such healthy individuals that equally represent the many physiological variations that can exist in health. Some of those important healthy variations include pregnancy, gender and age (as in infants, children, puberty, young adults, menopausal women and elderly adults). There are also appreciable diurnal variations with fasting, eating, drinking and exercise, as well as monthly variations (menstrual cycle) and circannual variations including diet, sunlight and ambient temperature.
Fortunately, an internationally accepted standard for defining reference intervals exists (CLSI C28-A3).19 In the direct approach, healthy individuals are recruited, re-assessed for health, their samples analysed and the results described using parametric or non-parametric statistics. It takes a major effort to define these populations and analyse them in a consistent or traceable manner. Excellent examples include the NORIP study defining adult reference intervals in Scandinavia,20 the six Asian cities study,21 the CALIPER studies defining paediatric intervals in Canada22 and the paediatric health survey in Germany (KiGGS).23 While it might be assumed that these populations are homogenous, major differences do exist,24 diminishing their general applicability. Furthermore, the methods used in the studies define both an accuracy base and expected imprecision for field methods.25
Indirect methods seek to distinguish reference populations within the mixture of populations that exist in a laboratory database.26,27 They usually depend on an assumption of a Gaussian or log-Gaussian distribution of reference data. Importantly, they do not represent healthy reference intervals because most patients tested in clinical laboratories have a clinical complaint. Nevertheless, since an ideal reference population is one that exactly mirrors the patient except for the disease in question, it can be argued that reference populations defined within a laboratory database are better candidates for comparison: they have had their samples collected, transported and analysed exactly as patients do, and they have similar complaints (e.g. abdominal pain) without the disease (e.g. pancreatitis). In another example, reference populations for phaeochromocytoma should be patients with hypertension (but no phaeochromocytoma)28 rather than completely healthy normotensive populations. The issue of whether the reference population should be in an ideal state of health or more simply not have the disease in question addresses a weakness of ‘healthy’ reference intervals, namely that they are defined by specificity for health rather than sensitivity for disease. A disadvantage of the indirect method is that the reference intervals derived may vary with long-term variations in analytical quality.29 Studies have been conducted that show that the main differences between directly and indirectly defined reference intervals are due to assay bias.30
Both parametric and non-parametric approaches usually describe 95% confidence intervals for the reference population. However, this convention can be modified to 99% confidence intervals where greater specificity is required (e.g. troponin, CA125, alpha fetoprotein) or 90% confidence intervals when greater sensitivity has been accepted as a convention (e.g. testosterone). In other words, healthy reference intervals are by definition expressions of specificity for health; they don’t directly transpose to the likelihood of disease, the more typical clinical question.
2b. Decision Limits Based on Clinicians’ Opinions
While clinicians’ opinions can be used to decide whether a reference interval should have 95% or 99% specificity, this category refers more to the setting of a clinical decision limit based on clinical survey or consensus rather than a limit derived through a clinical outcome study as described initially above. An example of a clinical decision limit based on consensus is an upper reference limit for thyroid-stimulating hormone (TSH) of 2.5 mIU/L by the National Academy of Clinical Biochemistry (NACB).31 The strength of the approach is that clinicians can balance the need for specificity, sensitivity and management. The weakness of this approach is that clinicians may not be aware of the variations among assays that exist despite claims to traceability to a common reference standard.32 Other examples include the definition of testosterone deficiency (<8.5 nmol/L) by specialists advising the Australian pharmaceutical benefits schedule,33 and the recent definition of vitamin D sufficiency at the end of winter in Australia as ≥60 nmol/L.34
Ideally, laboratorians should be involved in the discussions of clinical decision limits as they may be more aware than the clinicians of differences in methodologies and they can also help to ensure that laboratories adopt the recommendations of the group.35–38
3. Published Professional Recommendations
Recommendations can come from national or international expert bodies or from local groups or individuals. They usually refer to laboratory experts defining reference intervals rather than clinicians involved in clinical decision limits. However laboratory experts can promote the adoption of clinical decision limits in preference to reference intervals and include further recommendations on how laboratories should structure their reports.39
Examples of reference intervals determined by laboratory professionals include those promoted by the Auckland Regional Quality Assurance Group (ARQAG),40,41 and those of the Sonic biochemistry reference interval network.42–45 This is not to say that these groups create reference intervals without evidence such as contained in the aforementioned approaches, but that these groups provide a laboratory perspective to the data that is available by pragmatically agreeing on reference intervals that are most appropriate for the populations and methodologies used in their network. Therein lies both the strength and weakness of this approach. Apart from harnessing the combined experience and qualifications of experts in the network, the strength is that these experts can gather all available evidence that applies to their populations and consider any method biases that exist in their methods, including pre-analytical factors. The weaknesses are that the reference intervals will need to be redefined when there is a change to network methods and that they may not be applicable to other networks with differing methods or populations (e.g. ambulant outpatient populations versus recumbent inpatient populations).
4. Reference Limits Set by Regulatory Bodies or Organisers of External Quality Assurance Schemes
When setting analytical quality goals, it is possible to consider the existing achievable performance benchmarks in external quality assurance surveys to establish a standard that can be achieved. It is also possible to compare analytical variation with variation in reference intervals to investigate whether reference interval variation is greater than the variation between methods.46,47 Reference interval surveys have been conducted in the USA48 as well as the UK where the surveys form part of the information for harmonising reference intervals.49,50 A proposed benchmark from such surveys might be the most common interval used or the median reference interval. Postanalytical external quality assurance (EQA) can also serve to improve both the quality of laboratory based interpretation and the reference intervals that laboratories use to make those interpretations.51
Regulatory authorities can also play a large role in determining reference intervals and clinical decision limits. When an assay is approved for use by local regulatory authorities, the approval often includes the expected patient distributions or reference limit for that assay. Subsequent approval of similar tests may be based on their similarity to predicate devices including having the same interpretive limits. In 1994, prostate-specific antigen (PSA) was approved by the US Food and Drug Administration (FDA) as an aid for early prostate cancer detection using a 4.0 μg/L cut-off. Subsequent approvals for PSA also claimed a 4.0 μg/L cut-off for all men even though clinical practice had progressed to using lower reference limits in younger men to define men with more favourable prognosis.52 This issue has also complicated the approval of cardiac troponin I assays when the FDA has a predicate device with a cut-off defined by receiver operating characteristic (ROC) analysis, while cut-offs for new troponin I assays are now being defined by evaluating diagnostic performance at the 99th percentile of the reference population.53
5. Reference Limits Based on the Current State of the Art or Current Publications
This was defined as the lowest level of assessing analytical performance goals because laboratories that compared their analytical performance with published data could not be sure that the published analytical performance fulfilled any quality goals itself. Taking a reference interval from a publication usually does not provide the confidence that the published reference interval was established using the same method as yours let alone that the reference population would be the same as yours. Comparable methods and comparable reference populations are the two essential considerations according to the CLSI C28 guideline when laboratory directors subjectively accept published reference intervals. While a reference interval (or ‘expected values’) from a kit insert that you are using might be assumed to be the same method, this usually requires method validation. Similarly, while a reference interval derived in the USA or Europe might be assumed to be similar to one in Australia, this also requires validation. This is why almost all kit inserts suggest that the laboratory performs their own reference interval studies. Validation of the published reference interval is sometimes impossible when the kit insert does not include a description of the reference population including their age and gender, and too often it fails to recognise important morbidities such as obesity. It may have once been acceptable to extract reference intervals from common textbooks, but this is (by this definition) the worst way to do things. Of course, sometimes there are no reference intervals available for your particular method in children or pregnancy so published values such as provided by reference books by Soldin54 and Gronowski55 can be useful guides. However, the methods used in those publications and their reference populations (e.g. hospitalised children) must be critically evaluated.
Discussion
It should be noted that while outlining the levels of the Stockholm Hierarchy as it would apply to reference intervals and clinical decision limits, I have alternated between these different concepts. For example, level 1 is to do with clinical decision limits, level 2 is either to do with reference intervals or clinical decision limits, and level 3 is mainly to do with reference intervals but could be to do with clinical decision limits, as is also true for levels 4 and 5. In reality, there are two hierarchies, one set that applies to reference intervals and another set that applies to clinical decision limits. The only distinguishing feature is whether we are focusing on limiting false positives in health (specificity) or limiting false negatives in disease (sensitivity). For example, level 1 does not necessarily have to address the clinical outcomes of disease; it could apply to the expected clinical outcome when disease is absent (health), e.g. survival or longevity. Finally, the two aspects, health and disease, can be combined statistically though ROC analysis where cut-offs that balance sensitivity and specificity can be defined. This approach is useful for all the situations where we cannot decide whether test sensitivity or specificity is the more important issue.
I hope I have demonstrated that the Stockholm Hierarchy, initially proposed as a tool to define analytical quality, is just as useful a tool for evaluating reference intervals and clinical decision limits. The recent literature indicates that the inter-relationship between analytical goals, reference intervals, decision limits and their impact on patient classification (or misclassification)56–58 is the current frontier that brings together everything that underlies the value of our clinical laboratory tests.
Table 2.
The Stockholm Hierarchy applied to reference intervals and clinical decision limits.
|
Footnotes
Competing Interests: None declared.
References
- 1.Kallner A. Quality specifications based on the uncertainty of measurement. Scand J Clin Lab Invest. 1999;59:513–6. doi: 10.1080/00365519950185256. [DOI] [PubMed] [Google Scholar]
- 2.Fraser CG. General strategies to set quality specifications for reliability performance characteristics. Scand J Clin Lab Invest. 1999;59:487–90. doi: 10.1080/00365519950185210. [DOI] [PubMed] [Google Scholar]
- 3.Petersen PH. Quality specifications based on analysis of effects of performance on clinical decision-making. Scand J Clin Lab Invest. 1999;59:517–21. doi: 10.1080/00365519950185265. [DOI] [PubMed] [Google Scholar]
- 4.Klee GG, Schryver PG, Kisabeth RM. Analytic bias specifications based on the analysis of effects on performance of medical guidelines. Scand J Clin Lab Invest. 1999;59:509–12. doi: 10.1080/00365519950185247. [DOI] [PubMed] [Google Scholar]
- 5.Thienpont LM. Quality specifications for reference methods. Scand J Clin Lab Invest. 1999;59:535–8. doi: 10.1080/00365519950185300. [DOI] [PubMed] [Google Scholar]
- 6.Sandberg S, Thue G. Quality specifications derived from objective analyses based upon clinical needs. Scand J Clin Lab Invest. 1999;59:531–4. doi: 10.1080/00365519950185292. [DOI] [PubMed] [Google Scholar]
- 7.Ricós C, Alvarez V, Cava F, García-Lario JV, Hernández A, Jiménez CV, et al. Current databases on biological variation: pros, cons and progress. Scand J Clin Lab Invest. 1999;59:491–500. doi: 10.1080/00365519950185229. [DOI] [PubMed] [Google Scholar]
- 8.Ehrmeyer SS, Laessig RH. Effect of legislation (CLIA’88) on setting quality specifications for US laboratories. Scand J Clin Lab Invest. 1999;59:563–7. doi: 10.1080/00365519950185364. [DOI] [PubMed] [Google Scholar]
- 9.Westgard JO. The need for a system of quality standards for modern quality management. Scand J Clin Lab Invest. 1999;59:483–6. doi: 10.1080/00365519950185201. [DOI] [PubMed] [Google Scholar]
- 10.Kenny D, Fraser CG, Hyltoft Petersen P, Kallner A. Strategies to set Global Analytical Quality Specifications in Laboratory Medicine – Consensus agreement. Scand J Clin Lab Invest. 1999;59:585. [PubMed] [Google Scholar]
- 11.Kaplan LA. Determination and application of desirable analytical performance goals: the ISO/TC 212 approach. Scand J Clin Lab Invest. 1999;59:479–82. doi: 10.1080/00365519950185193. [DOI] [PubMed] [Google Scholar]
- 12.Klee GG. Tolerance limits for short-term analytical bias and analytical imprecision derived from clinical assay specificity. Clin Chem. 1993;39:1514–8. [PubMed] [Google Scholar]
- 13.Petersen PH, de Verdier CH, Groth T, Fraser CG, Blaabjerg O, Hørder M. The influence of analytical bias on diagnostic misclassifications. Clin Chim Acta. 1997;260:189–206. doi: 10.1016/s0009-8981(96)06496-0. [DOI] [PubMed] [Google Scholar]
- 14.Hosogaya S, Ozaki Y. [Allowable limits of analytical error which can guarantee the reliability of reference intervals for interpretation of clinical laboratory data. (Japanese)] Rinsho Byori. 2008;56:617–21. [PubMed] [Google Scholar]
- 15.The Diabetes Control and Complications Trial Research Group The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N Engl J Med. 1993;329:977–86. doi: 10.1056/NEJM199309303291401. [DOI] [PubMed] [Google Scholar]
- 16.Mbanya JC, Henry RR, Smith U. Presidents’ statement on the WHO recommendation on HbA1c for diabetes diagnosis. Diabetes Res Clin Pract. 2011;93:310–1. doi: 10.1016/j.diabres.2011.06.026. [DOI] [PubMed] [Google Scholar]
- 17.Selvin E, Steffes MW, Zhu H, Matsushita K, Wagenknecht L, Pankow J, et al. Glycated hemoglobin, diabetes, and cardiovascular risk in nondiabetic adults. N Engl J Med. 2010;362:800–11. doi: 10.1056/NEJMoa0908359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ritchie RF, Palomaki G. Selecting clinically relevant populations for reference intervals. Clin Chem Lab Med. 2004;42:702–9. doi: 10.1515/CCLM.2004.120. [DOI] [PubMed] [Google Scholar]
- 19.Clinical and Laboratory Standards Institute (CLSI) Defining, establishing, and verifying reference intervals in the clinical laboratory; Approved guideline - Third Edition. CLSI Document C28-A3. Wayne, PA, USA: CLSI; 2008. [Google Scholar]
- 20.Rustad P, Felding P, Franzson L, Kairisto V, Lahti A, Mårtensson A, et al. The Nordic Reference Interval Project 2000: recommended reference intervals for 25 common biochemical properties. Scand J Clin Lab Invest. 2004;64:271–84. doi: 10.1080/00365510410006324. [DOI] [PubMed] [Google Scholar]
- 21.Ichihara K, Itoh Y, Lam CW, Poon PM, Kim JH, Kyono H, et al. Sources of variation of commonly measured serum analytes in 6 Asian cities and consideration of common reference intervals. Clin Chem. 2008;54:356–65. doi: 10.1373/clinchem.2007.091843. [DOI] [PubMed] [Google Scholar]
- 22.Colantonio DA, Kyriakopoulou L, Chan MK, Daly CH, Brinc D, Venner AA, et al. Closing the gaps in pediatric laboratory reference intervals: a CALIPER database of 40 biochemical markers in a healthy and multiethnic population of children. Clin Chem. 2012;58:854–68. doi: 10.1373/clinchem.2011.177741. [DOI] [PubMed] [Google Scholar]
- 23.Kohse KP, Thamm M. KiGGS-the German survey on children’s health as data base for reference intervals. Clin Biochem. 2011;44:479. doi: 10.1016/j.clinbiochem.2011.02.016. [DOI] [PubMed] [Google Scholar]
- 24.Lahti A. Are the common reference intervals truly common? Case studies on stratifying biochemical reference data by countries using two partitioning methods. Scand J Clin Lab Invest. 2004;64:407–30. doi: 10.1080/00365510410006027. [DOI] [PubMed] [Google Scholar]
- 25.Rustad P, Hyltoft Petersen P. Effect of analytical quality on establishing common reference intervals and their use. Scand J Clin Lab Invest. 2004;64:399–406. doi: 10.1080/00365510410006315. [DOI] [PubMed] [Google Scholar]
- 26.Ilcol YO, Aslan D. Use of total patient data for indirect estimation of reference intervals for 40 clinical chemical analytes in Turkey. Clin Chem Lab Med. 2006;44:867–76. doi: 10.1515/CCLM.2006.139. [DOI] [PubMed] [Google Scholar]
- 27.Shine B. Use of routine clinical laboratory data to define reference intervals. Ann Clin Biochem. 2008;45:467–75. doi: 10.1258/acb.2008.008028. [DOI] [PubMed] [Google Scholar]
- 28.Kairisto V, Koskinen P, Mattila K, Puikkonen J, Virtanen A, Kantola I, et al. Reference intervals for 24-h urinary normetanephrine, metanephrine, and 3-methoxy-4-hydroxymandelic acid in hypertensive patients. Clin Chem. 1992;38:416–20. [PubMed] [Google Scholar]
- 29.Matsubara A, Ichihara K, Fukutani S. Determination of reference intervals for 26 commonly measured biochemical analytes with consideration of long-term within-individual variation. Clin Chem Lab Med. 2008;46:691–8. doi: 10.1515/CCLM.2008.140. [DOI] [PubMed] [Google Scholar]
- 30.Arzideh F, Wosniok W, Haeckel R. Reference limits of plasma and serum creatinine concentrations from intra-laboratory data bases of several German and Italian medical centres: Comparison between direct and indirect procedures. Clin Chim Acta. 2010;411:215–21. doi: 10.1016/j.cca.2009.11.006. [DOI] [PubMed] [Google Scholar]
- 31.Baloch Z, Carayon P, Conte-Devolx B, Demers LM, Feldt-Rasmussen U, Henry JF, et al. Laboratory medicine practice guidelines. Laboratory support for the diagnosis and monitoring of thyroid disease. Thyroid. 2003;13:3–126. doi: 10.1089/105072503321086962. [DOI] [PubMed] [Google Scholar]
- 32.Waise A, Price HC. The upper limit of the reference range for thyroid-stimulating hormone should not be confused with a cut-off to define subclinical hypothyroidism. Ann Clin Biochem. 2009;46:93–8. doi: 10.1258/acb.2008.008113. [DOI] [PubMed] [Google Scholar]
- 33.Handelsman DJ. Pharmacoepidemiology of testosterone prescribing in Australia, 1992–2010. Med J Aust. 2012;196:642–5. doi: 10.5694/mja11.11277. [DOI] [PubMed] [Google Scholar]
- 34.Nowson CA, McGrath JJ, Ebeling PR, Haikerwal A, Daly RM, Sanders KM, et al. Vitamin D and health in adults in Australia and New Zealand: a position statement. Med J Aust. 2012;196:686–7. doi: 10.5694/mja11.10301. [DOI] [PubMed] [Google Scholar]
- 35.Mathew TH, Australasian Creatinine Consensus Working Group Chronic kidney disease and automatic reporting of estimated glomerular filtration rate: a position statement. Med J Aust. 2005;183:138–41. doi: 10.5694/j.1326-5377.2005.tb06958.x. [DOI] [PubMed] [Google Scholar]
- 36.Mathew TH, Johnson DW, Jones GR, Australasian Creatinine Consensus Working Group Chronic kidney disease and automatic reporting of estimated glomerular filtration rate: revised recommendations. Med J Aust. 2007;187:459–63. doi: 10.5694/j.1326-5377.2007.tb01357.x. [DOI] [PubMed] [Google Scholar]
- 37.Johnson DW, Jones GR, Mathew TH, Ludlow MJ, Doogue MP, Jose MD, et al. Chronic kidney disease and automatic reporting of estimated glomerular filtration rate: new developments and revised recommendations. Med J Aust. 2012;197:224–5. doi: 10.5694/mja11.11329. [DOI] [PubMed] [Google Scholar]
- 38.Johnson DW, Jones GR, Mathew TH, Ludlow MJ, Chadban SJ, Usherwood T, et al. Chronic kidney disease and measurement of albuminuria or proteinuria: a position statement. Med J Aust. 2012;197:224–5. doi: 10.5694/mja11.11468. [DOI] [PubMed] [Google Scholar]
- 39.Australian Pathology Lipid Interest Group. Appleton CA, Caldwell G, McNeil A, Meerkin M, Sikaris K, et al. Recommendations for lipid testing and reporting by Australian pathology laboratories. Clin Biochem Rev. 2007;28:32–45. [PMC free article] [PubMed] [Google Scholar]
- 40.Jones G, Barker A. Standardisation of reference intervals: an Australasian view. Clin Biochem Rev. 2007;28:169–73. [PMC free article] [PubMed] [Google Scholar]
- 41.Reed M. The New Zealand approach to harmonised reference intervals. Clin Biochem Rev. 2012;33:115–8. [PMC free article] [PubMed] [Google Scholar]
- 42.Taylor N, Meerkin M, Sikaris KA, McNeil A, Garcia Webb P, Guerin M. Patient data defined LFT reference intervals. Clin Biochem Rev. 2001;22:89. [Google Scholar]
- 43.Sikaris KA, Kanowski D, Caldwell G, Sack S, Flatman R. Consensus network reference intervals. Clin Biochem Rev. 2006;27(Suppl.):S34. [Google Scholar]
- 44.Sikaris KA, Lu Z, Kanowski D, Price L, Flatman R, Caldwell G, et al. Defining Sonic network reference intervals for children. Clin Biochem Rev. 2009;30:S20. [Google Scholar]
- 45.Sikaris KA, Lu Z, Kanowski D, Price L, Flatman R, Caldwell G, et al. Defining Sonic network reference intervals for pregnancy. Clin Biochem Rev. 2009;30:S20. [Google Scholar]
- 46.Compton PJ, Stuart MC, Lazarus L. Error in laboratory reference limits as shown in a collaborative quality-assurance program. Clin Chem. 1986;32:845–9. [PubMed] [Google Scholar]
- 47.Zardo L, Secchiero S, Sciacovelli L, Bonvicini P, Plebani M. Reference intervals: are interlaboratory differences appropriate? Clin Chem Lab Med. 1999;37:1131–3. doi: 10.1515/CCLM.1999.165. [DOI] [PubMed] [Google Scholar]
- 48.Friedberg RC, Souers R, Wagar EA, Stankovic AK, Valenstein PN, College of American Pathologists The origin of reference intervals. Arch Pathol Lab Med. 2007;131:348–57. doi: 10.5858/2007-131-348-TOORI. [DOI] [PubMed] [Google Scholar]
- 49.Berg J. The approach to pathology harmony in the UK. Clin Biochem Rev. 2012;33:89–93. [PMC free article] [PubMed] [Google Scholar]
- 50.Berg J, Lane V. Pathology Harmony; a pragmatic and scientific approach to unfounded variation in the clinical laboratory. Ann Clin Biochem. 2011;48:195–7. doi: 10.1258/acb.2011.011078. [DOI] [PubMed] [Google Scholar]
- 51.Sciacovelli L, Zardo L, Secchiero S, Zaninotto M, Plebani M. Interpretative comments and reference ranges in EQA programs as a tool for improving laboratory appropriateness and effectiveness. Clin Chim Acta. 2003;333:209–19. doi: 10.1016/s0009-8981(03)00188-8. [DOI] [PubMed] [Google Scholar]
- 52.Partin AW, Criley SR, Subong EN, Zincke H, Walsh PC, Oesterling JE. Standard versus age-specific prostate specific antigen reference ranges among men with clinically localized prostate cancer: a pathological analysis. J Urol. 1996;155:1336–9. [PubMed] [Google Scholar]
- 53.Apple FS, Morrow D, Zaugg C, Hickey D, Zaharik M, Blackwood JJ, et al. The challenges and concerns companies face pertaining to the US Food and Drug Administration 510(k) process for cardiac biomarkers. Clin Chem. 2012;58:31–8. doi: 10.1373/clinchem.2011.165860. [DOI] [PubMed] [Google Scholar]
- 54.Soldin SJ, Wong EC, Brugnara C, Soldin OP, editors. Pediatric Reference Intervals. 7th Edition. Washington DC, USA: AACC Press; 2011. [Google Scholar]
- 55.Gronowski AM, editor. Handbook of Clinical Laboratory Testing During Pregnancy. Totowa NJ, USA: Humana Press; 2004. [Google Scholar]
- 56.Haeckel R, Wosniok W. A new concept to derive permissible limits for analytical imprecision and bias considering diagnostic requirements and technical state-of-the-art. Clin Chem Lab Med. 2011;49:623–35. doi: 10.1515/CCLM.2011.116. [DOI] [PubMed] [Google Scholar]
- 57.Petersen PH, Sandberg S, Fraser CG. Do new concepts for deriving permissible limits for analytical imprecision and bias have any advantages over existing consensus? Clin Chem Lab Med. 2011;49:637–40. doi: 10.1515/CCLM.2011.117. [DOI] [PubMed] [Google Scholar]
- 58.Petersen PH, Jensen EA, Brandslund I. Analytical performance, reference values and decision limits. A need to differentiate between reference intervals and decision limits and to define analytical quality specifications. Clin Chem Lab Med. 2011;50:819–31. doi: 10.1515/cclm-2011-0844. [DOI] [PubMed] [Google Scholar]