Abstract
Study Design Survey of spine surgeons.
Objective To determine the reliability with which international spine surgeons identify a posterior ligamentous complex (PLC) injury in a patient with a compression-type vertebral body fracture (type A).
Methods A survey was sent to all AOSpine members from the six AO regions of the world. The survey consisted of 10 cases of type A fractures (2 subtype A1, 2 subtype A2, 3 subtype A3, and 3 subtype A4 fractures) with appropriate imaging (plain radiographs, computed tomography, and/or magnetic resonance imaging), and the respondent was asked to identify fractures with a PLC disruption, as well as to indicate if the integrity of the PLC would affect their treatment recommendation.
Results Five hundred twenty-nine spine surgeons from all six AO regions of the world completed the survey. The overall interobserver reliability in determining the integrity of the PLC was slight (kappa = 0.11). No substantial regional or experiential difference was identified in determining PLC integrity or its absence; however, a regional difference was identified (p < 0.001) in how PLC integrity influenced the treatment of type A fractures.
Conclusion The results of this survey indicate that there is only slight international reliability in determining the integrity of the PLC in type A fractures. Although the biomechanical importance of the PLC is not in doubt, the inability to reliably determine the integrity of the PLC may limit the utility of the M1 modifier in the AOSpine Thoracolumbar Spine Injury Classification System.
Keywords: posterior ligamentous complex, PLC, reliability, AOSpine Thoracolumbar Spine Injury Classification System, M1 modifier
Introduction
The two main goals of spine trauma classifications are to allow for communication between surgeons, researchers, and trainees and to guide the treatment. Some thoracolumbar fracture classifications such as the Magerl classification are based mainly on the bony injury morphology, which can be reliably evaluated on plain radiographs as well as computed tomography (CT).1 2 However, because of the complexity of the Magerl system as well as the failure to formally consider the neurologic presentation of the patient, the clinical applicability of this classification system has been questioned.2 3 4 5 6
Recognizing the limitations of the Magerl system, Vaccaro et al published the thoracolumbar injury classification system (TLICS) in 2005.3 The TLICS assigns a numerical value to the fracture morphology, the integrity of the posterior ligamentous complex (PLC), and the neurologic status of the patient, and the treatment recommendation is based on the summation of these three values.3 Excellent reliability and validity of the system has been reported, with up to 96.2% agreement in the final treatment recommendation7 8 9; however, Harrop et al reported only fair reproducibility in the interpretation of the integrity of the PLC (kappa = 0.34).10
Without consistent agreement on the criteria that define the integrity of the PLC, the TLICS cannot properly guide the treatment of thoracolumbar burst fractures. In the TLICS algorithm, a burst fracture in a neurologically intact patient with an intact PLC is awarded 2 points, and an initial trial of nonoperative management is recommended. In a similar patient, if the PLC is clearly disrupted, 5 points are awarded and surgical intervention is warranted. If the status of the PLC is unclear, the TLICS fails to recommend a definitive treatment as the overall score of 4 points (indeterminate) places the treatment recommendation in a gray area, leaving it up to surgeon discretion. This failure of the TLICS, especially in burst-type fractures, has led to significant regional variations in the application of the proposed treatment algorithm. Recent studies from North and South America have reported successful nonoperative treatment of thoracolumbar burst fractures with and without a brace,11 12 and concurrent studies from Europe have reported the successful treatment of similar bony injuries with a combined anterior and posterior fusion.13 14
In 2013, Vaccaro et al published the AOSpine Thoracolumbar Spine Injury Classification System.15 In the new AOSpine classification, fractures are divided into three major types (A, compression; B, tension band injury; C, translational injury), and the neurologic status of the patient is divided into five groups (N0, neurologically intact; N1, transient neurologic deficit; N2, radiculopathy; N3, incomplete spinal cord injury; N4, complete spinal cord injury; NX, unable to asses neurologic status). A definitive ligamentous injury is defined as type B, but if there is uncertainty about ligamentous injury, a patient-specific modifier (M1) is assigned.15 Eventually a Spine Injury Score will be established and used as the basis for a global algorithm for the treatment of the thoracolumbar trauma. However, prior to establishing what value should be awarded to the M1 modifier, it is critical to establish the agreement between surgeons to identify a possible PLC injury in a thoracolumbar compression (type A) fracture.
The TLICS was the first classification to formally recognize the importance of the integrity of the PLC on thoracolumbar fracture stability, and so it is possible that surgeons have become more adept at identifying PLC injuries since Harrop et al reported only fair agreement 1 year after the TLICS was published (kappa = 0.34)3 10; however, if surgeons can still not reliably agree on the presence of PLC injuries, then any classification system that makes the PLC a defining variable in the recommended treatment is bound to have the same shortcomings as the TLICS. The aim of this study is to determine the reliability of spine surgeons in indentifying a PLC injury in a compression-type fracture (type A) from the six AOSpine regions of the world (North America, South America, Europe, Africa, Asia, and the Middle East) and to determine their opinion on the perceived importance of the integrity of the PLC on the treatment algorithm.
Methods
A survey was sent to AOSpine members from the six AO regions of the world. The survey consisted of 10 cases of type A fractures (2 subtype A1, 2 subtype A2, 3 subtype A3, and 3 subtype A4 fractures) with appropriate imaging (plain radiographs, CT, and/or magnetic resonance imaging [MRI]), and the respondent was asked to identify fractures with a PLC disruption, as well as to indicate if the integrity of the PLC would affect their treatment recommendation.
The kappa statistic, as proposed by Cohen, was calculated for measuring the agreement among the investigators to a disruption in the PLC, and this was estimated by region and experience. The Landis and Koch grading system was used to interpret the kappa (Table 1).16 A chi-square test was used to test the independence of categorical responses, and the statistical significance was accepted as p < 0.05. The analysis was performed using the statistical software STATA version 13 (Stata Corporation, College Station, Texas, United States).
Table 1. Landis and Koch grading system16 .
Kappa value | Strength of agreement |
---|---|
<0.20 | Slight |
0.21–0.41 | Fair |
0.41–0.60 | Moderate |
0.61–0.80 | Substantial |
>0.81 | Excellent |
Results
Five hundred twenty-nine surgeons from the six AO regions of the world completed the survey (Table 2). According to the total number of 5,290 assessments, around 84.1% answered the question “Do you see any significant injury to the posterior ligamentous complex (PLC)?” and 83.8% answered the question “Did the status of the posterior ligamentous complex (PLC) matter in your decision making?” The overall agreement on identifying disruption of the PLC was slight (kappa = 0.11). The interobserver reliability was highest in North America (kappa = 0.22), which also was the only region to attain fair (kappa > 0.20) reliability. All other regions demonstrated only slight reliability, and the reliability was the lowest in the Middle East (kappa = 0.07; Table 3). The interobserver reliability demonstrated only a minor improvement in the surgeons with more than 10 years of experience compared with the surgeons with less than 10 years of experience (kappa = 0.13 versus kappa = 0.10, respectively).
Table 2. Demographics of respondents.
Total number | Percent | |
---|---|---|
Region | ||
Europe | 152 | 28.7 |
Asia Pacific | 180 | 34.0 |
Latin America | 79 | 14.9 |
Middle East | 57 | 10.8 |
North America | 33 | 10.8 |
Africa | 7 | 1.3 |
Missing | 21 | 4.0 |
Experience | ||
1–10 y | 241 | 45.6 |
>10 y | 285 | 53.9 |
Missing | 3 | 0.6 |
Table 3. Interobserver reliability by region.
Region | Kappa value |
---|---|
Europe | 0.17 |
Asia Pacific | 0.08 |
Latin America | 0.10 |
Middle East | 0.07 |
North America | 0.22 |
Africa | 0.15 |
Significant (p < 0.001) regional variability was identified in the perceived importance of the integrity of the PLC among all compression-type fractures (A1, A2, A3, and A4; Table 4). Surgeons from North America reported that PLC integrity affected their treatment algorithm in 79.5% of cases, whereas surgeons from Europe reported that it only affected their recommended treatment in 61.2% of cases (Table 4). When only burst fractures (A3 and A4) were included in the analysis, regional variability regarding the importance of the integrity of the PLC on the treatment algorithm remained (p < 0.001; Table 4). No difference was identified in the perceived importance of the PLC according to surgeon experience for all type A fractures, but when only burst fractures were considered, surgeons with more than 10 years of experience were more likely to have the integrity of the PLC affect their treatment recommendation (Table 5).
Table 4. The effect of the integrity of the PLC on the treatment recommendation by region.
Region | Integrity of the PLC affected treatment for type A fractures (%) | Integrity of the PLC affected treatment for type A3/A4 fractures (%) |
---|---|---|
Europe | 61.2 | 54.3 |
Asia Pacific | 73.0 | 67.1 |
Latin America | 63.7 | 54.9 |
Middle East | 76.9 | 73.2 |
North America | 79.5 | 72.2 |
Africa | 61.9 | 44.4 |
Abbreviation: PLC, posterior ligamentous complex.
Note: p < 0.001 for all regions
Table 5. The effect of the integrity of the PLC on the treatment recommendation by experience.
Experience | Integrity of the PLC affected treatment for type A fractures (%) | Integrity of the PLC affected the treatment for type A3/A4 fractures (%) |
---|---|---|
1–10 y | 67.4 | 58.7 |
>10 y | 69.3 | 64.1 |
p value | 0.19 | <0.01 |
Abbreviation: PLC, posterior ligamentous complex.
Discussion
The goal of this study was to determine if international spine surgeons are able to reliably identify a PLC injury in a compression-type fracture (type A) and to determine their perception of the importance of the PLC on the treatment of type A fractures. Globally, we demonstrated only a slight interobserver reliability between spine surgeons (kappa = 0.11), and we demonstrated significant (p < 0.001) regional differences in the perceived importance of the PLC in type A fractures.
We hypothesized that the ability of spine surgeons to agree on the integrity of the PLC had improved substantially over the last 8 years, as it is one of the three main determinants of the TLICS. Instead, we found a substantial decline in the interobserver reliability compared with those reported in 2006 (kappa = 0.11 versus 0.34).10 This decline may be due to the fact that in the previous study, the 30 surgeons who completed the survey were a select group of academic spine surgeons who also helped develop the TLICS. Comparatively, in the current study, there were more than 10 times the number of surgeons surveyed, and none of them were responsible for the development of the AOSpine Thoracolumbar Spine Injury Classification System. Furthermore, in the current study, the surgeons were from throughout the world, which undoubtedly included some surgeons who practice in locations where MRIs are not routinely ordered to evaluate thoracolumbar type A fractures. Finally, and most importantly, the current study focused only on compression (type A) fractures, whereas the previous study had cases from all different fracture morphologies. The overall reliability would likely have been improved if more severe cases with an obvious PLC disruption, such as rotation/translation and distraction morphology, were included, as was the case in previous studies; however, this study focused on presumed type A fractures without clear signs of posterior distraction, because this is the only type in which the integrity of the PLC may be in question.
This study is one of a series of studies being performed to establish the intellectual foundation for a globally accepted algorithm for the treatment of thoracolumbar trauma based on the AOSpine Thoracolumbar Spine Injury Classification System. The failure to demonstrate any meaningful regional or experiential differences in the interobserver reliability of indentifying a PLC injury is consistent with the other studies performed. Schroeder et al demonstrated no regional or experiential difference in the perceived severity of subtypes of the AOSpine Thoracolumbar Spine Injury Classification System and demonstrated no regional or experiential difference in the radiographic interpretation of burst fractures with a single end plate (A3) or both end plates involved (A4).17 Furthermore, Kepler et al demonstrated the globally applicability of the AOSpine Thoracolumbar Spine Injury Classification System by demonstrating moderate (kappa = 0.56) interobserver reliability in a worldwide survey of 100 spine surgeons without prior knowledge of the classification system.18 However, with significant regional and experiential differences identified for the effect of the integrity of the PLC on the treatment algorithm and only slight interobserver reliability in identifying a PLC injury, the current study questions if the M1 modifier should be a pivotal piece in the thoracolumbar treatment algorithm. Using the results of the current and the aforementioned studies, the essential factors for a global algorithm for the treatment of thoracolumbar trauma based around the AOSpine Thoracolumbar Spine Injury Classification System are being identified.
This study has significant limitations, particularly those known to be inherent to survey studies in general. Although the survey was carefully designed by members of the AOSpine Trauma Knowledge Forum and all pertinent images were provided, the respondents did not have the opportunity to review the entire CT and MRI sequence. Undoubtedly, some respondents would have changed their answer if all CTs and MRIs were available; however, the current methodology was chosen for the sake of inclusiveness, as it allowed for 529 surgeons from all over the world to participate.
Conclusion
The interobserver reliability of an international group of spine surgeons to indentify an injury to the PLC in presumed type A injuries without clear signs of posterior distraction is only slight (kappa = 0.11), and there is a significant (p < 0.001) regional variability in how their perception of the integrity of the PLC affects their choice of treatment. These two findings indicate that the treatment algorithm associated with the AOSpine Thoracolumbar Spine Injury Classification System should be based primarily on fracture morphology and the neurologic status of the patient rather than the M1 modifier.
Acknowledgments
This manuscript was the work of the AOSpine Trauma Knowledge Forum. Each author was involved in the initial planning of this manuscript, designing the concept of the manuscript, and critically revising the manuscript. This manuscript is part of a much larger project designed to determine a surgical algorithm based off of the AOSpine Thoracolumbar Injury Classification.
AOSpine is a clinical division of the AO Foundation—an independent, medically guided, not-for-profit organization. The AO has a strong financial independence thanks to the foundation's endowment. The annual operating activities are financed through three pillars: collaboration and support agreements with DePuy Synthes and other industrial partners, return on own financial assets, and other third-party income (e.g., participant fees, R&D projects, memberships).
The AOSpine Knowledge Forums are pathology-focused working groups acting on behalf of AOSpine in their domain of scientific expertise. Each forum consists of a steering committee of up to 10 international spine experts who meet biannually to discuss research, assess the best evidence for current practices, and formulate clinical trials to advance their field of spine expertise. Authors are compensated for their travel and accommodation costs. Study support is provided directly through AOSpine's Research department and AO's Clinical Investigation and Documentation unit. There are no other institutional subsidies, corporate affiliations, or funding sources supporting this work unless clearly documented and disclosed.
Footnotes
Disclosures Gregory D. Schroeder, Grant: Medtronic Chris D. Kepler, Grant: AOSpine; Consultant: HealthGrades John K. Koerner, Grant: Medtronic F. Cumhur Oner, Consultant: Medtronic, Synthes; Grant: AOSpine, Depuy Michael G. Fehlings, Grant: AOSpine Bizhan Aarabi, Grant: AOSpine Marcel Dvorak, Grant: AOSpine Max Reinhold, Grant: AOSpine Frank Kandziora, Grant: AOSpine; Board membership: Eurospine, Dwg, DGOU, AOSpine; Consultant: Depuy, Siemens; Speakers bureau: Depuy, Siemens, AOSpine; Educational development: ESJ-OOT Carlo Bellabarba, Grant: AOSpine North America, AOSpine, Depuy, Omega Jens R. Chapman, Grant: AOSpine; Stocks: Renovis Medical; Royalties from publishers: Evidence-Based Spine-Care Journal, Global Spine Journal; editorial governing board: Evidence-Based Spine-Care Journal, Global Spine Journal, Spine, Journal of Spine; Board membership: AOSpine North America, AOSpine Foundation, CSRS—Membership Committee Luiz R. Vialle, Grant: AOSpine; Board membership: AOSpine Foundation Alexander R. Vaccaro, Grant: AOSpine; Consultant: Medtronic, Stryker, Globus, Stout, Gerson Lehrman Group, Guidepoint Global, Medacorp, Innovative Surgical Design, Orthobullets, Expert testimony, Ellipse; Scientific advisory board: Innovative Surgical Design, Spinicity; Royalties: Thieme, Jaypee, Elsevier, Taylor and Francis, Aesculap, Globus, Medtronic, Stryker, DePuy; Institutional support: Cerapedics; Stocks: Spine Medica, Computational Biodynamics, Progressive Spinal Technologies, Spinology, Small Bone Innovations, Cross Current, In Vivo, Flagship Surgical, Advanced Spinal Intellectual Properties, Cytonics, Bonovo Orthopaedics, Electrocore, Gamma Spine, Location Based Intelligence, FlowPharma, R.S.I., Replication Medica, Globus, Stout Medical, Rothman Institute and Related Properties, Innovative Surgical Design, Spinicity
References
- 1.Denis F. The three column spine and its significance in the classification of acute thoracolumbar spinal injuries. Spine (Phila Pa 1976) 1983;8(8):817–831. doi: 10.1097/00007632-198311000-00003. [DOI] [PubMed] [Google Scholar]
- 2.Magerl F, Aebi M, Gertzbein S D, Harms J, Nazarian S. A comprehensive classification of thoracic and lumbar injuries. Eur Spine J. 1994;3(4):184–201. doi: 10.1007/BF02221591. [DOI] [PubMed] [Google Scholar]
- 3.Vaccaro A R, Lehman R A Jr, Hurlbert R J. et al. A new classification of thoracolumbar injuries: the importance of injury morphology, the integrity of the posterior ligamentous complex, and neurologic status. Spine (Phila Pa 1976) 2005;30(20):2325–2333. doi: 10.1097/01.brs.0000182986.43345.cb. [DOI] [PubMed] [Google Scholar]
- 4.Wood K B, Khanna G, Vaccaro A R, Arnold P M, Harris M B, Mehbod A A. Assessment of two thoracolumbar fracture classification systems as used by multiple surgeons. J Bone Joint Surg Am. 2005;87(7):1423–1429. doi: 10.2106/JBJS.C.01530. [DOI] [PubMed] [Google Scholar]
- 5.Oner F C, Ramos L M, Simmermacher R K. et al. Classification of thoracic and lumbar spine fractures: problems of reproducibility. A study of 53 patients using CT and MRI. Eur Spine J. 2002;11(3):235–245. doi: 10.1007/s00586-001-0364-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Aebi M. Classification of thoracolumbar fractures and dislocations. Eur Spine J. 2010;19 01:S2–S7. doi: 10.1007/s00586-009-1114-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Raja Rampersaud Y, Fisher C, Wilsey J. et al. Agreement between orthopedic surgeons and neurosurgeons regarding a new algorithm for the treatment of thoracolumbar injuries: a multicenter reliability study. J Spinal Disord Tech. 2006;19(7):477–482. doi: 10.1097/01.bsd.0000211237.14211.21. [DOI] [PubMed] [Google Scholar]
- 8.Joaquim A F, Fernandes Y B, Cavalcante R A, Fragoso R M, Honorato D C, Patel A A. Evaluation of the thoracolumbar injury classification system in thoracic and lumbar spinal trauma. Spine (Phila Pa 1976) 2011;36(1):33–36. doi: 10.1097/BRS.0b013e3181c95047. [DOI] [PubMed] [Google Scholar]
- 9.Vaccaro A R Baron E M Sanfilippo J et al. Reliability of a novel classification system for thoracolumbar injuries: the Thoracolumbar Injury Severity Score Spine (Phila Pa 1976) 200631(11, Suppl):S62–S69., discussion S104 [DOI] [PubMed] [Google Scholar]
- 10.Harrop J S, Vaccaro A R, Hurlbert R J. et al. Intrarater and interrater reliability and validity in the assessment of the mechanism of injury and integrity of the posterior ligamentous complex: a novel injury severity scoring system for thoracolumbar injuries. Invited submission from the Joint Section Meeting On Disorders of the Spine and Peripheral Nerves, March 2005. J Neurosurg Spine. 2006;4(2):118–122. doi: 10.3171/spi.2006.4.2.118. [DOI] [PubMed] [Google Scholar]
- 11.Bakhsheshian J, Dahdaleh N S, Fakurnejad S, Scheer J K, Smith Z A. Evidence-based management of traumatic thoracolumbar burst fractures: a systematic review of nonoperative management. Neurosurg Focus. 2014;37(1):E1. doi: 10.3171/2014.4.FOCUS14159. [DOI] [PubMed] [Google Scholar]
- 12.Alcalá-Cerra G, Paternina-Caicedo A J, Díaz-Becerra C, Moscote-Salazar L R, Fernandes-Joaquim A. Orthosis for thoracolumbar burst fractures without neurologic deficit: a systematic review of prospective randomized controlled trials. J Craniovertebr Junction Spine. 2014;5(1):25–32. doi: 10.4103/0974-8237.135213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Schnake K J, Stavridis S I, Kandziora F. Five-year clinical and radiological results of combined anteroposterior stabilization of thoracolumbar fractures. J Neurosurg Spine. 2014;20(5):497–504. doi: 10.3171/2014.1.SPINE13246. [DOI] [PubMed] [Google Scholar]
- 14.Reinhold M, Knop C, Beisse R. et al. Operative treatment of 733 patients with acute thoracolumbar spinal injuries: comprehensive results from the second, prospective, Internet-based multicenter study of the Spine Study Group of the German Association of Trauma Surgery. Eur Spine J. 2010;19(10):1657–1676. doi: 10.1007/s00586-010-1451-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Vaccaro A R, Oner C, Kepler C K. et al. AOSpine thoracolumbar spine injury classification system: fracture description, neurological status, and key modifiers. Spine (Phila Pa 1976) 2013;38(23):2028–2037. doi: 10.1097/BRS.0b013e3182a8a381. [DOI] [PubMed] [Google Scholar]
- 16.Landis J R, Koch G G. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. [PubMed] [Google Scholar]
- 17.Schroeder G D, Vaccaro A R, Kepler C K. et al. Establishing the injury severity of thoracolumbar trauma: confirmation of the hierarchical structure of the AOSpine Thoracolumbar Spine Injury Classification System. Spine (Phila Pa 1976) 2015;40(8):E498–E503. doi: 10.1097/BRS.0000000000000824. [DOI] [PubMed] [Google Scholar]
- 18.Kepler C K Vaccaro A R Koerner J D et al. Reliability analysis of the AOSpine thoracolumbar spine injury classification system by a worldwide group of naïve spinal surgeons Eur Spine J 2015; January 20 (Epub ahead of print) [DOI] [PubMed] [Google Scholar]