Abstract
Background:
Epidemiological analyses of sarcoma are limited by the heterogeneity and rarity of the disease. Utilizing population-based surveillance data enabled us to evaluate the contribution of census tract-level socioeconomic status (CT-SES) and race/ethnicity on sarcoma incidence rates.
Methods:
We utilized the Surveillance, Epidemiology and End Results program to evaluate associations between CT-SES and race/ethnicity on the incidence rates of sarcoma. Incidence rate ratios (IRR) and 99% confidence intervals (CI) were estimated from quasi-Poisson models. All models were stratified by broad age groups (pediatric: < 20 years, Adult: 20 – 65 years, Older adult: 65 + years) and adjusted for sex, age and year of diagnosis. Within each age group, we conducted analyses stratified by somatic genome (fusion positive and fusion negative sarcomas) and for subtypes with > 200 total cases. A p-value less than 0.01 was considered statistically significant.
Results:
We included 55,415 sarcoma cases in 35 sarcoma subtype-age group combinations. Increasing CT-SES was statistically significantly associated with 11 subtype-age group combinations, primarily in the older age group strata (8 subtypes), while malignant peripheral nerve sheath tumors in adults were associated with decreasing CT-SES. Nearly every sarcoma subtype-age group combination displayed racial/ethnic disparities in incidence that were independent of CT-SES.
Conclusions:
We found race/ethnicity to be more frequently associated with sarcoma incidence than CT-SES. Our findings suggest that genetic variation associated with ancestry may play a stronger role than area-level SES-related factors in the etiology of sarcoma.
Impact:
These findings provide direction for future etiologic studies of sarcomas.
Introduction:
Sarcomas, which can be broadly categorized as soft tissue sarcomas (STS) or bone sarcomas (BS), comprise a group of over 50 exceedingly rare and heterogeneous mesenchymal neoplasms.(1) The overall incidence of STS and BS is fewer than 5 cases per 100,000 persons per year, although substantial variation in incidence by subtype, age, sex, and race has been described.(2–5) Several factors are associated with sarcoma risk.(6) These include genetic predisposition from both rare variants,(7–9) some of which underlie cancer predisposition syndromes,(10) and common variants.(11–13) Other studies have observed sarcoma development to be associated with environmental (e.g. phenoxyherbicide exposure(14,15), radiotherapy.(16,17)), infectious,(18) and perinatal factors (e.g. birth weight,(19) parental age(20)). However, the etiology for the vast majority of sarcomas remains unknown, largely because the rarity and heterogeneity of the disease preclude sufficiently powered etiologic studies.(6)
Evaluating socioeconomic status (SES) and racial/ethnic disparities in incidence across sarcoma subtypes may provide clues into their etiology. SES can serve as a proxy for locally varying environmental and lifestyle factors that influence health,(21) whereas racial/ethnic disparities that are independent of SES may indicate that genetic variants associated with ancestry contribute to tumor development. However, prior analyses of SES and sarcoma incidence are either limited to analyses of childhood cases or to those that combined distinct subtypes into a single group,(22–25) and few analyses of race-specific incidence patterns considered probable confounding by SES.(2,5) It is therefore difficult to properly attribute differences observed to environment or to genetic variants associated with ancestry.
In this study, we sought to utilize the population-based Surveillance, Epidemiology, and End Results (SEER) program to investigate the independent associations of census tract-level SES (CT-SES) and race/ethnicity with the incidence rate of individually rare subtypes of sarcoma diagnosed across the age span.
Materials and Methods:
Study population
Sarcoma cases were identified from the SEER Census Tract-level SES database, a specialized database that includes cancer cases diagnosed between 2000 and 2015 in the catchment area of 16 SEER registries.(26) The Alaska Native and Louisiana tumor registries are excluded by SEER due to confidentiality concerns and the impact of Hurricane Katrina on population estimates, respectively.(27) We identified a sarcoma as any microscopically confirmed, initial primary malignant tumor with an International Classification of Disease for Oncology, 3rd Edition (ICD-O-3) histology code recognized by the 2013 World Health Organization’s (WHO) Classification of Tumours of Soft Tissue and Bone.(28) We then categorized sarcomas into subtypes according to WHO classifications and the recommendations of an expert sarcoma oncologist (BJW) and pathologist (PM) (Supplementary Table 1). Kaposi sarcoma was excluded because it primarily arises in the setting of HIV infection.(18) Additionally, we classified sarcoma subtypes by their somatic genome as either fusion positive (F +) or fusion negative (F −) according to published data (Supplementary Table 2).(29,30) The (F +) category included subtypes with somatic genomes characterized by simple chromosomal translocations, whereas the (F −) category included subtypes with somatic genomes characterized by complex chromosomal rearrangements.(31) We opted to categorize sarcoma subtypes by their presumed fusion status because evidence suggests that (F −) sarcomas are more likely than (F +) sarcomas to arise from genetic susceptibility.(29,32–34) Cases diagnosed in the year 2000 were excluded because they were not coded under the ICD-O-3 guidelines introduced in 2001.(35)
SES, race/ethnicity, and other covariates
SES was assessed using a composite index of SES measured at the level of the census-tract, defined as a small geographical unit (~ 4,000 people) designed to be homogeneous with respect to population characteristics, economic status, and living conditions.(27) The CT-SES index was generated from a factor analysis of seven SES characteristics identified by Yost et al.(36) Cases were mapped to a census tract based upon their address at diagnosis.(36) CT-SES indices assigned to each census tract were then categorized by SEER into quintiles of equal population size, with the first quintile (Q1) representing the lowest CT-SES and the fifth quintile (Q5) representing the highest.(26)
Cases were identified as having Hispanic ethnicity by SEER based upon the North American Association for Central Cancer Registries Hispanic-Latino identification algorithm.(37) Those without a Hispanic ethnicity were categorized into mutually exclusive race categories as either Non-Hispanic (NH)-white, NH-black, or American Indian/Alaskan Native or Asian Pacific Islander (AIAN/API); the AIAN and API race categories were combined due to sample size constraints. Age at diagnosis was categorized into 9 age-groups (0 – 9 years, 10 - 19 years, 20 - 29 years, 30 – 39 years, 40 – 49 years, 50 - 64 years, 65 – 74 years, 75 - 84 years, 85 +).
The population denominator estimates provided by SEER were stratified by single calendar year, sex, age, race, and Hispanic ethnicity. SEER allocates multiracial populations into one of the four single race categories with a probability proportional to the size of that race in the population.(26)
After excluding cases with missing CT-SES (n = 914) or race/ethnicity (n = 775) variables, we analyzed 97% of the starting dataset (55,415 of 57,044 cases).
Statistical Analyses
Incidence rate ratios (IRR) and 99% confidence intervals (CI) for CT- SES and race/ethnicity, adjusted for age at diagnosis, sex, and year of diagnosis were estimated from quasi-Poisson models, which handle over dispersion by assuming the variance is a linear function of the mean.(38) In all models, log transformed population denominator estimates served as the offset term. Analyses were stratified by broad age group categories (pediatric < 20 years; adult 20 – 65 years; older adult > 65 years) to account for possible differences in sarcoma etiology. Within each age group strata, associations were assessed for sarcomas classified by fusion status (either (F+) or (F-)), as well as for any subtype with more than 200 total cases, a cut-off that was set a priori to maintain adequate power. For each sarcoma subtype, CT-SES was assessed as either a categorical or ordinal variable, with the p-value from the ordinal variable used as a test for trend. For ease of interpretation, we have focused our reporting on the results obtained from analyzing CT-SES as an ordinal variable (Figure 1). In some instances, these results appeared to obscure a non-linear association apparent in the analysis of CT-SES as a categorical variable. We, therefore, present results from analyses of CT-SES as a categorical variable in Supplementary Table 3.
Figure 1.

Multivariable adjusted incidence rate ratios (IRR) for sarcoma by CT-SES stratified by sarcoma subtype and age group, SEER 16 registries (2001 – 2015). The IRR was obtained by evaluating CT-SES as an ordinal variable and error bars represent 99% CIs. Estimates are adjusted for race/ethnicity, age at diagnosis, sex, and year. The analysis of malignant chordoma in the older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e. < 5 NHB cases). ES = Ewing sarcoma; RMS = rhabdomyosarcoma; ARMS = Alveolar RMS; F/MF = Fibroblasitc or Myofibroblastic; MPNST = Malignant peripheral nerve sheath tumor; DSRCT = Desmoplastic small round cell tumor, GIST = gastrointestinal stromal tumor; UPS = undifferentiated pleomorphic sarcoma.
A p-value less than 0.01 was considered statistically significant. This threshold adjusted for the multiple primary hypothesis tests evaluated within each sarcoma subtype and age group strata (i.e. the ordinal CT-SES and 3 race/ethnicity comparisons) and controls the overall Type I error rate at approximately 0.05. All reported p-values are two-sided. Datasets were created using SEER*Stat 8.3.6(39) and analyses were performed in R version 3.4.4.(40)
Results:
Table 1 presents the distribution of sarcoma cases by demographic and clinical characteristics. Overall, 55,415 initial primary malignant sarcoma cases were identified (48,348 STS and 7,067 BS). Among all sarcoma cases, 64% were NH-white, 12% were NH-black, 16% were of Hispanic ethnicity and 9% were AIAN/API. The distribution of sarcoma cases from lowest to highest SES quintile was 17%, 18%, 20%, 22%, and 24%, respectively.
Table 1.
Distribution of sarcoma cases by demographic and clinical characteristics, SEER 16 registries (2001 – 2015)
| Total | Sex, % | Race,% | Census Tract SES index, % | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M | F | NHW | NHB | H | AIAN/API | Q1 | Q2 | Q3 | Q4 | Q5 | ||
| All Ages at diagnosis | ||||||||||||
| Bone sarcomas | 7,067 | 57 | 43 | 63 | 8 | 22 | 7 | 17 | 19 | 20 | 21 | 24 |
| Soft tissue sarcomas | 48,348 | 51 | 49 | 64 | 12 | 15 | 9 | 17 | 18 | 20 | 22 | 24 |
| (F −) sarcomas | 17,391 | 59 | 41 | 67 | 9 | 17 | 8 | 16 | 18 | 20 | 21 | 24 |
| (F +) sarcomas | 13,662 | 53 | 47 | 58 | 14 | 20 | 8 | 17 | 19 | 20 | 21 | 23 |
| Pediatric (<20 years) | ||||||||||||
| Osteosarcoma | 1,569 | 56 | 44 | 45 | 15 | 32 | 8 | 20 | 22 | 20 | 19 | 19 |
| ES of bone | 931 | 61 | 39 | 65 | 3 | 25 | 7 | 15 | 19 | 23 | 18 | 25 |
| ARMS | 489 | 54 | 46 | 45 | 17 | 28 | 10 | 18 | 22 | 18 | 19 | 23 |
| Embryonal RMS | 749 | 61 | 39 | 55 | 14 | 25 | 6 | 18 | 20 | 19 | 19 | 24 |
| F/MF tumors | 518 | 52 | 48 | 48 | 20 | 28 | 5 | 19 | 22 | 16 | 21 | 22 |
| Synovial sarcoma | 302 | 51 | 49 | 52 | 9 | 31 | 8 | 18 | 19 | 20 | 20 | 23 |
| ES of soft tissue | 352 | 56 | 44 | 57 | 5 | 32 | 7 | 16 | 19 | 22 | 21 | 22 |
| (F −) sarcomas | 2,078 | 56 | 44 | 46 | 14 | 32 | 8 | 20 | 22 | 19 | 20 | 20 |
| (F +) sarcomas | 2,762 | 56 | 44 | 55 | 10 | 27 | 8 | 17 | 20 | 20 | 19 | 23 |
| Adult (20 – 65 years) | ||||||||||||
| Osteosarcoma | 1,203 | 56 | 45 | 54 | 13 | 23 | 9 | 21 | 18 | 20 | 19 | 21 |
| ES of bone | 450 | 64 | 36 | 73 | 4 | 19 | 5 | 18 | 20 | 19 | 20 | 23 |
| Chondrosarcoma | 1,699 | 53 | 48 | 72 | 6 | 17 | 5 | 15 | 18 | 20 | 22 | 25 |
| Malignant chordoma | 448 | 61 | 39 | 65 | 5 | 21 | 9 | 16 | 17 | 18 | 22 | 28 |
| Other RMS | 372 | 58 | 42 | 52 | 16 | 22 | 10 | 20 | 19 | 21 | 20 | 20 |
| F/MF tumors | 5,581 | 48 | 52 | 57 | 19 | 15 | 9 | 18 | 18 | 20 | 21 | 24 |
| Synovial sarcoma | 1,366 | 54 | 46 | 56 | 10 | 26 | 8 | 19 | 20 | 19 | 21 | 21 |
| ES of soft tissue | 442 | 52 | 48 | 62 | 5 | 22 | 12 | 18 | 21 | 22 | 20 | 20 |
| MPNST | 1,048 | 55 | 45 | 58 | 15 | 18 | 9 | 22 | 19 | 20 | 20 | 18 |
| DSRCT | 225 | 84 | 16 | 53 | 20 | 20 | 8 | 18 | 23 | 23 | 16 | 20 |
| GIST | 3,966 | 56 | 44 | 54 | 18 | 14 | 14 | 19 | 18 | 20 | 21 | 23 |
| Leiomyosarcoma | 6,036 | 27 | 73 | 59 | 16 | 17 | 9 | 17 | 19 | 19 | 23 | 22 |
| Liposarcoma | 4,796 | 60 | 40 | 63 | 9 | 20 | 9 | 16 | 18 | 20 | 22 | 25 |
| Malignant vascular tumors | 1,027 | 52 | 48 | 63 | 12 | 16 | 10 | 16 | 19 | 21 | 22 | 23 |
| Epithelioid sarcoma | 386 | 57 | 43 | 62 | 11 | 19 | 7 | 17 | 18 | 18 | 24 | 23 |
| UPS | 1,571 | 62 | 38 | 70 | 9 | 15 | 6 | 16 | 17 | 20 | 22 | 26 |
| Unclassified sarcomas | 2,218 | 54 | 46 | 61 | 13 | 17 | 9 | 19 | 18 | 20 | 20 | 22 |
| (F −) sarcomas | 9,114 | 58 | 42 | 64 | 10 | 18 | 8 | 17 | 18 | 20 | 21 | 24 |
| (F +) sarcomas | 9,284 | 52 | 48 | 57 | 15 | 19 | 9 | 18 | 19 | 20 | 21 | 23 |
| Older Adult (65 + years) | ||||||||||||
| Chondrosarcoma | 537 | 53 | 47 | 83 | 3 | 10 | 4 | 13 | 18 | 18 | 24 | 27 |
| Malignant chordoma | 230 | 62 | 38 | NR | NR | NR | NR | 12 | 17 | 16 | 23 | 32 |
| Other RMS | 237 | 43 | 57 | 74 | 9 | 9 | 8 | 14 | 17 | 19 | 24 | 26 |
| F/MF tumors | 1,455 | 50 | 50 | 74 | 7 | 11 | 8 | 12 | 19 | 20 | 23 | 26 |
| MPNST | 252 | 55 | 45 | 75 | 8 | 10 | 8 | 17 | 18 | 21 | 20 | 24 |
| GIST | 3,309 | 48 | 52 | 62 | 15 | 9 | 14 | 17 | 19 | 18 | 22 | 23 |
| Leiomyosarcoma | 3,220 | 41 | 59 | 75 | 11 | 9 | 6 | 16 | 19 | 20 | 21 | 24 |
| Liposarcoma | 2,721 | 62 | 38 | 74 | 5 | 12 | 9 | 14 | 18 | 20 | 23 | 26 |
| Malignant vascular tumors | 1,078 | 58 | 42 | 76 | 4 | 9 | 11 | 14 | 16 | 21 | 26 | 23 |
| UPS | 2,568 | 68 | 32 | 85 | 3 | 6 | 5 | 12 | 17 | 21 | 22 | 27 |
| Unclassified sarcomas | 2,064 | 55 | 45 | 80 | 7 | 8 | 6 | 14 | 18 | 20 | 23 | 25 |
| (F −) sarcomas | 6,199 | 62 | 38 | 79 | 5 | 9 | 7 | 13 | 18 | 21 | 23 | 26 |
| (F +) sarcomas | 1,616 | 51 | 49 | 73 | 9 | 11 | 8 | 15 | 19 | 17 | 24 | 25 |
ES = Ewing sarcoma; RMS = rhabdomyosarcoma; ARMS = Alveolar RMS; F/MF = Fibroblastic or Myofibroblastic; MPNST = Malignant peripheral nerve sheath tumor; DSRCT = Desmoplastic small round cell tumor; GIST = gastrointestinal stromal tumor; UPS = undifferentiated pleomorphic sarcoma; M= Male; F = Female; NH = Non-Hispanic; H = Hispanic; W = White; B = Black; AIAIN/API = American Indian, Alaskan native, Asian Pacific Islander; Q1 corresponds to lowest small-area SES quintile, Q5 corresponds to highest.NR = not reported due to insufficient sample sizes (< 5 cases in a cell).
Socioeconomic status and sarcoma incidence
Figure 1 shows the IRRs for CT-SES, stratified by sarcoma category and age group strata (values are given in Supplementary Table 3). In the pediatric age group, a positive trend in the incidence of fibroblastic/myofibroblastic (F/MF) tumors was observed across CT-SES quintiles (IRR: 1.09, 99% CI 1.00, 1.18; p-value = 0.007). However, the results from CT-SES evaluated as an ordinal variable may have obscured a potential non-linear association in the incidence of F/MF tumors; the IRR comparing CT-SES Q2 vs Q1 (IRR: 1.37 ,99% CI: 0.97, 1.94) appeared higher than the IRR comparing CT-SES Q3 vs Q1 (IRR: 1.04, 99% CI: 0.71, 1.52; Supplementary Table 3). The incidence rates for other sarcoma subtypes evaluated in the pediatric age group were not associated with CT-SES.
In the adult age group strata, F/MF tumors and liposarcoma showed evidence of having a higher incidence in higher CT-SES quintiles (p-value < 0.001 for both), whereas malignant peripheral nerve sheath tumors (MPNST) showed the opposite trend with every increase in CT-SES quintile associated with a 7% lower incidence rate (99% CI: 0.87, 0.99; p-value = 0.002). The other subtypes evaluated in adults were not associated with CT-SES, including all of the bone sarcoma subtypes evaluated.
In the older adult age group strata, most subtypes (8 of 11 evaluated) showed a trend of increased incidence with increasing CT-SES quintiles that reached statistical significance, including those subtypes that were not significantly associated with CT-SES in adults. Other RMS, MPNST, and malignant vascular tumors were the only subtypes evaluated in older adults that were not associated with CT-SES.
In all age groups, we observed an increasing incidence of (F +) sarcomas with increasing CT-SES (p-value < 0.001 in all age groups). A positive trend was also observed for (F −) sarcomas in the older adult aged strata (p-value < 0.001), but the trends in the adult and pediatric age groups were not statistically significant (p-values = 0.02 and 0.08, respectively).
Race/ethnicity and sarcoma incidence
Figure 2 shows the IRRs for race and ethnicity, stratified by sarcoma category and age group strata (values are given in Supplementary Table 4). In the pediatric age group, the IRRs observed in NH-black children compared to NH-white children differed according to sarcoma subtype, with statistically significant IRRs observed for 5 of the 7 subtypes evaluated. Heterogeneous results were also observed among children of Hispanic ethnicity, although we found fewer significant results (3 of the 7 subtypes evaluated) and smaller effect sizes than those comparing incidence rates in NH-black to NH-white children. Among AIAN/API children, the IRRs that reached statistical significance (3 of 7 subtypes) were all less than 1, with some subtypes showing incidence rates that were nearly 50% lower in AIAN/API compared to NH-white children (e.g. Ewing sarcoma of bone, Embryonal RMS, and F/MF tumors).
Figure 2.

Multivariable adjusted incidence rate ratios for sarcoma by race/ethnicity stratified by sarcoma subtype and age group, SEER 16 registries (2001 – 2015). Estimates are adjusted for CT-SES (ordinal), age at diagnosis, sex, and year. Error bars represent 99% CIs. Multivariable analysis of malignant chordoma in older adult strata did not include race/ethnicity as a covariate due to insufficient sample sizes (i.e. < 5 NHB cases). ES = Ewing sarcoma; RMS = rhabdomyosarcoma; ARMS = Alveolar RMS; F/MF = Fibroblastic or Myofibroblastic; MPNST = Malignant peripheral nerve sheath tumor; DSRCT = Desmoplastic small round cell tumor, GIST = gastrointestinal stromal tumor; UPS = undifferentiated pleomorphic sarcoma; NH = non-Hispanic, AIAN/API = American Indian, Alaskan Native, Asian Pacific Islander. NH-White is the reference category.
In adults, several STS subtypes had incidence rates in NH-blacks that were substantially higher (IRR > 1.60) than those observed in NH-whites, including other RMS, F/MF tumors, desmoplastic small round cell tumor (DSRCT), GIST, and leiomyosarcoma. The incidence rates for several subtypes were also significantly different in adults of Hispanic ethnicity compared to NH-white adults (6 of 17 subtypes evaluated), although we again observed weaker associations than those comparing the incidence rates in NH-black adults to NH-white adults. Among adults identified as AIAN/API, we observed incidence rates that were significantly lower than those observed in NH-white adults for several sarcoma subtypes (9 of 17 subtypes evaluated). A notable exception is GIST, the only subtype evaluated to show a significantly higher incidence rate in AIAN/API compared to NH-white adults (IRR: 1.48, 99% CI: 1.31, 1.68).
Several of the results observed in older adults (> 65 years at diagnosis) were dissimilar to those in adults. For example, the incidence of UPS was lower in older adults identified as NH-black (IRR: 0.48, 99% CI: 0.35, 0.64) and Hispanic (IRR: 0.65, 95% CI 0.52, 0.8) compared to NH-white older adults, but showed no difference in incidence across these racial/ethnic categories in adults. Conversely, results among AIAN/API older adults largely concurred with those observed in younger aged individuals. Incidence rates in AIAN/API older adults that reached statistical significance (5 of 10 subtype evaluated) were primarily lower than those observed for NH-whites, except for GIST, which had an incidence rate that was 78% higher in AIAN/API compared to NH-white older adults(99% CI; 1.56, 2.04).
In all age groups, the incidence of (F −) and (F +) sarcomas in NH-black or Hispanic individuals compared to NH-white individuals were heterogeneous. Conversely, the comparative incidence rates in AIAN/API individuals that reached statistical significance consistently showed lower incidence rates for both (F −) and (F +) sarcomas compared to NH-whites in all age groups.
Discussion:
In the current study, we found race/ethnicity to be more often associated with sarcoma incidence than census tract-level SES. Specifically, of the 35 subtype-age group combinations evaluated for an association with CT-SES, 12 were statistically significant, and most occurred in the older age group strata (8 subtypes), with all, except MPNST in adults, having a positive trend in incidence across CT-SES levels. Conversely, nearly every subtype-age group combination displayed racial/ethnic differences in incidence rates that were independent of CT-SES. Notably strong associations included an over two-fold increase in the incidence of GIST among NH-black compared NH-white adults. Overall, our findings suggest that genetic variation associated with ancestry may play a stronger role than area-level SES-related factors in the etiology of STS and BS subtypes.
Prior results regarding the association between SES and sarcoma incidence are limited. A large SEER analysis (n = 1,576 STS and 1,113 BS) found that children aged 0 – 19 years living in higher-income counties of the United States had a higher incidence of STS but not BS,(22) whereas other case-control studies using maternal or household education as a proxy for SES have found mostly null associations.(24,41) In prior analyses, however, individual subtypes of sarcoma were not separately evaluated. With regards to race/ethnicity, several race-specific incidence patterns have been reported, including the dramatic patterns reported herein among minority populations diagnosed with Ewing sarcoma,(42–44) GIST,(45,46) DSRCT,(47) and leiomyosarcoma.(2) Prior analyses, however, have failed to comprehensively account for likely confounding by SES. The results of our analysis build upon prior work by demonstrating the independent associations of CT-SES and race/ethnicity on the incidence of subtypes of sarcoma throughout the age span.
Area-level SES is speculated to represent complex interactions between environmental, social, and cultural influences on health; therefore, we cannot determine which specific aspect may have driven our observed associations. Several possible risk factors for sarcoma have been identified in prior studies,(6) including those we presume to be correlated with both a high SES (e.g. older parental age(20)) and low SES (e.g. residential proximity to industrial facilities,(48,49)), but few of those nominated to date have been definitively established as risk factors.(6) Nonetheless, the variation in incidence across CT-SES supports the possibility that exogenous or lifestyle factors contribute to the development of at least some subtypes of sarcoma. Moreover, that results were heterogeneous across subtypes and age groups highlights the likelihood of etiologic differences in the tumors evaluated (i.e. environmental vs genetic factors).
For example, it is hypothesized that individuals diagnosed with (F −) sarcomas more often harbor rare, pathogenic germline genetic variants than those with (F +) sarcomas, as evidenced by the increased frequency with which (F −) sarcoma cases are diagnosed in cancer predisposition syndromes and as secondary cancers.(29,32–34) That the association with CT-SES appeared weaker in (F-) sarcomas compared to (F +) sarcomas in the pediatric and adult age groups supports this hypothesis, as tumors that are unrelated to SES are perhaps more likely influenced by germline genetics. In the older adult stratum, however, we note dissimilar findings; CT-SES was associated with both classes of tumors and nearly every subtype evaluated (8 of 11). We speculate that CT-SES is unrelated to (F −) sarcomas and other subtypes in younger patients due to the increased frequency with which germline genetic predisposition contributes to sarcoma development early in life.(7,50)
In the pediatric and adult age groups, the incidence of several sarcoma subtypes was unrelated to CT-SES but demonstrated dramatic race-specific incidence patterns. We speculate that these findings may be driven in part by differences in the frequency of genetic variants across ancestries. For example, the binding affinity for EWS-FLI1, the fusion oncoprotein characteristic of Ewing sarcoma,(30) depends upon the length of GGAA microsatellite repeats,(51) which is polymorphic between populations of primarily European and African ancestries.(52) This is consistent with the substantially higher incidence of Ewing sarcoma in NH-white individuals compared to NH-black individuals reported here and elsewhere.(42–44) Whether genetic variants that are associated with ancestry underlie associations with other subtypes is unclear. We note, however, that racial and ethnic categories are imperfect measures of genomic ancestry,(53) and that other social, cultural, or lifestyle factors which are unaccounted for by area-level SES may have confounded our observations. The use of admixture mapping(54,55) may help to resolve the potential mechanisms underlying our observations.
To our knowledge, ours is the largest study to investigate the independent contributions of SES and race/ethnicity on sarcoma incidence. Utilizing the SEER program enabled us to evaluate associations among individually rare subtypes of sarcoma within a racially and ethnically diverse population with reasonably adequate statistical power. Additionally, the use of a comprehensive and widely available SES variable measured at the census tract-level allowed us to control analyses for possible environmental and lifestyle factors, and facilitated informed speculations on the relative contribution of environment vs. genetics in the development of these rare group of cancers.
Nevertheless, we acknowledge several limitations. Although we included only microscopically confirmed sarcomas, there may still be misclassification of tumor subsets represented in our study. Accurately diagnosing sarcomas remains a clinical challenge,(56,57) and recording their diagnostic codes into tumor registries is subject to error.(58),(59) It is possible that misclassification may be greater among individuals with lower SES, as they may have reduced access to specialty sarcoma centers or clinical trials. Furthermore, we classified subtypes as fusion positive or negative according to the predominant somatic mutations in each sarcoma based on prior literature,(29,30) but tumors categorized as (F +) may have lacked a fusion gene and some (F −) sarcoma subtypes may harbor fusion genes yet to be discovered.(60) We also note that area-level SES does not represent individual-level SES, but rather captures the SES characteristics of one’s environment that may also influence health.(61) SEER includes individual-level insurance status for cases but does not include it in calculations of population denominator estimates. This precluded us from using insurance status as a measure of SES in our analysis of incidence rates. Other individual-level SES characteristics (e.g. income or educational attainment) are not publicly available from SEER. Furthermore, the CT-SES measure was assigned to a case based upon their address at diagnosis. This may not account for potentially long latency periods between exposure and diagnosis, during which an individual may have moved between areas of high or low SES. Furthermore, we note that our analysis could not account for environmental exposures that may be associated with sarcoma development, but which are unrelated to CT-SES. Finally, given this is a broad study of rare cancers across the age spectrum, a large number of statistical tests were performed. Some significant findings may therefore be due to chance.
Through this large and comprehensive analysis, we observed sarcoma incidence rates to be more often associated with race/ethnicity than CT-SES. Although future etiologic analyses – particularly genetic studies - are needed to confirm our findings, our results suggest that genetics play a greater role than environmental factors on the etiology of several subtypes of sarcoma.
Supplementary Material
Acknowledgements:
This work was supported by the National Institute of Arthritis and Musculoskeletal and Skin Diseases at the National Institutes of Health “Musculoskeletal Training Grant” (T32 AR05938 to BJD).
Abbreviations:
- CT-SES
census tract-level socioeconomic status
- STS
soft tissue sarcomas
- BS
bone sarcomas
- SEER
Surveillance, Epidemiology and End Results
- ICD-O-3
International Classification of Disease for Oncology, 3rd Edition
- WHO
World Health Organization
- (F +)
Fusion positive
- (F −)
Fusion negative
- CI
confidence interval
- IRR
incidence rate ratio
- NH
non-Hispanic
- AIAN/API
American Indian/Alaskan Native or Asian Pacific Islander
- ES
Ewing sarcoma
- RMS
rhabdomyosarcoma
- ARMS
Alveolar rhabdomyosarcoma
- F/MF
FIbroblasitc or Myofibroblastic
- MPNST
Malignant peripheral nerve sheath tumor
- DSRCT
Desmoplastic small round cell tumor
- GIST
gastrointestinal stromal tumor
- UPS
undifferentiated pleomorphic sarcoma
Footnotes
Conflicts of interest: The authors declare no potential conflicts of interest.
References
- 1.Helman LJ, Meltzer P. Mechanisms of sarcoma development. Nat Rev Cancer. 2003;3:685–94. [DOI] [PubMed] [Google Scholar]
- 2.Toro JR, Travis LB, Hongyu JW, Zhu K, Fletcher CDM, Devesa SS. Incidence patterns of soft tissue sarcomas, regardless of primary site, in the Surveillance, Epidemiology and End Results program, 1978-2001: An analysis of 26,758 cases. Int J Cancer. 2006;119:2922–30. [DOI] [PubMed] [Google Scholar]
- 3.Ng VY, Scharschmidt TJ, Mayerson JL, Fisher JL. Incidence and survival in sarcoma in the United States: A focus on musculoskeletal lesions. Anticancer Res. 2013;33:2597–604. [PubMed] [Google Scholar]
- 4.Ferrari A, Sultan I, Huang TT, Rodriguez-Galindo C, Shehadeh A, Meazza C, et al. Soft tissue sarcoma across the age spectrum: A population-based study from the surveillance epidemiology and end results database. Pediatr Blood Cancer. 2011;57:943–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rouhani P, Fletcher CDM, Devesa SS, Toro JR. Cutaneous soft tissue sarcoma incidence patterns in the U.S.: An analysis of 12,114 cases. Cancer. 2008;113:616–27. [DOI] [PubMed] [Google Scholar]
- 6.Burningham Z, Hashibe M, Spector L, Schiffman JD. The epidemiology of Sarcoma. EmbaseNature Rev Rheumatol. 2011;2:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ballinger ML, Goode DL, Ray-Coquard I, James PA, Mitchell G, Niedermayr E, et al. Monogenic and polygenic determinants of sarcoma risk: an international genetic study. Lancet Oncol. 2016;17:1261–71. [DOI] [PubMed] [Google Scholar]
- 8.Chan SH, Lim WK, Ishak NDB, Li ST, Goh WL, Tan GS, et al. Germline Mutations in Cancer Predisposition Genes are Frequent in Sporadic Sarcomas. Sci Rep. 2017;7:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhang J, Walsh MF, Wu G, Edmonson MN, Gruber TA, Easton J, et al. Germline Mutations in Predisposition Genes in Pediatric Cancer. N Engl J Med. 2015;373:2336–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Farid M, Ngeow J. Sarcomas Associated With Genetic Cancer Predisposition Syndromes: A Review. Oncologist. 2016;21:1002–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Savage SA, Mirabello L, Wang Z, Gastier-Foster JM, Gorlick R, Khanna C, et al. Genome-wide association study identifies two susceptibility loci for osteosarcoma. Nat Genet. 2013;45:799–803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Postel-Vinay S, Véron AS, Tirode F, Pierron G, Reynaud S, Kovar H, et al. Common variants near TARDBP and EGR2 are associated with susceptibility to Ewing sarcoma. Nat Genet. 2012;44:323–9. [DOI] [PubMed] [Google Scholar]
- 13.Machiela MJ, Grünewald TGP, Surdez D, Reynaud S, Mirabeau O, Karlins E, et al. Genome-wide association study identifies multiple new loci associated with Ewing sarcoma susceptibility. Nat Commun. 2018;9:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hardell L, Sandström A. Case-control study: Soft-tissue sarcomas and exposure to phenoxyacetic acids or chlorophenols. Br J Cancer. 1979;39:711–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Eriksson M, Hardell L, Moller T. Soft-tissue sarcomas and exposure to chemical substances: a case-referent study. Br J Ind Med. 1981;38:27–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Berrington de Gonzalez A, Kutsenko A, Rajaraman P. Sarcoma risk after radiation exposure. Clin Sarcoma Res. 2012;2:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Virtanen A, Pukkala E, Auvinen A. Incidence of bone and soft tissue sarcoma after radiotherapy: A cohort study of 295,712 Finnish cancer patients. Int J Cancer. 2006;118:1017–21. [DOI] [PubMed] [Google Scholar]
- 18.Renwick N, Halaby T, Weverling GJ, Dukers NH, Simpson GR, Coutinho RA, et al. Seroconversion for human herpesvirus 8 during HIV infection is highly predictive of Kaposi’s sarcoma. Aids. 1998;12:2481–8. [DOI] [PubMed] [Google Scholar]
- 19.Ognjanovic S, Carozza SE, Chow EJ, Fox EE, Horel S, McLaughlin CC, et al. Birth characteristics and the risk of childhood rhabdomyosarcoma based on histological subtype. Br J Cancer. 2010;102:227–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Johnson KJ, Carozza SE, Chow EJ, Fox EE, Horel S, McLaughlin CC, et al. Parental age and risk of childhood cancer: a pooled analysis. Epidemiology. 2009;20:475–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Seeman TE, Crimmins E. Social Environment Effects on Health and Aging Integrating Epidemiologic and Demographic Approaches and Perspectives. Ann N Y Acad Sci. 2001; [DOI] [PubMed] [Google Scholar]
- 22.Pan IJ, Daniels JL, Zhu K. Poverty and childhood cancer incidence in the United States. Cancer Causes Control. 2010;21:1139–45. [DOI] [PubMed] [Google Scholar]
- 23.Alston RD, Rowan S, Eden TOB, Moran A, Birch JM. Cancer incidence patterns by region and socioeconomic deprivation in teenagers and young adults in England. Br J Cancer. 2007;96:1760–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kehm RD, Spector LG, Poynter JN, Vock DM, Osypuk TL. Socioeconomic Status and Childhood Cancer Incidence: A Population-Based Multilevel Analysis. Am J Epidemiol. 2018;187:982–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hampras SS, Moysich KB, Marimuthu SP, Ravid V, Jayaprakash V. Socioeconomic factors and the risk for sarcoma. Eur J Cancer Prev. 2014;23:560–5. [DOI] [PubMed] [Google Scholar]
- 26.Census Tract-Level SES and Rurality Database (2000 - 2015) [Internet] [cited 2019 Oct 3]. Available from: https://seer.cancer.gov/seerstat/databases/census-tract/index.html [Google Scholar]
- 27.Yu M, Tatalovich Z, Gibson JT, Cronin KA. Using a composite index of socioeconomic status to investigate health disparities while protecting the confidentiality of cancer registry data. Cancer Causes Control. 2014;25:81–92. [DOI] [PubMed] [Google Scholar]
- 28.Fletcher CDM, Bridge JA, Beidge JA, Hogendoom PCW MF. WHO classification of tumours of soft tissue and bone. Lyon, Fr: IARC; 2013. [Google Scholar]
- 29.Lupo PJ, Brown AL, Hettmer S. Second malignancy risk among pediatric, adolescent, and young adult survivors of fusion-positive and fusion-negative sarcomas: Results from the SEER database, 1992 through 2012. Cancer. 2016;122:3492–500. [DOI] [PubMed] [Google Scholar]
- 30.Perry JA, Seong BKA, Stegmaier K. Biology and Therapy of Dominant Fusion Oncoproteins Involving Transcription Factor and Chromatin Regulators in Sarcomas. Annu Rev Cancer Biol. 2019;3:299–321. [Google Scholar]
- 31.Bovée JVMG, Hogendoorn PCW. Molecular pathology of sarcomas: Concepts and clinical implications. Virchows Arch. 2010;456:193–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lupo PJ, Danysh HE, Plon SE, Curtin K, Malkin D, Hettmer S, et al. Family history of cancer and childhood rhabdomyosarcoma: A report from the children’s oncology group and the Utah Population Database. Cancer Med. 2015;4:781–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Archer NM, Amorim RP, Naves R, Hettmer S, Diller LR, Ribeiro KB, et al. An Increased Risk of Second Malignant Neoplasms After Rhabdomyosarcoma: Population-Based Evidence for a Cancer Predisposition Syndrome? Pediatr Blood Cancer. 2016;63:196–201. [DOI] [PubMed] [Google Scholar]
- 34.Pinto N, Hawkins DS. Second Malignant Neoplasms in Rhabdomyosarcoma: Victims of Our Own Success or an Underlying Genetic Predisposition Syndrome? Pediatr Blood Cancer. 2016;63:189–90. [DOI] [PubMed] [Google Scholar]
- 35.North American Association of Central Cancer Registries Guidelines for ICD-O-3 Implementation. NAACCR ICD-O-3 Implementation Work Group; 2000. [Google Scholar]
- 36.Yost K, Perkins C, Cohen R, Morris C, Wright W. Socioeconomic status and breast cancer incidence in California for different race/ethnic groups. Cancer Causes Control. 2001;12:703–11. [DOI] [PubMed] [Google Scholar]
- 37.NAACCR Race and Ethnicity Work Group. NAACCR Guideline for Ehancing Hispanic/Latino Identification: Revised NAACCR Hispanic/Latino Identification Algorithm [NHIA v2.2.1]. Springfield, IL: North American Association of Central Cancer Registries; 2011. [Google Scholar]
- 38.Ver Hoef JM, Boveng PL. Quasi-poisson vs. negative binomial regression: How should we model overdispersed count data? Ecology. 2007;88:2766–72. [DOI] [PubMed] [Google Scholar]
- 39.SEER*Stat Software [Internet] [cited 2019 Oct 15]. Available from: https://seer.cancer.gov/seerstat/
- 40.R core team (2016). R: A language and enironment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria: URL https://www.R-project.org [Internet] R Foundation for Statistical Computing, Vienna, Austra; Available from: https://www.r-project.org [Google Scholar]
- 41.Carozza SE, Puumala SE, Chow EJ, Fox EE, Horel S, Johnson KJ, et al. Parental educational attainment as an indicator of socioeconomic status and risk of childhood cancers. Br J Cancer. 2010;103:136–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jawad MU, Cheung MC, Min ES, Schneiderbauer MM, Koniaris LG, Scully SP. Ewing sarcoma demonstrates racial disparities in incidence-related and sex-related differences in outcome: An analysis of 1631 cases from the SEER database, 1973-2005. Cancer. 2009;115:3526–36. [DOI] [PubMed] [Google Scholar]
- 43.Polednak AP. Primary bone cancer incidence in black and white residents of New York state. Cancer. 1985;55:2883–8. [DOI] [PubMed] [Google Scholar]
- 44.Parkin DM, Stiller CA, Draper GJ, Bieber CA. The international incidence of childhood cancer. Int J Cancer. 1988;42:511–20. [DOI] [PubMed] [Google Scholar]
- 45.Søreide K, Sandvik OM, Søreide JA, Giljaca V, Jureckova A, Bulusu VR. Global epidemiology of gastrointestinal stromal tumours (GIST): A systematic review of population-based cohort studies. Cancer Epidemiol. 2016;40:39–46. [DOI] [PubMed] [Google Scholar]
- 46.Ma GL, Murphy JD, Martinez ME, Sicklick JK. Epidemiology of gastrointestinal stromal tumors in the era of histology codes: Results of a population-based study. Cancer Epidemiol Biomarkers Prev. 2015;24:298–302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Worch J, Cyrus J, Goldsby R, Matthay KK, Neuhaus J, DuBois SG. Racial differences in the incidence of mesenchymal tumors associated with EWSR1 translocation. Cancer Epidemiol Biomarkers Prev. 2011;20:449–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Zambon P, Ricci P, Bovo E, Casula A, Gattolin M, Fiore AR, et al. Sarcoma risk and dioxin emissions from incinerators and industrial plants: A population-based case-control study (Italy). Environ Heal A Glob Access Sci Source. 2007;6:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.García-Pérez J, Morales-Piga A, Gómez-Barroso D, Tamayo-Uria I, Pardo Romaguera E, López-Abente G, et al. Risk of bone tumors in children and residential proximity to industrial and urban areas: New findings from a case-control study. Sci Total Environ. 2017;579:1333–42. [DOI] [PubMed] [Google Scholar]
- 50.Mirabello L, Yeager M, Mai PL, Gastier-Foster JM, Gorlick R, Khanna C, et al. Germline TP53 Variants and Susceptibility to Osteosarcoma. J Natl Cancer Inst. 2015;107:1–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gangwal K, Sankar S, Hollenhorst PC, Kinsey M, Haroldsen SC, Shah AA, et al. Microsatellites as EWS/FLI response elements in Ewing’s sarcoma. Proc Natl Acad Sci U S A. 2008;105:10149–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Beck R, Monument MJ, Watkins WS, Smith R, Boucher KM, Schiffman JD, et al. EWS/FLI-responsive GGAA microsatellites exhibit polymorphic differences between European and African populations. Cancer Genet. 2012;205:304–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Mersha TB, Abebe T. Self-reported race/ethnicity in the age of genomic research: Its potential impact on understanding health disparities. Hum Genomics. 2015;9:1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Winkler CA, Nelson GW, Smith MW. Admixture Mapping Comes of Age. Annu Rev Genomics Hum Genet. 2010;11:65–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Shriner D Overview of admixture mapping. Curr Protoc Hum Genet. 2017;94:1–23. [DOI] [PubMed] [Google Scholar]
- 56.Ray-coquard I, Montesco MC, Coindre JM, Dei tos AP, Lurkin A, Ranchère-vince D, et al. Sarcoma: Concordance between initial diagnosis and centralized expert review in a population-based study within three European regions. Ann Oncol. 2012;23:2442–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Thway K, Fisher C. Histopathological Diagnostic Discrepancies in Soft Tissue Tumours Referred to a Specialist Centre. Sarcoma. 2009;125:2926–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lyu HG, Haider AH, Landman AB, Raut CP. The opportunities and shortcomings of using big data and national databases for sarcoma research. Cancer. 2019;125:2926–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lyu HG, Stein LA, Saadat LV, Phicil SN, Haider A, Raut CP. Assessment of the accuracy of disease coding among patients diagnosed with sarcoma. JAMA Oncol. 2018;4:1293–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Mertens F, Antonescu CR, Mitelman F. Gene fusions in soft tissue tumors: Recurrent and overlapping pathogenetic themes. Genes Chromosom Cancer. 2016;55:291–310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Krieger N, Williams DR, Moss NE. Measuring Social Class in US Public Health Research: Concepts, Methodologies, and Guidelines. Annu Rev Public Health. 1997;18:341–78. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
