Abstract
Background
The primary objective of this study is to identify novel copy number variations (CNVs) associated with familial ankylosing spondylitis (AS). A customized genome-wide microarray was designed to detect CNVs and applied to a multiplex AS family with six (6) affected family members. CNVs were detected using the built-in DNA analytics aberration detection method-2 (ADM-2) algorithm. Gene enrichment analysis was performed to observe the segregation. Subsequent validation was performed using real time quantitative fluorescence polymerase reaction (QF-PCR). The frequency of copy number variation for the UGT2B17 gene was then performed on two well-defined AS cohorts. Fisher exact test was performed to quantify the association.
Results
Our family-based analysis revealed ten gene-enriched CNVs that segregate with all six family members affected with AS. Based on the proposed function and the polymorphic nature of the UGT2B17 gene, the UGT2B17 gene CNV was selected for validation using real time QF-PCR with full concordance. The frequency of two copies of the UGT2B17 gene CNV was 0.41 in the Newfoundland AS cases and 0.35 in the Newfoundland controls (OR = 1.26(0.99-1.59); p < 0.05)), whereas the frequency of two (2) copies of the UGT2B17 gene CNV was 0.40 in the Alberta AS cases and 0.39 in the Alberta controls (OR = 1.05(95% CI: 0.83-1.33); p < 0.71)).
Conclusions
A genome-wide microarray interrogation of a large multiplex AS family revealed segregation of the UGT2B17 gene CNV among all affected family members. The association of the UGT2B17 CNV with AS is particularly interesting given the recent association of this CNV with osteoporosis and the proposed function as it encodes a key enzyme that inhibits androgens. However, two copies of the UGT2B17 gene CNV were only marginally significant in a uniplex AS cohort from Newfoundland but not in a uniplex AS cohort from Alberta.
Keywords: Copy number variants (CNV), UGT2B17, Ankylosing spondylitis, Complex autoimmune disease
Background
Ankylosing spondylitis (AS) is the prototypic inflammatory spondyloarthritis with a peak onset between 20 to 30 years [1,2]. Extra-articular features of AS include inflammation of the eyes, skin, bowels, and more rarely the lungs and heart. There is a male preponderance as men are affected 2 to 3 times more frequently than females [3,4].
Genetic factors are of major importance in susceptibility to AS and, in fact, genetic epidemiological studies suggest that AS represents one of the most heritable complex autoimmune diseases with a heritability greater than 90% and a sibling recurrence ratio of at least 52 [2-4]. HLA-B*27, which was first recognized to be associated with AS in 1973, remains the strongest genetic association signal with AS, and it is estimated that HLA-B*27 accounts for 23% of the genetic heritability [2-4]. Despite this strong association, only a small fraction (1-5%) of HLA-B*27 positive individuals develop AS [2-4]. Recent genome-wide association studies (GWAS) in several European populations and in the Han Chinese population have identified up to 15 high priority genes including IL23R, RUNX3, KIF21B, 2p15, IL1R2, PTGER4, ERAP1, IL12B, CARD9, TNFR1/ LTBR, TBKBP1 [5]. These associations were primarily reported based on single nucleotide polymorphisms (SNP) analysis. Recently, an association study revealed the complex interactions between the non-HLA gene, ERAP1, with the HLA-B*27 gene [5,6]. However, the genetic risk described by HLA or non-HLA genes suggests that other genomic variants may contribute to the risk factor for AS.
The premise that CNVs contribute to disease pathogenesis is supported by their capacity to disrupt gene expression and to interrupt functional pathways [7]. In recent years, many CNVs have been associated with complex diseases and the majority of these associations were in autoimmune-mediated diseases, including rheumatoid arthritis, psoriasis, Crohn’s disease, and systemic lupus erythematous [8-11]. Here, we report the first validated CNV associated specifically with AS using a custom genome-wide microarray in a well-defined multiplex AS family.
Results
Multiplex family
The quality control (QC) measures for the custom microarray chip were very good as all samples exhibited <0.25 DLRS. Approximately 1700 CNVs were detected in each individual sample using our custom microarray. Segregated gene-centric CNV analysis (i.e., a CNV that consists of or overlaps with a gene) revealed that 56 CNVs are enriched in affected family members (at least three) and absent in the unaffected family members. Among these 56 CNVs, we have identified ten (9) gene copy number variation regions (CNVRs) that segregate with at least six (6) affected family members (Additional file 1: Table S1). Our microarray analysis revealed multiple duplications within the UGT2B17 gene region that segregate in the affected family members that is absent in unaffected family members. One 7 kb duplication (chr4:69423002–69430016) disrupts exon 3 of the UGT2B17 gene and segregates with six AS affected family members (Figure 1). The breakpoint encompassing the UGT2B17 gene region was covered with probes with 280 bp spacing, providing high resolution to detect genomic aberrations. Validation of this targeted region using real time QF-PCR demonstrated 100% concordance with the microarray analysis (Figure 2). The real time QF-PCR CNV call (>99% confidence) revealed that affected family members carried two (2) copies of the UGT2B17 gene, whereas unaffected members carried a single copy. Eight (8) of the ten (10) family members were HLA-B27 positive including and all six affected family members that carried two (2) copies of the UGT2B17 gene.
Disease prevalence (case control studies)
In the case–control cohorts, real time QF-PCR analysis targeting the identical CNVR (from family analysis) revealed the presence of three (3) alleles (0 copy - homozygous deletion, 1 copy - heterozygous deletion, and 2 copies) within this population for the UGT2B17 gene.
In the Newfoundland population, the frequency of the three alleles was 0.13, 0.51, and 0.35 for 0, 1 and 2 copies in controls, respectively. The frequency of two (2) copies of the UGT2B17 gene CNV was 0.41 in the Newfoundland AS cases and 0.35 in the Newfoundland controls (OR = 1.26 (95% CI:0.99-1.59); p = 0.05). The frequency of two (2) copies of the UGT2B17 gene in the Alberta population was 0.40 and 0.39 for AS cases and controls, respectively (OR = 1.05 (95% CI:0.83-1.33); p = 0.71) (Table 1). The association analysis for the combined cohort (Newfoundland and Alberta population) revealed two (2) copies of the UGT2B17 gene with a frequency of 0.40 and 0.37, for the cases and controls, respectively (association significance p = 0.09).
Table 1.
Population | Copy numbers | AS cases | Healthy controls | OR(95%CI) | P value |
---|---|---|---|---|---|
Newfoundland |
2 |
0.41 |
0.35 |
1.26(0.99-1.59) |
<0.05 |
298 cases |
|
|
|
|
|
299 controls |
|
|
|
|
|
Alberta |
2 |
0.40 |
0.39 |
1.05(0.83-1.33) |
<0.71 |
289 cases |
|
|
|
|
|
285 controls |
Discussion
Copy number variations are increasingly being recognized in complex autoimmune diseases as they are capable of altering gene dosage and consequently affect gene function [12]. The contribution of CNVs in AS pathogenesis remains to be systematically evaluated. The primary goal of this study was to identify functionally relevant CNVs segregating within a multiplex AS family and then subsequently determine the allele frequencies of CNVs in multiple case–control cohorts. The genome-wide CNV analysis performed in this study revealed that increased copy number of the UGT2B17 gene is a potential risk factor for AS.
Given that UGT2B17 is a functional gene encoding an enzyme that metabolizes steroid hormones including testosterone and selected xenobiotics [13-16], it represents an excellent candidate gene for susceptibility to AS. A gene-dosage effect is present as this CNV is associated with urine testosterone level, male insulin sensitivity, fat mass, and prostate-cancer risk (as summarized by Xue) [13]. Importantly, copies of the UGT2B17 gene have recently been associated with osteoporosis (including hip fracture) [14]. The dose of the UGT2B17 gene was significantly associated with bone mineral density, cortical thickness, and buckling ratio. These results support the role of UGT2B17 CNVs in the pathogenesis of osteoporosis. Although AS is characterized by new bone formation, osteoporosis is a well recognized complication of AS and the risk of clinical vertebral fractures is increased in AS patients [17,18]. This complexity with respect to genotype-phenotype correlation requires further investigation.
From an evolutionary perspective, the UGT2B17 gene in humans is highly stratified which renders it more likely to be associated with disease. The deletion allele (i.e., 0 copies) frequency varies significantly among populations. For example, the frequency of the deletion allele is approximately 0.15 in Europeans, ~0.22 in Africans, and ~0.80-0.90 in Asians [7,13]. In contrast, the frequency of two (2) copies of the UGT2B17 gene is rare in Asian populations (~0.01-0.03), ~0.30-0.40 (reports with varying frequencies) in Europeans, and ~0.60 in African populations.
In this study, two copies of the UGT2B17 gene clearly segregates within a large well-defined AS multiplex family. Given the nature and proposed function of the UGT2B17 gene [19,20], the UGT2B17 CNV may contribute to susceptibility for AS within this family. Conflicting results were obtained regarding the excess prevalence of this CNV in large uniplex AS cohorts. A trend was observed in the cohort from Newfoundland. However, this was not replicated in the Alberta cohort. In this study, the deletion (i.e., 0 copies) frequency of the UGT2B17 gene in the Newfoundland population is consistent with the Alberta population. There was a noticeable frequency difference between the Alberta (0.51) and Newfoundland (0.46) populations for a single copy of the UGT2B17 gene. This stratification may represent a contributing factor for the lack of association.
The analysis of CNVs within multiplex families provides a unique opportunity to detect variants that may be family-specific. There are multiple reports in other complex diseases where family specific CNVs manifest with the disease [21]. Although it is a difficult task to obtain more multi-generational pedigrees to follow up the segregation of this CNV, the findings observed within the investigated family suggests that segregation analysis into multiple multi-generation pedigrees is promising. Thus, it is important to assess other large multiplex families to determine if this CNV may be a cause of familial AS.
Conclusions
This is study applied a custom designed genome-wide microarray to investigate a large multiplex AS family revealed segregation of the UGT2B17 gene CNV among all affected family members. The association of the UGT2B17 CNV with AS is particularly interesting given the recent association of this CNV with osteoporosis and the proposed function as it encodes a key enzyme that inhibits androgens. This result provides a basis for further exploration on the role of CNV in AS pathogenesis. Further replication is required for the marginal significance of UGT2B17 association identified with AS cohort from Newfoundland.
Methods
Multiplex AS family
A multiplex AS family from Newfoundland was identified for this study. All members of the multiplex family were “native Newfoundlanders” of North European ancestry. Ten (10) members from this large family participated in the study. Each individual was assessed clinically including imaging studies. Six (6) of the ten (10) family members had AS, as defined by the 1984 modified New York criteria, while the remaining family members were not affected. Informed consent for participation in the study was obtained from participants or, where participants are children, a parent or guardian. After consent was obtained, all participating family members had their DNA collected. All samples involving human subjects reported in this manuscript have been approved by the Memorial University of Newfoundland Human Investigation Committee.
Case–control cohorts
DNA was obtained from two (2) well-defined AS cohorts from the Newfoundland and Alberta populations. All cases and controls were of North European ancestry. We assessed 298 AS cases satisfying the modified New York criteria and 299 ethnically-matched controls from a homogenous population of Newfoundland. The second cohort consisted of 289 AS cases satisfying the modified New York criteria and 285 ethnically-matched controls from Alberta.
Custom microarray
We designed a customized genome-wide microarray based on genomic hotspot breakpoints previously identified [22]. The targeted regions are prone to produce CNVs due to their structural architecture (i.e. presence of segmental duplication, dose sensitive genes etc.). The custom design tiling microarray consisted of 2 × 1 million probes covering the genome with a mean spacing of 280 bp. Prior to CNV analysis, QC measures were applied and the derivative of log ratio spread (DLRS) <0.25 was considered as threshold. CNVs were detected using the built-in Aberration Detection Method-2 (ADM-2) algorithm DNA Analytics v.4.0.85 (Agilent Technologies) using the following criteria: 1) at least 5 probes for a CNV call on GC-corrected intensity; 2) nested filter was set to 2; and 3) log intensity >0.25 for duplication and <−0.25 for deletion. A custom script was applied to detect gene-enriched CNVs (i.e., overlaps with or constitutes a gene) that segregate (at least three cases) within affected AS family members and absent in the unaffected family members.
QF-PCR
To validate the CNV encompassing the UGT2B17 (UDP glucuronosyltransferase 2 family, polypeptide B17) gene in the family members and in the case–control cohorts, copy number estimation of the UGT2B17 gene was performed using the Taqman copy number assay Hs03185327_cn (Life Technologies) using the manufacturer’s recommended protocol. The assay was performed in quadruplicate using 10ng genomic DNA for each sample in a 96-well plate. The 10 μl reaction mix consisted of 2 μl of 2× Taqman Genotyping Master Mix (Life Technologies), 0.5 μl of 20× copy number assay (described above), 0.5 μl of TaqMan RNAse P Copy Number Reference Assay (Life Technologies, part 4403326), 2 μl of water, and 2 μl of 5 ng/μl genomic DNA. Cycling conditions for the reaction were 95°C for 10 min, followed by 40 cycles of 95°C for 15 sec and 60°C for 1 min. Samples were analyzed using the ViiA™ 7 Real-Time PCR System (Life Technologies) and analyzed using CopyCaller Software (Life Technologies, PN 4412907). Three reference (calibrator) DNA HapMap samples (NA10851, NA15510 and NA07048 (Coriell Institute)) plus one non-template control were included with the test samples. Fisher exact test was performed to determine association of the UGT2B17 gene copy number with AS in our case–control data.
Competing interests
The authors have declared that no competing interests exist.
Authors’ contributions
MU, DO and PR conceived the problem and designed the project. MU implemented the methodology and carried out the analysis. WPM, RI, DG, AM, RY, FP, SH, and PR contributed on the patient recruitment. The manuscript was written by MU, DO, and PR. All authors read and approved the final manuscript.
Supplementary Material
Contributor Information
Mohammed Uddin, Email: z06mju@mun.ca.
Walter P Maksymowych, Email: walter.maksymowych@ualberta.ca.
Robert Inman, Email: robert.inman@uhn.on.ca.
Dafna Gladman, Email: dafna.gladman@utoronto.ca.
Alexandra Munn, Email: 8am7@queensu.ca.
Ramin Yazdani, Email: ramin.yazdani@easternhealth.ca.
Fawnda Pellett, Email: fpellett@uhnres.utoronto.ca.
Sean Hamilton, Email: sean.hamilton@easternhealth.ca.
Darren D O’Rielly, Email: darren.orielly@med.mun.ca.
Proton Rahman, Email: prahman@mun.ca.
Acknowledgements
We would like to thank Genome Quebec (GQ) for helping us on microarray array analysis and The Centre for Applied Genomics (TCAG) for QF-PCR validation. This work was funded by the Canadian Arthritis Society.
References
- Brown MA, Laval SH, Brophy S, Calin A. A Recurrence risk modelling of the genetic susceptibility to ankylosing spondylitis. Ann. 2000;59:883–886. doi: 10.1136/ard.59.11.883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown MA, Kennedy LG, MacGregor AJ, Darke C, Duncan E, Shatford JL, Taylor A, Calin A, Wordsworth P. Susceptibility to ankylosing spondylitis in twins: the role of genes, HLA, and the environment. Arthritis Rheum. 1997;40:1823–1828. doi: 10.1002/art.1780401015. [DOI] [PubMed] [Google Scholar]
- Calin A, Marder A, Becks E, Burns T. Genetic differences between B27 positive patients with ankylosing spondylitis and B27 positive healthy controls. Arthritis Rheum. 1983;26(12):1460–1464. doi: 10.1002/art.1780261207. [DOI] [PubMed] [Google Scholar]
- Linden SVD, Valkenburg H, Cats A. The risk of developing ankylosing spondylitis in HLA-B27 positive individuals: a family and population study. Br. 1983;22:18–19. doi: 10.1093/rheumatology/22.1.18. [DOI] [PubMed] [Google Scholar]
- International Genetics of Ankylosing Spondylitis Consortium (IGAS) Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci. Nat Genet. 2013;45:730–738. doi: 10.1038/ng.2667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reveille JD, Sims AM, Danoy P, Evans DM, Leo P, Pointon JJ, Jin R, Zhou X, Bradbury LA, Appleton LH, Davis JC, Diekman L, Doan T, Dowling A, Duan R, Duncan EL, Farrar C, Hadler J, Harvey D, Karaderi T, Mogg R, Pomeroy E, Pryce K, Taylor J, Savage L, Deloukas P, Kumanduri V, Peltonen L, Ring SM, Whittaker P, Glazov E, Thomas GP, Maksymowych WP, Inman RD, Ward MM, Stone MA, Weisman MH, Wordsworth BP, Brown MA. Australo-Anglo-American Spondyloarthritis Consortium (TASC) Genome-wide association study of ankylosing spondylitis identifies non-MHC susceptibility loci. Nat. 2010;42:123–127. doi: 10.1038/ng.513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P, Fitzgerald T, Hu M, Ihm CH, Kristiansson K, Macarthur DG, Macdonald JR, Onyiah I, Pang AW, Robson S, Stirrups K, Valsesia A, Walter K, Wei J, Tyler-Smith C, Carter NP, Lee C, Scherer SW, Hurles ME. Wellcome Trust Case Control Consortium. Origins and functional impact of copy number variation in the human genome. Nature. 2010;464(7289):704–712. doi: 10.1038/nature08516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKinney C, Fanciulli M, Merriman ME, Phipps-Green A, Alizadeh BZ, Koeleman BP, Dalbeth N, Gow PJ, Harrison AA, Highton J, Jones PB, Stamp LK, Steer S, Barrera P, Coenen MJ, Franke B, van Riel PL, Vyse TJ, Aitman TJ, Radstake TR, Merriman TR. Association of variation in Fcgamma receptor 3B gene copy number with rheumatoid arthritis in Caucasian samples. Ann Rheum Dis. 2010;69(9):1711–1716. doi: 10.1136/ard.2009.123588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hollox EJ, Huffmeier U, Zeeuwen PL, Palla R, Lascorz J, Rodijk-Olthuis D, van de Kerkhof PC, Traupe H, de Jongh G, den Heijer M, Reis A, Armour JA, Schalkwijk J. Psoriasis is associated with increased β-defensin genomic copy number. Nat. 2008;40(1):23–25. doi: 10.1038/ng.2007.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fellermann K, Stange DE, Schaeffeler E, Schmalzl H, Wehkamp J, Bevins CL, Reinisch W, Teml A, Schwab M, Lichter P, Radlwimmer B, Stange EF. A Chromosome 8 Gene-Cluster Polymorphism with Low Human Beta-Defensin 2 Gene Copy Number Predisposes to Crohn Disease of the Colon. Am. 2006;79(3):439–448. doi: 10.1086/505915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Molokhia M, Fanciulli M, Petretto E, Patrick AL, McKeigue P, Roberts AL, Vyse TJ, Aitman TJ. FCGR3B copy number variation is associated with systemic lupus erythematosus risk in Afro-Caribbeans. Rheumatology. 2011;50(7):1206–1210. doi: 10.1093/rheumatology/keq456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henrichsen CN, Chaignat E, Reymond A. Copy number variants, diseases and gene expression. Hum. 2009;18:R1–R8. doi: 10.1093/hmg/ddp011. [DOI] [PubMed] [Google Scholar]
- Xue Y, Sun D, Daly A, Yang F, Zhou X, Zhao M, Huang N, Zerjal T, Lee C, Carter NP, Hurles ME, Tyler-Smith C. Adaptive Evolution of UGT2B17 Copy-Number Variation. Am. 2008;83:337–346. doi: 10.1016/j.ajhg.2008.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang TL, Chen XD, Guo Y, Lei SF, Wang JT, Zhou Q, Pan F, Chen Y, Zhang ZX, Dong SS, Xu XH, Yan H, Liu X, Qiu C, Zhu XZ, Chen T, Li M, Zhang H, Zhang L, Drees BM, Hamilton JJ, Papasian CJ, Recker RR, Song XP, Cheng J, Deng HW. Genome-wide Copy-Number-Variation Study Identified a Susceptibility Gene, UGT2B17, for Osteoporosis. Am. 2008;83(6):663–674. doi: 10.1016/j.ajhg.2008.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karypidis AH, Olsson M, Andersson SO, Rane A, Ekström L. Deletion polymorphism of the UGT2B17 gene is associated with increased risk for prostate cancer and correlated to gene expression in the prostate. Pharmacogenomics J. 2008;8(2):147–151. doi: 10.1038/sj.tpj.6500449. [DOI] [PubMed] [Google Scholar]
- Olsson M, Lindström S, Häggkvist B, Adami HO, Bälter K, Stattin P, Ask B, Rane A, Ekström L, Grönberg H. The UGT2B17 Gene Deletion is Not Associated with Prostate Cancer Risk. Prostate. 2008;68(5):571–575. doi: 10.1002/pros.20700. [DOI] [PubMed] [Google Scholar]
- Cooper C, Carbone L, Michet CJ, Atkinson EJ, O'Fallon WM, Melton LJ 3rd. Fracture risk in patients with ankylosing spondylitis: a population based study. J Rheumatol. 1994;21(10):1877–1882. [PubMed] [Google Scholar]
- Vosse D, Landewé R, van der Heijde D, van der Linden S, van Staa TP, Geusens P. Ankylosing spondylitis and the risk of fracture: results from a large primary care-based nested case–control study. Ann Rheum Dis. 2009;68(12):1839–1842. doi: 10.1136/ard.2008.100503. [DOI] [PubMed] [Google Scholar]
- Schulze JJ, Lundmark J, Garle M, Skilving I, Ekström L, Rane A. Doping test results dependent on genotype of uridine diphospho-glucuronosyl transferase 2B17, the major enzyme for testosterone glucuronidation. J. 2008;93:2500–2506. doi: 10.1210/jc.2008-0218. [DOI] [PubMed] [Google Scholar]
- Clarke BL, Khosla S. Androgens and Bone. Steroids. 2009;74:296–305. doi: 10.1016/j.steroids.2008.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cook EH Jr, Scherer SW. Copy-number variations associated with neuropsychiatric conditions. Nature. 2008;455(7215):919–923. doi: 10.1038/nature07458. [DOI] [PubMed] [Google Scholar]
- Uddin M, Sturge M, Peddle L, O'Rielly DD, Rahman P. Genome-Wide Signatures of ‘Rearrangement Hotspots’ within Segmental Duplications in Humans. PLoS ONE. 2011;6:e28853. doi: 10.1371/journal.pone.0028853. 10.1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.