Abstract
The effect of Copy Number Variants (CNVs) on Type 2 Diabetes (T2D) remains little explored. The present study characterized large rare CNVs in 686 T2D and 194 non-T2D subjects of Mexican ancestry genotyped using the Affymetrix Genome-Wide Human SNP array 5.0. Rare CNVs with ≥ 100 kb length were identified using a stringent strategy based on merging CNVs calls generated using Birdsuit, iPattern and PennCNV algorithms. We applied three different strategies to evaluate the distribution of CNVs in the T2D and non-T2D samples: 1) Burden analysis, 2) Identification of CNVs in loci previously associated to T2D, and 3) Identification of CNVs observed only in the T2D group. In the CNV burden analysis, the T2D group showed a higher proportion of CNVs, and also a higher proportion of CNVs overlapping at least one gene than the non T2D group. Five of the six loci previously associated with T2D had duplications or deletions in the T2D sample, but not the non-T2D sample. A gene-set analysis including genes with CNVs observed only in the T2D group highlighted gene-sets related with sensory perception (olfactory receptors, OR) and phenylpyruvate tautomerase/dopachrome isomerase activity (MIF and DDT genes).
Introduction
Type 2 Diabetes (T2D) is a complex metabolic disorder that constitutes a worldwide public health problem1. The evidence of a strong hereditary component in T2D has led to numerous efforts to study the genetic factors underlying the disease, mainly through genome-wide association studies (GWAS) focused on common variants2–6.
Copy Number Variants (CNVs), defined as DNA deletions or duplications above 1 kb in length, represent a significant proportion of human genetic variation and have been linked to many complex human diseases, but their implication in T2D remains little explored. In particular, large CNVs may encompass many genes and/or regulatory sequences and could be of particular relevance for T2D because of their potential pathological effects7.
The current Mexican population is an admixed population with a genetic background derived from European, Native American and to lesser extent, West African populations8. Mexico has one of the highest prevalences of T2D in the world9. In this study, we characterized large rare CNVs in T2D and non-T2D individuals (controls) of Mexican ancestry that were genotyped with the Affymetrix 5.0 array10. We then applied three different strategies in order to evaluate the distribution of CNVs in the T2D and control samples: 1) Burden analysis, 2) Identification of CNVs in loci previously associated to T2D, and 3) Identification of CNVs observed only in the T2D group.
Results
Subjects
A total of 686 T2D and 194 non-T2D control subjects were analyzed. As outlined in Table 1, the T2D group had more females and showed higher body mass index (BMI) and triglycerides but lower Multidimensional Scaling (MDS) ancestry vector 1 values than the control group (e.g. lower European ancestry). No differences were observed in relation to CNV size and deletion/duplication distribution among comparison groups [Supplementary Figure 4].
Table 1.
Characteristic | T2D group | Control group | P value |
---|---|---|---|
N | 686 | 194 | |
General characteristics | |||
Male/female sex | 205/481 | 89/105 | <0.001* |
Age, years | 49 ± 12 | 50 ± 7 | 0.086 |
BMI, kg/m2 | 29.2 ± 6.4 | 27.4 ± 4.3 | <0.001* |
Total cholesterol, mg/dL | 210.5 ± 57 | 205.5 ± 50 | 0.119 |
Triglycerides, mg/dL | 197.5 ± 133 | 133 ± 63 | <0.001* |
MDS1 ancestry vector | −0.0047 ± 0.0291 | 0.0072 ± 0.0355 | <0.001* |
MDS2 ancestry vector | 0.0018 ± 0.0067 | 0.0018 ± 0.0072 | 0.653 |
MDS3 ancestry vector | 0.0009 ± 0.0055 | 0.0001 ± 0.0061 | 0.094 |
Rare CNVs | |||
Number of Rare CNVs | 526 | 131 | |
Deletions | 189 (35.9) | 37 (28.2) | |
Duplications | 337 (64.1) | 94 (71.8) | 0.097 |
CNVs size in Mb | 214.5 ± 230.3 | 208.2 ± 245.9 | 0.759 |
Deletions | 171.3 ± 164.8 | 156.6 ± 87.4 | 0.335 |
Duplications | 241.3 ± 255 | 254.1 ± 354.1 | 0.543 |
Genic Rare CNVs | |||
Number of Genic Rare CNVs | 358 | 83 | |
Deletions | 86 (24) | 17 (20.5) | |
Duplications | 272 (76) | 66 (79.5) | 0.492 |
CNVs size in Mb | 233.7 ± 245.3 | 257.7 ± 351.1 | 0.282 |
Deletions | 181.7 ± 219.2 | 160.5 ± 78 | 0.552 |
Duplications | 246.1 ± 244.6 | 311.6 ± 354.9 | 0.119 |
Data presented as counts (percentage) or median ± interquartile range.
P value between T2D and control groups by X2 and Mann–Whitney U test (numerical variables were not normally distributed).
T2D: type 2 diabetes; MDS1: Multidimensional scaling; BMI: Body Mass Index (kg/m2) CNVs: Copy number variations. *P < 0.05.
Rare CNV burden analysis
In the first comparative analysis [Fig. 1], the rare CNV burden analysis, the T2D group showed a higher CNV deletion rate (0.25 vs 0.19, corrected p-value = 0.029) and a higher proportion of rare genic CNVs (e.g. CNVs overlapping with at least one gene) than the control group (0.42 vs 0.31, corrected p-value = 0.044). We did not observe significant differences in CNV rate, genic CNV rate, CNV sample proportion, total CNV size and genic CNV enrichment between the T2D and control groups [Table 2]. In the genic CNV regional analysis, ZNF718 CNVs were found inversely associated with T2D (p = 0.048 for Duplication + Deletions). However, it is important to note that all the CNVs identified in this gene were present only in the control group [Table 3].
Table 2.
Rare CNV burden analyzes | T2D group | Control group | Both groups | T2D/ Control ratio | Raw P-value | Corrected P-value |
---|---|---|---|---|---|---|
CNV rate 1 | ||||||
Deletions | 0.25 | 0.19 | 0.26 | 1.30 | 0.127 | 0.029* |
Duplications | 0.51 | 0.48 | 0.49 | 1.04 | 0.437 | 0.475 |
Deletions + Duplications | 0.75 | 0.68 | 0.75 | 1.11 | 0.21 | 0.116 |
CNV sample proportion 2 | ||||||
Deletions | 0.22 | 0.18 | 0.22 | 1.20 | 0.187 | 0.097 |
Duplications | 0.40 | 0.37 | 0.37 | 1.08 | 0.319 | 0.533 |
Deletions + Duplications | 0.55 | 0.48 | 0.51 | 1.13 | 0.112 | 0.269 |
Total CNV size 3 | ||||||
Deletions | 291.9 | 271.1 | 299.30 | 1.08 | 0.386 | 0.258 |
Duplications | 419.2 | 484.0 | 444.90 | 0.87 | 0.816 | 0.85 |
Deletions + Duplications | 420.2 | 466.5 | 449.40 | 0.90 | 0.811 | 0.699 |
Average CNV size 4 | ||||||
Deletions | 258.70 | 263.90 | 259.10 | 0.98 | 0.499 | 0.574 |
Duplications | 341.70 | 360.50 | 336.60 | 0.95 | 0.642 | 0.811 |
Deletions + Duplications | 306.20 | 338.00 | 311.10 | 0.91 | 0.779 | 0.871 |
Genic CNV rate 5 | ||||||
Deletions | 0.32 | 0.16 | 0.27 | 2.03 | 0.079 | 0.073 |
Duplications | 1.25 | 1.20 | 1.14 | 1.04 | 0.467 | 0.641 |
Deletions + Duplications | 1.57 | 1.36 | 1.41 | 1.16 | 0.304 | 0.424 |
Genic CNV proportion 6 | ||||||
Deletions | 0.11 | 0.08 | 0.11 | 1.38 | 0.176 | 0.116 |
Duplications | 0.34 | 0.26 | 0.30 | 1.32 | 0.065 | 0.107 |
Deletions + Duplications | 0.42 | 0.31 | 0.37 | 1.33 | 0.027* | 0.044* |
Genic CNV enrichment 7 | ||||||
Deletions | 0.005 | 0.005 | 0.005 | 0.94 | 0.51 | 0.545 |
Duplications | 0.009 | 0.008 | 0.009 | 1.23 | 0.195 | 0.176 |
Deletions + Duplications | 0.008 | 0.007 | 0.007 | 1.17 | 0.231 | 0.298 |
1Number of rare CNVs per sample.
2Proportion of samples with one or more rare CNVs.
3Total rare CNV kb length spanned.
4Average rare CNV size.
5Number of regions/genes spanned by rare CNVs.
6Proportion of rare CNVs with at least one gene.
7Number of regions/genes per total rare CNVs kb size.
T2D: type 2 diabetes; CNVs: Copy number variations; kb kilobases.
P-values were estimated by permutation (one-sided, 10,000 permutations). Corrected P-values were adjusted by sex, body mass index, triglycerides and ancestry (MDS1). *P < 0.05.
Table 3.
Genic region | Location | Coordinates 1 | Size (Mb) | DGV frequency (%) 2 | CNV type | T2D group (n = 686) | Control group (n = 194) |
---|---|---|---|---|---|---|---|
CNV regions associated with T2D | |||||||
ZNF718 | 4p16.3 | chr4:43,276-146,490 | 103.2 | 0.0252 | Deletions | 0 (0) | 1 (0.52) |
Duplications | 0 (0) | 1 (0.52) | |||||
Deletions + Duplications* | 0 (0) | 2 (1.03) | |||||
Rare CNVs in diabetes candidate genes | |||||||
AMY2B | 1p21.1 | chr1:103,898,844-103,923,672 | 24.8 | 0.0486 | Duplication | 1 (0.15) | 0 (0) |
LPP | 3q28 | chr3:189,413,414-190,080,135 | 666.7 | 0.0026 | Duplication | 1 (0.15) | 0 (0) |
ARL15 | 5p15.2 | chr5:53,216,370-53,642,160 | 425.8 | 0.0430 | Duplication | 0 (0) | 1 (0.52) |
HFE | 6p22.1 | chr6:26,195,487-26,203,448 | 8.0 | 0.0087 | Deletion | 1 (0.15) | 0 (0) |
CDKAL1 | 6p22.3 | chr6:20,642,666-21,339,743 | 697.1 | 0.0000 | Deletion | 1 (0.15) | 0 (0) |
RASGRP1 | 15q14 | chr15:36,567,593-36,644,299 | 76.7 | 0.0339 | Duplication | 1 (0.15) | 0 (0) |
1Coordinates according NCBI v36, hg18.
2Frequency according to the Database of Genomic Variants (DGV).
T2D: type 2 diabetes; CNVs: Copy number variations; kb kilobases; NA: Not available.
P-values were estimated by exact Fisher test. Corrected P-values by sex, body mass index, triglycerides and ancestry (MDS1) were not available. *P < 0.05.
CNVs in candidate genes
We evaluated if rare CNVs in our samples overlapped with a list of 129 loci previously described as being implicated in T2D [Supplementary Table 3]. In the T2D group, we found duplications in the genes AMY2B, LPP and RASGRP1 and deletions in the HFE and CDKAL1 genes. In the control group, we found a deletion in the ARL15 gene [Table 3].
CNVs seen in the T2D group but not in the control group
We observed a total of 76 genic CNVs in the T2D group that were not seen in the control group and also had frequencies below 0.2% in the Database of Genomic Variants (DGV). These CNVs overlapped with 123 genes, 59 of which are listed in the Online Mendelian Inheritance in Man database (www.omim.org ) [Supplementary Table 4].
Gene-sets analysis
A total of 130 genes were included in the gene-set analysis (one gene identified in the genic regional analysis, 6 genes from the candidate gene analysis, and 123 genes from the analysis of CNVs that are present in the T2D group but not in the control group). The gene-set analysis identified 27 gene-sets. Most of the gene-sets related to sensory perception (olfactory receptors –OR-). Two of the gene-sets pointed to phenylpyruvate tautomerase/dopachrome isomerase activity. Duplications are the most common type of CNV observed in these gene-sets [Fig. 2, Supplementary Table 5].
Discussion
T2D is a complex disorder caused by genetic and environmental factors. The high concordance between twins and the disparities in rates of development of T2D from children of parents with and without the diabetes show the influence of genetic factors in the occurrence the disease11. GWAS have emerged as the most successful methodology to discover genetic variants associated with complex diseases4–6.
CNVs represent a significant proportion of human genetic variation. It has been estimated that 79.55% of the genome contains CNVs (http://dgv.tcag.ca/dgv/app/home). In the last few years, there has been an increased interest in the evaluation of the contribution of rare CNVs to complex diseases7,12–14. This type of genomic variation could be relevant for disease risk due to dosage or disrupting effects. For example, genic duplications may create imbalance due to the excess of products of duplicated genes7.
It is possible to characterize CNVs using microarray data. In this study, we characterized large rare CNVs in 686 T2D and 194 control subjects of Mexican ancestry genotyped using the Affymetrix Genome-Wide Human SNP array 5.0. In our comparative analyses, we found that the T2D group showed a higher proportion of CNVs overlapping with at least one gene than the control group. In our genic CNV regional analysis, we observed that CNVs (Duplications + Deletions) in the ZNF718 gene were associated to T2D status. However, all the duplications and deletions were observed in the control group, and this does not support a role of CNVs within this gene in the pathogenesis of T2D. We found CNVs in 6 loci previously associated with T2D (AMY2B, LPP, RASGRP1, HFE, CDKAL1, ARL15). With the exception of the deletion found in the ARL15 gene, all the other CNVs (duplications or deletions) were observed in the T2D group. Finally, we identified a list of 130 genes for further analysis, based on three different strategies: 1) Genic CNV regional analysis, 2) Identification of CNVs overlapping with previously reported T2D genes, and 3) Genic CNVs observed in the T2D but not in the control group. A gene-set analysis including these 130 genes identified 27 gene-sets. Most of the aforementioned gene-sets included OR genes. The remaining gene-sets included the genes MIF and DDT, which are involved in Dopachrome/phenylpyruvate isomerase activity.
Loss of smell has been associated with reductions in fat mass and insulin resistance15. However, it is important to note that the OR gene family is the largest vertebrate gene family and several studies have shown that OR loci are enriched in CNVs because they are often located in segmentally duplicated regions16–20. For example, in a high-resolution study using oligonucleotide tiling microarrays, Hasin et al.16 analyzed 851 OR gene and pseudogene loci, and reported the presence of CNVs in 93 OR genes and 153 pseudogenes. Our gene-set results have been primarily driven by the inclusion of genes with CNVs identified in the T2D group but absent in the control group. It is possible that the enrichment in OR genes observed in our analysis is the result of ORs being prone to the presence of CNVs, in combination with the unbalanced number of T2D and control samples in our study, instead of a functional effect of CNVs located within OR genes on T2D risk. Another gene-set identified in our analysis includes the gene MIF, which codes for the cytokine macrophage migration inhibitory factor that has been previously related with adipose tissue inflammatory processes and with T2D21,22. It would be important to carry out additional research to evaluate if CNVs affecting this gene may play a role in T2D risk.
Similarly to a recent study in Mexican Americans23, we found that duplications are on average larger in size than deletions. However, in contrast to Blackburn’s study, in which deletions were more frequent than duplications (deletion/duplication ratio between 1.56 to 2.75), in our study we observed a larger number of duplications than deletions (deletion/duplication ratio between 0.25 to 0.56). These differences may be due to the type of microarray assay, the CNV calling algorithms and/or the CNV length and selection protocols used in both studies. Our study also found that the T2D group had a higher CNV deletion rate in comparison with the control group. Our CNV deletion and duplication distribution was quite similar to that described in the T2D and control cohorts from the Welcome Trust Case-Control Consortium study24.
The contemporary Mexican population is the result of an admixture process that involved Native American, European and African populations25–28. The average ancestral contributions of this sample have been estimated as 60%, 35% and 5% respectively25. It is well known that the presence of population stratification can inflate the rate of false positives in association studies in admixed populations. One strategy to minimize this problem was to include ancestry proportions as covariates in the statistical analyses. In our study, we used the coordinates of the first MDS axis, which reflects the relative ancestral contributions from European and Native American populations. In addition to the MDS1 ancestry vector, the rare CNV burden analysis was also adjusted by sex, BMI, and triglycerides, given that significant differences for these variables were observed between the T2D and control groups in the initial comparative analysis. Unfortunately, due the small counts, the aforementioned adjustment was not possible neither for ZNF718 gene nor for the diabetes candidate gene analysis.
The lack of detailed clinical information like blood glucose and hemoglobin A1c concentrations, the relatively high percentage of samples dropped during the QC protocol, the limited sample size and the unbalanced number of T2D subjects and controls are the main limitations of our study. With respect to the number of samples removed, Grassi29 reported that 12.8% of the samples genotyped with the Affymetrix Genome-Wide Human SNP array 5.0 were eliminated using only two quality control criteria (gender incompatibilities, and genotyping rate <90%). The higher percentage of samples removed in our study (32.8%) could be explained by the rigorous four-step quality control analysis used to maximize our confidence in the CNV calls.
Despite the limited sample size and the unbalanced number of T2D and control samples, in a post-hoc analysis, the power to detect overall genic CNV proportion differences and ZNF718 gene differences among comparison groups were 0.799 and 0.646 respectively. However, the statistical power decrease to 0.483 when there was a CNV in a control subject but not in T2D and 0.003 when there was a CNV in a T2D subject but not in controls.
Conclusions
In this study conducted in Mexico we found that the T2D group showed a higher proportion of CNVs than the control sample, and also a higher proportion of CNVs overlapping at least one gene than the control group. We found duplications or deletions in six genes that have been previously associated with type 2 diabetes. Finally, gene-sets related with sensory perception (olfactory receptors) and phenylpyruvate tautomerase/dopachrome isomerase activity were enriched in the T2D group.
Methods
Study design and population
As outlined below, we analyzed a total of 686 T2D and 194 non-T2D control subjects selected from the initial sample comprised 1,310 individuals (967 T2D subjects and 343 controls) that were previously included in a GWAS for T2D10. Briefly, the T2D sample included subjects previously diagnosed according to the American Diabetes Association (ADA) criteria30 recruited from Family Medicine Clinics located at the southern part of Mexico City. The control group included healthy blood donors without family history of T2D and with normal glucose tolerance tests, according to the ADA criteria, that were recruited from the Central Blood Bank of the National Medical Center “Siglo XXI”. Both groups were sampled between 2000 and 2005. Information on sex, age, BMI, total cholesterol and triglycerides was available for all the samples. Ancestry proportions were estimated using a genome-wide panel of 3,500 Ancestry Informative Markers (AIMs) with the program ADMIXMAP [http://homepages.ed.ac.uk/pmckeigu/admixmap/index.html] and also a MDS approach based on genome-wide data for six parental population samples (Nahua and Maya from Mexico, Andeans and Aymara from South America, Spanish, CEPH European and Yoruba) and the Mexico City sample.
SNP genotyping
Genotyping was carried out at the Centre for Applied Genomics (Toronto, ON) using the Affymetrix Genome-Wide Human SNP array 5.0 (Affymetrix, Santa Clara, CA) and following standard protocols.
Quality control analysis and CNV analysis
A four-step quality control (QC) procedure was applied to the initial set of 1,310 samples to ensure that ascertainment of CNVs was consistent between comparison groups. Rare CNVs with ≥100 kb length were identified using a stringent strategy based on merging CNVs calls generated using Birdsuite (version 1.5.3)31, iPattern32,33 and PennCNV34 algorithms. At the end of the QC analysis, the dataset comprised 880 individuals (686 T2D subjects and 194 control subjects) and 657 rare large CNVs. [Supplementary Information, Supplementary Figures 1–3 and Supplementary Tables 1 and 2].
Comparative analysis of general characteristics
Comparisons of sex, age, BMI, total cholesterol, triglycerides, ancestry vectors, CNV size and deletion/duplications proportions between the T2D and control groups were carried out with the X2 and Mann–Whitney U tests.
Rare CNV burden analysis
The CNV burden analysis was carried out using PLINK v 1.07 (http://zzz.bwh.harvard.edu/plink/), considering two categories: rare CNVs and genic rare CNVs (e.g. CNVs that overlap with at least one gene). The global CNV burden in the T2D group compared to the control group was tested using seven measures: 1) CNV rate (number of CNVs per sample), 2) CNV sample proportion (proportion of samples with one or more CNVs), 3) total CNV size, 4) average CNV size, 5) genic CNV rate (number of regions/genes spanned by rare CNVs), 6) genic CNV proportion (proportion of rare CNVs overlapping at least one gene) and 7) genic enrichment (number of regions/genes per total CNVs kb size). Statistical significance was evaluated using an adaptive permutation procedure for one-sided tests (i.e., hypothesizing that T2D subjects will show greater burden of rare CNVs than controls). For each of 10,000 permutations, samples were randomly reassigned either T2D or control status. Rare measures were also adjusted by significant variables in the comparative group analysis: sex, BMI, triglycerides and ancestry (coordinates of the first MDS axis, which explains the relative proportion of Native American and European ancestry in the individuals analyzed). CNVs were considered rare if they were found at a frequency of <1% in the Mexican control sample set (n = 194), and showed an overlap of less than 50% of their length with other CNVs or segmental duplications found at a frequency >1%. Rare CNVs were classified as genic based on RefSeq annotations (UCSC, v. March 2006, NCBI v36, hg18).
In a second comparative analysis of burden of rare CNVs (genic regional analysis), we tested for specific genic regions associated with T2D, using Fisher’s exact test to assess for differences between the T2D and the control groups.
CNVs in candidate genes and CNVs seen in the T2D group but not in the control group
After the rare burden analysis and the genic regional analysis, we looked for CNVs present in a list of 129 loci previously described as being implicated in T2D [Supplementary Table 3], and we also looked for CNVs present in T2D subjects but not in the control group.
Gene-sets analysis
A final analysis was carried out to find gene-sets that may be implicated in T2D. Here we considered the results of the genic regional analysis, the analysis of CNVs in candidate genes and the analysis of genic CNVs seen in the T2D group but not in the control group. Enrichment Map plugin v2.2.1 (http://baderlab.org/Software/EnrichmentMap) and Cytoscape software v3.5.1 (http://cytoscape.org/) were used to draw a map of the gene-sets network, in which the nodes represented the gene-sets and the edges corresponded to the overlap between sets.
Ethics Statement
The research was approved by the National Ethical Committee of the Mexican Institute of Social Security (register number 2008-785-073) and by The Ethics Review Office at the University of Toronto. Written informed consent was obtained from each participant. All methods were performed in accordance with the ethical standards and regulations of the institutional research committees and national laws and with the 1964 Helsinki declaration and its later amendments.
Data Availability Statement
The datasets analysed during the current study are not publicly available due to ethical concerns, but are available from the corresponding author on reasonable request.
Electronic supplementary material
Acknowledgements
We would like to thank to Dr. Chao Lu, Dr. Bhooma Thiruvahindrapuram and Dr. Zhuozhi Wang from The Centre for Applied Genomics for the microarray experiments and the Birdsuite and iPattern data processing. We thanks to staff of Biochemistry Research Unit for their excellent technical support. The project was supported by CONACYT SALUD-2008-785-073. IJAM is a recipient of a Consejo Nacional de Ciencia y Tecnología (CONACyT, Mexico) grant 72694.
Author Contributions
M.C. conceived the study question. A.V., E.J.P. and S.W.S. contributed to the analysis plan. I.J.A.M. and D.P. carried out statistical analyses, interpreted the data and drafted the manuscript. All authors reviewed the manuscript
Competing Interests
The authors declare that they have no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-017-17361-7.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.World Health Organization. Diabetes Key facts. http://www.who.int/mediacentre/factsheets/fs312/en/ (2017).
- 2.Stumvoll M, Goldstein BJ, van Haeften TW. Type 2 diabetes: principles of pathogenesis and therapy. Lancet. 2005;365(9467):1333–1346. doi: 10.1016/S0140-6736(05)61032-X. [DOI] [PubMed] [Google Scholar]
- 3.Doria A, Patti ME, Kahn CR. The emerging genetic architecture of type 2 diabetes. Cell Metab. 2008;8(3):186–200. doi: 10.1016/j.cmet.2008.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.McCarthy MI, Zeggini E. Genome-wide association studies in type 2 diabetes. Curr Diab Rep. 2009;9(2):164–171. doi: 10.1007/s11892-009-0027-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McCarthy MI. Genomics, type 2 diabetes, and obesity. N Engl J Med. 2010;363(24):2339–2350. doi: 10.1056/NEJMra0906948. [DOI] [PubMed] [Google Scholar]
- 6.Grotz AK, Gloyn AL, Thomsen SK. Prioritising Causal Genes at Type 2 Diabetes Risk Loci. Curr Diab Rep. 2017;17(9):76. doi: 10.1007/s11892-017-0907-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lee C, Scherer SW. The clinical context of copy number variation in the human genome. Expert Rev Mol Med. 2010;12:e8. doi: 10.1017/S1462399410001390. [DOI] [PubMed] [Google Scholar]
- 8.Stavenhagen, R. & Carrasco, T. La diversidad étnica y cultural. In: Florescano E: El patrimonio nacional de México, 1st ed. (Consejo Nacional para la Cultura y las Artes) 249–280 (1997).
- 9.International Diabetes Federation. IDF Diabetes Atlas Seventh edition. http://www.diabetesatlas.org/ (2015).
- 10.Parra EJ, et al. Genome-wide association study of type 2 diabetes in a sample from Mexico City and a meta-analysis of a Mexican-American sample from Starr County, Texas. Diabetologia. 2011;54(8):2038–2046. doi: 10.1007/s00125-011-2172-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Barroso I. Genetics of Type 2 diabetes. Diabet Med. 2005;22(5):517–535. doi: 10.1111/j.1464-5491.2005.01550.x. [DOI] [PubMed] [Google Scholar]
- 12.Cook EH, Jr., Scherer SW. Copy-number variations associated with neuropsychiatric conditions. Nature. 2008;455(7215):919–923. doi: 10.1038/nature07458. [DOI] [PubMed] [Google Scholar]
- 13.de Cid R, et al. Deletion of the late cornified envelope LCE3B and LCE3C genes as a susceptibility factor for psoriasis. Nat Genet. 2009;41(2):211–215. doi: 10.1038/ng.313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Diskin SJ, et al. Copy number variation at 1q21.1 associated with neuroblastoma. Nature. 2009;459(7249):987–991. doi: 10.1038/nature08035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Riera CE, et al. The Sense of Smell Impacts Metabolic Health and Obesity. Cell Metab. 2017;26(1):198–211. doi: 10.1016/j.cmet.2017.06.015. [DOI] [PubMed] [Google Scholar]
- 16.Hasin Y, et al. High-resolution copy-number variation map reflects human olfactory receptor diversity and evolution. PLoS Genet. 2009;4(11):e1000249. doi: 10.1371/journal.pgen.1000249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Young JM, et al. Extensive copy-number variation of the human olfactory receptor gene family. Am J Hum Genet. 2008;83(2):228–242. doi: 10.1016/j.ajhg.2008.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Redon R, et al. Global variation in copy number in the human genome. Nature. 2006;444:444–454. doi: 10.1038/nature05329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Korbel JO, et al. Paired-End Mapping Reveals Extensive Structural Variation in the Human. Genome. Science. 2007;318:420–426. doi: 10.1126/science.1149504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Nozawa M, Kawahara Y, Nei M. Genomic drift and copy number variation of sensory receptor genes in humans. Proc Natl Acad Sci USA. 2007;104:20421–20426. doi: 10.1073/pnas.0709956104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kim BS, Pallua N, Bernhagen J, Bucala R. The macrophage migration inhibitory factor protein superfamily in obesity and wound repair. Exp Mol Med. 2015;47:e161. doi: 10.1038/emm.2015.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Morrison MC, Kleemann R. Role of Macrophage Migration Inhibitory Factor in Obesity, Insulin Resistance, Type 2 Diabetes, and Associated Hepatic Co-Morbidities: A Comprehensive Review of Human and Rodent Studies. Front Immunol. 2015;6:308. doi: 10.3389/fimmu.2015.00308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Blackburn A, et al. Utilizing extended pedigree information for discovery and confirmation of copy number variable regions among Mexican Americans. Eur J Hum Genet. 2013;21:404–409. doi: 10.1038/ejhg.2012.188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Grozeva D, et al. Reduced burden of very large and rare CNVs in bipolar affective disorder. Bipolar Disord. 2013;15(8):893–898. doi: 10.1111/bdi.12125. [DOI] [PubMed] [Google Scholar]
- 25.Martinez-Marignac VL, et al. Admixture in Mexico City: implications for admixture mapping of type 2 diabetes genetic risk factors. Hum Genet. 2007;120(6):807–819. doi: 10.1007/s00439-006-0273-3. [DOI] [PubMed] [Google Scholar]
- 26.Martinez-Fierro ML, et al. Ancestry informative markers and admixture proportions in northeastern Mexico. J Hum Genet. 2009;54(9):504–509. doi: 10.1038/jhg.2009.65. [DOI] [PubMed] [Google Scholar]
- 27.Silva-Zolezzi I, et al. Analysis of genomic diversity in Mexican Mestizo populations to develop genomic medicine in Mexico. Proc Natl Acad Sci USA. 2009;106(21):8611–8616. doi: 10.1073/pnas.0903045106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Galanter JM, et al. Development of a panel of genome-wide ancestry informative markers to study admixture throughout the Americas. PLoS Genet. 2012;8(3):e1002554. doi: 10.1371/journal.pgen.1002554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Grassi MA, et al. Genome-wide meta-analysis for severe diabetic retinopathy. Hum Mol Genet. 2011;20(12):2472–2481. doi: 10.1093/hmg/ddr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.American Diabetes Association. Standards of medical care in diabetes 2017. Diabetes Care. 40 (Suppl1): S1–S135 (2017).
- 31.Korn JM, et al. Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs. Nat Genet. 2008;40(10):1253–1260. doi: 10.1038/ng.237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pinto D, et al. Functional impact of global rare copy number variation in autism spectrum disorders. Nature. 2010;466(7304):368–372. doi: 10.1038/nature09146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pinto D, et al. Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nat Biotechnol. 2011;29(6):512–520. doi: 10.1038/nbt.1852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wang K, et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 2007;17(11):1665–1674. doi: 10.1101/gr.6861907. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets analysed during the current study are not publicly available due to ethical concerns, but are available from the corresponding author on reasonable request.