An official website of the United States government
Here's how you know
Official websites use .gov
A
.gov website belongs to an official
government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you've safely
connected to the .gov website. Share sensitive
information only on official, secure websites.
As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with,
the contents by NLM or the National Institutes of Health.
Learn more:
PMC Disclaimer
|
PMC Copyright Notice
. Author manuscript; available in PMC: 2017 Feb 11.
Published in final edited form as: Nature. 2016 Aug 11;536(7615):205–209. doi: 10.1038/nature19075
Emergence of a Homo sapiens-specific gene family and chromosome 16p11.2 CNV susceptibility
1Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
2Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
3Laboratory of Genetics, The Salk Institute for Biological Studies, 10010 North Torrey Pines Road, La Jolla, CA 92037, USA
4Department of Chemistry, University of Florence, Via della Lastruccia 3, 50019 Sesto Fiorentino, Florence, Italy
5Magnetic Resonance Center CERM, University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Florence, Italy
7Department of Biology, University of Bari, Bari, Italy
8Howard Hughes Medical Institute, Seattle, WA 98195, USA
9Benaroya Research Institute at Virginia Mason, Seattle, WA 98101, USA
10Center for Academic Research and Training in Anthropogeny (CARTA), 9500 Gilman Drive, La Jolla, CA 92093, USA
✉
Corresponding authors: Evan E. Eichler, Ph.D., University of Washington School of Medicine, Howard Hughes Medical Institute, 3720 15th Ave NE, S413C, Box 355065, Seattle, WA 98195-5065, Phone: (206) 543-9526, eee@gs.washington.edu, Alexandre Reymond, Ph.D., Center for Integrative Genomics, University of Lausanne, Genopode Building, CH-1015 Lausanne, Switzerland., Phone: +41 21 692 3961, alexandre.reymond@unil.ch
6
Present address: Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
The publisher's version of this article is available at Nature
Abstract
Genetic differences that specify unique aspects of human evolution have typically been identified by comparative analyses between the genomes of humans and closely related primates1, including more recently the genomes of archaic hominins2,3. Not all regions of the genome, however, are equally amenable to such study. Recurrent copy number variation (CNV) at chromosome 16p11.2 accounts for ~1% of autism cases4,5 and is mediated by a complex set of segmental duplications, many of which arose recently during human evolution. We reconstructed the evolutionary history of the locus and identified BOLA2 (bolA family member 2) as a gene duplicated exclusively in Homo sapiens. We estimate that a 95 kbp segment containing BOLA2 duplicated across the critical region ~282 thousand years ago (kya), one of the latest among a series of genomic changes that dramatically restructured the locus during hominid evolution. All humans examined carry one or more copies of the duplication, which nearly fixed early in the human lineage—a pattern unlikely to have arisen so rapidly in the absence of selection (p < 0.0097). We show that the duplication of BOLA2 led to a novel, human-specific in-frame fusion transcript and that BOLA2 copy number correlates with both RNA expression (r = 0.36) and protein level (r = 0.65), with the greatest expression difference between human and chimpanzee in experimentally derived stem cells. Analyses of 152 patients carrying a chromosome 16p11.2 rearrangement showed that >96% of breakpoints occur within the Homo sapiens-specific duplication. In summary, the duplicative transposition of BOLA2 at the root of the Homo sapiens lineage ~282 kya simultaneously increased copy number of a gene associated with iron homeostasis and predisposed our species to recurrent rearrangements associated with disease.
To reconstruct the evolutionary history of the chromosome 16p11.2 region, we generated complete, reference-quality genome sequence6 (Table S1) for one orangutan, two chimpanzee and three human haplotypes (Fig. 1a and Extended Data Fig. 1). Comparison with mouse establishes the orangutan configuration as ancestral. In both humans and chimpanzees, the region has been independently restructured, nearly doubling in length primarily by the differential accumulation of segmental duplications (Fig. 1a and Extended Data Fig. 1a). We find six inversions have occurred in the African great apes within chromosome 16p11.2 (Extended Data Figs. 2–4 and Tables S2, S3), a nonrandom clustering (p < 1 × 10−6), with breakpoints mapping near an ~20 kbp LCR16a (low copy repeat 16a) core duplicon. The core encodes a positively selected gene family (NPIP) that emerged on the human-African great ape lineage7. Only within the human lineage do large (>100 kbp) segmental duplications exist in a direct orientation flanking the autism critical region at breakpoint regions BP4 and BP5 (Extended Data Fig. 5a and Table S4)8, implying that susceptibility to large-scale CNV associated with disease4,5,9 arose specifically within the human species.
Structural differences between human haplotypes are largely restricted to integral changes in the copy number of a 102 kbp block within both the proximal and distal breakpoint regions (Extended Data Fig. 1b). This block is composed of two different segmental duplications originating from chromosome 16: a 72 kbp segment duplicated from chromosome 16p12.1 carrying NPIP and a portion of the SMG1 serine-threonine kinase gene (SMG1P) and a 30 kbp segment carrying three intact genes: BOLA2, SLX1 and SULT1A3 (Fig. 1a and Extended Data Fig. 1b). More than one dozen large-scale structural changes, including six duplicative transpositions (>830 kbp) from elsewhere on chromosome 16, are required to reconcile the organization of human and chimpanzee chromosome 16p11.2 (Extended Data Figs. 3, 4 and Table S3). Assuming a human–chimpanzee divergence time of 6 million years ago (mya)10 and a constant substitution rate, we estimate that a 95 kbp segment including BOLA2 duplicated across the critical region ~282 kya (95% confidence interval: 361–209 kya), around the time when Homo sapiens emerged as a species11 (Fig. 1b, Fig. 2a, Extended Data Fig. 6, and Tables S5–S7).
We examined copy number diversity12 of the duplicated genes mapping to the 102 kbp cassette—BOLA2, SLX1, and SULT1A3—in humans, archaic humans, and apes (Fig. 2b–c, Extended Data Fig. 7, and Tables S8–S10). We found that BOLA2 is duplicated in all Homo sapiens individuals examined, including archaic representatives of Neolithic and Mesolithic populations13, as well as the oldest sequenced archaic human, Ust’-Ishim, estimated to have lived 45 kya14. In sharp contrast, BOLA2 is single copy (i.e., diploid copy number = 2) in nonhuman primates and the archaic hominins Neanderthal2 and Denisova3 (Fig. 2b–c and Table S8), consistent with our phylogenetic point estimate of the duplication age. Human genomes contain from 3 to 8 diploid BOLA2 copies, with at least one copy of the distal duplicate BOLA2B (range = 1–4; mean and median = 2 copies) and at least two copies of the proximal ancestral BOLA2A (range = 2–5 copies; mean and median = 4 copies, Fig. 2c and Table S8).
In light of its recent origin and its potential to promote disease-causing rearrangement, we considered it remarkable that 99.8% of humans carry four or more copies of this segment. Ancient humans such as Ust’-Ishim as well as some of the oldest branches of modern humans (e.g., San and Biaka pygmy15) typically carry five or six copies, indicating that it spread rapidly early in human history. We modeled various evolutionary scenarios by simulation based on the observed genotypes and a realistic model of human demographic history (Extended Data Fig. 8a), assuming neutral evolution16–18. The observed genotypes or genotypes with higher BOLA2B frequencies only in humans were improbable (p < 0.0097, Extended Data Fig. 8b), even when the duplication age parameter was varied by an order of magnitude. Scenarios incorporating recurrent duplication were also deemed unlikely (p < 0.0062). We next implemented a model incorporating the 282 kya age estimate but varying the selection coefficient (s) as an input parameter, yielding a maximum likelihood estimate of s = 0.0015 (Extended Data Fig. 8c). Interestingly, the unique ~550 kbp critical region flanked by BOLA2 duplications showed signatures consistent with a region under positive selection: the absence of archaic introgression19, low diversity (bottom 2.7%) and an excess of rare variants (Extended Data Fig. 8d–e).
Because humans show extensive copy number variation, we assessed whether copy number correlated with mRNA and protein levels. We found a significant correlation between BOLA2 copy number and expression at the RNA level based on analysis of 366 lymphoblastoid cell lines (LCLs)20 (r = 0.36, p = 2.09 × 10−12, Fig. 3a and Tables S11, S12) and at the protein level based on analysis of whole-protein lysates from 34 LCLs (r = 0.64, p = 4.34 × 10−5, Fig. 3b and Tables S13, S14).
We also performed RT-PCR and identified an alternate gene structure composed of the first two exons from BOLA2 joined with three novel 3′ exons from an older segmental duplication containing SMG1P (Fig. 3c). This fusion isoform contains an open reading frame (ORF) predicted to encode a 217 residue protein including 53 residues from BOLA2 and 164 residues from SMG1P. Both canonical and fusion transcripts are co-expressed in a wide variety of tissues and developmental stages (Extended Data Fig. 9). Although the predicted fusion protein cannot be detected by existing antibodies, it is interesting that ribosome profiling data provide evidence that the mRNA is translated (Table S15). Importantly, since the ancestral BOLA2 at BP5 lacked the SMG1P duplication downstream, the origin of the fusion product must have coincided with the juxtaposition of BOLA2 and SMG1P by the tandem 102 kbp segmental duplication ~650–300 kya at BP5. We conclude that this fusion isoform is Homo sapiens-specific.
BOLA2 was previously identified as one of the top 50 genes differentially expressed between humans and nonhuman apes in induced pluripotent stem cells (iPSCs)21, implying that this gene might be particularly relevant early in development. Based on our characterization of the different BOLA2 isoforms, we revisited this observation by quantifying BOLA2 mRNA levels by RNA-seq in human and chimpanzee iPSCs, iPSC-derived neural progenitor cells (NPCs), and eight-week-old neurons. Remarkably, we found the greatest differences in canonical BOLA2 expression at the iPSC state (2-fold) and to a lesser extent in NPCs (1.5-fold) (Fig. 3d and Table S16). Quantification of BOLA2 expression in two primary human embryonic stem cell (ESC) lines revealed transcript levels comparable to human iPSCs (Fig. 3d and Table S16). In contrast, examination of a panel of adult tissues22 revealed no substantial differences in BOLA2 mRNA levels between human and chimpanzee (Extended Data Fig. 9d). As expected, expression of the fusion BOLA2-SMG1P transcript was detected exclusively in human.
The duplication of BOLA2 across the critical region expanded by threefold the size of flanking high-identity, directly oriented sequence blocks (Extended Data Fig. 5a–b and Tables S4, S17, S18), theoretically predisposing the locus to recurrent CNV via unequal crossover (Extended Data Fig. 5c) specifically in the human lineage. To test this, we refined breakpoint locations in autism and developmental delay patients carrying either the chromosome 16p11.2 microduplication or microdeletion event23. Using whole-genome sequence (WGS) data and a molecular inversion probe (MIP) assay24, we localized breakpoints in 152 patients corresponding to 105 independent rearrangement events (Fig. 4a, Extended Data Fig. 10, and Table S19). We found 96% (101 of 105) of the disease-causing rearrangement breakpoints map within the Homo sapiens-specific duplication containing BOLA2 (Fig. 4b). Thus, the expansion of this segment rendered the chromosome 16p11.2 locus susceptible to recurrent rearrangement.
In summary, the level of genetic difference between humans and chimpanzees for chromosome 16p11.2 stands in sharp contrast to the oft-quoted 99% genetic identity between the species. The region has undergone extensive inversion and duplication, including a 95 kbp segment containing BOLA2 that duplicated after our divergence with ancient hominins. This event contributes more derived sequence specific to Homo sapiens than 35,500 previously reported human-specific single-nucleotide variants and indels combined2. The rapid rise and dispersal of this duplicated segment at the root of Homo sapiens (~282 kya) are unlikely to have occurred under neutral evolution but rather are consistent with modest positive selection (s = 0.0015). The estimated strength of selection on the BOLA2 duplication is an order of magnitude weaker than what is typically observed for recent positive selection (such as the emergence of lactase persistence ~10 kya25) but an order of magnitude stronger than nearly neutral mutations. Remarkably, the BOLA2 duplication rapidly rose to high frequency in humans despite predisposing our species to recurrent CNV associated with disease. The expansion of this segment resulted in the formation of a novel fusion transcript and dramatic BOLA2 expression differences between chimpanzee and human iPSCs. Although the phenotypic consequences of increased BOLA2 expression and the novel fusion transcript await future in vivo characterization, it is known that BOLA2 physically interacts in a heterotrimeric complex with GLRX3 (glutaredoxin 3)26. This complex is conserved from prokaryotes to humans27 and was shown to have a role in iron sensing in yeast28. In vertebrates, BOLA2 has been hypothesized to play important roles in iron regulation29 and iron-sulfur protein biogenesis30. We speculate that the expansion of this conserved gene may enhance iron utilization and homeostasis, especially during human embryonic development.
Single-molecule, real-time (SMRT) sequencing was used to generate high-quality sequence6 from bacterial artificial chromosome (BAC) clones obtained from genomic libraries. Clone sequences were assembled using HGAP and error-corrected using Quiver31. Contig assembly was performed using Sequencher (Gene Codes Corporation, Ann Arbor, MI) and validated by FISH. Copy number genotyping of genes and segmental duplications was performed using a read-depth method12 and WGS data from humans32,33, nonhuman primates34, and archaic genomes2,3,13,14, as well as single molecule MIPs (smMIPs)35 targeted to paralogous sequence variants24. We estimated evolutionary timing of segmental duplication events based on comparative sequencing and phylogenetic analyses (neighbor-joining method), adjusting branch lengths for trees that failed the Tajima’s relative rate test and assuming divergence times of 6 mya (human–chimpanzee)10 and 15 mya (human–orangutan). Evolutionary conservation analysis of BOLA2 was performed by maximum likelihood (PAML). Likelihoods of BOLA2B fixation under different scenarios were assessed using the coalescent simulators ms17 and msms18, adapting a previously published demographic model16. BOLA2 copy number estimates were correlated (Pearson’s r) using RNA-seq quantifications20 (PEER-normalized RPKM) and Western blot BOLA2 densities in human LCLs grown in complete RPMI medium and lysed in RIPA buffer. After SDS-PAGE and transfer to PVDF membrane, blots were incubated with an anti-BOLA2 antibody (Santa Cruz Biotechnology, Dallas, TX) and an anti-actin antibody (Sigma) for normalization purposes. Band densities were quantified using the Bio1D software. BOLA2 coding DNA sequence (CDS) was cloned using the Gateway system (Invitrogen, Carlsbad, CA). HeLa cells were transfected with cytomegalovirus-BOLA2 CDS (both 10 and 17 kDa forms) and analyzed by Western blotting. BOLA2 gene models were established via RT-PCR, cloning, and capillary sequencing. RNA-seq data were generated from previously described ESC and iPSC lines21, as well as iPSC lines differentiated into NPCs and neurons. BOLA2 mRNA expression was quantified in transcripts per million (TPM) with Kallisto36 (version 0.42.1) using a custom catalog of transcripts including all human RefSeq transcripts with the three BOLA2 isoforms. Breakpoints of chromosome 16p11.2 rearrangements were refined using Illumina whole-genome shotgun sequencing37,38 and smMIP analysis24,35,37 of patient DNA obtained from the Simons Variation in Individuals Project (Simons VIP)23 and Simons Simplex Collection (SSC)39. All procedures for clinical assessment and blood extraction were approved by the institutional review boards (IRBs) of participating institutions, and informed consent was obtained for participation in this research.
We thank families at the participating Simons VIP and SSC sites, as well as the Simons VIP Consortium. Approved researchers can obtain the Simons VIP dataset, the SSC dataset, and/or biospecimens by applying at https://base.sfari.org. We thank M. Chaisson for SMRT WGS data, B. Vernot for archaic introgression data, B.J. Nelson and K. Munson for technical assistance, M.L. Gage for editorial comments, and T. Brown for assistance with manuscript preparation. This work was supported by the Paul G. Allen Foundation (grant #11631 to E.E.E.), the Simons Foundation Autism Research Initiative (SFARI #303241 to E.E.E. and #274424 to A.R.), the U.S. National Institutes of Health (NIH grant 2R01HG002385 to E.E.E.), the Swiss National Science Foundation (31003A_160203 and CRSII33-133044 to A.R.), and funds from NIH TR01 MH095741, the Helmsley Charitable Fund, the Mathers Foundation, and the JPB Foundation (to F.H.G.). X.N. was supported by a U.S. National Science Foundation Graduate Research Fellowship under grant #DGE-1256082. G.G. was awarded a Pro-Women Scholarship from the Faculty of Biology and Medicine, University of Lausanne. M.H.D. is supported by U.S. National Institute of Mental Health grant 1F30MH105055-01. L.B. is supported by the EC grant N653706, project iNEXT. S.C.B. and F.C. were supported by Ente Cassa di Risparmio grant (ID no. 2013/7201). E.E.E. is an investigator of the Howard Hughes Medical Institute. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
X.N., G.G., M.H.D., A. Reymond, and E.E.E. designed the study. X.N., G.G., M.H.D., M.M., J.H., L.D., L.H., C. Baker, A. Raja, and K.P. contributed to sequencing and assembly of haplotypes. X.N. developed the evolutionary model, with input from G.G. P.H.S. genotyped aggregate copy number from WGS data. X.N. and M.H.D. performed MIP experiments and analyzed WGS data to genotype paralog-specific copy number and refine rearrangement breakpoints. N.J. performed massively parallel sequencing. J.G.S., M.H.D., and X.N. performed population genetic simulations, with input from J.M.A. G.G. analyzed RNA-seq data from LCLs, performed Western blots, and assessed the correlation of expression with copy number. I.N., C. Benner, and M.C.N.M. performed and analyzed RNA-seq experiments over in vitro differentiation of experimentally derived primate stem cells, with supervision from F.H.G. O.P., G.G., and X.N. analyzed RNA-seq data from different human and nonhuman primate tissues. J.H. performed inversion density simulations using data provided by F.A. and M.V. G.C. and F.A. performed FISH experiments. F.C., S.C.B., H.A.F.S., and L.B. performed functional experiments and provided insights into potential effects of increased BOLA2 dosage. W.J.T. and C.T.A. constructed a BAC library. X.N. and E.E.E. wrote the paper, with input and approval from all coauthors.
Author Information
Clone sequences, haplotype contig sequences, MIP data, and RNA-seq data for NPCs and neurons are available at NCBI BioProject (accession number PRJNA325679 in the NCBI BioProject database, https://www.ncbi.nlm.nih.gov/bioproject/). Patient WGS and MIP data are available at SFARI base (accession numbers SFARI_SVIP_WGS_1 and SFARI_SVIP_MIPS_1). Reprints and permission information is available at www.nature.com/reprints. The authors declare competing financial interests: details accompany the full-text HTML version of the paper at www.nature.com/nature.
COMPETING FINANCIAL INTERESTS
E.E.E. is on the scientific advisory board (SAB) of DNAnexus, Inc., and is a consultant for the Kunming University of Science and Technology (KUST) as part of the 1000 China Talent Program.
References
1.King MC, Wilson AC. Evolution at two levels in humans and chimpanzees. Science (New York, NY) 1975;188:107–116. doi: 10.1126/science.1090005. [DOI] [PubMed] [Google Scholar]
2.Prufer K, et al. The complete genome sequence of a Neanderthal from the Altai Mountains. Nature. 2014;505:43–49. doi: 10.1038/nature12886. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Meyer M, et al. A high-coverage genome sequence from an archaic Denisovan individual. Science (New York, NY) 2012;338:222–226. doi: 10.1126/science.1224344. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Weiss LA, et al. Association between microdeletion and microduplication at 16p11.2 and autism. The New England journal of medicine. 2008;358:667–675. doi: 10.1056/NEJMoa075974. [DOI] [PubMed] [Google Scholar]
5.Kumar RA, et al. Recurrent 16p11.2 microdeletions in autism. Human molecular genetics. 2008;17:628–638. doi: 10.1093/hmg/ddm376. [DOI] [PubMed] [Google Scholar]
6.Huddleston J, et al. Reconstructing complex regions of genomes using long-read sequencing technology. Genome research. 2014;24:688–696. doi: 10.1101/gr.168450.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Johnson ME, et al. Positive selection of a gene family during the emergence of humans and African apes. Nature. 2001;413:514–519. doi: 10.1038/35097067. [DOI] [PubMed] [Google Scholar]
8.Zufferey F, et al. A 600 kb deletion syndrome at 16p11.2 leads to energy imbalance and neuropsychiatric disorders. Journal of medical genetics. 2012;49:660–668. doi: 10.1136/jmedgenet-2012-101203. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Jacquemont S, et al. Mirror extreme BMI phenotypes associated with gene dosage at the chromosome 16p11.2 locus. Nature. 2011;478:97–102. doi: 10.1038/nature10406. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Patterson N, Richter DJ, Gnerre S, Lander ES, Reich D. Genetic evidence for complex speciation of humans and chimpanzees. Nature. 2006;441:1103–1108. doi: 10.1038/nature04789. [DOI] [PubMed] [Google Scholar]
11.Auton A, et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Sudmant PH, et al. Diversity of human copy number variation and multicopy genes. Science (New York, NY) 2010;330:641–646. doi: 10.1126/science.1197005. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Lazaridis I, et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature. 2014;513:409–413. doi: 10.1038/nature13673. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Fu Q, et al. Genome sequence of a 45,000-year-old modern human from western Siberia. Nature. 2014;514:445–449. doi: 10.1038/nature13810. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Tishkoff SA, et al. The genetic structure and history of Africans and African Americans. Science (New York, NY) 2009;324:1035–1044. doi: 10.1126/science.1172257. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Yang MA, Harris K, Slatkin M. The projection of a test genome onto a reference population and applications to humans and archaic hominins. Genetics. 2014;198:1655–1670. doi: 10.1534/genetics.112.145359. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics (Oxford, England) 2002;18:337–338. doi: 10.1093/bioinformatics/18.2.337. [DOI] [PubMed] [Google Scholar]
18.Ewing G, Hermisson J. MSMS: a coalescent simulation program including recombination, demographic structure and selection at a single locus. Bioinformatics (Oxford, England) 2010;26:2064–2065. doi: 10.1093/bioinformatics/btq322. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Vernot B, et al. Excavating Neandertal and Denisovan DNA from the genomes of Melanesian individuals. Science (New York, NY) 2016;352:235–239. doi: 10.1126/science.aad9416. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Lappalainen T, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501:506–511. doi: 10.1038/nature12531. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Marchetto MC, et al. Differential L1 regulation in pluripotent stem cells of humans and apes. Nature. 2013;503:525–529. doi: 10.1038/nature12686. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Brawand D, et al. The evolution of gene expression levels in mammalian organs. Nature. 2011;478:343–348. doi: 10.1038/nature10532. [DOI] [PubMed] [Google Scholar]
23.Simons Variation in Individuals Project (Simons VIP): a genetics-first approach to studying autism spectrum and related neurodevelopmental disorders. Neuron. 2012;73:1063–1067. doi: 10.1016/j.neuron.2012.02.014. [DOI] [PubMed] [Google Scholar]
24.Nuttle X, et al. Rapid and accurate large-scale genotyping of duplicated genes and discovery of interlocus gene conversions. Nature methods. 2013;10:903–909. doi: 10.1038/nmeth.2572. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Bersaglieri T, et al. Genetic signatures of strong recent positive selection at the lactase gene. American journal of human genetics. 2004;74:1111–1120. doi: 10.1086/421051. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Li H, Mapolelo DT, Randeniya S, Johnson MK, Outten CE. Human glutaredoxin 3 forms [2Fe-2S]-bridged complexes with human BolA2. Biochemistry. 2012;51:1687–1696. doi: 10.1021/bi2019089. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Li H, Outten CE. Monothiol CGFS glutaredoxins and BolA-like proteins: [2Fe-2S] binding partners in iron homeostasis. Biochemistry. 2012;51:4377–4389. doi: 10.1021/bi300393z. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Kumanovics A, et al. Identification of FRA1 and FRA2 as genes involved in regulating the yeast iron regulon in response to decreased mitochondrial iron-sulfur cluster synthesis. The Journal of biological chemistry. 2008;283:10276–10286. doi: 10.1074/jbc.M801160200. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Haunhorst P, et al. Crucial function of vertebrate glutaredoxin 3 (PICOT) in iron homeostasis and hemoglobin maturation. Molecular biology of the cell. 2013;24:1895–1903. doi: 10.1091/mbc.E12-09-0648. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Banci L, Camponeschi F, Ciofi-Baffoni S, Muzzioli R. Elucidating the Molecular Function of Human BOLA2 in GRX3-Dependent Anamorsin Maturation Pathway. Journal of the American Chemical Society. 2015;137:16133–16143. doi: 10.1021/jacs.5b10592. [DOI] [PubMed] [Google Scholar]
31.Chin CS, et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nature methods. 2013;10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
32.Abecasis GR, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. doi: 10.1038/nature11632. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Sudmant PH, et al. Global diversity, population stratification, and selection of human copy-number variation. Science (New York, NY) 2015;349:aab3761. doi: 10.1126/science.aab3761. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Prado-Martinez J, et al. Great ape genetic diversity and population history. Nature. 2013;499:471–475. doi: 10.1038/nature12228. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Hiatt JB, Pritchard CC, Salipante SJ, O'Roak BJ, Shendure J. Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation. Genome research. 2013;23:843–854. doi: 10.1101/gr.147686.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Bray N, Pimentel H, Melsted P, Pachter L. Near-optimal RNA-Seq quantification. 2015 doi: 10.1038/nbt.3519. arXiv, 1505.02710. [DOI] [PubMed] [Google Scholar]
37.Nuttle X, Itsara A, Shendure J, Eichler EE. Resolving genomic disorder-associated breakpoints within segmental DNA duplications using massively parallel sequencing. Nature protocols. 2014;9:1496–1513. doi: 10.1038/nprot.2014.096. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Fischbach GD, Lord C. The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron. 2010;68:192–195. doi: 10.1016/j.neuron.2010.10.006. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.