Abstract
We are jointly proposing a new name for a protein domain of approximately 65 amino acids that has been previously termed NBPF or DUF1220. Our two labs independently reported the initial studies of this domain, which is encoded almost entirely within a single gene family. The name Neuroblastoma Breakpoint Family ( NBPF) was applied to this gene family when the first identified member of the family was found to be interrupted in an individual with neuroblastoma.
Prior to this discovery, the PFAM database had termed the domain DUF1220, denoting it as one of many protein domains of unknown function. It has been PFAM’s intention to use “DUF” nomenclature to serve only as a temporary placeholder until more appropriate names are proposed based on research findings.
We believe that additional studies of this domain, primarily from our laboratories over the past 10 years, have resulted in furthering our understanding of these sequences to the point where proposing a new name for this domain is warranted. Because of considerable data linking the domain to human-specific evolution, brain expansion and cognition, we believe a name reflecting these findings would be appropriate. With this in mind, we have chosen to name the domain (and the repeat that encodes it) Olduvai. The gene family will remain as NBPF for now. The primary domain subtypes will retain their previously assigned names (e.g. CON1-3; HLS1-3), and the three-domain block that expanded dramatically in the human lineage will be termed the Olduvai triplet.
The new name refers to Olduvai Gorge, which is a site in East Africa that has been the source of major anthropological discoveries in the early-mid 1900’s. We also chose the name as a tribute to the scientists who made important contributions to the early studies of human origins and our African genesis.
Keywords: DUF1220, NBPF, protein domain, human brain evolution, copy number, gene duplication, genome evolution, Olduvai Gorge
Protein domains are portable units within proteins that can serve important biological functions. They have been implicated in a broad range of key biological phenomena, including development, disease and evolution 1, 2. Here we jointly propose a new name for a protein domain of approximately 65 amino acids that has been previously termed NBPF 3 or DUF1220 4. Our two labs independently reported the initial studies of this domain, which is encoded almost entirely within a single gene family. The name Neuroblastoma Breakpoint Family ( NBPF) was applied to this gene family when the first identified member of the family was found to be interrupted in an individual with neuroblastoma 3, 5. Prior to this discovery, the PFAM database had termed the domain DUF1220, denoting it as one of many protein domains of unknown function 6. It has been PFAM’s intention to use “DUF” nomenclature to serve only as a temporary placeholder until more appropriate names are proposed based on research findings. We believe that additional studies of this domain, primarily from our laboratories over the past 10 years, have resulted in furthering our understanding of these sequences to the point where proposing a new name for this domain is warranted.
Key findings relevant to assigning a name to the domain are as follows:
-
1.
The domain has been repeatedly linked with human-specific evolution. The haploid human genome is estimated to encode approximately 300 copies of the NBPF/DUF1220 domain, while the copy number for other species is substantially lower: great apes 90–120, monkeys 30–40, and all other mammalian species 1–9 7, 8. The increase in humans (at least 165 additional human copies) represents the largest human lineage-specific copy number increase of any coding region in the genome. These findings, involving the copy number of a protein coding domain, provide strong support for an involvement in human-specific evolution.
-
2.
The domain has been linked with human brain evolution and cognitive function. Over the past 10 years we have published several papers on NBPF/DUF1220 protein domains and the NBPF gene family. These have implicated the copy number of the domain in human brain evolution 7, 9– 11, brain size-related phenotypes 7, 9– 11, brain disorders (autism/schizophrenia/micro- and macrocephaly) 9, 12– 14, and measures of cognitive function 13, 15. Also, our finding of a robust linear association between NBPF/DUF1220 copy number and brain size across primate species was confirmed by an independent study 16.
Given the above research findings, a new name for this protein domain that is related to human-specific evolution would be appropriate. We believe a name that would do this is “Olduvai” (ohl’-du-vi) (when necessary it can be abbreviated as “ODV”). This name refers to Olduvai Gorge, which is located in the rift valley of Eastern Africa. Olduvai has been the site of key paleoanthropological discoveries related to human origins and has been called “the Cradle of Mankind”, and “the Grand Canyon of Human Evolution” 17. Deposits at the gorge are estimated to cover a time span from 2.1 million to 15,000 years ago, and the fossil remains that have been identified there are thought to represent more than 60 hominins (members of the human lineage). These findings are believed to constitute the most continuous known record of human evolution over the past 2 million years, and the longest known archaeological record of the development of stone-tool industries. Olduvai Gorge was designated part of a UNESCO World Heritage site in 1979 18. Just as the protein domain appears to be important to human-specific evolution, so too, Olduvai Gorge has provided key insights into human’s evolutionary origin. We believe both are central to the story of what made us human.
Finally, we have also chosen this name because it reflects an appreciation for the important contributions of the scientists who made major anthropological discoveries in Africa in the early/mid-20 th century that stimulated further research into human origins and our African genesis.
While we believe that the domain and repeat should now be called “Olduvai”, we also propose that, for now, the gene family name should remain NBPF. In summary, the NBPF domain and DUF1220 domain will be termed the Olduvai domain, and the NBPF repeat and DUF1220 repeat will be termed the Olduvai repeat. The primary gene family that encodes these sequences will continue to be called NBPF, and the primary domain subtypes will retain established nomenclature (CON1-3, HLS1-3) 8. The three-domain block, composed of HLS1, HLS2 and HLS3 subtypes, that is tandemly repeated within several human NBPF genes and responsible for the great majority of additional human copies of the domain, will be called the Olduvai triplet.
The Olduvai domain hyper-amplification in the human lineage was one of the most extreme and rapid copy number expansions in the human genome, and we look forward to additional studies that may provide further insights into the role this protein domain family plays in human disease and evolution.
Acknowledgments
The authors thank Jonathon Davis, Veronica Searles Quick, Ilea Heft, Laura Dumas, Vanessa Andries and Karl Vandepoele for constructive discussions.
Funding Statement
JMS is funded by NIH R01 grant MH108684. FVR is supported by the Foundation Against Cancer – Belgium, the Research Foundation – Flanders (FWO-Vlaanderen), and the Queen Elisabeth Medical Foundation (G.S.K.E.), Belgium.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 1; referees: 2 approved
References
- 1. Mark M, Rijli FM, Chambon P: Homeobox genes in embryogenesis and pathogenesis. Pediatr Res. 1997;42(4):421–429. 10.1203/00006450-199710000-00001 [DOI] [PubMed] [Google Scholar]
- 2. Li W-H: Molecular Evolution. Sinauer Associates, Sunderland, Massachusetts;1997. [Google Scholar]
- 3. Vandepoele K, Van Roy N, Staes K, et al. : A novel gene family NBPF: Intricate structure generated by gene duplications during primate evolution. Mol Biol Evol. 2005;22(11):2265–2274. 10.1093/molbev/msi222 [DOI] [PubMed] [Google Scholar]
- 4. Popesco MC, Maclaren EJ, Hopkins J, et al. : Human lineage-specific amplification, selection, and neuronal expression of DUF1220 domains. Science. 2006;313(5791):1304–1307. 10.1126/science.1127980 [DOI] [PubMed] [Google Scholar]
- 5. Vandepoele K, Andries V, Van Roy N, et al. : A constitutional translocation t(1;17)(p36.2;q11.2) in a neuroblastoma patient disrupts the human NBPF1 and ACCN1 genes.Bielinsky A-K, ed. PLoS One. 2008;3(5):e2207. 10.1371/journal.pone.0002207 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Bateman A, Coin L, Durbin R, et al. : The Pfam protein families database. Nucleic Acids Res. 2004;32(Database issue):D138–41. 10.1093/nar/gkh121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Dumas L, Sikela JM: DUF1220 domains, cognitive disease, and human brain evolution. Cold Spring Harb Symp Quant Biol. 2009;74:375–382. 10.1101/sqb.2009.74.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. O’Bleness MS, Dickens CM, Dumas LJ, et al. : Evolutionary history and genome organization of DUF1220 protein domains. G3 (Bethesda, Md). Genes/Genomes/Genetics.2012;2(9):977–986. 10.1534/g3.112.003061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Dumas LJ, O’Bleness MS, Davis JM, et al. : DUF1220-domain copy number implicated in human brain-size pathology and evolution. Am J Hum Genet. 2012;91(3):444–454. 10.1016/j.ajhg.2012.07.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Keeney JG, Dumas L, Sikela JM: The case for DUF1220 domain dosage as a primary contributor to anthropoid brain expansion. Front Hum Neurosci. 2014;8: 427. 10.3389/fnhum.2014.00427 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Keeney JG, Davis JM, Siegenthaler J, et al. : DUF1220 protein domains drive proliferation in human neural stem cells and are associated with increased cortical volume in anthropoid primates. Brain Struct Funct. 2015;220(5):3053–60. 10.1007/s00429-014-0814-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Davis JM, Searles VB, Anderson N, et al. : DUF1220 dosage is linearly associated with increasing severity of the three primary symptoms of autism. PLoS Genet. 2014;10(3):e1004241. 10.1371/journal.pgen.1004241 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Davis JM, Searles Quick VB, Sikela JM: Replicated linear association between DUF1220 copy number and severity of social impairment in autism. Hum Genet. 2015;134(6):569–575. 10.1007/s00439-015-1537-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Searles Quick V, Davis JM, Olincy A, et al. : DUF1220 copy number is associated with schizophrenia risk and severity: Implications for understanding autism and schizophrenia as related diseases. Transl Psychiatry. 2015;5(12):e697. 10.1038/tp.2015.192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Davis JM, Searles VB, Anderson N, et al. : DUF1220 copy number is linearly associated with increased cognitive function as measured by total IQ and mathematical aptitude scores. Hum Genet. 2015;134(1):67–75. 10.1007/s00439-014-1489-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Zimmer F, Montgomery SH: Phylogenetic analysis supports a link between DUF1220 domain number and primate brain expansion. Genome Biol Evol. 2015;7(8):2083–2088. 10.1093/gbe/evv122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Ardrey R: African Genesis. Collins, London, UK;1961. Reference Source [Google Scholar]
- 18. Olduvai Gorge. In: Encyclopedia Britannica Reference Source [Google Scholar]