Skip to main content
F1000Research logoLink to F1000Research
. 2016 Jan 20;4:474. Originally published 2015 Aug 5. [Version 2] doi: 10.12688/f1000research.6823.2

Possible identification of CENP-C in fish and the presence of the CENP-C motif in M18BP1 of vertebrates.

Leos Kral 1,a
PMCID: PMC4830207  PMID: 27127616

Version Changes

Revised. Amendments from Version 1

This update incorporates revisions suggested by the reviewer, includes common name categories of species names utilized in this study in the legend to figure 1 and makes a clear distinction between lobe-finned fishes in which CENP-C contains the "CENP-C motif" vs. non-lobe-finned fishes (specifically teleost fishes and shark) which lack an obvious "CENP-C motif".

Abstract

The centromeric protein CENP-C is a base component of the kinetochore. This protein, along with CENP-A has been shown to adaptively evolve in a number of animal and plant species. In order to determine if CENP-C also evolves in fish species, I attempted to retrieve fish CENP-C sequences from GenBank. No Teleostei CENP-C sequences were found either by name or by BLASTP searches with the vertebrate CENP-C motif sequence. A number of putative Teleostei protein sequences were identified in GenBank that have homology to the C-terminal cupin domain of vertebrate CENP-C. These proteins only have partial homology to the CENP-C motif, but evidence is presented that makes it likely that these fish proteins are orthologs of CENP-C. Interestingly, it was also discovered that the CENP-C motif sequence is also mostly present in M18BP1 proteins of fish and some other vertebrates but not in mammals. This finding may have implications for CENP-C and M18BP1 assembly in centromeric regions of different vertebrate taxa.

Keywords: CENP-C, M18BP1, centromeric proteins, teleostei, kinetochore, CENP-C motif, cupin domain protein

Introduction

The kinetochore is a structure that connects chromosomal centromeric DNA to microtubules during mitosis and meiosis 1. The centromere is epigenetically defined by the deposition of nucleosomes that contain the histone H3 variant CENP-A 2. Centromeric protein CENP-C is required for both the recruitment of new CENP-A to the centromeric region as well as the initial assembly of the kinetochore. The CENP-C protein is generally considered to be ubiquitous in all eukaryotic taxa since homologs of CENP-C have been identified in yeast 3 and Drosophila 4 as well as many plants and vertebrates 5. While CENP-C evolves so rapidly that very little homology is observed between distantly related taxa, a conserved CENP-C motif has been identified across all lineages studied 5. This conserved motif should, therefore, be of utility to identify CENP-C orthologs in other species.

CENP-A has been initially shown to evolve adaptively in Drosophila 6, in members of the Bressicaceae family 7 and more recently in primates 8 and in percid fishes 9, 10. CENP-C has also been shown to evolve adaptively in a number of animal and plant species 5 as well as in primates 8. In an effort to determine if CENP-C also evolves adaptively in fish species, searches were conducted in GenBank for Teleostei proteins that had been already identified as CENP-C or for genes that had been annotated as coding for CENP-C. No such teleost fish proteins or genes were found. BLASTP searches of just the Teleostei subset of GenBank were performed with the conserved vertebrate CENP-C motif and these too failed to find identified fish CENP-C proteins or genes. However, a set of orthologous C-terminal cupin domain containing genes have been identified in the elephant shark Callorhinchus milii and several teleost fish species that, while lacking most of the conserved CENP-C motif, have features that make these likely to be fish CENP-C orthologs.

Methods

Standard BLASTP searches were performed on the NCBI blast server. The vertebrate CENP-C motif NVRRTKRXRLKPLEYWRGERVBY used in BLASTP searches in this study was obtained from an alignment of 25 species including the lobe-finned fish Latimeria chalumnae, amphibians, reptiles, birds and mammals ( Supplementary File S1). Retrieved sequences were aligned with the MUSCLE alignment feature in Geneious (version 6.1) sequence analysis software.

Results and discussion

BLASTP searches with the vertebrate CENP-C motif identified CENP-C proteins from a variety of taxa, including plants, but did not identify any CENP-C in non-lobe-finned fish lineages. It is possible that CENP-C may be absent in ray-finned fish, but the ubiquity of this protein in other lineages and the central role of this protein in centromeric function make this unlikely. A C-terminal cupin domain protein encoded by a shark gene annotated in GenBank as CENP-C was used to identify homologs in Teleostei genomes by BLASTP. The retrieved teleost fish homologs were annotated as either calponin homology domain containing protein, neurofilament heavy polypeptide-like protein, or myb-like protein. The cupin domains of these proteins share significant homology to the cupin domain of other vertebrate CENP-C proteins ( Figure 1). Within vertebrate CENP-C proteins the RxxRxxxxPLxYWxGERxxY sequence defines identities within the CENP-C motif located within about 100 amino acids upstream of the cupin domain ( Figure 2). However, within the shark and teleost fish C-terminal cupin domain-containing protein sequences, only some of these CENP-C motif sequence identities were present ( Figure 3) and, therefore, unambiguous identity of these proteins as CENP-C was not obvious.

Figure 1. Homology of shark and teleost fish putative CENP-C cupin domains to the cupin domains of other vertebrate CENP-C proteins.

Figure 1.

Lobe-finned fish: Latimeria chalumnae. Amphibian: Xenopus laevis. Mammals: Homo sapiens, Equus caballus, Canis lupus familiaris. Reptile: Chelonia mydas. Bird: Gallus gallus. Shark: Callorhinchus milii. Teleost fish: Danio rerio, Oryzias latipes, Poecilia formosa, Poecilia reticulata, Maylandia zebra, Stegastes partitus.

Figure 2. Conserved domains and sequences in vertebrate CENP-C.

Figure 2.

( A) Diagram of human CENP-C. ( B) Amino acids that are identical in the CENP-C motif in vertebrates in which CENP-C has been identified. ( C) Conserved sequence in the CENP-C central region that is homologous to part of the CENP-C motif. Amino acid locations within human CENP-C protein of conserved sequences are indicated at the beginning and end of each sequence.

Figure 3. Conserved regions homologous to a portion of the CENP-C motif present in putative non-lobe-finned fish CENP-C.

Figure 3.

( A) Diagram of putative Stegastes partitus CENP-C. ( B) Alignment of fish central region sequences that contain the conserved RxxxxPxxWW sequence. ( C) Alignment of the fish cupin domain proximal sequences that contain the conserved RxxxxPxxWW sequence. Amino acids matching the conserved sequence identities are highlighted in red. Amino acid locations within each species’ protein are indicated at the beginning and end of each sequence.

In a recent study that examined the interaction between CENP-C conserved domains and CENP-A containing nucleosomes (or nucleosomes containing histone H3 modified with a CENP-A C-terminal tail), Kato et al. 11 identified within the conserved central region of CENP-C a RxSxxPSxWW consensus sequence ( Figure 2) that is similar to the core portion of the CENP-C motif. Mutations of the arginine to alanine or the tryptophans to alanine in this sequence prevented the binding of this central region to the nucleosomes. So, functionally, the RxxxxPxxWW portion of the central region sequence is important to centromeric binding of CENP-C. Furthermore, mutations of the arginine, tyrosine and tryptophan in the core CENP-C motif RxxxxPxxYW also reduce the binding affinity the CENP-C to the nucleosomes 11. A mutation of arginine to alanine in this core portion of the CENP-C motif was previously shown to prevent the binding of Xenopus CENP-C to centromeres 12.

An alignment of the putative shark and teleost fish CENP-C proteins identified two conserved regions that contained the RxxxxPxxWW sequences ( Figure 3). The placement of these sequences corresponds roughly to the locations of the central portion and the CENP-C motif of the vertebrate CENP-C ( Figure 2). Therefore, it is likely that the combination of the C-terminal cupin domain and the presence of these centromeric nucleosome binding regions in positions generally corresponding to the locations of the central region and the CENP-C motif identifies these teleost genes as possible CENP-C orthologs. It will be necessary, of course, to verify if this protein is actually found at non-lobe-finned fish centromeres. It should be noted, however, that the distance between the cupin domain and the “CENP-C motif” position is about twice as long in the putative teleost fish CENP-C in comparison to this distance in CENP-C of other vertebrates. It is interesting that the putative shark “CENP-C motif” location lacks the tryptophans of the RxxxxPxxWW sequence and that Poecilia reticulata has a replacement of the first tryptophan in the conserved central region sequence ( Figure 3). However, depending on other factors acting in the assembly of the centromere in various taxa, it may be possible that just one of those conserved RxxxxPxxWW sequences may be necessary for centromeric binding of the putative non-lobe-finned fish CENP-C. Indeed, the conserved central region is not present in chicken CENP-C 13 and no homology to the RxxxxPxxWW portion of the conserved central region is detectable in CENP-C of other birds and reptiles. Yet a deletion mutant of chicken CENP-C in which the central portion had been removed was able to rescue CENP-C deficient chicken cells and also co-localized with CENP-T at centromeres 13. This demonstrates that, at least in chickens, the central region is not necessary for CENP-A binding and, presumably, the C-terminal region which contains the CENP-C motif is sufficient for that purpose.

Interestingly, BLASTP searches of the Teleostei subset of GenBank retrieved centromeric protein M18BP1 sequences. This protein is recruited to centromeres by CENP-C 14, 15 and along with centromeric proteins Mis18α and Mis18β functions in the recruitment of CENP-A to centromeres 16. The M18BP1 protein contains almost the entire vertebrate CENP-C motif in all vertebrates examined except in mammals ( Figure 4). It appears that the CENP-C motif sequence is not exclusive to just CENP-C. Since both CENP-C and M18BP1 associate with centromeres and with each other, it is tempting to speculate that what has generally been regarded as a CENP-C motif sequence facilitates the interaction of both of these proteins with centromeric nucleosomes. Furthermore, since mammalian M18BP1 lacks this CENP-C motif, it is possible that mammalian M18BP1 may be more dependent on association with CENP-C to localize to the centromere than the M18BP1 of other vertebrate taxa.

Figure 4. Alignment of regions of vertebrate CENP-C and M18BP1 that contain the CENP-C motif sequences.

Figure 4.

The vertebrate CENP-C motif sequence identities ( Figure 2A) are highlighted in red.

Funding Statement

This study was supported by a Faculty Research Grant funded by the University of West Georgia.

I confirm that the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; referees: 3 approved]

Supplementary material

Supplementary File S1. Alignment of the C-terminal portion of vertebrate CENP-C proteins.

The vertebrate CENP-C motif containing consensus sequence utilized in BLASTP searches spans amino acids 11 to 33 and is highlighted in red.

References

  • 1. Cheeseman IM: The kinetochore. Cold Spring Harb Perspect Biol. 2014;6(7):a015826. 10.1101/cshperspect.a015826 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Black BE, Brock MA, Bédard S, et al. : An epigenetic mark generated by the incorporation of CENP-A into centromeric nucleosomes. Proc Natl Acad Sci U S A. 2007;104(12):5008–5013. 10.1073/pnas.0700390104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Meluh PB, Koshland D: Evidence that the MIF2 gene of Saccharomyces cerevisiae encodes a centromere protein with homology to the mammalian centromere protein CENP-C. Mol Biol Cell. 1995;6(7):793–807. 10.1091/mbc.6.7.793 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Heeger S, Leismann O, Schittenhelm R, et al. : Genetic interactions of separase regulatory subunits reveal the diverged Drosophila Cenp-C homolog. Genes Dev. 2005;19(17):2041–2053. 10.1101/gad.347805 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Talbert PB, Bryson TD, Henikoff S: Adaptive evolution of centromere proteins in plants and animals. J Biol. 2004;3(4):18. 10.1186/jbiol11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Malik HS, Henikoff S: Adaptive evolution of Cid, a centromere-specific histone in Drosophila. Genetics. 2001;157(3):1293–1298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Cooper JL, Henikoff S: Adaptive evolution of the histone fold domain in centromeric histones. Mol Biol Evol. 2004;21(9):1712–1718. 10.1093/molbev/msh179 [DOI] [PubMed] [Google Scholar]
  • 8. Schueler MG, Swanson W, Thomas PJ, et al. : Adaptive evolution of foundation kinetochore proteins in primates. Mol Biol Evol. 2010;27(7):1585–1597. 10.1093/molbev/msq043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Fountain DM, Kral LG: Isolation and Characterization of the Etheostoma tallapoosae (Teleostei: Percidae) CENP-A Gene. Genes (Basel). 2011;2(4):829–840. 10.3390/genes2040829 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Abbey HN, Kral LG: Adaptive Evolution of CENP-A in Percid Fishes. Genes (Basel). 2015;6(3):662–671. 10.3390/genes6030662 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kato H, Jiang J, Zhou BR, et al. : A conserved mechanism for centromeric nucleosome recognition by centromere protein CENP-C. Science. 2013;340(6136):1110–1113. 10.1126/science.1235532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Milks K, Moree B, Straight AF: Dissection of CENP-C-directed centromere and Kinetochore assembly. Mol Biol Cell. 2009;20(19):4246–4255. 10.1091/mbc.E09-05-0378 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Nagpal H, Hori T, Furukawa A, et al. : Dynamic changes in CCAN organization through CENP-C during cell-cycle progression. Mol Biol Cell. 2015;26(21):3768–3776. 10.1091/mbc.E15-07-0531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Moree B, Meyer CB, Fuller CJ, et al. : CENP-C recruits M18BP1 to centromeres to promote CENP-A chromatin assembly. J Cell Biol. 2011;194(6):855–871. 10.1083/jcb.201106079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Dambacher S, Deng W, Hahn M, et al. : CENP-C facilitates the recruitment of M18BP1 to centromeric chromatin. Nucleus. 2012;3(1):101–110. 10.4161/nucl.18955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Fujita Y, Hayashi T, Kiyomitsu T, et al. : Priming of centromere for CENP-A recruitment by human hMis18alpha, hMis18beta, and M18BP1. Dev Cell. 2007;12(1):17–30. 10.1016/j.devcel.2006.11.002 [DOI] [PubMed] [Google Scholar]
F1000Res. 2016 Apr 22. doi: 10.5256/f1000research.8419.r13524

Referee response for version 2

Inna Lermontova 1

The CENP-C protein is required for the assembly and proper function of kinetochores in most eukaryotes. In this paper the author has identified the putative CENP-C protein homologue of Teleostei using BLASTP searches. This protein does not contain a typical CENP-C motif, but a motif with a partial homology to CENP-C. However, presence of the Cupin domain at the C-terminus and two CENP-C-like motifs at the central and proximal regions, respectively, have allowed the author to conclude that the identified protein is a CENP-C ortholog.

Additionally, the author has identified the CENP-C motif in M18BP1 proteins of fishes and some other vertebrates. It is interesting that the CENP-C motif cannot be identified in M18BP1 of mammals. From my point of view, this is very important finding of the current study. Therefore, I recommend to show in the paper the structure of the M18BP1 protein homologues of vertebrates with the SANTA, SANT and CENP-C domains. These domains are present in M18BP1 proteins of most vertebrates, but M18BP1 proteins of mammals have only SANTA and SANT domains and in Latimeria chalumnae only SANTA and CENP-C domains are present.  I suggest to mention this in the paper and to discuss it.

I think the study is a significant contribution in the field of centromere research and the information presented in the paper is applicable by experimental biologists.

Minor comments:

  • I suggest to change the title to: “Identification of the putative CENP-C protein homologue in fish and the CENP-C motif in M18BP1 of vertebrates.”

  • Abstract: "[…]but evidence is presented that makes it likely[…]" Please write which evidence.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2016 Apr 4. doi: 10.5256/f1000research.8419.r12934

Referee response for version 2

Nikolina Sekulic 1,2

The paper describes identification of putative CENP-C proteins in fish and points to the presence of CENP-C motif in M18BP1protein of many vertebrates but not mammals.

CENP-C is an essential protein in building centromeres that interacts directly with CENP-A, epigenetic mark of centromeres, and its underlining DNA. Lack of conservation between CENP-C of different species has been attributed to adaptive evolution of the protein. Great evolutional variability has made it difficult to identify CENP-C proteins based only on simple sequence homology search. The author has noticed the lack of annotated CENP-C protein in fish and undertook an effort to use targeted database searches to identify putative CENP-C proteins.

The paper fills in the gap of knowledge in the field.  It presents interesting and valuable findings but I suggest a couple of changes that would help improve it.  

 

  1. The central region of the human CENP-C should be labeled as 426-537 and not 426-552. For references see Caroll CW et al., 2010 1; Kato H et al., 2013 2; Falk SJ et al., 2015 3, Falk SJ et al., 2016 4

  2. The author couldn’t identify CENP-C motif in the putative proteins in fish but has instead identified a region that is more similar to the central region. Here previous findings on the role of central region and CENP-C motif should be clarified, emphasising that central region specifically binds CENP-A nucleosomes (which targets it to the centromere) (References: Caroll CW et al., 2010 1; Falk SJ e t al. 2015 3) while CENP-C motif binds both types of nucleosomes non-selectively (H3 and CENP-A) with lower affinity (Kato H et al., 2013 2). This can be followed with an exception in the chicken system, as suggested by the previous reviewer, Tatsuo Fukagawa.

In this context, I find the absence of CENP-C motif less surprising, since the major function of CENP-C protein (its association with CENP-A nucleosomes) comes from central region anyway – making identified candidate proteins very likely fish CENP-C homologs. Subsequently, identification of the CENP-C motif in M18BP1 proteins is also less surprising knowing that it might just be a nucleosome binding module.   

  • In my opinion, region that has similarity to central region should be named “central region-like motif” instead of “cupin-proximal region”.  I find this name very confusing.

 

Minor comments:

 

  1. For the more lay audience, specify that teleost is the most populated branch of fish phylogenetic tree

  2. In figures 1 and 4, make a distinction between fish and other species, and indicate amino-acid specifications within each species protein like in figures 2 and 3.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

References

  • 1. Carroll CW, Milks KJ, Straight AF: Dual recognition of CENP-A nucleosomes is required for centromere assembly. J Cell Biol.2010;189(7) : 10.1083/jcb.201001013 1143-55 10.1083/jcb.201001013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Kato H, Jiang J, Zhou BR, Rozendaal M, Feng H, Ghirlando R, Xiao TS, Straight AF, Bai Y: A conserved mechanism for centromeric nucleosome recognition by centromere protein CENP-C. Science.2013;340(6136) : 10.1126/science.1235532 1110-3 10.1126/science.1235532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Falk SJ, Guo LY, Sekulic N, Smoak EM, Mani T, Logsdon GA, Gupta K, Jansen LE, Van Duyne GD, Vinogradov SA, Lampson MA, Black BE: Chromosomes. CENP-C reshapes and stabilizes CENP-A nucleosomes at the centromere. Science.2015;348(6235) : 10.1126/science.1259308 699-703 10.1126/science.1259308 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Falk SJ, Lee J, Sekulic N, Sennett MA, Lee TH, Black BE: CENP-C directs a structural transition of CENP-A nucleosomes mainly through sliding of DNA gyres. Nat Struct Mol Biol.2016;23(3) : 10.1038/nsmb.3175 204-8 10.1038/nsmb.3175 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2016 Jan 28. doi: 10.5256/f1000research.8419.r11994

Referee response for version 2

Tatsuo Fukagawa 1

I found that author almost addressed my concerns. The quality of paper has been improved and the paper is of indexable quality.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2015 Sep 25. doi: 10.5256/f1000research.7336.r10498

Referee response for version 1

Tatsuo Fukagawa 1

The centromere protein CENP-C is well known as an essential component for functional kinetochore assembly. Due to importance of this molecule, CENP-C must be conserved in Fish species. The author performed BLASTP searches with the conserved CENP-C motif sequence, but any CENP-C homologues in Fish lineages were not identified with this sequence. However, as there is a putative CENP-C sequence in shark genome, BLASTP searches were carried out with C-terminal domain sequence of putative shark CENP-C. Then, the author identified CENP-C candidates from various teleost genomes. Although the author does not show that candidate proteins localize to centromeres, these candidates contains related sequences of CENP-C motif, which were a little divergent from the vertebrate consensus sequences. Interestingly, the authors found that various vertebrate M18BP1s, which are recruited to centromeres by CENP-C, contain the CENP-C motif sequence, but mammalian homologues do not contain the motif sequence.

This is an observation article and finding of the CENP-C motif in the M18BP1 sequence is interesting. However, to improve the quality of the paper, the author should revise the manuscript. My specific concerns are following.

 

  1. As the author recognizes, it is necessary to verify whether CENP-C candidates from teleost genome really localize to Fish centromeres. As the author obtained a candidate from Zebrafish genome, such an experiment is not difficult with the Zebrafish experimental system. If the author added localization data, Figure 2 would be interesting.

  2. Related to Figure 2. If the author shows sequence comparison of Cupin domain in teleost sequences, it would be helpful.

  3. When the author discuss about central region of human CENP-C, it may be better to cite a recent paper by Nagpa l et al. (Mol. Biol. Cell, 2015), which says that central region sequence does not exist in chicken CENP-C. Then, the author can emphasize that the central motif is really important for CENP-A binding.

  4. The author described some results of the analysis in the Introduction. This is not necessary and it would be better to cut of redundant description.

I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2016 Jan 12.
Leos Kral 1

I have revised the manuscript to address concerns #2, #3 and #4. Unfortunately, I do not have the resources to carry out the localization experiment (concern #1). My main motivation in publishing these observations is to bring awareness of this issue to individuals who may have the resources and interest to follow up with the relevant experiments.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials


    Articles from F1000Research are provided here courtesy of F1000 Research Ltd

    RESOURCES