Skip to main content
BioMed Research International logoLink to BioMed Research International
. 2022 Oct 4;2022:1581714. doi: 10.1155/2022/1581714

Comparative Genomic Characterization of Relaxin Peptide Family in Cattle and Buffalo

Muhammad Saif-ur Rehman 1, Faiz-ul Hassan 1, Zia-ur Rehman 2, Hafiz Noubahar Hussain 3, Muhammad Adnan Shahid 1, Muhammad Mushahid 1, Borhan Shokrollahi 4,
PMCID: PMC9553489  PMID: 36246983

Abstract

Relaxin family peptides significantly regulate reproduction, nutrient metabolism, and immune response in mammals. The present study aimed to identify and characterize the relaxin family peptides in cattle and buffalo at the genome level. The genomic and proteomic sequences of cattle, buffalo, goat, sheep, horse, and camel were accessed through the NCBI database, and BLAST was performed. We identified four relaxin peptides genes (RLN3, INSL3, INSL5, and INSL6) in Bos taurus, whereas three relaxin genes (RLN3, INSL3, and INSL6) in Bubalus bubalis. Evolutionary analysis revealed the conserved nature of relaxin family peptides in buffalo and cattle. Physicochemical properties revealed that relaxin proteins were thermostable, hydrophilic, and basic peptides except for INSL5 which was an acidic peptide. Three nonsynonymous mutations (two in RLN3 at positions A16 > T and P29 > A, and one in INSL6 at position R32 > Q) in Bos taurus, whereas two nonsynonymous mutations (one in RLN3 at positions G105 > w and one in INSL3 at position G22 > R) in Bubalus bubalis, were identified. INSL3 had one indel (insertion) at position 55 in Bos taurus. Gene duplication analysis revealed predominantly segmental duplications (INSL5/RLN3 and INSL6/INSL3 gene pairs) that helped expand this gene family, whereas Bubalus bubalis showed primarily tandem duplication (INSL3/RLN3). Our study concluded that relaxin family peptides remained conserved during the evolution, and gene duplications might help to adapt and enrich specific functions like reproduction, nutrient metabolism, and immune response. Further, the nonsynonymous mutations identified potentially affect these functions in buffalo.

1. Introduction

The relaxin peptide family comprises seven peptides with significant structural similarities but low sequence resemblance. It includes seven genes, relaxin like RLN1, -2, and -3, and insulin-like INSL3, -4, -5, and -6 in most mammals [1], but their number varies. These peptides show a high sequence resemblance with insulin due to the presence of six cysteine residues that provide the 2 interchain and 1 intrachain disulphide linkages. Each relaxin peptide family member is constituted of two chains called A and B chains [2]. These chains are connected by two disulfide bonds present between them and one disulfide bond within the A chain. Each chain contained the cysteine residues together with distinctive disulfide bonding, which are found conserved across all family members [2].

The RLN1 and RLN2 are present in humans and higher primates like apes. Both are referred to as relaxin as human RLN2 is an orthologue to RLN1 in other mammals [1, 3]. RLN3 was first identified in 2002 and is considered the common ancestral gene for all relaxin peptides [2, 4]. RLN1 and its orthologue RLN2 play an important role in reproduction in mammals [5], but in bovines, these genes have been lost during the evolution and have no traces in the genome [6]. RLN3 has been shown to play a role in nutrient metabolism in cattle [7]. INSL3 plays a crucial role in testicular descent by promoting the growth and development of the gubernaculum ligament [8]. INSL4 is highly expressed in the placenta and might be involved in bone development [5, 9]. Its receptor is yet to be identified. INSL5 with its cognate receptor RXFP4 has been suggested to play important role in immune response regulation, signal transmission to CNS through the vagus nerve, and autocrine/paracrine function within the intestinal tract [10]. INSL6 role in sperm production has been determined in buffalo [11].

Relaxin family peptides have important role in ruminant reproduction and nutrient metabolism as mentioned above. The availability of genomic data has provided opportunity to perform genome-wide characterization of protein families using the different available bioinformatics tools. Many studies have been conducted to identify the evolutionary relationships, physicochemical characteristics, comparative amino acid analysis, effects of mutations, and gene duplications in important protein families in ruminants [1215]. The present study was conducted to characterize relaxin family peptides in cattle (Bos taurus) and buffalo (Bubalus bubalis) at genome-wide level in order to better understand their evolutionary significance, physicochemical properties, and gene duplications events.

2. Materials and Methods

2.1. Genome-Wide Identification of Relaxin Peptides

The whole genome and proteomic sequence data of buffalo (UOA WB 1), cattle (ARS-UCD1.2), sheep (Oar rambouillet v1.0), goat (ARS1), camel (CamDro3), and horse (EquCab3.0) were obtained from the National Center for Biotechnology Information (NCBI) database [16]. A genome-wide BLAST and HMM search was performed to look for all putative relaxin peptide genes in Bos taurus, Bubalus bubalis, and other targeted species [17]. The cattle (Bos taurus), buffalo (Bubalus bubalis), goat (Capra hircus), sheep (Ovis aries), horse (Equus caballus), and camel (Camelus dromedarius) relaxin peptide sequences were also validated through the UniProt database search [18]. All accession numbers of sequences used for this study are presented in Table S1. No information was available for INSL5 gene annotation of Bubalus bubalis in the databases.

2.2. Phylogenetic Analysis

Relaxin peptide amino acid sequences from Bos taurus, Bubalus bubalis, Capra hircus, Ovis aries, Equus caballus, and Camelus dromedarius were aligned in ClustalW. Further, the neighbor-joining (NJ) tree was constructed through the MEGA7 software [19].

2.3. Gene Structure, Motif Patterns, and Conserved Domain Analysis of Relaxin Peptide Family

The conserved motifs in the dataset were analyzed using the MEME suite [20]. As a query, the relaxin protein sequences were submitted in Fasta format, and a site distribution with one occurrence of each site was determined for each sequence. The motifs' minimum and maximum widths were found to be 6 and 50, respectively. The number of themes was limited to ten. The Gene Structure Display Server (GSDS) [21] was used to import all CDs and genomic sequences. The final gene structure was exhibited and illustrated using the genome annotation data in general feature format utilizing in-house scripts in the TBtools software (GFF).

2.4. Physicochemical Properties of Relaxin Proteins

The online available ProtParam tool [22] was used to evaluate the physicochemical characteristics of relaxin peptides including molecular weight (MW), amino acid count (AA), isoelectric point (pI), and aliphatic index (A.I.). Besides, it also included instability index (II) and the grand average of hydropathicity (GRAVY).

2.5. Multiple Sequence Analysis (MSA)

Multiple align show [23] was used online to explore the mutations and indels in the relaxin peptides using the aligned sequences of Bos taurus, Bubalus bubalis, Capra hircus, and Ovis aries.

2.6. Mutational Analysis

Further, the mutations found in Bos taurus and Bubalus bubalis relaxin peptide sequences were subsequently examined using several online tools (PolyPhen-2, MUpro, PROVEAN, IMutant, PhD-SNP, SIFT, SNAP2, PredictSNP, Meta-SNP, and SNAP) to determine their effects on protein structure and functions.

2.7. Nuclear Hormone Receptor Sites Identification

The NHR scan [24] was employed to predict nuclear hormone receptor binding sites. Using genomic sequences in Fasta format, NHR scan was performed. The cumulative probability of entering match states was 0.01 using the NHR scan.

2.8. Synteny Analysis and Gene Duplications

To find collinear genes, the whole genomes of cattle and buffalo were blasted to each other. The dual-synteny map was constructed using the TBtools after submitting annotation files for both genomes, including information about collinear genes and chromosomal IDs.

Chromosomal locations of relaxin genes were obtained from genomic resources of respective specie. An annotated genome file was saved as a general feature format (GFF) file and was fed into the MCScanX programme [25], which was subsequently used to plot the gene locations on chromosomes and presented in TB tools. In addition, the relaxin peptide gene collinearity plots for Bos taurus and Bubalus bubalis were generated. Further, for the Bos taurus and Bubalus bubalis relaxin peptide gene family, pairwise alignment of homologous gene pairs of relaxin peptide genes using MEGA7 [19] with the MUSCLE method was utilized to analyze the frequency of duplications. DnaSP v6.0 [26] was also used to determine pairwise synonymous substitutions per synonymous site (ks) and nonsynonymous substitutions per nonsynonymous site (ka) that were corrected for multiple hits. Synonymous mutations are referred as silent mutatins which can result in altered DNA sequence but does not change the encoded amino acid (evolutionary neutral mutations), whereas nonsynonymous mutations cause change in both DNA and protein sequence (evolutionary important mutations).

3. Results

3.1. Phylogenetic Analysis

Results revealed that relaxin peptide family consisted of four genes (RLN3, INSL3, INSL5, and INSL6) in all representative species, except for Bubalus bubalis which had 3 of them excluding INSL5 (Figure 1). All four genes were clustered into two sister clades. From top to downward, clade 1 included INSL3 and INSL6, whereas clade 2 included INSL5 and RLN3. Overall phylogenetic analysis of the relaxin peptide family revealed that the “Bos taurus and Bubalus bubalis,” “Capra hircus and Ovis aries,” and “Camelus dromedarius and Equus caballus” had close similarities between them. However, INSL3 showed more evolutionary similarities between “Bos taurus and Capra hircus” than between “Bos taurus and Bubalus bubalis.”

Figure 1.

Figure 1

Evolutionary relationship of relaxin peptides (RLN3, INSL3, INSL5, INSL6) in six mammalian species.

3.2. Structural Categorization of the Relaxin Peptide Family

The examination of gene organization, motif patterns, and conserved domains in the relaxin peptide family of four targeted species, including Bos taurus, Bubalus bubalis, Capra hircus, and Ovis aries, were performed to carry out the structural characterization of the relaxin peptide family while taking account of their evolutionary relationships (Figures 2(a)2(d)). The top ten MEME-conserved motifs were investigated to look for conserved domains (Table 1). No conserved domain was detected in the Pfam search for these motifs. Further, all genes were investigated to look for conserved domains across all targeted species (Figure 3(c)). The insulin/insulin-like growth factor (IIGF) domain was found conserved across all targeted species, which was further validated through a conserved domain database (CDD) BLAST. A gene structural analysis revealed the evolutionarily conserved nature of relaxin family genes across the studied species (Figure 3(d)). The same gene across different species revealed a similar number of introns and exons.

Figure 2.

Figure 2

Graphical representation of motif patterns (b), conserved domains (c), and gene structure (d) of relaxin peptide family genes corresponding to their phylogenetic relationships (a) in four mammalian species.

Table 1.

Top ten differentially conserved motifs detected in relaxin peptide family (RLN3, INSL3, INSL5, and INSL6).

Motif Protein sequence Length Pfam domain
MEME-1 REAAATEAARKLCGRHFIRAVVKLCGGSRWSREEG 35
MEME-2 RGLSEKCCKKGCTKSELLTLC 21
MEME-3 PEYQYPEVBLPFESELEEAVASSEILPLTKEPIEFYGKNTBKIGTPSNLF 50
MEME-4 DPALNPAPQPLSQEEAIHNMK 21
MEME-5 TQLLSZASEKVESFIPDRSESSQTTFPVW 29
MEME-6 MRALVLLLLALAVLL 15
MEME-7 LPGGDYELLRKLZGL 15
MEME-8 WGNHPQRK 8
MEME-9 HLLHGLMA 8
MEME-10 GDRDPL 6

Figure 3.

Figure 3

(a–d) Comparative amino acid analysis of the relaxin peptide family(RLN3, INSL3, INSL5, and INSL6) in Bos taurus, Bubalus bubalis, Capra hircus, and Ovis aries.

3.3. Physico-chemical Properties of the Relaxin Proteins

The physicochemical properties like location on the chromosome, exon count, molecular weight (Da), number of amino acids (A.A) in each peptide, aliphatic index (A.I.), isoelectric point (pI), instability index (II), and grand average of hydropathicity index (GRAVY) of the relaxin peptides were evaluated in cattle through the ProtParam tool (Table 2). The molecular weight of relaxin peptides ranged from 14 to 24 kDa. RLN3 and INSL3 were located on chromosomes 7, whereas INSL5 and INSL6 were located on chromosomes 3 and 8, respectively. All relaxin peptide genes had an exons count of 2 and a variable length of the peptides with amino acid residues. The pI values revealed all relaxin peptides were basic except for INSL5 which was slightly acidic. The AI values suggested the thermostable nature of all peptides having AI values greater than 65. Moreover, II values greater than 40 revealed that all peptide members of the relaxin family are unstable in vitro. Negative GRAVY values suggested the hydrophilic nature of relaxin peptides.

Table 2.

Physicochemical properties of the relaxin family peptides in Bos taurus.

Gene Chromosome Exon count MW (kDa) A.A pI AI II GRAVY
RLN3 7 2 14.75 135 7.60 86.00 55.82 −0.162
INSL3 7 2 14.38 132 8.69 96.21 67.33 −0.145
INSL5 3 2 15.36 135 6.89 73.04 76.93 −0.389
INSL6 8 2 24.00 205 9.14 77.51 66.54 −0.717

MW: molecular weight; A.A: number of amino acids; pI: isoelectric point; AI: aliphatic index; II: instability index; GRAVY: grand average of hydropathicity index.

3.4. Identification of Mutations in Relaxin Peptides

Comparative amino acid analysis was performed by aligning the protein sequences of the relaxin peptides of buffalo, cattle, goat, sheep, camel, and horse in multiple align show to look for indels and single amino acid variations in Bos taurus and Bubalus bubalis (Figures 33(d)).

In RLN3 protein, mutations were observed in Bos taurus at positions A16 > T, P29 > A, and A62 > T and Bubalus bubalis at position G105 > W (Figure 3). INSL3 had one indel (insertion) at position 55 in Bos taurus (Figure 3(b)). In INSL3, mutations were observed in Bubalus bubalis at positions G22 > R, V86 > M, and V88 > I, whereas no mutation was observed in Bos taurus (Figure 3(b)). INSL5 sequence of Bubalus bubalis was not found in the database. INSL6 had a mutation in Bos taurus at position R32 > Q, whereas no mutation was detected in Bubalus bubalis.

Additionally, the mutations observed in Bos taurus and Bubalus bubalisthrough a comparative amino acid analysis were further analyzed through different online available mutational analysis tools to predict the functional effects of these mutations (Table S2). In Bos taurus, a total of three nonsynonymous mutations were predicted, two in RLN3 at positions A16 > T and P29 > A, and one in INSL6 at position R32 > Q. In Bubalus bubalis, a total of two nonsynonymous mutations were predicted, one in RLN3 at positions G105 > w, G22 > R, and one in INSL3 at position G22 > R.

3.5. NHR Patterns in Relaxin Peptides

The Bos taurus nuclear hormone receptor sites (NHRs) were searched for all four relaxin peptides (RLN3, INSL3, INSL5, and INSL6) (Figures 4(a)4(d)). A total of 23 NHRs were found of which RLN3 had 3, INSL3 had 2, INSL5 had 6, and INSL6 had 12. The number of direct repeats (DR) identified in RLN3, INSL3, INSL5, and INSL6 were 2, 1, 3, and 6, respectively. The number of everted repeats (ER) identified in RLN3, INSL3, INSL5, and INSL6 were 1, 0, 2, and 3, respectively. The number of inverted repeats (IR) identified in RLN3, INSL3, INSL5, and INSL6 were 0, 1, 1, and 3, respectively.

Figure 4.

Figure 4

NHR scans patterns of RLN3 (a), INSL3 (b), INSL5 (c), and INSL6 (d) in Bos taurus.

3.6. Synteny Analysis and Gene Duplications

Collinearity analysis showed that relaxin family genes were randomly distributed over 2 chromosomes in both cattle and buffalo (Figure 5). In Bos taurus, relaxin peptide genes were present on chromosomes 7 and 8, whereas in Bubalus bubalis, these genes were located on chromosomes 3 and 9.

Figure 5.

Figure 5

Synteny plot between Bos taurus and Bubalus bubalis genomes.

Further, the gene duplication analysis was performed to look for segmental or tandem duplication gene pairs in the relaxin peptide family of Bos taurus and Bubalus bubalis (Table 3). In Bos taurus, two segmental duplication events were observed between INSL5/RLN3 and INSL6/INSL3 gene pairs, whereas in Bubalus bubalis, one tandem duplication was detected between INSL3/RNL3 gene pair. The number of nonsynonymous substitutions per nonsynonymous site/number of synonymous substitutions per synonymous site (ka/ks) ratios was determined for this duplicated event. Bos taurus segmental duplicated pairs INSL5/RLN3 and INSL6/INSL3 showed 0.68 and 0.60 ka/ks ratios, respectively, whereas Bubalus bubalis tandem duplication pair INSL3/RNL3 showed 0.74 ka/ks ratio.

Table 3.

Analysis of duplicated gene pairs and their ka/ks values of relaxin peptide family in Bos taurus and Bubalus bubalis.

Bos taurus
Gene pair
Chromosome Duplication ka ks ka/ks
INSL5/RLN3 3/7 SD 0.32 0.47 0.68
INSL6/INSL3 8/7 SD 0.33 0.56 0.60

Bubalus bubalis
Gene pair
Chromosome Duplication ka ks ka/ks
INSL3/RLN3 9/9 TD 0.37 0.50 0.74

ka: number of nonsynonymous substitutions per nonsynonymous site; ks: number of synonymous substitutions per synonymous site; SD: segmental duplication; TD: tandem duplication.

4. Discussion

4.1. Phylogenetic Analysis

In recent years, genomic sequencing technology, particularly next-generation sequencing, has advanced significantly, resulting in the accessibility of sequenced genomes for many important organisms, opening up a new path for understanding the genomic architecture at the molecular level of diverse animal species [27]. Comparative genomics allows for the discovery of new genes and functional components [28, 29]. Advances in bioinformatics have enabled the utilization of genomic data and look into the protein family evolutionary history, comparative amino acid analysis, gene duplications and prediction of mutations, and their functional and structural effects [1215].

The relaxin peptide family has been found to contain seven members in most mammals, including relaxin-like genes RLN1, RLN2, and RLN3 and insulin-like genes INSL3, INSL4, INSL5, and INSL6 [30]. In our analysis, we have found four genes (RNL3, INSL3, INSL5, and INSL6) in Bos taurus, Capra hircus, Ovis aries, Camelus dromedarius, and Equus caballus, whereas three genes (RNL3, INSL3, and INSL6) in Bubalus bubalis from the sequenced genome of these species. Further, these genes were grouped into two sister major clades, clade 1 included INSL3 and INSL6, whereas clade 2 included INSL5 and RLN3. A variable number of RLN/INSL peptides were also observed in different vertebrates [3133]. These variations could be explained on the basis of gene loss and fixation during the evolution and adaption to specific niches. Overall phylogenetic analysis of the relaxin peptide family revealed that the “Bos taurus and Bubalus bubalis,”, “Capra hircus and Ovis aries,” and “Camelus dromedarius and Equus caballus” had close similarities between them. Previous studies also revealed evolutionary similarities between these species [12, 13]. However, INSL3 peptide showed more evolutionary similarity between “Bos taurus and Capra hircus” than between “Bos taurus and Bubalus bubalis”. Researchers have been fascinated by relaxin evolution for decades. Relaxins are renowned for their high sequence variability across closely related species, although unexpected parallels have been found between quite different species like pigs and whales [34].

4.2. Structural Features of Relaxin Peptides

The examination of gene organization, motif patterns, and conserved domains of the relaxin peptide family of four targeted species including Bos taurus, Bubalus bubalis, Capra hircus, and Ovis aries revealed the conserved nature of relaxin genes across the targeted species. The insulin/insulin-like growth factor (IIGF) domain was found conserved in all relaxin family genes across all targeted species. The insulin/IGF system (IIGFs) regulates a wide range of physiological processes, including development, linear growth, and aging, as well as metabolism, homeostasis, and central nervous system activities [35, 36]. This domain is important for proper function, and any dysregulation in this domain can result in abnormal growth, increased development and progression of numerous cancers, and pathologic ailments associated with chronic inflammation and fibrosis [37, 38].

4.3. Physico-chemical Properties of Relaxin Peptides

The physicochemical properties of the relaxin peptide family proteins were evaluated in Bos taurus through the ProtParam tool (Table 2). The molecular weight of relaxin peptides ranged from 14 to 24 kDa. The aliphatic index (AI) tells about the thermostability of globular proteins, and values greater than 65 show greater thermostability [39]. In our study, all relaxin family peptides were found thermostable. In vitro stability of proteins can be inferred through the instability index (II), and the II value lower than 40 indicates the in vitro stability of proteins [40], as in our case, all relaxin family peptides showed in vitro instability having values greater than 40. The GRAVY values tell about the hydropathicity of protein, the negative GRAVY values show hydrophilic nature, whereas the positive GRAVY values show hydrophobic nature of proteins [41], as in our case, all relaxin family peptides showed hydrophilic nature having negative GRAVY values.

4.4. Comparative Mutational Analysis

Comparative genomics is a large-scale, integrated technology for the comparison of two or more genomes. Comparative studies at various levels of the genomes may be conducted to obtain distinct perspectives on the organisms [35, 42]. We aligned the sequences of four species Bos taurus, Bubalus bubalis, Capra hircus, and Ovis aries in multiple align show to look for indels and single amino acid variations in Bos taurus and Bubalus bubalis. All relaxin family peptides were found well conserved with few amino acid variations in Bos taurus and Bubalus bubalis. INSL3 had one indel (insertion) at position 55 in Bos taurus. Indels and mutations have all played a part in the divergence of gene family members from their progenitors [43]. Further, the mutational analysis of observed single amino acid variations predicted three nonsynonymous mutations (two in RLN3 at positions A16 > T and P29 > A and one in INSL6 at position R32 > Q) in Bos taurus, whereas two nonsynonymous mutations (one in RLN3 at positions G105 > w, G22 > R, and one in INSL3 at position G22 > R) in Bubalus bubalis. RLN3 gene was observed to play role in feed efficiency in cattle [7]. INSL3 is a gender-specific gene that is produced in Leydig cells of male adult and fetus and plays a key role in testicular descent [44]. Higher level of INSL3 gene was observed in female ruminant blood with male fetus. Mutations in the INSL3 gene resulted in failure of testicular descent (cryptorchidism) [45, 46]. INSL6 was detected to play a role during spermatogenesis [11]. The deficiency of INL6 in mice resulted in a decline in sperm production and immotility [47]. Mutations in these genes can interfere with functions like feed metabolism, testicular descent, and spermatogenesis in bovines.

4.5. NHR Patterns in Relaxin Peptides

Diverse biochemical mechanisms are involved in gene regulation and information flow from the DNA to the protein that is transcription and translation, and understanding of these mechanisms is necessary to explore the cell dynamics [48]. Nuclear receptors bind to target genes at sites referred as hormone response elements (HREs) and help to regulate the transcription. These HREs are usually located in the 5-flanking region of target genes. Even though HREs are primarily found near the primary promoter, they can sometimes be found several kilobases upstream away from the start of the transcription site in enhancer regions [49]. Most of the time, a single NHR has been found to impact many genes, and sometimes, many NHRs have competition for one target gene and result in overlapping networks for the target genes [50]. This competition for the same target gene sometimes results in reduced expression of the gene. The expression of the gene can also be reduced if NHR bind with negative HREs [49]. The pattern of NHR sites in the relaxin peptide family in Bos taurus was investigated. A total of 23 NHR sites were detected. In total, 12 direct repeats (DR), 6 everted repeats (ER), and 5 inverted repeats (IR) were found in the Bos taurus relaxin genes.

4.6. Synteny Analysis and Gene Duplications

Chromosomal regions common between two genomes with the same homologous genes order as in common ancestor sites are called synteny blocks [51]. Different species originating from the common ancestor in the same tree of life can be compared using syntenic relationships, which will give an idea about the chromosomal structure and number variation between species [52, 53]. Synteny analysis revealed that relaxin peptide genes were randomly located over 2 chromosomes in both Bos taurus and Bubalus bubalis. In Bos taurus, relaxin genes were present on chromosomes 7 and 8, whereas in Bubalus bubalis, these genes were located on chromosomes 3 and 9. Further, the gene duplication events were examined for Bos taurus and Bubalus bubalis. Gene duplications have evolutionary significance as it is believed that during the evolution, whole genome duplications occurred and only 5 to 10% of duplicated genomes got fixed to perform specific functions, while others were lost in the process [54, 55]. These duplication events helped in the expansion of genome size and increased complexities to perform specific functions as indicated by two rounds of duplications hypothesis (2R hypothesis) [56, 57]. In our study, the Bos taurus relaxin peptide family showed predominantly segmental duplications (INSL5/RLN3 and INSL6/INSL3 gene pairs) that helped in the expansion of this gene family, whereas Bubalus bubalis showed predominantly tandem duplication (INSL3/RLN3). Our results are in agreement with Liu et al. [58], and they explained that segmental duplications are predominant in cattle genome and these duplications in bovine genomes are enriched with specific biological processes related to digestion, lactation, immunity, and reproduction. Further, the ka/ks ratios were lower than 1 for all these observed duplications, indicating the purifying pressure for these duplication events [59].

5. Conclusions

Our study revealed four relaxin peptide family genes (RLN3, INSL3, INSL5, and INSL6) in Bos taurus, whereas three relaxin peptide genes (RLN3, INSL3, and INSL6) in Bubalus bubalis in contrast to seven genes in most of the mammals. The loss of genes might be the result of adaptation to specific niches during evolution. Relaxin family peptides remained conserved during evolution. Nonsynonymous mutations in RLN3, INSL3, and INLS6 can interfere with biological functions like spermatogenesis, testicular descent, and feed metabolism in bovines. The segmental duplication in Bos taurus and the tandem duplication in Bubalus bubalis of relaxin family peptides helped in enrichment to specific functions like reproduction and feed metabolism during evolution.

Acknowledgments

This study was funded by the “National Centre for Livestock Breeding, Genetics and Genomics (NCLBGG),”, Subcentre-UAF, Pakistan.

Data Availability

All data are shown within the manuscript.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Authors' Contributions

Muhammad Saif-ur Rehman and Faiz-ul Hassan contributed equally to this manuscript.

Supplementary Materials

Supplementary Materials

Table S1: Relaxin peptides gene family members' nucleotide and protein sequences accessions from NCBI database (Genebank accessions) for Bos taurus, Bubalus bubalis, Capra hircus, Ovis aries, Camelus dromedarius and Equus caballus. Table S2: Mutational effects predicted through different online software in relaxin peptides of Bos taurus and Bubalus bubalis.

References

  • 1.Wilkinson T. N., Speed T. P., Tregear G. W., Bathgate R. A. D. Evolution of the relaxin-like peptide family. BMC Evolutionary Biology . 2005;5(1):14–17. doi: 10.1186/1471-2148-5-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Jiaying Chan L., Akhter Hossain M., Samuel C. S., Separovic F., Wade J. D. The relaxin peptide family – structure, function and clinical applications. Protein and Peptide Letters . 2011;18(3):220–229. doi: 10.2174/092986611794578396. [DOI] [PubMed] [Google Scholar]
  • 3.Evans B. A., Fu P., Tregear G. W. Characterization of two relaxin genes in the chimpanzee. The Journal of Endocrinology . 1994;140(3):385–392. doi: 10.1677/joe.0.1400385. [DOI] [PubMed] [Google Scholar]
  • 4.Hsu S. Y. T. New insights into the evolution of the relaxin-LGR signaling system. Trends in Endocrinology and Metabolism . 2003;14(7):303–309. doi: 10.1016/S1043-2760(03)00106-1. [DOI] [PubMed] [Google Scholar]
  • 5.Bathgate R. A. D., Halls M. L., van der Westhuizen E. T., Callander G. E., Kocan M., Summers R. J. Relaxin family peptides and their receptors. Physiological Reviews . 2013;93(1):405–480. doi: 10.1152/physrev.00001.2012. [DOI] [PubMed] [Google Scholar]
  • 6.Malone L., Opazo J. C., Ryan P. L., Hoffmann F. G. Progressive erosion of the Relaxin1 gene in bovids. General and Comparative Endocrinology . 2017;252:12–17. doi: 10.1016/j.ygcen.2017.07.011. [DOI] [PubMed] [Google Scholar]
  • 7.Perkins S. D., Key C. N., Marvin M. N., et al. Effect of residual feed intake on hypothalamic gene expression and meat quality in Angus-sired cattle grown during the hot season1,2. Journal of Animal Science . 2014;92(4):1451–1461. doi: 10.2527/jas.2013-7020. [DOI] [PubMed] [Google Scholar]
  • 8.Ivell R., Hartung S., Anand-Ivell R. A. Insulin-like factor 3: where are we now? Annals of the New York Academy of Sciences . 2005;1041(1):486–496. doi: 10.1196/annals.1282.073. [DOI] [PubMed] [Google Scholar]
  • 9.Chassin D., Laurent A., Janneau J.-L., Berger R., Bellet D. Cloning of a new member of the insulin gene superfamily (INSL4) expressed in human placenta. Genomics . 1995;29(2):465–470. doi: 10.1006/geno.1995.9980. [DOI] [PubMed] [Google Scholar]
  • 10.Hechter D., Vahkal B., Tiede T., Good S. V. Reviewing the physiological roles of the novel hormone-receptor pair INSL5-RXFP4: a protective energy sensor? Journal of Molecular Endocrinology . 2022;69(1):R45–R62. doi: 10.1530/JME-21-0241. [DOI] [PubMed] [Google Scholar]
  • 11.Huang Y., Fu Q., Pan H., et al. Spermatogenesis-associated proteins at different developmental stages of buffalo testicular seminiferous tubules identified by comparative proteomic analysis. Proteomics . 2016;16(14):2005–2018. doi: 10.1002/pmic.201500547. [DOI] [PubMed] [Google Scholar]
  • 12.Rehman S. U., Feng T., Wu S., et al. Comparative genomics, evolutionary and gene regulatory regions analysis of casein gene family in Bubalus bubalis. Frontiers in Genetics . 2021;12, article 662609 doi: 10.3389/fgene.2021.662609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rehman M. S., Hassan F. U., Rehman Z. U., Ishtiaq I., Rehman S. U., Liu Q. Molecular characterization of TGF-beta gene family in buffalo to identify gene duplication and functional mutations. Genes . 2022;13(8):p. 1302. doi: 10.3390/genes13081302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rehman M. S., Mushtaq M., Hassan F., Rehman Z., Mushahid M., Shokrollahi B. Comparative genomic characterization of insulin-like growth factor binding proteins in cattle and Buffalo. BioMed Research International . 2022;2022:15. doi: 10.1155/2022/5893825.5893825 [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 15.Rehman S. U., Nadeem A., Javed M., et al. Genomic identification, evolution and sequence analysis of the heat-shock protein gene family in buffalo. Genes . 2020;11(11):p. 1388. doi: 10.3390/genes11111388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.NCBI: National Center for Biotechnology Information. April 2022, https://www.ncbi.nlm.nih.gov/
  • 17.Finn R. D., Clements J., Eddy S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Research . 2011;39:W29–W37. doi: 10.1093/nar/gkr367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Uniprot. Protein databse. April 2022, https://www.uniprot.org/
  • 19.Kumar S., Stecher G., Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Molecular Biology and Evolution . 2016;33(7):1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bailey T. L., Boden M., Buske F. A., et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Research . 2009;37(Web Server):W202–W208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hu B., Jin J., Guo A.-Y., Zhang H., Luo J., Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics . 2015;31(8):1296–1297. doi: 10.1093/bioinformatics/btu817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Gasteiger E., Hoogland C., Gattiker A., Wilkins M. R., Appel R. D., Bairoch A. The proteomics protocols handbook . Humana Totowa, NJ; 2005. Protein identification and analysis tools on the ExPASy server; pp. 571–607. [Google Scholar]
  • 23.MSA: multiple align show. April 2022, https://www.bioinformatics.org/sms/multi_align.html.
  • 24.NHR-SCAN: Nuclear hormone receptor scan. April 2022, http://nhrscan.genereg.net/cgi-bin/nhr_scan.cgi.
  • 25.Wang Y., Tang H., DeBarry J. D., et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Research . 2012;40(7):e49–e49. doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J. C., et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Molecular Biology and Evolution . 2017;34(12):3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  • 27.Tantia M. S., Vijh R. K., Bhasin V., et al. Whole-genome sequence assembly of the water buffalo (Bubalus bubalis) The Indian Journal of Animal Sciences . 2011;81(5):p. 38. [Google Scholar]
  • 28.Rehman S. U., Hassan F., Luo X., Li Z., Liu Q. Whole-genome sequencing and characterization of buffalo genetic resources: recent advances and future challenges. Animals . 2021;11(3):p. 904. doi: 10.3390/ani11030904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Li W., Bickhart D. M., Ramunno L., Iamartino D., Williams J. L., Liu G. E. Genomic structural differences between cattle and river buffalo identified through comparative genomic and transcriptomic analysis. Data in brief . 2018;19:236–239. doi: 10.1016/j.dib.2018.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hoffmann F. G., Opazo J. C. Evolution of the relaxin/insulin-like gene family in placental mammals: implications for its early evolution. Journal of Molecular Evolution . 2011;72(1):72–79. doi: 10.1007/s00239-010-9403-6. [DOI] [PubMed] [Google Scholar]
  • 31.Good-Avila S. V., Yegorov S., Harron S., et al. Relaxin gene family in teleosts: phylogeny, syntenic mapping, selective constraint, and expression analysis. BMC Evolutionary Biology . 2009;9(293):1–19. doi: 10.1186/1471-2148-9-293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Park J.-I., Semyonov J., Chang C. L., Yi W., Warren W., Hsu S. Y. T. Origin of INSL3-mediated testicular descent in therian mammals. Genome Research . 2008;18(6):974–985. doi: 10.1101/gr.7119108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Park J.-I., Semyonov J., Yi W., Chang C. L., Hsu S. Y. T. Regulation of receptor signaling by relaxin A chain motifs. The Journal of Biological Chemistry . 2008;283(46):32099–32109. doi: 10.1074/jbc.M806817200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Schwabe C., Böllesbach E. E., Heyn H., Yoshioka M. Cetacean relaxin. The Journal of Biological Chemistry . 1989;264(2):940–943. doi: 10.1016/S0021-9258(19)85033-0. [DOI] [PubMed] [Google Scholar]
  • 35.Belfiore A., Frasca F., Pandini G., Sciacca L., Vigneri R. Insulin receptor isoforms and insulin receptor/insulin-like growth factor receptor hybrids in physiology and disease. Endocrine Reviews . 2009;30(6):586–623. doi: 10.1210/er.2008-0047. [DOI] [PubMed] [Google Scholar]
  • 36.Belfiore A., Malaguarnera R., Vella V., et al. Insulin receptor isoforms in physiology and disease: an updated view. Endocrine Reviews . 2017;38(5):379–431. doi: 10.1210/er.2017-00073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bowers L. W., Rossi E. L., O’Flanagan C. H., deGraffenried L. A., Hursting S. D. The role of the insulin/IGF system in cancer: lessons learned from clinical trials and the energy balance-cancer link. Frontiers in Endocrinology . 2015;6:p. 77. doi: 10.3389/fendo.2015.00077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Avgerinos K. I., Spyrou N., Mantzoros C. S., Dalamaga M. Obesity and cancer risk: emerging biological mechanisms and perspectives. Metabolism . 2019;92:121–135. doi: 10.1016/j.metabol.2018.11.001. [DOI] [PubMed] [Google Scholar]
  • 39.Atsushi I. Thermostability and aliphatic index of globular proteins. Journal of Biochemistry . 1980;88(6):1895–1898. [PubMed] [Google Scholar]
  • 40.Guruprasad K., Reddy B. V. B., Pandit M. W. Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Engineering, Design & Selection . 1990;4(2):155–161. doi: 10.1093/protein/4.2.155. [DOI] [PubMed] [Google Scholar]
  • 41.Kyte J., Doolittle R. F. A simple method for displaying the hydropathic character of a protein. Journal of Molecular Biology . 1982;157(1):105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
  • 42.Wei L., Liu Y., Dubchak I., Shon J., Park J. Comparative genomics approaches to study organism similarities and differences. Journal of Biomedical Informatics . 2002;35(2):142–150. doi: 10.1016/S1532-0464(02)00506-3. [DOI] [PubMed] [Google Scholar]
  • 43.Spiller B., Gershenson A., Arnold F. H., Stevens R. C. A structural view of evolutionary divergence. Proceedings of the National Academy of Sciences . 1999;96(22):12305–12310. doi: 10.1073/pnas.96.22.12305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Anand-Ivell R., Hiendleder S., Viñoles C., et al. INSL3 in the ruminant: a powerful indicator of Gender- and genetic-specific feto-maternal dialogue. PLoS One . 2011;6(5, article e19821) doi: 10.1371/journal.pone.0019821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Canto P., Escudero I., Söderlund D., et al. A novel mutation of the insulin-like 3 gene in patients with cryptorchidism. Journal of Human Genetics . 2003;48(2):0086–0090. doi: 10.1007/s100380300012. [DOI] [PubMed] [Google Scholar]
  • 46.Feng S., Cortessis V. K., Hwang A., et al. Mutation analysis of INSL3 and GREAT/LGR8 genes in familial cryptorchidism. Urology . 2004;64(5):1032–1036. doi: 10.1016/j.urology.2004.06.051. [DOI] [PubMed] [Google Scholar]
  • 47.Burnicka-Turek O., Shirneshan K., Paprotta I., et al. Inactivation of insulin-like factor 6 disrupts the progression of spermatogenesis at late meiotic prophase. Endocrinology . 2009;150(9):4348–4357. doi: 10.1210/en.2009-0201. [DOI] [PubMed] [Google Scholar]
  • 48.Davidson E. H. Genomic regulatory systems: in development and evolution . Elsevier; 2001. [Google Scholar]
  • 49.Aranda A., Pascual A. Nuclear hormone receptors and gene expression. Physiological Reviews . 2001;81(3):1269–1304. doi: 10.1152/physrev.2001.81.3.1269. [DOI] [PubMed] [Google Scholar]
  • 50.Cotnoir-White D., Laperrière D., Mader S. Evolution of the repertoire of nuclear receptor binding sites in genomes. Molecular and Cellular Endocrinology . 2011;334(1–2):76–82. doi: 10.1016/j.mce.2010.10.021. [DOI] [PubMed] [Google Scholar]
  • 51.Vergara I. A., Chen N. Large synteny blocks revealed between Caenorhabditis elegans and Caenorhabditis briggsae genomes using OrthoCluster. BMC Genomics . 2010;11(1):p. 516. doi: 10.1186/1471-2164-11-516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Zhang G., Li B., Li C., Gilbert M. T. P., Jarvis E. D., Wang J. Comparative genomic data of the Avian Phylogenomics Project. Gigascience . 2014;3(1):p. 2047. doi: 10.1186/2047-217X-3-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Howe K. L., Bolt B. J., Cain S., et al. WormBase 2016: expanding to enable helminth genomic research. Nucleic Acids Research . 2016;44(D1):D774–D780. doi: 10.1093/nar/gkv1217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature . 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
  • 55.Venter J. C., Adams M. D., Myers E. W., et al. The sequence of the human genome. Science . 2001;291(5507):1304–1351. doi: 10.1126/science.1058040. [DOI] [PubMed] [Google Scholar]
  • 56.Holland P. W. H., Garcia-Fernàndez J., Williams N. A., Sidow A. Gene duplications and the origins of vertebrate development. Development . 1994;1994:125–133. doi: 10.1242/dev.1994.Supplement.125. [DOI] [PubMed] [Google Scholar]
  • 57.Wolfe K. H. Yesterday’s polyploids and the mystery of diploidization. Nature Reviews. Genetics . 2001;2(5):333–341. doi: 10.1038/35072009. [DOI] [PubMed] [Google Scholar]
  • 58.Liu G. E., Ventura M., Cellamare A., et al. Analysis of recent segmental duplications in the bovine genome. BMC Genomics . 2009;10(571):1–16. doi: 10.1186/1471-2164-10-571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Hurst L. D. The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends in genetics: TIG . 2002;18(9):486–487. doi: 10.1016/S0168-9525(02)02722-1. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

Table S1: Relaxin peptides gene family members' nucleotide and protein sequences accessions from NCBI database (Genebank accessions) for Bos taurus, Bubalus bubalis, Capra hircus, Ovis aries, Camelus dromedarius and Equus caballus. Table S2: Mutational effects predicted through different online software in relaxin peptides of Bos taurus and Bubalus bubalis.

Data Availability Statement

All data are shown within the manuscript.


Articles from BioMed Research International are provided here courtesy of Wiley

RESOURCES