Abstract
The distribution of fat among both invertebrate and vertebrate groups is heterogeneous. Studies have shown that fatty acid‐binding proteins (FABPs), which mainly bind and transport fatty acids, play important roles in the regulation of fat storage and distribution. However, the systematic and genome‐wide investigation of FABP genes in organisms with a heterogeneous fat distribution remains in its infancy. The availability of the complete genomes of Caenorhabditis elegans, Callorhinchus milii, and other organisms with a heterogeneous fat distribution allowed us to systematically investigate the gene structure and phylogeny of FABP genes across a wide range of phyla. In this study, we analyzed the number, structure, chromosomal location, and phylogeny of FABP genes in 18 organisms from C. elegans to Homo sapiens. A total of 12 types of FABP genes were identified in the 18 species, and no single organism exhibited all 12 fatty acid‐binding genes (FABPs). The absence of a specific FABP gene in tissue may be related to the absence of fat storage in the corresponding tissue. The genomic loci of the FABP genes were diverse, and their gene structures varied. The results of the phylogenetic analysis and the observation of conserved gene synthesis of FABP family genes/proteins suggest that all FABP genes may have evolved from a common ancestor through tandem duplication. This study not only lays a strong theoretical foundation for the study of fat deposition in different organisms, but also provides a new perspective regarding metabolic disease prevention and control and the improvement of agricultural product quality.
Keywords: evolution, FABPs, fat distribution, fatty acid‐binding proteins, gene structure
To clarify the relationships between fatty acid‐binding proteins (FABPs) and the tissues in which fat is deposited, the number, gene structure, conserved synteny, and evolution of FABP family genes in 18 species were systemically studied. In total, 142 FABPs were identified. The absence of a specific FABP gene in tissue may be related to the absence of fat storage in the corresponding tissue.
Abbreviations
- fabps
fish and invertebrates’ fatty acid‐binding genes
- fabps
fish and invertebrates’ fatty acid‐binding proteins
- FABPs
mammalian fatty acid‐binding genes
- FABPs
mammalian fatty acid‐binding proteins
- LBP
lipid‐binding protein
- ML
maximum‐likelihood
- NJ
neighbor‐joining
- WAT
white adipose tissue
From Caenorhabditis elegans to Homo sapiens, all animal species have found a way to store excess energy in the form of fat for future needs. Worms (C. elegans) store fat in the intestine [1, 2], and sharks store fat in the liver [3, 4]. However, in most species, fat is stored mainly in white adipose tissue (WAT) to provide energy during periods when energy demands exceed caloric intake [5]. The location of WAT varies in different species. For example, the most efficient site for WAT is the intra‐abdominal region, as observed in most amphibians and reptiles; in nearly all mammals except for pinnipeds (seals) and certain small cetaceans (whales and dolphins) and in many birds, adipose tissue is partitioned into a dozen or more discrete depots that are widely distributed around the body [6]; fish mainly store fat in the liver, muscle, and mesentery; and in the platypus, fat is stored in the tail [7]. Fat distribution plays an important role in the risk of developing metabolic syndrome. Increased intra‐abdominal/visceral fat promotes a high risk of obesity, diabetes, and other metabolic diseases, whereas increased subcutaneous fat in the thighs and hips is responsible for little or no risk [2]. Notably, adipose tissues present in different parts of poultry and livestock play an important role in product quality [8].
Many studies have shown that the storage and distribution of fat are related to age, sex, hormones, fat synthesis, fat decomposition transport, and other factors [9, 10, 11, 12]. Among these factors, fatty acid‐binding proteins (FABPs) are critical mediators of fat storage and distribution [13, 14, 15, 16] that bind fatty acids and other lipid ligands. Epidermal FABPs (EFABPs) are differentially expressed between human omental and subcutaneous adipose tissue [11], and elevated levels of adipocyte FABPs (AFABPs) have been found in pericardial fat tissue and are associated with cardiac dysfunction in obese people [17]. AFABP and HFABP (muscle and heart FABP) have been considered candidate genes for pig fatness traits. AFABP is involved in the regulation of intramuscular fat accretion in mammals such as Duroc pigs, mice, chickens, and rabbits [18, 19, 20].
Fatty acid‐binding proteins are a family of small cytosolic proteins that bind hydrophobic ligands (mainly fatty acids) noncovalently. They are 14‐ to 15‐kDa proteins of 126–134 amino acids and are named after the first tissue from which they are isolated or identified [12]. There are 12 known FABPs in vertebrate and invertebrate animals. In mammals, nine different FABPs with a tissue‐specific distribution have been identified: FABP1 (LFABP, liver), FABP2 (IFABP, intestinal), FABP3 (HFABP, muscle and heart), FABP4 (AFABP, adipocyte), FABP5 (EFABP, epidermal), FABP6 (IlFABP, ileal), FABP7 (BFABP, brain), FABP8 (MFABP, myelin), and FABP9 (TFABP, testis) [21]. In teleost fish, two members of the FABP family, FABP10 (Lb‐FABP, liver basic FABP) and FABP11, have been discovered [22]. In addition, FABP12 has been identified in humans, rats, and mice [23]. Despite the variable sequence identity of these proteins, whose amino acid sequence similarity ranges from 20% to 70%, all FABPs share a conserved tertiary structure composed of 10 antiparallel beta‐sheets and two alpha‐helixes. The orthogonal β‐sheets wrap around a large solvent‐accessible ligand‐binding cavity centered at one end of the barrel, where the helix–turn–helix motif is proposed to act as a ‘portal’ to allow ligand entry [24].
Evolutionary studies have shown that FABPs evolved via successive gene duplications, generating a large number of tissue‐specific homologs [24, 25, 26]. However, the systematic investigation of FABP genes in organisms with a heterogeneous fat distribution in which fat is located in various organs remains in its infancy. The number, structure, sequence identity, and phylogeny of FABP genes in these organisms are unclear. Thus, based on the complete genomes of C. elegans, Callorhinchus milii [27], and other organisms, we systematically analyzed FABP gene organization in 18 species, including organisms with a heterogeneous fat distribution and important domestic economic species (Table 1). We found that although 12 FABP genes have been identified in the selected species, not all types of fatty acid‐binding genes (FABPs) are found in certain organisms. In addition to the ‘canonical’ fabp gene structure including four exons and three introns, some atypical protein‐coding genes exist, including seven exon–six intron, five exon–four intron, and three exon–two intron genes. Alternative splicing, a common phenomenon in humans and other species, greatly increases the diversity of FABP proteins. Based on the phylogenetic analysis and gene synthesis of FABP family genes/proteins, we verified that all FABP genes could have evolved from an ancestral FABP gene through gene duplication. In this study, the differences in fat deposition sites between different species were considered in terms of the transport of fatty acids. This work provides a new research hypothesis and direction for the study of ectopic fat deposition in humans and domestic economic species.
Table 1.
FABPs a | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Organisms | LFABP (FABP1) | IFABP (FABP2) | HFABP (FABP3) | AFAPB (FABP4) | EFABP (FABP5) | ILFABP (FABP6) | BFABP (FABP7) | MFABP (FABP8) | TFABP (FABP9) | Lb‐FABP (FABP10) | FABP11 | FABP12 | Gene number |
Human (H. sapiens) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | FABP9 | N | N | FABP12 | 10 |
Rat (R. norvegicus) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | FABP9 | N | N | FABP12 | 10 |
Mouse (M. musculus) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | FABP9 | N | N | FABP12 | 10 |
Cow (B. taurus) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | FABP9 | N | N | FABP12 | 10 |
Pig (S. scrofa) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | FABP9 | N | N | N | 9 |
Dolphin (T. truncatus) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | N | FABP8 | N | N | N | N | 7 |
Platypus (O. anatinus) | FABP1 | FABP2 | N | N | FABP5 | N | N | FABP8 | N | N | N | N | 4 |
Chicken (G. gallus) | FABP1 | FABP2 | FABP3 | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | N | FABP10 | N | N | 9 |
Xenopus (X. tropicalis) | fabp1 | fabp2 | fabp3 | fabp4 | N | fabp6 | fabp7 | fabp8 | N | fabp10 | N | N | 8 |
Anole lizard (A. carolinensis) | FABP1 | FABP2 | N | FABP4 | FABP5 | FABP6 | FABP7 | FABP8 | N | FABP10 | N | N | 8 |
Zebrafish (D. rerio) | fabp1a, fabp1b.1, fabp1b.2 | fabp2 | fabp3 | N | N | fabp6 |
fabp7a fabp7b |
N | N | fabp10a, fabp10b | fabp11a, fabp11b | N | 12 |
Tilapia (O. niloticus) | fabp1b.1 | fabp2 | fabp3 | N | N | fabp6 | fabp7a | N | N | fabp10a, fabp10b | fabp11a, fabp11b | N | 9 |
Fugu (T. rubripes) | fabp1b.1 | fabp2 | fabp3 | N | N | fabp6 |
fabp7 fabp7a |
N | N |
fabp10a fabp10b |
fabp11a | N | 9 |
Medaka (O. latipes) | fabp1b.1 | fabp2 | fabp3 | N | N | fabp6 | fabp7, fabp7a | N | N | fabp10a, fabp10b | fabp11a, fabp11b | N | 10 |
Shark (C. milii) | ENSCMIG00000013098, ENSCMIG00000013107 | ENSCMIG00000003354 | ENSCMIG00000017535 | N | N | N | ENSCMIG00000011741 | N | N | ENSCMIG00000016027 | N | N | 6 |
Ciona (C. intestinalis) | ENSCING00000012923 | ENSCING00000014756 | N | N | N | ENSCING00000014865 | ENSCING00000002061 | ENSCING00000005575 | N | N | N | N | 5 |
Fruit fly (D. melanogaster) | Fabp | 1 | |||||||||||
Worm (C. elegans) | lbp‐5, lbp‐6, lbp‐7, lbp‐8, lbp‐9 | 5 |
The upper or lower part of gene nomenclature refers to Ensembl database.
Results and Discussion
FABP family gene identification from C. elegans to Homo sapiens
Fatty acid‐binding proteins are members of the superfamily of lipid‐binding proteins (LBPs). The primary role of all FABP family members is the regulation of fatty acid uptake and intracellular transport. Thus, we wondered how many fabp genes exist in species with different adipose tissue depots to regulate fat transport and storage. We found that the presence or absence of 12 FABP genes (FABP1‐FABP12) varies from C. elegans to H. sapiens (Table 1). Among these genes, FABP1 and FABP2 are conserved in 16 chordates, from Ciona intestinalis to H. sapiens. FABP3, FABP6, and FABP7 are conserved in vertebrate genomes. FABP4, FABP5, and FABP8 have been identified in humans, mice, birds, amphibians, and reptiles but not in fish. FABP9 and FABP12 are found only in mammals. In contrast to FABP10 in chickens, amphibians, reptiles, and fish, FABP11 has only been identified in teleosts. On the other hand, some species exhibit limited copies of FABPs such as fbpb1, fabp2, fabp5, fabp6, fabp7, and fabp8. For example, four FABPs (FABP1, FABP2, FABP5, and FABP8) are found in the genome of the platypus (Ornithorhynchus anatinus). According to Ensembl, live‐like, intestinal‐type, brain‐like, and myelin P2 protein‐like fish and invertebrates’ fatty acid‐binding genes (fabps) exist in C. intestinalis. In Drosophila melanogaster, only one fabp gene can be found. In C. elegans, there are nine lbp genes, lbp‐1 to lbp‐9. lbp‐1, lbp‐2, lbp‐3, and lbp‐4 are exclusive to C. elegans, and their products localize to the extracellular region. These four genes are not orthologs of FABP1, FABP2, FABP3, and FABP4 in mammals. Based on the HomoloGene annotation of the NCBI (https://www.ncbi.nlm.nih.gov/homologene), lbp ‐5, lbp‐6, lbp‐7, and lbp‐8 are orthologs of FABP4 in mammals. lbp‐9 is an ortholog of FABP8 in humans and FABP11a/11b in fish. With the updated shark (C. milii) genome, we identified five fabps (fabp1, fabp2, fabp3, fabp7, and fabp10) in cartilaginous fish. fabp1, fabp2, fabp3, fabp6, fabp7, fabp10, and fabp11 have been identified in the genomes of teleost fish. More gene copies have appeared in teleost fish, followed by divergence. For example, in zebrafish, there is only one copy of fabp2, fabp3, and fabp6, while there are three copies of the fabp1 gene (fabp1a, fabp1b.1, and fabp1b.2), and there are two copies of fabp7, fabp10, and fabp11. In total, 142 FABPs were identified in the 18 collected organisms (Tables 1 and S1). In Table S1, we list the gene annotations, including the gene ID, transcript ID, splice variant number, RNA/protein length, exon/intron number, and differing genome sites.
Considering the relationship between FABP genes and fat deposit tissue, we propose that the absence of the FABP5 (EFABP, epidermal) gene in Xenopus and fish may be related to the absence of subcutaneous WAT in these species. In addition, the lack of the FABP4 (AFABP, adipocyte) gene in fish may account for the deposition of fat in the liver, muscle, and mesentery in most fish rather than in adipocytes. Indeed, in zebrafish, it has been proven that fabp11a, and not fabp4, plays a key role during adipogenesis [28, 29]. However, the mechanism regulating the differential distribution of fat is complex, and for many FABPs, the precise physiological function is not completely understood; thus, how FABPs regulate the fat deposited in different tissues also requires further study.
Diversity of the gene structure of FABPs
As reported previously, most functional FABP genes exhibit a ‘canonical’ structure including four exons and three introns [26]. However, some protein‐coding genes with an atypical structure also exist (Figs 1, S1 and Table 2). For example, seven exons and six introns are found in FABP6 of dolphin (Tursiops truncates); there are five exons and four introns in FABP2 of Rattus norvegicus and FABP10 of Anolis carolinensis, respectively; there are three exons and two introns in FABP5 of T. truncates, fabp1 of Xenopus tropicalis, fabp10b of Takifugu rubripes, fabp7a of Oryzias latipes, and FABP1 of C. intestinalis; and in C. elegans, lbp‐5, lbp‐6, lbp‐7, and lbp‐8 are composed of two exons and one intron. In addition, alternative splicing may result in two (FABP6 and FABP7 of humans), two (FABP5 and FABP7 of pigs), one (fabp6 of medaka), three (fabp1, fabp2, and fabp10 of sharks), or one (lbp‐9 of C. elegans) protein encoded by genes exhibiting an unrepresentative gene structure (Fig. 1 and Table 2). Additionally, some lncRNAs, retained introns, and nonsense‐mediated decay are observed in FABP genes derived from alternative splicing (Figs 1, S1, and Table S1).
Table 2.
Organism‐types | Ensembl gene ID | Location | Splice variants | Transcript ID | bp | Protein | Exons | Introns |
---|---|---|---|---|---|---|---|---|
Rat‐FABP2 | ENSRNOG00000024947 | 2:227,080,924‐227,083,501:1 | 1 | ENSRNOT00000029871.3 | 399 | 132aa | 5 | 4 |
Dolphin‐FABP5 | ENSTTRG00000005377 | scaffold_98333:298,656‐300,529:1 | 1 | ENSTTRT00000005366.1 | 327 | 108aa | 3 | 2 |
Dolphin‐FABP6 | ENSTTRG00000012640 | scaffold_106877: 69,634‐93,441:‐1 | 1 | ENSTTRT00000012640.1 | 516 | 171aa | 7 | 6 |
Xenopus‐fabp1 | ENSXETG00000001938 | GL172916.1: 603,458‐605,661:1 | 1 | ENSXETT00000004132.3 | 384 | 128aa | 3 | 2 |
Anole‐FABP10 | ENSACAG00000005346 | GL343498.1: 40,727‐43,888:1 | 1 | ENSACAT00000005335.3 | 382 | 125aa | 5 | 4 |
Fugu‐ fabp10b | ENSTRUG00000020508 | 12:585,852‐586,715:‐1 | 1 | ENSTRUT00000051675.1 | 393 | 130aa | 3 | 2 |
Medaka‐fabp7a | ENSORLG00000013475 | 22:8,108,362‐8,108,904:‐1 | 1 | ENSORLT00000016902.2 | 375 | 132aa | 3 | 2 |
Ciona ‐fabp1 | ENSCING00000014865 | 7: 5,911,360‐5,912,785:1 | 1 | ENSCINT00000026940.2 | 561 | 128aa | 3 | 2 |
C. elegans‐lbp5 | WBGene00002257 | I: 6,731,269‐6,732,280:‐1 | 1 | W02D3.7 | 485 | 136aa | 2 | 1 |
C. elegans‐lbp6 | WBGene00002258 | I: 6,728,290‐6,729,376:‐1 | 1 | W02D3.5 | 533 | 135aa | 2 | 1 |
C. elegans‐lbp7 | WBGene00002259 | V: 13,879,552‐13,880,213 :1 | 1 | T22G5.2 | 517 | 137aa | 2 | 1 |
C. elegans‐lbp8 | WBGene00002260 | V: 13,890,679‐13,891,364 :1 | 1 | T22G5.6 | 622 | 137aa | 2 | 1 |
Human‐FABP6 | ENSG00000170231 | 5:160,187,367‐160,238,735:1 | 4 | ENST00000393980.8 | 756 | 177aa | 7 | 6 |
Human‐FABP7 | ENSG00000164434 | 6:122,779,716‐122,784,074:1 | 2 | ENST00000356535.4 | 2086 | 166aa | 3 | 2 |
Pig‐FABP5 | ENSSSCG00000006153 | 4:55,276,621‐55,282,908:‐1 | 2 | ENSSSCT00000054753.1 | 1740 | 135aa | 5 | 4 |
Pig ‐FABP7 | ENSSSCG00000004234 | 1:39,871,298‐39,876,298:‐1 | 2 | ENSSSCT00000038263.1 | 2183 | 132aa | 3 | 2 |
Medaka‐fabp6 | ENSORLG00000012622 | 17:19,211,982‐19,214,509:1 | 2 | ENSORLT00000047118.1 | 2359 | 164aa | 3 | 2 |
Shark‐fabp1 | ENSCMIG00000013098 | KI635913.1:4,310,897‐4,329,941:1 | 5 | ENSCMIT00000030904.1 | 1181 | 142aa | 3 | 2 |
Shark‐fabp2 | ENSCMIG00000003354 | I635873.1:8,109,897‐8,115,283:1 | 3 | ENSCMIT00000006086.1 | 1361 | 147aa | 2 | 1 |
shark‐fabp10 | ENSCMIG00000016027 | KI635890.1:6,798‐9,285:‐1 | 2 | ENSCMIT00000038686.1 | 1301 | 135aa | 2 | 1 |
C. elegans‐lbp9 | WBGene00021486 | V: 2,068,449‐2,069,143:1 | 2 | Y40B10A.1a | 565 | 152aa | 3 | 2 |
As shown in Fig. 1, which shows the gene structure of 142 fabps, we found that the second and third exons are conserved in almost all protein‐coding FABP genes with four exons and three introns. Although the exon/intron positions are similar in all genes, the length, especially that of the intron, is variable. For example, the lengths of the first intron of FABP1 vary from 74 bp (FABP1 in medaka) to 5239 bp (FABP1 in platypus) because of the differences in genome size among different species. In contrast, the length of exons varies relatively little. In almost all FABPs, the length of the second exon is 173 bp, except in FABP11, in which the length of the second exon is 176 bp. The third exon of most FABPs (FABP3, FABP4, FABP5, FABP7, FABP8, FABP9, FABP11, and FABP12) consists of 102 bp nucleotides; however, there are 93, 108, 90, and 90 bp nucleotides in the third exons of FABP1, FABP2, FABP6, and FABP10, respectively (Fig. 1).
Alternative splicing of FABP genes
Alternative splicing is a common phenomenon in eukaryotes that greatly increases the diversity of proteins that can be encoded by genes [30]. For example, in humans, ~ 95% of multiexon genes undergo alternative splicing [31]. According to the Ensembl annotation, 33 of the 142 FABPs (Fig. 1 and Table S2) exhibit splice variants, except those in the genomes of Bos taurus, T. truncates, Gallus gallus, X. tropicalis, Oreochromis niloticus, and C. intestinalis. Notably, in humans, with the exception of FABP2 and FABP9, eight other FABP genes exhibit alternative splicing; FABP1 and FABP3 to FABP6 exhibit four splice variants, whereas FABP7, FABP8, and FABP12 exhibit two splice variants. In addition, in D. melanogaster, there is only one FABP gene, but it exhibits three splice variants, with all transcripts retaining the second exon during alternative splicing (Fig. 1). Mapping these variants to those in mammals, we found that the second and third exons in mammals are combined to form the second exon (272 bp) in D. melanogaster. Therefore, exon skipping and intron retention are involved in fabp post‐transcriptional splicing.
The conserved synteny of FABP genes
Conserved synteny (the colocalization of genes on chromosomes) is sometimes used to describe the preservation of the precise order of genes on a chromosome passed down from a common ancestor [32], although many geneticists reject this use of the term [33]. Many studies have indicated that all FABPs are likely to have arisen from common ancestral genes through duplication and diversification [25, 34, 35]. Thus, we analyzed the synteny of FABP family genes. As shown by the genetic linkage maps in Fig. 2, we found that FABP4 and FABP8 are always located on the same chromosome in mammals, aves, and amphibians. In mammals, FABP4, FABP5, FABP8, FABP9, and FABP12 are arranged in sequential order on human chromosome 8, rat chromosome 2, mouse chromosome 3, and cow chromosome 14, respectively. In mice, FABP2 is also located on chromosome 3, slightly separated from FABP4, FABP5, FABP8, FABP9, and FABP12. In chickens, FABP4, FABP5, and FABP8 are adjacent to each other on chromosome 2, while FABP2/FABP1 and FABP3/FABP10 are paired on chromosomes 4 and 23, respectively. The sequence arrangement of FABP4/FABP5/FABP8/FABP9 on a single chromosome and the variation of the FABP gene cluster in different tetrapod lineages indicate that the FABP family genes may have arisen through a series of unequal crossover events during meiosis, resulting in continuous tandem duplications in vertebrate lineages [36]. In fish, fabp3 is clustered with fabp10 or/and fabp11. For example, in sharks, fabp3/fabp10 are colocalized on scaffold KI635890.1, but in zebrafish and tilapia, fabp3/fabp11a are located on chromosome 19 and scaffold GL81161.1, respectively; however, in takifugu and medaka, fabp3/fabp10b/11a are located on chromosomes 12 and 11, respectively. In zebrafish, the duplicated fabp1 genes fabp1b.1 and fabp1b.2 form a gene cluster on chromosome 8. In C. elegans, the lbp5/lbp6 and lbp7/lbp8/lbp9 gene clusters are positioned on chromosomes I and V (Fig. 2), respectively, suggesting that they might have arisen via tandem gene duplication.
Phylogenetic relationship between the FABPs of organisms with a heterogeneous fat distribution
Thus far, FABP genes have only been found in vertebrates and invertebrates, and evolutionary studies have distinguished major subfamilies that could have been derived from a single ancestral gene close to the time of the vertebrate/invertebrate split [12]. In this work, to understand the evolutionary relationships of organisms with a heterogeneous fat distribution, we downloaded all selected genes encoding proteins, which varied in length from 108 to 189 amino acids. Multiple sequence alignment of proteins using ClustalW showed that the identities of the proteins ranged from 8.8% (fabp1 in shark vs. fabp5 in dolphin) to 96.9% (FABP7 in mouse vs. FABP7 in rat). The similarity of the amino acid sequences of FABPs of the same type in different organisms is greater than that between FABPs of different types in the same organism. For example, FABP1 from 18 different species consistently displays sequence identities higher than 60%, while the identities of 10 FABPs (FABP1‐FABP9, FABP10) in the human genome are as low as 23% on average (Table S3).
We further constructed a protein neighbor‐joining (NJ) tree using mega6 (Biodesign Institute, Tempe, AZ, USA ) [37]. As shown in the NJ tree (Fig. 3), all FABPs were split into two clades: FABP1, FABP6, and FABP10 cluster in one clade, and FABP2, FABP3, FABP4, FABP5, FABP7, FABP8, FABP9, FABP11, and FABP12 colocalize in another clade. Moreover, the invertebrate FABPs of Nematoda and Drosophila cluster into the FABP2/FABP3/FABP4/FABP5/FABP7/FABP8/FABP9/FABP11/FABP12 clades, which suggests that the FABP1/FABP6/FABP10 gene cluster may have diverged from another cluster before vertebrate/invertebrate divergence. In teleost fish, fabp7a/fabp7b and fabp11a/fabp11b duplicates share a common node, but fabp1a/fabp1b and fabp10a/fabp10b do not share a common node, although all of these sequences cluster in the same clade with all other sister fish and invertebrates’ fatty acid‐binding proteins (fabps). However, fabp7 and fabp8 in C. intestinalis do not cluster into the corresponding clade with other species. The topology of the maximum‐likelihood (ML) tree (Fig. S2) is similar to that of the NJ tree. Together with the conserved FABP gene synteny analysis, we confirmed that the current FABP family gene/protein set might have resulted from multiple rounds of duplications and splicing editing divergence during evolution. These results regarding FABP evolution are consistent with the reports of Schaap [25] and M. Wright [35].
As shown by the studies of M. Wright and his colleagues [35, 36, 38, 39, 40, 41], teleost fishes possess many copies of fabps genes owing to a whole‐genome duplication even that occurred early in the teleost radiation. In addition, these authors proposed that the two copies of fabp7a/fabp7b, fabp10a/fabp10b, and fabp11a/fabp11b in the teleost fish genome may have resulted from a fish‐specific duplication event [29, 39], although the duplicated zebrafish fabp1b.1 and fabp1b.2 sequences are tandemly arrayed on chromosome 8. The fabp1b.1 and fabp1b.2 genes of zebrafish are paralogs that were presumably duplicated by unequal crossing‐over during meiosis [35].
Conclusions
To clarify the relationships between FABPs and the tissues in which fat is deposited, the number, gene structure, conserved synteny, and evolution of FABP family genes in 18 species were systemically studied. There are ten, nine, eight, eight, seven, five, five, one, and five types of FABP family genes in mammals, chickens, Xenopus, anole lizards, teleosts, sharks, Ciona, fruit flies, and worms, respectively (Table 1). In total, 142 FABPs were identified in the selected species (Tables 1 and Table S1). We propose that the loss of a particular FABP may be related to the lack of fat storage in the corresponding tissue. However, since there are many members of the FABP family, how they individually or interactively regulate the distribution of fat in different tissues requires further study.
Materials and methods
FABP orthologs identified from C. elegans to Homo sapiens
In this study, we chose humans, rats, mice, dolphins, platypuses, chickens, Xenopus, anole lizards, teleost fish, sharks, fruit flies, and worms as representative organisms with a heterogeneous fat distribution, and we chose cows, pigs, tilapia, fugu, and medaka as representatives of domestic economic species. Different FABPs are typically identified based on the reciprocal best hits (RBHs) in two genomes using Ensembl BioMart (http://asia.ensembl.org/biomart/martview/1aaa80b6fabd0febf1bc9c3d3c2fb519). Using human FABP1, FABP2, FABP3, FABP4, FABP5, FABP6, FABP7, FABP8, FABP9, and FABP12, we searched for orthologs in the rat (R. norvegicus), mouse (Mus musculus), cow (B. taurus), pig (Sus scrofa), dolphin (Tursiops truncatus), platypus (O. anatinus), chicken (G. gallus), Xenopus (X. tropicalis), anole lizard (A. carolinensis), zebrafish (Danio rerio), tilapia (O. niloticus), fugu (T. rubripes), medaka (O. latipes), Ciona (C. intestinalis), worm (C. elegans), and fruit fly (D. melanogaster) genomes with Ensembl BioMart. Similarly, using zebrafish fabp10a, fabp10b, fabp11a, and fabp11b, we performed searches for orthologs in the genomes of the 17 other organisms.
As previously reported, FABPs evolved through successive gene duplications, generating a large number of tissue‐specific homologs. However, a high degree of gene duplication, particularly in distantly related organisms, hinders ortholog identification by different methods. To avoid obtaining false orthology from Ensembl Compara, we performed NCBI BLASTP and TBLASTN searches using the default parameters to detect orthology. Then, we manually verified the missing genes using OrthoDB (https://www.orthodb.org/), ZFIN (Zebrafish International Resource Center database, http://zfin.org/), and WormBase (https://wormbase.org/#01-23-6).
Gene structure and alternative splicing analysis
Gene structure and splicing variant information was obtained by referencing the Ensembl database (http://asia.ensembl.org/index.html). We manually searched and downloaded information about genetic structure and splice variants for each FABP. Then, we drew the gene structures including every fabps transcript with PowerPoint (PPT).
Gene and protein sequence retrieval
Most FABP protein sequences were retrieved from the UniProt database (https://www.uniprot.org/uploadlists/); the other transcript sequences from T. truncatus, C. milii, D. melanogaster, and C. elegans were downloaded from Ensembl (http://asia.ensembl.org/index.html) and NCBI (https://www.ncbi.nlm.nih.gov/gene/). The protein lengths of all FABPs ranged from 108 to 189 amino acids.
FABP genetic linkage map
The chromosomal or scaffold locations of the FABP genes were derived from Ensembl genome databases. Based on the provided genetic linkage data, we drew genetic linkage maps using MapDraw 2.1 [42] in Excel.
Sequence alignments and reconstruction of gene/protein trees
The FABP protein sequences were aligned using the ClustalW algorithm with default parameters and then manually checked. In addition to all the FABP amino acid sequences identified in this study, 144 FABP sequences were also included for phylogenetic analysis. LCN1 (accession number NP_002288), which belongs to the lipocalin family of calycins and has a size (158 amino acids) comparable with those of iLBPs, was used as an outgroup. Phylogenetic trees were constructed using both the NJ and ML methods implemented in mega6 software (bootstrap = 1000) [37]. Evolview v3 [43] was used to visualize and annotate the NJ tree.
Conflict of interest
All authors declare no conflict of interest.
Author contributions
YZ conceived and designed the experiments. YZ and JZ performed the experiments. YZ and JZ analyzed the data. YZ, JZ, YR, RL, LY, and GN contributed materials/analysis tools. YZ wrote the paper.
Supporting information
Acknowledgements
We would like to thank the other members of our laboratory for their comments on this manuscript. This work was supported by the National Natural Science Foundation of China (31872581 and 31672671), the Key Research Projects of Henan Higher Education Institutions (19B240002 and 20B240002), and the Innovation Scientists and Technicians Troop Construction Projects of Henan Province (CXTD2016043).
References
- 1. McKay RM, McKay JP, Avery L and Graff JM (2003) C. elegans: a model for exploring the genetics of fat storage. Dev Cell 4, 131–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Gesta S, Tseng YH and Kahn CR (2007) Developmental origin of fat: tracking obesity to its source. Cell 131, 242–256. [DOI] [PubMed] [Google Scholar]
- 3. Sheridan MA (1988) Lipid dynamics in fish: aspects of absorption, transportation, deposition and mobilization. Comp Biochem Physiol B 90, 679–690. [DOI] [PubMed] [Google Scholar]
- 4. Chen K, Durand D and Farach‐Colton M (2000) NOTUNG: a program for dating gene duplications and optimizing gene family trees. J Comput Biol 7, 429–447. [DOI] [PubMed] [Google Scholar]
- 5. Ottaviani E, Malagoli D and Franceschi C (2011) The evolution of the adipose tissue: a neglected enigma. Gen Comp Endocrinol 174, 1–4. [DOI] [PubMed] [Google Scholar]
- 6. Pond CM (1992) An evolutionary and functional view of mammalian adipose tissue. Proc Nutr Soc 51, 367–377. [DOI] [PubMed] [Google Scholar]
- 7. Lev‐Ran A (2001) Human obesity: an evolutionary approach to understanding our bulging waistline. Diabetes Metab Res Rev 17, 347–362. [DOI] [PubMed] [Google Scholar]
- 8. Weil C, Lefevre F and Bugeon J (2013) Characteristics and metabolism of different adipose tissues in fish. Rev Fish Biol Fish 23, 157–173. [Google Scholar]
- 9. Blouin K, Boivin A and Tchernof A (2008) Androgens and body fat distribution. J Steroid Biochem Mol Biol 108, 272–280. [DOI] [PubMed] [Google Scholar]
- 10. Power ML and Schulkin J (2008) Sex differences in fat storage, fat metabolism, and the health risks from obesity: possible evolutionary origins. Br J Nutr 99, 931–940. [DOI] [PubMed] [Google Scholar]
- 11. Perez‐Perez R, Ortega‐Delgado FJ, Garcia‐Santos E, Lopez JA, Camafeita E, Ricart W, Fernandez‐Real JM and Peral B (2009) Differential proteomics of omental and subcutaneous adipose tissue reflects their unalike biochemical and metabolic properties. J Proteome Res 8, 1682–1693. [DOI] [PubMed] [Google Scholar]
- 12. Esteves A and Ehrlich R (2006) Invertebrate intracellular fatty acid binding proteins. Comp Biochem Physiol C Toxicol Pharmacol 142, 262–274. [DOI] [PubMed] [Google Scholar]
- 13. Wajchenberg BL (2000) Subcutaneous and visceral adipose tissue: their relation to the metabolic syndrome. Endocr Rev 21, 697–738. [DOI] [PubMed] [Google Scholar]
- 14. Montani JP, Carroll JF, Dwyer TM, Antic V, Yang Z and Dulloo AG (2004) Ectopic fat storage in heart, blood vessels and kidneys in the pathogenesis of cardiovascular diseases. Int J Obes Relat Metab Disord. 28 (Suppl 4), S58–S65. [DOI] [PubMed] [Google Scholar]
- 15. Li A, Wu L, Wang X, Xin Y and Zan L (2016) Tissue expression analysis, cloning and characterization of the 5'‐regulatory region of the bovine FABP3 gene. Mol Biol Rep 43, 991–998. [DOI] [PubMed] [Google Scholar]
- 16. Laprairie RB, Denovan‐Wright EM and Wright JM (2016) Subfunctionalization of peroxisome proliferator response elements accounts for retention of duplicated fabp1 genes in zebrafish. BMC Evol Biol 16, 147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Choromanska B, Mysliwiec P, Dadan J, Hady HR and Chabowski A (2011) [The clinical significance of fatty acid binding proteins]. Postepy Hig Med Dosw (Online) 65, 759–763. [DOI] [PubMed] [Google Scholar]
- 18. Gerbens F, Jansen A, van Erp AJ, Harders F, Meuwissen TH, Rettenberger G, Veerkamp JH and te Pas MF (1998) The adipocyte fatty acid‐binding protein locus: characterization and association with intramuscular fat content in pigs. Mamm Genome 9, 1022–1026. [DOI] [PubMed] [Google Scholar]
- 19. Liu ZW, Fan HL, Liu XF, Ding XB, Wang T, Sui GN, Li GP and Guo H (2015) Overexpression of the A‐FABP gene facilitates intermuscular fat deposition in transgenic mice. Genet Mol Res 14, 2742–2749. [DOI] [PubMed] [Google Scholar]
- 20. Migdal L, Koziol K, Palka S, Migdal W, Otwinowska‐Mindur A, Kmiecik M, Migdal A, Maj D and Bieniek J (2018) Single nucleotide polymorphisms within rabbits (Oryctolagus cuniculus) fatty acids binding protein 4 (FABP4) are associated with meat quality traits. Livestock Sci 210, 21–24. [Google Scholar]
- 21. Chmurzynska A (2006) The multigene family of fatty acid‐binding proteins (FABPs): function, structure and polymorphism. J Appl Genet 47, 39–48. [DOI] [PubMed] [Google Scholar]
- 22. Alejandra Crovetto C and Leon Cordoba O (2016) Structural and biochemical characterization and evolutionary relationships of the fatty acid‐binding protein 10 (Fabp10) of hake (Merluccius hubbsi). Fish Physiol Biochem 42, 149–165. [DOI] [PubMed] [Google Scholar]
- 23. Liu R‐Z, Li X and Godbout R (2008) A novel fatty acid‐binding protein (FABP) gene resulting from tandem gene duplication in mammals: transcription in rat retina and testis. Genomics 92, 436–445. [DOI] [PubMed] [Google Scholar]
- 24. Storch J and McDermott L (2009) Structural and functional analysis of fatty acid‐binding proteins. J Lipid Res 50 (Suppl), S126–S131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Schaap FG, van der Vusse GJ and Glatz JF (2002) Evolution of the family of intracellular lipid binding proteins in vertebrates. Mol Cell Biochem 239, 69–77. [PubMed] [Google Scholar]
- 26. Zheng Y, Blair D and Bradley JE (2013) Phyletic distribution of fatty acid‐binding protein genes. PLoS ONE 8, e77636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Venkatesh B, Lee AP, Ravi V, Maurya AK, Lian MM, Swann JB, Ohta Y, Flajnik MF, Sutoh Y, Kasahara M et al (2014) Elephant shark genome provides unique insights into gnathostome evolution. Nature 505, 174–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Flynn EJ 3rd, Trent CM and Rawls JF (2009) Ontogeny and nutritional control of adipogenesis in zebrafish (Danio rerio). J Lipid Res 50, 1641–1652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Salmeron C (2018) Adipogenesis in fish. J Exp Biol 221, 1–11. [DOI] [PubMed] [Google Scholar]
- 30. Black DL (2003) Mechanisms of alternative pre‐messenger RNA splicing. Annu Rev Biochem 72, 291–336. [DOI] [PubMed] [Google Scholar]
- 31. Pan Q, Shai O, Lee LJ, Frey BJ and Blencowe BJ (2008) Deep surveying of alternative splicing complexity in the human transcriptome by high‐throughput sequencing. Nat Genet 40, 1413–1415. [DOI] [PubMed] [Google Scholar]
- 32. Dawson DA, Akesson M, Burke T, Pemberton JM, Slate J and Hansson B (2007) Gene order and recombination rate in homologous chromosome regions of the chicken and a passerine bird. Mol Biol Evol 24, 1537–1552. [DOI] [PubMed] [Google Scholar]
- 33. Passarge E, Horsthemke B and Farber RA (1999) Incorrect use of the term synteny. Nat Genet 23, 387. [DOI] [PubMed] [Google Scholar]
- 34. Plant K, Thumser A and Plant N (2006) Evolution of the fatty acid‐binding protein family: an ordered increase in gene number, correlating to more refined mechanisms of lipid processing. Toxicology 226, 73–74. [Google Scholar]
- 35. Venkatachalam AB, Parmar MB and Wright JM (2017) Evolution of the duplicated intracellular lipid‐binding protein genes of teleost fishes. Mol Genet Genom 292, 699–727. [DOI] [PubMed] [Google Scholar]
- 36. Karanth S, Denovan‐Wright EM, Thisse C, Thisse B and Wright JM (2008) The evolutionary relationship between the duplicated copies of the zebrafish fabp11 gene and the tetrapod FABP4, FABP5, FABP8 and FABP9 genes. FEBS J 275, 3031–3040. [DOI] [PubMed] [Google Scholar]
- 37. Tamura K, Stecher G, Peterson D, Filipski A and Kumar S (2013) MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol 30, 2725–2729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Denovan‐Wright EM, Pierce M, Sharma MK and Wright JM (2000) cDNA sequence and tissue‐specific expression of a basic liver‐type fatty acid binding protein in adult zebrafish (Danio rerio). Biochim Biophys Acta 1492, 227–232. [DOI] [PubMed] [Google Scholar]
- 39. Parmar MB and Wright JM (2013) Comparative genomic organization and tissue‐specific transcription of the duplicated fabp7 and fabp10 genes in teleost fishes. Genome 56, 691–701. [DOI] [PubMed] [Google Scholar]
- 40. Bayir M, Bayir A and Wright JM (2015) Divergent spatial regulation of duplicated fatty acid‐binding protein (fabp) genes in rainbow trout (Oncorhynchus mykiss). Comp Biochem Physiol D Genomics Proteomics 14, 26–32. [DOI] [PubMed] [Google Scholar]
- 41. Venkatachalam AB, Fontenot Q, Farrara A and Wright JM (2018) Fatty acid‐binding protein genes of the ancient, air‐breathing, ray‐finned fish, spotted gar (Lepisosteus oculatus). Comp Biochem Physiol Part D Genomics Proteomics 25, 19–25. [DOI] [PubMed] [Google Scholar]
- 42. Liu RH and Meng JL (2003) [MapDraw: a microsoft excel macro for drawing genetic linkage maps based on given genetic linkage data]. Yi Chuan 25, 317–321. [PubMed] [Google Scholar]
- 43. Subramanian B, Gao S, Lercher MJ, Hu S and Chen W‐H (2019) Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res 47, W270–W275. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.