Abstract
Intramuscular fat (IMF) content is related to insulin resistance, which is an important prediction factor for disorders, such as cardiovascular disease, obesity and type 2 diabetes in human. At the same time, it is an economically important trait, which influences the sensorial and nutritional value of meat. The deposition of IMF is influenced by many factors such as sex, age, nutrition, and genetics. In this study Nellore steers (Bos taurus indicus subspecies) were used to better understand the molecular mechanisms involved in IMF content. This was accomplished by identifying differentially expressed genes (DEG), biological pathways and putative regulatory factors. Animals included in this study had extreme genomic estimated breeding value (GEBV) for IMF. RNA-seq analysis, gene set enrichment analysis (GSEA) and co-expression network methods, such as partial correlation coefficient with information theory (PCIT), regulatory impact factor (RIF) and phenotypic impact factor (PIF) were utilized to better understand intramuscular adipogenesis. A total of 16,101 genes were analyzed in both groups (high (H) and low (L) GEBV) and 77 DEG (FDR 10%) were identified between the two groups. Pathway Studio software identified 13 significantly over-represented pathways, functional classes and small molecule signaling pathways within the DEG list. PCIT analyses identified genes with a difference in the number of gene-gene correlations between H and L group and detected putative regulatory factors involved in IMF content. Candidate genes identified by PCIT include: ANKRD26, HOXC5 and PPAPDC2. RIF and PIF analyses identified several candidate genes: GLI2 and IGF2 (RIF1), MPC1 and UBL5 (RIF2) and a host of small RNAs, including miR-1281 (PIF). These findings contribute to a better understanding of the molecular mechanisms that underlie fat content and energy balance in muscle and provide important information for the production of healthier beef for human consumption.
Introduction
Intramuscular fat (IMF, also known as marbling) represents the amount of fat accumulated between muscle fibers or within muscle cells, which is the sum of phospholipids (present in cell membranes), and triglycerides (lipid droplets). Understanding the biological and functional mechanisms that regulate IMF content is an interesting issue in meat science and human medicine. Intramuscular fat content is a polygenic trait regulated by many genes involved directly, or indirectly in adipogenesis and fat metabolism [1]. The deposition of IMF is influenced by many factors such as sex, age, breed, nutrition, and genetics [2].
High intramuscular fat content (marbling) has been associated with tenderness, juiciness and consumer satisfaction [3]. At the same time, red meat consumption or more specifically saturated fat consumption has been associated with human diseases, such as cardiovascular disease, obesity and colon cancer [4]. These associations have created a demand for beef with both low fat and high quality by consumers. In addition, consumers in some countries have different preferences of IMF amount. For example, consumers in Asia and North America desire beef with high IMF, while Europeans prefer lean beef, with low IMF [5].
RNA-Seq has been used to provide insight into biological and molecular mechanisms of complex traits and diseases. Recent RNA-Seq studies of pigs with extreme IMF phenotypes have defined differentially expressed genes (DEG) and potential gene networks important in lipid and fatty acid metabolism in the liver [6]. In beef cattle, DEG were identified in subcutaneous fat from Bos taurus crossed steers, which revealed that expression pattern depends on the genetic background [7]. Two studies have investigated DEG due to IMF deposition in Bos taurus, Bos indicus and their crosses [8, 9], however the biological and functional mechanisms involved with IMF amount are still unclear. From the published literature, lipid metabolism is considerably different between nonruminants and ruminants, but gene expression studies in ruminant are relatively scarce.
Understanding the biological mechanisms involved with complex traits requires analysis of genetic networks, as well as determination of relationships between genes and networks. A novel algorithm for the partial correlation coefficient with information theory (PCIT) mathematical method was developed to determine the relationships between all genes and networks [10]. Previously, the PCIT algorithm approach identified causal regulatory changes in myostatin gene expression in beef cattle [11] by co-expression network analysis and the regulatory and phenotypic impact factor methods (RIF and PIF). These co-expression methods provide powerful incite into the changes that occur in the network makeup and wiring between different treatment groups. These methods allow subtle changes in networks to be detected even when many genes in a pathway are not differentially expressed.
The objective of this study was to identify differentially expressed genes, pathways and putative regulators associated with IMF variation in Longissimus dorsi muscle of extreme IMF GEBV Nellore steers to better understanding of IMF metabolism.
Materials and Methods
Ethics statement
All experimental procedures involving steers were approved by the Institutional Animal Care and Use Committee Guidelines from Brazilian Agricultural Research Corporation—EMBRAPA and sanctioned by the president Dr. Rui Machado.
Animals, samples and phenotypes
Three hundred and ten Nellore steers from the Brazilian Agricultural Research Corporation (EMBRAPA/Brazil) experimental breeding herd, raised between 2009 and 2011 were included in this study, as described by Cesar and collaborators [12]. These steers were sired by 34 unrelated sires, and were selected to represent the main genealogies used in Brazil according to the National Summary of Nellore produced by the Brazilian Association of Zebu Breeders (ABCZ) and National Research Center for Beef Cattle. Animals were raised in feedlots under identical nutrition and handling conditions until slaughter at an average age of 25 months. Samples from LD muscle located between the 12th and 13th ribs were collected in two moments: at slaughter for RNA sequencing analysis, and 24 hours after slaughter for the intramuscular fat (IMF) content measurement.
Approximately 100 g samples of beef collected for IMF content analysis were lyophilized and ground to a fine powder. Five g of this ground, lyophilized tissue was used to obtain IMF. The Ankom XT20 lipids equipment was used to determine lipid content according to the procedure of AOCS (Official Procedure Am 5–04) for IMF extraction [13]. Restricted maximum likelihood analysis was performed to estimate variance components, heritability and Genomic Best Linear Unbiased Prediction (GBLUP) using ASREML software [14] on 310 total animals. The SNP markers information was obtained as described by Cesar and collaborators (2014) using BovineHD 770 k BeadChip (Infinium BeadChip, Illumina, San Diego, CA). SAS PROC MIXED was used to test independent sources for significance. Fixed effects included contemporary group classes (animals with the same origin, birth year and slaughter date) and hot carcass weight as a covariate. Animal and residuals were fitted as random effects [12]. The animal model used in this analysis was, y = Xb + Zu + e, where y is the vector of observations, which represented the trait of interest (dependent variable), X and Z are the design or incidence matrices for the vectors of fixed and random effects in b and u, respectively, and e was the vector of random residuals. The expected variance of vector u is Var(a) = I σ2 m; where σ2 m is the variance explained by markers, and I is the identity matrix. The variance of vector u was G σ2 m for the genomic analyses where G is the genomic relationship matrix derived from SNP markers using allele frequencies as suggested by VanRaden [15], with σ2 m being the marker-based additive genetic variance. A group of 14 animals were selected based on their extreme genomic estimated breeding values (GEBVs) for IMF (seven high and seven low). To verify the difference in IMF level between the high and low group a Student's t-test was performed using R package. The genomic estimated breeding values (GEBVs) for other muscle characteristics such as ribeye area and backfat thickness were also calculated to ascertain these animal were not extreme for another characteristic.
RNA extraction, quality analysis, library preparation and sequencing
Total RNA was extracted from 100 mg of frozen LD muscle that was collected at slaughter using the TRIzol reagent (Life Technologies, Carlsbad, CA). RNA integrity was verified by Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA). Only samples with RIN > 8 were used. A total of 2μg of total RNA from each sample was used for library preparation according to the protocol described in the TruSeq RNA Sample Preparation kit v2 guide (Illumina, San Diego, CA). Libraries average size was estimated using the Agilent Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA) and quantified using quantitative PCR with the KAPA Library Quantification kit (KAPA Biosystems, Foster City, CA, USA). Quantified, samples were diluted and pooled (three pools of six samples each). Three lanes of a sequencing flowcell, using the TruSeq PE Cluster kit v3-cBot-HS kit (Illumina, San Diego, CA, USA), were clusterized and sequenced using HiScanSQ equipment (Illumina, San Diego, CA, USA) with a TruSeq SBS Kit v3-HS (200 cycles), according to manufacturer instructions. The sequencing analyses were performed at the Genomics Center at ESALQ, Piracicaba, São Paulo, Brazil.
Quality control and read alignment
Sequencing adaptors and low-complexity reads were removed in an initial data-filtering step. Quality control and reads statistics were estimated with FASTQC version 0.10.1 software [http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/]. Tophat v. 1.2.0 software [16] was used to map reads to the UMD3.1 Bos taurus reference assembly available at Ensembl [http://www.ensembl.org/Bos_taurus/Info/Index/]. A reference-guided assembly was performed using Cufflinks version 2.0.2, with a minimum alignment count per locus of 10 per transcript [17], to identify novel transcripts. A combination of novel transcripts identified by Cufflinks and those from the reference GTF file at Ensembl were used as the reference for read quantification for each transcript. The abundance (read counts) of mRNAs for all annotated genes, was calculated using HTSeq version 0.5.4 software [http://www-huber.embl.de/users/anders/HTSeq/] [18]. Only sequence reads that uniquely mapped to known chromosomes (excluding reads mapped to unassigned contigs) were used in this analysis.
Identification of differential expressed genes and pathway analysis
Differentially expressed genes were identified using the DESeq software according to the protocol proposed by Anders and collaborators [18], which uses read counts that fall into annotated genes and perform statistical analysis based on the table of counts to discover quantitative changes in expression levels between experimental groups. Prior to statistical analysis, the read count data was filtered as follows: i) transcripts with zero counts were removed (unexpressed); ii) transcripts with less than 1 read per sample on average were removed (very lowly expressed); iii) transcripts that were not present in at least three samples were removed (rarely expressed). After filtering, a total of 16,101 transcripts were analyzed for differential expression using the “nbinomTest” function of DESeq to fit transcript expression level as a negative binomial distribution. Exploratory diagnostic plots were generated to check the dispersion estimates. Benjamini-Hochberg [19] methodology was used to control the false discovery rate (FDR) at 10%. Transcript annotations were retrieved with a perl script to query the Ensembl database using the Ensembl Perl Application Program Interface (API). Transcripts that lacked annotation information were annotated using the Genome-to-seq and GOanna for GO annotations based on sequence homology by Basic Local Alignment Search Tool (BLAST) at AgBase [20]. A two-tiered approach was taken to detect pathway level changes in gene ontology in the extreme IMF samples RNAseq profiles. First, enrichment analysis of curated gene ontology terms was completed with the Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 tool [21] using the list of genes that presented FDR < 10%. Second, a literature based pathway enrichment analysis was performed using Pathway Studio [22] from a list of genes that presented FDR < 20%.
PCIT and differential hubbing network analysis
Expression values were normalized as the number of fragments per kilobase of exon per million reads (FPKM) as reported in Cufflinks output [17] for co-expression analysis. A modified version of the PCIT algorithm [23] was used to identify differential hubbing (DH) for all transcripts [11]. A shell script pipeline was developed to summarize all DH results. The gene list used for PCIT included all transcripts detected in our study, but only those with a direct and partial correlation ≥ 0.90 were used for the DH analysis. The DH was computed by the difference of significant connections of a transcript between the low and high IMF group. The top ten most positively and negatively DH transcripts were identified for further investigation.
Regulatory Impact Factor (RIF) network co-expression analysis and phenotypic impact factor (PIF) scores
The regulatory impact factor (RIF) and phenotypic impact factor (PIF) scores were calculated as described [24] to predict which transcripts were potential regulators of gene expression differences between the high and low IMF groups. The RIF calculations presented here were modified from the original method and the complete list of expressed transcripts were tested as potential regulators and only transcripts with a significant partial correlation ≥ 0.90 from PCIT were included in the RIF and PIF score estimates.
The PIF values represent a way to rank genes based on the magnitude of the expression of a gene and the difference in the expression of that gene between two treatments. A high PIF value would indicate that a given gene would likely be closely related to changes in phenotype. The RIF1 value allows genes to be ranked as potential regulators of networks based largely on changes in correlation between two treatment levels (i.e. differential wiring). The RIF2 value allows genes to be ranked as potential regulators of networks with more of an emphasis on how expression changes of a potential regulator may predict the abundance of genes DE due to treatment differences [24]. Thus, RIF2 ranks genes as potential biomarkers tracking key differences in gene expression related to treatment differences.
Results
Phenotypic groups, mapping and annotation
A summary of the IMF phenotypic data expressed as a percentage, GEBVs, and the total number of reads mapped against the Bos taurus UMD3.1 reference genome assembly following quality control are shown in Table 1. The genetic variance, residual variance and heritability for IMF obtained from this population were 0.196, 0.490 and 0.29±0.16, respectively. The GEBV for IMF values for all 310 animals were ranked and seven animals with high GEBV for IMF (H) and seven with low (L) were selected for RNA-Seq analysis. This strategy, to select the animals with extreme GEBV was performed for two main reasons: (1) the correlation between the raw IMF values (percentage of IMF) and GEBV was high (r = 0.76) (S1 Fig) and (2) the GBLUP procedure used genomic information from all relatives and can account for environmental effects [25]. The T-test showed that the IMF averages for two groups were statistically different (p-value = 2.016e-09). The cattle used in this study were not extreme individuals for either subcutaneous fat thickness or longissimus muscle area (S1 Table). These results indicate that the samples selected for IMF were not divergent for other characteristics of muscle measured in this population.
Table 1. Intramuscular fat percentage (IMF), genomic estimated breeding values (GEBV) and mapped reads for all animals within group (Low and High) based on IMF GEBV.
Animal | IMF (%) | GEBV | Mapped reads (M) 1 |
---|---|---|---|
Low1 2 | 1.70 | -0.31 | 23.09 |
Low2 | 1.94 | -0.29 | 11.13 |
Low3 | 1.86 | -0.24 | 25.11 |
Low4 | 1.60 | -0.59 | 13.32 |
Low5 | 1.32 | -0.77 | 14.34 |
Low6 | 1.58 | -0.50 | 14.58 |
Low7 | 1.62 | -0.57 | 17.85 |
High1 3 | 4.42 | 0.44 | 22.65 |
High2 | 4.35 | 0.57 | 17.53 |
High3 | 4.38 | 0.71 | 17.58 |
High4 | 5.27 | 0.85 | 16.31 |
High5 | 5.02 | 0.47 | 15.73 |
High6 | 4.74 | 0.81 | 15.13 |
High7 | 4.35 | 0.61 | 13.76 |
Mean Low | 1.56 ± 0.20 2 | -0.45 | 17.04 |
Mean High | 4.65 ± 0.37 2 | 0.65 | 19.39 |
1M million reads
2 Standard deviation (SD)
A total of 364.18 million (M) 100 bp paired-end reads were obtained from three lanes of an Illumina HiScanSQ. The average number of total reads per sample was 26 M, with an estimated sequencing depth of 40X coverage (i.e. the expected mean coverage at all transcripts). The mean number of total mapped reads estimated by Tophat [16] was 17 M, for an average of 65.38% paired reads mapped. All transcripts were annotated using the Ensembl Perl API or AgBase tools [20], genome2seq and GOanna. A total of 16,101 genes were analyzed after filtering out lowly expressed genes. Correlations among mean gene expression levels between both the H and L groups was high (r = 0.99).
Differential expression analysis and differentially expressed genes
The DESeq R package was used to identify DEG. This statistical package uses a parametric method, which relies on assumptions regarding the distribution of sampled data [18]. The inference relies on estimation of the typical relationship between the mean of gene expression levels and their data’s dispersion (square of the coefficient of biological variation). Dispersion plot of the 16,101 expressed genes identified in both groups (H and L) is presented in S2 Fig. Using this parametrical approach and extending Fisher’s exact test to the data following a negative binomial distribution, 77 DEG (false discovery rate (FDR 10%) were detected (S3 Fig and S2 Table) between the H and L IMF groups. The histogram of p values demonstrates the presence of DEG (S4 Fig). Of the 77 DEG, 41 genes were up-regulated and 36 down-regulated in the L group (S2 Table). The expression level, fold change, p-value and annotation of all 16,101 genes identified are showed in the S3 Table.
Functional enrichment and pathways
A gene-annotation enrichment analysis was performed using the DAVID software [21] and knowledgebase to capture enriched biological terms from the DEG list (FDR < 10%). This analysis identified (Table 2) two significant KEGG pathways based on Benjamini-Hochberg (BH) methodology [19]: focal adhesion (BH-adj < 0.02) and Extracellular matrix (ECM)-receptor interaction (BH-adj < 0.056). A gene set enrichment analysis (GSEA) was performed to identify over-represented pathways in DEG (FDR < 20%) using Pathway Studio. The GSEA detected 13 pathways (FDR < 10%) that contained 27 DEG (Table 3). These DEG are associated with multiple pathways, which could indicate an inter-relation between the pathways. An over-representation of genes were identified in the following functional classes by Pathway Studio: L-cysteine, Zn2+, H2O2, Mg2+, ATP, retinoic acid, NO, ROS, oxidized LDL complex, inflammatory cytokines, and protein tyrosin-kinase. Pathway Studio also identified genes associated with IL-1ß and NF-kB pathways. Fig 1 shows the genes associated with the L-cysteine small molecule, and the genes associated with the retinoic acid and inflammatory cytokine functional classes are shown in S5 and S6 Figs, respectively, where genes colored in red are overexpressed in animals of Low group and those in blue are overexpressed in H group.
Table 2. Functional enrichment and significant category (BH-adj < 0.10) are shown from DEG (FDR < 0.10) comparing high and low IMF GEBV animals.
Category | Term | Count 1 | % 2 | P-Value | Benjamini |
---|---|---|---|---|---|
KEGG_PATHWAY | Focal adhesion | 6 | 8.0 | 4.9E-4 | 2.0E-2 |
GOTERM_BP_FAT | cellular protein catabolic process | 7 | 9.3 | 2.3E-4 | 4.6E-2 |
GOTERM_BP_FAT | protein catabolic process | 7 | 9.3 | 3.5E-4 | 4.7E-2 |
GOTERM_BP_FAT | cellular macromolecule catabolic process | 7 | 9.3 | 5.4E-4 | 5.3E-2 |
KEGG_PATHWAY | ECM-receptor interaction | 4 | 5.3 | 2.7E-3 | 5.6E-2 |
SP_PIR_KEYWORDS | basement membrane | 3 | 4.0 | 6.9E-4 | 6.0E-2 |
GOTERM_BP_FAT | macromolecule catabolic process | 7 | 9.3 | 1.0E-3 | 7.9E-2 |
GOTERM_BP_FAT | modification-dep. protein catabolic process | 6 | 8.0 | 1.2E-3 | 8.0E-2 |
GOTERM_BP_FAT | modification-dep. macromolecule catabolic process | 6 | 8.0 | 1.2E-3 | 8.0E-2 |
GOTERM_BP_FAT | proteolysis involved in cell. protein catabol. process | 7 | 9.3 | 2.3E-4 | 8.8E-2 |
GOTERM_BP_FAT | muscle tissue development | 4 | 5.3 | 1.9E-3 | 9.1E-2 |
GOTERM_BP_FAT | striated muscle tissue development | 4 | 5.3 | 1.6E-3 | 9.1E-2 |
1 Number of differentially expressed genes (DEG) involved in the term
2 Percentage of DEG involved in the term (involved DEG/total DEG)
Table 3. Gene Set Enrichment Analysis, significant functional classes and pathways (p < 0.10) are shown from DEG (FDR < 0.20) comparing high and low IMF GEBV animals.
Name | Type | Total number of Neighbors | Number of Measured Neighbors | Gene Set Seed | Neighbors | padj 1 |
---|---|---|---|---|---|---|
Neighbors of L-cysteine | Pathway | 366 | 5 | L-cysteine | MT3, GRM4, PGM1, TGM2, ELN | 0.011 |
Neighbors of Zn2+ | Pathway | 905 | 6 | Zn2+ | MT3, ADAMTS12, PGM1, TGM2, SLC2A4, STAT5A | 0.021 |
Neighbors of oxidized LDL | Pathway | 424 | 5 | oxidized LDL | PTAFR, COL4A2, TGM2, SERPINE2, SLC2A4 | 0.023 |
Neighbors of inflammatory cytokine | Pathway | 774 | 9 | inflammatory cytokine | MT3, ADAMTS12, FBXO32, CRYAB, TGM2, SERPINE2, SLC2A4, ELN, STAT5A | 0.025 |
Neighbors of H2O2 | Pathway | 1142 | 8 | H2O2 | MT3, FBXO32, CRYAB, HTATIP2, TGM2, SLC2A4, TGFB1I1, ELN | 0.030 |
Neighbors of Mg2+ | Pathway | 562 | 5 | Mg2+ | ADAMTS12, PGM1, TGM2, REM1, ELN | 0.033 |
Neighbors of ATP | Pathway | 1167 | 9 | ATP | GRM4, PGM1, FBXO32, CRYAB, SLC4A4, TGM2, SLC2A4, GYPC, NRP1 | 0.061 |
Neighbors of IL1B | Pathway | 1191 | 12 | IL1B | MT3, CSRP3, RGS16, FBXO32, CRYAB, TGM2, SERPINE2, SLC2A4, SPARC, ELN, STAT5A, NRP1 | 0.063 |
Neighbors of retinoic acid | Pathway | 1605 | 12 | retinoic acid | SLC7A4, CSRP3, PTAFR, RGS16, CPM, TGM2, SLC2A4, FLNA, SPARC, ELN, STAT5A, ROCK2 | 0.074 |
Neighbors of NF-kB | Pathway | 1125 | 9 | NF-kB | MT3, PTAFR, RGS16, FBXO32, TGM2, SLC2A4, ELN, STAT5A, FSCN1 | 0.085 |
Neighbors of protein tyrosine kinase | Pathway | 823 | 7 | protein tyrosine kinase | PGM1, PTAFR, RGS16, SLC4A4, SLC2A4, STAT5A, BCR | 0.092 |
Neighbors of NO | Pathway | 862 | 8 | NO | CSRP3, PTAFR, FBXO32, TGM2, SLC2A4, SPARC, ELN, STAT5A | 0.097 |
Neighbors of ROS | Pathway | 787 | 8 | ROS | MT3, PTAFR, SLC4A4, TGM2, SLC2A4, TGFB1I1, ELN, STAT5A | 0.098 |
1 padj—p value adjusted for multiple testing with the Benjamini-Hochberg procedure, which controls false discovery rate (FDR).
PCIT and differential hubbing analysis
The PCIT algorithm was used to identify differential hubbing (DH) between the H and L IMF groups. Differential hubbing (DH) or differential connectivity (DC) is the difference in the number of significant partial correlations a gene has between two different states (i.e. compared between high and low groups), as computed by the PCIT algorithm. In other words, DH is the change in the number of significant connections between two states. The significance of a correlation is determined using an information theory approach in the case of the PCIT method.
The top ten negative and positive DH values are shown in Tables 4 and 5, respectively. All DH results are presented in S4 Table. The modified version of the PCIT algorithm used in this study allowed all transcripts to be tested as putative regulators genes without prior knowledge of their function [10]. Among the top ten positive DH, three have been previously reported as associated with adipogenesis and adipose metabolism: ANKRD26, HOXC5 and PPAPDC2 (Fig 2). Among the top ten negative DH, two were identified by literature as good putative regulatory genes: Zinc finger protein, friend of GATA (FOG) family member 1 (ZFPM1) and zinc finger protein 90 (ZFP90) (Fig 3).
Table 4. Top 10 negative differentially hubbed (DH) genes comparing H and L groups of GEBV for IMF.
Ensembl Gene ID | Ensembl Transcript ID | Gene Symbol | DH | Transcript Biotype |
---|---|---|---|---|
ENSBTAG00000006639 | ENSBTAT00000008722 | CSTF1 | -216 | protein_coding |
ENSBTAG00000022922 | ENSBTAT00000055503 | ANKRD26 | -216 | protein_coding |
ENSBTAG00000018343 | ENSBTAT00000024407 | NCR3 | -210 | protein_coding |
ENSBTAG00000006990 | ENSBTAT00000009187 | MYRF | -207 | protein_coding |
ENSBTAG00000025809 | ENSBTAT00000036501 | ABHD8 | -204 | protein_coding |
ENSBTAG00000015208 | ENSBTAT00000061091 | LRRC16B | -204 | protein_coding |
ENSBTAG00000009769 | ENSBTAT00000012885 | Novel gene | -200 | processed_pseudogene |
ENSBTAG00000009634 | ENSBTAT00000012696 | HOXC5 | -199 | protein_coding |
ENSBTAG00000011050 | ENSBTAT00000014674 | PPAPDC2 | -198 | protein_coding |
ENSBTAG00000022922 | ENSBTAT00000031151 | ANKRD26 | -198 | protein_coding |
Table 5. Top 10 positive differentially hubbed (DH) genes comparing H and L groups of GEBV for IMF.
Ensembl Gene ID | Ensembl Transcript ID | Gene Symbol | DH | Transcript Biotype |
---|---|---|---|---|
ENSBTAG00000007531 | ENSBTAT00000009911 | NCF4 | 302 | protein_coding |
ENSBTAG00000015766 | ENSBTAT00000047968 | ZFPM1 | 297 | protein_coding |
ENSBTAG00000006638 | ENSBTAT00000008723 | BCL2L12 | 296 | protein_coding |
ENSBTAG00000011590 | ENSBTAT00000015396 | GLIPR1 | 293 | protein_coding |
ENSBTAG00000025398 | ENSBTAT00000035669 | Novel gene | 292 | Novel protein coding |
ENSBTAG00000007488 | ENSBTAT00000032517 | ZPF90 | 285 | protein_coding |
ENSBTAG00000004206 | ENSBTAT00000005510 | LRRC55 | 285 | protein_coding |
ENSBTAG00000014491 | ENSBTAT00000019267 | ATP6V1G2 | 282 | protein_coding |
UNANNOTATED | CUFF.25603.1 | - | 278 | - |
ENSBTAG00000002823 | ENSBTAT00000003653 | MPZL1 | 277 | protein_coding |
Regulatory impact factor and phenotypic impact factor
The DH approach is not applicable to assess the importance of each DEG on differences in phenotype, so Hudson, Reverter and Dalrymple [11] proposed a new metric approach called “phenotypic impact factor” (PIF). PIF is based on DEG numerical properties, which “weigh” the contribution of the DEG based on the per unit differences in phenotype across groups of phenotypically extreme individuals. These authors also proposed the “regulatory impact factor” (RIF) to have a metric that accounts for changes in the molecular wiring of networks represented by changes in gene-gene correlation as well as changes in gene expression and PIF in response to changes in regulators expression level. Two types of RIF scores were developed. The RIF1 score prioritizes regulators that have a greater impact on the changes in wiring (i.e. correlation) in the network, whereas RIF2 score prioritizes regulators whose changes in expression mostly reflect the changes in expression of DEG. A complete list of all RIF1, RIF2 and PIF results are presented in S5, S6 e S7 Tables, respectively. A list of the top 10 RIF1 and RIF2 genes are presented in Tables 6 and 7, respectively.
Table 6. Top 10 genes identified by Regulatory Impact Factor 1 (RIF1) score by contrasting high minus low IMF groups.
Ensembl Gene ID | Ensembl Transcript ID | Gene Symbol | RIF1* ,^ | Transcript Biotype |
---|---|---|---|---|
Top Positive RIF1 | ||||
ENSBTAG00000047048 | ENSBTAT00000063303 | Novel gene | 9.27 | protein coding |
ENSBTAG00000010148 | ENSBTAT00000013392 | Novel gene | 8.88 | processed_pseudogene |
ENSBTAG00000046926 | ENSBTAT00000063044 | GPX4 | 7.92 | protein_coding |
ENSBTAG00000047278 | ENSBTAT00000063308 | NDUFB11 | 7.71 | protein_coding |
ENSBTAG00000046850 | ENSBTAT00000065286 | Novel gene | 7.64 | pseudogene |
Top Negative RIF1 | ||||
ENSBTAG00000011682 | ENSBTAT00000015510 | GLI2 | -1.34 | protein_coding |
ENSBTAG00000001574 | ENSBTAT00000002064 | GPATCH2 | -1.31 | protein_coding |
ENSBTAG00000019283 | ENSBTAT00000025675 | FAM186B | -1.31 | protein_coding |
ENSBTAG00000013066 | ENSBTAT00000044139 | IGF2 | -1.31 | protein_coding |
ENSBTAG00000037440 | ENSBTAT00000055187 | Novel gene | -1.30 | protein coding |
*Note: RIF1 scores are presented as Z score normalized values.
Table 7. Top 10 genes identified by Regulatory Impact Factor 2 (RIF2) score by contrasting high minus low IMF groups.
Ensembl Gene ID | Ensembl Transcript ID | Gene Symbol | RIF2* | Transcript Biotype |
---|---|---|---|---|
Top Positive RIF1 | ||||
ENSBTAG00000016997 | ENSBTAT00000022603 | LOC511180 | 7.60 | protein coding |
ENSBTAG00000015371 | ENSBTAT00000020429 | GABPB2 | 7.19 | protein coding |
ENSBTAG00000047712 | ENSBTAT00000065359 | Novel gene | 6.75 | processed pseudogene |
ENSBTAG00000027879 | ENSBTAT00000040201 | MPC1 | 6.71 | protein coding |
ENSBTAG00000006398 | ENSBTAT00000008388 | TOMM7 | 6.70 | protein coding |
Top Negative RIF2 | ||||
ENSBTAG00000030592 | ENSBTAT00000031821 | UBL5 | -10.27 | protein coding |
ENSBTAG00000012803 | ENSBTAT00000017007 | SRP19 | -10.19 | protein coding |
ENSBTAG00000007199 | ENSBTAT00000009472 | LOC529535 | -8.34 | pseudogene |
ENSBTAG00000047194 | ENSBTAT00000063248 | Novel gene | -8.30 | protein coding |
ENSBTAG00000019949 | ENSBTAT00000026574 | DBR1 | -8.13 | protein coding |
*Note: RIF2 scores are presented as Z score normalized values.
The RIF1 analysis identified many novel or pseudo genes as potential regulators that lead to differences between the low and high IMF groups. Two particularly interesting regulators included GLI2 and IGF2. The RIF2 score identified additional candidate regulators, including two strong candidates: MPC1 and UBL5. The PIF analysis identified many small RNAs as having a large impact on the IMF phenotype. In fact, 9 of the top 10 most highly ranked PIF genes are small RNAs.
To further investigate the functional associations of the 3,000 genes with higher scores of PIF, the functional term enrichment at DAVID database (http://www.david.abcc.ncifcrf.gov) was performed (S8 Table). Several significant clusters were enriched (BH-adj < 0.10), including GO terms of biological processes related to translation, generation of precursor metabolites and energy, ATP synthesis coupled proton transport, tricarboxylic acid cycle, acetyl-CoA catabolic process, acetyl-CoA metabolic process, posttranscriptional regulation of gene expression, protein catabolic process, mRNA metabolic process, RNA processing, RNA splicing, intracellular protein transport, intracellular transport and protein folding.
Discussion
Intramuscular fat (IMF) quantity is an economically important trait, which influences the sensorial and nutritional value of meat. In addition, it is related to insulin resistance, which is an important predictive factor for disorders, such as cardiovascular disease, obesity and type 2 diabetes in human. In beef cattle, adipose tissue development is of significant interest because the deposition and composition of IMF are involved with organoleptic characteristics, consumer preference, public health, and producer profitability. Fat deposition is a consequence of the balance between energy intake and energy expenditure [26], which involves complex biological processes. Adipogenesis depends on a cascade of transcriptional factors activation [27] and events that are still unclear in species such as ruminants. As reviewed by Cristancho and Lazar [28] the adipogenesis process can be divided into three phases: commitment, transition and terminal differentiation. The commitment phase involves the conversion of mesenchymal stem cells (MSCs) to committed white preadipocytes, when occurs a dramatic alteration in cell shape mediated by extracellular matrix (ECM) factors. The committed white preadipocyte then goes through an epigenomic transition phase, when adipogenic stimuli (such as insulin and glucocorticoids) stimulates transcription factors that will induce peroxisome proliferator-activated receptor gamma (PPARg). In the terminal differentiation phase, committed white adipocyte becomes mature white adipocyte by induction of metabolic genes involved with triacylglycerol synthesis and degradation [28].
The present transcriptome study performed two strategies to identify differentially expressed genes (DEG), biological pathways and putative regulatory factors. The first strategy identified DEG between two groups (H and L) and biological pathways [18]. The second strategy identified putative regulatory factors and pathways [11]. In the first strategy, 77 significantly DEG between the groups were identified (S1 Table). It should be noted that in this study, the myofibers were not separated from intramuscular fat. Therefore, it is not possible to determine if the differential expression observed is the direct result of differences in the content of intramuscular fat or if there is an actual change in gene expression in either just the myofibers or intramuscular fat. Some of the genes and pathways herein identified agree with previous gene expression studies in preadipocyte differentiation process [29] and in IMF in cattle [30, 31].
The animals of low GEBV IMF group presented higher expression levels of genes associated with the first phase of adipocyte differentiation (MSC to committed white adipocytes). Some of these genes (ROCK2, SPARC, ELN, RGS16, TGM2, and FLNA) are involved in actinomyosin cytoskeleton remodeling, controlling the expression of adipogenic WNT genes [28]. Secreted protein acidic and rich in cysteine (SPARC) was the first matricellular protein to be linked to the accumulation of white adipose tissue [32]. SPARC inhibits differentiation of preadipocytes into adipocytes, by favoring osteoblastocyte differentiation. In this study the expression level of SPARC was higher in the lower GEBV IMF cattle, similar to that reported by Nie and Sage [33] in mice.
Animals with high GEBV IMF showed higher expression level of HIV-1 tat interactive protein 2 (HTATIP2) and signal transducer and activator of transcription 5A (STAT5A), which participate in molecular processes during the epigenomic transition phase, when insulin and glucocorticoids can lead to changes in the chromatin conformation, inducing the DNase I hypersensitivity “hotspots” [34]. In these “hotspots” Mandrup and collaborators [35] identified transcription factor motifs and binding of CCAAT-enhancer-binding proteins (C/EBP), glucocorticoid receptor (GR) and retinoid X receptor (RXR) and signal transducer and activator of transcription. STAT5 is activated by kinases associated with transmembrane receptors and play role in cell growth and division, apoptosis and inflammation [36]. STAT5 mediates energy homeostasis in response to endogenous cytokines and herein showed greater expression level in the group with high GEBV for IMF. HTATIP2, also more expressed in high GEBV for IMF, acts as a redox sensor, which is linked to regulation of nuclear import and is important in the transition phase [37].
Previously, Wang and collaborators [31] used spotted microarray to identify DEG due to different intramuscular fat content in Wagyu x Hereford and Piedmontese x Hereford crossbreds (Bos taurus). Similar to this study (Table 2), they identified DEG with the over-represented GO terms: muscle development and extracellular structure. Among the DEG identified by Wang and collaborators were SPARC, RGS and STAT family genes, which also were identified in this study. Lee and collaborators [30] evaluated gene expression due to differing IMF levels in Bos taurus coreanae animals and identified GO terms associated with biological adhesion, ECM, metabolic and development processes, which corroborate the findings of this study (Table 2). Lee and collaborators [30] identified some protein components of ECM (ITGA1, ITGB1, COL2A1, COL11A1, and COL11A2) significantly up-regulated in IMF, as identified herein.
In this study, PPARg and fatty acid synthase (FAS), major genes associated with terminal differentiation and fat deposition, presented low expression level in both H and L groups (S2 Table) and the difference of expression level was not significant between these groups. This could be explained by the fact that the animals used in the current study were in early stage of fat deposition (between 1.6 and 4.6% of IMF), while studies that identified terminal differentiation genes employed animal with 7.08% of IMF [38]. These findings corroborate with previous studies, which demonstrated that some Bos taurus breeds are genetically predisposed to deposit intramuscular fat earlier than Bos indicus breeds [32, 39, 40]. Lee and collaborators [31] evaluated differences in gene expression between three different adipose tissues (omental, subcutaneous and intramuscular). FASN, FABP4, LPL, THRSP, DGAT1 and PPARg, all involved in adipogenesis and lipid metabolism, were significantly down-regulated in the intramuscular fat tissue as compared to other tissues. Based on these results, the authors suggested that those genes showed lower metabolic activities in intramuscular tissue of animals over 30 months old. Previous study [41] also reported that the PPARg expression level was different in the Longissimus muscle from Holstein and Charolais bulls slaughtered at 18 mo of age.
Similar to Huff and collaborators [42], De Jager and collaborators [6], presented the correlation between the PPARg gene set (TF, ACSS2, ACLY, PPARg, CEBPA and CYB5A) and IMF in Wagyu x Hereford and Piedmontese x Hereford (Bos taurus crosses) crosses, and Brahman (Bos indicus). These authors reported that the correlation between IMF percentage and PPARg gene set was weak in animal with lower IMF percentage (1.9% in average), i.e. in animals of Brahman breed.
Enrichment analyses detected 13 over-represented functional classes and pathways related to: 1) Small molecule signaling (L-cysteine, Zn2+, H2O2, Mg2+, ATP, retinoic acid, NO, IL-1ß, NF-kB and ROS); 2) Complex molecules (oxidized LDL); 3) Functional classes (inflammatory cytokines and protein tyrosin-kynase) as central regulators, which are involved in molecular mechanisms of adipogenesis (Table 3).
The L-cysteine pathway (Fig 1) regulates the expression of ELN and MT3 that are involved in ECM process during the commitment phase of adipogenesis and have higher expression in animals with low GEBV for IMF. L-cysteine is involved in acylation, a covalent modification of intracellular polypeptides by the addition of C16 palmitic acid by a thioester linkage to cysteine residues of proteins [42, 43].
Elastin (ELN) protein is a component of the extracellular matrix (ECM) and with others EMC components (collagen and thrombospondin) are required for the expansion of fat mass process in obese animals [44]. Metallothionein-3 (MT3) is a zinc-binding protein and was observed that in null MT3 male mice there is an increase in weight, resulting in obesity [45]. This obesity was caused by reduced energy expenditure and not from increased feed intake. This result suggests that MT3 expression is negatively associated with fat variation, and agrees with our observation that animals in the low GEBV for IMF group show high expression level of MT3. On the other hand, phosphoglucomutase 1 (PGM1), which is involved with glycogen storage and is associated with the transition phase of adipogenesis, had higher expression in the animals with high GEBV for IMF. PGM1 is associated with diseases such as hypertension, obesity and cardiovascular disease. It is overexpressed in skeletal muscle from insulin resistant humans [46]. The higher expression of PGM1 in animals with in high IMF GEBV corroborates the findings reported by Nguyen and collaborators [46] in human skeletal muscle from obese individuals.
The retinoic acid and inflammatory cytokine pathways contain several genes that are in common, such as STAT5A, ELN, transglutaminase 2 (TGM2) and solute carrier family 2 (facilitated glucose transporter) member 4 (SLC2A4, also called GLUT4). These genes are involved in different phases of adipogenesis. Glucose transporter 4 (GLUT4) is a major regulator of glucose uptake in adipocytes (insulin regulation), which participates the adipogenic stimulation process. In mice with selective reduction of GLUT4 there is a reduction in insulin-stimulated glucose uptake in adipocytes and consequently insulin resistance [47]. In this study, GLUT4 had higher expression level in animal with low GEBV for IMF. The pathways identified in this study agree with previously published studies that showed the influence of retinoic acid and inflammatory cytokine systems on transcription process of peroxisome proliferator-activated receptor (PPAR) family, which regulate the PPARs expression level in human, mouse and ruminant [48].
The second strategy (PCIT, RIF1, RIF2 and PIF) was performed to better understand the correlations between expressed genes and to identify putative regulators of adipogenesis. These methods revealed regulators known to be involved in adipose development, obesity and retinoic acid signaling.
Differential hubbing (DH) or differential connectivity (DC) is the difference in the number of connections, measured as partial correlations a gene has in two different states. Previous studies discovered that a gene list of extreme values of DH showed higher percentage of transcriptional regulator factors than a list obtained by differential expression (DE) analysis [11]. In this study, the DH analysis using PCIT identified a host of interesting candidate regulators of IMF variation. Biologically interesting regulators include: ankyrin repeat domain 26 (ANKRD26) and phosphatidic acid phosphatase type 2 domain containing 2 (PPAPDC2), homeobox genes, such as HOXC5, zinc finger protein, friend of GATA (FOG) family member 1 (ZFPM1), and zinc finger protein 90 (ZFP90) as putative causal regulatory genes.
Ankyrin repeat domain 26 (ANKRD26) is expressed in the hypothalamus, brain, liver, adipose tissue and skeletal muscle and is located within the cell membrane. In humans and in mice, ANKRD26 is responsible for white adipose tissue insulin response and appetite control [49]. Interestingly, one of the pathways enriched in this study, retinoic acid, has been demonstrated to be associated with adipose metabolism and Sahab and collaborators [50] showed that down-regulation of RAR could impact the expression of ANKRD26.
The homeobox family of genes is a conserved family of transcription factors, which play important roles in morphogenesis, metabolism [51] and differentiation of adipocytes [52].
Phosphatidic acid phosphatase type 2 domain containing 2 (PPAPDC2) is located within the endoplasmic reticulum and nuclear envelope in mammalian cells. Phosphatidic acid is a new class of lipid mediators, which are involved in in cell growth, proliferation and reproduction pathways such as a regulatory factor [53]. In Saccharomyces cerevisiae, phosphatidic acid was reported as an essential metabolic intermediate and a signaling lipid [54].
Zinc-finger protein, friend of GATA (FOG) family member 1 (ZFPM1) and zinc finger protein 90 (ZFP90) are important putative regulator genes in lipid metabolism. GATA family proteins are zinc-finger transcription factors, which recognize GATA motifs in DNA. GATA plays an important role in transcriptional regulation of many genes [55]. ZFP90 is associated with RNA polymerase II core, which is involved in negative regulation of transcription (GO:0001078). Previous study also showed the zinc-finger protein as transcriptional regulators comparing the expression profiles between adipogenic and non-adipogenic fibroblasts [28].
The RIF1 analysis identified as putative regulators IGF2 and GLI2, and RIF2 identified MPC1 and UBL5. The identification of IGF2 as a potential regulator of the degree of IMF has been suggested previously in pigs [56] and in mice [57], as it is involved in adipocyte proliferation [58]. The GLI2 gene, although not documented to play a role in adipose development, is a regulator of cellular proliferation and is known to be regulated by retinoic acid signaling [59], which is consistent with the signaling mechanisms identified by Pathway Studio. The MPC1 gene is a part of a mitochondrial pyruvate carrier complex and has been implicated in the mitochondrial response to insulin and PPARg signaling [60]. The UBL5 gene has been suggested to impact adipose deposition in both the pig and human. The UBL5 is a candidate gene for meat quality and IMF in the pig [61] and also associated with body fat and fat accumulation related to metabolic dysfunction and diabetes in humans [62, 63]. Previous study also reported that PPARg proteasomal degradation is ubiquitin-dependent and the stable overexpression of ubiquitin gene reduces PPARg protein levels and suppress adipocyte differentiation in human cell [64].
One of the top PIF genes was microRNA-1281, previously identified as an adipose expressed miRNA that may regulate the lipid metabolism gene EP300 [65]. The functional enrichment by DAVID database showed that the PIF approach was well suited to identify putative regulators involved with the phenotype studied (intramuscular fat variation).
Associations between GEBVs and RNA abundance are based on the assumption that changes in RNA levels are related to genetic differences in IMF potential in animals with extreme GEBVs for IMF since environmental variation known to impact IMF is account for within the genetic prediction model. However, it is possible that environmental factors could still impact which RNAs are identified as differentially expressed.
In this study, changes in gene expression in this study may indicate changes in IMF, muscle or other cell types within muscle that impact the content of IMF. A portion of the DE genes identified in this study may be related to differences in the lipid content within intramuscular adipocytes or extracellular matrix (ECM) deposition. Changes in genes related to ECM proteins deposition in this study are consistent with previous studies that indicate that increased ECM proteins deposition is associated with increased lipid content within intramuscular adipocytes [66, 67]. This may provide an explanation for DE genes related to ECM growth.
The genes identified by association and network analysis in this study could be related to any of the physiological processes involved in creating variation in IMF. It is difficult to determine if IMF deposition was altered versus other processes like lipid filling in intramuscular adipocytes or other changes in muscle physiology since IMF deposition rates were not measured over time in this study. The lack of FA and TAG synthesis pathway genes may indicate that the DE genes in this study may be related to the lipid content or filling of intramuscular adipocytes which results in the observed variation in IMF levels. Since this study provides novel information about transcriptional differences in the Nellore Bos indicus breed, it is possible that novel genes may be detected that impact the variation in IMF levels. This may represent genetic or physiological differences in Nellore compared to other breeds. Further studies are needed to discern the difference between these two possible explanations.
Conclusions
The present study showed the complexity of Longissimus dorsi muscle transcriptome and the molecular mechanisms involved in lipid metabolism related to differences in IMF in extreme IMF GEBV Nellore cattle. Differential gene expression and pathway enrichment analysis identified a number of genes and pathways related to adipogenesis and lipid metabolism. These changes in gene expression and associated pathways indicate that animals in the high GEBV for IMF mature earlier in respect to IMF content. At the same age, animals with low GEBV for IMF had a higher expression of genes related to the commitment phase of adipogenesis, whereas animal with high GEBV for IMF had higher expression of genes related to the transition phase. Furthermore, there appears to be differences between Bos indicus and Bos taurus in regards of IMF deposition. This study indicated that retinoic acid signaling, IGF2 and ANKRD26 are important regulators of molecular mechanisms related to IMF content and adipogenesis process. These findings contribute to a better understanding of the molecular mechanisms underlying fat deposition and energy balance in muscle of ruminant, and may provide important information to other species, such as human and mouse.
Supporting Information
Acknowledgments
This study was conducted with funding from EMBRAPA (Macroprograma 1, 01/2005) and FAPESP (processes number 2011/00005-7, 2012/02383-1 and 2012/23638-8). LR, GBM and LC were granted CNPq fellowships. The authors would like to acknowledge the collaborative efforts among EMBRAPA, University of São Paulo, Iowa State University.
Data Availability
All sequencing data are available from the European Nucleotide Archive (ENA) of European Bioinformatics Institute (EBI) (accession number PRJEB6732).
Funding Statement
Brazilian Agricultural Research Corporation (EMBRAPA) – https://www.embrapa.br – process number: (Macroprograma 1, 01/2005). Foundation for Research Support of the State of São Paulo (FAPESP) - https://www.fapesp.br – processes number: 2011/00005-7, 2012/02383-1 and 2012/23638-8. LCAR, GBM and LLC were granted CNPq fellowships. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Michal JJ, Zhang ZW, Gaskins CT, Jiang Z (2006) The bovine fatty acid binding protein 4 gene is significantly associated with marbling and subcutaneous fat depth in Wagyu x Limousin F2 crosses. Anim Genet 37: 400–402. [DOI] [PubMed] [Google Scholar]
- 2. Rule DC, MacNeil MD, Short RE (1997) Influence of sire growth potential, time on feed, and growing-finishing strategy on cholesterol and fatty acids of the ground carcass and longissimus muscle of beef steers. J Anim Sci 75: 1525–1533. [DOI] [PubMed] [Google Scholar]
- 3. O'Quinn TG, Brooks JC, Polkinghorne RJ, Garmyn AJ, Johnson BJ, Starkey JD, et al. (2012) Consumer assessment of beef strip loin steaks of varying fat levels. J Anim Sci 90: 626–634. 10.2527/jas.2011-4282 [DOI] [PubMed] [Google Scholar]
- 4. Salter AM (2013) Impact of consumption of animal products on cardiovascular disease, diabetes, and cancer in developed countries. Anim Front 3: 20–27. [Google Scholar]
- 5. Hocquette JF, Van Wezemael L, Chriki S, Legrand I, Verbeke W, Farmer L, et al. (2013) Modelling of beef sensory quality for a better prediction of palatability. Meat Sci 97: 316–322. 10.1016/j.meatsci.2013.07.031 [DOI] [PubMed] [Google Scholar]
- 6. Ramayo-Caldas Y, Mach N, Esteve-Codina A, Corominas J, Castello A, Ballester M, et al. (2012) Liver transcriptome profile in pigs with extreme phenotypes of intramuscular fatty acid composition. BMC Genomics 13: 547 Available: http://www.biomedcentral.com/content/pdf/1471-2164-13-547.pdf. Accessed 18 October 2013. 10.1186/1471-2164-13-547 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jin W, Olson EN, Moore SS, Basarab JA, Basu U, Guan LL (2012) Transcriptome analysis of subcutaneous adipose tissues in beef cattle using 3' digital gene expression-tag profiling. J Anim Sci 90: 171–183. 10.2527/jas.2011-4229 [DOI] [PubMed] [Google Scholar]
- 8. De Jager N, Hudson NJ, Reverter A, Barnard R, Cafe LM, Greenwood PL, et al. (2013) Gene expression phenotypes for lipid metabolism and intramuscular fat in skeletal muscle of cattle. J Anim Sci 91: 1112–1128. 10.2527/jas.2012-5409 [DOI] [PubMed] [Google Scholar]
- 9. Ramayo-Caldas Y, Fortes MR, Hudson NJ, Porto-Neto LR, Bolormaa S, Barendse W, et al. (2014) A marker-derived gene network reveals the regulatory role of PPARGC1A, HNF4G, and FOXP3 in intramuscular fat deposition of beef cattle. J Anim Sci 92: 2832–2845. 10.2527/jas.2013-7484 [DOI] [PubMed] [Google Scholar]
- 10. Koesterke L, Milfeld K, Vaughn MW, Stanzione D, Koltes JE, Weeks NT, et al. (2013) Optimizing the PCIT algorithm on stampede's Xeon and Xeon Phi processors for faster discovery of biological networks In Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery. San Diego, California: ACM: 1–8. [Google Scholar]
- 11. Hudson NJ, Reverter A, Dalrymple BP (2009) A differential wiring analysis of expression data correctly identifies the gene containing the causal mutation. PLoS Comput Biol 5: e1000382 Available: http://www.ploscompbiol.org/article/fetchObject.action?uri=info%3Adoi%2F10.1371%2Fjournal.pcbi.1000382&representation=PDF. Accessed 12 January 2014. 10.1371/journal.pcbi.1000382 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Cesar AS, Regitano LC, Mourão GB,Tullio RR, Lanna DP, Nassu RT, et al. (2014) Genome-wide association study for intramuscular fat deposition and composition in Nellore cattle. BMC Genet 15: 39 Available: http://www.biomedcentral.com/content/pdf/1471-2156-15-39.pdf. Accessed 10 April 2014. 10.1186/1471-2156-15-39 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. AOCS (2004) Rapid Determination of Oil/Fat Utilizing High Temperature Solvent Official Procedure Am 5–04 oil. Official Methods and Recommended Practices of the AOCS. Am. Oil Chem. Soc., Champaign, IL. [Google Scholar]
- 14. Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2009) ASReml User Guide Release 3.0 Hemel Hempstead, HP1 1ES, UK: VSN International Ltd. [Google Scholar]
- 15. VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91: 4414–4423. 10.3168/jds.2007-0980 [DOI] [PubMed] [Google Scholar]
- 16. Trapnell C, Pachter L, Salzberg SL (2009) TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25: 1105–1111. 10.1093/bioinformatics/btp120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25 Available: http://genomebiology.com/content/pdf/gb-2009-10-3-r25.pdf. Accessed 18 October 2013. 10.1186/gb-2009-10-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Anders S, Huber W(2009) Differential expression analysis for sequence count data. Genome Biol 11: R106 Available: http://genomebiology.com/content/pdf/gb-2010-11-10-r106.pdf. Accessed 4 August 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J Roy Statist Soc Ser B (Methodological) 57: 289–300. [Google Scholar]
- 20. McCarthy FM, Gresham CR, Buza TJ, Chouvarine P, Pillai LR, Kumar R, et al. (2011) AgBase: supporting functional modeling in agricultural organisms. Nucleic Acids Res 39: 497–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Huang da W, Sherman BT, Lempicki RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37: 1–13. 10.1093/nar/gkn923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Nikitin A, Egorov S, Daraselia N, Mazo I (2003) Pathway studio—the analysis and navigation of molecular networks. Bioinformatics 19: 2155–2157. [DOI] [PubMed] [Google Scholar]
- 23. Reverter A, Chan EK (2008) Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks. Bioinformatics 24: 2491–2497. 10.1093/bioinformatics/btn482 [DOI] [PubMed] [Google Scholar]
- 24. Reverter A, Hudson NJ, Nagaraj SH, Perez-Enciso M, Dalrymple BP (2010) Regulatory impact factors: unraveling the transcriptional regulation of complex traits from expression data. Bioinformatics 26: 896–904. 10.1093/bioinformatics/btq051 [DOI] [PubMed] [Google Scholar]
- 25. Sosnicki AA, Newman S (2010) The support of meat value chains by genetic technologies. Meat Sci 86: 129–137. 10.1016/j.meatsci.2010.04.034 [DOI] [PubMed] [Google Scholar]
- 26. Gregor MF, Hotamisligil GS (2011) Inflammatory mechanisms in obesity. Annu Rev Immunol 29: 415–445. 10.1146/annurev-immunol-031210-101322 [DOI] [PubMed] [Google Scholar]
- 27. Gregoire FM, Smas CM, Sul HS (1998) Understanding adipocyte differentiation. Physiol Rev 78: 783–809. [DOI] [PubMed] [Google Scholar]
- 28. Cristancho AG, Lazar MA (2011) Forming functional fat: a growing understanding of adipocyte differentiation. Nat Rev Mol Cell Biol 12: 722–734. 10.1038/nrm3198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Lee SH, Park EW, Cho YM, Kim SK, Lee JH, Jeon JT, et al. (2007) Identification of differentially expressed genes related to intramuscular fat development in the early and late fattening stages of hanwoo steers. J Biochem Mol Biol 40:757–764. [DOI] [PubMed] [Google Scholar]
- 30. Lee HJ, Jang M, Kim H, Kwak W, Park W, Hwang JY, et al. (2013) Comparative Transcriptome Analysis of Adipose Tissues Reveals that ECM-Receptor Interaction Is Involved in the Depot-Specific Adipogenesis in Cattle. PLoS One 8(6):e66267 Available: http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0066267. Accessed 10 December 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Wang YH, Byrne KA, Reverter A, Harper GS, Taniguchi M, McWilliam SM, et al. (2005) Transcriptional profiling of skeletal muscle tissue from two breeds of cattle. Mamm Genome 16: 201–210. [DOI] [PubMed] [Google Scholar]
- 32. Bradshaw AD, Graves DC, Motamed K, Sage EH (2003) SPARC-null mice exhibit increased adiposity without significant differences in overall body weight. Proc Natl Acad Sci U S A 100: 6045–6050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Nie J, Sage EH (2009) SPARC functions as an inhibitor of adipogenesis. J Cell Commun Signal 3: 247–254. 10.1007/s12079-009-0064-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Steger DJ, Lazar MA (2011) Adipogenic hotspots: where the action is. EMBO J 30: 1418–1419. 10.1038/emboj.2011.80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Mandrup S, Hager GL (2012) Modulation of chromatin access during adipocyte differentiation. Nucleus 3: 12–15. 10.4161/nucl.18928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Stoecklin E, Wissler M, Schaetzle D, Pfitzner E, Groner B (1999) Interactions in the transcriptional regulation exerted by Stat5 and by members of the steroid hormone receptor family. J Steroid Biochem Mol Biol 69: 195–204. [DOI] [PubMed] [Google Scholar]
- 37. Gerhard DS, Wagner L, Feingold EA, Shenmen CM, Grouse LH, Schuler G, et al. (2004) The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res 14: 2121–2127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Dinh TT, Blanton JR Jr, Riley DG, Chase CC Jr, Coleman SW, Philips WA, et al. (2010) Intramuscular fat and fatty acid composition of longissimus muscle from divergent pure breeds of cattle. J Anim Sci 88: 756–766. 10.2527/jas.2009-1951 [DOI] [PubMed] [Google Scholar]
- 39. Lehnert SA, Reverter A, Byrne KA, Wang Y, Nattrass GS, Hudson NJ, et al. (2007) Gene expression studies of developing bovine longissimus muscle from two different beef cattle breeds. BMC Dev Biol 7: 95 Available: http://www.biomedcentral.com/content/pdf/1471-213X-7-95.pdf. Accessed 18 October 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Wang YH, Bower NI, Reverter A, Tan SH, De Jager N, Wang R, et al. (2009) Gene expression patterns during intramuscular fat development in cattle. J Anim Sci 87: 119–130. 10.2527/jas.2008-1082 [DOI] [PubMed] [Google Scholar]
- 41. Huff PW, Ren MQ, Lozeman FJ, Weselake RJ, Wegner J (2004) Expression of peroxisome proliterator-activated receptor (PPARγ) mRNA in adipose and muscle tissue of Holstein and Charolais cattle. Can J Anim Sci 84: 49–55. [Google Scholar]
- 42. Milligan G, Parenti M, Magee AI (1995) The dynamic role of palmitoylation in signal transduction. Trends Biochem Sci 20: 181–187. [DOI] [PubMed] [Google Scholar]
- 43. Resh MD (2006) Palmitoylation of ligands, receptors, and intracellular signaling molecules. Sci STKE 359: re14 Available: http://stke.sciencemag.org/cgi/reprint/sigtrans;2006/359/re14.pdf. Accessed 18 October 2013. [DOI] [PubMed] [Google Scholar]
- 44. Nicoloff G, Weiss AS, Iotova V, Tzaneva V, Petrova C, Domuschieva N, et al. (2006) Abnormal levels of serum antielastin antibodies in children with diabetes mellitus type 1. J Investig Med 54: 461–467. [DOI] [PubMed] [Google Scholar]
- 45. Byun HR, Kim DK, Koh JY (2011) Obesity and downregulated hypothalamic leptin receptors in male metallothionein-3-null mice. Neurobiol Dis, 44: 125–132. 10.1016/j.nbd.2011.06.012 [DOI] [PubMed] [Google Scholar]
- 46. Nguyen LL, Kriketos AD, Hancock DP, Caterson ID, Denyer GS (2006) Insulin resistance does not influence gene expression in skeletal muscle. J Biochem Mol Biol 39: 457–463. [DOI] [PubMed] [Google Scholar]
- 47. Abel ED, Peroni O, Kim JK, Kim YB, Boss O, Hadro E, et al. (2001) Adipose-selective targeting of the GLUT4 gene impairs insulin action in muscle and liver. Nature 409: 729–733. [DOI] [PubMed] [Google Scholar]
- 48. Bionaz M, Chen S, Khan MJ, and Loor JJ (2013). Functional Role of PPARs in Ruminants: Potential Targets for Fine-Tuning Metabolism during Growth and Lactation. PPAR Res 2013: 684159 10.1155/2013/684159 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Raciti GA, Bera TK, Gavrilova O, Pastan I (2011) Partial inactivation of Ankrd26 causes diabetes with enhanced insulin responsiveness of adipose tissue in mice. Diabetologia 54: 2911–2922. 10.1007/s00125-011-2263-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Sahab ZJ, Hall MD, Zhang L, Cheema AK, Byers SW (2010) Tumor Suppressor RARRES1 Regulates DLG2, PP2A, VCP, EB1, and Ankrd26. J Cancer 1: 14–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Arcioni L, Simeone A, Guazzi S, Zappavigna V, Boncinelli E, Mavilio F (1992) The upstream region of the human homeobox gene HOX3D is a target for regulation by retinoic acid and HOX homeoproteins. EMBO J 11: 265–277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Cowherd RM, Lyle RE, Miller CP, Mcgehee RE Jr (1997) Developmental profile of homeobox gene expression during 3T3-L1 adipogenesis. Biochem Biophys Res Commun 237: 470–475. [DOI] [PubMed] [Google Scholar]
- 53. Wang X, Devaiah SP, Zhang W, Welti R (2006) Signaling functions of phosphatidic acid. Prog Lipid Res 45: 250–278. [DOI] [PubMed] [Google Scholar]
- 54. Loewen CJ, Gaspar ML, Jesch SA, Delon C, Ktistakis NT, Henry SA, et al. (2004) Phospholipid metabolism regulated by a transcription factor sensing phosphatidic acid. Science 304: 1644–1647. [DOI] [PubMed] [Google Scholar]
- 55. Fujiwara Y, Browne CP, Cunniff K, Goff SC, Orkin SH (1996) Arrested development of embryonic red cell precursors in mouse embryos lacking transcription factor GATA-1. Proc Natl Acad Sci U S A 93: 12355–12358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Aslan O, Hamill RM, Davey G, McBryan J, Mullen AM, Gispert M, et al. (2012) Variation in the IGF2 gene promoter region is associated with intramuscular fat content in porcine skeletal muscle. Mol Biol Rep 39: 4101–4110. 10.1007/s11033-011-1192-5 [DOI] [PubMed] [Google Scholar]
- 57. Keller MP, Choi Y, Wang P, Davis DB, Rabaglia ME, Oler AT, et al. (2008) A gene expression network model of type 2 diabetes links cell cycle regulation in islets with diabetes susceptibility. Genome Res 18:706–716. 10.1101/gr.074914.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Khandekar MJ, Cohen P, Spiegelman BM (2011) Molecular mechanisms of cancer development in obesity. Nature Reviews Cancer 11: 886–895. 10.1038/nrc3174 [DOI] [PubMed] [Google Scholar]
- 59. Ribes V, Le Roux I, Rhinn M, Schuhbaur B, Dolle P (2009) Early mouse caudal development relies on crosstalk between retinoic acid, Shh and Fgf signalling pathways. Development 136: 665–676. 10.1242/dev.016204 [DOI] [PubMed] [Google Scholar]
- 60. Colca JR, VanderLugt JT, Adams WJ, Shashlo A, McDonald WG, Liang J, et al. (2013) Clinical proof-of-concept study with MSDC-0160, a prototype mTOT-modulating insulin sensitizer. Clin Pharmacol Ther 93: 352–359. 10.1038/clpt.2013.10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Cepica S, Ovilo C, Masopust M, Knoll A, Fernandez A, Lopez A, et al. (2012) Four genes located on a SSC2 meat quality QTL region are associated with different meat quality traits in Landrace x Chinese-European crossbred population. Anim Genet 43: 333–336. 10.1111/j.1365-2052.2011.02252.x [DOI] [PubMed] [Google Scholar]
- 62. Bozaoglu K, Curran JE, Elliott KS, Walder KR, Dyer TD, Rainwater DL, et al. (2006) Association of genetic variation within UBL5 with phenotypes of metabolic syndrome. Hum Biol 78: 147–159. [DOI] [PubMed] [Google Scholar]
- 63. Jowett JB, Elliott KS, Curran JE, Hunt N, Walder KR, Collier GR, et al. (2004) Genetic variation in BEACON influences quantitative variation in metabolic syndrome-related phenotypes. Diabetes 53: 2467–2472. [DOI] [PubMed] [Google Scholar]
- 64. Kim JH, Park KW, Lee EW, Jang WS, Seo J, Shin S, et al. (2014) Suppression of PPARγ through MKRN1-mediated ubiquitination and degradation prevents adipocyte differentiation. Cell Death Differ 21: 594–603. 10.1038/cdd.2013.181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Romao JM, Jin W, He M, McAllister T, Guan le L (2014) MicroRNAs in bovine adipogenesis: genomic context, expression and function. BMC Genomics 15:137 Available: http://www.biomedcentral.com/content/pdf/1471-2164-15-137.pdf. Accessed 18 May 2014. 10.1186/1471-2164-15-137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Hausman GJ, Dodson MV, Ajuwon K, Azain M, Barnes KM, Guan LL, et al. (2009) Board-invited review: The biology and regulation of preadipocytes and adipocytes in meat animals. J Anim Sci 87: 1218–1246. 10.2527/jas.2008-1427 [DOI] [PubMed] [Google Scholar]
- 67. Nakajima I, Muroya S, Tanabe R, Chikuni K (2002) Extracellular matrix development during differentiation into adipocytes with a unique increase in type V and VI collagen. Biol Cell 94: 197–203. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All sequencing data are available from the European Nucleotide Archive (ENA) of European Bioinformatics Institute (EBI) (accession number PRJEB6732).