Abstract
Background
Transcription has a substantial genetic control and genetic dissection of gene expression could help us understand the genetic architecture of complex phenotypes such as meat quality in cattle. The objectives of the present research were: 1) to perform eQTL and sQTL mapping analyses for meat quality traits in longissimus dorsi muscle; 2) to uncover genes whose expression is influenced by local or distant genetic variation; 3) to identify expression and splicing hot spots; and 4) to uncover genomic regions affecting the expression of multiple genes.
Results
Eighty steers were selected for phenotyping, genotyping and RNA-seq evaluation. A panel of traits related to meat quality was recorded in longissimus dorsi muscle. Information on 112,042 SNPs and expression data on 8588 autosomal genes and 87,770 exons from 8467 genes were included in an expression and splicing quantitative trait loci (QTL) mapping (eQTL and sQTL, respectively). A gene, exon and isoform differential expression analysis previously carried out in this population identified 1352 genes, referred to as DEG, as explaining part of the variability associated with meat quality traits. The eQTL and sQTL mapping was performed using a linear regression model in the R package Matrix eQTL. Genotype and year of birth were included as fixed effects, and population structure was accounted for by including as a covariate the first PC from a PCA analysis on genotypic data. The identified QTLs were classified as cis or trans using 1 Mb as the maximum distance between the associated SNP and the gene being analyzed. A total of 8377 eQTLs were identified, including 75.6% trans, 10.4% cis, 12.5% DEG trans and 1.5% DEG cis; while 11,929 sQTLs were uncovered: 66.1% trans, 16.9% DEG trans, 14% cis and 3% DEG cis. Twenty-seven expression master regulators and 13 splicing master regulators were identified and were classified as membrane-associated or cytoskeletal proteins, transcription factors or DNA methylases. These genes could control the expression of other genes through cell signaling or by a direct transcriptional activation/repression mechanism.
Conclusion
In the present analysis, we show that eQTL and sQTL mapping makes possible positional identification of gene and isoform expression regulators.
Keywords: Cis effect, Differentially expressed gene, Expression master regulator, Meat quality, Splicing master regulator and trans effect
Background
Little knowledge exists about transcription variation patterns across the genome as well as how much of this variability is under genetic control. Regulatory variation is proposed as a primary factor associated with phenotypic variability [1] and based on some estimates, gene expression can be classified as medium-highly heritable [2]. Both eQTL and sQTL can be classified into cis (local) and trans (distant) effects. A large fraction of human genes is enriched for cis regulation and in some cases, a cis effect is able to explain trans effects associated with its harboring gene. On the other hand, trans regulation is more difficult to identify and explain [1], but it allows for the identification of “hot spots”, which are also known as master regulators, with transcriptional control over a suite of genes usually involved in the same biological pathway [3]. Therefore, trans regulation might be suggested as the primary factor determining phenotypic variation in complex phenotypes [2].
Since transcription has a substantial genetic control, eQTL and sQTL mapping provides information about genetic variant with modulatory effects on gene expression [4] which are useful for understanding the genetic architecture of complex phenotypes. This mapping allows for uncovering of genomic regions associated with transcription regulation of genes which can be related to phenotypic variation when they colocalize with QTLs (cis and trans effects), providing a molecular basis for the phenotype-genotype association [5]. The eQTL and sQTL mapping can also uncover master regulators and suites of genes related to a particular phenotype (trans effect). Using an eQTL approach, Gonzales-Prendes [6] investigated the genetic regulation of porcine genes associated with uptake, transport, synthesis, and catabolism of lipids. About 30% of these genes were regulated by cis- and/or trans-eQTLs and provided a first description of the genetic regulation of porcine lipid metabolism. Steibel et al. [7] identified 62 unique eQTLs in porcine loin muscle tissue and observed strong evidence for local regulation of lipid metabolism-related genes, such as AKR7A2 and TXNDC12. Higgins et al. [8] carried out an eQTL analysis for residual feed intake, average daily gain and feed intake to identify functional effects of GWAS-identified variants. The eQTL analysis allowed them to identify variants useful both for genomic selection of RFI and for understanding the biology of feed efficiency. Genome sequence-based imputation and association mapping identified a cluster of 17 non-coding variants spanning MGST1 highly associated with milk composition traits [9] in cattle. A subsequent eQTL mapping revealed a strong MGST1 eQTL underpinning these effects and demonstrated the utility of RNA sequence-based association mapping.
The objectives of the present research were: 1) to perform eQTL and sQTL mapping analyses for meat quality traits in longissimus dorsi muscle; 2) to uncover genes whose expression is influenced by local or distant genetic variation; 3) to identify expression and splicing hot spots; and 4) to uncover genomic regions affecting the expression of multiple genes (multigenic effects).
Results
On average, 39.8 million paired-end RNA-Seq reads per sample were available for mapping, and out of these, 34.9 million high-quality paired-end RNA-Seq reads were uniquely mapped to the Btau_4.6.1 reference genome. The mean fragment inner distance was equal to 144 ± 64 bps.
Expression QTL mapping
A total of 8377 eQTLs were identified in the present population (Fig. 1). The most frequently identified types of eQTLs were trans (75.6%) followed by cis (10.4%) (Fig. 2a). Only 12.5% of the eQTLs were classified as DEG trans and 1.5% as DEG cis. The majority of SNPs with trans and DEG trans effects were associated with the expression of only one gene (76.2 and 84.0%, respectively).
Expression cis and DEG cis eQTL analysis
A total of 868 cis and 125 DEG cis eQTLs were uncovered. SNPs rs110591035 and rs456174577 were cis eQTLs and were highly associated with expression of LSM2 Homolog, U6 Small Nuclear RNA And MRNA Degradation Associated (LSM2) (p-value = 5.8 × 10− 9) and Sterol O-Acyltransferase 1 (SOAT1) (p-value = 4.4 × 10− 7) genes, respectively. Additional file 1 presents all significant eQTLs based on the effective number of independent tests.
Expression trans and DEG trans eQTL analysis, and master regulators
Twenty-seven SNPs (Table 1) distributed in 22 clusters (Fig. 1) were identified and used to map potential master regulator genes. Figure 3 shows a network for the identified master regulators and their 674 associated genes (Additional file 2). Out of the 27 master regulators, nine membrane-associated proteins, three cytoskeletal proteins, four transcription factors, and one DNA methylase were identified. No clear classification was evident for the remaining 10 genes. Additional file 3 shows least-squares mean plots for SNP effect on transformed gene counts for seven of the identified master regulators.
Table 1.
SNP location | SNP name | Clustera | MAF (%) | Number | % DEG | Harboring gene or closest genes b |
---|---|---|---|---|---|---|
of eQTLs | eQTLs | |||||
1: 119,758,395 | rs378343630 | 1 | 4 | 62 | 0.0 | TM4SF1 |
2: 11,594,176 | rs208227436 | 2 | 3 | 26 | 7.7 | ZNF804A |
2: 25,653,736 | rs211476449 | 3 | 3 | 36 | 13.9 | GAD1 |
3: 102,943,677 | rs135786834 | 4 | 2 | 32 | 0.0 | KDM4A |
5: 27,001,953 | rs441241989 | 5 | 8 | 27 | 18.5 | CSAD |
5: 27,834,250 | rs110130901 | 5 | 8 | 25 | 24.0 | KRT7 |
5: 105,380,442 | rs207649022 | 6 | 24 | 76 | 69.7 | NTF3 - KCNA5 |
7: 92,439,344 | rs110618957 | 7 | 2 | 34 | 8.8 | POLR3G - GPR98 |
8: 95,625,807 | ARS_BFGL_N-GS_65636 | 8 | 3 | 111 | 8.1 | ENSBTAG00000047350 - OR13F1 |
8: 104,345,143 | rs378706947 | 8 | 2 | 22 | 9.1 | ALAD |
10: 8,457,276 | Bovine-HD1000002801 | 9 | 30 | 27 | 74.1 | PDE8B |
11: 46,753,639 | rs211218494 | 10 | 8 | 37 | 13.5 | PSD4 |
11: 46,785,388 | rs209448226 | 10 | 8 | 37 | 13.5 | 5S_rRNA - PAX8 |
13: 54,009,694 | rs135144232 | 11 | 4 | 24 | 8.3 | ENSBTAG00000011638 |
14: 74,732,269 | rs208451702 | 12 | 3 | 24 | 8.3 | RUNX1T1 |
15: 79,202,054 | rs41781450 | 13 | 37 | 20 | 35.0 | OR4X1 - OR4S1 |
15: 79,564,333 | rs109630111 | 13 | 24 | 36 | 2.8 | ENSBTAG00000035487 |
16: 62,544,863 | rs456174577 | 14 | 2 | 36 | 0.0 | TOR1AIP1 |
17: 30,508,078 | BTB_00676236 | 15 | 41 | 34 | 23.5 | INTU - FAT4 |
18: 56,858,212 | rs41891374 | 16 | 5 | 20 | 20.0 | C18H19ORF41 - MYH14 |
18: 57,361,426 | rs383445569 | 16 | 4 | 41 | 17.1 | KLK4 |
18: 61,257,126 | No SNP name | 17 | 49 | 133 | 2.3 | ENSBTAG00000000336 - ENSBTAG00000046961 |
19: 42,754,262 | rs377935001 | 18 | 2 | 34 | 0.0 | TTC25 |
22: 16,367,834 | rs110289782 | 19 | 11 | 24 | 50.0 | ENSBTAG00000030533 - ZNF445 |
26: 12,930,282 | rs42085062 | 20 | 4 | 23 | 26.1 | PCGF5 |
27: 31,921,721 | rs136162903 | 21 | 3 | 25 | 0.0 | KCNU1 |
28: 4,877,558 | rs207999887 | 22 | 5 | 34 | 5.9 | SNORA25 - SIPA1L2 |
aCluster number used in Fig. 1
bBolded genes were selected as master regulators when the associated SNP was intergenic; underlined gene names were identified as expressed in skeletal muscle in the present analysis.
Multigenic effects based on the eQTL analysis
Table 2 shows the number of eQTLs identified by gene where the expression of the top genes seems to be influenced by multiple genomic regions (multigenic effects). The Solute Carrier Family 43 Member 1 (SLC43A1), Unc-51 Like Autophagy Activating Kinase 2 (ULK2), Myosin Light Chain 1 (MYL1), PHD Finger Protein 14 (PHF14), and Enolase 3 (ENO3) are the top five genes based on the number of eQTL regulators.
Table 2.
eQTL analysis | sQTL analysis | ||||
---|---|---|---|---|---|
Gene | N | eQTL Type | Gene | N | sQTL Type |
SLC43A1 | 126 | Trans | TTN | 324 | DEG Trans |
LOC100848703 | 64 | Trans | TXN2 | 99 | Trans |
ULK2 | 43 | Trans | NEB | 63 | DEG Trans |
MYL1 | 40 | Trans | TCEB2 | 43 | Trans |
ENO3 | 36 | Trans | LOC100851645 | 36 | DEG Trans |
PHF14 | 36 | Trans | CREB5 | 33 | DEG Trans |
PKM | 32 | Trans | USF2 | 33 | DEG Trans |
ZBTB12 | 31 | Trans | MYH7 | 28 | Trans |
PGAM2 | 30 | Trans | PON3 | 26 | Trans |
ACTA1 | 28 | Trans | MYOM3 | 26 | Trans |
SNX19 | 25 | Trans | RSPO2 | 25 | Trans |
LDHA | 25 | Trans | METTL22 | 25 | Trans |
RPL5 | 23 | Trans | MAP 3 K14 | 25 | Trans |
ALDH4A1 | 23 | DEG Trans | UBR3 | 25 | Trans |
PLSCR3 | 22 | Trans | PAPD4 | 25 | Trans |
CHURC1 | 22 | Trans | BAZ1A | 24 | Trans |
TNNI2 | 22 | Trans | ITPR1 | 23 | Trans |
GPD1 | 21 | Trans | MYH1 | 23 | Trans |
TMTC2 | 21 | Trans | SVIL | 22 | Trans |
UCK2 | 21 | DEG Trans | ZDHHC4 | 22 | Trans |
LRRC42 | 20 | Trans | FILIP1L | 22 | DEG Trans |
HSPG2 | 21 | Trans | |||
UBR4 | 21 | Trans | |||
KTN1–2 | 21 | Trans | |||
DST | 21 | DEG Trans | |||
MYBPC1 | 20 | Trans |
Splicing QTL mapping
The cis and trans sQTLs identified in the present analysis are presented in Fig. 4 and highlight the effects on DEG. A total of 11,929 sQTLs were uncovered. The most frequently identified type of sQTL was trans (Fig. 2b). Trans, DEG trans, cis and DEG cis effects were identified in 66.1, 16.9, 14.0 and 3.0% of the cases, respectively. The majority of SNPs with trans and DEG trans effects were associated with the expression of only one exon (88.4 and 88.9%, respectively).
Splicing cis and DEG cis analysis
Additional file 1 shows all cis and DEG cis sQTLs uncovered using the effective number of independent tests. Since the number of significant cis sQTLs detected using these thresholds was very high, only associations with a p-value ≤2 × 10− 4 were used for further analysis. A total of 2222 cis sQTLs were identified and two of the most interesting genes are Titin (TTN) and TEK Receptor Tyrosine Kinase (TEK).
Splicing trans and DEG trans sQTL analysis, and master regulators
Out of the 13 splicing master regulator genes identified in the present analysis (Table 3), four encode for proteins located in the extracellular space. Four other genes encode for plasma and/or organelle associated membrane or cytoskeletal proteins, and two other genes encode for transcription factors. Mechanisms associated with splicing regulation for the remaining three master regulators were not evident. A total of 231 genes (Additional file 4) were associated with these 13 master regulators and were included in a regulation network (Additional file 5). The master regulators ZNF804A, ALAD, OR13F1, and ENSBTAG00000000336 were determined simultaneously as expression and splicing master regulators. Markers inside these four genes were able to explain variability in the fraction of exon counts in 28 (ZNF804A), 192 (ALAD), 22 (OR13F1) and 25 (ENSBTAG00000000336) genes across the genome. The most important uncovered master regulators associated with splicing were selected for further discussion.
Table 3.
SNP location | SNP name | Clustera | MAF (%) | Number | % DEG | Harboring gene or closest genesb |
---|---|---|---|---|---|---|
of sQTLs | sQTLs | |||||
1: 144,604,558 | rs381222773 | 1 | 7 | 33 | 9.1 | PDE9A - WDR4 |
2: 11,594,176 | rs208227436 | 2 | 3 | 28 | 17.9 | ZNF804A |
2: 84,792,003 | rs208053623 | 3 | 4 | 21 | 19.0 | DNAH7 |
4: 5,827,343 | rs381476620 | 4 | 42 | 21 | 33.3 | ZPBP - VWC2 |
8: 92,924,658 | rs382101207 | 5 | 3 | 23 | 13.0 | RNF20 |
8: 93,336,078 | BTB_01634267 | 5 | 4 | 20 | 20.0 | PLEKHB2 - SNORA19 |
8: 95,762,113 | rs136343964 | 5 | 3 | 22 | 13.6 | OR13F1 |
8: 104,345,143 | rs378706947 | 6 | 2 | 192 | 27.1 | ALAD |
14: 57,184,022 | rs210798753 | 7 | 2 | 24 | 50.0 | PKHD1L1 |
15: 25,536,733 | rs209617551 | 8 | 2 | 34 | 35.3 | SNORA3 |
15: 35,729,304 | rs382501844 | 9 | 4 | 33 | 9.1 | NUCB2 - ENSBTAG00000032859 |
16: 75,296,157 | rs41821837 | 10 | 4 | 20 | 20.0 | SYT14 - DIEXF |
18: 61,257,126 | 11 | 49 | 25 | 44.0 | ENSBTAG00000000336 - ENSBTAG00000046961 |
aCluster number used in Fig. 4
bBolded genes were selected as master regulators when the associated SNP was intergenic; underlined gene names were identified as expressed in skeletal muscle in the present analysis
Two different clusters were uncovered in the Functional Annotation Clustering analysis using the whole list of regulated genes across clusters (Additional file 6). Some of the identified terms in these clusters were Carbon metabolism, ATP binding and Nucleotide-binding, showing that genes in these clusters might have a complex splicing regulation.
Multigenic effects based on the sQTL analysis
A variety of genes seem to have a complex transcriptional control based on the ratio of exon counts (Table 2) and some of them are: Titin (TTN), Nebulin (NEB), Elongin B (TCEB2), CAMP Responsive Element Binding Protein 5 (CREB5) and Upstream Transcription Factor 2, C-Fos Interacting (USF2).
Discussion
Expression QTL mapping
Expression cis and DEG cis eQTL analysis
LSM2 and SOAT1 harbor some highly significant cis eQTLs. LSM2 binds to other members of the ubiquitous and multifunctional family Sm-like (LSM) in order to form RNA-processing complexes. These complexes are involved in processes such as stabilization of the spliceosomal U6 snRNA, mRNA decay and guide site-specific pseudouridylation of rRNA [10]. Lu et al. [11] identified two missense polymorphisms in SOAT1 associated with cholesterol in plasma and triglyceride levels in mice since they are able to increase enzyme activityG. None of these two genes were identified as DEG, therefore they must be more involved in skeletal muscle homeostasis.
Expression trans and DEG trans eQTL analysis, and master regulators
The 27 master regulators identified in the eQTL analysis could contribute to gene expression control by promoting cell signaling or by direct transcriptional activation/repression mechanisms. A number of structural proteins and transcription regulators were identified as master regulators. Neurotrophin 3 (NTF3), Glutamate Decarboxylase 1 (GAD1), FAT Atypical Cadherin 4 (FAT4), Transmembrane 4 L Six Family Member 1 (TM4SF1), Transmembrane 4 L Six Family Member 1 (TM4SF1) and Keratin 7 (KRT7) encode for transmembrane or cytoskeletal proteins. Zinc Finger Protein 804A (ZNF804A), Paired Box 8 (PAX8), Lysine Demethylase 4A (KDM4A) and RUNX1 Translocation Partner 1 (RUNX1T1 or Myeloid Translocation Gene on 8q22-MTG8) encode for transcription factors or histone demethylases. NTF3, TM4SF1, and KDM4A are further discussed.
NTF3 was identified as a master regulator in the present analysis since rs207649022 was able to explain variation in the expression of 76 genes (Table 1), 69.7% of which were DEG genes (Fig. 3b). Since NTF3 was associated with a number of DEGs, this master regulator was able to explain variability in gene expression associated with meat quality. The Neurotrophic Factor gene family regulates myoblast and muscle fiber differentiation. It also coordinates muscle innervation and functional differentiation of neuromuscular junctions [12]. Mice with only one functional copy of the NTF3 gene showed a smaller cross-sectional fiber area and more densely distributed muscle fibers [13]. Upregulation of NTF3, stimulated by the transcription factor POU3F2, is present during neuronal differentiation [14]. The neocortex has multiple layers originated by cell fate restriction of cortical progenitors and NTF3 induces cell fate switches by controlling a feedback signal between postmitotic neurons and progenitors. Therefore, changes in NTF3 expression can modulate the amount of tissue present in each neocortex layer [15].
NTF3 was identified in a previous study as highly associated with cooking loss [16] pointing out that markers inside this locus are able to explain variation at both the phenotype and gene expression level. This implicates NTF3 as a positional and functional gene with a potential role in meat quality. These effects are probably not due to cis regulation on NTF3 given that the number of reads mapped to this gene was extremely low and it did not surpass the threshold used in order to be included in the DEG analysis (average = 6.7, min = 0; max = 23). However, NTF3 could be actively expressed in earlier developmental stages and then expressed at a basal level, exerting control on expression regulation later on when cellular morphology has been completely established. A Functional Annotation Clustering analysis for the NTF3 regulated genes indicated that the master regulator NTF3 could be involved in the regulation of specific mechanisms and pathways related to Mitochondrion, Transit peptide and Mitochondrion inner membrane (Additional file 6).
The expression of 62 genes was associated with rs378343630, a marker located in the TM4SF1 master regulator. This gene encodes a plasma transmembrane protein and belongs to a gene family involved in signal transduction processes; thus, it modulates development, growth, and motility [17]. TM4SF1 physically interacts with the membrane and some cytoskeleton-associated proteins to form cell projections named ‘nanopodia’ [18], which are described as frequently identified in multiple types of cancer. This gene is highly expressed in pancreatic cancer cells and stimulates metastasis by upregulating Discoidin Domain Receptor Tyrosine Kinase 1 (DDR1), Matrix Metallopeptidase 2 (MMP2) and Matrix Metallopeptidase 9 (MMP9) [19]. In liver, TM4SF1 reduced apoptosis and promoted cell migration by upregulating MMP-2, MMP-9 and VEGF, and downregulating Caspase-3 and Caspase-9 [17]. Upregulation of miR-9 produces downregulation of TM4SF1, MMP2, MMP9 and VEGF in colorectal carcinoma, inhibiting cell migration and invasion [20]. In esophageal cancer stem-like cells, downregulation of miR-141 increases TM4SF1 expression, self-renewal ability and promotes cell invasion [21]. The Functional Annotation Clustering analysis for TM4SF1 found an overrepresentation of the transcription, DNA-templated term (Additional file 6); thus, TM4SF1 could be involved in the regulation of specific mechanisms and pathways associated with transcription in longissimus dorsi muscle. Neither TM4SF1 nor any gene in this cluster was identified as DEG; therefore they might be more related to skeletal muscle homeostasis than meat quality.
The KDM4A cluster has 32 regulated genes associated with rs135786834; KDM4A encodes a histone lysine demethylase able to modify trimethylated H3-K9/K36 to dimethylated products, contributing to gene expression, cellular differentiation and cancer development [22]. Histone H3K9 methylation promotes the silencing of muscle-specific genes in proliferating myoblasts and derepression of these genes is required to initiate muscle differentiation. Expression of a KDM4A isoform named DN-JMJD2A is upregulated during differentiation of myoblasts into myotubes promoting myotube formation and transcriptionally activating muscle-specific genes such as MyoD [23]. The only DEG master regulator identified in the present analysis was KDM4A and this master regulator harbors rs135786834, an SNP associated with expression of 32 genes by trans association. Therefore, changes in the expression of KDM4A did not show evidence of promoting the expression of genes related to meat quality.
Multigenic effects based on the eQTL analysis
Some of the most interesting genes identified in this analysis were ULK2, MYL1, and PHF14. Forty-three SNPs were associated with ULK2 expression. ULK2 encodes a serine/threonine-protein kinase required for autophagy under nutrient-deprived conditions [24]. Downregulation of ULK2 activates mTOR c1 signaling, promoting cell proliferation [25]. The MYL1 gene encodes a fast-twitch regulatory light chain of myosin in skeletal muscle; downregulation of MYL1 alters myocyte morphology and muscle structure, and generates congenital myopathy in zebrafish [26]. A total of 40 and 36 polymorphisms were associated with the expression of MYL1 and PHF14, respectively. PHF14 is ubiquitously expressed and its protein has multiple PHD fingers, a domain present in chromatin-binding proteins which are able to recognize particular epigenetic marks on histone tails. The PHF14 knockout in mice generates neonatal lethality and severe structural changes in multiple organs especially lungs. PHF14 is an epigenetic regulator required for the normal development of multiple organs [27], and it is probably involved in skeletal muscle homeostasis.
Splicing QTL mapping
Splicing cis and DEG cis analysis
The TTN gene harbors a highly significant DEG cis sQTL (p-value = 2.0 × 10− 7) and encodes a central sarcomeric protein. Some TTN mutations are associated with skeletal-muscle diseases such as tibial muscular dystrophy [28]. Fernandez-Marmiesse et al. [29] identified a non-sense mutation in a TTN exon only present in a fetal skeletal isoform and associated with a neuromuscular disorder; histologically, this mutation promotes sarcomeric deposition of a filamentous material. A DEG cis sQTL (p-value = 5.1 × 10− 7) was identified in the TEK gene. This gene encodes a receptor for Angiopoietin-1 (ANGPT1), and its signaling pathway is critical for migration, sprouting and survival of endothelial cells; TEK activates the SHC Adaptor Protein 1 (SHC1), a protein involved in triggering the Ras/mitogen-activated protein kinase pathway, regulating migration and endothelial organization [30]. Therefore, cis sQTLs in TTN and TEK could explain variation in the expression of these genes and variation in meat quality-related phenotypes.
Trans and DEG trans splicing QTL analysis, and master regulators
Similarly, as the identified expression master regulators, the splicing master regulators can be grouped as transcription regulators and structural proteins. Small Nucleolar RNA, H/ACA Box 3A (SNORA3), Small Nucleolar RNA, H/ACA Box 19 (SNORA19), Ring Finger Protein 20 (RNF20), and Zinc Finger Protein 804A (ZNF804A) could be classified as transcription regulators. Phosphodiesterase 9A (PDE9A), Olfactory Receptor Family 13 Subfamily F Member (OR13F1), Dynein Axonemal Heavy Chain 7 (DNAH7) and Von Willebrand Factor C Domain Containing 2 (VWC2) can be identified as structural proteins.
Small non-coding RNAs such as SNORA3 and SNORA19 modulate stability, folding and interaction with proteins and more recently, functions such as mRNA editing, alternative splicing and posttranscriptional gene silencing were discovered [31]. However, no clear function of SNORA3 and SNORA19 is described. Exon expression of 34 exons from 17 genes and 20 exons from 15 genes were associated with the polymorphisms rs209617551 (SNORA3) and BTB_01634267 (SNORA19), respectively.
Expression of 23 exons from 19 genes was associated with rs382101207, an SNP located in Ring Finger Protein 20 (RNF20). Upregulation of RNF20 stimulates H2B monoubiquitination and methylation at H3K4 and H3K79; it promotes expression of Homeobox genes, a group of transcription factors [32]. RNF20 also regulates expression of H2A and H2B histones, p53, several proto-oncogenes and promotes cell migration and tumorigenesis [33]. The RNF20/RNF20 (Bre1 complex) is documented as a tumor suppressor by upregulating a set of tumor suppressor genes and by contributing to genomic stability maintenance. Bre1 deficient cells present a high frequency of DNA double-strand breaks (DSB), and abundant aberrant RNA-DNA structures (R-loops), indicators of replication stress and genomic instability [32].
Pierce et al. [1] theorized that a high proportion of trans associations are caused by cis effects. However, no cis QTL was identified in any expression or splicing master regulator. This result suggests that, in the present analysis, trans effects might contribute significantly to phenotypic variation related to skeletal muscle homeostasis and meat quality.
Multigenic effects based on the sQTL analysis
The large number of sQTLs identified in genes like TTN (324) and NEB (63) could be related to gene size, since these genes are 275 and 219 kb long, respectively, which would increase the probability of being involved in trans regulation. On the other hand, some relatively short genes such as TCEB2 (9.9 kb) and USF2 (3.9 kb) also had a large number of sQTLs (43 and 33, respectively) indicating a possible complex splicing regulation.
A total of 324 and 67 polymorphisms were associated with TTN and NEB ratio exon counts, respectively. TTN and NEB are involved in assembly and mechanical activity of striated muscles. Both proteins are large sarcomere filament-binding proteins expressed in skeletal muscle and multiple splicing events in the bovine homologous are described. In the human brain, NEB acts as an actin filament stabilizer and regulates neuronal length. It is also involved in myofibrillogenesis, modulates thin filament length and allows proper muscle contraction [34]. TTN, NEB, and USF2 were identified as DEG; therefore, sQTL regulation could contribute to phenotypic variability associated with meat quality in longissimus dorsi and skeletal muscle homeostasis.
Gene expression and splicing regulation mechanisms by plasma and organelle associated proteins
The cell cytoskeleton provides cellular mechanical constraints and extracellular matrix stiffness [35]. However, structural proteins are involved in multiple biological processes different from the organizational ones, with signaling and cell fate being some of the most important. Cell signaling is crucial since it orchestrates cellular responses to different microenvironmental stimuli, and transcription repression-activation and splicing regulation are influenced by signaling proteins. A number of receptors, transmembrane linkers, cytoskeletal fibers and membrane-associated transcription factors were previously associated with transcription repression-activation.
The OR4A47, GPR98, PDE9A, OR13F1 and SYT14 master regulators were also described as transmembrane protein-coding genes and this type of molecule is involved in cell signaling processes. Pandey et al. [36] reported that estrogen can signal using diverse receptors, the G Protein-Coupled Estrogen Receptor 1 (GPR30) being one of them. Stimulation of GPR30 by estrogen activates a transcription factor network that upregulates Cellular Communication Network Factor 2 (CCN2), promoting proliferation and cell migration. The master regulators GAD1 and TM4SF1 encode transmembrane linkers similar to the integrin family. Integrins can modulate signal transduction cascades involved in cell survival, proliferation, differentiation and organ development [37]. The dimer ITGA1-ITGB1 can stall Epidermal Growth Factor Receptor (EGFR) signaling by stimulating Protein Tyrosine Phosphatase, Non-Receptor Type 2 (PTPN2). The cytoplasmic domain of ITGA1 interacts with PTPN2 and decreases EGFR phosphorylation after Epidermal Growth Factor (EGF) stimulation [38].
The cytoskeletal protein-coding genes KRT7, FAT4, MYH14, and DNAH7 were identified as master regulators. Some cytoskeletal proteins might drive transcription regulation and promote cellular mechanisms such as growth and apoptosis. Flouriot et al. [35] reported that the actin network can regulate Myocardin Related Transcription Factor A (MRTFA) subcellular localization, a protein involved in growth-quiescence switch. High F/G actin ratio or mutant MRTFA cells showed higher global biosynthetic activity and open chromatin state, associated with extensive histone modifications. In Drosophila, Hippo tumor suppressor pathway controls organ size, and proteins such as Yorkie (human homologous Yes Associated Protein 1 -YAP), a transcriptional coactivator, and Hpo and Warts kinases (human homologous Serine/Threonine Kinase 3 -STK3- and Large Tumor Suppressor Kinase 1 -LATS1, respectively) belong to this pathway. YAP is negatively regulated by STK3 and LATS1. F-actin accumulation promotes overgrowth in Drosophila imaginal discs through modulating the activity of the Hippo pathway [39].
Applicability of the present results and future analysis
The present results provide biological support to some of the previously identified pQTLs related to complex phenotypes in cattle and could contribute to discovery of potential causative polymorphisms. pQTL and eQTL colocalization for NTF3 (cooking loss) and GPR98 (tenderness) was evident in the present population [16]; however, more research is required in order to be able to determine if these genes harbor actual causative markers associated with meat quality. The use of causative polymorphisms in genomic prediction is the ideal scenario since it is not affected by recombination events between the actual pQTL and the marker being genotyped, over time. In this respect, research showed that polymorphisms associated with expression regulation such as eQTLs and sQTLs can explain an important proportion of the genetic variance present on complex phenotypes in cattle.
Lopdell et al. [4] identified a set of 3695 distinct eQTL variants for milk, fat and protein yield and showed that they have increased the predictive ability for milk composition related phenotypes. DGAT1, MGST1, and GPAT4 were identified as the most highly predictive regions. A 1 Mbp region nearby DGAT1 harbors three polymorphisms that are able to explain a high amount of the SNP variance in the set. Xiang et al. [40] classified 17,669,372 imputed variants into 30 sets of markers. This classification included categories such as inter-species conserved markers, polymorphisms associated with metabolic traits (several milk metabolites), expression regulation related polymorphisms (gene and exon expression QTLs, sQTLs, and allele-specific expression QTLs), and markers with evolutionary roles. An index was constructed for each marker using the amount of genetic variance explained by them for a total of 34 complex traits in cattle. Conserved markers, polymorphisms associated with metabolic traits and expression regulation related markers were able to explain the highest amount of genetic variance. Later, this index was applied to another population composed of 7551 individuals and it was determined that high ranking variants significantly increased genetic variance estimates and genomic prediction accuracies for milk, fat and protein yield.
However, other research has found difficult to illustrate the potential use of eQTL and sQTL mapping on the predictive ability for complex phenotypes. The research of Berg et al. [41] was focused on identifying pQTLs caused by eQTL for milk, fat and protein yield, and calving interval. No strong evidence of association between pQTL and eQTL effects were evident.
The results reported by Berg et al. [41] could indicate that most eQTLs are able to explain a very small fraction of the variance associated to pQTLs; however, it is important to highlight that lack of power for eQTL effect estimation and long-range LD could contribute the difficulty of identifying pQTLs and eQTL colocalization. Additionally, the relationship between pQTL and eQTL effect could be dependent on the genetic architecture of the phenotype being assessed and its degree of transcriptional control. In this respect, Lopdell et al., [4] noticed that predictions for milk, fat and protein yield using eQTL variants did not surpass R2, of 0.5 since all the QTL effects present in these traits are not due to expression effects. Furthermore, eQTLs in related tissues or eQTLs present at different stages of development could contribute to these phenotypes as well.
In order to identify causative polymorphisms, the present results require validation through eQTL and sQTL mapping on additional populations with Angus, Brahman and mixed breed composition. After validation, candidate genes would also need to be confirmed using in-vitro and in-vivo analysis. For the assessment of proteins described as eQTL and sQTL associated transcription factors, techniques such as Electrophoretic mobility shift assay (EMSA) and Chip-seq could be used in order to identify actual DNA-protein interaction able to regulate gene expression of the potential target genes. To support eQTL and sQTL master regulator activity for structural proteins able to activate signaling cascades and gene expression, knockout and knockdown trials could verify if these proteins could module this biological activity. Finally, to identified cis regulations, reporter gene experiments can be used.
Conclusions
The mapping analysis performed in this study provides a holistic insight into the regulatory network architecture in longissimus dorsi muscle in an Angus-Brahman population.
Multiple cis eQTLs and sQTLs effects were identified and genes such as LSM2, SOAT1, TTN and TEK are a few examples of potential expression and splicing regulatory genes. A total of 27 expression and 13 splicing master regulator genes were uncovered, mainly cytoskeletal or membrane-associated proteins, transcription factors and DNA methylases. Cytoskeletal proteins provide mechanical constraints to the cell, but they are also involved in processes such as signaling. Signaling is crucial since it coordinates cellular responses to different stimuli, and transcription repression-activation and splicing regulation are influenced by structural proteins. The ZNF804A, ALAD, OR13F1 and ENSBTAG00000000336 genes were identified as both expression and splicing master regulators.
It is shown that eQTL and sQTL mapping makes possible positional identification of potential expression and splicing master regulators. The present analysis identified master regulators associated with gene and isoform expression in skeletal muscle but was also focused on uncovering master regulators related to genes whose expression is able to explain variability in meat quality-related phenotypes (DEG genes) in cattle. The genes PDE8B, NTF3, ZNF445, and OR4S1 can be highlighted as eQTL master regulators associated with a high proportion of DEG genes. The sQTL master regulators PKHD1L1, ENSBTAG00000000336, SNORA3, and VWC2 were the regulators most frequently associated with DEG genes. These master regulators could contribute to phenotypic variability through modulating the expression of key genes whose expression is able to explain variability in the complex meat quality phenotype.
Methods
Cattle population and phenotypic data
The University of Florida Institutional Animal Care and Use Committee number 201003744 approved the present research protocol. A total of 120 steers from the University of Florida Beef Unit multibreed Angus-Brahman herd born between 2013 and 2014 were used in this study [42]. This population can be classified into six different groups based on breed composition. In terms of Angus composition, the grouping was the following: 1 = 100 to 80%; 2 = 79 to 65%; 3 = 64 to 60% (Brangus); 4 = 59 to 40%; 5 = 39 to 20%; 6 = 19 to 0% [42].
These animals were kept with their dams on bahiagrass pastures (Paspalum notatum) until weaning and received a complete mineral supplement (UF University Special Hi-Cu Mineral, University of Florida, Gainesville, Florida), and bermudagrass (Cynodon dactylon) hay and cotton-seed (Gossypium spp.) meal in the winter months (mid-December to mid-March). The calves were kept on bahiagrass pastures and fed bahiagrass hay, concentrate (1.6–3.6 kg of soy hull pellets per day; 14.0% CP; 488 Pellet Medicated Weaning Ration, Lakeland Animal Nutrition, Lakeland, Florida) and a mineral supplement until yearling.
Yearling steers were transported to a contract feeder (2014: Suwannee Farms, O Brien, Florida; 2015: Quincey Farms, Chiefland, Florida), where they were provided a standard feedlot diet based on corn, protein, vitamins, and minerals until they reached a subcutaneous fat thickness over the ribeye of approximately 1.27 cm [43]. Cattle were transported to a commercial processing facility (FPL Food LLC., Augusta, Georgia) 1 day prior to harvest. Steers were harvested under USDA-FSIS inspection using captive bolt stun. The average slaughter weight was 573.34 ± 54.79 kg at 12.91 ± 8.69 months. After splitting the carcass, five to ten g of the longissimus dorsi muscle was collected, snapped-frozen in liquid nitrogen and stored at − 80 °C until RNA was extracted.
Phenotypes recorded on these steers included tenderness, connective tissue and juiciness determined by a sensory panel, and marbling, cooking loss and WBSF according to the American Meat Science Association Sensory Guidelines [44]. Marbling was assessed on the ribeye muscle at the 12th/13th rib interface after ribbing the carcass and was recorded on a numerical scale by visual appraisal 48 h postmortem. The grading was as follows: Practically Devoid = 100–199, Traces = 200–299, Slight = 300–399, Small = 400–499, Modest = 500–599, Moderate = 600–699, Slightly Abundant = 700–799, Moderately Abundant = 800–899, Abundant = 900–999.
From each animal, two 2.54 cm steaks from the 12th/13th rib interface of the longissimus dorsi muscle were collected, aged for 14 days at 4 °C, and stored at − 20 °C at the Meat Science Laboratory of the University of Florida. Frozen steaks were allowed to thaw at 4 °C for 24 h and cooked to an internal temperature of 71 °C on an open-hearth grill.
After cooking, the first steak was cooled at 4 °C for 18 to 24 h and six cores with a 1.27-cm diameter and parallel to the muscle fiber were sheared with a Warner-Bratzler head attached to an Instron Universal Testing Machine (model 3343; Instron Corporation, Canton, MA). The Warner-Bratzler head moved at a cross head speed of 200 mm/min. The average peak load (kg) of six cores from the same animal was analyzed. The weight lost during cooking was recorded and cooking loss was expressed as a percentage of the cooked weight out of the thaw weight. The second steak was cooked and assessed by the sensory panel. The sensory panel consisted of eight to 11 trained members, and six animals were assessed per session. Two 1 × 2.54 cm samples from each steak were provided to each panelist. Sensory panel measurements analyzed by the sensory panelists included: tenderness (8 = extremely tender, 7 = very tender, 6 = moderately tender, 5 = slightly tender, 4 = slightly tough, 3 = moderately tough, 2 = very tough, 1 = extremely tough), juiciness (8 = extremely juicy, 7 = very juicy, 6 = moderately juicy, 5 = slightly juicy, 4 = slightly dry, 3 = moderately dry, 2 = very dry, 1 = extremely dry), and connective tissue (8 = none detected, 7 = practically none, 6 = traces amount, 5 = slight amount, 4 = moderate amount, 3 = slightly abundant, 2 = moderately abundant, 1 = abundant amount). For each phenotype, the average score by steak from all members of the panel was analyzed.
Marbling, WBSF, cooking loss, juiciness, tenderness and connective tissue were included in a principal component (PC) analysis using PROC FACTOR procedure of SAS [45], and a composited meat quality index for each animal was constructed using the first three PCs. The meat quality index was determined using the following formula:
Where PCSij is the PC score of the animal i for the PCj and PCWj is the weight of the PCj (eigenvalue). The amount of variance explained by PC1, PC2 and PC3 were 44.26, 20.04 and 13.29%, respectively. The 120 animals were ranked using the meat quality index and 80 animals with extreme values were selected and used for RNA sequencing.
Genotyping and data quality control
Genomic DNA was extracted from blood using the DNeasy Blood & Tissue kit (Qiagen, Valencia, CA) and stored at − 20 °C. All animals were genotyped with the commercial GGP Bovine F-250 chip (GeneSeek, Inc., Lincoln, NE) which contains 221,077 single nucleotide polymorphisms (SNPs). After excluding markers with a minor allele frequency lower than 3% (at least 2 animals out of 80 for the less frequent genotype) and a calling rate < 0.9, a total of 112,042 SNPs were included in the association analysis. Quality control was implemented with JMP genomics 6.0 software [46]. The genotype data is available in the European Variation Archive website, accession number PRJEB24746.
RNA extraction, RNA-seq library preparation and sequencing
Total RNA was extracted from muscle using TRIzol reagent (Thermo Fisher Scientific, Waltham, MA, USA) according to the manufacturer’s protocol (Invitrogen, catalog no. 15596–026). RNA concentration was measured by NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and integrity was verified by formaldehyde gel. The mRNA samples were stored at − 80 °C. Total RNA samples were sent to RAPiD Genomics LLC (Gainesville, Florida, United States) for mRNA isolation, RNA-seq library preparation and sequencing procedures. mRNA isolation was performed using oligo-dT attached magnetic beads prior to its reverse transcription and synthesis of double-stranded cDNA. One RNA-seq library for each sample was constructed, multiplexed, and sequenced based on protocols of Illumina HiSeq 3000 PE100 platform (Illumina, San Diego, CA, USA). All samples were sequenced on 8 lanes, generating 2 × 101 nts paired-end reads. RNA-seq data are available at the European Nucleotide Archive, accession number PRJEB31379.
Read quality control, paired-end read alignment and paired-end read counting
The pipeline described by Korpelainen et al. (2014) [47] was used to generate an index for the Btau_4.6.1 reference genome and to create a gene, exon and isoform expression files. Tophat 2.1.0 [48], Bowtie2 2.3.4 [49], Picard [50] and samtools [51] were used to generate the Btau_4.6.1 index. Eight forward and eight reverse FASTQ files per sample were concatenated in separated FASTQ files and analyzed with FastQC 0.9.6 [52] to check quality of the raw sequence reads. Read trimming was performed with PRINSEQ 0.20.4 software [53] using 3 bp sliding windows and 20 as phred threshold. Reads with more than 2 ambiguous bases were excluded from the analysis. Cutadapt version 1.8.1 software [54] was used to trim adapters and reads shorter than 50 nts were excluded.
Tophat 2.1.0 [48] and Bowtie2 2.3.4 [49] were used to perform paired-end read mapping against the Btau_4.6.1 reference genome [55]. HTSeq 0.9.1 software [56] was used to estimate gene paired-end read counts for all annotated genes, including paired-end reads uniquely mapped to known chromosomes. Cufflinks 2.2.1.1 [57, 58] was used to assemble transcripts and estimate transcript abundance in FPKM (Fragments Per Kilobase of exon per Million fragments mapped). Exon counts per gene were determined using the RNA-Sequencing differential expression analysis pipeline DEXSeq [59]. Genes and exons with less than 10 counts across all 80 samples were excluded from the analysis. Indexing and sorting of the alignment files were performed using Samtools 1.9 software [51].
Differentially expressed genes, exons and isoforms associated with meat quality
Differential expression analysis was performed to identify genes, exons and isoforms whose expression was associated with meat quality. The procedures described by Seo et al. [60], Love et al. [61] and Jia et al. [62] were used to identify differential expression. Genes and exons with less than 10 counts, and isoforms with less than 10 FPKM across samples were excluded from the analysis.
The R package edgeR [63] was used to obtain normalized gene counts by applying the trimmed mean of M-values (TMM) normalization method. The R packages sfsmisc and MASS [64–66] were used to apply Huber’s M-estimator based robust regression including all 80 samples used for RNA sequencing. The meat quality index was used as a response variable. Gene expression was treated as a covariate and year of birth of the animal as fixed effects. A PCA analysis was carried out with the “PCA for population structure” work-flow of JMP [46], and population structure was accounted for by including its first PC as covariate in the model. Genes whose association test had a p-value lower than 0.05 were included in the DEG list. The same analysis was performed for exon expression, and genes with at least three associated exons were included in the DEG list.
Out of the 80 samples used for RNA sequencing, 40 (20 high and 20 low performance based on WBSF, tenderness or marbling) were included in the DEG analysis. The R package DESeq2 version 1.20.0 [61] was used to identify DEG genes, including year of birth, breed group and a categorical classification of each animal based on phenotype as fixed effects in the analysis. The categorical classification was as follows: tender vs tough using WBSF or tenderness and high vs low using marbling. Genes with a Benjamini-Hochberg adjusted p-value lower than 0.05 were determined as DEG for WBSF and lower than 0.1 as DEG for tenderness and marbling. The DEG isoform analysis was performed with MetaDiff [62]. Breed group, year of birth, and the same categorical classification based on phenotype fitted in the DESeq2 analysis were included as fixed effects in the association model.
A total of 8799 genes, 93,349 exons, and 4471 isoforms from 957 genes were included in the DEG analysis. Expression of 1352 genes was identified as associated with meat quality traits using the differential expression analysis (Additional file 7).
eQTL and sQTL mapping
The R package Matrix eQTL was used to perform the QTL mapping [67] using 112,042 SNPs and 8588 genes (eQTL mapping) or 87,770 exons from 8467 genes (sQTL mapping) located in autosomes. A linear regression model was used where the SNP genotypes were coded as 0, 1 or 2. For the eQTL analysis, gene counts were transformed using the tool varianceStabilizingTransformation from the R package DESeq2 version 1.20.0 [61] in order to solve heteroscedasticity [8]. In the sQTL analysis, we used the fraction of counts mapped to a specific exon out of the total counts mapped to its gene [68]. This fraction was converted to an integer value by keeping three decimal digits and multiplying by 1000 and then transformed using the tool varianceStabilizingTransformation. Gene and fraction exon counts were included as response variables, and SNP genotype and year of birth of the animal as fixed effects. The first PC from the “PCA for population structure” work-flow of JMP [46] was included as a covariate in the model to control for population structure. A cis QTL was defined as an SNP located no more than 1 Mb upstream of the transcription start site or downstream of the transcription end site of an annotated gene, and cis and trans QTLs were analyzed separately.
Bonferroni trans and cis p-value thresholds were calculated using the effective number of independent tests implemented in the R function “simpleM_Ex” [69]. For the trans associations, the total number of tests was 112,042, and 42,246 was its corresponding effective number of independent tests. Therefore, the p-value corrected for multiple testing for the trans effects was equal to 1.18 × 10− 6 for both, trans eQTLs and sQTLs. However, in order to maximize the number of eQTLs and sQTLs hotspots a less stringent threshold was used. The final trans association thresholds used for eQTLs and sQTLs were 1 × 10− 5 and 1 × 10− 6, respectively. An effective number of independent tests per each gene was calculated in order to determine cis p-value thresholds. An in-house script written in Java was used to group all SNPs by gene and to generate the file inputs for the R function “simpleM_Ex” [70]. The Bonferroni cis p-value thresholds are presented in the Additional file 8. However, since the number of cis sQTLs was very high using these thresholds, a more stringent threshold was implemented. The final cis sQTL association threshold was 2 × 10− 4.
Polymorphisms associated with expression of at least 20 genes in the case of eQTL and at least 20 exons in the case of sQTL were considered hot spots. The harboring gene or the adjacent gene in which biological function was somewhat related to transcription regulation was defined as master regulators.
Functional annotation clustering analysis
A functional classification analysis using DAVID Bioinformatic Resources 6.8 server [71] was performed for each cluster of genes associated with a master regulator.
Supplementary information
Acknowledgements
Not Applicable.
Abbreviations
- DEG
Differentially expressed gene
- eQTL
Expression quantitative trait loci
- pQTL
Phenotypic quantitative trait loci
- SNP
Single nucleotide polymorphism
- sQTL
Splicing quantitative trait loci
Authors’ contributions
JL conducted all analyses and drafted the manuscript; ME assisted with the analysis and manuscript; RM conceived and assisted with the analyses and manuscript. All authors read and approved the final manuscript. All authors read and approved the final manuscript.
Funding
Financial support provided by Florida Agricultural Experiment Station Hatch FLA-ANS-005548, Florida Beef Council, and Florida Beef Cattle Association – Beef Enhancement Fund Award 022962. The funders were not involved in the study design or collection, analysis, or interpretation of the data.
Availability of data and materials
Genotype data are available on the EVA website, accession number PRJEB24746. https://www.ebi.ac.uk/ena/data/view/PRJEB24746. RNA-seq data are available at the European Nucleotide Archive, accession number PRJEB31379, https://www.ebi.ac.uk/ena/data/search?query=PRJEB31379.
Ethics approval and consent to participate
The research protocol was approved by the University of Florida Institutional Animal Care and Use Committee number 201003744.
Consent for publication
Not Applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Joel D. Leal-Gutiérrez, Email: joelleal@ufl.edu
Mauricio A. Elzo, Email: maelzo@ufl.edu
Raluca G. Mateescu, Email: raluca@ufl.edu
Supplementary information
Supplementary information accompanies this paper at 10.1186/s12864-020-6520-5.
References
- 1.Pierce Brandon L., Tong Lin, Chen Lin S., Rahaman Ronald, Argos Maria, Jasmine Farzana, Roy Shantanu, Paul-Brutus Rachelle, Westra Harm-Jan, Franke Lude, Esko Tonu, Zaman Rakibuz, Islam Tariqul, Rahman Mahfuzar, Baron John A., Kibriya Muhammad G., Ahsan Habibul. Mediation Analysis Demonstrates That Trans-eQTLs Are Often Explained by Cis-Mediation: A Genome-Wide Analysis among 1,800 South Asians. PLoS Genetics. 2014;10(12):e1004818. doi: 10.1371/journal.pgen.1004818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, et al. Population genomics of human gene expression. Nat Genet. 2007;39:1217–1224. doi: 10.1038/ng2142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu H, Luo X, Niu L, Xiao Y, Chen L, Liu J, et al. Distant eQTLs and non-coding sequences play critical roles in regulating gene expression and quantitative trait variation in maize. Mol Plant. 2017;10:414–426. doi: 10.1016/j.molp.2016.06.016. [DOI] [PubMed] [Google Scholar]
- 4.Lopdell T, Tiplady K, Littlejohn M. Proceedings of the World Congress on genetics applied to livestock production. 2018. Using RNAseq data to improve genomic selection in dairy cattle. [Google Scholar]
- 5.Khokhar W, Hassan MA, Reddy ASN, Chaudhary S, Jabre I, Byrne LJ, et al. Genome-wide identification of splicing quantitative trait loci (sQTLs) in diverse ecotypes of Arabidopsis thaliana. Front Plant Sci. 2019;10:1–13. doi: 10.3389/fpls.2019.01160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.González-Prendes R, Quintanilla R, Amills M. Investigating the genetic regulation of the expression of 63 lipid metabolism genes in the pig skeletal muscle. Anim Genet. 2017;48:606–610. doi: 10.1111/age.12586. [DOI] [PubMed] [Google Scholar]
- 7.Steibel Juan Pedro, Bates Ronald O., Rosa Guilherme J. M., Tempelman Robert J., Rilington Valencia D., Ragavendran Ashok, Raney Nancy E., Ramos Antonio Marcos, Cardoso Fernando F., Edwards David B., Ernst Catherine W. Genome-Wide Linkage Analysis of Global Gene Expression in Loin Muscle Tissue Identifies Candidate Genes in Pigs. PLoS ONE. 2011;6(2):e16766. doi: 10.1371/journal.pone.0016766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Higgins MG, Fitzsimons C, McClure MC, McKenna C, Conroy S, Kenny DA, et al. GWAS and eQTL analysis identifies a SNP associated with both residual feed intake and GFRA2 expression in beef cattle. Sci Rep. 2018;8:1–12. doi: 10.1038/s41598-018-32374-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Littlejohn MD, Tiplady K, Fink TA, Lehnert K, Lopdell T, Johnson T, et al. Sequence-based association analysis reveals an MGST1 eQTL with pleiotropic effects on bovine Milk composition. Sci Rep. 2016;6:1–14. doi: 10.1038/srep25376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fernandez C, Pannone B, Chen X, Fuchs G, Wolin S. Rab11 in Recycling Endosomes Regulates the Sorting and Basolateral Transport of E-Cadherin. Mol Biol Cell. 2004;15:2842–2852. doi: 10.1091/mbc.e04-10-0867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lu Z, Yuan Z, Miyoshi T, Wang Q, Su Z, Chang CC. Identification of Soat1 as a quantitative trait locus gene on mouse chromosome 1 contributing to hyperlipidemia. PLoS One. 2011;6:1–8. doi: 10.1371/journal.pone.0025344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chevrel G, Hohlfeld R, Sendtner M. The role of neurotrophins in muscle under physiological and pathological conditions. Muscle Nerve. 2006;33:462–476. doi: 10.1002/mus.20444. [DOI] [PubMed] [Google Scholar]
- 13.Sheard PW, Bewick GS, Woolley AG, Shaw J, Fisher L, Fong SW, et al. Investigation of neuromuscular abnormalities in neurotrophin-3-deficient mice. Eur J Neurosci. 2010;31:29–41. doi: 10.1111/j.1460-9568.2009.07032.x. [DOI] [PubMed] [Google Scholar]
- 14.Lin YMJ, Hsin IL, Sun HS, Lin S, Lai YL, Chen HY, et al. NTF3 is a novel target gene of the transcription factor POU3F2 and is required for neuronal differentiation. Mol Neurobiol. 2018;55:8403–8413. doi: 10.1007/s12035-018-0995-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Parthasarathy S, Srivatsa S, Nityanandam A, Tarabykin V. Ntf3 acts downstream of Sip1 in cortical postmitotic neurons to control progenitor cell fate through feedback signaling. Development. 2014;141:3324–3330. doi: 10.1242/dev.114173. [DOI] [PubMed] [Google Scholar]
- 16.Leal-Gutiérrez JD, Elzo MA, Johnson DD, Hamblen H, Mateescu RG. Genome wide association and gene enrichment analysis reveal membrane anchoring and structural proteins associated with meat quality in beef. BMC Genomics. 2019;20:151. doi: 10.1186/s12864-019-5518-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Huang YK, Fan XG, Qiu F. TM4SF1 promotes proliferation, invasion, and metastasis in human liver cancer cells. Int J Mol Sci. 2016;17:1–19. doi: 10.3390/ijms17050661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zukauskas A, Merley A, Li D, Ang LH, Sciuto TE, Salman S, et al. TM4SF1: a tetraspanin-like protein necessary for nanopodia formation and endothelial cell migration. Angiogenesis. 2011;14:345–354. doi: 10.1007/s10456-011-9218-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yang JC, Zhang Y, He SJ, Li MM, Cai XL, Wang H, et al. TM4SF1 promotes metastasis of pancreatic Cancer via regulating the expression of DDR1. Sci Rep. 2017;7:3–10. doi: 10.1038/srep45895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Park Y, Lee S, Kim S, Liu Y, Lee M, Shin J, et al. MicroRNA-9 suppresses cell migration and invasion through downregulation of TM4SF1 in colorectal cancer. Int J Oncol. 2016;48:2135–2413. doi: 10.3892/ijo.2016.3430. [DOI] [PubMed] [Google Scholar]
- 21.Xue L, Yu X, Jiang X, Deng X, Mao L, Guo L, et al. TM4SF1 promotes the self-renewal of esophageal cancer stem-like cells and is regulated by miR-141. Oncotarget. 2017;8:19274–19284. doi: 10.18632/oncotarget.13866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Guerra-Calderas L, González-Barrios R, Herrera LA, de León DC, Soto-Reyes E. The role of the histone demethylase KDM4A in cancer. Cancer Genet. 2015;208:215–224. doi: 10.1016/j.cancergen.2014.11.001. [DOI] [PubMed] [Google Scholar]
- 23.Verrier Laure, Escaffit Fabrice, Chailleux Catherine, Trouche Didier, Vandromme Marie. A New Isoform of the Histone Demethylase JMJD2A/KDM4A Is Required for Skeletal Muscle Differentiation. PLoS Genetics. 2011;7(6):e1001390. doi: 10.1371/journal.pgen.1001390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lee EJ, Tournier C. The requirement of uncoordinated 51-like kinase 1 (ULK1) and ULK2 in the regulation of autophagy. Autophagy. 2011;7:689–695. doi: 10.4161/auto.7.7.15450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jung CH, Seo M, Otto NM, Kim DH. ULK1 inhibits the kinase activity of mTORC1 and cell proliferation. Autophagy. 2011;7:1212–1221. doi: 10.4161/auto.7.10.16660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ravenscroft G, Zaharieva I, Bortolotti CA, Lambrughi M, Pignataro M, Borsari M, et al. Bi-allelic mutations in MYL1 cause a severe congenital myopathy. Hum Mol Genet. 2018;27:4263–4272. doi: 10.1093/hmg/ddy320/5095322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Huang Q, Zhang L, Wang Y, Zhang C, Zhou S, Yang G, et al. Depletion of PHF14, a novel histone-binding protein gene, causes neonatal lethality in mice due to respiratory failure. Acta Biochim Biophys Sin Shanghai. 2013;45:622–633. doi: 10.1093/abbs/gmt055. [DOI] [PubMed] [Google Scholar]
- 28.Hackman P, Vihola A, Haravuori H, Marchand S, Sarparanta J, de Seze J, et al. Tibial Muscular Dystrophy Is a Titinopathy Caused by Mutations in TTN, the Gene Encoding the Giant Skeletal-Muscle Protein Titin. Am J Hum Genet. 2002;71:492–500. doi: 10.1086/342380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fernández-Marmiesse A, Carrascosa-Romero MC, Alfaro Ponce B, Nascimento A, Ortez C, Romero N, et al. Homozygous truncating mutation in prenatally expressed skeletal isoform of TTN gene results in arthrogryposis multiplex congenita and myopathy without cardiac involvement. Neuromuscul Disord. 2017;27:188–192. doi: 10.1016/j.nmd.2016.11.002. [DOI] [PubMed] [Google Scholar]
- 30.Audero E, Cascone I, Maniero F, Napione L, Arese M, Lanfrancone L, et al. Adaptor ShcA protein binds tyrosine kinase Tie2 receptor and regulates migration and sprouting but not survival of endothelial cells. J Biol Chem. 2004;279:13224–13233. doi: 10.1074/jbc.M307456200. [DOI] [PubMed] [Google Scholar]
- 31.Bratkovič T, Rogelj B. The many faces of small nucleolar RNAs. Biochim Biophys Acta Gene Regul Mech. 2014;1839:438–443. doi: 10.1016/j.bbagrm.2014.04.009. [DOI] [PubMed] [Google Scholar]
- 32.Chernikova SB, Razorenova OV, Higgins JP, Sishc BJ, Nicolau M, Dorth JA, et al. Deficiency in mammalian histone H2B ubiquitin ligase Bre1 (Rnf20/Rnf40) leads to replication stress and chromosomal instability. Cancer Res. 2012;72:2111–2119. doi: 10.1158/0008-5472.CAN-11-2209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Shema E, Tirosh I, Aylon Y, Huang J, Ye C, Moskovits N, et al. The histone H2B-specific ubiquitin ligase RNF20/hBREl acts as a putative tumor suppressor through selective regulation of gene expression. Genes Dev. 2008;22:2664–2676. doi: 10.1101/gad.1703008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bang ML, Li X, Littlefield R, Bremner S, Thor A, Knowlton KU, et al. Nebulin-deficient mice exhibit shorter thin filament lengths and reduced contractile function in skeletal muscle. J Cell Biol. 2006;173:905–916. doi: 10.1083/jcb.200603119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Flouriot G, Huet G, Boujrad N, Demay F, Pakdel F, Michel D. The actin/MKL1 signalling pathway influences cell growth and gene expression through large-scale chromatin reorganization and histone post-translational modifications. Biochem J. 2014;461:257–268. doi: 10.1042/BJ20131240. [DOI] [PubMed] [Google Scholar]
- 36.Pandey DP, Lappano R, Albanito L, Madeo A, Maggiolini M, Picard D. Estrogenic GPR30 signalling induces proliferation and migration of breast cancer cells through CTGF. EMBO J. 2009;28:523–532. doi: 10.1038/emboj.2008.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhang Y, Wang H. Integrin signalling and function in immune cells. Immunology. 2012;135:268–275. doi: 10.1111/j.1365-2567.2011.03549.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Mattila E, Pellinen T, Nevo J, Vuoriluoto K, Arjonen A, Ivaska J. Negative regulation of EGFR signalling through integrin-α1β1-mediated activation of protein tyrosine phosphatase TCPTP. Nat Cell Biol. 2004;7:78–85. doi: 10.1038/ncb1209. [DOI] [PubMed] [Google Scholar]
- 39.Sansores-Garcia L, Bossuyt W, Wada KI, Yonemura S, Tao C, Sasaki H, et al. Modulating F-actin organization induces organ growth by affecting the hippo pathway. EMBO J [internet] Nat Publ Group. 2011;30:2325–2335. doi: 10.1038/emboj.2011.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Xiang R, van den Berg I, MacLeod IM, Hayes BJ, Prowse-Wilkins CP, Wang M, et al. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc Natl Acad Sci. 2019;116:19398–19408. doi: 10.1073/pnas.1904159116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Van Den Berg I, Hayes BJ, Chamberlain AJ, Goddard ME. Overlap between eQTL and QTL associated with production traits and fertility in dairy cattle. BMC Genomics. 2019;20:1–18. doi: 10.1186/s12864-018-5379-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Elzo M.A., Mateescu R., Thomas M.G., Johnson D.D., Martinez C.A., Rae D.O., Wasdin J.G., Driver M.D., Driver J.D. Growth and reproduction genomic-polygenic and polygenic parameters and prediction trends as Brahman fraction increases in an Angus-Brahman multibreed population. Livestock Science. 2016;190:104–112. doi: 10.1016/j.livsci.2016.06.011. [DOI] [Google Scholar]
- 43.Elzo MA, Mateescu RG, Johnson DD, Scheffler TL, Scheffler JM, Carr C, et al. Genomic-polygenic and polygenic predictions for nine ultrasound and carcass traits in Angus-Brahman multibreed cattle using three sets of genotypes. Livest Sci. 2017;202:58–66. doi: 10.1016/j.livsci.2017.05.027. [DOI] [Google Scholar]
- 44.Belk KE, Dikeman ME, Calkins CR, Andy King D, Shackelford SD, Hale D, et al. Research Guidelines for Cookery, Sensory Evaluation, and Instrumental Tenderness Measurements of Meat. 2015. [Google Scholar]
- 45.SAS Institute, Inc., Cary, NC. United States;
- 46.JMP®, Version 13. SAS Institute Inc., Cary, NC, 1989–2007.
- 47.Korpelainen E, Tuimala J, Somervuo P, Huss M, Wong G. RNA-seq data analysis a practical approach. 1. London: CRC Press Taylor & Francis Group; 2014. [Google Scholar]
- 48.Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg S. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:0–9. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Picard [Internet]. Available from: http://broadinstitute.github.io/picard/. Accessed 25 Oct 2018.
- 51.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Andrews S, Krueger F, Segonds-Pichon A, Biggins L, Krueger C, Wingett S. FastQC. 2018. [Google Scholar]
- 53.Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
- 55.http://emea.support.illumina.com/sequencing/sequencing_software/igenome.html##. Available from: http://emea.support.illumina.com/sequencing/sequencing_software/igenome.html#. Accessed 20 June 2018.
- 56.Anders S, Pyl P, Huber W. HTSeq-A Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, Van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–515. doi: 10.1038/nbt.1621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol. 2013;31:46–53. doi: 10.1038/nbt.2450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Anders S, Reyes A, Huber W. Detecting diferential usage of exons from RNA-seq data. Genome Res. 2012;22:2008–2017. doi: 10.1101/gr.133744.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Seo M, Kim K, Yoon J, Jeong JY, Lee HJ, Cho S, et al. RNA-seq analysis for detecting quantitative trait-associated genes. Sci Rep. 2016;6:1–12. doi: 10.1038/srep24375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:1–21. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jia C, Guan W, Yang A, Xiao R, Tang WHW, Moravec CS, et al. MetaDiff: differential isoform expression analysis using random-effects meta-regression. BMC Bioinformatics. 2015;16:1–12. doi: 10.1186/s12859-015-0623-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Maechler M. sfsmisc: Utilities from “Seminar fuer Statistik” ETH Zurich. R package version 1.1–2. 2018. [Google Scholar]
- 65.Venables WN, Ripley BD. Modern applied statistics with S. 4. New York: Springer; 2002. [Google Scholar]
- 66.R Core Team . R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2018. [Google Scholar]
- 67.Shabalin AA. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012;28:1353–1358. doi: 10.1093/bioinformatics/bts163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Pickrell J, Marioni J, Pai A, Degner J, Engelhardt B, Nkadori E, et al. Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010;464:768–772. doi: 10.1038/nature08872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Gao X, Becker LC, Becker DM, Starmer JD, Province M. Avoiding the high Bonferroni penalty in genome-wide association studies. Genet Epidemiol. 2010;34:100–105. doi: 10.1002/gepi.20430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Gao X, Starmer J, Martin ER. A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet Epidemiol. 2008;32:361–369. doi: 10.1002/gepi.20310. [DOI] [PubMed] [Google Scholar]
- 71.Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Genotype data are available on the EVA website, accession number PRJEB24746. https://www.ebi.ac.uk/ena/data/view/PRJEB24746. RNA-seq data are available at the European Nucleotide Archive, accession number PRJEB31379, https://www.ebi.ac.uk/ena/data/search?query=PRJEB31379.