Skip to main content
eLife logoLink to eLife
. 2023 Feb 6;12:e80500. doi: 10.7554/eLife.80500

A transcriptome atlas of leg muscles from healthy human volunteers reveals molecular and cellular signatures associated with muscle location

Tooba Abbassi-Daloii 1,, Salma el Abdellaoui 1, Lenard M Voortman 2, Thom TJ Veeger 3, Davy Cats 4, Hailiang Mei 4, Duncan E Meuffels 5, Ewoud van Arkel 6, Peter AC 't Hoen 1,7,, Hermien E Kan 3,8,, Vered Raz 1,
Editors: Christopher L-H Huang9, Mone Zaidi10
PMCID: PMC9988256  PMID: 36744868

Abstract

Skeletal muscles support the stability and mobility of the skeleton but differ in biomechanical properties and physiological functions. The intrinsic factors that regulate muscle-specific characteristics are poorly understood. To study these, we constructed a large atlas of RNA-seq profiles from six leg muscles and two locations from one muscle, using biopsies from 20 healthy young males. We identified differential expression patterns and cellular composition across the seven tissues using three bioinformatics approaches confirmed by large-scale newly developed quantitative immune-histology procedures. With all three procedures, the muscle samples clustered into three groups congruent with their anatomical location. Concomitant with genes marking oxidative metabolism, genes marking fast- or slow-twitch myofibers differed between the three groups. The groups of muscles with higher expression of slow-twitch genes were enriched in endothelial cells and showed higher capillary content. In addition, expression profiles of Homeobox (HOX) transcription factors differed between the three groups and were confirmed by spatial RNA hybridization. We created an open-source graphical interface to explore and visualize the leg muscle atlas (https://tabbassidaloii.shinyapps.io/muscleAtlasShinyApp/). Our study reveals the molecular specialization of human leg muscles, and provides a novel resource to study muscle-specific molecular features, which could be linked with (patho)physiological processes.

Research organism: Human

Introduction

Skeletal muscles have grosso modo similar functions, generate the force for mobility and skeleton support, and maintain the body homeostasis. However, skeletal muscles differ in biomechanical and physiological features. These features include the size and contractile properties of the motor units and myofibers, differences in shortening velocity, resistance to fatigue, and differences in innervation and perfusion (Valentine, 2017). Yet, the molecular and cellular differences that contribute to this muscle specialization are not fully understood. A molecular atlas for different skeletal muscles could assist in deciphering the molecular basis of muscle-specific physiological features. Such an atlas may also be used to study differential muscle involvement in various conditions, such as muscular dystrophies, myopathies, differences in regenerative potential, physiological compensation in sports and sarcopenia.

During aging and in muscle diseases, some muscles are affected earlier than others. This is often referred as muscle involvement pattern, which can be characteristic of a given muscular pathology (Carlier et al., 2011; Raz et al., 2015; Albayda et al., 2018; Brogna et al., 2018; Diaz-Manera et al., 2018; Servián-Morilla et al., 2020). Several studies suggested that muscle-specific intrinsic molecular factors may explain this muscle involvement pattern (Kang et al., 2005; Rahimov et al., 2012; Huovinen et al., 2015; Raz et al., 2016; Terry et al., 2018; Hettige et al., 2020; Xi et al., 2020). For example, differences in the cellular pathways and myofiber type (slow- and fast-twitch myofibers) composition between muscles could play a role (De Micheli et al., 2020; Rubenstein et al., 2020; Xi et al., 2020), but may not fully explain the muscle involvement patterns.

Most of the studies characterizing the molecular variation between muscles were performed in mice (Campbell et al., 2001; Porter et al., 2001; Haslett et al., 2005; von der Hagen et al., 2005; Raz et al., 2018; Terry et al., 2018; Hettige et al., 2020), where muscle-specific mRNA profiles were linked to distinct myofiber type composition (Campbell et al., 2001; Raz et al., 2018; Hettige et al., 2020).

Since human muscle-related pathologies are not always recapitulated in mouse models (van Putten et al., 2020), understanding molecular variations between skeletal muscles should be performed in human samples. Only a few studies compared mRNA profiles between muscles from healthy human adults, and these studies face several limitations. Skeletal muscles are highly affected by age (McCormick and Vasilaki, 2018; Aversa et al., 2019), yet, the age range in previous studies was broad (Kang et al., 2005; Huovinen et al., 2015). Moreover, the number of sampled muscles and subjects were limited (Abbassi-Daloii et al., 2020). A study using postmortem material (Kang et al., 2005) only partly reflects molecular composition in living muscles due to storage in cooling conditions. Understanding muscle involvement in different pathologies can benefit from a molecular atlas of human muscles.

We generated a transcriptome atlas from six leg muscles and two locations from one muscle to explore molecular variations within and between muscles. Paired samples were obtained from 20 healthy male subjects of 25±3.6 years old. We show that the seven muscle tissues clustered into three groups, distinguished by cell type composition and mRNA expression profiles. We confirmed the transcriptome analyses with large-scale quantitative immunohistochemistry and RNA in situ hybridization procedures. We discuss the value of this skeletal muscle atlas resource to understand human health and pathologies affecting skeletal muscle tissues.

Results

Transcriptome atlas of adult human skeletal muscles

To determine molecular signatures marking leg muscles, we generated a transcriptome atlas of human skeletal muscles by sequencing biopsies from five upper leg muscles, gracilis (GR), semitendinosus (ST), rectus femoris (RF), vastus lateralis (VL), and vastus medialis (VM) muscles, and one lower leg muscle, gastrocnemius lateralis (GL; Figure 1A). We also investigated molecular differences within one muscle by including biopsies from the middle and distal end of the semitendinosus muscle (STM and STD, respectively). These two biopsies were treated as independent muscle samples in subsequent analyses (Figure 1B and C). In total, 128 samples from 20 individuals (aged 25±3.6 yr) were analyzed (Figure 1—figure supplement 1), making this currently the largest freely available human muscle atlas. Supplementary file 1 shows the sample characteristics.

Figure 1. An overview of biopsies’ location and the study workflow.

(A) A schematic overview of the leg muscles. Arrows point to the muscles that were included in this study. The biopsies, with exception of STM (semitendinosus-middle), were taken from the distal area. (B–D) The study overview includes cryosectioning, RNA-isolation and sequencing (B) data analysis (C) and validations (D). Created with BioRender.

Figure 1.

Figure 1—figure supplement 1. Analysis framework.

Figure 1—figure supplement 1.

A flowchart summarizing the analysis framework used to detect molecular signatures characterizing distinct skeletal muscles. Following pre-processing, muscle-specific signatures were identified using three approaches: differential transcript usage analysis (in purple), differential expression analysis (in blue), cell type composition analysis (in yellow), gene co-expression network analysis (in green), and functional enrichment analysis (in red).
Figure 1—figure supplement 2. RNA isolation protocols.

Figure 1—figure supplement 2.

The PCA plot shows the effect of the RNA isolation protocols. The scatter plot of PC1 (x-axis) and PC2 (y-axis) shows GR muscle from five individuals from which RNA was isolated using two RNA isolation protocols.
Figure 1—figure supplement 3. The overview of RNA-seq samples.

Figure 1—figure supplement 3.

An X indicates samples that are not present in the final transcriptome dataset. Fastq files available from the European Genome Archive, Dataset ID: EGAS00001005904.
Figure 1—figure supplement 4. The quality control and batch correction of RNA-seq data.

Figure 1—figure supplement 4.

(A–B) Scatter plots of PC1 (x-axis) and PC2 (y-axis) before (A) and after (B) batch correction. Each dot presents a sample labeled by muscle tissue. Each library preparation batch is shown with a different color. The re-sequenced batch is denoted in black. (C–D) Box plots show the analysis of variance before (C) and after (D) batch correction. Y-axis shows the percentage of variance explained by different factors. The x-axis shows the known biological (muscle and individual, shown in green) and technical (RIN score, concentration, batch, library size, shown in red) factors. The RNA isolation protocol effect is captured in the individual effect.

Variation in cell type composition between different muscles

Skeletal muscle is a heterogeneous tissue containing multiple cell types. The differences in the abundance of these cell types can be reflected in bulk RNA-seq profiles. Therefore, we used RNA-seq data to first explore possible cell type heterogeneity between leg muscles. We summarized the expression level of genes marking each cell type present in human skeletal muscles by calculating their first principal components (eigenvectors; Supplementary file 2). We used the eigenvalues of the eigenvectors representing the different cell types to cluster the muscles (Figure 2A) and to identify cell types with significant differences in relative abundance between muscles (Figure 2—figure supplement 1). The muscle tissues clustered into three groups, Group 1 (G1): GR, STM, and STD; Group 2 (G2): RF, VL, and VM; GL was the only muscle in Group 3 (G3) (Figure 2A). The relative abundance of endothelial cells was statistically the most different between muscles, with higher abundance in G2 and G3 than in G1 (Figure 2A–B, Figure 2—figure supplement 1). Other cell types marking blood vessels, namely pericytes, post-capillary venule (PCV) endothelial cells, natural killer (NK) cells, T and B cells, and myeloid cells, clustered together with the endothelial cells and all showed a higher abundance in G2 and G3 compared with G1 (Figure 2A–B). These results could suggest a higher capillary density and blood perfusion in/of the muscles in G2 and G3.

Figure 2. Muscles cluster into three main groups based on cell type composition.

(A) The heatmap shows the mean eigenvalues of genes marking each cell type across all the individuals. Each row shows a muscle, and each column shows a cell type. FAP stands for fibro-adipogenic progenitors. (B) The boxplot shows the eigenvalues for the endothelial cells, fast-twitch, and slow-twitch myofibers per muscle. The boxes reflect the median and interquartile range.

Figure 2.

Figure 2—figure supplement 1. Cell type composition pairwise comparison between muscles.

Figure 2—figure supplement 1.

The heatmap shows the differences between each pair of muscles. The statistically significant variation between muscles was tested by ANOVA followed by the post-hoc pairwise comparisons. Each row corresponds to a pairwise comparison, and each column shows a cell type. Color-coded cells show the corresponding t-ratio for the differences in eigenvalue of a cell type in each pairwise comparison. The significant differences (Tukey p-value <0.05) are colored red (significantly higher eigenvalues in muscle M1) or blue (significantly higher eigenvalues in muscle M2). The non-significant differences are colored from pink (relatively higher eigenvalues in M1) to light blue (relatively higher eigenvalues in M1).
Figure 2—figure supplement 2. Cell types’ eigenvalues.

Figure 2—figure supplement 2.

Each boxplot shows the eigenvalues of a cell types across different muscles. The boxes reflect the median and interquartile range.

Genes marking fast-twitch myofibers showed overall higher expression levels in G1, while slow-twitch genes were higher in G2 and G3 (Figure 2A–B). Differences in the relative abundance of non-muscle cell types, pericytes, immune cells, and endothelial cells, distinguished G3 from the G1 and G2 muscles (Figure 2—figure supplement 2, Figure 2A–B).

While STM and STD showed significant differences in the relative abundance of genes marking endothelial cells (higher expression in STD) and slow-twitch myofibers (higher expression in STM), there were no significant differences between ST and GR, and within the G2 muscles. This suggests that differences between regions of the same muscle may be larger than differences between distinct muscles (Figure 2—figure supplement 1).

Further study of differences in myofiber type composition between groups of muscles

To confirm the differences in myofiber types between muscles, we performed immunofluorescence staining for all muscles with a mixture of antibodies to three MyHC isoforms and anti-laminin antibody (Figure 3A). We developed a semi-automated image processing workflow to segment the myofibers using laminin staining and to quantify the fluorescence intensity of each MyHC isoform per myofiber. We next identified myofiber types by clustering all the myofibers using the MFI values of the three MyHC isoforms. The vast majority of the myofibers (94%) were assigned to three major clusters (Figure 3B). Each myofiber cluster had a major MyHC isoform (Figure 3C). Consistent with our study in human vastus lateralis muscle (Raz et al., 2020), the results here suggest that the myofibers are generally not purely type -I, -IIA, or -IIX but contain a mix of myosin heavy chain isoforms. We observed relatively high correlations from 0.55 to 0.62 between normalized gene expression of the dominant MyHC in each cluster and the proportion of myofibers assigned to the corresponding cluster (Figure 3D–F, Figure 3—figure supplement 1A). This correlation demonstrates the reliability of our RNA-seq-based assessment of MyHC expression.

Figure 3. Myofiber type composition is consistent with the expression level of genes marking fast and slow-twitch myofibers.

(A) A representative immunostaining image. The overlay of each myosin heavy chain isoform and laminin are shown separately. Scale bar is 1000 μm. (B) A 3-D scatterplot of the MyHC isoforms MFI. Each dot represents a myofiber. Myofibers in the three largest clusters are denoted with red (Cluster 1), blue (Cluster 2), and green (Cluster 3). The objects with low MFI values for all the isoforms are denoted in yellow (Cluster 4, ~2% of all the dots). In gray are ~4% of myofibers assigned to many small clusters. (C) Density plots show MFI distribution for each MyHC isoform in the three largest clusters. MFI values are scaled (without centering) and transformed. (D–F) Scatterplots show the proportion of the assigned myofibers to each of the largest clusters and the normalized expression of the gene coding the isoform with a relatively higher expression in that specific myofiber cluster. (G) The boxplot shows the proportion of myofibers in the three largest clusters per muscle. Each muscle is depicted with a different color, with G1 muscles in blue, G2 muscles in red and the G3 muscle in gray. The boxes reflect the median and interquartile range.

Figure 3.

Figure 3—figure supplement 1. The association of myofiber clusters with MyHC expression, and myofiber composition differences between muscles.

Figure 3—figure supplement 1.

(A) Scatterplots show correlation between the proportion of the assigned myofibers per cluster and MyHC isoform normalized expression levels. MYH2 codes for MyHC2A, MYH7 codes for MyHC1 and MYH1 codes for MyHC2X. (B) The heatmap shows the differences between each pair of muscles. The statistically significant variation between muscles was tested by ANOVA followed by the post-hoc pairwise comparisons. Each row corresponds to a myofiber cluster, and each column shows a pairwise comparison. Color-coded cells show the corresponding t-ratio for the differences in proportions of myofiber in each pairwise comparison. The significant differences (Tukey p-value <0.05) are colored red (significantly higher proportions in muscle M1) or blue (significantly higher proportions in muscle M2). The non-significant differences are colored from pink (relatively higher proportions in M1) to light blue (relatively higher proportions in M1).

In agreement with the results of the RNA-seq cell type composition analysis (Figure 2A–B), the quantitative histology analysis demonstrated a higher proportion of slow-twitch (oxidative) myofibers and a lower proportion of MyHC2X-dominated myofibers in G2 and GL (G3) muscles than in G1 muscles (Figure 3G, Figure 3—figure supplement 1B). The quantitative histology analysis further showed that G2 muscles had a higher proportion of MyHC2A-dominated myofibers than the G3 muscle (Figure 3G, Figure 3—figure supplement 1B), highlighting a distinct myofiber type composition of the GL muscle.

The myofiber composition results further showed a higher proportion of MyHC2X-dominated myofibers in STD than in STM, whereas STM had a higher proportion of MyHC1 and MyHC2A-dominated myofibers (Figure 3—figure supplement 1B). This confirms the existence of regional differences within a muscle (Bindellini et al., 2021).

Higher capillary density in GL

The RNA-seq cell type composition analysis suggested a higher proportion of endothelial and other cell types marking blood vessels in the GL muscle than in other muscles. To confirm this observation, we immunostained the endothelial cells using antibodies against Endoglin (ENG) and CD31 proteins (Tey et al., 2019). We included cryosections of GL and STM with the largest differences in the expression of genes marking endothelial cells (Figure 4A). We observed a higher proportion of CD31-positive areas in GL (Figure 4B), which was consistent with higher CD31 RNA expression levels in this muscle (Figure 4C).

Figure 4. Immunostaining confirms higher capillary density in GL compared with STM muscles.

Figure 4.

A) Representative images of STM and GL cross-sections immunostained with CD31 (green), ENG (red), and laminin (white). An enlargement of the boxed region is shown on the right: merged and separate channels (CD31 and ENG). Examples of objects recognized as capillaries are depicted by yellow arrows. Examples of objects that were not considered as capillaries due to circularity values are shown by red arrows. Scale bar 1000 μm. (B) The box plot shows the percentage of CD31 positive area in the two muscles. (C) The box plot shows the normalized expression of CD31 gene in the two muscles. (D) The boxplot shows the estimated capillary density in the two muscles. The capillary density was defined as the number of objects (3–51 µm2) with an overlap between CD31 and ENG per unit cross-sectional area of the muscle. The boxes reflect the median and interquartile range (N=19 per muscle). The red dots on the boxes show the mean. **** p-value <1 × 10–6 (linear mixed-model).

Next, we determined the muscles’ capillary density by counting small circular objects stained positive for both CD31 and ENG (Wehrhan et al., 2011). We observed a higher capillary density in GL compared with STM (Figure 4D). This observation is consistent with a higher proportion of endothelial cell types in GL compared with muscles in G1 or G2.

Gene and transcript expression profiles and molecular pathways distinguishing muscle clusters

We next investigated whether muscle-specific gene and transcript expression profiles, not explained by cell type composition, could also be found in our dataset. To this end, we determined the differentially expressed genes (DEGs; Figure 5—figure supplement 1A, Supplementary file 3) and the differentially expressed transcripts (DETs; Figure 5—figure supplement 1B, Supplementary file 4) for every pairwise comparison. DEGs that were driven by differences in cell type composition were excluded (Pearson’s R>0.5 between gene expression levels and the eigenvector of any cell type). Similarly, the transcripts originating from genes related to differences in cell type composition were excluded from the list of DETs. The proportion of DEGs and DETs that were not driven by cell type composition but discriminated each pair of muscles are shown in Figure 5A and Figure 5B, respectively. The muscles clustered in a similar way as was observed in the cell type composition analysis: GR, STM, and STD (G1), RF, VL, and VM (G2), and GL (G3) (Figure 5A–B).

Figure 5. Gene expression differences between three groups of muscles not driven by cell type composition.

A) Symmetric heatmap plot shows the percentage of DEGs in different pairwise comparisons. Genes with a high Pearson correlation (R>0.5) with the eigenvector of any cell type (cell type related genes) are excluded. (B) Symmetric heatmap plots show the percentage of DETs in pairwise comparisons. Transcripts originating from cell type related genes are excluded. (C) as in (B), but for the number of genes having at least a significant transcripts usage difference. (D) Symmetric heatmap plot shows the number of modules that were not driven by cell type composition and were significantly different in each pairwise comparison. Each row or column in A–D) represents a muscle. (E) The heatmap shows the modules that reflect the intrinsic differences between groups of muscles. Each row represents a muscle, and each column shows a muscle-related module that was not driven by cell type composition. Color-coded cells show the corresponding average of eigenvalues across all individuals (N=20). Modules with an overall higher expression in G1 or G3 are underlined by a blue or gray dashed line, respectively. The red dashed line underlines the modules with an overall higher expression in both G2 and G3. The black asterisks show modules with the largest differences between the three groups of muscles.

Figure 5.

Figure 5—figure supplement 1. DEA and WGCNA also clustered muscles in three groups.

Figure 5—figure supplement 1.

(A) Symmetric heatmap shows the proportion of all differentially expressed genes in different pairwise comparisons. Each row or column represents a muscle. (B) As in A, but here for the proportion of all differentially expressed transcripts. (C) As in (A) but here for the number of genes with at least a significant transcript usage difference. (D) as in A, but here for the number of modules that were significantly different in each pairwise comparison.
Figure 5—figure supplement 2. Overview of genes with significant transcript usage difference.

Figure 5—figure supplement 2.

The heatmap shows the number of transcripts with significant usage differences in each pair of muscles. The genes with at least one significant transcript usage difference are listed in rows. The pairwise comparisons with at least one transcript usage difference are shown in columns. Red asterisks highlight genes with a high Pearson correlation (R>0.5) with the eigenvector of any cell type.
Figure 5—figure supplement 3. WGCNA modules.

Figure 5—figure supplement 3.

A) Each row corresponds to a pairwise comparison and each column shows a muscle-related. Color-coded cells show the corresponding t-ratio for the differences in eigenvalue of a module in each pairwise comparison. The significant differences (Tukey p-value <0.05) are colored red (significantly higher eigenvalues in M1) or blue (significantly higher eigenvalues in M2). The insignificant differences are colored from pink (relatively higher eigenvalues in M1) to light blue (relatively higher eigenvalues in M1). (B) The table shows the intersection of the genes in the modules (columns) with genes marking different cell types (rows). Color-coded cells show the corresponding intersection number. The red asterisks show modules driven by cell type composition.

We then investigated the transcript usage differences in each pair of muscles using the IsoformSwitchAnalyzeR algorithm (Vitting-Seerup and Sandelin, 2019). We found a total of 79 genes with at least one transcript usage difference in any pairwise comparison of muscles (Figure 5—figure supplement 1C, Supplementary file 5). Out of 79 significant genes, 14 were excluded as they were related to the differences in cell type composition (Figure 5—figure supplement 2). Figure 5C shows the number of genes with significant transcript usage differences in each pair of muscles. The limited number of genes with significant transcript usage differences compare to DEGs and DETs analyses could be, because we could not account for the individual effect in this analysis in the same as for the DEG and DET analyses.

To further study muscle-specific expression profiles, we applied weighted gene co-expression network analysis (WGCNA). We identified 34 modules of co-expressed genes (Supplementary file 6). For each module, we calculated the module eigengene (ME) that represents gene expression levels of the genes in the module. We then implemented a pairwise comparison to find modules showing significant differences in every pairwise comparison (Figure 5—figure supplement 1D, Figure 5—figure supplement 3A). Out of the 34 modules, 27 showed a difference between at least two muscles (module size range: 38–1459; containing 10,695 genes in total). Nine out of the 27 muscle-related modules had at least five genes marking a specific cell type and were therefore defined as modules driven by differences in cell type composition and were not considered for further analysis (Figure 5—figure supplement 3B). Figure 5D shows the remaining modules that were not driven by cell type composition, nevertheless distinguished pairs of muscles. We then plotted the mean eigenvalues of muscle-related modules in a heatmap (Figure 5E) to determine the clustering of muscles based on the expression patterns of genes in the modules. The WGCNA-based clustering was consistent with the cell type composition and differential expression groups (Figure 5E). In total, 7 out of the 18 muscle-related modules demonstrated higher expression levels in G1, four modules had higher expression in G2 and G3, and three modules demonstrated higher expression levels in G3 only (Figure 5E). In addition, although none of the modules showed distinct expression patterns between muscles in G2 and between ST and GR, M.21 module showed higher expression levels in STM than STD (Figure 5—figure supplement 3A).

To explore the molecular and cellular pathways in the three groups, functional enrichment analysis was performed in the muscle-related modules. The most significantly enriched biological processes and molecular functions within these modules are listed in Table 1 (a complete list is in Supplementary file 6).

Table 1. Top enrichment results of muscle-related modules not driven by cell type composition.

Module Term FDR #Enriched genes Hub genes
Higher expression in G1 (GR, STM, and STD)
M.30 (75 genes) IRE1-mediated unfolded protein response 4×10–4 4
M.32 (236 genes) Hormone-mediated signaling pathway 9×10–3 11 CARM1, WBP2, ZBTB7A
M.7 (308 genes) Negative regulation of nucleobase-containing compound metabolic process 6×10–4 51 CREBBP, DAB2IP, FOXK2, LARP1, RXRA, THRA
Chromatin organization 1×10–2 27 ARID1B, CREBBP, HUWE1
M.17 (176 genes) Chromatin modifying enzymes 4×10–3 11 HCFC1, SETD1A
Higher expression in G2 (RF, VL, and VM) and G3 (GL)
M.31 (945 genes) RNA splicing 6×10–5 54 SNRNP70
Histone modification 3×10–3 47 KAT2A
M.33 (538 genes) Apical junction complex 2.4×10–2 11 MICALL2
M.13 (300 genes) Mitochondrion 4×10–45 122 AIFM1, ATP5F1A, ATP5F1B, CKMT2, COQ9, DLD, DLST, FH, GHITM, HADHA, HADHB, IMMT, MFN2, NDUFS2, PDHA1, PDHB, TRAP1, UQCRC2
M.14 (162 genes) Anterior/posterior pattern specification 4×10–3 7 HOXA11
Higher expression in G3 (GL)
M.5 (190 genes) Regulation of lipid metabolic process 8×10–4 15 ADIPOQ, ADRA2A, CIDEA, LEP, LGALS12, PDE3B, SCD
M.25 (188 genes) Ameboidal-type cell migration 5×10–3 15 CFL1, PML, TGFB1
Positive regulation of muscle cell differentiation 9×10–3 6 EHD2, ENG, NIBAN2, TGFB1
M.11 (299 genes) Golgi membrane 2×10–3 30 ASAP2, MAN1A1
Regulation of nervous system development 8×10–3 30 IQGAP1

Higher expression of mitochondrial genes in G2 and G3 muscles consistent with higher proportion of slow myofibers

In the M.13 module, with higher expression in G2 (VL, VM, and RF) and G3 (GL), the mitochondrial-related genes were enriched (Table 1). Eighteen out of 122 genes enriched for mitochondria in this module were hub genes, highly interconnected genes in the module (Table 1). To assess a potential impact on mitochondrial metabolic processes, we mapped the 122 genes to mitochondrial pathways (Figure 6). The most enriched processes were the respiratory electron transport chain in oxidative phosphorylation, the tricarboxylic acid (TCA) cycle, and beta-oxidation. This observation suggests a higher oxidative metabolism in G2 and G3, which is consistent with a higher proportion of slow myofibers.

Figure 6. A schematic representation of genes with higher expression in G2 and G3, related to oxidative phosphorylation and metabolic pathways in the mitochondria.

Figure 6.

60 (out of the 122) mitochondrial genes with higher expression in G2 and G3 are shown in red. The electron transport chain, lysin and tryptophan catabolism, TCA cycle, and beta-oxidation are shown. The hub genes are underlined and in bold. Created with BioRender.

Homeobox transcription factors contribute to the mRNA diversity between the three groups of muscles

An enrichment for ‘anterior/posterior pattern specification’ in M.14 was observed, with higher expression in G2 and G3 (Table 1). This module included HOX hub genes (Figure 7A). To assess whether the diversity between the groups of muscles was associated with the pattern of HOX gene expression, we plotted the normalized expression of all expressed HOX genes across all samples (Figure 7B). Remarkably, clustering based on HOX gene expression clearly separated the G1 from the G2 and G3 muscles (Figure 7B). Additionally, we observed that VL muscle sub-clustered apart from the two other muscles in G2 (Figure 7B). Moreover, 11 out of 36 HOX genes were assigned to three of the muscle-related modules (M.14, M.30, and M.32), which showed the largest differences between the three groups of muscles (Figure 7B). From each of these three muscle-related modules, one HOX gene was selected (HOXA11, HOXA10 and HOXC10) to further confirm the differences in expression between muscles using the in situ hybridization (ISH) procedure (Figure 8A). We included samples from GL and STM showing the largest difference in HOX gene expressions. HOXA11 was excluded from quantification analysis as it showed a low signal-to-noise ratio, in line with a lower expression level of RNA-seq-based assessment of HOXA11 compare to HOXA10 and HOXC10. The HOX signal was mainly localized in myofibers (Figure 8A). Per sample, the average number of foci per myofiber was calculated revealing a higher number of HOXA10 and HOXC10 single molecule RNAs in STM compared with GL (Figure 8B). The ISH results were consistent with the RNA-seq data (Figure 8B–C), further demonstrating the robustness of our RNA-seq data.

Figure 7. The expression patterns of HOX genes cluster muscles in the same groups.

Figure 7.

A) The graph shows the co-expression subnetwork of HOX genes and genes related to anterior/posterior pattern specification assigned to the M.14 module. Diamonds indicate transcription factors while other genes are indicated by circles. Pink and purple nodes represent the hub genes and non-hub genes, respectively. The genes related to anterior/posterior pattern specification have a black border. The edge thickness reflects the degree of topological overlap. Topological overlap is defined as a similarity measure between each pair of genes in relation to all other genes in the network. High topological overlaps indicate that genes share the same neighbors in the co-expression network. (B) Normalized expression of all HOX genes (scaled by row) represented as a heatmap. The hierarchical clustering was generated using the normalized expression values. Each row represents a gene and each column represents a sample. The side color of columns indicates different muscles. The module in which the gene assigned is given between parentheses. Eleven highlighted HOX genes are assigned into muscle-related modules which showed the largest differences between the groups of muscles (M.14, M.30, and M.32).

Figure 8. Distinct expression of HOX genes confirmed by RNAscope.

(A) Representative in situ hybridization images of HOXC10 and HOXA10 in cryosections of STM and GL. The image is a merge image of the channels used for laminin, DAPI, HOXC10 and HOXA10 staining. Scale bar is 100 μm. (B) The boxplots show the average number of foci per myofiber (y-axis) in STM and GL muscles (x-axis). (C) The boxplots show the normalized expression of HOXA10 (top) and HOXC10 (bottom) in STM and GL muscles. The boxes reflect the median and interquartile range (N = 12 per muscle). The red dots on the boxes show the mean. **** p-value < 1 × 10–6 (linear mixed-model).

Figure 8.

Figure 8—figure supplement 1. A representative image of negative control probes in the RNAscope experiment in STM muscle cryosections.

Figure 8—figure supplement 1.

Scale bar is 100 μm.

Web application for exploring transcriptome atlas of human skeletal muscles

To facilitate data reuse and exploration of the human skeletal muscle atlas, we developed a web application (https://tabbassidaloii.shinyapps.io/muscleAtlasShinyApp), enabling users to look up the sample information and the expression of any gene of interest. In addition, users can explore the list of genes used for the cell type composition analysis and their expression levels across all the samples. Furthermore, users can list and visualize the differentially expressed genes and the modules and their hub genes.

Discussion

We generated a large skeletal muscle transcriptome atlas from 20 young healthy males. We included six leg muscles and two locations within one muscle. The atlas presented in this study is unique in terms of the number of muscles, the individuals included, and the age range of the participants. We confirmed the RNA-seq analysis using large-scale quantitative immunohistochemistry and mRNA in situ hybridization. Based on cell type composition, differential gene and transcript expression analyses, differential transcript usage analysis and WGCNA, the seven leg muscle tissues consistently clustered into three groups: (G1) GR, STM, and STD; (G2) VL, VM, and RF; (G3) GL. The muscles in G2 and G3 (VL, VM, RF and GL) showed higher proportions of slow myofiber types and higher capillary densities. GL, the only lower leg muscle, was distinct from VL, VM, and RF in its lower proportion of type 2 A myofibers and a higher proportion of non-muscle cells. HOXA10 and HOXC10 expressions were lower in VL, VM, RF and GL than in GR, STM, and STD muscles.

Molecular diversity between muscles in different anatomical locations

The muscles included in this study mobilize and stabilize the knee joint. Muscles of the hamstrings (ST and GR) clustered together (G1), and muscles of the quadriceps (RF, VL and VM) clustered together (G2). The Hamstrings and quadriceps alternate in contraction and relaxation to flex, extend and stabilize the knee and aid in moving the thigh. GL, the only lower leg muscle in our set, is in the posterior side of the leg, allowing flexion of the knee and plantar flexion of the ankle. Our study suggests that there is little molecular diversity between muscles of the same group, as compared to muscles in different groups of muscles.

We observed a higher proportion of fast-twitch myofibers in G1 compared with G2 and G3. This could be due to the role of the hamstrings in activities that require a large power output since fast-twitch myofibers are used more in these activities than slow-twitch myofibers (Bottinelli et al., 1999; Willigenburg et al., 2014, Camic et al., 2015). Slow-twitch myofibers have a higher mitochondrial content compared with fast-twitch myofibers (Berchtold et al., 2000; Gouspillou et al., 2014). Consistently, G2 and G3 muscles showed higher expression of genes encoding for mitochondrial proteins and a higher ratio of slow-twitch myofibers compared with G1. Slow-twitch muscles are also supplied by a denser capillary network (Nishiyama, 1965; Murakami et al., 2010; Korthuis, 2011). Indeed, we observed a higher capillary density and higher endothelial cells in G3. The three groups of muscles also differed by the expression of HOX genes, specifically, HOXA and HOXC family members. Hox genes establish the anterior/posterior patterning during vertebrate embryonic limb development (Zakany and Duboule, 2007). Interestingly, the development of these groups of leg muscles differs in developmental time (Diogo et al., 2019), consistent with the expression of Hox genes (Zakany and Duboule, 2007). Hox genes expression is not limited to embryonic development, but was found also in adult mouse muscles (Houghton and Rosenthal, 1999; Yoshioka et al., 2021), and Hoxa10 gene was differentially expressed across adult limb mouse muscles (Yoshioka et al., 2021). Moreover, Yoshioka et al., 2021 demonstrated that Hoxa10 expression in adult satellite cells affects muscle regeneration in mice. Here, we show that both HOXA and HOXC gene families are expressed in myofibers, and their expression levels differ between leg muscles. Yoshioka et al., 2021 also showed the expression of HOX genes in adult human muscle tissues. Yet, Terry et al., 2018 concluded that the expression pattern of Hox genes in adult muscles is insufficient to explain the mRNA expression diversity in adult mouse skeletal muscles. Whether HOX genes are transcriptionally active in adult myofibers is a subject for future studies.

Potential relevance to muscle disease and aging

In several muscle-related diseases like muscular dystrophies (MDs), muscle weakness and pathological features like replacement of muscle tissue with fat starts in specific muscles and spreads to others as the disease progresses (Emery, 2002). This pattern differs between diseases, and the reason for the disease-specific involvement pattern is unknown. Exploring the molecular signatures that contribute to the differences between muscles may elucidate the pathophysiology of these diseases. For example, in Duchenne muscular dystrophy (DMD), which is caused by mutations in the DMD gene, the quadriceps is involved earlier, whereas the hamstring muscles are less involved, and the GR is spared (Wokke et al., 2014; Hooijmans et al., 2017). The observed higher expression level of the DMD gene in ST and GR may be related to the late involvement in DMD patients during disease progression. Accessing the expression level of genes and implementing a quantitative approach (Veeger et al., 2021) to evaluate the association between leg muscle architectural characteristics and gene expression levels could be performed for other muscle diseases.

Regional differences within muscles

The molecular and cellular differences between the samples from distal and middle locations of ST were larger than the differences between ST and GR (Figure 2—figure supplement 1, Figure 2—figure supplement 2). One module of co-expressed genes, M.21, showed a different expression pattern between STM and STD. This module was enriched for the cellular amino acid catabolic process and monocarboxylic acid catabolic process (Supplementary file 7). While the distal side of the ST muscle has a rounded tendon, the differences between STM and STD cannot be explained by contamination of tendon tissue or closer proximity to the tendon, because we did not find a difference in the estimated tenocyte proportions between biopsies collected from the distal and middle parts of the muscle (Figure 2—figure supplement 1, Figure 2—figure supplement 2). The myofiber composition was different between the distal and medial parts of the ST muscle (Figure 2—figure supplement 1, Figure 3—figure supplement 1B). A divergent myofiber type composition of biopsies from superficial and deep areas of the same human muscle was reported by Johnson et al., 1973 for the GL, RF, VL, VM, adductor magnus, soleus, and tibialis anterior muscles in the leg and thigh. Bindellini et al., 2021 also reported different proportions of MyHC2A myofibers in distal and middle parts of tibialis anterior in mice.

Interindividual differences were larger than differences between muscles

Despite the narrow age range and an inclusion of only one gender in our study, we observed that the percentage of variance explained by the individual surpassed the variance explained by the muscles (Figure 1—figure supplement 4D). This is in agreement with findings from Kang et al., 2005. The inter-individual variations are possibly resulting from genetic and environmental (activity, exercise, diet, etc.) factors. To account for inter-individual variation, we included the individual as a random effect in the statistical models in cell type and differential expression analyses. In the WGCNA, we constructed a consensus gene co-expression network by merging the co-expression networks separately constructed per individual. In the downstream steps of the WGCNA workflow, the same as in cell type analysis and differential expression analyses, individuals were included as a random effect in the models. Only after properly accounting for interindividual differences, we could identify the intrinsic differences between leg muscles.

Study limitations

Differences in cell type composition between muscles are best captured using single-cell sequencing. Previous single-cell (De Micheli et al., 2020; Rubenstein et al., 2020; Xi et al., 2020) and single nucleus (Orchard et al., 2021; Perez et al., 2021) studies reported the cellular composition of adult human muscles, where single nucleus profiling is preferred because myofibers cannot be dispersed into single cell suspensions. The high costs associated with single-cell technologies are currently prohibitive for performing large scale analyses of >100 samples such as performed in our study. Here, we evaluated differences in cellular composition by deconvolution of bulk RNA-seq based on marker genes reported in single-cell studies. This approach appeared to be suitable for analyzing differences in cellular composition between large sets of samples, as we observed good consistency with immunohistochemistry-based analyses of myofiber type and endothelial cell composition. A limitation of the deconvolution approach is, however, that this only captures cell types for which discriminative marker genes are available.

We further acknowledge that RNA expression levels do not necessarily match protein abundance in muscles and do not reflect post-translational modifications (Greenbaum et al., 2003; Liu et al., 2016). Although a protein atlas could relate to muscle cell function better than RNA expression profiles, generating a genome-wide proteome in skeletal muscles is challenging, as muscle proteomes are dominated by the high abundance of high-molecular-weight sarcomeric proteins, and capturing the low abundance proteins is challenging. Despite this limitation, we showed consistency between results obtained by mRNA expression profiling and immunohistochemical staining of the proteins that were in focus in our study.

In summary, we demonstrated divergent molecular and cellular compositions between skeletal muscles in different anatomically adjacent locations. Overall, the consistency of the gene expression patterns, and the results obtained from the immunohistochemistry and RNA in situ hybridization experiments indicate the high accuracy and reliability of the transcriptome atlas generated in this study. Therefore, this atlas provides a resource for exploring the molecular characteristics of muscles and studying the association between molecular signatures, muscle (patho)physiology and biomechanics.

Materials and methods

Subject characteristics and biopsy collection

Healthy male subjects (aged 18–32) undergoing surgery of the knee for anterior cruciate ligament (ACL) reconstruction using hamstring autografts were recruited from outpatient clinics of two hospitals: Erasmus Medical Center and Medisch Centrum Haaglanden. Inclusion criteria included age, sex, and the amount of routine exercise. Subjects eligible for reconstructive ACL surgery were mobile, had full range of knee motion, minimal to no knee swelling and had physiotherapy until the surgery.

A total of seven biopsies were taken from six different leg muscles (Figure 1A). To study molecular differences within the muscle, two biopsies from the middle and distal sides of the semitendinosus muscle (STM and STD, respectively) were collected. During the surgery, the tendons of the gracilis (GR) and semitendinosus muscles were used to reconstruct the ACL, and biopsies from these muscles were taken directly from the graft after harvesting the autografts at the beginning of the operation. After the ACL construction, biopsies from gastrocnemius lateralis (GL) rectus femoris (RF), vastus lateralis (VL), and vastus medialis (VM) muscles were taken by percutaneous biopsy (modified Bergstrom, 1975) using a minimally invasive biopsy needle. All biopsies were immediately frozen in liquid nitrogen and were kept at –80 °C.

The study was approved by the local Medical Ethical Review Board of The Hague Zuid-West and the Erasmus Medical Centre and conducted in accordance with the ethical standards stated in the 1964 Declaration of Helsinki and its later amendments (ABR number: NL54081.098.16). All subjects provided written informed consent prior to participation.

Sample processing, RNA isolation, and cDNA library preparation

Biopsies were cryosectioned for RNA isolation, immunofluorescence staining, and in situ hybridization. For each sample, three cryosections of 16 µm thick were collected onto SuperFrost slides (Thermo Fisher Scientific, 12372098) and stored at −20 °C prior to staining. For in situ hybridization, the cryosections were mounted on SuperFrost Plus Adhesion slides (Thermo Fisher Scientific, 12625336) and stored at −80 °C. For the RNA isolation, cryosections were transferred into MagNA lyser green beads tubes (Roche, 3358941001). Then, they were homogenized in QIAzol lysis reagent (Qiagen, 79306) using the MagNA Lyser. Subsequently, total RNA was purified with chloroform. For samples from a subset of individuals, RNA was precipitated with isopropyl alcohol (Supplementary file 1). For the other samples total RNA was mixed with an equal volume of 70% ethanol and further purified with miRNeasy Mini Kit (217004, Qiagen) using the manufacturer’s protocol (Supplementary file 1). To evaluate the effect of two different RNA isolation protocols, RNA from five GR samples were isolated with both protocols (Figure 1—figure supplement 2). For both protocols, DNA was removed using RNAse-free DNAse set (Qiagen, 79254) using the manufacturer’s protocol. RNA integrity was assessed with the Agilent 2100 Bioanalyzer using Eukaryote Total RNA Nano chips according to the manufacturer’s protocol (Agilent BioAnalyzer, 824.070.709) (Supplementary file 1).

Poly(A) library preparation was performed in four batches each with 39 samples at Leiden Genome Technology Center (LGTC, the Netherlands). Information on the RNA isolation protocol and library preparation batches used for each sample can be found in Supplementary file 1. Samples from different muscles and individuals were equally distributed in each library batch to minimize a batch effect bias. Approximately 200 ng of total RNA was used as starting material. mRNA was enriched using oligo dT beads (polyA +bead-based enrichment), fragmented, and converted to cDNA using random hexamers and SuperScript III (Invitrogen). End-repair, A-tailing, and adapter ligation were performed using NEBNext chemistry (New England Biolabs) and xGen dual index UMI adapters (Integrated DNA Technologies) according to the manufacturer’s protocol. Finally, USER digest (New England Biolabs) and 15 cycles of library amplification were performed. Libraries were purified with XP beads and analyzed for size and purity on a Bioanalyzer DNA HS chip (Agilent BioAnalyzer, 5067–1504).

Bulk RNA-sequencing and analysis

Illumina sequencing was performed by GenomeScan BV (Leiden, the Netherlands) on a Novaseq-6000 producing paired-end 2×150 bp reads. Fastq files were processed using the BioWDL pipeline for processing RNA-seq data (v3.0.0, https://zenodo.org/record/3713261#.X4GpD2MzYck) developed by the sequencing analysis support core (SASC) team at LUMC. The BioWDL pipeline performs FASTQ pre-processing, RNA-seq alignment, deduplication using unique molecular identifiers (UMIs), variant calling, and read quantification. FastQC (v0.11.7; ) (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used for checking raw read QC. Adapter clipping was performed using Cutadapt (v2.4; Martin, 2011) with default settings, followed by checking the QC using FastQC. RNA-Seq reads’ alignment was performed using STAR (v2.7.3a; Dobin et al., 2013) against the GRCh38 reference genome. PCR duplications were removed based on UMIs using UMI-tools (v0.5.5; Smith et al., 2017). Gene read quantification was performed using HTSeq-count (v0.11.2; Anders et al., 2015). The expression of known transcripts were quantified using StringTie (3.7.3)(Pertea et al., 2015). Ensembl version 98 (http://sep2019.archive.ensembl.org/) was used for genes and transcripts annotation. Samples with less than 5 M reads assigned to annotated exons were re-sequenced or excluded from all downstream analyses. A SNP calling was performed using GATK4 (v4.1.0.0; McKenna et al., 2010). Possible sample swapping was checked using an SNP panel with 50 SNPs (Yousefi et al., 2018). The similarity for calls of these SNPs showed that two samples in the same RNA isolation batch were swapped. We revised the labels of these two samples in our dataset for downstream analyses.

We performed all the analyses in RStudio Software (v1.3.959; RStudio-Team, 2020) using R Statistical Software (v4.0.2) (R Development Core Team, 2020). Samples with more than 5 M reads assigned to annotated exons were included in all downstream analyses (Figure 1—figure supplement 3). The HTSeq count table was used to create a DGEList object using the edgeR Bioconductor package (v3.30.3) (Robinson et al., 2010). The filterByExpr function from the edgeR Bioconductor package was used to keep genes with 10 or more reads in at least 16 samples (the number of samples in the smallest muscle group). The dataset was normalized using the calcNormFactors function (considering trimmed mean of M-values [TMM] method) from the edgeR Bioconductor package.

Quality control and batch effect correction

We performed principal component analysis (PCA) to evaluate the main difference between samples in an unsupervised manner. Log-transformed expression values after normalization by counts per million (CPM) were used to calculate principal components using the base function prcomp with the center and scale argument set to TRUE.

We then performed the analysis of variance to determine the factors driving gene expression variations. We estimated the contribution of known biological (muscle tissues and Individuals) and technical (RNA isolation protocol, RIN score, initial RNA concentration, library preparation batch, sequencing lane, and library size) factors on variation of gene expression. Data transformed by the voom function from the limma Bioconductor package (v3.44.3; Law et al., 2014; Ritchie et al., 2015) was used to fit a linear model for each gene. We included all biological and technical factors as fixed effects and fitted the following linear model:

voom-transformedexpressiongenemuscle+individual+RNAisolationprotocol+RINscore+concentration+librarypreparationbatch+sequencinglane+library

We used ANOVA from the car R package (v3.0–10; Fox and Weisberg, 2019) to estimate the relative contribution of each of these factors in the total variation of gene expression. Outcomes from both PCA and ANOVA revealed a strong library preparation batch effect (Figure 1—figure supplement 4A and C), while the effect of other technical factors (RNA isolation protocol, initial RNA concentration, RIN score, and library size) was minimal (Figure 1—figure supplement 4C). Accordingly, the HTSeq count table was corrected for the batch effect by the ComBat-seq Bioconductor package (Zhang et al., 2020). The muscle was included in the ComBat-seq model to preserve possible molecular differences between muscles. The ComBat-Seq count table was used to create a DGEList object, followed by removing the low expressed genes and additional normalization using filterByExpr and calcNormFactors functions, respectively.

Similarly, the batch effect was corrected for the StringTie transcript count table using the same ComBat-seq’s model. Next, a DGEList object was generated and the low expressed transcripts were filtered considering the same threshold used for gene dataset. The transcripts were further filtered to only include the transcripts of genes that were present in the filtered gene dataset. Finally, the transcript dataset was normalized with TMM method.

Outcomes of PCA and ANOVA confirmed the proper removal of the batch effect (Figure 1—figure supplement 4B and D). In addition, the percentage of variance explained by the individual was found to be bigger than the variance explained by the muscle (Figure 1—figure supplement 4D). We, therefore, included the individual as a random effect in all different analyses. Moreover, the RIN score was not considered as an exclusion criterion as it did not contribute to gene expression variation (Figure 1—figure supplement 4D).

Cell type composition estimation

We collected lists of genes marking different cell types that are present in human skeletal muscles from different studies (Smith et al., 2013; Kendal et al., 2019; Perucca Orfei et al., 2019; Rubenstein et al., 2020; Supplementary file 2). The expression of genes marking each cell type was summarized by their eigenvector (first principal component). We subsequently fitted a linear-mixed model to the eigenvector of each cell type using the lmer function from the lmerTest R package (3.1–3; Kuznetsova et al., 2017). These models included muscle as a fixed effect and individual as a random effect shown in the formula below:

eigenvectorcelltypemuscle+1individual+error

We tested the significance of fixed effects with the ANOVA from the car R package. The Benjamini-Hochberg false-discovery rate (FDR) was applied to adjust for multiple testing. We conducted post-hoc pairwise comparisons using the lsmeans R package (v2.30–0; Lenth, 2016) to identify a significant difference in the expression level of genes marking different cell types between different muscles. We used the pheatmap function from the pheatmap R package (v1.0.12; https://CRAN.R-project.org/package=pheatmap) with the difficult setting to draw all the heatmaps.

Differential expression analysis (DEA)

We used the voom-transformed data to fit linear mixed-effects models for each gene using the lmer function from the lmerTest R package. The individual and muscle were included in the models as a random-effect and a fixed-effect, respectively, similarly to the formula-2. The voom precision weights showing the mean-variance trend for each observation were incorporated into the models. We tested the significance of fixed effects with the ANOVA from the car R package and the FDR was applied to adjust for multiple testing. We conducted post-hoc pairwise comparisons using the lsmeans R package to identify significant differences between each pair of muscles.

We calculated the Pearson correlation between differentially expressed genes (DEGs, FDR <0.05) and the eigenvector of each cell type using the cor and cor.test from the stats R package. We adjusted for multiple testing using the FDR. DEGs which were significantly associated with a cell type eigenvector (Pearson correlation >0.5 and FDR <0.05) were defined as cell type related.

To find differentially expressed transcripts (DETs), we performed the DEA using the same model (formula-2) for the voom-transformed transcript data. We defined cell type related DETs as transcripts originating from genes with significant association with a cell type eigenvector.

Differential transcript usage analysis

We computed the isoform usage differences using IsoformSwitchAnalyzeR Bioconductor package (v1.20.0; Vitting-Seerup and Sandelin, 2019). Breifly, the Ballgown input table files generated by StringTie were imported by the importIsoformExpression function. A switchAnalyzeRlist was created considering the muscle and batch in the design matrix using importRdata function. The transcripts of genes that were not present in the filtered gene dataset were filtered out using subsetSwitchAnalyzeRlist function. Finally, isoformSwitchAnalysisCombined function was used to determine splice switch changes (differential transcript usage) between each pair of muscles. The significant splice switch changes were identified using significant FDR (<0.05).

Consensus gene co-expression network analysis

In order to construct a gene network, we used the weighted gene co-expression network analysis algorithm using the WGCNA R package (v1.69) (Langfelder and Horvath, 2008). We used the voom transformed data as an input. In order to calibrate the parameters of the network, we used the approach published by our group (Abbassi-Daloii et al., 2020). Briefly, prior knowledge of gene interactions from a pathway database was used to select the most optimal set of WGCNA parameters. We used the biweight midcorrelation (median-based) function in WGCNA of the signed hybrid type to define the adjacency matrix. We performed a full parameter sweep, testing various combinations of settings for power (6, 8, 10, 12, 14, 18, and 22), minClusterSize (15, 20, and 30), deepSplit (0, 2, and 4), and CutHeight (0.1, 0.15, 0.2, 0.25, and 0.3). These different settings were assessed using the knowledge network obtained from the Reactome database using g:ProfileR2 R package (v0.2.0) (Kolberg et al., 2020). All possible pairs of genes were assigned into four different groups: (1) in the same module and in the same pathway, (2) in the same module but not in the same pathway, (3) not in the same module but in the same pathway and (4) neither in the same module nor in the same pathway. The enrichment factor (No.pairsgroup1×No.pairsgroup4No.pairsgroup2×No.pairsgroup3) was calculated. The optimal set of parameters with the highest enrichment factor was: power: 8, MinModuleSize: 20, deepSplit: 0, Cut Height: 0.2. To identify gene co-expression networks that were consistent across individuals, we constructed first co-expression networks for each individual separately and merged these subsequently into a consensus co-expression network. To achieve this, the adjacency matrices per individual were raised to power 8 and converted into topological overlap matrices (TOM). TOM of some individuals may be overall lower or higher than TOM of other individuals. To account for this, we performed percentile (0.95) normalization over all the TOMs. The consensus TOM was then calculated by taking the elementwise 40th percentile of the TOMs. The consensus TOM was used to calculate the TOM dissimilarity matrix (dissTOM=1-TOM) which was then input to agglomerative hierarchical clustering (Langfelder and Horvath, 2012). Finally, modules were identified using a dynamic tree-cutting algorithm from the resulting dendrogram (Langfelder et al., 2008) specifying MinModuleSize =20 and deepSplit =0. The module labeled ‘gray’ was not considered in the analysis as it consisted of genes that did not assign to any specific module. The summary expression measure for each module, the module eigengene (ME), was calculated (Zhang and Horvath, 2005). Modules with similar expression profiles were merged at the threshold of 0.2. In addition, we calculated the intramodular connectivity to identify highly interconnected genes, called hub genes, per module.

Module-muscle association

To identify modules that differ in expression levels between muscles (named as muscle-related modules), we fitted linear mixed-effect models on the module eigengenes (MEs) using the lmer function from the lmerTest R package. These models included individual as a random-effect and muscle as a fixed-effect, similar to formula-2. We tested the significance of fixed effects with ANOVA from the car R package. We used ranova from the LmerTest R package to test the significance of random effects. To identify significant differences between each pair of muscles, we used a post-hoc multiple comparison tests as implemented in the lsmeans R package.

We performed a functional enrichment analysis for the muscle-related modules using ClueGO App (v2.5.7) (Bindea et al., 2009) in Cytoscape (v3.8.1) (Kohl et al., 2011). We used the CyREST API (Ono et al., 2015) to execute the ClueGO by R script (http://www.ici.upmc.fr/cluego/cluegoDocumentation.shtml). Pathways and gene annotations from Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO), Reactome, and WikiPathways (WP) were included. The Benjamini-Hochberg FDR was applied to adjust for multiple testing. The annotations with any differentially expressed genes or hub genes or a transcription factor were included. To eliminate the redundant annotations, we only included an annotation with the lowest FDR for each ‘GoGroups’ defined by ClueGO and the annotations marked as ‘LeadingGoTerm’ by ClueGO.

We next determined muscle-related modules which showed the largest differences between the three groups of muscles. These modules were selected based on the FDR of GlueGO enrichment (<0.01) and the F-value of the genes resulting from DEA in each module (third quantile >5.5).

Immunofluorescence staining, imaging, image analysis

The immunofluorescence staining included myofiber typing and capillary staining. Prior to the staining, slides were allowed to equilibrate to room temperature, blocked for 30 min using 5% milk powder (FrieslandCampina, Amersfoort, The Netherlands) in phosphate-buffered saline containing 0.05% tween (PBST).

Myofiber type composition

The myofiber immunostaining, image quantification and myofiber type composition analysis are detailed in a STAR protocol (Abbassi-Daloii et al., 2023).

Myosin staining

The antibodies for three myosin heavy chain (MyHC) isoforms (MyHC1, MyHC2A, and MyHC2X) and laminin were used as described by Riaz et al., 2016. Briefly, cryosections were stained with rabbit anti‐laminin (1:1000, Sigma-Aldrich, L9393) and mouse anti‐6H1 (1:5, DSHB; AB_2314830) detecting MyHC2X, for two hours at room temperature. Following the PBST washing, the secondary antibodies goat anti‐rabbit‐conjugated‐Alexa Fluor 750 (1:1000, Thermo Fisher Scientific, A21039) and goat anti‐mouse‐conjugated‐Alexa Fluor 488 (1:1000, A11001, Thermo Fisher Scientific) were incubated for an hour at room temperature. After PBST washing, sections were incubated overnight at four degrees with a mix of fluorescently conjugated monoclonal antibodies: BA‐D5‐conjugated‐Alexa Fluor 350 (1:600, DSHB, AB_2235587) and SC‐71‐conjugated‐Alexa Fluor 594 (1:700, DSHB, AB_2147165), detecting MyHC1 and MyHC2A, respectively. Lastly, after washing with PBST, the cryosections were mounted with ProLong Gold antifade reagent (P36930, Thermo Fisher Scientific) and stored at four degrees prior to imaging.

Image acquisition, processing, and quantification

The stained slides were imaged with the Axio Scan.Z1 slidescanner (Carl Zeiss, Germany) using the ZEN Blue software (v2.6), capturing the entire section. The images were acquired with a 10×/0.45 Plan-Apochromat objective lens and the same image settings were used for all slides.

After imaging all cryosections, a shading profile was calculated using the ‘Shading Reference From Tile Image’ in ZEN Lite (v3.3) for each channel in each slide. This procedure produces a shading profile for each channel per slide and does not apply the shading correction. To improve the accuracy of the shading profile, we calculated the median over all the shading profiles over all scanned slides for each channel. These median shading profiles were then used to perform the shading correction using ‘Shading Correction’ in ZEN Lite (v3.3).

Further image processing was performed using Fiji (v 1.51) (Schindelin et al., 2012). Since the aggregated dataset is relatively large, we created a modular set of Fiji macros that process each step independently.

First, we converted the slidescanner datasets from the Carl Zeiss Image format (CZI) to multichannel 16 bit TIFF files using BioFormats (Linkert et al., 2010). In this step, the images were 4 x downsampled, by averaging, to improve the processing speed and reduce the dataset size. After downsampling the effective pixel size was 2.6 µm.

Next, we applied a semi-automated process to generate tissue masks from the laminin channel to determine the (parts of) cryosections to be quantified. To generate masks, we first used an automated procedure, inspired by ‘ArtefactDetectionOnLaminin’ method from MuscleJ (Mayeuf-Louchart et al., 2018). Subsequently, a manual step was incorporated to check and correct the generated masks. For each sample, we performed manual corrections to remove artifacts such as tissue folds, out-of-focus regions, scratch, and dirt objects.

Then, we generated ‘masked’ copies of the laminin channel. To reduce any possible artifacts due to this binary mask, we applied a gaussian blur of 4 pixels to the masks and we set the pixel values of the laminin channel that were outside the mask to the median intensity of these pixels. The masked laminin images were then fed into the Ilastik pixel classification algorithm (Berg et al., 2019). In Ilastik we trained a classifier to identify two classes: ‘myofiber boundary’ and ‘not myofiber boundary’. This classifier was then used to process all images in this dataset. This classification step greatly improved the subsequent laminin segmentation outputs.

Next, the laminin objects were segmented based on the output of the previous step. In short, the image was slightly blurred with a Gaussian Blur, after which the image was segmented using the Fiji method ‘Find Maxima’ with output ‘Segmented Particles’, followed by binary dilation, and closing. Finally, the regions-of-interest (ROI; individual laminin segmented objects) were generated using the ‘Analyze Particle’ method from Fiji.

After laminin segmentation, we measured the mean fluorescence intensity (MFI) as well as other properties in ROIs for all three channels using the Fiji measurement: ‘Mean gray value’. In addition, we recorded the ‘Area’, ‘Standard deviation’, ‘Modal gray value’, ‘Min & max gray value’, ‘Shape descriptors’, and ‘Median’ features. We also quantified the results of the pixel-classification step by measuring its ‘Mean gray value’ in each ROI as well as on the border (3-pixel enlargement) of each ROI. This quantification allows the assessment of the myofiber ‘segmentation certainty’, the certainty is high when the pixel-classification is high for the ‘myofiber boundary’ class all around the myofiber and low in the interior of the myofiber.

Myofiber type composition analysis

First, we filtered out the non-myofiber objects since the laminin segmentation was automatic. We applied a percentile filtering for a ‘segmentation certainty’ on the cross-sectional area (CSA) and the circularity values. The objects with (1) pixel-classification on the object boundary less than 5th percentile or (2) pixel-classification in the interior of the object greater than 95th percentile or (3) CSA less than 10th percentile or greater than 99th percentile or (4) circularity greater than 1st percentile were excluded. Samples from different muscles were pooled for all different filtering criteria except for the filtering for CSA, as the density distributions of CSA were found to differ between different muscles. In the next step, we selected the cryosection with the largest number of myofibers for each sample for further analysis. Samples with a minimum of a hundred myofibers were included in the myofiber type analysis. The final dataset contained 1,287,729 myofibers from 96 samples, with a median of 888 myofibers per sample. As previously described by Raz et al., 2020, per myofiber, the MFI values for each of three MyHC isoforms were scaled per sample (without centering). Subsequently, the composition of myofiber types was determined by clustering of the transformed (natural logarithm) MFI values. Each myofiber was assigned to a cluster using the mean-shift algorithm (bandwidth (h)=0.02), a density-based clustering approach, implemented in the LPCM R package (v0.46–7) (Cheng, 1995; Durham University and Einbeck, 2011). All the small clusters, with less than 1% from the total myofibers, were excluded. Then, per myofiber type cluster, the proportions of the total myofibers were calculated per sample.

Capillary density

Staining and image acquisition

Sections were stained with the primary antibodies: anti-human CD105 (endoglin, ENG) biotin-conjugated (1:100, BioLegend, 323214), anti-human CD31-Alexa Fluor 594 conjugated (1:400, BioLegend, 303126), and rabbit anti‐laminin for two hours at room temperature. After PBST washing, the slides were incubated with streptavidin-Alexa Fluor 647 conjugated (1:500, Life Technologies, S21374) and goat anti‐rabbit Alexa Fluor 750-conjugated for an hour. After final PSBT washing, nuclei were counterstained with 4’,6-diamidino-2-phenylindole (DAPI) (0.5 μg/mL, Sigma-Aldrich) and were mounted with ProLong Gold antifade reagent. Cryosections were imaged with Axio Scan.Z1 slide scanner.

Image processing and quantification

We used Fiji macros created for the myofiber composition analysis to convert CZI files to TIFF files, to generate the masks, and for the laminin segmentation. We then measured the cross-sectional area for laminin segmented objects using the Fiji ‘Area’ measurement. Next, a Gaussian Blur filter with an σ value set to 1 was implemented on the CD31 channel, followed by thresholding using setAutoThreshold (“Li dark” algorithm) and processing using Watershed algorithm to separate touching and overlapping cells. The lumens were filled using the Fill Holes algorithm in Fiji. We then measured the properties in ROIs using the Fiji measurements: ‘Area’, ‘Mean gray value’, ‘Standard deviation’, and ‘Shape descriptors’. We then implemented the same processing on the ENG channel to select the ROIs but measured the ‘Mean gray value’ and ‘Standard deviation’ in the CD31 channel to determine the CD31 and ENG colocalization.

For the image quantification, we first calculated the ratio between the total positively stained areas for CD31 and the total area of the muscle section, expressed as a percentage. We then determined the capillaries as the objects with (1) positive signals for both CD31 and ENG (Wehrhan et al., 2011), (2) larger than 3 µm2 and smaller than 51 µm2 (Poole et al., 2013), and (3) circularity larger than 0.5. Finally, we defined capillary density as the number of capillaries per unit (µm²) of muscle area.

RNAscope in situ hybridization

We detected single-molecule RNA using Multiplex Fluorescent Reagent Kit v2 (ACDBio, 323135) according to the manufacturer’s protocol for fresh-frozen cryosections, with the following adjustments to optimize the experiment for human muscles: fixation with 4% paraformaldehyde at 4 degrees for an hour, and all washing steps with washing buffer were performed three times for 2 min each. The protocol was optimized on control muscle cryosections by negative and positive probe sets provided by ACDBio. Figure 8—figure supplement 1 shows a representative image of negative control probes in a muscle cryosection. We performed the hybridization using probes for Hs-HOXA11 (ACDBio, 1061891-C1), Hs-HOXA10 (ACDBio, 867141-C2), and Hs-HOXC10 (ACDBio, 803141-C3). Following the completion of the RNA probe hybridization, we carried out an immunostaining step at room temperature to label myofibers with rabbit anti‐laminin followed by secondary labeling with goat anti‐rabbit‐conjugated‐Alexa Fluor 555 (1:1000, Abcam, ab150078). Lastly, following PBST washing, the nuclei were counterstained with DAPI (ACDbio, 323110). Cryosections were mounted with ProLong Gold antifade reagent. Slides were imaged with a Leica SP8 confocal microscope, equipped with a white light laser (WLL) source (Leica Microsystems, Germany) using a 40 x/1.3 OIL objective. For each sample, multiple tiles at different regions across the muscle cryosection were images with seven z-planes (z-step size = 0.35 μm). The images for DAPI and HOXA11 channels were acquired using a HyD 2 detector with 414 nm-532nm excitation lasers and with 504 nm-543nm excitation lasers, respectively. A HyD 4 detector was used to image anti-laminin and HOXA10 channels with 558 nm-585nm excitation lasers and with 603 nm-665nm excitation lasers, respectively. A HyD 5 detector was used to image HOXC10 channel with 675 nm-800nm excitation lasers. The same image settings were used for all samples.

We performed the image processing in multiple steps and created a modular set of Fiji macros that process each step independently. We first merged and converted the Leica Image File (LIF) to a multichannel 16 bit TIFF file using the Grid/Collection Stitching Plugin (Preibisch et al., 2009). We segmented myofibers using the following steps: (1) creating the maximum intensities projections of the laminin channel, (2) creating ‘probability’ maps of the laminin channel in Ilastik, (3) adding a point selection to the TIFF files, which seed the watershed, and (4) implementing watershed segmentation with two halting points for user interaction, first watershed segmentation and then making the ROI list (individual segmented myofibers) generated using the ‘Analyze Particle’ command.

After myofiber segmentation, we implemented a Gaussian Blur filter with an σ value set to 1 on each probe channel. We then applied the color threshold settings using setAutoThreshold (‘RenyiEntropy dark’ algorithm). Finally, for each probe channel, we measured the foci properties in each segmented myofiber using the Fiji measurements: ‘Area’, ‘Mean gray value’, ‘Standard deviation’, and ‘Shape descriptors’.

The RNA foci were defined as speckles smaller than 3.5 µm2 with circularity above 0.98. We excluded HOXA11 from further analysis due to a low signal-to-noise ratio. Based on the negative controls, we defined threshold values to filter out false-positive signals for the 2 other HOX genes. These threshold values were set such that approximately all the foci in the negative control were classified as negative. Finally, to compare the expression of two genes between muscles, we calculated the average number of foci per myofiber per sample.

Availability of data and scripts

All scripts are publicly available on GitHub: https://github.com/tabbassidaloii/HumanMuscleTranscriptomeAtlasAnalyses, (Abbassi-Daloii, 2023 copy archived at swh:1:rev:4d885b7612c66147c99a26c4503559deabc610ac). The raw data is publicly available at the European Genome Archive (Dataset ID: EGAS00001005904, https://ega-archive.org). Figure 1C and Figure 1—figure supplement 1 show our analyses workflow used to explore genes contributing to the intrinsic differences between muscles.

Graphical user interface

The muscle transcriptomics atlas is available for exploration through a graphical user interface (https://tabbassidaloii.shinyapps.io/muscleAtlasShinyApp/) implemented using shiny, a web application framework for application shiny R package (v1.5.0)(Chang et al., 2020).

Gene network visualization

The subnetwork was exported and visualized in Cytoscape (v3.8.1).

Acknowledgements

We thank Susan Kloet, and the personnel from the Leiden genome technology center (LGTC) in the LUMC for providing the sequencing support. HK is member of the European Reference Network for Rare Neuromuscular Diseases [ERN EURO-NMD].

This project was funded by the Netherlands Organization for Scientific Research (NWO, under research program VIDI, Grant # 917.164.90) and the Association Française contre les Myopathies (AFM Telethon; Grant # 22506). We thank the personnel of the Sequence Analysis Support Core (SASC) in the LUMC for their support in data pre-processing and data submission at the European Genome Archive. This work was partially funded by a grant to the Netherlands X-omics Initiative (Dutch Research Council (NWO), project 184.034.019)

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Peter AC 't Hoen, Email: peter-bram.thoen@radboudumc.nl.

Hermien E Kan, Email: H.E.Kan@lumc.nl.

Vered Raz, Email: v.raz@lumc.nl.

Christopher L-H Huang, University of Cambridge, United Kingdom.

Mone Zaidi, Icahn School of Medicine at Mount Sinai, United States.

Funding Information

This paper was supported by the following grants:

  • Nederlandse Organisatie voor Wetenschappelijk Onderzoek 917.164.90 to Hermien E Kan.

  • Association France Myopathies 22506 to Vered Raz.

  • Netherlands X-omics Initiative 184.034.019 to Peter AC 't Hoen.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Resources, Data curation, Formal analysis, Methodology, Writing – original draft.

Visualization.

Software, Visualization, Methodology.

Writing – review and editing.

Software.

Data curation.

Resources.

Resources.

Conceptualization, Resources, Software, Supervision, Methodology.

Conceptualization, Supervision, Funding acquisition, Project administration, Writing – review and editing.

Conceptualization, Supervision, Funding acquisition, Methodology, Project administration, Writing – review and editing.

Ethics

The study was approved by the local Medical Ethical Review Board of The Hague Zuid-West and the Erasmus Medical Centre and conducted in accordance with the ethical standards stated in the 1964 Declaration of Helsinki and its later amendments (ABR number: NL54081.098.16). All subjects provided written informed consent prior to participation.

Additional files

Supplementary file 1. Samples’ metadata.
elife-80500-supp1.xls (51.5KB, xls)
Supplementary file 2. A combined list of the genes marking different cell types.
elife-80500-supp2.xls (85.5KB, xls)
Supplementary file 3. The result of differential expression analysis for genes.
elife-80500-supp3.xls (16.3MB, xls)
Supplementary file 4. The result of differential expression analysis for transcripts.
elife-80500-supp4.xls (63.1MB, xls)
Supplementary file 5. The result of differential transcript usage analysis.
elife-80500-supp5.xls (1.7MB, xls)
Supplementary file 6. List of genes – modules.
elife-80500-supp6.xls (1.7MB, xls)
Supplementary file 7. List of enriched biological processes and molecular functions within modules.
elife-80500-supp7.xls (152.5KB, xls)
MDAR checklist

Data availability

The raw data is publicly available at the European Genome Archive (Dataset ID: EGAS00001005904, https://ega-archive.org/). The muscle transcriptomics atlas is available for exploration through a graphical user interface (https://tabbassidaloii.shinyapps.io/muscleAtlasShinyApp/).

The following previously published datasets were used:

Abbassi-Daloii T. 2022. The raw data is publicly available at the European Genome Archive. European Genome Archive. EGAS00001005904

References

  1. Abbassi-Daloii T, Kan HE, Raz V, ’t Hoen PAC. Recommendations for the analysis of gene expression data to identify intrinsic differences between similar tissues. Genomics. 2020;112:3157–3165. doi: 10.1016/j.ygeno.2020.05.026. [DOI] [PubMed] [Google Scholar]
  2. Abbassi-Daloii T. HumanMuscleTranscriptomeAtlasAnalyses. swh:1:rev:d09eff958b768ded8e39ad6a312063504190bd98Software Heritage. 2023 https://archive.softwareheritage.org/swh:1:dir:4b3aa1c7cf4367c3fcd8ac827bd4761ce1bc0adf;origin=https://github.com/tabbassidaloii/HumanMuscleTranscriptomeAtlasAnalyses;visit=swh:1:snp:0002bc13a132565e1ed7870f0d486c159d8c7f0a;anchor=swh:1:rev:d09eff958b768ded8e39ad6a312063504190bd98
  3. Abbassi-Daloii T, el Abdellaoui S, Kan HE, van den Akker E, ’t Hoen PAC, Raz V, Voortman LM. Quantitative analysis of myofiber type composition in human and mouse skeletal muscles. STAR Protocols. 2023;4:102075. doi: 10.1016/j.xpro.2023.102075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Albayda J, Christopher-Stine L, Iii COB, Paik JJ, Tiniakou E, Billings S, Uy OM, Burlina P. Pattern of muscle involvement in inclusion body myositis: a sonographic study. Clin Exp Rheumatol. 2018;36:996–1002. [PMC free article] [PubMed] [Google Scholar]
  5. Anders S, Pyl PT, Huber W. HTSeq—a python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Aversa Z, Zhang X, Fielding RA, Lanza I, LeBrasseur NK. The clinical impact and biological mechanisms of skeletal muscle aging. Bone. 2019;127:26–36. doi: 10.1016/j.bone.2019.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Berchtold MW, Brinkmeier H, Müntener M. Calcium ion in skeletal muscle: its crucial role for muscle function, plasticity, and disease. Physiological Reviews. 2000;80:1215–1265. doi: 10.1152/physrev.2000.80.3.1215. [DOI] [PubMed] [Google Scholar]
  8. Berg S, Kutra D, Kroeger T, Straehle CN, Kausler BX, Haubold C, Schiegg M, Ales J, Beier T, Rudy M, Eren K, Cervantes JI, Xu B, Beuttenmueller F, Wolny A, Zhang C, Koethe U, Hamprecht FA, Kreshuk A. Ilastik: interactive machine learning for (BIO) image analysis. Nature Methods. 2019;16:1226–1232. doi: 10.1038/s41592-019-0582-9. [DOI] [PubMed] [Google Scholar]
  9. Bergstrom J. Percutaneous needle biopsy of skeletal muscle in physiological and clinical research. Scandinavian Journal of Clinical and Laboratory Investigation. 1975;35:609–616. doi: 10.3109/00365517509095787. [DOI] [PubMed] [Google Scholar]
  10. Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, Fridman WH, Pagès F, Trajanoski Z, Galon J. ClueGO: a cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009;25:1091–1093. doi: 10.1093/bioinformatics/btp101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bindellini D, Voortman LM, Olie CS, van Putten M, van den Akker E, Raz V. Discovering fiber type architecture over the entire muscle using data-driven analysis. Cytometry. Part A. 2021;99:1240–1249. doi: 10.1002/cyto.a.24465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bottinelli R, Pellegrino MA, Canepari M, Rossi R, Reggiani C. Specific contributions of various muscle fibre types to human muscle performance: an in vitro study. Journal of Electromyography and Kinesiology. 1999;9:87–95. doi: 10.1016/s1050-6411(98)00040-6. [DOI] [PubMed] [Google Scholar]
  13. Brogna C, Cristiano L, Tartaglione T, Verdolotti T, Fanelli L, Ficociello L, Tasca G, Battini R, Coratti G, Forcina N, De Santis R, Norcia G, Carnicella S, Colosimo C, Carlier P, Pane M, Mercuri E. Functional levels and MRI patterns of muscle involvement in upper limbs in Duchenne muscular dystrophy. PLOS ONE. 2018;13:e0199222. doi: 10.1371/journal.pone.0199222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Camic CL, Kovacs AJ, Enquist EA, McLain TA, Hill EC. Muscle activation of the quadriceps and hamstrings during incremental running. Muscle & Nerve. 2015;52:1023–1029. doi: 10.1002/mus.24901. [DOI] [PubMed] [Google Scholar]
  15. Campbell WG, Gordon SE, Carlson CJ, Pattison JS, Hamilton MT, Booth FW. Differential global gene expression in red and white skeletal muscle. American Journal of Physiology. Cell Physiology. 2001;280:C763–C768. doi: 10.1152/ajpcell.2001.280.4.C763. [DOI] [PubMed] [Google Scholar]
  16. Carlier RY, Laforet P, Wary C, Mompoint D, Laloui K, Pellegrini N, Annane D, Carlier PG, Orlikowski D. Whole-Body muscle MRI in 20 patients suffering from late onset Pompe disease: involvement patterns. Neuromuscular Disorders. 2011;21:791–799. doi: 10.1016/j.nmd.2011.06.748. [DOI] [PubMed] [Google Scholar]
  17. Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J. Shiny: web application framework for R. 1.7.4R Project. 2020 https://cran.r-project.org/web/packages/shiny/index.html
  18. Cheng Y. Mean shift, mode seeking, and clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1995;17:790–799. doi: 10.1109/34.400568. [DOI] [Google Scholar]
  19. De Micheli AJ, Spector JA, Elemento O, Cosgrove BD. A reference single-cell transcriptomic atlas of human skeletal muscle tissue reveals bifurcated muscle stem cell populations. Skeletal Muscle. 2020;10:19. doi: 10.1186/s13395-020-00236-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Diaz-Manera J, Fernandez-Torron R, LLauger J, James MK, Mayhew A, Smith FE, Moore UR, Blamire AM, Carlier PG, Rufibach L, Mittal P, Eagle M, Jacobs M, Hodgson T, Wallace D, Ward L, Smith M, Stramare R, Rampado A, Sato N, Tamaru T, Harwick B, Rico Gala S, Turk S, Coppenrath EM, Foster G, Bendahan D, Le Fur Y, Fricke ST, Otero H, Foster SL, Peduto A, Sawyer AM, Hilsden H, Lochmuller H, Grieben U, Spuler S, Tesi Rocha C, Day JW, Jones KJ, Bharucha-Goebel DX, Salort-Campana E, Harms M, Pestronk A, Krause S, Schreiber-Katz O, Walter MC, Paradas C, Hogrel J-Y, Stojkovic T, Takeda S, Mori-Yoshimura M, Bravver E, Sparks S, Bello L, Semplicini C, Pegoraro E, Mendell JR, Bushby K, Straub V, Jain COS Consortium Muscle MRI in patients with dysferlinopathy: pattern recognition and implications for clinical trials. Journal of Neurology, Neurosurgery, and Psychiatry. 2018;89:1071–1081. doi: 10.1136/jnnp-2017-317488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Diogo R, Siomava N, Gitton Y. Development of human limb muscles based on whole-mount immunostaining and the links between ontogeny and evolution. Development. 2019;146:20. doi: 10.1242/dev.180349. [DOI] [PubMed] [Google Scholar]
  22. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. Star: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Durham University. Einbeck J. Bandwidth selection for mean-shift based unsupervised learning techniques: a unified approach via self-coverage. Journal of Pattern Recognition Research. 2011;6:175–192. doi: 10.13176/11.288. [DOI] [Google Scholar]
  24. Emery AEH. The muscular dystrophies. Lancet. 2002;359:687–695. doi: 10.1016/S0140-6736(02)07815-7. [DOI] [PubMed] [Google Scholar]
  25. Fox J, Weisberg S. An R Companion to Applied Regression. Los Angeles: SAGE; 2019. [Google Scholar]
  26. Gouspillou G, Sgarioto N, Norris B, Barbat-Artigas S, Aubertin-Leheudre M, Morais JA, Burelle Y, Taivassalo T, Hepple RT. The relationship between muscle fiber type-specific PGC-1α content and mitochondrial content varies between rodent models and humans. PLOS ONE. 2014;9:e103044. doi: 10.1371/journal.pone.0103044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Greenbaum D, Colangelo C, Williams K, Gerstein M. Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biology. 2003;4:117. doi: 10.1186/gb-2003-4-9-117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Haslett JN, Kang PB, Han M, Kho AT, Sanoudou D, Volinski JM, Beggs AH, Kohane IS, Kunkel LM. The influence of muscle type and dystrophin deficiency on murine expression profiles. Mammalian Genome. 2005;16:739–748. doi: 10.1007/s00335-005-0053-8. [DOI] [PubMed] [Google Scholar]
  29. Hettige P, Tahir U, Nishikawa KC, Gage MJ. Comparative analysis of the transcriptomes of EDL, psoas, and soleus muscles from mice. BMC Genomics. 2020;21:808. doi: 10.1186/s12864-020-07225-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hooijmans MT, Niks EH, Burakiewicz J, Anastasopoulos C, van den Berg SI, van Zwet E, Webb AG, Verschuuren JJGM, Kan HE. Non-Uniform muscle fat replacement along the proximodistal axis in Duchenne muscular dystrophy. Neuromuscular Disorders. 2017;27:458–464. doi: 10.1016/j.nmd.2017.02.009. [DOI] [PubMed] [Google Scholar]
  31. Houghton L, Rosenthal N. Regulation of a muscle-specific transgene by persistent expression of Hox genes in postnatal murine limb muscle. Developmental Dynamics. 1999;216:385–397. doi: 10.1002/(SICI)1097-0177(199912)216:4/5&#x0003c;385::AID-DVDY7&#x0003e;3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
  32. Huovinen S, Penttilä S, Somervuo P, Keto J, Auvinen P, Vihola A, Huovinen S, Pelin K, Raheem O, Salenius J, Suominen T, Hackman P, Udd B. Differential isoform expression and selective muscle involvement in muscular dystrophies. The American Journal of Pathology. 2015;185:2833–2842. doi: 10.1016/j.ajpath.2015.06.018. [DOI] [PubMed] [Google Scholar]
  33. Johnson MA, Polgar J, Weightman D, Appleton D. Data on the distribution of fibre types in thirty-six human muscles. An autopsy study. Journal of the Neurological Sciences. 1973;18:111–129. doi: 10.1016/0022-510x(73)90023-3. [DOI] [PubMed] [Google Scholar]
  34. Kang PB, Kho AT, Sanoudou D, Haslett JN, Dow CP, Han M, Blasko JM, Lidov HGW, Beggs AH, Kunkel LM. Variations in gene expression among different types of human skeletal muscle. Muscle & Nerve. 2005;32:483–491. doi: 10.1002/mus.20356. [DOI] [PubMed] [Google Scholar]
  35. Kendal A, Layton T, Al-Mossawi H, Brown R, Loizou C, Rogers M, Sharp M, Dakin S, Appleton L, Carr A. Identification of Human Tendon Cell Populations in Healthy and Diseased Tissue Using Combined Single Cell Transcriptomics and Proteomics. bioRxiv. 2019 doi: 10.1101/2019.12.09.869933. [DOI]
  36. Kohl M, Wiese S, Warscheid B. Cytoscape: software for visualization and analysis of biological networks. Methods in Molecular Biology. 2011;696:291–303. doi: 10.1007/978-1-60761-987-1_18. [DOI] [PubMed] [Google Scholar]
  37. Kolberg L, Raudvere U, Kuzmin I, Vilo J, Peterson H. Gprofiler2 -- an R package for gene list functional enrichment analysis and namespace conversion toolset G: profiler. F1000Research. 2020;9:F1000Res. doi: 10.12688/f1000research.24956.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Korthuis RJ. In: Skeletal Muscle Circulation. Korthuis RJ, editor. San Rafael; 2011. Anatomy of skeletal muscle and its vascular; pp. 1–2. [Google Scholar]
  39. Kuznetsova A, Brockhoff PB, Christensen RHB. lmertest package: tests in linear mixed effects models. Journal of Statistical Software. 2017;82:1–26. doi: 10.18637/jss.v082.i13. [DOI] [Google Scholar]
  40. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Langfelder P, Zhang B, Horvath S. Defining clusters from a hierarchical cluster tree: the dynamic tree cut package for R. Bioinformatics. 2008;24:719–720. doi: 10.1093/bioinformatics/btm563. [DOI] [PubMed] [Google Scholar]
  42. Langfelder P, Horvath S. Fast R functions for robust correlations and hierarchical clustering. Journal of Statistical Software. 2012;46:11. doi: 10.18637/jss.v046.i11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Law CW, Chen Y, Shi W, Smyth GK. Voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biology. 2014;15:R29. doi: 10.1186/gb-2014-15-2-r29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lenth RV. Least-squares means: the R package lsmeans. Journal of Statistical Software. 2016;69:1–33. doi: 10.18637/jss.v069.i01. [DOI] [Google Scholar]
  45. Linkert M, Rueden CT, Allan C, Burel JM, Moore W, Patterson A, Loranger B, Moore J, Neves C, Macdonald D, Tarkowska A, Sticco C, Hill E, Rossner M, Eliceiri KW, Swedlow JR. Metadata matters: access to image data in the real world. The Journal of Cell Biology. 2010;189:777–782. doi: 10.1083/jcb.201004104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Liu Y, Beyer A, Aebersold R. On the dependency of cellular protein levels on mRNA abundance. Cell. 2016;165:535–550. doi: 10.1016/j.cell.2016.03.014. [DOI] [PubMed] [Google Scholar]
  47. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  48. Mayeuf-Louchart A, Hardy D, Thorel Q, Roux P, Gueniot L, Briand D, Mazeraud A, Bouglé A, Shorte SL, Staels B, Chrétien F, Duez H, Danckaert A. MuscleJ: a high-content analysis method to study skeletal muscle with a new fiji tool. Skeletal Muscle. 2018;8:25. doi: 10.1186/s13395-018-0171-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. McCormick R, Vasilaki A. Age-Related changes in skeletal muscle: changes to life-style as a therapy. Biogerontology. 2018;19:519–536. doi: 10.1007/s10522-018-9775-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Murakami S, Fujino H, Takeda I, Momota R, Kumagishi K, Ohtsuka A. Comparison of capillary architecture between slow and fast muscles in rats using a confocal laser scanning microscope. Acta Medica Okayama. 2010;64:11–18. doi: 10.18926/AMO/32859. [DOI] [PubMed] [Google Scholar]
  52. Nishiyama A. Histochemical studies on the red, white and intermediate muscle fibers of some skeletal mescles. II. the capillary distribution on three types of fibers of some skeletal muscles. Acta Med Okayama. 1965;19:191–198. [PubMed] [Google Scholar]
  53. Ono K, Muetze T, Kolishovski G, Shannon P, Demchak B. CyREST: turbocharging cytoscape access for external tools via a restful API. F1000Research. 2015;4:478. doi: 10.12688/f1000research.6767.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Orchard P, Manickam N, Ventresca C, Vadlamudi S, Varshney A, Rai V, Kaplan J, Lalancette C, Mohlke KL, Gallagher K, Burant CF, Parker SCJ. Human and rat skeletal muscle single-nuclei multi-omic integrative analyses nominate causal cell types, regulatory elements, and SNPs for complex traits. Genome Research. 2021;31:2258–2275. doi: 10.1101/gr.268482.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Perez K, McGirr J, Limbad C, Doi R, Nederveen JP, Nilsson MI, Tarnopolsky M, Campisi J, Melov S. Single Nuclei Profiling Identifies Cell Specific Markers of Skeletal Muscle Aging, Sarcopenia and Senescence. medRxiv. 2021 doi: 10.1101/2021.01.22.21250336. [DOI] [PMC free article] [PubMed]
  56. Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature Biotechnology. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Perucca Orfei C, Viganò M, Pearson JR, Colombini A, De Luca P, Ragni E, Santos-Ruiz L, de Girolamo L. In vitro induction of tendon-specific markers in tendon cells, adipose- and bone marrow-derived stem cells is dependent on TGFβ3, BMP-12 and ascorbic acid stimulation. International Journal of Molecular Sciences. 2019;20:149. doi: 10.3390/ijms20010149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Poole DC, Copp SW, Ferguson SK, Musch TI. Skeletal muscle capillary function: contemporary observations and novel hypotheses. Experimental Physiology. 2013;98:1645–1658. doi: 10.1113/expphysiol.2013.073874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Porter JD, Khanna S, Kaminski HJ, Rao JS, Merriam AP, Richmonds CR, Leahy P, Li J, Andrade FH. Extraocular muscle is defined by a fundamentally distinct gene expression profile. PNAS. 2001;98:12062–12067. doi: 10.1073/pnas.211257298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Preibisch S, Saalfeld S, Tomancak P. Globally optimal stitching of tiled 3D microscopic image acquisitions. Bioinformatics. 2009;25:1463–1465. doi: 10.1093/bioinformatics/btp184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rahimov F, King OD, Leung DG, Bibat GM, Emerson CP, Kunkel LM, Wagner KR. Transcriptional profiling in facioscapulohumeral muscular dystrophy to identify candidate biomarkers. PNAS. 2012;109:16234–16239. doi: 10.1073/pnas.1209508109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Raz Y, Henseler JF, Kolk A, Riaz M, van der Zwaal P, Nagels J, Nelissen RGHH, Raz V. Patterns of age-associated degeneration differ in shoulder muscles. Frontiers in Aging Neuroscience. 2015;7:236. doi: 10.3389/fnagi.2015.00236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Raz Y, Henseler JF, Kolk A, Tatum Z, Groosjohan NK, Verwey NE, Arindrarto W, Kielbasa SM, Nagels J, ’t Hoen PAC, Nelissen RGHH, Raz V. Molecular signatures of age-associated chronic degeneration of shoulder muscles. Oncotarget. 2016;7:8513–8523. doi: 10.18632/oncotarget.7382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Raz V, Riaz M, Tatum Z, Kielbasa SM, ’t Hoen PAC. The distinct transcriptomes of slow and fast adult muscles are delineated by noncoding rnas. FASEB Journal. 2018;32:1579–1590. doi: 10.1096/fj.201700861R. [DOI] [PubMed] [Google Scholar]
  65. Raz Y, van den Akker EB, Roest T, Riaz M, van de Rest O, Suchiman HED, Lakenberg N, Stassen SA, van Putten M, Feskens EJM, Reinders MJT, Goeman J, Beekman M, Raz V, Slagboom PE. A data-driven methodology reveals novel myofiber clusters in older human muscles. FASEB Journal. 2020;34:5525–5537. doi: 10.1096/fj.201902350R. [DOI] [PubMed] [Google Scholar]
  66. R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2020. https://www.r-project.org/index.html [Google Scholar]
  67. Riaz M, Raz Y, van Putten M, Paniagua-Soriano G, Krom YD, Florea BI, Raz V. PABPN1-dependent mrna processing induces muscle wasting. PLOS Genetics. 2016;12:e1006031. doi: 10.1371/journal.pgen.1006031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Robinson MD, McCarthy DJ, Smyth GK. EdgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. RStudio-Team RStudio: integrated development environment for R. R Project. 2020 https://www.r-project.org/conferences/useR-2011/abstracts/180111-allairejj.pdf
  71. Rubenstein AB, Smith GR, Raue U, Begue G, Minchev K, Ruf-Zamojski F, Nair VD, Wang X, Zhou L, Zaslavsky E, Trappe TA, Trappe S, Sealfon SC. Single-cell transcriptional profiles in human skeletal muscle. Scientific Reports. 2020;10:229. doi: 10.1038/s41598-019-57110-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Servián-Morilla E, Cabrera-Serrano M, Johnson K, Pandey A, Ito A, Rivas E, Chamova T, Muelas N, Mongini T, Nafissi S, Claeys KG, Grewal RP, Takeuchi M, Hao H, Bönnemann C, Lopes Abath Neto O, Medne L, Brandsema J, Töpf A, Taneva A, Vilchez JJ, Tournev I, Haltiwanger RS, Takeuchi H, Jafar-Nejad H, Straub V, Paradas C. POGLUT1 biallelic mutations cause myopathy with reduced satellite cells, α-dystroglycan hypoglycosylation and a distinctive radiological pattern. Acta Neuropathologica. 2020;139:565–582. doi: 10.1007/s00401-019-02117-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Smith LR, Meyer G, Lieber RL. Systems analysis of biological networks in skeletal muscle function. Wiley Interdisciplinary Reviews. Systems Biology and Medicine. 2013;5:55–71. doi: 10.1002/wsbm.1197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in unique molecular identifiers to improve quantification accuracy. Genome Research. 2017;27:491–499. doi: 10.1101/gr.209601.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Terry EE, Zhang X, Hoffmann C, Hughes LD, Lewis SA, Li J, Wallace MJ, Riley LA, Douglas CM, Gutierrez-Monreal MA, Lahens NF, Gong MC, Andrade F, Esser KA, Hughes ME. Transcriptional profiling reveals extraordinary diversity among skeletal muscle tissues. eLife. 2018;7:e34613. doi: 10.7554/eLife.34613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Tey SR, Robertson S, Lynch E, Suzuki M. Coding cell identity of human skeletal muscle progenitor cells using cell surface markers: current status and remaining challenges for characterization and isolation. Frontiers in Cell and Developmental Biology. 2019;7:284. doi: 10.3389/fcell.2019.00284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Valentine BA. In: Pathologic Basis of Veterinary Disease. Valentine BA, editor. Elsevier; 2017. Skeletal muscle; pp. 908–953. [Google Scholar]
  79. van Putten M, Lloyd EM, de Greef JC, Raz V, Willmann R, Grounds MD. Mouse models for muscular dystrophies: an overview. Disease Models & Mechanisms. 2020;13:dmm043562. doi: 10.1242/dmm.043562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Veeger TTJ, van Zwet EW, Al Mohamad D, Naarding KJ, van de Velde NM, Hooijmans MT, Webb AG, Niks EH, de Groot JH, Kan HE. Muscle architecture is associated with muscle fat replacement in Duchenne and Becker muscular dystrophies. Muscle & Nerve. 2021;64:576–584. doi: 10.1002/mus.27399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Vitting-Seerup K, Sandelin A. IsoformSwitchAnalyzeR: analysis of changes in genome-wide patterns of alternative splicing and its functional consequences. Bioinformatics. 2019;35:4469–4471. doi: 10.1093/bioinformatics/btz247. [DOI] [PubMed] [Google Scholar]
  82. von der Hagen M, Laval SH, Cree LM, Haldane F, Pocock M, Wappler I, Peters H, Reitsamer HA, Hoger H, Wiedner M, Oberndorfer F, Anderson LVB, Straub V, Bittner RE, Bushby KMD. The differential gene expression profiles of proximal and distal muscle groups are altered in pre-pathological dysferlin-deficient mice. Neuromuscular Disorders. 2005;15:863–877. doi: 10.1016/j.nmd.2005.09.002. [DOI] [PubMed] [Google Scholar]
  83. Wehrhan F, Stockmann P, Nkenke E, Schlegel KA, Guentsch A, Wehrhan T, Neukam FW, Amann K. Differential impairment of vascularization and angiogenesis in Bisphosphonate-associated osteonecrosis of the jaw-related mucoperiosteal tissue. Oral Surgery, Oral Medicine, Oral Pathology, Oral Radiology, and Endodontics. 2011;112:216–221. doi: 10.1016/j.tripleo.2011.02.028. [DOI] [PubMed] [Google Scholar]
  84. Willigenburg NW, McNally MP, Hewett TE. In: Hamstring and Quadriceps Injuries in Athletes: A Clinical Guide. Kaeding CC, Borchers JR, editors. Boston, MA: Springer US; 2014. Quadriceps and hamstrings strength in athletes; pp. 15–28. [DOI] [Google Scholar]
  85. Wokke BH, van den Bergen JC, Versluis MJ, Niks EH, Milles J, Webb AG, van Zwet EW, Aartsma-Rus A, Verschuuren JJ, Kan HE. Quantitative MRI and strength measurements in the assessment of muscle quality in Duchenne muscular dystrophy. Neuromuscular Disorders. 2014;24:409–416. doi: 10.1016/j.nmd.2014.01.015. [DOI] [PubMed] [Google Scholar]
  86. Xi H, Langerman J, Sabri S, Chien P, Young CS, Younesi S, Hicks M, Gonzalez K, Fujiwara W, Marzi J, Liebscher S, Spencer M, Van Handel B, Evseenko D, Schenke-Layland K, Plath K, Pyle AD. A human skeletal muscle atlas identifies the trajectories of stem and progenitor cells across development and from human pluripotent stem cells. Cell Stem Cell. 2020;27:158–176. doi: 10.1016/j.stem.2020.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Yoshioka K, Nagahisa H, Miura F, Araki H, Kamei Y, Kitajima Y, Seko D, Nogami J, Tsuchiya Y, Okazaki N, Yonekura A, Ohba S, Sumita Y, Chiba K, Ito K, Asahina I, Ogawa Y, Ito T, Ohkawa Y, Ono Y. Hoxa10 mediates positional memory to govern stem cell function in adult skeletal muscle. Science Advances. 2021;7:24. doi: 10.1126/sciadv.abd7924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Yousefi S, Abbassi-Daloii T, Kraaijenbrink T, Vermaat M, Mei H, van ’t Hof P, van Iterson M, Zhernakova DV, Claringbould A, Franke L, ’t Hart LM, Slieker RC, van der Heijden A, de Knijff P, BIOS consortium. ’t Hoen PAC. A SNP panel for identification of DNA and RNA specimens. BMC Genomics. 2018;19:90. doi: 10.1186/s12864-018-4482-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Zakany J, Duboule D. The role of Hox genes during vertebrate limb development. Current Opinion in Genetics & Development. 2007;17:359–366. doi: 10.1016/j.gde.2007.05.011. [DOI] [PubMed] [Google Scholar]
  90. Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology. 2005;4:Article17. doi: 10.2202/1544-6115.1128. [DOI] [PubMed] [Google Scholar]
  91. Zhang Y, Parmigiani G, Johnson WE. combat-seq: batch effect adjustment for RNA-seq count data. NAR Genomics and Bioinformatics. 2020;2:lqaa078. doi: 10.1093/nargab/lqaa078. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Christopher L-H Huang 1

Skeletal muscle groups varied in biomechanical and can show a diffential involvement pattren in muscular disorders. Molecular characteristics of leg muscles from healthy young human males is presented in this study. It provides the first comparitive of gene signatures that will be valuable to understand muscle involvement in normal and abnormal physiological and pathological conditions.

Decision letter

Editor: Christopher L-H Huang1

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "A transcriptome atlas of leg muscles from healthy human volunteers reveals molecular and cellular signatures associated with muscle location" for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Mone Zaidi as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

A) Points raised by reviewer 1:

– The differences in functions because of different proportions of slow and fast fibers are well known and self evident. Because of the large interindividual variability which was greater than the variations between the different muscles under study the authors should discuss more how to methodologically enable higher levels of distinction between the different muscles.

– Analyzing the observed expression differences according to genetic myopathy gene defect categories such as: sarcolemmal genes, dystroglycanopathy genes, myofibrillar genes, autophagy related genes, chaperones, ion channels etc. would have been a preferred approach as to understand the differences and sometimes very selective involvement of certain muscles in the different genetic muscle diseases

– Instead of comparing 5 thigh muscles and one lower leg muscle maybe comparing six different lower leg muscles could have yielded more detail?

– The separation of different fast-slow fiber types could be further detailed by using the double IHC staining technique which identifies correctly all specific type I, IIA and IIX fibers and also the IIA/IIX hybrid fibers and the I/IIA hybrids for quantification.

– Another aspect not discussed would be the expression of different isoforms of the muscle genes which could have major influence on the functional profiles and the susceptibility or resistance to degeneration with certain gene defects.

B) Points raised by reviewer 2:

These data provide an impressive dataset of human muscle gene expression from young healthy individuals and will serve as a reference for future investigation of muscle pathology or ageing. The conclusions of this paper are mostly well supported by the data, but some aspects of the analyses by immunohistochemistry need to be clarified.

1) The method to cluster fibres according to their relative expression of myosin isoforms needs to be further explained (Figure 3). Fibres were clustered into three main categories according to the fluorescence intensity of MyHC1, MyHC2A and MyHC2X after immunohistochemistry. Figure 3B shows a continuum of the three intensities rather than clearly separated clusters and the authors should explain how fluorescence intensity thresholds were used to define such clusters. In addition, the differences in average Mean Fluorescence Intensity (MFI) values between clusters appear quite low: Max(MyHC1)/Min(MyHC1)=0.725/0.653=1.11;

Max(MyH2A)/Min(MyH2A)=0.806/0.535=1.50;

Max(MyH2X)/Min(MyH2X)=0.718/0.651=1.10.

These differences are low, notably compared to the gene expression data where these myosin isoforms have expression levels spread over 2 logs (Figures 3D-F). The authors should show specific examples of quantification, show specific quantification of fluorescence over MFI of background, and add statistics of expression differences based on MFI for each MyHC between clusters.

Figures 3D-F show correlations between the % of fibres in a given fluorescence cluster as a function of the expression by RNAseq of the corresponding MyHC defining this cluster. To strengthen these correlations, the authors should show the % of fibres in a given fluorescence cluster as a function of the expression of all three MyHC isoforms (e.g. % of fibres in cluster 1 = f(expression MYH7) or f(expression MYH1), same for clusters 2 and 3).

2) The authors assess blood vessel density in GL by immunohistochemistry (Figure 4). Figure 4A does not mention the muscle corresponding to the cryosection presented, and presenting STM and GL sections side-by-side would help to understand the conclusions of this figure. Displaying the green and red arrows as well on the CD31 and ENG panels and showing higher magnifications would help to understand which regions were defined or not as capillaries.

3) The authors validate HOX expression patterns by RNA-scope (Figure 8). Figure 8A does not mention the muscle corresponding to the cryosection presented. Maybe it would help show STM and GL sections stained for HOXA10 and HOXC10 side-by-side. Also, although the authors mention that signal specificity of RNA-scope probes was verified using negative controls, it would be helpful to show these controls and validate these probes on muscle sections known not to express HOX genes (head-derived?).

Finally, gene expression datasets from human muscle samples have already been generated. As discussed by the authors, these studies were limited in terms of the number of samples, large variation of donor ages, sample conservation before processing, etc. Nevertheless, it would be helpful to put the main findings of this paper (cell type composition, blood vessel density, fibre types ratios, HOX genes expression, mitochondrial processes, etc) into context and assess, if possible and if the data is available, whether similar findings can be concluded from previous datasets.

eLife. 2023 Feb 6;12:e80500. doi: 10.7554/eLife.80500.sa2

Author response


Essential revisions:

A) Points raised by reviewer 1:

1) The differences in functions because of different proportions of slow and fast fibers are well known and self-evident. Because of the large interindividual variability which was greater than the variations between the different muscles under study the authors should discuss more how to methodologically enable higher levels of distinction between the different muscles. Instead of comparing 5 thigh muscles and one lower leg muscle maybe comparing six-even different lower leg muscles could have yielded more detail?

We address this comment in two parts:

– Methodology to overcome inter individual differences:

In the revised version we extended the discussion how we dealt with the inter individual variability to optimize the discovery of genes with different expression levels between muscles (page 9, lines 337-350). We added sentences on how we accounted for the differences between individuals in the statistical models we employed and in the consensus network. In brief, in both cell type analysis and differential expression analyses, we included individuals as a random effect to the statistical model. In the co-expression network analysis (WGCNA), a network was constructed for each individual followed by a consensus network construction to correct for the individual’s effect. In the downstream steps of WGCNA, the same as the models above, individuals were included as a random effect.

– Muscles inclusion:

The muscles in this study are all leg muscles, but in terms of their role in locomotion, the muscles above or under the knee and the hamstrings and quadriceps have different roles (Willigenburg, McNally et al. 2014, Camic, Kovacs et al. 2015).

Muscle inclusion was determined by the surgeons of the ACL surgery. We included as many muscles as possible.

2) Another aspect not discussed would be the expression of different isoforms of the muscle genes which could have a major influence on the functional profiles and the susceptibility or resistance to degeneration with certain gene defects.

In our initial submission, we focused on genes’ analysis (combining multiple transcripts per gene). Following the reviewer's suggestion, we now added transcript-level differential expression analysis and transcript usage analysis: we quantified transcript expression levels using StringTie. After batch effect correction (the same approach used for the gene expression count table), we performed differential expression analysis. The approach is detailed in the material and method section (page 14, lines 535-538). The results are shown in Figure 5B.

Briefly, the results showed that the muscles clustered into the same groups as genes, considering the proportion of transcript with differential expression (DETs) in each pair of muscles. One panel has been added to Figure 5 and one to Figure 5 —figure supplement 1.

Collectively, these results imply that not only transcriptional but also post-transcriptional mechanisms contribute to the intrinsic differences between different leg muscles.

B) Points raised by reviewer 2:

These data provide an impressive dataset of human muscle gene expression from young healthy individuals and will serve as a reference for future investigation of muscle pathology or ageing. The conclusions of this paper are mostly well supported by the data, but some aspects of the analyses by immunohistochemistry need to be clarified.

We appreciate this positive evaluation of our resources.

1) The method to cluster fibres according to their relative expression of myosin isoforms needs to be further explained (Figure 3). Fibres were clustered into three main categories according to the fluorescence intensity of MyHC1, MyHC2A and MyHC2X after immunohistochemistry. Figure 3B shows a continuum of the three intensities rather than clearly separated clusters and the authors should explain how fluorescence intensity thresholds were used to define such clusters. In addition, the differences in average Mean Fluorescence Intensity (MFI) values between clusters appear quite low:

Max(MyHC1)/Min(MyHC1)=0.725/0.653=1.11;

Max(MyH2A)/Min(MyH2A)=0.806/0.535=1.50;

Max(MyH2X)/Min(MyH2X)=0.718/0.651=1.10.

These differences are low, notably compared to the gene expression data where these myosin isoforms have expression levels spread over 2 logs (Figures 3D-F). The authors should show specific examples of quantification, show specific quantification of fluorescence over MFI of background, and add statistics of expression differences based on MFI for each MyHC between clusters.

A detailed protocol for myofiber typing, wet lab, computer analysis (including clustering) and examples of MFI quantification can be found in a manuscript that we recently published: https://doi.org/10.5281/zenodo.7186929. This paper has been accepted for publication in STAR protocols and it is cited in the text (page 16, lines 613-615).

To find the myofiber clusters, we used the meanshift method, an unsupervised approach that clusters the myofibers based on the MyHC MFI. In this method, there is no MFI threshold, but clustering is made by the bandwidth value, i.e the distance/size scale of the kernel function. In other words, the size of the “window” used to calculate the mean.

We agree that using manual imaging background correction should be applied, also to eliminate differences between imaging sessions. We always applied it in our previous studies. In our study, imaging was carried out with a slide scanner in one batch. Since we obtained myofibers negative for each one of the fluorophores there was not need for background correction. The MFI of all three MyHC isoforms were scaled (without centering) and transformed prior to calculations or clustering. This is now specifically indicated in the legend to figure 3C. In our studies we consider the distribution of MFI values instead of the mean calculated by min and max values as suggested by the reviewer. Our approach does not assume normal distribution of the dataset, which is often the case in imaging dataset.

We have not implemented any statistics comparing the MFI for each MyHC isoform between clusters. However, the choice of bandwidth value is arbitrary and will affect the number and the size of clusters. Therefore, we selected the bandwidth where the number of myofiber clusters was biologically relevant (e.g. choosing a bandwidth that resulted in 10 myofiber clusters would not be biologically relevant).

As correctly spotted by the reviewer, Figure 3B (clustering 3D plot) shows a large variation in each cluster. As a result, the mean MFI values in the table (Figure 3C) are not representative of the cluster. Therefore, we replaced the table with density plots (one for each cluster) showing MFI distribution for each MyHC isoform (new Panel 3C). We believe this presentation is clearer.

Figures 3D-F show correlations between the % of fibres in a given fluorescence cluster as a function of the expression by RNAseq of the corresponding MyHC defining this cluster. To strengthen these correlations, the authors should show the % of fibres in a given fluorescence cluster as a function of the expression of all three MyHC isoforms (e.g. % of fibres in cluster 1 = f(expression MYH7) or f(expression MYH1), same for clusters 2 and 3).

As suggested by reviewer, we included a panel to Figure 3 —figure supplement 1A, showing the proportion of myofibers in each cluster as a function of the expression of all three MyHC isoforms. The additional panel confirms that the observed differences in MYH RNA expression are consistent with differences in the myofiber compositions and the assignments of cluster 1 as predominantly type 2A, cluster 2 as predominantly type 1 and cluster as predominantly 2X.

2) The authors assess blood vessel density in GL by immunohistochemistry (Figure 4). Figure 4A does not mention the muscle corresponding to the cryosection presented, and presenting STM and GL sections side-by-side would help to understand the conclusions of this figure. Displaying the green and red arrows as well on the CD31 and ENG panels and showing higher magnifications would help to understand which regions were defined or not as capillaries.

The figure is updated based on the reviewer recommendation:

3) The authors validate HOX expression patterns by RNA-scope (Figure 8). Figure 8A does not mention the muscle corresponding to the cryosection presented. Maybe it would help show STM and GL sections stained for HOXA10 and HOXC10 side-by-side. Also, although the authors mention that signal specificity of RNA-scope probes was verified using negative controls, it would be helpful to show these controls and validate these probes on muscle sections known not to express HOX genes (head-derived?).

We added an image zoom-into a single myofiber from each muscle.

Unfortunately, since this procedure is highly costly, we did not stain sections from muscles with no expression of these genes. We included the images from a negative control, where the muscle was stained with a bacterial mRNA probe (Figure 8 —figure supplement 1),

4) Finally, gene expression datasets from human muscle samples have already been generated. As discussed by the authors, these studies were limited in terms of the number of samples, large variation of donor ages, sample conservation before processing, etc. Nevertheless, it would be helpful to put the main findings of this paper (cell type composition, blood vessel density, fibre types ratios, HOX genes expression, mitochondrial processes, etc) into context and assess, if possible and if the data is available, whether similar findings can be concluded from previous datasets.

To our knowledge, this is the first study comparing gene expression patterns from human 6 leg muscles in a single study. Most gene expression studies comparing different muscles have been performed in mice. In the discussion, we thus compared our results to earlier immunohistochemistry-based studies in humans on the fiber type composition and capillary density. Moreover, we put our results on HOX gene expression in the context of earlier mouse gene expression studies (page 8, lines 289-304).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Abbassi-Daloii T. 2022. The raw data is publicly available at the European Genome Archive. European Genome Archive. EGAS00001005904

    Supplementary Materials

    Supplementary file 1. Samples’ metadata.
    elife-80500-supp1.xls (51.5KB, xls)
    Supplementary file 2. A combined list of the genes marking different cell types.
    elife-80500-supp2.xls (85.5KB, xls)
    Supplementary file 3. The result of differential expression analysis for genes.
    elife-80500-supp3.xls (16.3MB, xls)
    Supplementary file 4. The result of differential expression analysis for transcripts.
    elife-80500-supp4.xls (63.1MB, xls)
    Supplementary file 5. The result of differential transcript usage analysis.
    elife-80500-supp5.xls (1.7MB, xls)
    Supplementary file 6. List of genes – modules.
    elife-80500-supp6.xls (1.7MB, xls)
    Supplementary file 7. List of enriched biological processes and molecular functions within modules.
    elife-80500-supp7.xls (152.5KB, xls)
    MDAR checklist

    Data Availability Statement

    The raw data is publicly available at the European Genome Archive (Dataset ID: EGAS00001005904, https://ega-archive.org/). The muscle transcriptomics atlas is available for exploration through a graphical user interface (https://tabbassidaloii.shinyapps.io/muscleAtlasShinyApp/).

    The following previously published datasets were used:

    Abbassi-Daloii T. 2022. The raw data is publicly available at the European Genome Archive. European Genome Archive. EGAS00001005904


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES