Proteins underlie the ability to respond dynamically to the challenges and changing cellular environments of life. Through their various functions, modes of regulation, and complex signalling networks, proteins within cells/tissues (proteome) form the very foundation of biology. Mass spectrometry (MS)-based proteomics is a technology-driven approach for the characterisation of the proteome1. Over the last two-three decades, its accurate, unbiased and high-throughput nature has revolutionised the ability to understand biological mechanisms (functional proteomics) at a systems-wide scale. This has allowed the characterisation of both canonical and novel proteins and their accompanying transduction pathways important for physiology, health, and disease2.
Skeletal muscles account for 30–40% of total body weight and functions as the principal contractile organ of the body by generating power and force3. Skeletal muscle is a highly heterogeneous tissue consisting of various cell types including myofibers, endothelial cells, fibro-adipogenic progenitors, muscle stem cells, and other mononuclear cells such as resident or infiltrating immune cells. Skeletal muscle myofibers are the most abundant and are typically characterised into either slow oxidative (type I) or fast glycolytic (type II) fibres, where they play an indispensable role in maintaining posture, movement and functional independence4. Skeletal muscle is also an active endocrine organ capable of secreting circulatory factors (myokines) that regulate inter tissue crosstalk5 and has an extraordinary capacity for regeneration6. Furthermore, skeletal muscles are central regulators of whole-body metabolism by serving as primary sites of glucose storage and uptake in response to insulin ( ~ 80% post-prandial)7. Nonetheless, in states of metabolic dysfunction, skeletal muscles reduce their sensitivity to insulin and ability to uptake glucose (insulin resistance)8. In turn, the continued progression of insulin resistance, hyperinsulinemia and hyperglycemia can lead to severe comorbidities such as cardiovascular disease9,10. Together, these conditions represent one of the most common causes of burden and disability worldwide11.
Exercise is one of the most effective ways to promote muscle health and combat insulin resistance. Research efforts over the last two decades have collectively pointed towards its ability to increase insulin sensitivity, stimulate muscle synthesis and increase muscle strength/function depending on the type of exercise modality12,13,14. Nevertheless, exercise alone is not sufficient to prevent these functional declines, nor are there any currently approved therapeutics that specifically target skeletal muscles15. As such, understanding the signalling pathway underlying these conditions is vital for identifying novel therapeutic targets to improve musculoskeletal health and longevity.
Indeed, over the last two-three decades, MS-based proteomics have been used to better understand how skeletal muscle signal transduction is regulated under various settings of health and disease. In this review, we aim not to discuss the techniques nor procedures used for the preparation and analysis of skeletal muscles by proteomics, in which readers are directed to comprehensive reviews by Dowling and colleagues16 and Cervone and colleagues17. Instead, we discuss the technical challenges unique to the field and how they can be overcome to allow coverage of the spatial distribution, fibre-type differences, and MuSC of the skeletal muscle proteome. We also critically evaluate the individual merits of these approaches using the latest case studies. We describe how recent advances in skeletal muscle proteomics has increased the understanding of the molecular responses to exercise and in diseases, with a keen focus on metabolism and insulin resistance. Throughout the review, we present current limitations, highlight overlooked shortcomings, and discuss future opportunities in the field.
Challenges of skeletal muscle proteomics
One of the major challenges in the proteomic analysis of skeletal muscles is the exceptionally high dynamic range, combined with the presence of large and abundant contractile proteins. These proteins generate thousands of peptides from tryptic digestion which saturate mass spectrometric detection of peptides from low abundance proteins. Geiger et al., explored the proteome of 28 mouse tissues revealing that the top 100 most abundant proteins in skeletal muscle contribute to more than 85% of the proteome, while in organs such as the kidney, they contributed to <40% of the proteome18 (Fig. 1A). In fact, comparative proteomic analysis of mouse skeletal muscle and C2C12 myotubes revealed that the top 10 most abundant skeletal muscle proteins collectively make up over 50% of total protein mass19.

A Relative intensity of the top 100 most abundant proteins in various mouse tissues. B Number of theoretical tryptic peptides generated from top 10 most abundant proteins in various mouse tissues. C Number of observed tryptic peptides in mouse skeletal muscle and heart. Figure (A) adapted from18. C adapted from20,21.
To further demonstrate how large and high abundant proteins can hypothetically influence protein detection, we calculated the number of theoretical tryptic peptides produced from the top 10 most abundant proteins in mouse skeletal muscle compared to various other tissues. Figure 1B shows that the top 10 most abundant proteins in skeletal muscle generate almost 10 times more theoretical tryptic peptides compared to other tissues with the exception of the lung. Interestingly, while heart tissues also contain large and abundant contractile proteins, many of these (i.e., titin, myosin, and nebulin) are expressed at lower levels compared to skeletal muscles. Instead, the top 10 most abundant heart proteins consist of many mitochondrial proteins such as ATP5A1 and ATP5B, which produce fewer tryptic peptides. We re-analysed our previously published heart and skeletal muscle proteomics data from identical mice where samples were prepared in parallel, peptides fractionated into the same 12 fractions, analysed on the identical LC-MS/MS system, and processed with the identical proteome database/search algorithm pipeline20,21. We identified a total of 59,187 peptides spanning 5407 proteins from mouse skeletal muscles while 75,513 peptides spanning 7578 proteins were identified from heart tissues (Fig. 1C). A total of 4,664 peptides (7.9%) were identified in skeletal muscle from the large and abundant contractile proteins Titins and Myosins, while only 3583 peptides (4.7%) were identified from these same proteins in the heart. Taken together, the complexity, size and abundance of the skeletal muscle proteome makes deep proteome coverage extremely challenging.
Separation and fractionation techniques for skeletal muscle proteomics
Comprehensive coverage of the skeletal muscle proteome requires protein and/or peptide level separation and fractionation. Early analysis of the skeletal muscle proteome of mouse gastrocnemius22, rat abdominal23, and human vastus lateralis24 were separated using a 2D gel electrophoresis approach, yielding 71, 74, and 107 protein identifications using matrix-assisted laser desorption/ionization-MS (MALDI-MS), respectively. Expectedly, most of these proteins were of myofibiliar and sarcomeric origins.
The introduction of more advanced liquid chromatography separation coupled to electrospray ionization and tandem mass spectrometry has propelled the field forward enormously, with Højlund et al. leveraging the high resolution Fourier Transform Ion Cyclotron Resonance MS to identify 954 unique protein groups in the vastus lateralis muscle in healthy men, revealing canonical proteins involved in the myofibrillar apparatus, calcium homeostasis and glucose, glycogen and lipid metabolism25. The authors detected 212 mitochondrial proteins which accounted for ~22% of the skeletal muscle proteome, compared to the expected 4.8% found in the total human proteome26.
In 2015, one of the deepest skeletal muscle proteome analyses was performed using isoelectric focusing-based fractionation of peptides derived from C57BL/6 J mouse triceps paired with C2C12 myotubes19. This allowed peptides identified in the C2C12 myotubes to be matched to peptides identified in the mouse skeletal muscles based on MS1 features and peptide retention time (commonly referred to as “Match Between Runs” [MBR]). This approach led to identification of 10,218 proteins across the model systems (C2C12: 9880 and muscle: 8309 proteins) whereas when authors analysed raw files from skeletal muscle and C2C12 separately, 30% less proteins were detected in skeletal muscle (5887 proteins). While MBR is incredibly powerful and able to significantly boost peptide identification, it may not be suitable for every application and without additional filtering, false positive identifications have been documented27. Nevertheless, this strategy combined with peptide level fractionation enabled quantification of low abundance proteins such as transcription factors—MYOD1, MYOG, MEF2C, nuclear receptors—SMAD1 and NOTCH3, and circadian clock proteins—BMAL, CRY1 and FBXL3. These authors also provided an estimation of the absolute abundance of the mouse skeletal muscle proteome using Intensity-based absolute quantification (iBAQ)28, which to date, still represents an invaluable resource to the field.
In addition to protein/peptide level fractionation at a whole muscle level, subcellular fractionation of skeletal muscle is another attractive approach to obtain greater proteome coverage of discrete organelles. Considering the importance and abundance of mitochondrial proteins, it is unsurprising that a majority of subcellular skeletal muscle proteomics studies have focused on the mitochondria. Lefort et al. employed a differential centrifugation approach to specifically enrich mitochondria in human vastus lateralis muscles29, revealing the identification of 487 mitochondrial proteins including canonical proteins involved in the electron transport chain (ETC), TCA (tricarboxylic acid) cycle, fatty acid oxidation and mitochondrial transporters. The authors also identified 9 out of the 13 mtDNA (mitochondrial DNA)-encoded proteins. Adelnia et al. then extended this investigation by performing an integrated proteomics analysis of mitochondrial energy capacity in 57 healthy adults (22 F and 35 M)19. The authors assessed in vivo mitochondrial function (31 P magnetic resonance spectroscopy) paired with LC-MS/MS of the vastus lateralis muscle, revealing 4300 proteins, for which 253 were associated with increased and 93 proteins with decreased oxidative capacity, respectively. Enrichment analysis revealed that the 30% of the proteins positively associated with energetics were localised in the mitochondria with many of the proteins being involved in complex I, ETC, fatty acid metabolism, and critic acid cycle. Since these initial investigations, several additional studies have applied similar proteomic approaches to study skeletal muscle mitochondrial changes in Duchenne muscular dystrophy (DMD)30, exercise training31,32,33,34, ageing35 and insulin sensitivity36. In addition to mitochondrial isolations, Martin et al. performed sucrose-based fractionation to isolate nuclei and identified 854 nuclear enriched proteins with an enhanced coverage of proteins associated with DNA/chromatin-binding compared to analyses of whole cell lysate (WCL)37.
Biochemical-based fractionation is another common approach to increase depth of coverage by separating classes of proteins with different physico-chemical properties. Recently, Roberts et al., separated human vastus lateralis muscle biopsies into myofibril (MyoF) and non-myofibril fractions utilising a corona nanoparticle strategy38 to identify approximately 5645 non-MyoF and 2611 MyoF proteins identified in each individual39. This approach uses multiple nanoparticles with distinct physicochemical properties that bind specific proteins. However, it should be noted that the authors did not report on the precision or accuracy of the approach. Moreover, a comprehensive analysis of how different lysate compositions influence protein capture on the nanoparticle has, to our knowledge, not been published for skeletal muscle. Martinez-Val et al. established a spatial proteomic approach using biochemical fraction and identified discrete changes in the subcellular distribution of ribosome subunits following electrical stimulation of mouse tibialis anterior (TA) muscles40. Finally, an additional spatial proteomics approach was recently introduced by Schmidt et al., which combined serial cryosectioning of mouse soleus muscle with proteomics to characterise the myotendinous junction (MTJ) proteome41. Using zonation-based clustering, the authors elegantly profiled the muscle-tendon transition. Many of the enriched proteins were also identified by serial extraction of semitendinosus muscle-tendon samples with detergents of increasing stringency42.
While these various fractionation approaches have expanded the coverage and understanding of the skeletal muscle proteome, the extended upstream separation and handling steps during sample preparation will increase technical variation and may negatively influence statistical power to detect biological changes during quantitative experiments. To evaluate this, we generated new data to compare the technical variance and proteome depth when analysing whole skeletal muscle extracted with harsh denaturing conditions (whole cell lysate [WCL]) versus a more gentle biochemical subcellular fractionation (SC) approach. A single mouse gastrocnemius muscle was powdered under liquid nitrogen and aliquoted in 6 equal amounts to rule out any biological or spatial variation. Three aliquots were extracted in harsh 6 M guanidine via repeated tip-probe sonication and heating at 95 °C representing the WCL. The remaining three aliquots were extracted with gentle mechanical lysis using (Omni International; TH-02) and four fractions generated with the Thermo Scientific Subcellular Fractionation Kit using progressively increased detergent concentration to extract organelles/proteins. SDS-PAGE and total protein staining revealed overall good fractionation with many protein bands unique to specific fractions (Fig. 2A). Analysis by single-shot LC-MS/MS using data-independent acquisition (DIA) identified a total of 2167 proteins in the WCL while 4185 in the combined four subcellular fractions (Fig. 2B). Using the SC fractions as a spectral library for the analysis of the WCL increased the number of identified proteins to 3563. The median coefficient of variation (CV) in the WCL was 20.9% which significantly increased to 31.5%, 34.6%, 28.0, and 26.4% in the SC fractions F1-F4, respectively (Fig. 2C). These data demonstrate that fractionation can increase proteome depth but often comes at the cost of increased technical variation and time. The decision to take this penalty during quantitative experiments will ultimately depend on the biological variation, intervention effect size and cohort size/replicate numbers.

A Total protein stain by SyproRuby of mouse skeletal muscle whole cell lysate (WCL) and four subcellular fractions. B Number of proteins identified in WCL and subcellular fractions by single shot LC-MS/MS with DIA. C Coefficient of variation of the proteins identified in all 3 technical replicates of the WCL and subcellular fractions.
In summary, subcellular or biochemical fractionation of the skeletal muscle proteome is an extremely important approach to increase coverage depth or to characterise the spatial distribution of distinct cellular compartment/cell types. Nonetheless, under some experimental conditions, fractionation may increase technical variation and negatively impact statistical power. While advances in fractionation and separation techniques coupled to analysis with next generation mass spectrometers has enabled deeper coverage, rapid and comprehensive identification of the skeletal muscle proteome remains incomplete. For example, Lai et al., have identified >9200 proteins from non-fractionated brain tissue43, and several groups have recently identified >10,000 proteins from non-fractionated human cells lines in <1 h of acquisition time44,45.
Fibre type specific proteome
Skeletal muscles contain a heterogenous mix of fibre types, which can be broadly characterised as slow-twitch (type I) or fast-twitch (type IIA, IIX/D, and IIB) depending on Its myosin heavy chain (MHC) expression4. Rodent skeletal muscles are known to contain the full spectrum of fibre types, whereas human skeletal muscles only contain type I, IIA, and IIX/D fibres.
Classically, type I fibres are characterised by the presence of slow-twitch MHC I (encoded by the MYH7 gene) and typically contract slowly (slow speed of shortening) but have a high oxidative capacity (from oxidative phosphorylation) to support prolonged energy utilisation (thus, these fibres are relatively fatigue resistant). Type II fibres such as IIA, IIX/D, and IIB, are characterised by the presence of MHCIIa (encoded by MYH2), MHCIIx/d (encoded by MYH1) and MHCIIb (encoded by MYH4), respectively. They generally contract faster (IIA < IIX/D < IIB) but are more fatigable (IIA < IIX/D < IIB) than type I fibres, with IIA fibres utilising a combination of oxidative and glycolytic metabolism for fuel generation, while IIX/D and IIB fibres prioritises glycolytic metabolism46,47. For example, the mouse fast-twitch extensor digitorum longus (EDL) muscle is known to be composed of mainly type II fibres ( > 90%) while mouse slow-twitch soleus muscle is composed of ~60% type II and ~40% type I fibres48,49,50. In practice, muscles often exist as a spectrum of fibre types rather than comprising of pure fibres only and generally follow a sequential fibre-type transition pattern I ↔ IIA ↔ IIX/D ↔ IIB51. Today, the use of mass spectrometry-based proteomics has enabled a deep exploration of how the structural, metabolic and functional profile of muscles can differ depending on their fibre type composition.
Drexler et al., investigated these differences by characterising the proteome of the soleus (generally more oxidative) and EDL muscles (generally more glycolytic) in the SILAC mice, revealing 2,163 shared and 551 regulated protein groups between the two52. The authors identified canonical proteins involved in Ca2+ handling, metabolism and contractility that differ between the two skeletal muscle types. Whilst whole muscle analysis of slow and fast twitch muscles can reveal differences in their proteomes53, the existence of mix fibre type composition can affect the interpretation of more subtle phenotypes. To tackle this, Murgia et al., employed single fibre proteomics label free quantification (LFQ) to characterise proteomic differences in type I, IIA, IIX/D and IIB mouse muscle fibres, stratified by fibre typing based on the presence of MYH isoforms. Through their integration of single fibre with whole muscle proteomics in the MaxQuant environment and MBR, they were able to quantify almost 3000 proteins per single fiber54. The authors managed to characterise canonical proteins such as Myomesin 3, which is upregulated in I and IIA but not IIX/D or IIB fibres, and novel proteins including STAC3—a protein involved in excitation-contraction coupling which was upregulated in IIX/D and IIB, and Mitsugumin-53/Trim72—a protein involved in membrane repair and was upregulated in type I fibres. Moreover, the authors reveal an unexpected degree of mitochondrial specialisation with the major mitochondrial pathways—OXPHOS, beta oxidation and the TCA cycle all showing significant variation in different fibre types55. The same group then took this one step further using a similar experimental approach to investigate the human muscle fibre proteome, revealing 3852 unique protein groups with 404 differentially regulated proteins between fibre types56. Many of these proteins show significant differences in abundance across the organisms and confirm that the fibers richest in mitochondria are the slow type I fibers in humans and the type IIA fibers in rodents.
While early single-fiber proteomics studies focused on freshly isolated fibers from rodents and humans, recent advancements, including optimized workflows utilizing freeze-dried samples, have expanded research possibilities by allowing the use of biobank samples from previous studies. For example, Deshmukh et al. profiled isolated single muscle fibres from rapidly snap frozen, freeze-dried human muscle biopsies57, coupled to proteomic analysis of primary human cells using LFQ, revealing 3360 unique proteins groups and 471 regulated proteins between fibre types. The same research group extended the analysis to the largest single muscle fiber proteome study where >1000 single muscle fibres were analysed58. They utilized highly sensitive instrumentation combined with a short 21-min chromatographic gradient, achieving impressive coverage of nearly 3000 proteins. This efficient workflow allows for the analysis of up to 60 single fibers daily with high reproducibility and depth. It revealed substantial heterogeneity within each fiber type and demonstrated that expression of MYH isoforms alone does not define muscle fibre’s molecular profile as variations in metabolic, ribosomal, and cell junction proteins also contribute significantly to overall diversity. These findings highlight the unique insights afforded by single-fiber analyses. It is also important to note that the use of different protein quantification algorithms can significantly affect interpretation of results. Momenzadeh et al., [PMID: 37463334], analyzed 53 human single fibers by either iBAQ [PMID: 21593866] or by MaxLFQ [PMID: 24942700], revealing that iBAQ quantification was more reliable than MaxLFQ in assigning MYH isoforms based on established literature. For example, expression of MYH4, a specialized isoform found in rodents but are present in negligible quantity in humans, was 0.3% of total MYHs when using IBAQ, compared to 10% of total MYH when using MaxLFQ.
Nonetheless, an important limitation of the above fibre-type specific proteomic analyses is that stratification was performed primarily on myosin isoform expression and no phenotypic assessment was performed on individual fibres. To overcome this limitation, Ng et al. functionally characterised calcium handling and contractile properties of mechanically skinned single fibres combined with paired proteomic analysis of the same fibre59. An advantage of this approach is rather than stratifying fibres based on myosin expression, the authors grouped fibers based on functional parameters and correlation analysis was used to associate these functional phenotypes to the abundance of 1612 proteins in a pairwise manner. For example, fibre-type specific sensitivity of the contractile apparatus to calcium is mediated by the expression of Troponin C isoforms which each bind a different number of calcium ions, and hence an expected positive correlation of the slow troponin C isoform (TNNC1) and a negative correlation of the fast troponin C (TNNC2) with the concentration of calcium needed to elicit 50% of maximal force (also known as the pCa50) was observed. In addition to these known associations, hundreds of additional novel associations were identified such as a positive correlation between ribosomal protein subunits and specific force. An important limitation to the analysis performed by Ng et al. is that phenotypic analysis of the contractile apparatus and calcium handling was performed on mechanically skinned fibers prior to proteomics analysis. This involves removal of the sarcolemma and the authors show that this results in loss of ~22% of the proteome compared to analysis of intact fibers. In a similar analysis, Seaborne et al. performed a paired proteomic analysis of myofibers isolated from human patients with nemaline myopathy coupled with analysis of the myosin super-relaxation state (SRX)60. Correlation analysis identified several associations including proteins with ATPase activity which is important given the role of myosin SRX and skeletal muscle energy expenditure. While this analysis provides new insights into potential functional roles of fibre-type protein expression and phenotypic differences, it is important to note that these associations may not be causal in nature, and both studies have not performed additional validation studies.
On the whole, proteomic analysis, especially at a single fibre level have provided novel insights into protein expression changes across fibre-types and how they change during interventions such as exercise. However, current single fiber proteomic analysis is still limited to the detection of ~3000 proteins. Looking ahead, it is important that the field further enhances methodology to improve sensitivity and mechanistically explore if these proteins contribute to the various contractile and/or metabolic differences between fibre-types. For example, research should now focus on genetic studies that over-express (e.g using fiber-type specific promotors) or knockout these proteins in specific fibre-types and assess their functional role at either the whole muscle- or single fibre-level in settings of health and/or disease.
Muscle stem cells
Skeletal muscle stem cells (MuSCs) or satellite cells, are the resident stem cells of the skeletal muscles, responsible for homeostatic regulation and regeneration following injury61. In the face of injury, quiescent satellite cells rapidly activate and re-enter the cell cycle to start proliferating, with some daughter cells differentiating to form myoblasts, and others returning to quiescence to replenish the stem cell pool. Myoblast then terminally differentiate and fuse to form multinucleated myofibres62. This section will primarily focus on studies that investigated the stem cell proteome in adult skeletal muscles rather than during development.
Zhang et al. presented one of the first characterisations of the skeletal muscle stem cell proteome. The authors utilized FACS to isolate GFP labelled MuSCs, revealing 441 MuSC exclusive proteins by 1D-gel electrophoresis, including canonical markers such as CD34, integrin α7, caveolin-1, Numb and β1-integrin63. The authors identified Protein Arginine Methyltransferase 5 (PRMT5), which upon MuSC-specific KO, resulted in the complete abolishment of muscle regeneration 7, 14 days and 4 months after injury in adults. Nonetheless, these results were not seen when KO was applied under an embryonic setting, which challenges the conventional view that embryonic and adult myogenesis are largely regulated by the same molecular cues. In 2021, Schüler et al. preformed one of the largest characterisations of the mouse MuSC by quantifying >4500 proteins across 3 age groups (3 [young], 18 [old] and 26 [geriatric] months) by single-shot label-free DIA64. Subsequent integration with whole-muscle proteomics revealed age-dependent changes in the extracellular matrix (ECM) and dysregulation of MuSC ECM proteins, including Lrp1, Egfr, Cd44 and Smoc2 of the integrin pathway. Nonetheless, a limitation of these studies is that MuSCs were isolated by first dissociating the tissue via enzymatic digestion, followed by FACS – and this process is known to activate quiescent MuSCs. Given the rapid activation of MuSC in response to stress/injury, capturing the non-activated state was performed by, Zeng et al., using an in vivo paraformaldehyde (PFA) fixing to preserve the bona fide quiescence proteome65. Central to this approach is a de-crosslinking step which is typically performed in conditions of high detergents and at elevated temperature. Resultantly, the authors characterised over ~3000 proteins, including CPEB1, which resulted in impaired activation (via decreases in MyoD—a marker of activation) and proliferation (via decreased Edu uptake—a marker of cycling) upon knockdown. Together, the authors reveal the novel role of CPEB1 as a regulator of stem cell activation. Overall, this approach holds great promise, however, the additional purification steps can result in increased technical variability and decreased statistical power as discussed.
Despite being a relatively new field, great strides have been made in characterising the proteins and pathways important for MuSC function via mass spectrometry-based proteomics. We anticipate that the combination of in vivo fixation combined with recent improvements in sample preparation, such as SP366 and others, will improve recovery of the limited protein amounts recovered from fixed FACS sorted MuSCs. Furthermore, the use of alternative detergents and/or solvents such as the MS-compatible surfactant – n-Dodecyl-β-D-Maltoside (DDM) also warrants further investigation67. In the future, the combination of improved sample recovery and miniaturised sample preparation68 will likely expand MuSC analysis to enable the quantification of low abundant PTMs such as phosphorylation. However, it is important to note that while PFA crosslinking can fix and preserve in vivo protein abundance over the time course of several minutes, whether this can preserve protein phosphorylation which can be recycled over a time frame of seconds, warrants further comprehensive analysis. Furthermore, cross-linking and de-crosslinking reaction times and conditions should be assessed to investigate potential artefacts and how these steps can influence endogenous quantification. Finally, investigating how the proteomes of non-satellite MuSCs, such as mesenchymal stem cells, remodel to maintain skeletal muscle homeostasis, would be valuable69. Overall, elucidating the proteins, their modifications and signal transduction pathways that regulate MuSC activation in vivo holds great promise to understand the regenerative capacity of skeletal muscles.
link
