Skip to main content
Plants logoLink to Plants
. 2023 Dec 30;13(1):111. doi: 10.3390/plants13010111

Proteomic Profiles of Whole Seeds, Hulls, and Dehulled Seeds of Two Industrial Hemp (Cannabis sativa L.) Cultivars

Jan Bárta 1, Pavel Roudnický 2, Markéta Jarošová 1, Zbyněk Zdráhal 2, Adéla Stupková 1, Veronika Bártová 1,*, Zlatuše Krejčová 3, Jan Kyselka 4, Vladimír Filip 4, Václav Říha 3, František Lorenc 5, Jan Bedrníček 5, Pavel Smetana 5
Editors: Maria Iorizzi, Ioan Grozescu, Adina-Elena Segneanu
PMCID: PMC10780685  PMID: 38202419

Abstract

As a source of nutritionally important components, hemp seeds are often dehulled for consumption and food applications by removing the hard hulls, which increases their nutritional value. The hulls thus become waste, although they may contain valuable protein items, about which there is a lack of information. The present work is therefore aimed at evaluating the proteome of hemp (Cannabis sativa L.) at the whole-seed, dehulled seed, and hull levels. The evaluation was performed on two cultivars, Santhica 27 and Uso-31, using LC-MS/MS analysis. In total, 2833 protein groups (PGs) were identified, and their relative abundances were determined. A set of 88 PGs whose abundance exceeded 1000 ppm (MP88 set) was considered for further evaluation. The PGs of the MP88 set were divided into ten protein classes. Seed storage proteins were found to be the most abundant protein class: the averages of the cultivars were 65.5%, 71.3%, and 57.5% for whole seeds, dehulled seeds, and hulls, respectively. In particular, 11S globulins representing edestin (three PGs) were found, followed by 7S vicilin-like proteins (four PGs) and 2S albumins (two PGs). The storage 11S globulins in Santhica 27 and Uso-31 were found to have a higher relative abundance in the dehulled seed proteome (summing to 58.6 and 63.2%) than in the hull proteome (50.5 and 54%), respectively. The second most abundant class of proteins was oleosins, which are part of oil-body membranes. PGs belonging to metabolic proteins (e.g., energy metabolism, nucleic acid metabolism, and protein synthesis) and proteins related to the defence and stress responses were more abundant in the hulls than in the dehulled seeds. The hulls can, therefore, be an essential source of proteins, especially for medical and biotechnological applications. Proteomic analysis has proven to be a valuable tool for studying differences in the relative abundance of proteins between dehulled hemp seeds and their hulls among different cultivars.

Keywords: industrial hemp, hulls, dehulled seeds, proteomic profile, 11S globulin

1. Introduction

Hemp (Cannabis sativa L.), also known as industrial hemp, has long been cultivated as a source of solid and durable fibre in the stems, oily seeds, and active compounds useful in medicine [1,2]. Compared to Cannabis indica, most industrial hemp cultivars contain low levels of the drug tetrahydrocannabinol (THC), which has enabled an increase in its acreage in much of the world today. In addition to the valuable fibre, the seeds represent another important and economically exploitable component [3].

Hemp seeds are nutritionally very valuable; on average, whole seeds contain 25–35% fat, 20–28% protein, 25–37% carbohydrate (including fibre), 5–6% ash, and 4–8% moisture [1,3,4]. A large proportion of hemp seeds are used (mainly through cold pressing) to produce hemp oil, which is valued for its high content of polyunsaturated fatty acids. A number of papers [5,6,7,8,9] report a polyunsaturated fatty acid content of over 70% in the oil, mainly the essential n-6 linoleic acid (around 55%) and n-3 alpha-linolenic acid (12–18%) and minor levels of n-6 gamma-linolenic acid (1–3%) and n-3 stearidonic acid (<1%).

In addition to the oil, hemp seeds also contain easily digestible protein with a good amino acid composition, in which arginine plays an important role. Compared to soybean proteins, hemp seed proteins do not contain protease inhibitors in large amounts and are also considered less allergenic than proteins from other plant seeds [10]. Like the seeds of other dicotyledonous plants, hemp seed proteins are mainly represented by the globulin fraction (around 75%), with the rest being albumins [11,12].

Globulins are predominantly represented by the legumin-like 11S protein family [2,13], which, in hemp seed, is represented by edestin, which has a hexameric structure in its native state [12,14,15]. The edestin monomer has a molecular weight (MW) of 52–54 kDa and is composed of acidic (MW around 34 kDa) and basic (MW 20–25 kDa) polypeptides, which are linked together by a disulphide bond [12]. Edestin is a storage protein in the seed and is present in 60–80% of the total protein pool of hemp seeds [2,14]. The minor fraction of globulins is represented by the trimeric 7S protein (MW monomer 47-48 kDa), one of the vicilin-like proteins [4,16]. Albumin is mainly represented by a 10 kDa protein (2S albumin), which is 18% by weight of sulphur amino acids (cysteine, methionine) and, therefore, belongs to the group of sulphur-rich proteins [12].

Regarding the study of the protein profile of hemp seeds, several proteomic studies are available that deal with the analysis of proteins in whole seeds [17,18], in protein isolates [4,10], or in products after extrusion treatment [19]. Some studies [20,21,22] also present proteomic techniques as a tool suitable for verifying the authenticity of foods and their composition or assessing foods for the presence of allergens, which can be used to verify the addition of flours prepared from oilseeds (including hemp seeds) to food products. Most of the works [4,10,18,19,22] mentioned edestins or 11S globulins as major proteins in hemp seeds. Other proteins identified included proteins responsible for the biocatalysis of metabolic pathways, proteins related to nucleic acid and protein synthesis, proteins associated with membrane structures and functions, and stress proteins [17,18].

The above-mentioned studies [4,10,17,18,19] addressed protein number and identification using available databases but did not compare the relative abundance of identified items within the hemp seed protein pool. Moreover, in these studies, only one hemp cultivar was always used as the source of seed material, so information on varietal differences is lacking. Hemp seeds (more correctly, hemp achene) are dehulled for direct consumption and other food applications, which involves removing the hard, difficult-to-eat hull consisting of the pericarp and the seed coat layers, which contain primarily insoluble and indigestible fibre. On the other hand, dehulling increases the concentration of fat and protein in the dehulled seed [23]. It may thus alter the relative abundance of protein items in the dehulled seed compared to the whole (unhulled) seed. The hulls themselves can be finely ground, and the resulting powder (flour) can be used to fortify bakery, meat, or other food products with fibre and vegetable protein. However, the protein profiles of the hulls in terms of the presence and relative abundance of the different protein items have not been investigated, and information is completely lacking. Knowledge of the differences between the proteomes of the pericarp and the dehulled seed may be essential from the biological, nutritional, and technological points of view, as well as from the point of view of potential health risks and food safety.

Nowadays, the gel-free approach utilising LC-MS/MS analysis is the most efficient tool allowing the deep coverage of proteomes. The perspectives and potential of seed proteomics were recently reviewed by Smolikova et al. [24].

To increase the knowledge about the possibilities of using hemp seeds, dehulled seeds, and hulls as protein sources, the objectives of this study were (i) to perform proteomic characterisation at the level of whole seeds, dehulled seeds, and hulls; (ii) to evaluate the differences in the relative abundance of significant proteins and protein classes in these seed products in two industrial hemp cultivars.

2. Results and Discussion

2.1. Protein and Fat Contents in Original and Defatted Seed Product Samples

Protein and fat are the most valuable components of hemp seeds. As shown in Table 1, for both cultivars evaluated, Uso-31 and Santhica 27, the protein content of whole original seeds was found to be around 26–28% of dry matter (DM), and the fat content was around 29% of DM. The fat content of the whole seed corresponds to the usually reported data [1]; the protein content found is higher than or at the upper limit of the already-reported values of 20–25% [1] or 21.3–28.1% [3].

Table 1.

Dry matter, protein, and fat contents in evaluated hemp seed samples (mean ± standard deviation).

Material Cultivar Variant Dry Matter Content (% FM) Protein Content (% DM) Fat Content (% DM)
Original Uso-31 whole seeds 92.01 ± 0.07 b 25.79 ± 1.68 e 29.40 ± 0.87 b
hulls 91.57 ± 0.13 bc 13.40 ± 0.16 h 6.19 ± 0.19 d
dehulled seeds 94.84 ± 0.07 a 35.35 ± 0.07 d 49.07 ± 0.41 a
Santhica 27 whole seeds 91.39 ± 0.05 bc 27.74 ± 1.70 e 28.65 ± 0.63 b
hulls 91.39 ± 0.86 bc 19.20 ± 0.62 g 15.43 ± 0.92 c
dehulled seeds 93.87 ± 0.62 a 35.61 ± 0.31 cd 49.34 ± 0.79 a
Defatted Uso-31 whole seeds 89.96 ± 0.17 d 37.85 ± 0.83 c 1.18 ± 0.26 e
hulls 91.19 ± 0.09 bc 17.08 ± 0.18 g 0.05 ± 0.09 e
dehulled seeds 91.38 ± 0.25 bc 71.74 ± 0.15 a 0.27 ± 0.15 e
Santhica 27 whole seeds 89.76 ± 0.06 d 35.83 ± 0.89 cd 0.64 ± 0.13 e
hulls 90.63 ± 0.20 cd 21.79 ± 0.21 f 0.23 ± 0.19 e
dehulled seeds 90.09 ± 0.25 d 68.50 ± 0.44 b 0.04 ± 0.04 e

DM—dry matter; FM—fresh matter; Different letters in columns indicate the statistically significant difference at p < 0.05 (Tukey HSD test).

The dehulled original (non-defatted) seed has increased protein and fat contents, with values of 35% and 49%, respectively, in the two cultivars evaluated compared to the original whole seed (Table 1), which is roughly consistent with the literature reports [23,25]. In contrast, according to the literature data, significantly lower protein and fat contents were found in hulls, which have a high fibre content [25]. In the case of hulls, the effect of the cultivar on the separation of the hull from the rest of the seed seems to be more pronounced, as the reduction in the protein and fat contents of hulls relative to the whole seed does not occur in the same proportion in the two cultivars evaluated. In the case of the cultivar Uso-31, there was a reduction in the protein content of the hulls from 25.8 to 13.4% of DM (relative reduction to 51.9% content of original seed) and a reduction in the fat content from 29.4 to 6.2% (relative reduction to 21.1%). In the case of Santhica 27, there was a lower reduction in the protein and fat contents of the hulls, close to the values in the whole seed: the protein content was reduced from 27.7 to 19.2% (relative reduction to 69.3%), and the fat content was reduced from 28.7 to 15.4% (relative reduction to 53.7%).

For the defatted variants of the samples, which were subsequently subjected to detailed proteomic analysis, the fat content was reduced to below 1% of dry matter in almost all samples due to defatting with an organic solvent. The protein content increased in all samples due to defatting, with the dehulled seed samples showing an increase in protein content to above 65%, the value reported as the minimum protein content for protein concentrate products [26]. The increase in protein content is correlated with the original fat content removal. Thus, the highest growth in protein content was obtained in the dehulled seeds, from 35.4 to 71.7% of DM for the cultivar Uso-31 and from 35.6 to 68.5% of DM for the cultivar Santhica 27. Shen et al. [27] also observed increased protein content in flour and protein isolates prepared from dehulled seeds compared to the non-dehulled variants. However, they found only 41.8% protein content (Nx6.25) and 8.8% residual fat content (at 4.3% moisture content) for flour prepared from dehulled seeds after mechanical defatting, which is lower than our findings.

The results indicate that the mechanical separation of the hulls from hemp seeds is one of the two processes that significantly affect the production of hemp seed products with different primary nutrient contents. The second process should be defatting [28], either through mechanical pressing and fat extraction using an organic solvent or direct solvent use. Combining both processes (dehulling and defatting) can be essential to obtain concentrates with a higher protein content.

2.2. Characterisation of Seed Protein Pool

The essential features of the protein profiles of defatted seed, defatted hull, and dehulled seed samples are evident after one-dimensional SDS-PAGE (Figure 1). Four main zones of protein bands (A–D) are clear for all three types of samples of the two cultivars evaluated. However, the protein bands in the four zones mentioned above have different intensities. The A-zone region, with the weakest staining intensity, should represent a 7S vicilin-like protein with an MW of 47–48 kDa, similarly reported by several studies [12,29,30,31]. The most intense protein bands are those in zones B and C, which represent acidic (MW 30–35 kDa) and basic (MW 18–20 kDa) polypeptides of the 11S protein (edestin) monomer. The D zone proteins (MW 15 kDa and less) should represent the albumin fraction of the protein. The presence of edestin and the albumin fraction in SDS-PAGE gels has also been described by Liu et al. [29], Liu et al. [30], and Fang et al. [32].

Figure 1.

Figure 1

SDS-PAGE profiles of evaluated defatted products derived from seeds of two hemp cultivars under reducing conditions (WS—defatted flour from whole seeds; H—defatted flour from milled hulls; DS—defatted flour from dehulled hemp seeds, M—protein molecular weight standard ROTI®Mark Tricolor Xtra, Carl Roth GmbH + Co. KG, Karlsruhe, Germany).

Based on the evaluation of the proteomic data, 2833 protein groups (all belonging to Cannabis sativa L.) were identified, and their relative protein abundances in individual samples were determined and expressed in ppm. For further assessment of the protein pool, only protein groups (hereafter referred to as proteins or protein items) with a relative abundance greater than 1000 ppm (0.1%) in at least one of the three independent replicates for one of the three sample types (original or defatted whole seed, hulled seed, or hull) were considered. Eighty-eight of these major proteins were found (hereafter referred to as collection MP88). Their list, names, and primary data (molecular weight; the name of the corresponding protein family), including a more detailed characterisation of the proteins in the areas of biological process, molecular function, and cellular component, obtained from the UniProt KB database, are given in Table 2. Based on this detailed characterisation, the MP88 proteins were divided into ten protein classes: seed storage proteins (SP), oleosins (OL), other membrane components (MC), proteins involved in energy and metabolism (EM), translation-related proteins (TR), proteins related to DNA and RNA metabolism (DM), stress response and defence proteins (SD), proteins related to photosynthesis (PS), cytoskeleton and transport proteins (CT), and uncharacterised proteins (UP).

Table 2.

Description of evaluated proteins (information from UniProt KB database) included in MP88 collection.

Accession Protein Name Mol. Mass (Da) Protein Families Biological Process—Molecular Function—Cellular Component
Seed storage proteins (SP)
A0A7J6GWL5 Cupin type-1 domain-containing protein 109,151 11S seed storage protein—globulins family x—nutrient reservoir activity—x
A0A7J6DTA7 Cupin type-1 domain-containing protein 52,031 11S seed storage protein—globulins family x—nutrient reservoir activity—x
A0A7J6E205 Cupin type-1 domain-containing protein 62,389 11S seed storage protein—globulins family x—nutrient reservoir activity—membrane
A0A7J6H2R3 Cupin type-1 domain-containing protein 108,332 x—x—membrane
A0A7J6GLH5 Cupin type-1 domain-containing protein 108,083 x—x—membrane
A0A7J6HAT3 Cupin type-1 domain-containing protein 25,314 x—x—x
A0A7J6G321 Cupin type-1 domain-containing protein 55,360 x—x—x
A0A7J6H292 Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain-containing protein 16,752 2S seed storage albumins family; Plant LTP family x—lipid binding; nutrient reservoir activity—x
A0A7J6DXD1 Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain-containing protein 17,558 2S seed storage albumins family; Plant LTP family x—lipid binding; nutrient reservoir activity—x
Oleosins (OL)
A0A7J6EJ89 Oleosin 15,410 Oleosin family reproductive process; post-embryonic development—x—membrane; monolayer-surrounded lipid storage body
A0A7J6F0Y4 Oleosin 17,355 Oleosin family reproductive process; post-embryonic development—x—membrane; monolayer-surrounded lipid storage body
A0A7J6H4F6 Oleosin 16,884 Oleosin family reproductive process; post-embryonic development—x—membrane; monolayer-surrounded lipid storage body
Other membrane components (MC)
A0A7J6I425 Verticillium wilt resistance-like protein 124,317 Receptor-like protein family x—x—plasma membrane
A0A7J6F280 Peroxygenase 27,243 Caleosin family x—x—membrane
A0A7J6G6U3 Uncharacterized protein 71,299 x—oxidoreductase activity—membrane
A0A7J6DNR1 Uncharacterized protein 64,236 x—x—membrane
A0A7J6DPR7 Uncharacterized protein 20,132 x—x—membrane
A0A7J6ENC9 Uncharacterized protein 19,908 x—transmembrane transporter activity—membrane
A0A7J6GSC3 Aquaporin TIP3-2 27,504 MIP/aquaporin (TC 1.A.8) family x—channel activity—membrane
A0A7J6I7S9 Uncharacterized protein 41,884 x—x—membrane
Proteins involved in energy and metabolism (EM)
A0A7J6G6Z3 Glyceraldehyde 3-phosphate dehydrogenase NAD(P) binding domain-containing protein 31,874 Glyceraldehyde-3-phosphate dehydrogenase family x—NAD binding; oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor —x
A0A7J6E4U9 Triose-phosphate isomerase 30,435 Triosephosphate isomerase family glycolytic process—triose-phosphate isomerase activity—x
A0A7J6EFG0 Uncharacterized protein 46,262 x—oxidoreductase activity—x
A0A7J6EJG0 Protein disulfide-isomerase 56,660 Protein disulfide isomerase family x—protein disulfide isomerase activity—endoplasmic reticulum lumen
A0A7J6E5J2 Fructose-bisphosphate aldolase 38,370 Class I fructose-bisphosphate aldolase family glycolytic process—fructose-bisphosphate aldolase activity—x
A0A7J6EZ77 Malate dehydrogenase 36,570 LDH/MDH superfamily, MDH type 1 family malate metabolic process; tricarboxylic acid cycle—L-malate dehydrogenase activity—x
A0A7J6HK40 Glyceraldehyde 3-phosphate dehydrogenase NAD(P) binding domain-containing protein 31,950 Glyceraldehyde-3-phosphate dehydrogenase family x—NAD binding; oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor—x
A0A7J6GRW8 Phosphopyruvate hydratase 46,384 Enolase family glycolytic process—magnesium ion binding; phosphopyruvate hydratase activity—phosphopyruvate hydratase complex
A0A7J6EG53 Peptidyl-prolyl cis-trans isomerase 18,171 Cyclophilin-type PPIase family protein folding; protein peptidyl-prolyl isomerization—peptidyl-prolyl cis-trans isomerase activity—x
A0A7J6FNP6 Malate synthase 63,098 Malate synthase family glyoxylate cycle; tricarboxylic acid cycle—malate synthase activity—glyoxysome
A0A7J6E8J3 Malate dehydrogenase 36,478 LDH/MDH superfamily, MDH type 2 family malate metabolic process; tricarboxylic acid cycle; malate metabolic process; tricarboxylic acid cycle—L-malate dehydrogenase activity—membrane
A0A7J6DST1 NADP-dependent oxidoreductase domain-containing protein 38,559 Aldo/keto reductase family x—oxidoreductase activity—x
A0A7J6HKH5 Nucleoside-diphosphate kinase 16,290 NDK family CTP biosynthetic process; GTP biosynthetic process; UTP biosynthetic process—nucleoside diphosphate kinase activity—x
A0A7J6HTX3 Tyrosinase copper-binding domain-containing protein 66,851 Tyrosinase family pigment biosynthetic process—catechol oxidase activity; metal ion binding—x
A0A7J6HQA0 NADH-cytochrome b5 reductase 31,100 Flavoprotein pyridine nucleotide cytochrome reductase family x—cytochrome-b5 reductase activity, acting on NAD(P)H—membrane
Stress response and defence proteins (SD)
A0A7J6DVP5 rRNA N-glycosylase 28,760 Ribosome-inactivating protein family defence response; negative regulation of translation—rRNA N-glycosylase activity; toxin activity—x
A0A7J6FXL4 Dehydrin 29,312 Plant dehydrin family response to abscisic acid; response to cold; response to water deprivation—metal ion binding—x
A0A7J6I6U6 Uncharacterized protein 59,963 LEA type 4 family x—x—x
A0A7J6G382 Uncharacterized protein 17,523 Small heat shock protein (HSP20) family response to stress—x—x
A0A7J6HZ01 Annexin 36,065 Annexin family response to stress—calcium ion binding; calcium-dependent phospholipid binding—x
A0A7J6I4E9 18 kDa seed maturation protein 15,710 LEA type 1 family embryo development ending in seed dormancy—x—x
A0A7J6FL33 Late embryogenesis abundant protein D-29 30,417 x—x—x
A0A7J6GTA4 SHSP domain-containing protein 17,897 Small heat shock protein (HSP20) family x—x—x
A0A7J6FPH5 Lactoylglutathione lyase 32,764 Glyoxalase I family x—lactoylglutathione lyase activity; metal ion binding—x
A0A7J6DT98 Peroxiredoxin 24,133 Peroxiredoxin family, Prx6 subfamily x—thioredoxin-dependent peroxiredoxin activity—x
A0A7J6GJC9 Dehydroascorbate reductase 23,719 GST superfamily, DHAR family ascorbate glutathione cycle—glutathione dehydrogenase (ascorbate) activity; glutathione transferase activity—x
A0A7J6F3P9 Catalase 57,402 Catalase family hydrogen peroxide catabolic process; response to oxidative stress—catalase activity; heme binding; metal ion binding—x
A0A7J6HTR6 Glutaredoxin domain-containing protein 14,424 Glutaredoxin family, CPYC subfamily x—x—x
A0A7J6FRN5 Alcohol dehydrogenase 41,112 Zinc-containing alcohol dehydrogenase family x—oxidoreductase activity; zinc ion binding—x
Proteins related to DNA and RNA metabolism (DM)
A0A7J6EYT6 PPC domain-containing protein 30,708 x—minor groove of adenine-thymine-rich DNA binding—x
A0A7J6E9G4 Histone H4 11,409 Histone H4 family x—DNA binding; protein heterodimerization activity; structural constituent of chromatin—nucleosome; nucleus
A0A7J6E6Z7 Histone H2A 15,148 Histone H2A family x—DNA binding; protein heterodimerization activity; structural constituent of chromatin—nucleosome; nucleus
A0A7J6E9K3 Histone H2B 16,235 Histone H2B family x—DNA binding; protein heterodimerization activity; structural constituent of chromatin —nucleosome; nucleus
A0A7J6HBN5 ATP-dependent RNA helicase 33,486 DEAD box helicase family x—ATP binding; hydrolase activity; RNA binding; RNA helicase activity—x
Translation-related proteins (TR)
A0A7J6HCW3 Elongation factor 1-alpha 49,259 TRAFAC class translation factor GTPase superfamily, Classic translation factor GTPase family, EF-Tu/EF-1A subfamily x—GTP binding; GTPase activity; translation elongation factor activity—x
A0A7J6GTP8 60S acidic ribosomal protein P0 33,920 Universal ribosomal protein uL10 family ribosome biogenesis—x—ribonucleoprotein complex; ribosome
A0A7J6HHK4 Translation elongation factor EF1B beta/delta subunit guanine nucleotide exchange domain-containing protein 25,124 EF-1-beta/EF-1-delta family x—translation elongation factor activity—eukaryotic translation elongation factor 1 complex
A0A7J6F1D8 KH type-2 domain-containing protein 28,920 Universal ribosomal protein uS3 family translation—RNA binding; structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6HVI4 Ribosomal_L28e domain-containing protein 16,563 Eukaryotic ribosomal protein eL28 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6E6C3 Ribosomal protein L7 30,458 Universal ribosomal protein uL30 family maturation of LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA)—structural constituent of ribosome—cytosolic large ribosomal subunit
A0A7J6I4S9 Ribosomal protein L6 N-terminal domain-containing protein 25,942 Eukaryotic ribosomal protein eL6 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6I083 60S acidic ribosomal protein P1 16,784 Eukaryotic ribosomal protein P1/P2 family translational elongation—structural constituent of ribosome—ribonucleoprotein complex6; ribosome
A0A7J6F1J9 40S ribosomal protein S14 16,389 Universal ribosomal protein uS11 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6GA02 Ribosomal_S7 domain-containing protein 22,343 Universal ribosomal protein uS7 family translation—RNA binding; structural constituent of ribosome—small ribosomal subunit
A0A7J6G6F9 60S ribosomal protein L22-2; peroxidase 14,052 Eukaryotic ribosomal protein eL22 family; Peroxidase family translation; hydrogen peroxide catabolic process; response to oxidative stress—structural constituent of ribosome; heme binding; lactoperoxidase activity—ribonucleoprotein complex; ribosome; extracellular region
A0A7J6I796 60S ribosomal protein L12 17,790 Universal ribosomal protein uL11 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6HT66 40S ribosomal protein S9-2 23,142 Universal ribosomal protein uS4 family translation—rRNA binding; structural constituent of ribosome—small ribosomal subunit
A0A7J6ENY3 RRM domain-containing protein 11,250 x—nucleic acid binding—x
A0A7J6FAW9 40S ribosomal protein S18 17,635 Universal ribosomal protein uS13 family translation—rRNA binding; structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6I9M7 Ribosomal_L18e/L15P domain-containing protein 20,840 Eukaryotic ribosomal protein eL18 family translation—mRNA binding; structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6FX59 KOW domain-containing protein 16,767 Universal ribosomal protein uL24 family translation—RNA binding; structural constituent of ribosome—large ribosomal subunit
A0A7J6F744 Ribosomal_L14e domain-containing protein 15,331 Eukaryotic ribosomal protein eL14 family translation—RNA binding; structural constituent of ribosome—ribosome
A0A7J6E9P9 60S ribosomal protein L27 15,794 Eukaryotic ribosomal protein eL27 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6H424 50S ribosomal protein L23, chloroplastic 17,400 Universal ribosomal protein uL23 family; DHBP synthase family translation; riboflavin biosynthetic process—mRNA binding; rRNA binding; structural constituent of ribosome; 3,4-dihydroxy-2-butanone-4-phosphate synthase activity; GTP binding—ribonucleoprotein complex; ribosome
A0A7J6EFK0 40S ribosomal protein S17 16,185 Eukaryotic ribosomal protein eS17 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6G5C7 60S ribosomal protein L23 14,997 Universal ribosomal protein uL14 family translation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
A0A7J6GWW0 40S ribosomal protein S26 14,763 Eukaryotic ribosomal protein eS26 family translation—structural constituent of ribosome; nucleotidyltransferase activity—ribonucleoprotein complex; ribosome
A0A7J6E4Y6 60S acidic ribosomal protein P3 11,903 Eukaryotic ribosomal protein P1/P2 family translational elongation—structural constituent of ribosome—ribonucleoprotein complex; ribosome
Proteins related to photosynthesis (PS)
A0A7J6GUI2 Chlorophyll a-b binding protein, chloroplastic 28,276 Light-harvesting chlorophyll a/b-binding (LHC) protein family photosynthesis; light harvesting—chlorophyll binding—chloroplast thylakoid membrane; photosystem I; photosystem II
A0A7J6GSM8 Chlorophyll a-b binding protein, chloroplastic 28,429 Light-harvesting chlorophyll a/b-binding (LHC) protein family photosynthesis, light harvesting—chlorophyll binding—chloroplast thylakoid membrane; photosystem I; photosystem II
Cytoskeleton and transport proteins (CT)
A0A7J6I6X2 MD-2-related lipid-recognition domain-containing protein 16,057 intracellular sterol transport—sterol binding—x
A0A7J6I828 Actin 41,726 Actin family x—ATP binding—cytoplasm; cytoskeleton
A0A0M5M1Z3 ATP synthase subunit alpha 55,324 ATPase alpha/beta chains family actin filament bundle assembly; actin filament capping—actin filament binding; ATP binding; proton-transporting ATP synthase activity, rotational mechanism—mitochondrial inner membrane; proton-transporting ATP synthase complex, catalytic core F(1)
Uncharacterised proteins (UP)
A0A7J6FR97 Uncharacterized protein 35,774 x—x—x
A0A7J6FP03 Uncharacterized protein 16,800 x—x—x
A0A7J6I334 Uncharacterized protein 15,922 x—x—x
A0A7J6GUW7 Uncharacterized protein 13,786 x—x—x
A0A7J6HWC7 Uncharacterized protein 17,135 x—x—x

x—information in UniProt KB database is not available.

It should be noted that the number of proteins classified in these classes and their relative abundances in the samples of the three types reflect the condition of the maturing/mature seed. This state is associated with physiological processes involving the accumulation of di- and oligosaccharides, the synthesis of storage proteins, LEA proteins and heat-shock proteins, and the activation of antioxidant defences, in addition to the increasing dry matter content and the physical, structural changes in the cells [33]. In comparison to our experimental setup, Park et al. [17], in their study of the proteomic profiling of Cheungsam (Cannabis sativa L.) seeds using a combination of 2D PAGE and nano-LC-MS/MS, found 168 identified unique protein spots out of a total of 1102 spots resolved by the 2D PAGE separation of the extracted hemp seed protein mixture due to the lack of hemp protein sequences. Park et al. [17] used the database information of rice and Arabidopsis genomes, which had been completely sequenced at the time, to identify the proteins. The identified proteins were classified into 13 categories according to their function, as listed in the SWISS-PROT and NCBI databases.

2.2.1. Seed Storage Proteins

The SP class represents the most abundant group: seed storage proteins. The MP88 dataset included nine proteins. Three proteins (A0A7J6GWL5, A0A7J6DTA7, A0A7J6E205) were classified into the 11S family of seed storage proteins, commonly referred to as edestin in hemp seeds [12,14]; then, two proteins (A0A7J6H292, A0A7J6DXD1) were assigned to 2S albumins, and the remaining four proteins (A0A7J6H2R3, A0A7J6GLH5, A0A7J6HAT3, and A0A7J6G321) belong to the globulin family, and some of them belong to the 7S family (see Table 2). The names of these proteins do not directly imply that they are 7S proteins, but additional information found in the UniProt KB database suggests their similarity to 7S vicilin-like proteins. The occurrence of the three types of seed storage proteins mentioned above agrees with most papers that have investigated hemp seed proteins [10,16,22,34].

The main storage protein in hemp seeds is edestin. Docimo et al. [34], in their study on the molecular characterisation of the edestin gene family, found seven genes encoding seven forms of edestin, which can be divided into two types of edestin based on the sequence similarity—edestin 1 (includes four forms) and edestin 2 (involves three forms). In subsequent work [16], edestin type 3 was additionally found, which resembles type 1 more than type 2. The three above items of 11S proteins that we found are not directly identified as edestin in the UniProt KB database version used for database searching (they are named Cupin type-1 domain-containing protein) but show high sequence similarity to the edestins described in previous works [16,34]. The storage protein A0A7J6GWL5 has, according to the UniProt KB database, an MW of 109.15 kDa and is composed of a sequence of 953 amino acids, showing a close identity (99.8%) to edestin 1 (A0A090DLH8) in region 8-511, as reported in the UniProt database by, e.g., Mamone et al. [10] and Kotecka-Majchrzak et al. [22]. The second found 11S storage protein item (A0A7J6DTA7) has an MW of 52.03 kDa, is composed of 456 amino acids, and is consistent with forms of edestin 2—with item A0A090DLI7 at 98.1%, A0A090CXP9 at 97.8%, and A0A090CXP8 at 98.1%. The form of edestin 3 was not found in our experiment.

2.2.2. Proteins Involved in Energy and Metabolism

The class of proteins involved in energy and metabolism is represented by 15 proteins in the MP88 set. The most significantly represented items are those related to glycolysis, the tricarboxylic acid cycle (TCA), and other central metabolic processes, which are closely related to the physiological state of mature stored seed with preserved germination potential [33]. The importance of enzymes related to glycolysis, TCA, and the glyoxylate cycle for the metabolism of maturing oilseeds was also confirmed by Hajduch et al. [35].

Specifically, of the glycolytic enzymes, two glyceraldehyde-3-phosphate dehydrogenase items were represented in the MP88 set (items A0A7J6G6Z3 and A0A7J6HK40), in addition to triosephosphate isomerase (A0A7J6E4U9), fructose-bisphosphate aldolase (A0A7J6E5J2), and phosphopyruvate hydratase or enolase (A0A7J6GRW8). The TCA cycle is represented by two items of malate dehydrogenase (A0A7J6EZ77 and A0A7J6E8J3), and the glyoxylate cycle is represented by the enzyme malate synthase (A0A7J6FNP6).

Other important items with catalytic activity are involved in protein modification (protein disulfide-isomerase (A0A7J6EJG0) and peptidyl-prolyl cis-trans isomerase (A0A7J6EG53)), nucleotide metabolism (nucleoside-diphosphate kinase (A0A7J6HKH5)), etc. Other enzymes are related to electron and proton transfer in energy metabolism, e.g., NADH-cytochrome b5 reductase (A0A7J6HQA0) and ATP synthase (A0A0M5M1Z3). Due to their membrane localisation, these enzymes could also be classified as membrane components. Similarly, Park et al. [17] reported that the groups of proteins and enzymes related to basic metabolism and energy production in hemp seeds are the most numerous and diverse group of proteins. Their dataset also mainly lists enzymes of glycolysis, enzymes of the TCA cycle, and other important metabolic pathways.

2.2.3. Membrane Proteins

The representation of membrane proteins, which include two classes, namely, oleosins (OL) and other membrane components (MC), is significant. The OL are proteins involved in the structure of oil-body membranes called oleosomes [36]. In hemp seed cells, these are droplet-like structures (droplets) with a diameter of 3–5 µm, in whose membranes the membrane-specific oleosin proteins play an important structural role, with an estimated MW of ≈15 kDa [37]. In addition to oleosins, caleosins and steroleosins have also been reported as fat-body membrane proteins [38]. In the group of major proteins, three oleosin items (A0A7J6EJ89, A0A7J6F0Y4, A0A7J6H4F6) were found with MWs ranging from 15.41 to 17.36 kDa, which together represent about 8% of hemp seed proteins. The second highest relative abundance of oleosins in the hemp seed protein pool (after seed storage proteins) is thus closely related to the high fat accumulation in hemp seed tissue. Among the items of the MC class, which is represented by eight proteins, one can find a caleosin protein with peroxygenase activity (A0A7J6F280) and, e.g., aquaporin (A0A7J6GSC3), which functions as a transmembrane channel. Most of the other proteins are designated as uncharacterised proteins.

2.2.4. Proteins Involved in Stress Response and Defence

A total of 14 items were found in the MP88 dataset that can be classified as SD. These include LEA (late embryogenesis abundant) proteins (A0A7J6FL33, A0A7J6I6U6), including dehydrins (A0A7J6FXL4) as well as heat-shock proteins (A0A7J6GTA4, A0A7J6G382) and proteins associated with antioxidant protection against ROS (Reactive Oxygen Species), such as catalase (A0A7J6F3P9), peroxiredoxin (A0A7J6DT98), dehydroascorbate reductase (A0A7J6GJC9), glutaredoxin domain-containing protein (A0A7J6HTR6), and lactoylglutathione lyase or glyoxylase I (A0A7J6FPH5).

From the spectrum of stress-related proteins listed above, it is evident that the maturing/ripe seed and the developing/developed embryo must cope with progressive desiccation, temperature changes, and oxidative stress or oxygen deficiency. The occurrence of alcohol dehydrogenase in plants has been linked to oxygen deficiency.

Regarding seed defence, the occurrence of the enzyme rRNA N-glycosidase (A0A7J6DVP5) is significant. This enzyme generally belongs to the group of ribosome-inactivating proteins (RIPs), which are toxic because they inhibit proteosynthesis and are among the defence proteins [39]. Despite its toxicity, this group of proteins has the potential for possible applications in medicine or agriculture [40].

Park et al. [17] found, in their work on mature industrial hemp seeds, a similar spectrum of stress- or defence-related proteins—heat-shock proteins, glyoxylate reductase, dehydroascorbate reductase, alcohol dehydrogenase, and peroxiredoxin; in addition, they reported superoxide dismutase, which was also found in our samples, not in the MP88 set but in the set of all proteins. Aiello et al. [18] reported only Glyoxalase I and several variants of heat-shock proteins in their work focused on the proteomic characterisation of hemp seeds.

2.2.5. Other Protein Classes

Common cellular proteins include those involved in nucleic acid metabolism and subsequent protein synthesis during translation. The items included in the DM class are mainly represented by histone protein subunits (A0A7J6E9G4, A0A7J6E6Z7, A0A7J6E9K3). The TR class has the most significant number of items in the set of major proteins, as well as the EM class; in most cases (20 out of 24 items), these are ribosome subunits, e.g., two items of 60S ribosomal protein (A0A7J6GTP8, A0A7J6E9P9), as well as 40S (A0A7J6GWW0) and 50S (A0A7J6H424) ribosomal proteins and several other proteins involved in translation, e.g., elongation factor 1-alpha (A0A7J6HCW3) or EF1_GNE domain-containing protein (A0A7J6HK4).

The chloroplastic chlorophyll a-b-binding protein represents the PS class, and the CT class is represented by a protein related to intracellular sterol transport and by a protein from the actin family. Six items listed in the database had no characteristics and were therefore assigned to the UP class.

2.3. Effect of Cultivar and Dehulling on the Hemp Seed Protein Profile

The relative abundance of protein items in the MP88 set in the total protein profiles of whole seeds, hulls, and dehulled seeds with respect to the two cultivars studied (Uso-31, Santhica 27) is shown in Table 3. The percentages of the protein classes (including the group of proteins not included in the MP88 set) are expressed using pie charts in Figure 2.

Table 3.

Relative abundance of the MP88 collection proteins (in ppm) in defatted hemp seed products.

Accession Protein Name Uso-31 Santhica 27
Whole Seed Hulls Dehulled Seeds Whole Seed Hulls Dehulled Seeds
A0A7J6GWL5 Cupin type-1 domain-containing protein 292,953 abc 249,256 cd 301,796 ab 265,551 bcd 225,575 d 316,781 a
A0A7J6DTA7 Cupin type-1 domain-containing protein 336,178 ab 290,488 bc 358,906 a 319,042 abc 278,002 c 350,150 a
A0A7J6E205 Cupin type-1 domain-containing protein 2991 b 734 c 5698 a 1767 bc 1546 bc 2397 b
A0A7J6H2R3 Cupin type-1 domain-containing protein 6299 ab 7324 a 6399 ab 5745 b 6426 ab 6214 ab
A0A7J6GLH5 Cupin type-1 domain-containing protein 3410 a 3532 a 3127 a 2708 a 2851 a 2780 a
A0A7J6HAT3 Cupin type-1 domain-containing protein 5750 bc 7593 a 5372 c 5890 bc 6958 ab 5461 c
A0A7J6G321 Cupin type-1 domain-containing protein 16,117 b 18,916 a 15,978 b 10,262 d 13,227 c 10,338 d
A0A7J6H292 Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain-containing protein 19,306 a 21,984 a 17,488 a 15,188 a 12,731 a 15,075 a
A0A7J6DXD1 Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain-containing protein 1052 ab 817 bc 1297 a 929 bc 681 c 909 bc
A0A7J6EJ89 Oleosin 16,729 bc 7800 d 26,828 a 12,215 cd 10,393 cd 21,992 ab
A0A7J6F0Y4 Oleosin 40,499 a 41,904 a 38,200 a 38,126 a 44,509 a 38,168 a
A0A7J6H4F6 Oleosin 25,020 a 25,250 a 29,762 a 24,470 a 28,268 a 28,554 a
A0A7J6I425 Verticillium wilt resistance-like protein 988 a 1458 a 1525 a 1547 a 1874 a 1371 a
A0A7J6F280 Peroxygenase 4675 ab 5415 a 4283 b 4015 b 4225 b 4176 b
A0A7J6G6U3 Uncharacterized protein 1205 a 1115 ab 1164 a 749 c 898 bc 721 c
A0A7J6DNR1 Uncharacterized protein 145 b 81 b 52 b 13,567 a 16,616 a 95 b
A0A7J6DPR7 Uncharacterized protein 1549 ab 1264 bc 1966 a 703 d 775 d 968 cd
A0A7J6ENC9 Uncharacterized protein 1498 c 2198 ab 1177 c 1561 bc 2410 a 1395 c
A0A7J6GSC3 Aquaporin TIP3-2 2236 a 2728 a 2585 a 3037 a 3023 a 2988 a
A0A7J6I7S9 Uncharacterized protein 2546 abc 824 d 3336 a 1959 bc 1464 cd 2655 ab
A0A7J6G6Z3 Glyceraldehyde 3-phosphate dehydrogenase NAD(P) binding domain-containing protein 3130 bc 4596 a 2123 c 3207 bc 4012 ab 2765 c
A0A7J6E4U9 Triose-phosphate isomerase 2120 a 2489 a 1862 a 1912 a 2233 a 2080 a
A0A7J6EFG0 Uncharacterized protein 2577 b 3347 a 2398 b 1993 cd 2455 b 1774 d
A0A7J6EJG0 Protein disulfide-isomerase 948 bc 1218 ab 823 c 1315 ab 1597 a 1239 ab
A0A7J6E5J2 Fructose-bisphosphate aldolase 1637 cd 2756 b 1158 d 2346 b 3594 a 1745 c
A0A7J6EZ77 Malate dehydrogenase 999 bc 1648 a 732 c 1154 b 1552 a 836 bc
A0A7J6HK40 Glyceraldehyde 3-phosphate dehydrogenase NAD(P) binding domain-containing protein 926 ab 1276 a 684 b 898 ab 1277 a 702 b
A0A7J6GRW8 Phosphopyruvate hydratase 701 bc 1043 a 638 c 807 b 1142 a 560 c
A0A7J6EG53 Peptidyl-prolyl cis-trans isomerase 4223 b 6554 a 3436 b 3861 b 6019 a 3035 b
A0A7J6FNP6 Malate synthase 816 bc 1225 a 591 d 878 b 1284 a 667 cd
A0A7J6E8J3 Malate dehydrogenase 995 bc 1289 ab 730 c 1129 bc 1792 a 939 bc
A0A7J6DST1 NADP-dependent oxidoreductase domain-containing protein 7132 a 5649 ab 7532 a 1840 b 5212 ab 7102 a
A0A7J6HKH5 Nucleoside-diphosphate kinase 1096 b 1555 a 539 c 1050 b 1602 a 802 bc
A0A7J6HTX3 Tyrosinase copper-binding domain-containing protein 266 d 1265 b 1 e 611 c 1660 a 10 e
A0A7J6HQA0 NADH-cytochrome b5 reductase 959 bc 1153 ab 340 d 1244 ab 1389 a 607 cd
A0A7J6DVP5 rRNA N-glycosylase 485 bc 2044 a 60 d 331 c 584 b 71 d
A0A7J6FXL4 Dehydrin 1169 a 983 a 1413 a 1457 a 1723 a 1265 a
A0A7J6I6U6 Uncharacterized protein 877 b 1277 a 711 bc 840 bc 1297 a 634 c
A0A7J6G382 Uncharacterized protein 5578 b 7698 a 4057 c 2919 de 3976 cd 2535 e
A0A7J6HZ01 Annexin 1282 bc 1481 b 1040 cd 1193 cd 1791 a 961 d
A0A7J6I4E9 18 kDa seed maturation protein 3226 abc 3796 a 2444 bc 3612 a 3344 ab 2104 c
A0A7J6FL33 Late embryogenesis abundant protein D-29 2567 b 2793 b 1986 b 2815 b 3770 a 2076 b
A0A7J6GTA4 SHSP domain-containing protein 1359 b 2012 a 1083 c 693 de 929 cd 536 e
A0A7J6FPH5 Lactoylglutathione lyase 2462 ab 2739 a 1947 c 2097 bc 2472 ab 1897 c
A0A7J6DT98 Peroxiredoxin 3473 bc 4054 b 2623 c 4150 b 6328 a 3130 bc
A0A7J6GJC9 Dehydroascorbate reductase 868 a 1052 a 703 ab 480 b 733 ab 724 ab
A0A7J6F3P9 Catalase 600 bc 1393 a 323 d 777 b 1421 a 399 cd
A0A7J6HTR6 Glutaredoxin domain-containing protein 2690 ab 1809 b 3306 a 2262 ab 2009 ab 2969 ab
A0A7J6FRN5 Alcohol dehydrogenase 843 b 1160 a 620 cd 690 bc 1129 a 452 d
A0A7J6EYT6 PPC domain-containing protein 1681 b 1234 b 2057 b 14,310 a 1344 b 2149 b
A0A7J6E9G4 Histone H4 4793 cd 9218 a 3183 e 5008 c 7431 b 3664 de
A0A7J6E6Z7 Histone H2A 1523 cd 3063 a 829 d 1950 bc 2576 ab 1207 cd
A0A7J6E9K3 Histone H2B 1251 bc 1921 a 1166 bc 1116 bc 1558 ab 858 c
A0A7J6HBN5 ATP-dependent RNA helicase 686 d 1147 b 464 e 821 c 1293 a 655 d
A0A7J6HCW3 Elongation factor 1-alpha 2695 bc 3321 ab 2066 c 2668 bc 3676 a 2345 c
A0A7J6GTP8 60S acidic ribosomal protein P0 882 cd 1325 b 749 d 1117 bc 1698 a 825 cd
A0A7J6HHK4 Translation elongation factor EF1B beta/delta subunit guanine nucleotide exchange domain-containing protein 811 abc 946 ab 593 c 932 ab 1065 a 749 bc
A0A7J6F1D8 KH type-2 domain-containing protein 696 de 899 bc 515 e 973 b 1239 a 727 cd
A0A7J6HVI4 Ribosomal_L28e domain-containing protein 1077 bc 1341 ab 728 c 1301 abc 1864 a 1283 abc
A0A7J6E6C3 Ribosomal protein L7 691 cd 1031 ab 530 d 854 bc 1163 a 618 d
A0A7J6I4S9 Ribosomal protein L6 N-terminal domain-containing protein 620 cd 815 b 562 d 759 bc 1061 a 636 bcd
A0A7J6I083 60S acidic ribosomal protein P1 753 d 977 bc 803 cd 1045 ab 1204 a 854 bcd
A0A7J6F1J9 40S ribosomal protein S14 1583 bc 1859 b 1190 c 1582 bc 2413 a 1300 c
A0A7J6GA02 Ribosomal_S7 domain-containing protein 804 ab 893 ab 593 b 703 b 1360 a 683 b
A0A7J6G6F9 60S ribosomal protein L22-2; peroxidase 1520 bc 1897 ab 1152 c 1664 bc 2307 a 1241 c
A0A7J6I796 60S ribosomal protein L12 869 bc 1133 ab 685 c 948 bc 1279 a 785 c
A0A7J6HT66 40S ribosomal protein S9-2 915 b 1336 a 544 c 961 b 1286 a 728 bc
A0A7J6ENY3 RRM domain-containing protein 397 b 1022 a 403 b 1068 a 1321 a 525 b
A0A7J6FAW9 40S ribosomal protein S18 733 b 1031 ab 673 b 1000 ab 1212 a 806 b
A0A7J6I9M7 Ribosomal_L18e/L15P domain-containing protein 886 bcd 1093 b 663 d 936 bc 1458 a 793 cd
A0A7J6FX59 KOW domain-containing protein 982 ab 955 ab 815 b 804 b 1453 a 953 ab
A0A7J6F744 Ribosomal_L14e domain-containing protein 1065 b 1480 a 683 c 1047 b 1470 a 929 bc
A0A7J6E9P9 60S ribosomal protein L27 835 b 1374 a 767 b 1211 a 1216 a 951 b
A0A7J6H424 50S ribosomal protein L23, chloroplastic 782 ab 767 ab 662 b 784 ab 1018 a 777 ab
A0A7J6EFK0 40S ribosomal protein S17 607 bc 793 b 545 c 726 bc 1017 a 589 bc
A0A7J6G5C7 60S ribosomal protein L23 651 b 603 bc 371 c 805 ab 1065 a 647 b
A0A7J6GWW0 40S ribosomal protein S26 880 bc 1124 ab 722 c 1100 ab 1460 a 826 bc
A0A7J6E4Y6 60S acidic ribosomal protein P3 540 a 570 a 588 a 759 a 1068 a 627 a
A0A7J6GUI2 Chlorophyll a-b binding protein, chloroplastic 434 bc 2459 a 54 c 864 b 2696 a 155 bc
A0A7J6GSM8 Chlorophyll a-b binding protein, chloroplastic 1449 c 6038 a 203 d 2560 b 5606 a 522 d
A0A7J6I6X2 MD-2-related lipid-recognition domain-containing protein 1140 a 1477 a 1082 a 1430 a 1696 a 1194 a
A0A7J6I828 Actin 973 a 1218 a 928 a 1019 a 1255 a 806 a
A0A0M5M1Z3 ATP synthase subunit alpha 549 bc 1081 a 377 d 629 b 1035 a 405 cd
A0A7J6FR97 Uncharacterized protein 1215 a 1071 a 1257 a 1047 a 1123 a 1174 a
A0A7J6FP03 Uncharacterized protein 12,415 ab 14,264 a 9720 b 13,312 ab 16,039 a 10,073 b
A0A7J6I334 Uncharacterized protein 1235 bc 1552 a 852 d 1149 cd 1479 ab 948 cd
A0A7J6GUW7 Uncharacterized protein 786 ab 856 ab 643 b 777 ab 1061 a 692 b
A0A7J6HWC7 Uncharacterized protein 1506 cd 2038 ab 1028 d 1570 bc 2260 a 1045 d

Different lowercase letters in the rows indicate a statistically significant difference at the p < 0.05 (Tukey HSD test) among the defatted hemp seed products of two cultivars (for each protein separately).

Figure 2.

Figure 2

Pie charts of protein class sum proportions (in %) derived from relative abundance values in evaluated hempseed materials (WS—defatted flour from whole seeds; H—defatted flour from milled hulls; DS—defatted flour from dehulled hemp seeds).

The presented results (Table 3) confirmed the assumption that the hemp seed’s main class of proteins is SP, especially the 11S globulin edestin. The relative abundance of 11S globulins (summed over the three MP88 items) in whole seeds was 58.6 and 63.2% for the Santhica 27 and Uso-31 cultivars, respectively, when expressed as percentages, approximately corresponding to the literature-reported range of 60–80% [2,14]. Liu et al. [29] reported a range of relative edestin proportions of 57–76% when studying nine protein isolates prepared from seeds of different Cannabis sativa cultivars/genotypes, but this is the relative representation of edestin among the storage proteins (11S, 7S, and 2S), not the representation of edestin in the overall protein profile of hemp seeds. All three 11S protein items (A0A7J6GWL5, A0A7J6DTA7, A0A7J6E205) of the MP88 set were found to have a conclusively (except in one case) higher relative abundance in hulled seeds than in hulls. Considering the accumulation of reserve compounds in the inner seed part, this finding confirms the reported role of 11S proteins and edestins as seed reserve proteins [2,14]. The finding is also interesting concerning the potential food applications of hemp seed proteins, as edestin is their most valuable component [34].

The protein items A0A7J6H2R3, A0A7J6GLH5, A0A7J6HAT3, and A0A7J6G321, which could be classified as 7S proteins or vicilin-like proteins, represent about 3% of the seed protein pool in whole seeds of both cultivars, which is significantly less than that reported by other authors: Sun et al. [4] reported 5% and Liu et al. [29] reported a range of 9–19% (but here, as mentioned above, it is the proportion of storage proteins). In contrast to the 11S proteins, the relative abundance of 7S proteins was inconclusively higher in the hulls than in the dehulled seeds in all cases (see Table 3).

The third group of storage proteins is represented by the 2S proteins (A0A7J6H292, A0A7J6DXD1), which are classified as albumins. Collectively, their representation in the whole seed was only 2.04% (cv. Uso-31) or 1.61% (cv. Santhica 27), which is lower than the values published by other authors: Sun et al. [4] reported 13%, and Liu et al. [29] reported a range of 12–24% within a set of nine cultivars (here again, the values are proportions of storage proteins). The predominance of 2S albumins in the hulls or dehulled seeds is not at all clear.

The second most represented group of SP is the class of oleosins. Because the inner part of the seed coat accumulates more oil as a natural energy reserve, one would expect a higher representation of oleosins in dehulled seeds as key proteins that are part of the fat bodies, in which fat is stored in the cells. The results showed that the total oleosins increased with dehulling, from 8.2 (WS) to 9.4% (DS) and from 7.5 (WS) to 8.9 (DS) in Uso-31 and Santhica 27, respectively. However, this is due to the presence of the oleosin A0A7J6EJ89, which is significantly higher in the dehulled seeds than in the hulls of both cultivars. The other two proteins (A0A7J6F0Y4, A0A7J6H4F6) have no clear differences in the different parts of the seed.

Either way, the oleosin protein fraction is important in food applications as a natural emulsification system [36,37,38] or as a lipid-based delivery system in the food industry or in cosmetics and pharmaceuticals [39,40].

The protein classes EM, SD, DM, TR, and CT are collectively more abundant in the hulls (proteins of some classes are even significantly more abundant) than in the hulled seeds, which generally indicates that higher metabolic activity is taking place in the hull cells. This can be explained by the function of the hull tissues, i.e., the pericarp and seed coat layers, which form the natural barrier of the hemp achene and are thus forced to react more strongly to environmental stimuli, particularly abiotic stressors. Most of the proteins classified as SD in the MP88 dataset have a higher relative abundance in the hull than in the dehulled seed, except for dehydrin (A0A7J6FXL4), glutaredoxin domain-containing protein (A0A7J6HTR6), and dehydroascorbate reductase (A0A7J6GJC9). N-glycosidase rRNA (A0A7J6DVP5) is significantly more abundant in hulls (8-fold in cv. Santhica 27, 30-fold in cv. Uso-31) than in hulled seeds. This finding confirms the function of tRNA N-glycosidase as a defence protein [39]. The higher abundance in the hull predisposes the hull to be a potential source of this enzyme for biomedical applications. The RIP proteins have been found to have important biological properties, such as anticancer, antiviral, and neurotoxic activities [40,41]. Both chlorophyll-binding proteins (A0A7J6GUI2, A0A7J6GSM8) were significantly overrepresented in the hulls (compared to dehulled seeds) of both cultivars, which can be explained by the fact that chlorophyll is mainly found in the innermost layer of the seed coat, which is part of the hull [42].

Dehulling is an important technological process that removes the hard hull of the hemp seed, allowing the seeds to be consumed directly and used more widely in food products. Removing the hull eliminates or reduces the contents of antinutrients, improves the sensory properties of the dehulled seeds, and enhances their digestibility [42,43]. Previous work [27,43,44], and our presented results, have confirmed that dehulling can also be an important technological step in producing hemp seed protein concentrates and isolates, while this step removes big amounts of ballast substances (especially fibre components). At the same time, the proteomic results indicate the importance of hemp seed hulls as a source of a specific protein. The hulls often represent only the waste from hemp seed processing, while it could be a source of a large spectrum of proteins or as a raw material for a fibre flour product. On the other hand, the finding of a higher abundance of N-glycosidase rRNA also points to the possible risks of using products from the hulls for (direct) human consumption or the fact that the cultivar itself and its selection may play a major role in the occurrence of risk proteins.

3. Materials and Methods

3.1. Hemp Seed Samples and Their Preparation

The seeds of two cultivars of industrial hemp (Cannabis sativa L.) with low THC content (up to 0.2%), Uso-31 (registered in EUPVP—Common Catalogue; National Listing Netherlands; variety ID: 214015) and Santhica 27 (registered in EUPVP—Common Catalogue; National Listing France; variety ID: 213998), from the 2020 harvest were obtained from Hemp Production CZ, Ltd. Three variants of samples from both cultivars were provided for analysis: whole hemp seeds, seed hulls and dehulled hemp seeds. Hemp seeds were dehulled using dehulling and separating equipment (TFYM 1000; Liaoning Qiaopai Machineries Co., Ltd., Jinzhou, China). Before analyses, the samples were disintegrated using a Grindomix GD200 knife mill (Retsch, Haan, Germany) at 10,000 rpm for 1 min. The ground samples were then subjected to defatting using the organic solvent n-hexane. To the sample powder in the plastic tube, n-hexane was added at a ratio of 1:3 (w/v), and after mixing and extraction for 2 h at room temperature (with shaking of the sample mixture every 30 min), centrifugation (rpm 4500, 10 min, 20 °C) was performed, the solvent was carefully removed from the tube, and then the whole fat extraction process was repeated twice. After defatting, the sample pellets were left free to dry in the laboratory fume hood. Three defatted sample variants—defatted whole seeds (WS), defatted hulls (H), and defatted dehulled seeds (DS)—were obtained by the above-described treatment.

3.2. Dry Matter, Protein, and Fat Contents

Dry matter, protein, and fat contents were determined for the original (with fat) and defatted versions of the hulls, whole seeds, and dehulled seeds. The dry matter content was determined gravimetrically by drying the samples at 105 °C for 3 h in an oven to constant weight. The protein content was determined using the modified Dumas combustion method using a rapid N cube (N/Protein analysis) instrument (Elementar Analysen System, Langenselbold, Germany). Each sample was analysed in triplicate, and the protein content was calculated as the nitrogen content multiplied by a factor of 6.25. The fat content was measured using the Soxhlet extraction method using an ANKOM XT 10 Extractor (ANKOM Technology, Macedon, USA), according to the manufacturer’s manual. Petroleum ether was used as an extraction reagent. The fat content was calculated from the weight differences in the sample before and after extraction.

3.3. Sodium Dodecyl Sulphate Polyacrylamide Gel Electrophoresis

The defatted sample variants were extracted with SDS extraction buffer (0.065 M Tris-HCl, pH 6.8; 2% (w/v) SDS; 5% (v/v) 2-sulphanylethanol) in a ratio of 1:10 (w/v) at 4 °C for 4 h. Protein separation was carried out in triplicate using cooled dual vertical slab units (SE 600; Hoefer Scientific Instruments, Holliston, MA, USA) with a discontinuous gel system (4% stacking and 12% resolving gel) in reducing conditions [45]. Protein detection was performed by using Coomassie Brilliant Blue R-250.

3.4. LC-MS/MS Analysis

Proteins for LC-MS/MS analysis were extracted in SDT buffer (4% SDS, 0.1M DTT, 0.1M Tris/HCl, pH 7.6) in a thermomixer (Eppendorf ThermoMixer C, 60 min, 95 °C, 750 rpm). After that, all samples were centrifuged (15 min, 20,000× g), and the supernatants (ca. 100 μg of total protein) were used for filter-aided sample preparation (FASP), as described elsewhere [46], using 0.75 μg of trypsin (sequencing grade; Promega). Proteins were digested overnight (18 h) at 37 °C. The resulting peptides were analysed using LC-MS/MS.

LC-MS/MS analyses of all peptides were performed using an UltiMate 3000 RSLCnano system connected to an Orbitrap Exploris 480 spectrometer (Thermo Fisher Scientific, Waltham, MA, USA). Prior to LC separation, tryptic digests were concentrated and desalted online using a trapping column (Acclaim PepMap 100 C18, 300 μm ID, 5 mm long, 5 μm particles, Thermo Fisher Scientific). After washing the trapping column with 0.1% FA, the peptides were eluted in backflush mode (flow 500 nL·min−1) from the trapping column onto an analytical column (EASY-Spray column, 75 μm ID, 250 mm long, 2 μm particles, Thermo Fisher Scientific), where peptides were separated using a 90 min gradient program (flow rate 300 nL·min−1, 3–37% of mobile phase B; mobile phase A: 0.1% FA in water; mobile phase B: 0.1% FA in 80% ACN). Both columns were heated to 40 °C.

MS data were acquired in a data-dependent strategy (cycle time 2 s). The survey scan range was set to m/z 350–2000 with a resolution of 120,000 (at m/z 200), a normalised target value of 250%, and a maximum injection time of 500 ms. HCD MS/MS spectra (isolation window m/z 1.2, 30% relative fragmentation energy) were acquired from m/z 120 with a relative target value of 50% (intensity threshold 5 × 103), a resolution of 15,000 (at m/z 200), and a maximum injection time of 50 ms. Dynamic exclusion was enabled for 45 s.

3.5. Proteomic Data Processing

For data processing, we used MaxQuant software (v2.0.3.0) [47] with an inbuilt Andromeda search engine [48]. A search was performed against protein databases of Cannabis sativa (30,194 protein sequences, version from 24 February 2022, downloaded from https://www.uniprot.org/proteomes/UP000583929, accessed on 1 January 2020) and cRAP contaminants (112 sequences, version from 22 November 2018, downloaded from http://www.thegpm.org/crap). Modifications were set for the database search as follows: oxidation (M), deamidation (N, Q), and acetylation (Protein N-term) as variable modifications, with carbamidomethylation (C) as a fixed modification. Enzyme specificity was tryptic with two permissible missed cleavages. Only peptides and proteins with a false discovery rate threshold under 0.01 were considered. Relative protein abundance was assessed using protein intensities calculated using MaxQuant. The intensities of reported proteins were further evaluated using a software container environment (https://github.com/OmicsWorkflows/KNIME_docker_vnc; version 4.1.3a). The processing workflow is available upon request, and it covers, in short, reverse hits and contaminant protein group (cRAP) removal, protein group intensities’ log2 transformation, and normalisation (loessF). For the purpose of this article, protein groups reported by MaxQuant are referred to as proteins or protein items.

3.6. Statistical Analysis

The program Statistica 12 (StatSoft Power Solutions Inc., Palo Alto, CA, USA) was used for the data analysis. Data were subjected to analyses of variance using the two-way ANOVA method, and the means were compared using the Tukey HSD test. Differences between the variants were considered significant at p < 0.05 unless stated otherwise.

4. Conclusions

In total, 2833 proteins were identified in this proteomic study of the whole seeds, hulls, and dehulled seeds of two industrial hemp (Cannabis sativa L.) cultivars. Of this number, only 88 proteins accounted for 81.5–91.4% of the relative quantity of total proteins. The proteins within this set, reflecting the physiological state of the mature and stored seeds, were classified into 10 classes according to molecular and biological functions. According to the literature, we confirmed most of all three types of storage proteins—11S (edestin), 7S (vicilin-like proteins) globulins, and 2S albumins. As expected, the 11S storage globulins were found in increased abundance, mainly in the dehulled seeds; in contrast, 7S globulins were more abundant in the hulls, and 2S albumins were ambiguously represented within the two cultivars evaluated.

The relative quantification of the data revealed that the second most abundant protein class (next to storage proteins) included oleosins as key proteins of the oil-body membranes. Metabolically important classes of proteins (e.g., proteins related to energy acquisition, nucleic acid metabolism, or protein synthesis) and proteins that are part of defence mechanisms or stress responses are more abundant in the hulls. The hulls of the hemp seed can thus be an important source of valuable proteins for use in food, medical, or biotechnological applications.

Although protein samples of two cultivars were evaluated, it became very clear that the cultivar is an important factor in the relative abundance of proteins, and the selection of a suitable cultivar will be important not only in terms of hemp cultivation as a field crop and in terms of seed-to-oil processing but also in the utilisation of the protein component of the seed.

Author Contributions

Conceptualisation, J.B. (Jan Bárta); performance of the research work, analysis, and data interpretation, J.B. (Jan Bárta), P.R., M.J., Z.Z., A.S., V.B., Z.K., J.K., F.L., J.B. (Jan Bedrníček) and P.S.; writing—original draft preparation, J.B. (Jan Bárta) and V.B.; writing—review and editing, J.B. (Jan Bárta), V.B. and F.L.; supervision, J.B. (Jan Bárta) and Z.Z.; project administration and funding acquisition, J.B. (Jan Bárta), V.Ř., V.F. and Z.Z. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

Authors Zlata Krejčová and Václav Říha were employed by the company HEMP PRODUCTION CZ, Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Funding Statement

This work was supported by the Ministry of Agriculture of the Czech Republic (Project No. NAZV QK 1910302). The work was also supported by the Ministry of Education, Youth and Sports of CR from the European Regional Development Fund-Project “SINGING PLANT” (No. CZ.02.1.01/0.0/0.0/16_026/0008446). CIISB, Instruct-CZ Centre of Instruct-ERIC EU consortium, funded by MEYS CR infrastructure project LM2023042, is gratefully acknowledged for the financial support of the measurements taken at the CEITEC Proteomics Core Facility. Computational resources were provided by the e-INFRA CZ project (ID:90254), supported by the Ministry of Education, Youth and Sports of the Czech Republic.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Callaway J.C. Hempseed as a nutritional resource: An overview. Euphytica. 2004;140:65–72. doi: 10.1007/s10681-004-4811-6. [DOI] [Google Scholar]
  • 2.Burton R.A., Andres M., Cole M., Cowley J.M., Augustin M.A. Industrial hemp seed: From the field to value-added food ingredients. J. Cannabis Res. 2022;4:45. doi: 10.1186/s42238-022-00156-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Farinon B., Molinari R., Costantini L., Merendino N. The Seed of Industrial Hemp (Cannabis sativa L.): Nutritional Quality and Potential Functionality for Human Health and Nutrition. Nutrients. 2020;12:1935. doi: 10.3390/nu12071935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sun X., Sun Y., Li Y., Wu Q., Wang L. Identification and Characterization of the Seed Storage Proteins and Related Genes of Cannabis sativa L. Front. Nutr. 2021;8:678421. doi: 10.3389/fnut.2021.678421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Kriese U., Schumann E., Weber W.E., Beyer M., Brühl L., Matthäus Oil content, tocopherol composition and fatty acid patterns of the seeds of 51 Cannabis sativa L. genotypes. Euphytica. 2004;137:339–351. doi: 10.1023/B:EUPH.0000040473.23941.76. [DOI] [Google Scholar]
  • 6.Lan Y., Zha F., Peckrul A., Hanson B., Johnson B., Rao J., Chen B. Genotype x Environmental Effects on Yielding Ability and Seed Chemical Composition of Industrial Hemp (Cannabis sativa L.) Varieties Grown in North Dakota, USA. J. Am. Oil. Chem. Soc. 2019;96:1417–1425. doi: 10.1002/aocs.12291. [DOI] [Google Scholar]
  • 7.Alonso-Esteban J.I., González-Fernández M.J., Fabrikov D., Torija-Isasa E., Sánchez-Mata M.d.C., Guil-Guerrero J.L. Hemp (Cannabis sativa L.) Varieties: Fatty Acid Profiles and Upgrading of γ-Linolenic Acid–Containing Hemp Seed Oils. Eur. J. Lipid Sci. Technol. 2020;122:1900445. doi: 10.1002/ejlt.201900445. [DOI] [Google Scholar]
  • 8.Xu Y., Zhao J., Hu R., Wang W., Griffin J., Li Y., Sun X.S., Wang D. Effect of genotype on the physicochemical, nutritional, and antioxidant properties of hempseed. J. Agric. Food Res. 2021;3:100119. doi: 10.1016/j.jafr.2021.100119. [DOI] [Google Scholar]
  • 9.Golimowski W., Teleszko M., Marcinkowski D., Kmiecik D., Grygier A., Kwaśnica A. Quality of Oil Pressed from Hemp Seed Varieties: ‘Ealina 8FC’, ‘Secueni Jubileu’ and ‘Finola’. Molecules. 2022;27:3171. doi: 10.3390/molecules27103171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mamone G., Picariello G., Ramondo A., Nicolai M.A., Ferranti P. Production, digestibility and allergenicity of hemp (Cannabis sativa L.) protein isolates. Food Res. Int. 2019;115:562–571. doi: 10.1016/j.foodres.2018.09.017. [DOI] [PubMed] [Google Scholar]
  • 11.Miernyk J.A., Hajduch M. Seed proteomics. J. Proteom. 2011;74:389–400. doi: 10.1016/j.jprot.2010.12.004. [DOI] [PubMed] [Google Scholar]
  • 12.Aluko R.E. Hemp Seed (Cannabis sativa L.) Proteins: Composition, Structure, Enzymatic Modification, and Functional or Bioactive Properties. In: Nadathur S.R., Wanasundara J.P.D., Scanlin L., editors. Sustainable Protein Sources. Academic Press; San Diego, CA, USA: 2017. pp. 121–132. [Google Scholar]
  • 13.Tang C.-H., Ten Z., Wang X.-S., Yang X.-Q. Physicochemical and Functional Properties of Hemp (Cannabis sativa L.) Protein Isolate. J. Agric. Food Chem. 2006;54:8945–8950. doi: 10.1021/jf0619176. [DOI] [PubMed] [Google Scholar]
  • 14.Wang Q., Xiong Y.L. Processing, Nutrition, and Functionality of Hempseed Protein: A Review. Compr. Rev. Food Sci. Food Saf. 2019;18:936–952. doi: 10.1111/1541-4337.12450. [DOI] [PubMed] [Google Scholar]
  • 15.Bárta J., Bártová V., Jarošová M., Švajner J. Proteins of Oilseed Cakes, Their Isolation and Usage Possibilities. Chem. Listy. 2021;115:472–480. [Google Scholar]
  • 16.Ponzoni E., Brambilla I.M., Galasso I. Genome-wide identification and organization of seed storage protein genes of Cannabis sativa. Biol. Plant. 2018;62:693–702. doi: 10.1007/s10535-018-0810-7. [DOI] [Google Scholar]
  • 17.Park S.-K., Seo J.-B., Lee M.-Y. Proteomic profiling of hempseed proteins from Cheungsam. Biochim. Biophys. Acta Proteins Proteom. 2012;1824:374–382. doi: 10.1016/j.bbapap.2011.10.005. [DOI] [PubMed] [Google Scholar]
  • 18.Aiello G., Fasoli E., Boschin G., Lammi C., Zanoni C., Citterio A., Arnoldi A. Proteomic characterization of hempseed (Cannabis sativa L.) J. Proteom. 2016;147:187–196. doi: 10.1016/j.jprot.2016.05.033. [DOI] [PubMed] [Google Scholar]
  • 19.Leonard W., Zhang P., Ying D., Nie S., Tindal E., Fang Z. Transformation of hempseed (Cannabis sativa L.) oil cake proteome, structure and functionality after extrusion. Food Chem. 2022;384:132499. doi: 10.1016/j.foodchem.2022.132499. [DOI] [PubMed] [Google Scholar]
  • 20.Stoyke M., Becker R., Brockmeyer J., Jira W., Popping B., Uhlig S., Wittke S. German Government Official Methods Board Points the Way Forward: Launch of a New Working Group for Mass Spectrometry for Protein Analysis to Detect Food Fraud and Food Allergens. J. AOAC Int. 2019;102:1280–1285. doi: 10.5740/jaoacint.19-0056. [DOI] [PubMed] [Google Scholar]
  • 21.Kotecka-Majchrzak K., Sumara A., Fornal E., Montowska M. Identification of species-specific peptide markers in cold-pressed oils. Sci. Rep. 2020;10:19971. doi: 10.1038/s41598-020-76944-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kotecka-Majchrzak K., Sumara A., Fornal E., Montowska M. Proteomic analysis of oilseed cake: A comparative study of species-specific proteins and peptides extracted from ten seed species. J. Sci. Food Agric. 2021;101:297–306. doi: 10.1002/jsfa.10643. [DOI] [PubMed] [Google Scholar]
  • 23.Alonso-Esteban J.I., Pinela J., Ćirić A., Calhelha R.C., Soković M., Ferreira I.C.F.R., Barros L., Torija-Isasa E., Sánchez-Mata M.d.C. Chemical composition and biological activities of whole and dehulled hemp (Cannabis sativa L.) seeds. Food Chem. 2022;374:131754. doi: 10.1016/j.foodchem.2021.131754. [DOI] [PubMed] [Google Scholar]
  • 24.Smolikova G., Gorbach D., Lukasheva E., Mavropolo-Stolyarenko G., Bilova T., Soboleva E., Tsarev A., Romanovskaya E., Podolskaya E., Zhukov V., et al. Bringing New Methods to the Seed Proteomics Platform: Challenges and Perspectives. Int. J. Mol. Sci. 2020;21:9162. doi: 10.3390/ijms21239162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.House J.D., Neufeld J., Leson G. Evaluating the Quality of Protein from Hemp Seed (Cannabis sativa L.) Products Through the use of the Protein Digestibility-Corrected Amino Acid Score Method. J. Agric. Food Chem. 2010;58:11801–11807. doi: 10.1021/jf102636b. [DOI] [PubMed] [Google Scholar]
  • 26.Boye J.I., Barbana C. Food and Industrial Bioproducts and Bioprocessing. Wiley; Hoboken, NJ, USA: 2012. Protein Processing in Food and Bioproduct Manufacturing and Techniques for Analysis; pp. 85–113. [Google Scholar]
  • 27.Shen P., Gao Z., Xu M., Rao J., Chen B. Physicochemical and structural properties of proteins extracted from dehulled industrial hempseeds: Role of defatting process and precipitation pH. Food Hydrocoll. 2020;108:106065. doi: 10.1016/j.foodhyd.2020.106065. [DOI] [Google Scholar]
  • 28.Chen H., Xu B., Wang Y., Li W., He D., Zhang Y., Zhang X., Xing X. Emerging natural hemp seed proteins and their functions for nutraceutical applications. Food Sci. Hum. Wellness. 2023;12:929–941. doi: 10.1016/j.fshw.2022.10.016. [DOI] [Google Scholar]
  • 29.Liu M., Toth J.A., Childs M., Smart L.B., Abbaspourrad A. Composition and functional properties of hemp seed protein isolates from various hemp cultivars. J. Food Sci. 2023;88:942–951. doi: 10.1111/1750-3841.16467. [DOI] [PubMed] [Google Scholar]
  • 30.Liu M., Childs M., Loos M., Taylor A., Smart L.B., Abbaspourrad A. The effects of germination on the composition and functional properties of hemp seed protein isolate. Food Hydrocoll. 2023;134:108085. doi: 10.1016/j.foodhyd.2022.108085. [DOI] [PubMed] [Google Scholar]
  • 31.Potin F., Lubbers S., Husson F., Saurel R. Hemp (Cannabis sativa L.) Protein Extraction Conditions Affect Extraction Yield and Protein Quality. J. Food Sci. 2019;84:3682–3690. doi: 10.1111/1750-3841.14850. [DOI] [PubMed] [Google Scholar]
  • 32.Fang B., Gu Z., Ohm J.-B., Chen B., Rao J. Reverse micelles extraction of hemp protein isolate: Impact of defatting process on protein structure, functionality, and aromatic profile. Food Hydrocoll. 2023;135:108158. doi: 10.1016/j.foodhyd.2022.108158. [DOI] [Google Scholar]
  • 33.Angelovici R., Galili G., Fernie A.R., Fait A. Seed desiccation: A bridge between maturation and germination. Trends Plant Sci. 2010;15:211–218. doi: 10.1016/j.tplants.2010.01.003. [DOI] [PubMed] [Google Scholar]
  • 34.Docimo T., Caruso I., Ponzoni E., Mattana M., Galasso I. Molecular characterization of edestin gene family in Cannabis sativa L. Plant Physiol. Biochem. 2014;84:142–148. doi: 10.1016/j.plaphy.2014.09.011. [DOI] [PubMed] [Google Scholar]
  • 35.Hajduch M., Matusova R., Houston N.L., Thelen J.J. Comparative proteomics of seed maturation in oilseeds reveals differences in intermediary metabolism. Proteomics. 2011;11:1619–1629. doi: 10.1002/pmic.201000644. [DOI] [PubMed] [Google Scholar]
  • 36.Nikiforidis C.V. Structure and functions of oleosomes (oil bodies) Adv. Colloid. Interface Sci. 2019;274:102039. doi: 10.1016/j.cis.2019.102039. [DOI] [PubMed] [Google Scholar]
  • 37.Garcia F.L., Ma S., Dave A., Acevedo-Fani A. Structural and Physicochemical Characteristics of Oil Bodies from Hemp Seeds (Cannabis sativa L.) Foods. 2021;10:2930. doi: 10.3390/foods10122930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lopez C., Novales B., Rabesona H., Weber M., Chardot T., Anton M. Deciphering the properties of hemp seed oil bodies for food applications: Lipid composition, microstructure, surface properties and physical stability. Food Res. Int. 2021;150:110759. doi: 10.1016/j.foodres.2021.110759. [DOI] [PubMed] [Google Scholar]
  • 39.Puri M., Kaur I., Perugini M.A., Gupta R.C. Ribosome-inactivating proteins: Current status and biomedical applications. Drug Discov. Today. 2012;17:774–783. doi: 10.1016/j.drudis.2012.03.007. [DOI] [PubMed] [Google Scholar]
  • 40.Bolognesi A., Bortolotti M., Maiello S., Battelli M.G., Polito L. Ribosome-Inactivating Proteins from Plants: A Historical Overview. Molecules. 2016;21:1627. doi: 10.3390/molecules21121627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Citores L., Iglesias R., Ferreras J.M. Antiviral Activity of Ribosome-Inactivating Proteins. Toxins. 2021;13:80. doi: 10.3390/toxins13020080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kim E.-S., Kwon T.-H., Park S.-H. Structural Characteristics of Shells in a Fibrous Cultivar of Cannabis sativa L. J. Nat. Fibers. 2023;20:2216951. doi: 10.1080/15440478.2023.2216951. [DOI] [Google Scholar]
  • 43.Shen P., Gao Z., Xu M., Ohm J.-B., Rao J., Chen B. The impact of hempseed dehulling on chemical composition, structure properties and aromatic profile of hemp protein isolate. Food Hydrocoll. 2020;106:105889. doi: 10.1016/j.foodhyd.2020.105889. [DOI] [Google Scholar]
  • 44.Sharma S., Tamilselvan T., Shakeb M., Prabhasankar P. Hydrothermal treatment of hemp seeds (Cannabis sativa L.): Impact on its dehulling yield, fatty acid profile and nutritional characteristics. J. Sci. Food Agric. 2023;103:2681–2689. doi: 10.1002/jsfa.12328. [DOI] [PubMed] [Google Scholar]
  • 45.Laemmli U.K. Cleavage of Structural Proteins during the Assembly of the Head of Bacteriophage T4. Nature. 1970;227:680–685. doi: 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]
  • 46.Wiśniewski J.R., Zougman A., Nagaraj N., Mann M. Universal sample preparation method for proteome analysis. Nat. Methods. 2009;6:359–362. doi: 10.1038/nmeth.1322. [DOI] [PubMed] [Google Scholar]
  • 47.Cox J., Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 2008;26:1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]
  • 48.Cox J., Neuhauser N., Michalski A., Scheltema R.A., Olsen J.V., Mann M. Andromeda: A Peptide Search Engine Integrated into the MaxQuant Environment. J. Proteome Res. 2011;10:1794–1805. doi: 10.1021/pr101065j. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.


Articles from Plants are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES