Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 Mar 22;12:4873. doi: 10.1038/s41598-022-08833-6

Unraveling structural and conformational dynamics of DGAT1 missense nsSNPs in dairy cattle

Rajesh Kumar Pathak 1, Byeonghwi Lim 1, Yejee Park 1, Jun-Mo Kim 1,
PMCID: PMC8940929  PMID: 35318385

Abstract

Cattle are domestic animals that have been nourishing humans for thousands of years. Milk from cattle represents a key source of high-quality protein, fat, and other nutrients. The nutritional value of milk and dairy products is closely associated with the fat content, providing up to 30% of the total fat consumed in the human diet. The fat content in cattle milk represents a major concern for the scientific community due to its association with human health. The relationship between milk fat content and diacylglycerol o-acyltransferase 1 gene (DGAT1) is well described in literature. Several studies demonstrated the difference in fat contents and other milk production traits in a wide range of cattle breeds, to be associated with missense non-synonymous single nucleotide polymorphisms (nsSNPs) of the DGAT1 gene. As a result, an nsSNPs analysis is crucial for unraveling the DGAT1 structural and conformational dynamics linked to milk fat content. DGAT1-nsSNPs are yet to be studied in terms of their structural and functional impact. Therefore, state-of-the-art computational and structural genomic methods were used to analyze five selected variants (W128R, W214R, C215G, P245R, and W459G), along with the wild type DGAT1. Significant structural and conformational changes in the variants were observed. We illustrate how single amino acid substitutions affect DGAT1 function, how this contributes to our understanding of the molecular basis of variations in DGAT1, and ultimately its impact in improving fat quality in milk.

Subject terms: Computational biology and bioinformatics, Systems biology

Introduction

Cattle milk is the most-consumed milk worldwide and is recognized as a nutritionally superior food due to its capacity to satisfy nutritional and physiological requirements1. It has a high calcium, magnesium, and potassium content, minerals thought to be under-consumed in adult diets2. Additionally, it contains several water-soluble vitamins and is a good source of thiamine (B1), riboflavin (B2), and cobalamin (B12). Vitamins A and D can be found in high concentrations in milk, depending on the fat level2. Milk fats, which are primarily made up of triglycerides (one molecule of glycerin and three fatty acids), are one of the most important indicators of milk quality. It is vital in conferring microbial resistance and hence preserving animal homeostasis3. The nutritional value of milk and dairy products is closely associated with the fat content of milk, making up approximately 30% of the total fat consumed in the human diet4. Several studies have shown the potential impact of dietary fatty acids on human nutrition and recommended limits for dietary fat consumption57. Milk fat is made up of a complex mixture of lipids, which is composed of approximately 70% saturated fatty acids and 30% unsaturated fatty acids. While some fatty acids such as phospholipids are crucial for human health and nutrition, they are also essential constituents of cell membranes8. However, the high content of saturated fatty acids in milk is generally undesirable because it is associated with a variety of illnesses; including obesity, elevated blood cholesterol, and cardiovascular-related problems7,9.

The search for candidate genes that improve milk quality, particularly its fat content, has become increasingly important in recent years (Fig. 1). Most key traits of farm animals have economic importance and are associated with polygenes positioned at quantitative trait loci (QTLs), dispersed throughout the genome. Initial research identified the approximate positions of milk-related QTLs, including one controlling milk fat content at BTA14's proximal 3-cm marker interval3,1012. The diacylglycerol o-acyltransferase 1 gene (DGAT1) has been investigated as a key candidate and functional gene for fat trait and milk production11,13. It is found in the centromeric region of Bos taurus autosome 14 and encodes acyl-coenzyme A:diacylglycerol acyltransferase, a protein implicated in fat metabolism in cattle breeds11,13,14. Genome-wide association studies (GWAS) identified several single nucleotide polymorphisms (SNPs) and their association with milk fat trait(s) of interest by using linkage disequilibrium that exists among causative variants, and one or more genetic marker15. All the missense variants or non-synonymous SNPs (nsSNPs) are not structurally or functionally affecting; however, numerous deleterious variants can affect the physiological system of an organism. Furthermore, experimental studies have suggested that one-third of non-synonymous variants are deleterious16. Several reports spanning a wide range of cattle breeds investigated the impact of the K232A variant on milk fat content and composition, as well as other milk production traits17. Nevertheless, a computational investigation of missense variants of the DGAT1 and their crucial role in milk fat is currently lacking from the literature. As a result, we set out to undertake a complete in silico examination of the impact of DGAT1 variants linked to milk production in dairy cattle. We started with a list of 73 missense variants collected from the Ensembl using the BioMart data mining tool18,19. Finally, five variants of the DGAT1 protein, W128R, W214R, C215G, P245R, and W459G, were chosen and analyzed in detail utilizing all-atom molecular dynamics (MD) simulations for 100 ns to examine their temporal evolution impact. Our study provides novel insights into our understanding of the structural and conformational dynamics of wild type as well as DGAT1 variants and their role in improving milk quality in the future.

Figure 1.

Figure 1

Schematic pathway of triacylglycerol biosynthesis. It illustrates the role of DGAT1 in triacylglycerol synthesis that is responsible for fat content in the milk.

Results

Screening of significant nsSNPs

A total of 73 nsSNPs predicted by SIFT were first analyzed to identify the ones that might have a significant influence (Supplementary Table S1). Out of the 73 nsSNPs, 5 were considered deleterious based on PROVEAN prediction. An nsSNP with a PROVEAN score of < − 2.5 was regarded as deleterious, while nsSNPs with a score > − 2.5 were considered neutral. PROVEAN predicted that 64 of the 73 nsSNPs would be deleterious while the remaining 9 would be neutral. Of the 64 PROVEAN predicted nsSNPs, 5 nsSNPs (W128R, W214R, C215G, P245R, and W459G) were found significant based on their lowest score (Table 1). The PROVEAN score for 5 shorted nsSNPs ranged between − 8.543 and − 11.385.The high deleterious nature of these selected nsSNPs was also confirmed by different other sequence-based tools including PredictSNP, MAPP, PhD-SNP, PolyPhen-1, PolyPhen-2, and SNAP (Supplementary Table S2).

Table 1.

A list of five selected missense nsSNPs predicted as deleterious by different sequence based tools. SIFT, PROVEAN, PredictSNP, MAPP, PhD-SNP, PolyPhen-1, PolyPhen-2, and SNAP were employed.

S.N. SNP ID Location Variant alleles Substitution Overall prediction
1 rs466331549 14:610211 T/C W128R Deleterious
2 rs451872523 14:610865 T/C W214R Deleterious
3 rs440590330 14:610868 T/A/G C215G Deleterious
4 rs456383648 14:611059 C/G P245R Deleterious
5 rs474703848 14:612411 T/G W459G Deleterious

Structural modeling, analysis and visualization of DGAT1 and its screened nsSNPs

SwissModel was used to generate the 3D structure of DGAT1 protein. UCSC Chimera was used to visualize the predicted model (Fig. 2A). The SPDB Viewer and the steepest descent approach were used to remove the bad contract between protein atoms and stabilize the stereochemical properties of the DGAT1 model through energy minimization. Furthermore, the DGAT1 model stability was verified using the SAVES server, while the model's reliability of the backbone of torsion angles was assessed using PROCHECK. The latter computes the amino acid residues that fall within the Ramachandran plot's existing zones. For the DGAT1 model, the Ramachandran plot analysis revealed that 86.9% of residues fell in the most favored regions, 12.3% in additional allowed regions, 0.8% in generously allowed regions, and none fell in disallowed regions. ProSA also confirmed the DGAT1 model's reliability. The model's quality was further assessed by superimposing the predicted structure over the experimentally obtained structure and calculating the atoms RMSD using UCSF Chimera. This revealed that the predicted model is good and quite similar to the template (Fig. 2B). The model's energy profile and Z-score value were calculated using a distance-based pair potential, which determines the interaction energy per residue. The modeled DGAT1 had a Z-score of − 5.63, and the template had a Z-score of − 5.83; thus, indicating the good quality of the model (where the energy of ProSA in negative reflects the model's reliability) (Fig. 2C,D). Further, mutagenesis wizard from PyMOL was used for generating the 3D model of the W128R, W214R, C215G, P245R, and W459G variants of DGAT1 protein structure. The mutagenesis wizard can be used to make amino acid residues mutation/changes in the 3D structure of a protein at any position.

Figure 2.

Figure 2

(A) 3D model of DGAT1. (B) Superimposed structure of DGAT1 with its template structure 6VYI_A. (C) Overall quality of DGAT1 model depicting a z-score of –5.63 (Native conformation to its template). (D) Overall quality of template (6VYI_A) depicting a z-score of -5.83.

Evaluating the structural effects of deleterious nsSNPs of DGAT1

Structure-based tools were used to screen modeled structure of DGAT1 variants W128R, W214R, C215G, P245R, and W459G for their structural impact. DynaMut, CUPSAT, and I-mutant were used for this purpose. DynaMut uses a consensus-based technique centered on protein motion and graph-based signatures to estimate the effects of missense variations on protein stability. CUPSAT assesses the amino acid environment of the mutation site using amino acid atom potentials and torsion angles to predict protein stability changes due to point mutations. Lastly, I-mutant uses SVM-based predictors to estimate how a protein's stability changes as a result of variations. Based on the change in Gibbs free energy value, it can be determined whether a mutation would mostly destabilize, largely stabilize, or have a weak influence on the protein. Based on the comparative analysis of these tools, it was predicted that the selected nsSNPs are destabilizing the structure of DGAT1 protein, which might ultimately affect milk production and fat content. The results are shown in Table 2.

Table 2.

List of selected significant deleterious nsSNPs and their effects on DGAT1 structure predicted by structure-based tools.

SNP ID Variant DynaMut CUPSAT I-mutant
rs466331549 W128R Destabilizing Stabilizing Decrease
rs451872523 W214R Destabilizing Destabilizing Decrease
rs440590330 C215G Destabilizing Destabilizing Decrease
rs456383648 P245R Stabilizing Destabilizing Decrease
rs474703848 W459G Destabilizing Destabilizing Decrease

Further, these DGAT1 nsSNPs were visualized structurally using the HOPE server (Fig. 3). Table 3 shows the structural effects of these five deleterious variants. All variant residues, except P245R, were smaller than the wild type residues. As the mutant residue arginine is larger than the wild type residue proline, for the P245R nsSNP residue it was predicted that it would not fit in the core of the protein. Furthermore, arginine is positively charged while the wild type residue at 245 proline is neutral, which may cause problems during protein folding.

Figure 3.

Figure 3

Close-up view of wild type (green) and variant (red) amino acid residues of selected deleterious nsSNPs visualized using the HOPE server. (A) W128R, (B) W214R, (C) C215G, (D) P245R, and (E) W459G.

Table 3.

Structural impacts of DGAT1 variants due to amino acid changes.

Variant Structural change Properties of amino acid Domain

W128R

W214R

graphic file with name 41598_2022_8833_Figa_HTML.gif The mutant residue is smaller than the wild type residue. The wild type residue charge is neutral while the mutant residue charge is positive. The wild type residue is more hydrophobic than the mutant residue The mutated residue is located in a domain that is important for the main activity of the protein. Mutation of the residue might disturb protein function
C215G graphic file with name 41598_2022_8833_Figb_HTML.gif The mutant residue is smaller than the wild type residue. The wild type residue is more hydrophobic than the mutant residue The mutated residue is located in a domain that is important for the main activity of the protein. Mutation of the residue might disturb protein function
P245R graphic file with name 41598_2022_8833_Figc_HTML.gif The mutant residue is bigger than the wild type residue. The wild type residue charge is neutral, while the mutant residue charge is positive. The wild type residue is more hydrophobic than the mutant residue The mutated residue is located in a domain that is important for the main activity of the protein. Mutation of the residue might disturb protein function
W459G graphic file with name 41598_2022_8833_Figd_HTML.gif The mutant residue is smaller than the wild type residue. The wild type residue is more hydrophobic than the mutant residue The mutated residue is located in a domain that is important for the main activity of the protein. Mutation of the residue might disturb protein function

Conservation analysis and identification of functional regions in DGAT1

The level of residue conservation offers an approximate of the structural and functional impact of any protein due to variations. In nature, a deleterious variant of a highly conserved residue will almost certainly be damaging. The 3D structure of DGAT1 was used as an input query to determine the conservation level based on the score predicted by the ConSurf server (Fig. 4). The ConSurf results demonstrated that two of the selected variants, W214R and W459G, were highly conserved obtaining a conservation score of 9. The conservation scores of other selected variants, W128R, C215G, and P245R were 7, 6, and 6, respectively. However, the distribution of conserved residues was random. The position from 252 to 479 was predicted to be an area that contain numerous conserved residues. Furthermore, average variability and conservation residues (score of 5) were also found among this region. This analysis provides a better understanding of the functional regions in DGAT1 and their impact on milk production. Since the selected variants are found in conserved regions they might influence the fat content of milk.

Figure 4.

Figure 4

DGAT1 conservation analysis performed using ConSurf and depicting variable as well as conserved residues (score from 1 to 9 based on color).

MD simulation analysis for decoding the complex nature of DGAT1 wild type and its variants

The conformational differences between the wild type and variant proteins were decoded using MD simulations. An MD simulation of 100 ns was performed using Gromacs after generating six systems (DGAT1-wild-type, DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G). For trajectory analysis, Gromacs utilities were employed (Fig. 5).

Figure 5.

Figure 5

Conformational dynamic analysis of DGAT1-wild type and its variants using MD simulation. (A) RMSD, (B) RMSF, (C) Rg, and (D) SASA.

Analysis of conformational stability

The wild type and variant DGAT1 proteins' RMSDs were calculated to predict their respective stabilities. The RMSD discrepancies created during MD simulation can be used to measure the protein's stability in relation to its structure20,21. The RMSD deviations observed during the MD simulation can be used to measure the protein's stability in relation to its structure. Smaller RMSD deviations indicate a more stable protein structure. The backbone RMSD was plotted against time to measure structural conformational variations (Fig. 5A).The RMSD of the DGAT1-wild type averaged 0.68 nm. Whereas the RMSD values of the variants DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G were 0.62, 0.67, 0.55, 0.61, and 0.55 nm, respectively.

Analysis of structural flexibility

We computed structural flexibility for all variants, including the wild type structure, after assessing the RMSD. In nature, protein structures are dynamic, requiring structural flexibility to maintain their properties22. Figure 5B shows a plot of RMSF values against all residues. The average predicted RMSF value of DGAT1-wild type was 0.20 nm and for variants DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G were 0.21, 0.22, 0.20, 0.18, and 0.20 nm respectively.

Analysis of structural compactness

The Rg values of all six systems were used to examine the structural compactness of the DGAT1 structure before and after generating variations. The mechanism of structural compactness, stability, and folding of a protein structure can be derived using the time evolution of Rg values23. We calculated and plotted the Rg values of DGAT1-wild type, DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G systems from the MD trajectories. The average Rg were 2.36, 2.36, 2.35, 2.37, 2.36, and 2.37 nm, respectively (Fig. 5C).

Analysis of solvent accessible surface area

The average SASA values for DGAT1-wild type, DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G were 240, 240, 236, 237, 240, and 241 nm2, respectively and were plotted over 100 ns of MD simulation (Fig. 5D). The variation in the SASA values of W214R and C215G signified the folding nature of DGAT1 structure upon change in amino acid residue. The other variant showed similar conformational stability to the wild type.

Analysis of intramolecular hydrogen bond

In this study, the number of intramolecular hydrogen bonds was used to predict the effect of variants on the DGAT1 protein structure. Figure 6 shows the number of hydrogen bonds formed by the wild type and variant proteins as a function of time. The average number of intramolecular hydrogen bonds for DGAT1-wild type, DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G were 287, 297, 294, 294, 299, and 301, respectively.

Figure 6.

Figure 6

Intramolecular hydrogen bonds calculation illustrated as a time-dependent change. DGAT-Wild (black), DGAT1-W128R (red), DGAT1-W214R (green), DGAT1-C215G (blue), DGAT1-P245R (yellow), and DGAT1-W459G (brown).

Principal component analyses

Principal component analysis (PCA) was employed to visualize the protein global motions by using a small number of principal motions, which are defined by eigenvectors and eigenvalues. We examined whether the first few principal motions or eigenvectors playing a key role in the protein motions. Therefore, first 50 eigenvectors against eigenvalue of the DGAT1-wild type and its variant proteins were plotted in Fig. 7A. The percentage of motion for wild type and variants was computed. The first 5 principal components accounted for 75.84% of the DGAT1-wild type protein motion, and 78.34%, 84.93%, 77.31%, 79.39%, and 78.26% of DGAT1-W128R, DGAT1-W214R, DGAT1-C215G, DGAT1-P245R, and DGAT1-W459G protein motion, respectively. Additionally, a 2D plot for assessing the wild type and variant proteins’ dynamics was produced. We imagine a cluster of stable states in wild type and variants based on PC1 and PC2 projection (Fig. 7B).

Figure 7.

Figure 7

Principal component analysis using molecular dynamics simulation data. (A) First 50 eigenvectors against eigenvalue of the DGAT1-wild type and its variants. (B) First two eigenvectors depicting the projection of protein motion in phase space for DGAT1-wild type and its variants.

Analysis of PPIs network for visualizing interacting partners of DGAT1

The PPIs network construction and analysis based on highest confidence score using the STRING database revealed that the DGAT1 network has 35 nodes, 122 edges, and an average predicted node degree of 6.97 (Fig. 8). The predicted interacting partners of DGAT1 with their respective scores are provided in Supplementary Table S3. These proteins work along with DGAT1 to perform common tasks. The analysis predicted that the network is mostly involved in different biological processes, which includes retinal metabolic process (GO:0042574), Monoacylglycerol biosynthetic process (GO:0006640), Retinoic acid biosynthetic process (GO:0002138),Triglyceride biosynthetic process (GO:0019432), Diacylglycerol biosynthetic process (GO:0006651), Sphingosine metabolic process (GO:0006670), Retinol metabolic process (GO:0042572), Acylglycerol biosynthetic process (GO:0046463), Retinoid metabolic process (GO:0001523), Terpenoid biosynthetic process (GO:0016114), Phospholipid dephosphorylation (GO:0046839), Primary alcohol metabolic process (GO:0034308), Triglyceride metabolic process (GO:0006641), Cellular hormone metabolic process (GO:0034754), Olefinic compound metabolic process (GO:0120254), Cellular aldehyde metabolic process (GO:0006081), Polyol metabolic process (GO:0019751), Alcohol metabolic process (GO:0006066), Lipid modification (GO:0030258), Cellular lipid metabolic process (GO:0044255), Lipid biosynthetic process (GO:0008610), Lipid metabolic process (GO:0006629), Small molecule metabolic process (GO:0044281), Regulation of biological quality (GO:0065008), and Metabolic process (GO:0008152).

Figure 8.

Figure 8

Protein–protein interactions (PPIs) network generated by STRING depicting the interacting partners of DGAT1; Red color node represent query protein, filled nodes have known 3D structure and empty nodes represent proteins of unknown 3D structure. Besides, pink color edges represent experimentally determined interaction, green-neighborhood, red-gene fusion, blue-gene co-occurrence, sky blue-database curated, dark-coexpression, kelly-text mining and purple colour edges represent protein homology based interactions.

Discussion

Milk from cattle is an essential part of the human diet. Its impact on health is influenced by a variety of elements, including its fat composition. Fatty acids in milk come from two sources; they transfer from the blood via milk-producing cells and synthesize de novo in milk-producing cells24,25. Successful lactation is controlled and regulated by the metabolism of adipose tissues. The metabolism in adipose tissues of dairy cattle has been well-documented in the scientific literature26,27. Previous studies revealed alterations in enzymes that controlled the metabolism of lipid in adipose tissues during lactation28,29. These investigations were followed up by other researchers and found that the adaptation of metabolic activities is managed by many endocrine and neurocrine systems, which include growth hormone, insulin, and the sympathetic nervous system27,30. When compared to ordinary or lower genetic merit cattle, these studies found and consistently demonstrated that dairy cattle’s with high genetic merit for milk production had higher rates of subcutaneous adipose tissue lipolysis and lower rates of lipogenesis27,31,32. Therefore, adipose tissue is important for the success and efficiency of lactation, and recent research has produced various data sets on genetic differences and changes in adipose tissue gene transcription in dairy cattle27.

DGAT1 is regarded as one of the essential components in the metabolism of glycerol lipids and is involved in the final phase of triglyceride synthesis in adipocytes. Previous studies confirmed its role in maintaining the fat content and composition of fatty acids in milk3. Fatty acid concentration is known to be regulated by a variety of physiological and environmental variables. Many studies have shown that concentrations change during lactation25. A previous study demonstrated that the fatty acid profile is significantly affected during the lactation stage, it was found that only two were unaffected out of 40 analyzed33. Besides, another research investigated that lactation phase affected all fatty acid groups, but in different ways due to different SNPs25. The K231A, DGAT1 variant is responsible for a change in fat and protein percentage during lactation and is associated with the highest reduction of fat percentage34. Therefore, DGAT1 is documented as one of the key candidate genes for maintaining fat quality in milk. In recent years, this gene attracted researchers due to its importance in producing healthy milk, and its nsSNPs are liable for milk quality issues. In the present study, we tried to predict the conformational and structural dynamics of DGAT1 and its missense nsSNPs through structural genomics approaches for its possible utilization in research associated with milk production.

The current work was primarily interested in nsSNPs and their regulatory role in milk production. Therefore, only nsSNPs were investigated, and their structural as well as functional effects on the DGAT1 protein were predicted. The deleterious nature of selected nsSNPs was analysed by various sequence and structure-based tools. Based on the obtained score, 5 selected significant nsSNPs i.e. W128R, W214R, C215G, P245R, and W459G along with wild-type structure were further evaluated through structural modeling and molecular dynamics simulation, followed by network analysis. Based on RMSD results, the variants seemed to be more stable than the DGAT1-wild type conformation. Therefore, it was concluded that the increased stability could have a deleterious impact on overall function of DGAT1. More research is needed to clarify why enhanced stability can be associated with a deleterious effect.

During structural flexibility analysis through RMSF, minimal fluctuations were observed in P245R, whereas other systems showed similar type of behaviour, besides high pick was observed for the W128R and P245R between residues 225–250 due to variations in amino acid residue compared to the wild type structure. During Rg analysis, the DGAT1-wild type showed an Rg peak of 2.47 after 90 ns, which was achieved 40 ns after the peaks obtained for the other variants. Furthermore, the DGAT1-C215G and DGAT1-W459G variants average Rg values were predicted to be higher than the others, indicating less compactness and more flexibility.

To define the area accessible for solvent, SASA analysis was conducted. Higher SASA values represent a unstable conformation of the protein, while lower SASA values represent conformationally stable protein35. During analysis, it was observed that the SASA values of W214R and C215G varied, indicating the folding nature of the DGAT1 structure as amino acid residues were changed. In addition, when compared to wild type, other variants demonstrated similar conformational stability. The protein stability is sustained by hydrogen bonding. A more stable structure has more hydrogen bonds, while the least stable structure has fewer hydrogen bonds36. The loss or gain of hydrogen bonds caused by deleterious nsSNPs can influence the structure and function of the protein36. Due to the gain of hydrogen bonds, the variants' stability was enhanced. This was in line with the results obtained following the RMSD, RMSF, and Rg analyses.

In general, proteins carry out their unique tasks through a series of atomic motions. As a result, the collective atomic motion of a target protein is utilized as a parameter to determine its stability23,37. Therefore, PCA analysis was conducted; the results showed that the variants cover a wide range of phase spaces when compared to the wild type. The significant increase in total motion might transform the protein function. Overall, compared to the wild type protein, high correlated motions were observed in variants. Therefore, the conformational behavior of variants was altered due to amino acid change.

Network analysis has been promoted as one of the most effective methods for decoding the key regulatory elements in biological systems and their intricate molecular machinery38,39. Here, the purpose of network analysis was to identify an association of DGAT1 with other proteins in a holistic view based on the information available in the STRING database. The results of network analysis clearly demonstrated the interacting partners of DGAT1 and their involvement in different biological processes. Therefore, it was concluded that DGAT1 variants might affect several biological processes, since it might be working as a master regulator.

Materials and methods

Dataset collection

The missense variants of the DGAT1 gene were collected from Ensembl (https://asia.ensembl.org/index.html) using the BioMart18 data mining tool19. Based on Sorting Intolerant from Tolerant (SIFT) predictions, a total of 73 nsSNPs were analyzed initially40. The amino acid sequence of DGAT1 was retrieved from UniProt (https://www.uniprot.org/) database (UniProt ID: Q8MK44) in fasta format41.

Prediction of deleterious nsSNPs through sequence-based tools

Several standard tools were used to predict the functional effect of nsSNPs on the protein. Based on reliable prediction using all employed tools, nsSNPs were classified as deleterious. SIFT (https://sift.bii.a-star.edu.sg/) and Protein Variation Effect Analyzer (PROVEAN: http://provean.jcvi.org/index.php) was used to estimate the effect of amino acid substitution on protein structure and function40,42. Furthermore, PredictSNP43 (https://loschmidt.chemi.muni.cz/predictsnp/) that access different other tools related to SNP prediction such as MAPP, PhD-SNP, PolyPhen-1, PolyPhen-2, and SNAP was used to evaluate the nature of variants4448.

Molecular modeling of DGAT1 and its variants

The DGAT1 target protein sequence was compared to the PDB database (https://www.rcsb.org/) using NCBI BLASTp49 to find a suitable template and decide the 3D modelling strategy. Further, SwissModel (https://swissmodel.expasy.org/interactive) was used to model the wild type DGAT1 3D structure, 6VYI chain A was used as a template50. SPDB viewer was used to refine the model through energy minimization51. The Ramachandran plot of the predicted model was examined using the PROCHECK52 from the Structural analysis and verification (SAVES) server (https://saves.mbi.ucla.edu/). Protein structure analysis (ProSA) was used to determine the overall quality of the model (https://prosa.services.came.sbg.ac.at/prosa.php)53. UCSF Chimera was used to visualize and superimpose the DGAT1 model with its structural homolog 6VYI_A54. The amino acid variations W128R, W214R, C215G, P245R, and W459G were structurally incorporated into the DGAT1 model using the PyMOL software's mutagenesis wizard (https://pymol.org/2/).

Prediction of effects of missense variant on protein through structure-based tools

The three structure-based methods were used to investigate the changes in protein stability induced by deleterious predicted nsSNPs. DynaMut (http://biosig.unimelb.edu.au/dynamut/), CUPSAT (http://cupsat.tu-bs.de/), and I-Mutant (http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi) predicted the influence of nsSNPs on protein stability and dynamics5557. Further, The Have Your Protein Explained (HOPE) server (https://www3.cmbi.umcn.nl/hope/) was used to visualize and analyze the variants58.

Prediction of functional and conserved regions

The evolutionary conservation of a protein or a nucleic acid demonstrates a balance between the macromolecule's natural tendency to mutate and the overall need to maintain structural integrity and function. Here, the ConSurf (https://consurf.tau.ac.il/) server was used to predict the evolutionarily conserved amino acid residues sites of DGAT1. A 3D structure of DGAT1 was used as an input query to determine the conservation score. The conservation score ranges from 1 to 9, with 1 indicating that the residue is highly changeable and 9 indicating that it is well conserved59.

Molecular dynamics (MD) simulation

To explore the structural impact of DGAT1 protein due to the variation, all-atom MD simulations were performed using GROMACS 2018.160. All simulations were run within the High-Performance computing (HPC) facility using a GPU node. The topology of the wild type and DGAT1 variants was generated using the Amber99sb-ildn force field61. The systems were neutralised and subjected to the steepest energy minimization to achieve a maximal force below 1000 kJ/mol/nm; thus, reducing steric hindrance. The particle mesh Ewald (PME) approach was used to determine long-range electrostatic interactions62. We employed a radius cut-off of 1.0 nm for Lennard–Jones and Coulomb interactions, and the LINCS algorithm for H-bond length constrains63. Long-range electrostatics was predicted using the PME approach with 1.6-Å Fourier grid spacing, while short-range non-bonded interactions were predicted using a 10-Å cut-off distance. All bonds, including H-bonds, were fixed using shake algorithms. Systems were equilibrated following energy minimization, and then position-restraint simulations were run under NVT and NPT conditions to maintain the volume, temperature, and pressure64. The final MD simulation was carried for 100 ns. Root–mean–square deviation (RMSD), root–mean–square fluctuation (RMSF), radius of gyration (Rg), solvent-accessible surface area (SASA), hydrogen bonds, and principal component analysis (PCA) were calculated to predict the conformational and structural behaviour of DGAT1 and its variants; Gromacs gmx ‘rms’, ‘rmsf’, ‘gyrate’, ‘sasa’, ‘hbond’, and ‘covar’ were used to analyze and visualize the generated files (https://plasma-gate.weizmann.ac.il/Grace/).

Prediction of protein–protein interactions (PPIs) network

STRING (https://string-db.org/) database version: 11.5, a large collection of known and predicted protein–protein interactions, was used to analyze DGAT1 interactions. The network was generated at the highest confidence score (0.900) with no more than 50 interactors in 1st shell using Bos taurus database. This analysis is necessary and important in the context of this study because DGAT1 is a crucial protein linked with milk production and fat content. As, when any variations occur in this protein, its interacting partners may be affected, so that the functions that the network is involved is also affected65.

Conclusion

Non-synonymous SNPs are one of the most common genetic variation linked to a variety of dysfunctions in the target gene. Analysis of nsSNPs can provide insight into their functional role and associated pathways as well as offer solution for related problems. Studying nsSNPs through experimental approaches is time consuming and costly. The computational research presented here demonstrates a quick and cost-effective method for screening deleterious nsSNPs associated with structural and conformational destabilization of the protein. The impact of amino acid substitutions of highly deleterious variants shorted amongst the 73 predicted DGAT1 variants (W128R, W214R, C215G, P245R, and W459G) were determined using sequence and structure-based tools. Finally, these variants were investigated using MD simulations of 100 ns. Furthermore, the interacting partners of DGAT1 are also predicted to investigating its role in different biological processes. MD simulation studies have revealed that the deleterious effects of these variants may be attributable to structural changes in DGAT1. Based on our RMSD, RMSF, Rg, SASA, H-bonding, and PCA results, significant conformational changes were observed due to amino acid variation. Thus, the present study provides novel insight into the impact of missense nsSNPs on the structural and conformational dynamics of DGAT1. These findings could be helpful to direct future studies in improving milk quality and fat content in dairy cattle for ensuring nutritional security of the rapidly growing world populations.

Supplementary Information

Supplementary Tables. (824.4KB, pdf)

Acknowledgements

This research was supported by the Chung-Ang University Graduate Research Scholarship in 2020. This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (NRF-2022R1A2C1005830). This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2018R1A6A1A03025159).

Author contributions

J.M.K. designed the experiments and supervised the research; R.K.P. performed experiments, analyse results and wrote the manuscript. B.L. helped in SNP mining. Y.P. contributed to the revision of manuscript and provided valuable inputs. All authors read and approved the final manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-08833-6.

References

  • 1.Roy D, Ye A, Moughan PJ, Singh H. Composition, structure, and digestive dynamics of milk from different species—A review. Front. Nutr. 2020;7:577759. doi: 10.3389/fnut.2020.577759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Magan JB, O’Callaghan TF, Kelly AL, McCarthy NA. Compositional and functional properties of milk and dairy products derived from cows fed pasture or concentrate-based diets. Compr. Rev. Food Sci. Food Saf. 2021;20:2769–2800. doi: 10.1111/1541-4337.12751. [DOI] [PubMed] [Google Scholar]
  • 3.Li F, et al. DGAT1 K232A polymorphism is associated with milk production traits in Chinese cattle. Anim. Biotechnol. 2021;32:427–431. doi: 10.1080/10495398.2020.1711769. [DOI] [PubMed] [Google Scholar]
  • 4.Mansbridge RJ, Blake JS. Nutritional factors affecting the fatty acid composition of bovine milk. Br. J. Nutr. 1997;78(Suppl 1):S37–47. doi: 10.1079/bjn19970133. [DOI] [PubMed] [Google Scholar]
  • 5.Brouwer IA, Wanders AJ, Katan MB. Trans fatty acids and cardiovascular health: Research completed? Eur. J. Clin. Nutr. 2013;67:541–547. doi: 10.1038/ejcn.2013.43. [DOI] [PubMed] [Google Scholar]
  • 6.Connor WE. Importance of n-3 fatty acids in health and disease. Am. J. Clin. Nutr. 2000;71:171S–175S. doi: 10.1093/ajcn/71.1.171S. [DOI] [PubMed] [Google Scholar]
  • 7.Cruz VAR, et al. Genome-wide association study for milk fatty acids in holstein cattle accounting for the DGAT1 gene effect. Animals (Basel). 2019 doi: 10.3390/ani9110997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Spector AA, Yorek MA. Membrane lipid composition and cellular function. J. Lipid Res. 1985;26:1015–1035. doi: 10.1016/S0022-2275(20)34276-0. [DOI] [PubMed] [Google Scholar]
  • 9.Palombo V, et al. Genome-wide association study of milk fatty acid composition in Italian Simmental and Italian Holstein cows using single nucleotide polymorphism arrays. J. Dairy Sci. 2018;101:11004–11019. doi: 10.3168/jds.2018-14413. [DOI] [PubMed] [Google Scholar]
  • 10.Heyen DW, et al. A genome scan for QTL influencing milk production and health traits in dairy cattle. Physiol. Genom. 1999;1:165–175. doi: 10.1152/physiolgenomics.1999.1.3.165. [DOI] [PubMed] [Google Scholar]
  • 11.Grisart B, et al. Positional candidate cloning of a QTL in dairy cattle: Identification of a missense mutation in the bovine DGAT1 gene with major effect on milk yield and composition. Genome Res. 2002;12:222–231. doi: 10.1101/gr.224202. [DOI] [PubMed] [Google Scholar]
  • 12.Riquet J, et al. Fine-mapping of quantitative trait loci by identity by descent in outbred populations: Application to milk production in dairy cattle. Proc. Natl. Acad. Sci. USA. 1999;96:9252–9257. doi: 10.1073/pnas.96.16.9252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Winter A, et al. Association of a lysine-232/alanine polymorphism in a bovine gene encoding acyl-CoA:diacylglycerol acyltransferase (DGAT1) with variation at a quantitative trait locus for milk fat content. Proc. Natl. Acad. Sci. USA. 2002;99:9300–9305. doi: 10.1073/pnas.142293799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bobbo T, Tiezzi F, Penasa M, De Marchi M, Cassandro M. Short communication: Association analysis of diacylglycerol acyltransferase (DGAT1) mutation on chromosome 14 for milk yield and composition traits, somatic cell score, and coagulation properties in Holstein bulls. J. Dairy Sci. 2018;101:8087–8091. doi: 10.3168/jds.2018-14533. [DOI] [PubMed] [Google Scholar]
  • 15.Bouwman AC, Visker MH, van Arendonk JA, Bovenhuis H. Genomic regions associated with bovine milk fatty acids in both summer and winter milk samples. BMC Genet. 2012;13:93. doi: 10.1186/1471-2156-13-93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tokuriki N, Tawfik DS. Stability effects of mutations and protein evolvability. Curr. Opin. Struct. Biol. 2009;19:596–604. doi: 10.1016/j.sbi.2009.08.003. [DOI] [PubMed] [Google Scholar]
  • 17.Scotti, E. et al. DGAT1 p. K232A polymorphism in dairy and dual purpose Italian cattle breeds. Ital. J. Anim. Sci. 9, 1. 10.4081/ijas.2010.e16 (2010).
  • 18.Kinsella RJ, et al. Ensembl BioMarts: A hub for data retrieval across taxonomic space. Database (Oxford). 2011;2011:bar030. doi: 10.1093/database/bar030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yates AD, et al. Ensembl 2020. Nucleic Acids Res. 2020;48:D682–D688. doi: 10.1093/nar/gkz966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Singh DB, Pathak RK. Experimental Protocols in Biotechnology. Springer; 2020. pp. 95–117. [Google Scholar]
  • 21.Rai SK, Pathak RK, Singh DB, Bhatt A, Baunthiyal M. Chemo-informatics guided study of natural inhibitors targeting rho GTPase: A lead for treatment of glaucoma. In Silico Pharmacol. 2021;9:1–11. doi: 10.1007/s40203-020-00061-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Singh DB, Pathak RK. Bioinformatics: Methods and Applications. Elsevier; 2021. [Google Scholar]
  • 23.Shukla R, Singh TR. High-throughput screening of natural compounds and inhibition of a major therapeutic target HsGSK-3 β for Alzheimer’s disease using computational approaches. J. Genet. Eng. Biotechnol. 2021;19:1–17. doi: 10.1186/s43141-021-00163-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kesek M, Szulc T, Zielak-Steciwko A. Genetic, physiological and nutritive factors affecting the fatty acid profile in cows' milk—A review. Anim. Sci. Pap. Rep. 2014;32:95–105. [Google Scholar]
  • 25.Kesek-Wozniak MM, Wojtas E, Zielak-Steciwko AE. Impact of SNPs in ACACA, SCD1, and DGAT1 genes on fatty acid profile in bovine milk with regard to lactation phases. Animals-Basel. 2020 doi: 10.3390/ani10060997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.McNamara JP. Ruminant Nutrition Symposium: A systems approach to integrating genetics, nutrition, and metabolic efficiency in dairy cattle. J. Anim. Sci. 2012;90:1846–1854. doi: 10.2527/jas.2011-4609. [DOI] [PubMed] [Google Scholar]
  • 27.McNamara JP, Huber K, Kenez A. A dynamic, mechanistic model of metabolism in adipose tissue of lactating dairy cattle. J. Dairy Sci. 2016;99:5649–5661. doi: 10.3168/jds.2015-9585. [DOI] [PubMed] [Google Scholar]
  • 28.Shirley JE, Emery RS, Convey EM, Oxender WD. Enzymic changes in bovine adipose and mammary tissue, serum and mammary tissue hormonal changes with initiation of lactation. J. Dairy Sci. 1973;56:569–574. doi: 10.3168/jds.S0022-0302(73)85220-8. [DOI] [PubMed] [Google Scholar]
  • 29.Yang YT, Baldwin RL. Lipolysis in isolated cow adipose cells. J. Dairy Sci. 1973;56:366–374. doi: 10.3168/jds.S0022-0302(73)85179-3. [DOI] [PubMed] [Google Scholar]
  • 30.McNamara JP, Murray CE. Sympathetic nervous system activity in adipose tissues during pregnancy and lactation of the rat. J. Dairy Sci. 2001;84:1382–1389. doi: 10.3168/jds.S0022-0302(01)70169-5. [DOI] [PubMed] [Google Scholar]
  • 31.Sumner-Thomson JM, Vierck JL, McNamara JP. Differential expression of genes in adipose tissue of first-lactation dairy cattle. J. Dairy Sci. 2011;94:361–369. doi: 10.3168/jds.2010-3447. [DOI] [PubMed] [Google Scholar]
  • 32.Rocco SM, McNamara JP. Regulation of bovine adipose tissue metabolism during lactation. 7. Metabolism and gene expression as a function of genetic merit and dietary energy intake. J. Dairy Sci. 2013;96:3108–3119. doi: 10.3168/jds.2012-6097. [DOI] [PubMed] [Google Scholar]
  • 33.Bainbridge ML, Cersosimo LM, Wright AD, Kraft J. Content and composition of branched-chain fatty acids in bovine milk are affected by lactation stage and breed of dairy cow. PLoS ONE. 2016;11:e0150386. doi: 10.1371/journal.pone.0150386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bovenhuis H, Visker MH, van Valenberg HJ, Buitenhuis AJ, van Arendonk JA. Effects of the DGAT1 polymorphism on test-day milk production traits throughout lactation. J. Dairy Sci. 2015;98:6572–6582. doi: 10.3168/jds.2015-9564. [DOI] [PubMed] [Google Scholar]
  • 35.Singh DB, Tripathi T. Frontiers in Protein Structure, Function, and Dynamics. Springer; 2020. [Google Scholar]
  • 36.Pace CN, et al. Contribution of hydrogen bonds to protein stability. Protein Sci. 2014;23:652–661. doi: 10.1002/pro.2449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Singh DB. Computer-Aided Drug Design. Springer; 2020. [Google Scholar]
  • 38.Pathak RK, Singh DB. Advances in Agri-Food Biotechnology. Springer; 2020. pp. 409–426. [Google Scholar]
  • 39.Kim S, et al. Genome-wide identification of candidate genes for milk production traits in Korean Holstein Cattle. Animals (Basel). 2021 doi: 10.3390/ani11051392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Sim NL, et al. SIFT web server: Predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40:W452–457. doi: 10.1093/nar/gks539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.UniProt C. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 2021;49:D480–D489. doi: 10.1093/nar/gkaa1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Choi Y, Chan AP. PROVEAN web server: A tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31:2745–2747. doi: 10.1093/bioinformatics/btv195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bendl J, et al. PredictSNP: Robust and accurate consensus classifier for prediction of disease-related mutations. PLoS Comput. Biol. 2014;10:e1003440. doi: 10.1371/journal.pcbi.1003440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Stone EA, Sidow A. Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. Genome Res. 2005;15:978–986. doi: 10.1101/gr.3804205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Capriotti E, Calabrese R, Casadio R. Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information. Bioinformatics. 2006;22:2729–2734. doi: 10.1093/bioinformatics/btl423. [DOI] [PubMed] [Google Scholar]
  • 46.Ramensky V, Bork P, Sunyaev S. Human non-synonymous SNPs: Server and survey. Nucleic Acids Res. 2002;30:3894–3900. doi: 10.1093/nar/gkf493. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Adzhubei IA, et al. A method and server for predicting damaging missense mutations. Nat. Methods. 2010;7:248–249. doi: 10.1038/nmeth0410-248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bromberg Y, Rost B. SNAP: Predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 2007;35:3823–3835. doi: 10.1093/nar/gkm238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 50.Schwede T, Kopp J, Guex N, Peitsch MC. SWISS-MODEL: An automated protein homology-modeling server. Nucleic Acids Res. 2003;31:3381–3385. doi: 10.1093/nar/gkg520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Johansson MU, Zoete V, Michielin O, Guex N. Defining and searching for structural motifs using DeepView/Swiss-PdbViewer. BMC Bioinform. 2012;13:173. doi: 10.1186/1471-2105-13-173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Laskowski RA, Macarthur MW, Moss DS, Thornton JM. Procheck—A program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 1993;26:283–291. doi: 10.1107/S0021889892009944. [DOI] [Google Scholar]
  • 53.Wiederstein M, Sippl MJ. ProSA-web: Interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 2007;35:W407–410. doi: 10.1093/nar/gkm290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pettersen EF, et al. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
  • 55.Rodrigues CH, Pires DE, Ascher DB. DynaMut: Predicting the impact of mutations on protein conformation, flexibility and stability. Nucleic Acids Res. 2018;46:W350–W355. doi: 10.1093/nar/gky300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Parthiban V, Gromiha MM, Schomburg D. CUPSAT: Prediction of protein stability upon point mutations. Nucleic Acids Res. 2006;34:W239–242. doi: 10.1093/nar/gkl190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Capriotti E, Fariselli P, Casadio R. I-Mutant2.0: Predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Res. 2005;33:W306–W310. doi: 10.1093/nar/gki375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Venselaar H, Beek TAH, Kuipers RKP, Hekkelman ML, Vriend G. Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces. BMC Bioinform. 2010 doi: 10.1186/1471-2105-11-548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Ashkenazy H, et al. ConSurf 2016: An improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Van Der Spoel D, et al. GROMACS: Fast, flexible, and free. J. Comput. Chem. 2005;26:1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
  • 61.Aliev AE, et al. Motional timescale predictions by molecular dynamics simulations: Case study using proline and hydroxyproline sidechain dynamics. Proteins. 2014;82:195–215. doi: 10.1002/prot.24350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Darden T, York D, Pedersen L. Particle Mesh Ewald—An N.Log(N) method for ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. doi: 10.1063/1.464397. [DOI] [Google Scholar]
  • 63.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997;18:1463–1472. doi: 10.1002/(Sici)1096-987x(199709)18:12&#x0003c;1463::Aid-Jcc4&#x0003e;3.0.Co;2-H. [DOI] [Google Scholar]
  • 64.Abraham MJ, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
  • 65.Szklarczyk D, et al. The STRING database in 2021: Customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021;49:D605–D612. doi: 10.1093/nar/gkaa1074. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Tables. (824.4KB, pdf)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES