Abstract
With the ongoing laboratory restrictions, it is often challenging for bioscience students to make satisfactory progress in their projects. A long-standing practice in multi-disciplinary research is to use computational and theoretical method to corroborate with experiment findings. In line with the lack of opportunity to access laboratory instruments, the pandemic situation is a win-win scenario for scholars to focus on computational methods. This communication outline some of the standalone tools and webservers that bioscience students can successfully learn and adopt to obtain in-depth insights into biochemistry, biophysics, biotechnology, and bioengineering research work.
Keywords: Bioscience, Structural modeling, Visualizers, Classical simulations, Quantum mechanics
Introduction
The worldwide impact of COVID-19 has raised several challenges for research scholars [1]. The common problems include restricted access to lab workspace, delayed transportation of materials, and reduced technical assistance for experimental troubleshoot [2]. Though the online platforms have rescued to a certain level with one-on-one discussion, educational sessions, and virtual conferences, the problem still persists among the graduate students and forming the cloud of worries—“how to make progress in the research project?” (Fig. 1). While the laboratory experiments are not at full speed, it is still possible to use computational power. This communication aims to highlight a few useful computational tools that can help in making research progress, amid restricted laboratory access.
Fig. 1.
Illustration suggesting that exploring computational powers could be valuable for scholars to make progress in bioscience projects
Structural Modeling
Most of the bioscience problems deal with biological macromolecules such as proteins, nucleic acids, and small molecules. Though the structural coordinates are available from the protein data bank (PDB) for some structures, there exists a large sequence-structure gap for many biomacromolecules [3]. Homology modeling tools are a valuable starting point to analyze the putative three-dimensional conformation, interacting residues, and active site arrangement in these structures. The available standalone tools include MODELLER [4], while webservers are Phyre2 [5], ROBETTA [6], and SWISS MODEL [7]. The structural validation of such models for the interface analysis, surface assemblies, and Ramachandran plot can be determined using PDBePISA server [8]. The solvent accessibilities can also be analyzed using Naccess standalone program [9]. The contact map details can be analyzed using ConPlot [10] and DISTEVAL [11]. Relevant details of normal mode for predicting collective protein domain motions is achievable using iMODS [12], ElNémo [13], and WEBnm@ [14]. To determine the protein stability, especially for projects with protein mutants, one can use servers like SDM [15], MAESTROweb [16], PoPMuSic [17], DUET [18], and pPerturb [19]. These algorithms are accountable for predicting thermodynamics (free energy) properties based on input structural coordinates in a PDB format.
Bio-macromolecular Complex
The macromolecular interactions are an important area that helps in better understanding protein functioning, mechanism, and drug designing. The widely used server ClusPro [20] is one of such tools that predict protein-interface based on the energetic evaluation. Hex program offers interactive fast Fourier transform-based docking [21]. Several useful options are also available on HADDOCK [22] webserver that can help in blind or experimental constraint-guided protein docking. While protein-protein docking is more complicated due to the presence of several degrees of freedom, the algorithm for small molecule docking is relatively simpler. The widely used AutoDock [23] is suitable for such exercise, with AutoDock Vina [24] offering high-throughput screening. Notably, these tools must be benchmarked for the system under consideration, such as by docking a known complex and comparing the root-mean-squared deviation (docked vs original complex). Additionally, the docking outcomes can be benchmarked against a reliable set of decoy molecular sets, available from DUD-E [25] and DEKOIS 2.0 [26] database. Schematic diagrams for protein-ligand contacts can be plotted using Ligplot [27]. More advanced tools such as for pKa prediction include Protein-Sol server [28], DelPhiPKa [29], and PDB2PQR [30]. However, these tools come with limitation to process non-standard residues such as small organic and drug-like molecules.
Visualizers
The visualization tools add fun while depicting the spatial arrangement of amino acids and nucleobases, secondary structures, and water molecules in a three-dimensional network. Nevertheless, they can render publication-quality images (Fig. 2). One can also perform structural modeling with visualizers like PyMol [31], VMD [32], and Chimera [33].
Fig. 2.
Representation of static structures from homology modeling and protein-protein docking, and dynamic structures from molecular dynamics simulation
Classical Simulations
The collaboration between classical simulation with biophysics is more common these days. Some of the relevant free tools to perform molecular simulation include GROMACS [34] and Desmond [35]. However, they require advanced hardware to compute physics-based time-dependent motions of biomolecules. A Perl-based toolset for structure preparation and analysis is available with MMTSB [36]. While CHARMM-GUI server [37] provides numerous options to prepare solvated structures, membrane bilayer, and coarse-grain systems, one can also use this server to prepare input complement to other simulation engines. Equilibrated trajectories for a short time-scale are possible to be calculated using MDWeb server [38] and ChemCompute server [39]. Furthermore, scholars having a basic knowledge of python can make use of MDAnalysis [40] to extract information. Both VMD and Chimera offer capabilities to visualize, and varieties of plugins to analyze the simulation trajectory.
Quantum Chemical Calculations
Calculations based on quantum mechanics (QM) are more accurate and reliable compared to the classical simulations; however, they come with their own set of limitations. Accurate modeling of protein-ligand interaction and elucidation of spectroscopic properties are some valuable insights that might be crucial to investigate biochemical and biophysical problems. Notably, rigorous quantum mechanics methods are mandated to achieve their transferability and robustness in connection to experimental corroboration. Orca [41] and GAMESS [42] are open-source quantum chemistry package that offer a wide range of capabilities, including geometry optimization, calculation of UV/Vis excitation energy, CD spectra, and vibrational frequencies. Avogadro [43] is user-friendly freeware that not only enables chemical editing but also to visualize frontier molecular orbitals and vibrational modes. Psi4 open-source package [44] is useful in analyzing the wavefunctions and Hartree-Fock energy decomposition. While these programs are accountable for high-computing time, one can run calculations with a smaller basis set on desktop and laptop with Windows or Linux operating system. Thankfully, it is also possible to perform QM calculations with a high basis set using the webserver ChemCompute [39], which offers computing time on cluster nodes for registered users with an academic electronic address.
Conclusion
In summary, this communication highlights various (not limited to) computational tools relevant to obtain atomistic-scale analysis of biomacromolecules. While having restricted access to laboratory space and equipment, learning computational methods could mark a substantial advancement. Most of the standalone tool and web servers listed are freely accessible, with academic registration. Nevertheless, before using these computational tools and corroborating with the experimental measurements, it is essential to understand their theoretical principles.
Acknowledgements
The author is thankful to Prof. Dr. Anne-Frances Miller for discussing this topic in a regular group meeting and for all the encouragements.
Funding
Open Access funding enabled and organized by Projekt DEAL. R.K.K. thank the Einstein Foundation of Berlin for support.
Declarations
Ethics Approval
Not applicable.
Consent to Participate
Not applicable.s
Consent for Publication
Not applicable.
Competing Interests
The author declares no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Schiermeier BQ, Else H, Mega ER, Padma TV, Gaind N. What it’s really like to do science. Nature. 2020;586(7830):486–487. doi: 10.1038/d41586-020-02815-2. [DOI] [PubMed] [Google Scholar]
- 2.Myers KR, Tham WY, Yin Y, Cohodes N, Thursby JG, Thursby MC, Schiffer P, Walsh JT, Lakhani KR, Wang D. Unequal effects of the COVID-19 pandemic on. Nature Human Behaviour. 2020;4(9):880–883. doi: 10.1038/s41562-020-0921-y. [DOI] [PubMed] [Google Scholar]
- 3.Burley SK, Bhikadiya C, Bi C, Bittrich S, Chen L, Crichlow GV, Christie CH, Dalenberg K, di Costanzo L, Duarte JM, Dutta S, Feng Z, Ganesan S, Goodsell DS, Ghosh S, Green RK, Guranović V, Guzenko D, Hudson BP, Lawson CL, Liang Y, Lowe R, Namkoong H, Peisach E, Persikova I, Randle C, Rose A, Rose Y, Sali A, Segura J, Sekharan M, Shao C, Tao YP, Voigt M, Westbrook JD, Young JY, Zardecki C, Zhuravleva M. RCSB Protein Data Bank: Powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. Nucleic Acids Research. 2021;49(November 2020):437–451. doi: 10.1093/nar/gkaa1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Webb B, Sali A. Protein structure modeling with MODELLER. Methods in Molecular Biology. 2014;1137(1137):1–15. doi: 10.1007/978-1-4939-0366-5. [DOI] [PubMed] [Google Scholar]
- 5.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling , prediction and analysis. Nature Protocols. 2015;10(6):845–858. doi: 10.1038/nprot.2015-053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Yang J, Anishchenko I, Park H, Peng Z, Ovchinnikov S. Improved protein structure prediction using predicted interresidue orientations. Proceedings of the National Academy of Sciences of the United States of America. 2020;117(3):1496–1503. doi: 10.1073/pnas.1914677117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Research. 2018;46(W1):W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. Journal of Molecular Biology. 2007;372(3):774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
- 9.Hubbard SJ, Thornton JM. NACCESS. 1993. [Google Scholar]
- 10.Sa, F., Mesdaghi, S., Simpkin, A. J., Burgos-ma, J. J., Murphy, D. L., Uski, V., … Rigden, D. J. (2021). Structural bioinformatics ConPlot: Web-based application for the visualization of protein contact maps integrated with other data. Bioinformatics, (January), 1–3. 10.1093/bioinformatics/btab049 [DOI] [PMC free article] [PubMed]
- 11.Adhikari B, Shrestha B, Bernardini M, Hou J, Lea J. DISTEVAL: A web server for evaluating predicted protein distances. BMC Bioinformatics. 2021;22(8):1–9. doi: 10.1186/s12859-020-03938-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Aliaga I, Quintana-ort ES, Chac P. iMODS: Internal coordinates normal mode analysis server. Nucleic Acids Research. 2014;42(April):271–276. doi: 10.1093/nar/gku339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Suhre K, Sanejouand Y. ElNemo: A normal mode web server for protein ElNe movement analysis and the generation of templates for molecular replacement. Nucleic Acids Research. 2004;32:610–614. doi: 10.1093/nar/gkh368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tiwari SP, Fuglebakk E, Hollup SM, Skjærven L, Cragnolini T, Grindhaug SH, Tekle KM, Reuter N. WEBnm @ v2.0: Web server and services for comparing protein flexibility. BMC. 2015;15(427):1–12. doi: 10.1186/s12859-014-0427-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ascher DB, Pandurangan AP, Ochoa-monta B, Blundell L. SDM: A server for predicting effects of mutations on protein stability. Nucleic Acids Research. 2017;45(May):229–235. doi: 10.1093/nar/gkx439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Laimer J, Hiebl-flach J, Lengauer D, Lackner P. MAESTROweb: A web server for structure- based protein stability prediction. Bioinformatics. 2016;32(9):11414–11416. doi: 10.1093/bioinformatics/btv769. [DOI] [PubMed] [Google Scholar]
- 17.Dehouck, Y., Kwasigroch, J. M., Gilis, D., & Rooman, M. (2011). PoPMuSiC 2.1: A web server for the estimation of protein stability changes upon mutation and sequence optimality. BMC Bioinformatics, 12(151), 1–12. [DOI] [PMC free article] [PubMed]
- 18.Pires DEV, Ascher DB, Blundell TL. DUET: A server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic Acids Research. 2014;42(May):314–319. doi: 10.1093/nar/gku411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gopi S, Devanshu D, Rajasekaran N, Anantakrishnan S, Naganathan AN. pPerturb: A server for predicting long-distance energetic couplings and mutation-induced stability changes in proteins via perturbations. ACS Omega. 2020;5(2):1142–1146. doi: 10.1021/acsomega.9b03371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Desta IT, Porter KA, Xia B, Kozakov D, Vajda S, Desta IT, et al. Resource performance and its limits in rigid body protein- protein docking ll. Structure. 2020;28(9):1071–1081.e3. doi: 10.1016/j.str.2020.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ritchie DW. Protein docking using case-based reasoning. Proteins: Structure, Function, and Bioinformatics. 2013;81(12):2150–2158. doi: 10.1002/prot.24433. [DOI] [PubMed] [Google Scholar]
- 22.Van Zundert GCP, Rodrigues JPGLM, Trellet M, Schmitz C. The HADDOCK2.2 Web Server: User-friendly integrative modeling of biomolecular complexes. Journal of Molecular Biology. 2016;428(4):720–725. doi: 10.1016/j.jmb.2015.09.014. [DOI] [PubMed] [Google Scholar]
- 23.Forli S, Huey R, Pique ME, Sanner MF, Goodsell DS, Olson AJ. Computational protein – ligand docking and virtual drug screening with the AutoDock suite. Nature Protocols. 2016;11(5):905–919. doi: 10.1038/pj.2016.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Trott O, Olson AJ. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry. 2009;31(2):455–461. doi: 10.1002/jcc. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mysinger MM, Carchia M, Irwin JJ, Shoichet BK. Directory of useful decoys, enhanced (DUD-E): Better ligands and decoys for better benchmarking. Journal of Medicinal Chemistry. 2012;55(14):6582–6594. doi: 10.1021/jm300687e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Boeckler FM, Bauer MR, Ibrahim TM, Vogel SM. Use of DEKOIS 2.0 to gain insights for virtual screening. Journal of Cheminformatics. 2014;6(Suppl 1):2014. doi: 10.1186/1758-2946-6-S1-O24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wallace AC, Laskowski RA, Thornton JM. LIGPLOT: A program to generate schematic diagrams of protein-ligand interactions Clean up structure. Protein Engineering. 1995;8(2):127–134. doi: 10.1093/protein/8.2.127. [DOI] [PubMed] [Google Scholar]
- 28.Hebditch M, Carballo-amador MA, Charonis S, Curtis R, Warwicker J. Sequence analysis Protein – Sol: A web tool for predicting protein solubility from sequence. Bioinformatics. 2017;33(19):3098–3100. doi: 10.1093/bioinformatics/btx345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Pahari S, Sun L, Basu S, Alexov E. DelPhiPKa: Including salt in the calculations and enabling polar residues to titrate. Proteins: Structure, Function, and Bioinformatics. 2018;86(12):1277–1283. doi: 10.1002/prot.25608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Unni S, Huang Y, Hanson RM, Tobias M, Krishnan S, Li WW, et al. Web servers and services for electrostatics calculations with APBS and PDB2PQR. Journal of Computational Chemistry. 2011;32:5–8. doi: 10.1002/jcc. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC. (n.d.).
- 32.Humphrey W, Dalke A, Schulten K. VMD: Visual molecular dynamics. Journal of Molecular Graphics. 1996;14(1):33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 33.Huang CC, Meng EC, Morris JH, Pettersen EF, Ferrin TE. Enhancing UCSF Chimera through web services. Nucleic Acids Research. 2014;42(May):478–484. doi: 10.1093/nar/gku377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.James M, Murtola T, Schulz R, Smith JC, Hess B, Lindahl E. GROMACS : High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;2:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- 35.Gregersen, B. A., Klepeis, J. L., Kolossvary, I., & Mark, A. (2006). Scalable algorithms for molecular dynamics simulations on commodity clusters. In Proceedings of the ACM/IEEE Conference on Supercomputing (SC06), Tampa, Florida , 2006,.
- 36.Feig M, Karanicolas J, Iii CLB. MMTSB Tool Set: Enhanced sampling and multiscale modeling methods for applications in structural biology. Journal of Molecular Graphics and Modelling. 2004;22(5):377–395. doi: 10.1016/j.jmgm.2003.12.005. [DOI] [PubMed] [Google Scholar]
- 37.Qi Y, Lee J, Singharoy A, Mcgreevy R, Schulten K, Im W. CHARMM-GUI MDFF/xMDFF utilizer for molecular dynamics flexible fitting simulations in various environments. Journal of Physical Chemistry B. 2017;121(15):3718–3723. doi: 10.1021/acs.jpcb.6b10568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hospital A, Andrio P, Fenollosa C, Cicin-sain D, Orozco M, Gelpí JL. MDWeb and MDMoby : An integrated web-based platform for molecular dynamics simulations. Bioinformatics. 2012;28(9):1278–1279. doi: 10.1093/bioinformatics/bts139. [DOI] [PubMed] [Google Scholar]
- 39.Perri MJ, Weber SH. Web-based job submission interface for the GAMESS computational chemistry program. Journal of Chemical Education. 2014;91(12):2206–2208. doi: 10.1021/ed5004228. [DOI] [Google Scholar]
- 40.Michaud-agrawal N, Denning EJ, Woolf TB, Beckstein O. MDAnalysis : A toolkit for the analysis of molecular dynamics simulations. Journal of Computational Chemistry. 2011;32:2319–2327. doi: 10.1002/jcc. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Neese F. Software update: The ORCA program system, version 4.0. Wiley Interdisciplinary Reviews: Computational Molecular Science. 2018;8(1):4–9. doi: 10.1002/wcms.1327. [DOI] [Google Scholar]
- 42.Phys JC, Barca GMJ, Bertoni C, Carrington L, Fedorov DG, Gour JR, et al. Recent developments in the general atomic and molecular electronic structure system Recent developments in the general atomic and molecular electronic structure system. Journal of Chemical Physics. 2020;154102(February):154102. doi: 10.1063/5.0005188. [DOI] [PubMed] [Google Scholar]
- 43.Hanwell MD, Curtis DE, Lonie DC, Vandermeersch T, Zurek E, Hutchison GR. Avogadro : An advanced semantic chemical editor , visualization , and analysis platform. Journal of Cheminformatics. 2012;4(17):1–17. doi: 10.1186/1758-2946-4-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Phys JC, Smith DGA, Galvelis R, James M, Schriber B, Burns LA, et al. Throughput quantum chemistry P SI4 1.4: Open-source software for high-throughput quantum chemistry. Journal of Chemical Physics. 2020;184108(February):184108. doi: 10.1063/5.0006002. [DOI] [PMC free article] [PubMed] [Google Scholar]


