Abstract
The Centre for Molecular Informatics, formerly Unilever Centre for Molecular Science Informatics (UCMSI), at the University of Cambridge is a world-leading driving force in the field of cheminformatics. Since its opening in 2000 more than 300 scientific articles have fundamentally changed the field of molecular informatics. The Centre has been a key player in promoting open chemical data and semantic access. Though mainly focussing on basic research, close collaborations with industrial partners ensured real world feedback and access to high quality molecular data. A variety of tools and standard protocols have been developed and are ubiquitous in the daily practice of cheminformatics. Here, we present a retrospective of cheminformatics research performed at the UCMSI, thereby highlighting historical and recent trends in the field as well as indicating future directions.
Keywords: Centre for Molecular Science Informatics, Retrospective analysis, History, Trends, Directions
1 Introduction
In December 2000 the Unilever Centre for Molecular Science Informatics (UCMSI) was opened at the Department of Chemistry of University of Cambridge. Based on an investment by the industrial partner Unilever, a new world-leading research group in the emerging field of molecular informatics was established. The investment included a new building, an established chair in Molecular Science Informatics and three lectureships as well as set up costs (equipment, networking, and software). The Unilever research grants were renewed in year five and year ten. In addition, over the period of the UCMSI’s existence, significant additional grants from a variety of industrial, charitable and research council sources were obtained to support the objectives of the UCMSI.
The research centre is located in central Cambridge, thus profiting from a stimulating and exciting research environment. Daily interactions with several other local institutes at the University of Cambridge, the EMBL European Bioinformatics Institute (EBI), and the Cambridge Crystallographic Data Centre (CCDC) create a world class research cluster. Furthermore, several major industrial partners are located in close proximity in Cambridge’s science parks and on corporate research sites.
Collaborations with more than twenty industrial partners and especially Unilever allowed access to high quality data sets and formed the basis for state-of-the-art computational modelling.[1] Further industrial partners have included Boehringer Ingelheim, AstraZeneca, BASF, Pfizer, Johnson&Johnson, GSK, Aboca and Eli Lilly, to name but a few. Additional third party funding was attracted at the national level from the UK Engineering and Physical Sciences Research Council, The Medical Research Council, The Wellcome Trust, and the Biotechnology and Biological Sciences Research Council, as well as The National Institutes of Health in the USA. On the European level, major grants from the European Chemical Industry Council (CEFIC) and the European Research Council (ERC), and the Framework-7 program were successfully obtained, funding a variety of international informatics research projects.
The research at the UCMSI covers broad areas of cheminformatics, which coupled with experiments and collaborations with industrial partners in bringing products to the market, ensures real life feedback in several inter-disciplinary research efforts. A main goal of the currently 40 scientists in the UCMSI is the integration of chemistry, biology and materials science through the development and application of molecular informatics. Robert Glen has directed the UCMSI since its opening. He heads an interdisciplinary research group using a broad set of computational methodologies to tackle basic scientific questions in the general area of molecular biosystems. Four additional research groups are focusing on relevant areas on cheminformatics research at the UCMSI. The group of Jonathan Goodman focuses on synthesis, computation and informatics and applies cheminformatics to tackle questions in chemical reactivity and catalysis. Peter Murray-Rust’s group focuses on semantic web technologies and develops methodologies and software for intelligent storage and retrieval of chemical data. Andreas Bender’s group drives the integration of new large scale data sources (gene expression, biological networks, phylogenetics) in the field of cheminformatics. Recently, Lucy Colwell has established a new group at the Centre, her research aims to identify structural features within large datasets using advanced statistical and data analytics methodologies. Two former group leaders have recently departed from the UCMSI to set up significant research groups. Peter J. Bond recently moved on to a principal investigator position at the Bioinformatics (BII) Institute A*STAR in Singapore and continues his research efforts on multiscale modelling and large scale simulations. Former group leader John Mitchell has moved to the University of St. Andrews as a Reader and continues broad research in cheminformatics from quantum chemistry to molecular simulation technologies. Former group leaders include David Lary, Guy Grant, Dmitry Nerukh, Maxim Federov and Hamse Mussa.
Over the years of the Centre’s existence 67 PhDs have been trained and 36 postdoctoral research associates performed high quality research. The UCMSI hosted 50 conferences/workshops and organized over 200 seminars. Scientists at the UCMSI have won several awards and prizes including the RSC Bader Award to Jonathan Goodman, the Hansch Award to Andreas Bender, and the Novartis Chemistry Lectureship to Robert Glen. In this article we describe the developments in cheminformatics research performed at the UCMSI aiming to identify challenges and opportunities for the field in the future.
2 Methods
2.1 Data Mining in Web of Science
We retrieved all entries related to the UCMSI in all Web of Science databases.[2] Therefore, we used the Web of Science web interface and extracted entries with at least one authors address containing all words “Cambridge”, “University”, “Molecular”, and “Unilever”. Furthermore, we discarded entries in the database earlier than the year 2001, to remove three false positive hits of earlier years. Unfortunately, some entries (e.g.[3,4]) are discarded as false negatives at this stage, as different abbreviations in author affiliations have been used. Citing articles and total citations were extracted for all articles and analyzed in terms of self- and non-self citations according to the tools provided in the Web of Science online mask. The Hirsch index (h-index)[5] was calculated to estimate a total impact of the science performed at the UCMSI.
2.2 Analysis of Publications Using Word Clouds
Furthermore, we extracted information on author names and abstracts (where available) from all publications. We identified the leading scientists and their central research topics using word cloud representations (“wordles”) generated using WordItOut.[6] We used author surnames only to ensure consistency between different data sources. Full abstracts were used for the generation of topic-related word clouds. To allow for identification of trends over time we split the complete data sets for authors and abstracts into sections of years. Thereby, we analyzed author and thematic contributions for the years 2001–2004, 2005–2007, 2008–2010, and 2011–2014 respectively.
3 Results
We identified in total 325 published items of the UCMSI in the Web of Science database. After an initial growth phase, the article output has reached a stable plateau of between twenty and 40 published items per year (see Figure 1A). The observed decrease for the year 2014 arises from incomplete data for the current year. Linear extrapolation to the complete year increases the number of published articles listed in Web of Science to the average level of approximately 20. Due to lagging indexing in Web of Science linear extrapolation is expected to underestimate values for year 2014. When setting aside the incomplete year 2014, Pearson correlation between years and published items is 0.78, indicating a steadily growing output of research performed at the Centre.
Figure 1.

Scientific publications at the UCMSI: A) Published items at the UCMSI are shown over the years of its existence since 2000. An increase to a stable plateau of around two publications per month is visible. B) Citations related to items published at the UCMSI. The increasing importance of research performed in the field of cheminformatics is reflected by steadily growing citations per year.
In addition to the number of published items, citations were counted on articles published by the Centre. Soon after publication of the first two articles in 2001, first citations are recorded (see Figure 1B). The amount of citations continues to grow steadily over the years and breaks the barrier of 100 citations in the year 2005. Results are shown on a per-year basis, not as a cumulative count. Nevertheless, an almost linear increase of citations with Pearson correlation coefficient of 0.97 between year and citations is observed when excluding the incomplete year 2014. With 988 citations in the year 2013 only, research from the UCMSI is referenced almost three times per day. Linear extrapolation for year 2014 indicates that the barrier of 1000 citations is likely to be broken for the first time.
In total 5508 citations on the 325 articles were recorded via Web of Science. Only nine percent of these represent self-citations (506 citations), thus reflecting a broad audience and considerable impact within the cheminformatics community. 4109 unique indexed articles reference scientific reports from the UCMSI with less than five percent of self-citations (177 articles). On average, articles from the Centre are cited 16.95 times.
No single article accounts for the large average citation count. By contrast, eight articles are individually cited over 100 times, all of them have been published in 2004 or later.[7–14] This broad impact is reflected by an h-index of 39 after 14 years of existence of the Centre. Eight articles are indicated as “highly cited articles” within Web of Science,[9,11–13,15–18] thus representing the top one percent publications in the respective field. These articles only partially overlap with articles already cited more than 100 times. This indicates that they will most likely be the next ones crossing the barrier of 100 citations.
By creating word clouds from author lists of all published items we identified key scientists at the UCMSI in Cambridge (see Figure 2A). Robert C. Glen has published 83 items over the years, closely followed by the group leaders Peter Murray-Rust, Andreas Bender, and Jonathan Goodman, each contributing between 60 and 70 articles. Splitting author contributions according to intervals of years allows the development of the UCMSI to be followed. In earlier years, lecturer John Mitchell was a highly active researcher at the UCMSI, as was Henry Rzepa in collaboration with Peter Murray-Rust (see Figure 2B). In later years the name “Andreas Bender” emerges more and more with a short break during his time at Novartis and the University of Leiden (see Figure 2C and Figure 2D). In recent years, the group of Peter Bond focusing on simulations added additional output to the UCMSI (see Figure 2E).
Figure 2.

Word clouds of co-authors listed on publications of the UCMSI: A) Word cloud over all years of existence. B–E) Author occurrences are split according to years 2004–2004, 2005–2007, 2008–2010, and 2011–2014 respectively.
Wordles created on the basis of published abstracts identify key topics and methods applied at the UCMSI (see Figure 3A). “Data” is the most abundant single phrase and appears mostly in connection with the second most occurring words “molecular” and “chemical”. Therefore, chemical data form the foundation of all research performed at the Centre. “Model” and “models” appear prominent in the list, giving hints how chemical data is utilized in the generation of computational models for a variety of mostly chemical and biological properties.
Figure 3.

Wordles generated from indexed abstracts of publications from the UCMSI: A) Word cloud over all years of existence of the Centre. B-E) Word occurrences are analysed in terms of chronological development over the years 2001–2004, 2005–2007, 2008–2010, and 20011–2014.
Splitting abstracts according to publication years does not reveal any consistency in the research efforts on-going at the UCMSI (see Figures 3B–3E). Contributions of “molecular” and “data” are constantly high. Some words linked to the observables predicted in established models appear transiently, e.g. “target” for recently established target prediction tools. Nevertheless, the low frequency of individual modelled molecular properties reflects the broadness of topics covered. Furthermore, structure-based modelling approaches[19] appear more prominent in recent years as indicated by the increasing occurrence of the keywords “protein”, “water”, and “dynamics”.
4 Discussion
Analysis of citations has allowed the identification of key topics and methods applied at the UCMSI. Science is often centred around method development which is reflected by several publications in leading journals in the field of cheminformatics and modelling. Additionally, application of the newly developed methods ensures real life feedback and enabled publications in fields ranging from drug discovery to synthesis, materials science to electrochemistry. In the following paragraphs we will select particular fields of research where major advances have been made at the Centre.
4.1 Linking Cheminformatics and Biology
Starting from innovative ways to encode and compare chemical environments in molecules,[7,8,10] several steps have been taken to link chemical with biological properties of molecules[20] based on statistical modelling techniques.[21,22] Thereby, the fields of cheminformatics and bioinformatics increasingly overlap and even fuse, leading to approaches like proteochemometric modelling (PCM).[23,24] With the establishment of target prediction tools based on statistical modelling,[25,26] novel approaches to cluster molecules using predicted biological effect can be implemented.[27] Furthermore, these novel methodologies proved to be helpful in multi target drug design.[28] Recent directions in the area comprise the inclusion of further biological data sources, e.g. from biological networks and phylogenetics[29,30] as well as gene expression data.[31] Several successful applications of target prediction algorithms[32–34] underline the increasing accuracy of such data-driven approaches and point towards a bright future given the increase in available data sources.[35,36] In a recent success story cheminformatic tools were applied to identify potential targets of a series of synthetic biscoumarins showing anti-cancer activity in vitro and in vivo.[37] After in depth computational characterization of predicted protein-ligand interactions (see Figure 4A), the in silico predicted protein target tumour necrosis factor α was verified by experimental techniques. An emerging future direction in this field is the prediction of biological effects of compound combinations that is financed via an ERC Starting Grant to Andreas Bender. Further stimuli in this area expected from statistical analyses of genomic sequence data.[38,39]
Figure 4.

Examples of cheminformatics research currently performed at the UCMSI: A) Ensemble of protein-ligand poses extracted from a molecular dynamics simulation of tumor necrosis factor α (TNFα, green cartoon) in complex with a biscoumarin (cyan sticks). The native trimeric structure of TNFα (semi-transparent yellow cartoon, PDB: 1TNF) is disrupted by ligand binding. A zoom on the binding site highlights the hydrophobic environment of the ligand, allowing for several binding modes. B) Model of the apelin receptor (green cartoon) in complex with the apelin-13 peptide (cyan sticks). Starting coordinates of a molecular dynamics simulation box are depicted including a model lipid bi-layer (yellow lines and semi-transparent surface) along with explicit water molecules and counter ions (both not shown). The predicted binding mode of the apelin-13 peptide and surrounding residues are highlighted in the included zoom on the binding cavity. C) Illustration of the simulation approach followed to investigate selectivity of Schiff base forming covalent IRE-1 inhibitors. IRE-1 is shown as cartoon with the reactants Lys-907 and inhibitor 4µ8C highlighted as sticks.
4.2 Data Semantics and Accessibility
Growth of the available (“Big”) data brings new challenges to the field of cheminformatics. Millions of chemical substances have been characterized, many hundreds of thousands of three-dimensional structures have been deposited in databases, and associated biological data is spread over millions of publications and patents.[40] Therefore, the development of standards for chemical identification, indexing and storage is crucial for future successful data retrieval. Usage of the IUPAC standard International Chemical Identifiers (InChIs) allows the encoding of chemical information using a unique text descriptor, e.g. for database searches.[41] Based on InChIs several tools have been developed, e.g. to convert structures to chemical names in a fully automated way[42] or chemistry-aware text mining.[43] Recently, InChIs have been extended to RinChIs to depict chemical reactions unambiguously.[44] Chemical Markup Language (CML) introduces and specifies particular data fields to efficiently store and retrieve chemical data.[45] Based on CML the World-Wide Molecular Matrix (WWMM) has been introduced to collect and connect chemical information of various sources.[46] Chem4Word has been created in a collaboration between Microsoft Research and the UCMSI to facilitate the handling of chemical information within text processing software. To date, the plug-in has been downloaded more than 400 000 times. Over the last decade, the UCMSI has been a key player in setting standards for open data and data quality standards in chemistry.[47–49] In recent years, technologies for data semantics and natural language processing have been employed attempting to directly extract the scientific context of published chemical data.[50]
4.3 Modelling of Physicochemical Properties and ADMET Parameters
The increase in data sources over recent years allowed the establishment of more accurate computational models, even for complex biological phenomena, e.g. absorption, distribution, metabolism, excretion and toxicity (ADMET) of xenobiotics.[51] Dozens of computational methods for prediction of metabolic reactions on different levels of complexity have been published in the literature.[16] Thorough classification of annotated biotransformations[52] facilitated the development of novel predictive models for general metabolic reactivity (Metaprint2D),[53,54] P-glycoprotein transport,[55] solubility,[56] and cytochrome P450-catalyzed metabolic reactions.[57–59] Thereby, innovative computational methods making use of recent advances in graphics processing unit (GPU) based computing have been employed. Using these approaches, new levels of throughput and thus modelling accuracy are in range. Furthermore, the UCMSI ran a solubility competition (with over 100 entries) for the cheminformatics community and provided high quality experimental data to facilitate further method developments in the area.[60–62]
4.4 Bioactive Compound Discovery
The UCMSI has been a driving force of computer-aided drug design over the past decades. In addition to support of external drug design efforts, local lead discovery projects have been guided by computational technologies. Novel ligands for the G-protein coupled receptor (GPCR) apelin have been successfully identified and biologically characterized.[63] Based on modelled structures of the apelin receptor (see Figure 4B), optimization of the peptide-derived compounds is on-going and shows enormous potential for further development and recently the first human study of apelin biased agonists has been completed in Addenbrooke's hospital in Cambridge. (Design, characterization and first-in-human study of the vascular actions of a novel ‘biased’ apelin receptor agonist. Anthony Davenport, Aimee Brame, Janet Maguire, Peiran Yang, Alex Dyson, Rubben Torella, Joseph Cheriyan, Mervyn Singer, Robert Glen, Ian Wilkinson. Hypertension, in press, 2015). Furthermore, small molecule antagonists of the 5-HT1B GPCR have been identified and optimized for development as a potential treatment for Pulmonary Hypertension.[64] Several newly designed and synthesized compounds are currently in clinical studies in Addenbrooke’s Hosptial, Cambridge. In another compound discovery effort large scale simulation approaches (see Figure 4C) have been used to identify covalent binders of the endonuclease domain of IRE-1.[65,66] In a follow-up study selectivity of the Schiff base forming compounds has been investigated by advanced computational techniques.[67] Research at the UCMSI also lead to joint patents with Unilever on the CB1/2 receptors and NCKX ion channels showing benefits in skin for use in home and personal care products. To assist drug discovery several analyses of molecular diversity in the context of diversity-oriented chemical synthesis have been performed in collaboration with experimental groups.[68–70]
In addition to these four areas the UCMSI has been particularly active in development of innovative simulation and analysis methodologies[71,72] and cheminformatic support for organic synthesis.[73,74] Industrial partners emphasize the productivity of collaborative research efforts with the Centre for Molecular Informatics. Ola Engkvist, team leader of Computational Chemistry at AstraZeneca and involved in several joint research projects, highlights: “The collaboration with the UCMSI provides AstraZeneca with the opportunity to work closely with one of the world leading groups in cheminformatics. The combination of the UCMSI’s outstanding scientific knowledge with AstraZeneca’s industrial experience provides a platform for state-of-the-art research in cheminformatics. The proximity to one of the AstraZeneca science hubs is an additional plus that facilitates smooth collaborations.” Jim Crilly, senior vice-president of the Strategic Science Group at Unilever, adds: “Unilever is very proud to have instigated in partnership with the University of Cambridge the Centre for Molecular Informatics under the Leadership of Professor Robert Glen and pleased that it has developed into a global centre of excellence in a hugely important field of research. Collaboration with the centre has brought a new way of working into our own research endeavours and accelerated our discovery process.”
Following a clear mission statement published in 2002[75] the UCMSI is developing tools and standards in molecular informatics. With the increase in computer power and data accessibility, molecular modelling allows chemical and biological effects of increasing complexity and size to be captured. Available data sources range from chemistry and related bioactivity (PubChem,[76] ChEMBL,[35] DrugBank[77]) via protein sequence (UniProt[78]) and structure (Protein Data Bank,[79] Pfam[80]), to cellular pathways (KEGG[81]) and responses (LINCS[82]). A collection of online molecular biology databases has recently been published with the database issue of Nucleic Acids Research.[83]
With the advent of microsecond simulations of biological macromolecules[84] and sophisticated enhanced sampling methods,[85] dynamic processes underlying macromolecular recognition processes can be modelled with a new level of accuracy.[86] Protein-ligand binding processes may be studied at atomistic resolution, reports include full sampling of binding and unbinding of both fragments[87] and small molecules.[88] With an increasingly accurate description of protein dynamics, its role in biomolecular recognition processes can be studied, thereby allowing pharmaceutically relevant properties like binding specificity[89] or binding kinetics[90] including allosteric mechanisms[91] to be probed. On the other hand, integration of innovative data sources from emerging “omics” fields such as proteomics,[92] metabonomics/metabolomics,[93] or lipidomics[94] will allow the capture of novel biological properties which have limited or no direct chemical leads to their mechanism and function.
5 Conclusions
The data presented underline the central role in the cheminformatics world that is occupied by the UCMSI. The research institute sets world-wide standards and publishes technologies and software that is widely used and cited. The broad range of topics covered at the UCMSI ensures its broad impact ranging from basic cheminformatics, data mining and machine learning techniques to complex simulations, to chemical reactivity and synthesis. The connection of all these areas in a single research centre offers unique opportunities for scientists involved, industrial partners, as well as the whole cheminformatics community.
Acknowledgments
J. E. F thanks the Medical Research Council for funding (Grant Number MR/K020919/1). Furthermore, the UCMSI acknowledges all funding sources for continuous support over the past 15 years. We thank Ola Engkvist, AstraZeneca, and Jim Crilly, Unilever, for their encouraging feedback on joint research efforts and current and former members of the UCMSI for their valuable input on the manuscript.
References
- 1.Nigsch F, Macaluso NJ, Mitchell JB, Zmuidinavicius D. Expert Opin. Drug Metab. Toxicol. 2009;5:1–14. doi: 10.1517/17425250802660467. [DOI] [PubMed] [Google Scholar]
- 2. http://www.isiknowledge.com, database access 20.08.2014.
- 3.Adams SE, Goodman JM, Kidd RJ, McNaught AD, Murray-Rust P, Norton FR, Townsend JA, Waudby CA. Org. Biomol. Chem. 2004;2:3067–3070. doi: 10.1039/B411699M. [DOI] [PubMed] [Google Scholar]
- 4.Townsend JA, Adams SE, Waudby CA, de Souza VK, Goodman JM, Murray-Rust P. Org. Biomol. Chem. 2004;2:3294–3300. doi: 10.1039/b411033a. [DOI] [PubMed] [Google Scholar]
- 5.Hirsch JE. Proc. Natl. Acad. Sci. USA. 2005;102:16569–16572. doi: 10.1073/pnas.0507655102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. http://www.worditout.com, accessed 20.08.2014.
- 7.Bender A, Glen RC. Org. Biomol. Chem. 2004;2:3204–3218. doi: 10.1039/B409813G. [DOI] [PubMed] [Google Scholar]
- 8.Bender A, Mussa HY, Glen RC, Reiling S. J. Chem. Inf. Comput. Sci. 2004;44:1708–1718. doi: 10.1021/ci0498719. [DOI] [PubMed] [Google Scholar]
- 9.Adams N, Schubert US. Adv. Drug Deliv. Rev. 2007;59:1504–1520. doi: 10.1016/j.addr.2007.08.018. [DOI] [PubMed] [Google Scholar]
- 10.Bender A, Mussa HY, Glen RC, Reiling S. J. Chem. Inf. Comput. Sci. 2004;44:170–178. doi: 10.1021/ci034207y. [DOI] [PubMed] [Google Scholar]
- 11.Fedorov MV, Kornishev AA. J. Phys. Chem. B. 2008;112:11868–11872. doi: 10.1021/jp803440q. [DOI] [PubMed] [Google Scholar]
- 12.Fedorov MV, Kornishev AA. Electrochim. Acta. 2008;53:6835–6840. [Google Scholar]
- 13.Demir E, Cary MP, Paley S, Fukuda K, Lemer C, Vastrik I, Wu G, D’Eustachio P, Schaefer C, Luciano J, Schacherer F, Martinez-Flores I, Hu Z, Jimenez-Jacinto V, Joshi-Tope G, Kandasamy K, Lopez-Fuentes AC, Mi H, Pichler E, Rodchenkov I, Splendiani A, Tkachev S, Zucker J, Gopinath G, Rajasimha H, Ramakrishnan R, Shah I, Syed M, Anwar N, Babur O, Blinov M, Brauner E, Corwin D, Donaldson S, Gibbons F, Goldberg R, Hornbeck P, Luna A, Murray-Rust P, Neumann E, Ruebenacker O, Samwald M, van Iersel M, Wimalaratne S, Allen K, Braun B, Whirl-Carrillo M, Cheung KH, Dahlquist K, Finney A, Gillespie M, Glass E, Gong L, Haw R, Honig M, Hubaut O, Kane D, Krupa S, Kutmon M, Leonard J, Marks D, Merberg D, Petri V, Pico A, Ravenscroft D, Ren L, Shah N, Sunshine M, Tang R, Whaley R, Letovksy S, Buetow KH, Rzhetsky A, Schachter V, Sobral BS, Dogrusoz U, McWeeney S, Aladjem M, Birney E, Collado-Vides J, Goto S, Hucka M, Le Novère N, Maltsev N, Pandey A, Thomas P, Wingender E, Karp PD, Sander C, Bader GD. Nat. Biotechnol. 2010;28:935–942. doi: 10.1038/nbt.1666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Simon L, Goodman JM. J. Am. Chem. Soc. 2008;130:8741–8747. doi: 10.1021/ja800793t. [DOI] [PubMed] [Google Scholar]
- 15.Scior T, Bender A, Tresadern G, Medina-Franco JL, Martinez-Mayorga K, Langer T, Cuanalo-Contreras K, Agrafiotis DK. J. Chem. Inf. Model. 2012;52:867–881. doi: 10.1021/ci200528d. [DOI] [PubMed] [Google Scholar]
- 16.Kirchmair J, Williamson MJ, Tyzack JD, Tan L, Bond PJ, Bender A, Glen RC. J. Chem. Inf. Model. 2012;52:617–648. doi: 10.1021/ci200542m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ballester PJ, Mitchell JBO. Bioinformatics. 2010;26:1169–1175. doi: 10.1093/bioinformatics/btq112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Grienke U, Schmidtke M, von Grafenstein S, Kirchmair J, Liedl KR, Rollinger JM. Nat. Prod. Rep. 2012;29:11–36. doi: 10.1039/c1np00053e. [DOI] [PubMed] [Google Scholar]
- 19.Glen RC, Allen SC. Curr. Med. Chem. 2003;10:763–767. doi: 10.2174/0929867033457809. [DOI] [PubMed] [Google Scholar]
- 20.Fauzi FM, Koutsoukas A, Lowe R, Joshi K, Fan TP, Glen RC, Bender A. J. Chem. Inf. Model. 2013;53:661–673. doi: 10.1021/ci3005513. [DOI] [PubMed] [Google Scholar]
- 21.Mussa HY, Hawizy L, Nigsch F, Glen RC. J. Chem. Inf. Model. 2011;51:4–14. doi: 10.1021/ci100022u. [DOI] [PubMed] [Google Scholar]
- 22.Mussa HY, Mitchell JB, Glen RC. J. Cheminf. 2013;5:37. doi: 10.1186/1758-2946-5-37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.van Westen GJP, Swier RF, Wegner JK, IJzerman AP, van Vlijmen HWT, Bender A. J. Cheminf. 2013;5:41. doi: 10.1186/1758-2946-5-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.van Westen GJP, Swier RF, Cortes-Ciriano I, Wegner JK, Overington JP, IJzerman AP, van Vlijmen HWT, Bender A. J. Cheminf. 2013;5:42. doi: 10.1186/1758-2946-5-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Koutsoukas A, Lowe R, Kalantar-Motamedi Y, Mussa HY, Klaffke W, Mitchell JBO, Glen RC, Bender A. J. Chem. Inf. Model. 2013;53:1957–1966. doi: 10.1021/ci300435j. [DOI] [PubMed] [Google Scholar]
- 26. J. C. Costello, L. M. Heiser, E. Georgii, M. Gönen, M. P. Menden, N. J. Wang, M. Bansal, M. Ammad-ud-din, P. Hintsanen, S. A. Khan, J. P. Mpindi, O. Kallioniemi, A. Honkela, T. Aittokallio, K. Wennerberg, NCI DREAM Community, J. J. Collins, D. Gallahan, D. Singer, J. Saez-Rodriguez, S. Kaski, J.W. Gray, G. Stolovitzky, Nat. Biotechnol201432(12), 1202–1212. [DOI] [PMC free article] [PubMed]
- 27.Nguyen HP, Koutsoukas A, Fauzi MF, Drakakis G, Maciejewski M, Glen RC, Bender A. Chem. Biol. Drug Des. 2013;82:252–266. doi: 10.1111/cbdd.12155. [DOI] [PubMed] [Google Scholar]
- 28.Koutsoukas A, Simms B, Kirchmair J, Bond PJ, Whitmore AV, Simmer S, Young MP, Jenkins JL, Glick M, Glen RC, Bender A. J. Proteomics. 2011;74:2554–2574. doi: 10.1016/j.jprot.2011.05.011. [DOI] [PubMed] [Google Scholar]
- 29.Liggi S, Koutsoukas A, Motamedi YK, Glen RC, Bender A. J. Cheminf. 2013:5. 15. [Google Scholar]
- 30.Paricharak S, Klenka T, Augustin M, Patel UA, Bender A. J. Cheminf. 2013;5:49. doi: 10.1186/1758-2946-5-49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ravindranath AC, Perualila-Tan N, Kasim A, Drakakis G, Liggi S, Brewerton SC, Mason D, Bodkin MJ, Evans DA, Bhagwat A, Talloen W, Goehlmann HW, Consortium Qstar, Shkedy Z, Bender A. Mol. Biosyst. 2015:86–96. doi: 10.1039/c4mb00328d. 11. [DOI] [PubMed] [Google Scholar]
- 32.Drakakis G, Hendry AE, Hanson K, Brewerton SC, Bodkin MJ, Evans DA, Wheeler GN, Bender A. Med. Chem. Commun. 2014;5:386–396. [Google Scholar]
- 33.Keerthy HK, Mohan CD, Siveen KS, Fuchs JE, Rangappa S, Sundaram MS, Li F, Girish KS, Sethi G, Basappa B, Bender A, Rangappa KS. J. Biol. Chem. 2014;289:31879–31890. doi: 10.1074/jbc.M114.593855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mohan CD, Bharathkumar H, Bulusu KC, Pandey V, Rangappa S, Fuchs JE, Shanmugam MK, Dai X, Li F, Deivasigamani A, Hui KM, Kumar AP, Lobie PE, Bender A, Basappa B, Sethi G, Rangappa KS. J. Biol. Chem. 2014;289:34296–34307. doi: 10.1074/jbc.M114.601104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bento AP, Gaulton A, Hersey A, Bellis LJ, Chambers J, Davies M, Kruger FA, Light Y, Mak L, McGlinchey S, Nowotka M, Papadatos G, Santos R, Overington JP. Nucleic Acids Res. 2014;42:D1083–D1090. doi: 10.1093/nar/gkt1031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Dossetter AG, Ecker G, Laverty H, Overington J. Future Med. Chem. 2014;6:857–864. doi: 10.4155/fmc.14.45. [DOI] [PubMed] [Google Scholar]
- 37.Bharathkumar H, Paricharak S, Dinesh KR, Siveen KS, Fuchs JE, Rangappa S, Mohan CD, Mohandas N, Kumar AP, Sethi G, Bender A, Basappa B, Rangappa KS. RSC Adv. 2014;4:45143–45146. [Google Scholar]
- 38.Hopf TA, Colwell LJ, Sheridan R, Rost B, Sander C, Marks DS. Cell. 2012;149:1607–1621. doi: 10.1016/j.cell.2012.04.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dwyer RS, Ricci DP, Colwell LJ, Silhavy TJ, Wilgreen NS. Genetics. 2013;195:443–455. doi: 10.1534/genetics.113.155861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Glen RC. J. Comput. Aided Mol. Des. 2012;26:47–49. doi: 10.1007/s10822-011-9501-6. [DOI] [PubMed] [Google Scholar]
- 41.Murray-Rust P, Rzepa HS, Stewart JJ, Zhang Y. J. Mol. Model. 2005;11:532–541. doi: 10.1007/s00894-005-0278-1. [DOI] [PubMed] [Google Scholar]
- 42.Lowe DM, Corbett PT, Murray-Rust P, Glen RC. J. Chem. Inf. Model. 2011;51:739–753. doi: 10.1021/ci100384d. [DOI] [PubMed] [Google Scholar]
- 43.Jessop DM, Adams SE, Willighagen EL, Hawizy L, Murray-Rust P. J. Cheminf. 2011;3:41. doi: 10.1186/1758-2946-3-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Grethe G, Goodman JM, Allen CH. J. Cheminform. 2013;5:45. doi: 10.1186/1758-2946-5-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Murray-Rust P, Townsend JA, Adams SE, Phandungsukanan W, Thomas J. J. Cheminf. 2011;3:43. doi: 10.1186/1758-2946-3-43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Murray-Rust P, Adams SE, Downing J, Townsend JA, Zhang Y. J. Cheminf. 2011;3:42. doi: 10.1186/1758-2946-3-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Murray-Rust P. Nature. 2008;451:648–651. doi: 10.1038/451648a. [DOI] [PubMed] [Google Scholar]
- 48.O’Boyle NM, Guha R, Willighagen EL, Adams SE, Alvarsson J, Bradley JC, Filippov IV, Hanson RM, Hanwell MD, Hutchison GR, James CA, Jeliazkova N, Lang AS, Langner KM, Lonie DC, Lowe DM, Pansanel J, Pavlov D, Spjuth O, Steinbeck C, Tenderholt AL, Theisen KJ, Murray-Rust P. J. Cheminf. 2011;3:37. doi: 10.1186/1758-2946-3-37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Orchard S, Al-Lazikani B, Bryant S, Clark D, Calder E, Dix I, Engkvist O, Forster M, Gaulton A, Gilson M, Glen R, Grigorov M, Hammond-Kosack K, Harland L, Hopkins A, Larminie C, Lynch N, Mann RK, Murray-Rust P, Lo Piparo E, Southan C, Steinbeck C, Wishart D, Hermjakob H, Overington J, Thornton J. Nat. Rev. Drug Discov. 2011;10:661–669. doi: 10.1038/nrd3503. [DOI] [PubMed] [Google Scholar]
- 50.Hawizy L, Jessop DM, Adams N, Murray-Rust P. J. Cheminf. 2011;3:17. doi: 10.1186/1758-2946-3-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.van de Waterbeemd H, Gifford E. Nat. Rev. Drug Discov. 2003;2:192–204. doi: 10.1038/nrd1032. [DOI] [PubMed] [Google Scholar]
- 52.Kirchmair J, Howlett A, Peironcely JE, Murrell DS, Williamson MJ, Adams SE, Hankemeier T, van Buren L, Duchateau G, Klaffke W, Glen RC. J. Chem. Inf. Model. 2013;53:354–367. doi: 10.1021/ci300487z. [DOI] [PubMed] [Google Scholar]
- 53.Boyer S, Arnby CHasselgren, Carlsson L, Smith J, Stein V, Glen RC. J. Chem. Inf. Model. 2007;47:583–590. doi: 10.1021/ci600376q. [DOI] [PubMed] [Google Scholar]
- 54.Carlsson L, Spjuth O, Adams S, Glen RC, Boyer S. BMC Bioinformatics. 2010;11:362. doi: 10.1186/1471-2105-11-362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wang Z, Chen Y, Liang H, Bender A, Glen RC, Yan A. J. Chem. Inf. Model. 2011;51:1447–1456. doi: 10.1021/ci2001583. [DOI] [PubMed] [Google Scholar]
- 56.Comer J, Judge S, Matthews D, Towes L, Falcone B, Goodman J, Dearden J. ADMET & DMPK. 2014;2:18–32. [Google Scholar]
- 57.Kirchmair J, Williamson MJ, Afzal AM, Tyzack JD, Choy AP, Howlett A, Rydberg P, Glen RC. J. Chem. Inf. Model. 2013;53:2896–2907. doi: 10.1021/ci400503s. [DOI] [PubMed] [Google Scholar]
- 58.Tyzack JD, Williamson MJ, Torella R, Glen RC. J. Chem. Inf. Model. 2013;53:1294–1305. doi: 10.1021/ci400058s. [DOI] [PubMed] [Google Scholar]
- 59.Tyzack JD, Mussa HY, Williamson MJ, Kirchmair J, Glen RC. J. Cheminf. 2014;6:29. doi: 10.1186/1758-2946-6-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Llinas A, Glen RC, Goodman JM. J. Chem. Inf. Model. 2008;48:1289–1303. doi: 10.1021/ci800058v. [DOI] [PubMed] [Google Scholar]
- 61.Hopfinger AJ, Esposito EX, Llinas A, Glen RC, Goodman JM. J. Chem. Inf. Model. 2009;49:1–5. doi: 10.1021/ci800436c. [DOI] [PubMed] [Google Scholar]
- 62.Palmer DS, O’Boyle NM, Glen RC, Mitchell JB. J. Chem. Inf. Model. 2007;47:150–158. doi: 10.1021/ci060164k. [DOI] [PubMed] [Google Scholar]
- 63.Macaluso NJ, Pitkin SL, Maguire JJ, Davenport AP, Glen RC. ChemMedChem. 2011;6:1017–1023. doi: 10.1002/cmdc.201100069. [DOI] [PubMed] [Google Scholar]
- 64.Moloney GP, Garavelas A, Martin GR, Maxwell M, Glen RC. Eur. J. Med. Chem. 2004;39:305–321. doi: 10.1016/j.ejmech.2003.12.008. [DOI] [PubMed] [Google Scholar]
- 65.Cross BCS, Bond PJ, Sadowski PG, Jha BK, Zak J, Goodman JM, Silverman RH, Neubert TA, Baxendale IR, Ron D, Harding HP. Proc. Natl. Acad. Sci. 2012;109:5559–5560. doi: 10.1073/pnas.1115623109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Cross BCS, Bond PJ, Sadowski PG, Jha BK, Zak J, Goodman JM, Silverman RH, Neubert TA, Baxendale IR, Ron D, Harding HP. Proc. Natl. Acad. Sci. 2012;109:E869–E878. doi: 10.1073/pnas.1115623109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tomasio SM, Harding HP, Ron D, Cross BCS, Bond PJ. Mol. BioSyst. 2013;9:2408–2416. doi: 10.1039/c3mb70234k. [DOI] [PubMed] [Google Scholar]
- 68.Fergus S, Bender A, Spring DR. Curr. Opin. Chem. Biol. 2005;9:304–309. doi: 10.1016/j.cbpa.2005.03.004. [DOI] [PubMed] [Google Scholar]
- 69.Isidro-Llobet A, Murillo T, Bello P, Cillibrizzi A, Hodgkinson JT, Galloway WR, Bender A, Welch M, Spring DR. Proc. Natl. Acad. Sci. 2011;108:6793–6798. doi: 10.1073/pnas.1015267108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Koutsoukas A, Paricharak S, Galloway WR, Spring DR, IJzerman AP, Glen RC, Marcus D, Bender A. J. Chem. Inf. Model. 2014;54:230–242. doi: 10.1021/ci400469u. [DOI] [PubMed] [Google Scholar]
- 71.Khalid S, Bond PJ. Methods Mol. Biol. 2012;924:635–657. doi: 10.1007/978-1-62703-017-5_25. [DOI] [PubMed] [Google Scholar]
- 72.Paramo T, East A, Garzon D, Ulmschneider MB, Bond PJ. J. Chem. Theory Comput. 2014;10:2151–2164. doi: 10.1021/ct401098b. [DOI] [PubMed] [Google Scholar]
- 73.Currie RH, Goodman JM. Angew. Chem. Int. Ed. Engl. 2012;51:4695–4697. doi: 10.1002/anie.201109080. [DOI] [PubMed] [Google Scholar]
- 74.Goodman JM, Socorro IM. J. Comput. Aided Mol. Des. 2007;21:351–357. doi: 10.1007/s10822-007-9120-4. [DOI] [PubMed] [Google Scholar]
- 75.Glen RC. Chem. Commun. 2002;23:2745–2747. doi: 10.1039/b207793k. [DOI] [PubMed] [Google Scholar]
- 76.Wang Y, Suzek T, Zhang J, Wang J, He S, Cheng T, Shoemaker BA, Gindulyte A, Bryant SH. Nucleic Acids Res. 2014;42:D1075–D1082. doi: 10.1093/nar/gkt978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V, Tang A, Gabriel G, Ly C, Adamjee S, Dame ZT, Han B, Zhou Y, Wishart DS. Nucleic Acids Res. 2014;42:D478–D484. doi: 10.1093/nar/gkt1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.UniProt Consortium. Nucleic Acids Res. 2014;42:D191–D198. doi: 10.1093/nar/gkt1140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Rose PW, Prlic A, Bi C, Bluhm WF, Christie CH, Dutta S, Green RK, Goodsell DS, Westbrook JD, Woo J, Young J, Zardecki C, Berman HM, Bourne PE, Burley SK. Nucleic Acids Res. 2015;43:D345–D356. doi: 10.1093/nar/gku1214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J, Sonnhammer EL, Tate J, Punta M. Nucleic Acids Res. 2014;42:D222–D230. doi: 10.1093/nar/gkt1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, Tanabe M. Nucleic Acids Res. 2014;42:D199–D205. doi: 10.1093/nar/gkt1076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Duan Q, Flynn C, Niepel M, Hafner M, Muhlich JL, Fernandez NF, Rouillard AD, Tan CM, Chen EY, Golub TR, Sorger PK, Subramanaian A, Ma’ayan A. Nucleic Acids Res. 2014;42:W449–W460. doi: 10.1093/nar/gku476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Fernandez-Suarez XM, Rigden DJ, Galperin MY. Nucleic Acids Res. 2014;42:D1–D6. doi: 10.1093/nar/gkt1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Salomon-Ferrer R, Goetz AW, Poole D, Le Grand S, Walker RC. J. Chem. Theory Comput. 2013;9:3878–3888. doi: 10.1021/ct400314y. [DOI] [PubMed] [Google Scholar]
- 85.Pierce LCT, Salomon-Ferrer R, de Oliveira CAF, McCammon JA, Walker RC. J. Chem. Theory Comput. 2012;8:2997–3002. doi: 10.1021/ct300284c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Shaw DE, Maragakis P, Lindorff-Larsen K, Piana S, Dror RO, Eastwood MP, Bank JA, Jumper JM, Salmon JK, Shan Y, Wriggers W. Science. 2010;330:341–346. doi: 10.1126/science.1187409. [DOI] [PubMed] [Google Scholar]
- 87.Bisignano P, Doerr S, Harvey MJ, Favia AD, Cavalli A, De Fabritiis G. J. Chem. Inf. Model. 2014;54:362–366. doi: 10.1021/ci4006063. [DOI] [PubMed] [Google Scholar]
- 88.Buch I, Giorgino T, De Fabritiis G. Proc. Natl. Acad. Sci. USA. 2011;108:10184–10189. doi: 10.1073/pnas.1103547108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Fuchs JE, von Grafenstein S, Huber RG, Wallnoefer HG, Liedl KR. Proteins. 2014;82:546–555. doi: 10.1002/prot.24417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Pan AC, Borhani DW, Dror RO, Shaw DE. Drug Discov. Today. 2013;18:667–673. doi: 10.1016/j.drudis.2013.02.007. [DOI] [PubMed] [Google Scholar]
- 91.Dror RO, Green HF, Valant C, Borhani DW, Valcourt JR, Pan AC, Arlow DH, Canals M, Lane JR, Rahmani R, Baell JB, Sexton PM, Christopoulos A, Shaw DE. Nature. 2013;503:295–299. doi: 10.1038/nature12595. [DOI] [PubMed] [Google Scholar]
- 92.Fuchs JE, von Grafenstein S, Huber RG, Margreiter MA, Spitzer GM, Wallnoefer HG, Liedl KR. PLoS Comput. Biol. 2013;9:e1003007. doi: 10.1371/journal.pcbi.1003007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Haug K, Salek RM, Conesa P, Hastings J, de Matos P, Rijnbeek M, Mahendraker T, Williams M, Neumann S, Rocca-Serra P, Maguire E, Gonzalez-Beltran A, Sansone SA, Griffin JL, Steinbeck C. Nucleic Acids Res. 2013;41:781–796. doi: 10.1093/nar/gks1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Foster JM, Moreno P, Fabregat A, Hermjakob H, Steinbeck C, Apweiler R, Wakelam MJ, Vizcaino JA. PLoS One. 2013;8:e61951. doi: 10.1371/journal.pone.0061951. [DOI] [PMC free article] [PubMed] [Google Scholar]
