Skip to main content
Bioinformatics logoLink to Bioinformatics
. 2022 Jul 27;38(18):4454–4456. doi: 10.1093/bioinformatics/btac507

TargetMine 2022: a new vision into drug target analysis

Yi-An Chen 1,✉,b, Rodolfo S Allendes Osorio 2,b, Kenji Mizuguchi 3,4
Editor: Peter Robinson
PMCID: PMC9477527  PMID: 35894632

Abstract

Summary

We introduce the newest version of TargetMine, which includes the addition of new visualization options; integration of previously disaggregated functionality; and the migration of the front-end to the newly available Bluegenes service.

Availability and Implementation

TargeteMine is accessible online at https://targetmine.mizuguchilab.org/bluegenes. Users do not need to register to use the software. Source code for the different components listed in the article is available from TargetMine’s organizational account at http://github.com/targetmine.

Supplementary information

Supplementary data are available at Bioinformatics online.

1 Introduction

The last decade has seen a steady increase in the number of studies related to multi-omics analysis (Krassowski et al., 2020; Tarazona et al., 2021). References for ‘Multi-omics Analysis’ reviews listed on PubMed increases from 4 in 2012 to 145 in 2021 (https://pubmed.ncbi.nlm.nih.gov. Last accessed, 1 February 2022). Multi-omics analysis can not only be used to improve the classification of biological data but also for the prediction of variables (such as clinical outcomes), and it might even have the potential to elucidate regulatory mechanisms that include several molecular layers (Tarazona et al., 2021).

A main challenge in multi-omics analysis lies in data integration (Canzler et al., 2020; Krassowski et al., 2020; Tarazona et al., 2021). Approaches on data integration include early integration—data are concatenated into a single matrix; intermediate integration—jointly analyze different omics layers together; and late integration—integrate the analysis results (Adossa et al., 2021). This categorization has been extended by Picard et al. (2021) to also consider mixed and hierarchical integration strategies. At the same time, the development of platforms for the storage of multi-omics data also remains a strong research focus, with (Eloe-Fadrosh et al., 2021; Tang et al., 2022; Zhou et al., 2022) being only a few examples across different domains, all of them reported in this year’s Nucleic Acid Research’s special issue on Databases (Rigden and Fernández, 2022).

In this context, the TargetMine Data Warehouse has evolved into an integrative data analysis platform. TargetMine incorporates various types of omics data, sourced from a variety of data sources and models to provide a deep coverage of the biological data space, with a focus on target prioritization and broad-based biological knowledge discovery (Chen et al., 2011, 2019). Consolidated as a useful resource for the drug discovery scientific community (as suggested by the number of citations of the original paper as recorded by PubMed), through the integration of new data from different, heterogeneous sources, and by providing new widgets for its analysis, TargetMine continues to strive in becoming an integral solution to multi-omics data analysis, especially in terms of data storage and biological interpretation (Tarazona et al., 2021).

2 New in TargetMine

2.1 Integration and new visualization tools

Up until now, TargetMine also included an Auxiliary Toolkit (Chen et al., 2016), accessible through a separated user interface. This has now been integrated into a single-user experience. The display of a Composite Network Graph, added to report pages of gene lists, allows interactive visualization of gene-to-gene interactions among the list members, together with their relation to other genes, microRNA, chemical compounds and/or transcription factors found within TargetMine. Similarly, the Enrichment Display Graph, also included in the gene list report page, shows through bar graphs and heatmaps, the proportion of genes with a given annotation compared with the annotation of the whole genome, or how the individual genes in the list are matched to the corresponding enriched elements, respectively.

Completely new display widgets have also been added to TargetMine. The gene report page now includes a Gene Expression Graph (see Fig. 1); and the report page for chemical compounds has now a Bio-activity Graph. As suggested by their names, both these graphs allow to dynamically inspect either the expression or bio-activity levels of individual genes or chemical compounds, respectively. Whilst the first includes controls to handle the display at different levels of detail where the gene expression is measured; the second provides controls to clearly identify different assays.

Fig. 1.

Fig. 1.

Sample image of TargetMine’s new interface for a list of genes. Elements have been slightly adjusted for better display here. MicroRNA associations to original genes are shown in graph format. New MGeND enrichment for the list of genes is also shown

Details and user guides for all the aforementioned visualization tools are provided as Supplementary Material to this article.

2.2 Bluegenes migration

TargetMine is based on InterMine (Kalderimis et al., 2014; Smith et al., 2012), a data warehousing system that provides easy query and analysis of various heterogeneous data sources. Paired to InterMine, a new front-end named Bluegenes (https://github.com/intermine/bluegenes), meant to replace the old Java Server Pages (JSP)-based interface has been released.

As several customly implemented elements of TargetMine were implemented as components of the JSP-based interface, they all needed to be refactored into new Bluegenes tools. Figure 1 shows an example of the new interface used for TargetMine, in particular, the one used to report information of a list of genes. Users familiar with the application will notice the new, modern feel and look achieved with the new front-end.

One major advantage of this approach is that each element can be implemented as its own project, and thus can be individually maintained (i.e. is kept on its own GitHub repository). An extensive list of all the migrated tools and their corresponding repositories is provided as Supplementary Material.

2.3 New data sources

In order to continuously improve the coverage of the biological data space, some new data types and sources were added. These new data sources include protein binding pockets from PoSSuM (Ito et al., 2015), genomic variant with clinical annotation from MGeND (see Fig. 1) (Kamada et al., 2019), clinical trial data from WHO (https://trialsearch.who.int/. Last accessed, 11 March 2022) and also genome annotations from NCBI (https://www.ncbi.nlm.nih.gov/genome. Last accessed, 11 March 2022). New data are accommodated by extending the data model currently used by TargetMine, which can be generally described as an Object Oriented definition, transpilled into a Relational database for storage purposes. More details on how this is implemented can be found in (Chen et al., 2011). Applications of the new additions will be reported elsewhere.

3 Discussion

We believe TargetMine to be a highly valued data warehouse within the drug discovery research community, as proved by the continuous access that it has on a daily basis, from countries across five continents. As a response to the support shown by the community, we constantly strive to improve the service, with monthly data updates and constant software updates being a proof of our commitment toward this end.

Here, we introduced some of the major updates made to TargetMine over the past couple of years, namely, its migration to a new front-end and the development of new visualization widgets, customly targeted to specific data elements within the repository.

Supplementary Material

btac507_Supplementary_Data

Acknowledgements

The authors would like to thank Dr Lokesh P. Thripathi, at the RIKEN Center for Integrative Medical Sciences, and members of the ArCHER for their continuous support in the development of TargetMine.

Funding

This study was supported by the Ministry of Health, Labour and Welfare and the Cabinet Office of Japan Government for the Public/Private R&D Investment Strategic Expansion PrograM—PRISM (grant number JPMH20AC5001 and grant number 19AC5001).

Conflict of Interest: none declared.

Contributor Information

Yi-An Chen, Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Osaka 567-0085, Japan.

Rodolfo S Allendes Osorio, Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Osaka 567-0085, Japan.

Kenji Mizuguchi, Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Osaka 567-0085, Japan; Institute for Protein Research, Osaka University, Osaka 565-0871, Japan.

Data Availability

The source code for TargetMine is available under an MIT License at http://github.com/targetmine/targetmine-gradle. Source code for different the components listed in the manuscript are available from the following repositories under TargetMine’s organizational GitHub account: bluegenes-tool-compositenetworkd3-graph, bluegenes-list-enrichment-visualizer, bluegenes-tool-gene expression-graph and bluegenes-tool-bioactivity-graph.

References

  1. Adossa N.  et al. (2021) Computational strategies for single-cell multi-omics integration. Comput. Struct. Biotechnol. J., 19, 2588–2596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Canzler S.  et al. (2020) Prospects and challenges of multi-omics data integration in toxicology. Arch. Toxicol., 94, 371–388. [DOI] [PubMed] [Google Scholar]
  3. Chen Y.-A.  et al. (2011) TargetMine, an integrated data warehouse for candidate gene prioritisation and target discovery. PLoS One, 6, e17844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen Y.-A.  et al. (2016) An integrative data analysis platform for gene set analysis and knowledge discovery in a data warehouse framework. Database, 2016, baw009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chen Y.-A.  et al. (2019) The TargetMine data warehouse: enhancement and updates. Front. Genet., 10, 934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Eloe-Fadrosh E.A.  et al. (2021) The national microbiome data collaborative data portal: an integrated multi-omics microbiome data resource. Nucleic Acids Res., 50, D828–D836. [Google Scholar]
  7. Ito J-i.  et al. (2015) PoSSuM v.2.0: data update and a new function for investigating ligand analogs and target proteins of small-molecule drugs. Nucleic Acids Res., 43, D392–D398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kalderimis A.  et al. (2014) InterMine: extensive web services for modern biology. Nucleic Acids Res., 42, W468–W472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Kamada M.  et al. (2019) Mgend: an integrated database for Japanese clinical and genomic information. Hum. Genome Var., 6, 53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Krassowski M.  et al. (2020) State of the field in multi-omics research: from computational needs to data mining and sharing. Front. Genet., 11, 610798. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Picard M.  et al. (2021) Integration strategies of multi-omics data for machine learning analysis. Comput. Struct. Biotechnol. J., 19, 3735–3746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Rigden D.J., Fernández X.M. (2022) The 2022 nucleic acids research database issue and the online molecular biology database collection. Nucleic Acids Res., 50, D1–D10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Smith R.N.  et al. (2012) InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data. Bioinformatics, 28, 3163–3165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Tang Z.  et al. (2022) MVIP: multi-omics portal of viral infection. Nucleic Acids Res., 50, D817–D827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Tarazona S.  et al. (2021) Undisclosed, unmet and neglected challenges in multi-omics studies. Nat. Comput. Sci., 1, 395–402. [DOI] [PubMed] [Google Scholar]
  16. Zhou P.  et al. (2022) CyanoOmicsDB: an integrated omics database for functional genomic analysis of cyanobacteria. Nucleic Acids Res., 50, D758–D764. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

btac507_Supplementary_Data

Data Availability Statement

The source code for TargetMine is available under an MIT License at http://github.com/targetmine/targetmine-gradle. Source code for different the components listed in the manuscript are available from the following repositories under TargetMine’s organizational GitHub account: bluegenes-tool-compositenetworkd3-graph, bluegenes-list-enrichment-visualizer, bluegenes-tool-gene expression-graph and bluegenes-tool-bioactivity-graph.


Articles from Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES