Skip to main content
Plant and Cell Physiology logoLink to Plant and Cell Physiology
. 2021 Dec 17;63(3):433–440. doi: 10.1093/pcp/pcab173

Development of RIKEN Plant Metabolome MetaDatabase

Atsushi Fukushima 1,*,7, Mikiko Takahashi 2, Hideki Nagasaki 3,8, Yusuke Aono 4, Makoto Kobayashi 5, Miyako Kusano 6,7,8, Kazuki Saito 9, Norio Kobayashi 10,11, Masanori Arita 12,13
PMCID: PMC8917833  PMID: 34918130

Abstract

The advancement of metabolomics in terms of techniques for measuring small molecules has enabled the rapid detection and quantification of numerous cellular metabolites. Metabolomic data provide new opportunities to gain a deeper understanding of plant metabolism that can improve the health of both plants and humans that consume them. Although major public repositories for general metabolomic data have been established, the community still has shortcomings related to data sharing, especially in terms of data reanalysis, reusability and reproducibility. To address these issues, we developed the RIKEN Plant Metabolome MetaDatabase (RIKEN PMM, http://metabobank.riken.jp/pmm/db/plantMetabolomics), which stores mass spectrometry-based (e.g. gas chromatography–MS-based) metabolite profiling data of plants together with their detailed, structured experimental metadata, including sampling and experimental procedures. Our metadata are described as Linked Open Data based on the Resource Description Framework using standardized and controlled vocabularies, such as the Metabolomics Standards Initiative Ontology, which are to be integrated with various life and biomedical science data using the World Wide Web. RIKEN PMM implements intuitive and interactive operations for plant metabolome data, including raw data (netCDF format), mass spectra (NIST MSP format) and metabolite annotations. The feature is suitable not only for biologists who are interested in metabolomic phenotypes, but also for researchers who would like to investigate life science in general through plant metabolomic approaches.

Keywords:  Data sharing,  Metabolite profiling,  Metabolomics,  Plant metabolism,  Semantic web

Introduction

The metabolome is the whole set of low-molecular-weight metabolites (<1,500 Da) within a cell (Oliver et al. 1998). Recent analytical platforms, such as mass spectrometry (MS) and nuclear magnetic resonance spectroscopy, have enabled simultaneous measurement of the steady-state levels of metabolites. The growing methodologies and fields are called metabolomics. The literature suggests that the number of metabolites produced in the plant kingdom exceeds 100,000 (Fiehn 2002, Dixon and Strack 2003, Afendi et al. 2012). For example, it is estimated that the model plant Arabidopsis thaliana produces approximately 5,000 metabolites (Fernie et al. 2004, Saito and Matsuda 2010). Given that plants and crops serve as rich resources for food and drug development (Putri et al. 2013, Rai and Saito 2016), metabolomic data sharing and standardization are extremely important to reuse and reanalyze such data and to increase transparency of the study process.

A number of databases containing information related to metabolome analysis have been established in recent decades (Horai et al. 2010, Fukushima and Kusano 2013, Gurevich et al. 2018, Wishart et al. 2018, 2020). Since the establishment of the Metabolomics Standards Initiative (MSI), substantial efforts have been made in the field to overcome challenges related to the appropriate reporting of metabolomics studies (Fiehn et al. 2007a). As general-purpose metabolomics data repositories, two major databases have been established: MetaboLights (Haug et al. 2020) and Metabolomics Workbench (Sud et al. 2016). MetabolomeXchange (http://www.metabolomexchange.org/) indicates that, as of 1 July 2021, more than 800 metabolomic datasets are publicly available in MetaboLights and more than 1,400 in Metabolomics Workbench. Nevertheless, Spicer et al. (2017a) revealed that, based on an analysis of publicly available metabolomics (meta-)data, the majority of the shared metadata have substantial limitations.

Semantic web technologies facilitate the provision of Findable, Accessible, Interoperable and Reusable data (Wilkinson et al. 2016) on the World Wide Web (WWW). Of these technologies, the keys are Resource Description Framework (RDF) and SPARQL Protocol and RDF Query Language (SPARQL), which are global standards formulated by the WWW Consortium. These are powerful tools for realizing low-cost metadata management and the integration of distributed global data including omics data. Examples include KEGG/GenomeNet LinkDB RDF (Kanehisa et al. 2021) and UniProt RDF in genomics (UniProt 2019), Expression Atlas (Papatheodorou et al. 2020) and RefEx (Ono et al. 2017) in transcriptomics, jPOST in proteomics (Watanabe et al. 2021b), glycoPOST in glycomics (Yamada et al. 2020, Watanabe et al. 2021a), PubChem RDF in chemical information (Fu et al. 2015), SPARQLing biochemical reaction data (Rhea) in biochemical information (Lombardot et al. 2019), and RIKEN MetaDatabase (http://metadb.riken.jp) for a wide range of healthcare and life sciences (Kobayashi et al. 2018).

Although major public repositories for general metabolomic data have been launched, the metabolomics community still needs to address the shortcomings of data sharing, especially in terms of reanalysis, reusability and reproducibility of data. In this study, we have developed the RIKEN Plant Metabolome MetaDatabase (RIKEN PMM, http://metabobank.riken.jp/pmm/db/plantMetabolomics), which mainly provides MS-based metabolite profiling data of plants together with their detailed and structured metadata as Linked Open Data (LOD) based on the semantic web. To introduce our reanalysis approach with RIKEN PMM, we have shared our gas chromatography (GC)–MS data reanalysis workflow.

Results

Database overview and content

The RIKEN PMM is implemented on top of the RIKEN MetaDatabase, which provides a biologist-friendly graphical user interface, including tabular and card forms that are familiar to biologists to show classes and instances of graph-based RDF datasets simultaneously as well as SPARQL endpoint functions as the application programming interface (Fig. 1). Briefly, the key features are as follows:

Fig. 1.

Fig. 1

Specialized views in RIKEN PMM. (A) Database view shows the data statistics, including the number of classes like Project, Experiment and Dataset, and other statistical data. (B) SPARQL search view allows us to input a SPARQL query and returns query results in various formats such as HTML and a spreadsheet style. The example query can retrieve all of the projects related to plant ontology ‘shoot system’ (PO:0009006).

  1. Unlike a typical relational database, it is easy to extend and revise data schema due to RDF.

  2. RIKEN PMM promotes the distribution of plant metabolome data as LOD and efficient data integration across scattered databases.

In the current database, we archived public plant metabolome datasets containing a total of 151 projects (Supplementary Table S1), encompassing >9K biological samples and >8.6K raw data files (Fig. 1). It also spans over 40 different plant species including Arabidopsis, rice, tomato, soybean and lettuce (Supplementary Material 1). We compared the share of samples from Plantae in major metabolome databases. The total number of Plantae samples in the RIKEN PMM was 8,809 (by 13 July 2021). However, the shares of species included 42% Brassicales, 26% Poales and 15% Solanales. This suggests that there is no major bias. In the case of MetaboLights, the total number of samples was 49,496 (as of 13 July 2021), but 65% of them were from studies on the Lolium family (Subbaraj et al. 2019). The total number of Plantae samples was 28 (as of 13 July 2021) in Metabolomics Workbench. Thus, compared with other databases, our datasets were well balanced in terms of samples from specific plants.

We also opened almost all of the datasets in open standard file formats, including netCDF and Analysis Base Framework (ABF). The latter ABF file was converted using the freely available AbfConverter (http://www.reifycs.com/AbfConverter/index.html). Users can access all raw metabolite profile data files, such as those in the netCDF format, at least for GC–MS-based studies from RIKEN but with the exception of metadata from Kazusa DNA Research Institute (https://www.kazusa.or.jp/). As an example, you can examine such files at the following URL: http://metabobank.riken.jp/pmm/db/plantMetabolomics/http://metadb.riken.jp/db/plantMetabolomics/0.1/RawDataSet/RPMM0026_root_02_Polar_1.

A raw data file in netCDF format can be accessed at http://metabobank.riken.jp/data/RPMM0026/PolarMetabolites/RawDataset/root_02_Polar_1.cdf. This file is associated with the accession number RPMM0026 (Ichihashi et al. 2018) and corresponds to a root sample.

To facilitate the sharing of our experimental design (called phenodata), metabolite annotation and data matrix processed in each project, we also provide them as simple text and/or in CSV file format. Among 151 datasets, 90 datasets contain plant samples from Kazusa DNA Research Institute, linking the raw data with MassBase (Ara et al. 2021). Users can distinguish these sample names with the prefixes ‘RPMM’ and ‘MN’ (Metabolonote) (Ara et al. 2015). Excluding Kazusa’s dataset (MN#####), only three datasets from RIKEN (RPMM####), i.e. RPMM0001, RPMM0002 and RPMM0006, contain direct URL links to MetaboLights. This is due to sharing the three datasets and their metadata in RIKEN PMM, MetaboLights and Metabolonote.

Implementation and design

We have developed a novel ontology, called the Plant Metabolomics Ontology, to describe our metabolome data (https://github.com/afukushima/rpmm-metadata). This ontology takes over the DNA Data Bank of Japan (DDBJ) data structure (Ogasawara et al. 2020), including the BioSample database, and is extended with additional classes, such as experimental condition and data analysis. These additional classes describe the concepts of MS-based metabolomics (e.g. GC–MS) and statistical data analysis. The metabolomic part of the Plant Metabolomics Ontology is designed to realize metadata interoperability according to the recommendation of the MSI (http://www.metabolomics-msi.org/) (Fiehn et al. 2007a, 2007b) by introducing ontology terms (classes) defined in the Metabolomics Standards Initiative Ontology (MSIO, https://github.com/ISA-tools/MSIO) for example.

Browsing and searching metabolomic data

Users can browse our projects in RIKEN PMM, providing a set of all of the public studies currently available. Almost all studies have been reported in peer-reviewed journals. Fig. 2 shows an example of the detailed information of genotype-dependent metabolome data in A. thaliana leaves (Kusano et al. 2007). The page contains the project’s title, its unique identifier, description of the project, information about contributors to the project, links to the corresponding publication, and links to other information and databases. A user can walk through an instance linked via a triplet to show further triplets with the selected instance. RIKEN PMM provides a user-friendly web interface to extract metabolome-related information such as ‘Sample’, ‘Experiment’, ‘Measurement’ and ‘Data Analysis’. Fig. 3 shows an example of the search and download functions in RIKEN PMM. For example, users can retrieve related projects with the univariate analysis method ‘LIMMA’ as a keyword query (Fig. 3A). Users can also download the raw data of each biosample in each project (Fig. 3B).

Fig. 2.

Fig. 2

An example of the detailed information of Arabidopsis genotype-dependent metabolome data. (A) represents the project’s title and its unique identifier, while (B) explains this project, including a description of the goals and aims of this study. Typically, the abstract from the associated publication was set, (C) contains creator, contact person, principal investigator and submitter names, (D) is the links to the corresponding literature (e.g. PubMed and DOI) and (E) the links to other information and databases. This can be used to view an instance and its triplets linked to other instances or reverse-linked from other instances. A user can walk through an instance linked via a triplet to show further triplets with the selected instance.

Fig. 3.

Fig. 3

An example of search and download functions in RIKEN PMM. (A) A user can retrieve related projects with the univariate analysis method ‘LIMMA’ as a keyword query. LIMMA represents ‘Linear Models for Microarray Data’ (Ritchie et al. 2015). (B) Users can also access the raw data of each bio sample (e.g. L_01_1.cdf) in RPMM0001 (Kusano et al. 2007).

Access, privacy policy and license

All of the data in RIKEN PMM are available under a CC-BY-4.0 license as open data, which grants free access and reuse of our public data. We also developed the R package ‘rRPMM’ (https://github.com/afukushima/rRPMM) to download and parse our metadata from the RIKEN PMM (Supplementary Materials 2 and 3). An accessor for the RIKEN PMM converts the downloaded metadata to an R list. For example, users can visualize the species distribution in our database (Supplementary Material 1).

Sharing our GC–MS-data-reanalysis workflow

To introduce our reanalysis approach, we have shared our GC–MS-data-reanalysis workflow with our ‘rRPMM’ (https://github.com/afukushima/rRPMM) and ‘eRah’ packages (Domingo-Almenara et al. 2016) (Supplementary Material 4). This R-based workflow consists of a process to (i) obtain all raw data (e.g. netCDF files) from a Project, (ii) preprocess the data (e.g. peak deconvolution and alignment) and (iii) identify and annotate metabolites using a public mass-spectral library such as GMD (Kopka et al. 2005).

Discussion

We have developed a semantic web-based metadata database known as the RIKEN PMM that enables efficient sharing, spreading and retrieving of plant metabolome data using native RDF technologies. We have also developed easy-to-use spreadsheet software to rapidly generate RDF data, provided an accessor R package rRPMM and shared our data reanalysis workflow. There are domain-specific metabolome databases that adhere to the guidelines of the MSI (Fiehn et al. 2007a), including MetabolomeExpress (Carroll et al. 2010, 2015), Mery-B (Ferry-Dumazet et al. 2011) and MeKO (Fukushima et al. 2014). An important point is that the minimum reporting standards in plant science are complied with the most (Spicer et al. 2017a). RIKEN PMM also encourages minimum reporting guidelines for metabolome data analysis, including data transformation, scaling and normalization methods with existing and/or new ontologies (Considine and Salek 2019). As suggested by Spicer et al. (2017b) and in agreement with recently proposed reporting standards (Alseekh et al. 2021), it is time to discuss the revision of the MSI guidelines.

Despite the proposal of minimum reporting guidelines for data analysis in metabolomics for over a decade (Goodacre et al. 2007), the need to improve metadata completeness has been under discussion for some time (Considine and Salek 2019). It is also important to develop an efficient e-infrastructure and data analysis workflow for metabolomics, examples of which include XCMSonline (Forsberg et al. 2018), PhenoMeNal (Phenome and Metabolome aNalysis) (Peters et al. 2019), MetabolomeExpress (Carroll et al. 2010, 2015) and WebSpecmine (Cardoso et al. 2019). Other collaborative approaches also exist with Jupyter notebooks (e.g. see https://www.metabolomicsworkbench.org/data/AnalyzeUsingJupyterNotebooks.php) and R markdown files for metabolomics (Considine and Salek 2019, Mendez et al. 2019). Along with enhancing the metadata of metabolomic data analysis, it is preferable to recognize the importance and prospects of good experimental designs and to be aware of problems related to sample size.

The Plant Metabolomics Ontology is a graph representation of the meaning of terms or concepts in plant metabolomics, which we developed (see also our GitHub page at https://github.com/afukushima/rpmm-metadata). It was implemented as LOD, which aims to progress beyond the conventional Web and provide public accessibility to interlinked datasets from different resources. We, the plant metabolomics researchers at RIKEN, Kazusa DNA Research Institute, and DDBJ, have discussed the domain of plant metabolomics, aims of the ontology and community use case of the ontology. The ontology structure is defined as minimum sets of our data model. Integration of other datasets, including genomics and/or meta-genomics data, now represents an ongoing and worldwide challenge in the field of open life science. We are sure that our work will contribute to future collaborations consisting of different nodes such as MetaboLights, Metabolomics Workbench, MetaboBank in DDBJ and MetabolomeXchange (Fig. 4).

Fig. 4.

Fig. 4

An overview of metabolomics data resources with RIKEN PMM. An open-access general-purpose repository for metabolomics studies across the world including MetaboLights, Metabolomics Workbench and a newly developed database named MetaboBank in DDBJ. Other major metabolome data resources existing in Japan are MassBank, KNApSAcK and other resources related to computational mass spectrometry (Comp MS), such as MS-DIAL in RIKEN (http://prime.psc.riken.jp/compms/index.html). RIKEN PMM was established in 2018 with the aim of providing opportunities to work with and develop applications for plant metabolomics data based on FAIR principles. The upcoming MetaboBank resource will collect and assemble (meta)data from RIKEN and Kazusa DNA Research Institute (KDRI) as initial data. Acceptable (meta)data in MetaboBank include not only mass spectrometry-based data but also all platforms associated with metabolomics studies (e.g. NMR). Other resouces, such as mass spectra data in MassBank, compounds, species–metabolite relationships, and pathway information in KNApSAcK, will contribute to the future development of MetaboBank.

In summary, our framework can provide all metadata required for reanalyzing and reusing metabolomic data and will contribute to the development of another general-purpose metabolomics repository, called MetaboBank developed by DDBJ (Ogasawara et al. 2020). Our approach also enables the archiving of data integrated among different disciplines. The data-sharing aspect discussed in this paper will pave the way for discoverable, reproducible and reusable metabolomic data as well as the robust interpretation of plant metabolomic data.

Supplementary Material

pcab173_Supp

Acknowledgements

The authors thank all of the collaborators for their contributions to data openness, Hiroshi Tsugawa and Yutaka Yamada (RIKEN Center for Sustainable Resource Science) for computational assistance, and Enago (www.enago.jp) for the English edit.

Contributor Information

Atsushi Fukushima, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Mikiko Takahashi, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Hideki Nagasaki, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Yusuke Aono, Degree Programs in Life and Earth Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8572, Japan.

Makoto Kobayashi, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Miyako Kusano, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan; Faculty of Life and Environmental Science, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8572, Japan; Tsukuba Plant Innovation Research Center, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8572, Japan.

Kazuki Saito, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

Norio Kobayashi, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan; Data Knowledge Organization Unit, RIKEN Information R&D and Strategy Headquarters, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.

Masanori Arita, Metabolome Informatics Research Team, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan; Bioinformation and DDBJ Center, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan.

Supplementary Data

Supplementary data are available at PCP online.

Data Availability

The raw data underlying this article are available in RIKEN Plant Metabolome MetaDatabase (PMM) (http://metabobank.riken.jp/pmm/db/plantMetabolomics), DropMet (http://prime.psc.riken.jp/menta.cgi/prime/drop_index) and MassBase (http://webs2.kazusa.or.jp/massbase/).

Funding

National Bioscience Database Center (NBDC) of the Japan Science and Technology Agency (JST), Cabinet Office, Government of Japan; Cross ministerial Moonshot Agriculture, Forestry and Fisheries Research and Development Program, ‘Technologies for Smart Bio industry and Agriculture’ (funding agency Bio oriented Technology Research Advancement Institution), Cabinet Office, Government of Japan; Cross-ministerial Strategic Innovation Promotion Program (SIP), ‘Technologies for Smart Bio-industry and Agriculture’ (funding agency: Bio-oriented Technology Research Advancement Institution, NARO); Japan Society for the Promotion of Science (JSPS) KAKENHI (grant numbers 20K06043, 19K05711, 19H05652).

Disclosures

The authors have no conflicts of interest to declare.

References

  1. Afendi F.M., Okada T., Yamazaki M., Hirai-Morita A., Nakamura Y., Nakamura K., et al. (2012) KNApSAcK family databases: integrated metabolite-plant species databases for multifaceted plant research. Plant Cell Physiol. 53: e1. [DOI] [PubMed] [Google Scholar]
  2. Alseekh S., Aharoni A., Brotman Y., Contrepois K., D’Auria J., Ewald J., et al. (2021) Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices. Nat. Methods 18: 747–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ara T., Enomoto M., Arita M., Ikeda C., Kera K., Yamada M., et al. (2015) Metabolonote: a wiki-based database for managing hierarchical metadata of metabolome analyses. Front. Bioeng. Biotechnol. 3: 38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ara T., Sakurai N., Suzuki H., Aoki K., Saito K. and Shibata D. (2021) MassBase: a large-scaled depository of mass spectrometry datasets for metabolome analysis. Plant Biotechnol. (Tokyo) 38: 167–171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cardoso S., Afonso T., Maraschin M. and Rocha M. (2019) WebSpecmine: a website for metabolomics data analysis and mining. Metabolites 9: 237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carroll A.J., Badger M.R. and Harvey Millar A. (2010) The MetabolomeExpress Project: enabling web-based processing, analysis and transparent dissemination of GC/MS metabolomics datasets. BMC Bioinform. 11: 376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Carroll A.J., Zhang P., Whitehead L., Kaines S., Tcherkez G. and Badger M.R. (2015) PhenoMeter: a metabolome database search tool using statistical similarity matching of metabolic phenotypes for high-confidence detection of functional links. Front. Bioeng. Biotechnol. 3: 106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Considine E.C. and Salek R.M. (2019) A tool to encourage minimum reporting guideline uptake for data analysis in metabolomics. Metabolites 9: 43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Dixon R.A. and Strack D. (2003) Phytochemistry meets genome analysis, and beyond. Phytochemistry 62: 815–816. [DOI] [PubMed] [Google Scholar]
  10. Domingo-Almenara X., Brezmes J., Vinaixa M., Samino S., Ramirez N., Ramon-Krauel M., et al. (2016) eRah: a computational tool integrating spectral deconvolution and alignment with quantification and identification of metabolites in GC/MS-based metabolomics. Anal. Chem. 88: 9821–9829. [DOI] [PubMed] [Google Scholar]
  11. Fernie A.R., Trethewey R.N., Krotzky A.J. and Willmitzer L. (2004) Metabolite profiling: from diagnostics to systems biology. Nat. Rev. Mol. Cell Biol. 5: 763–769. [DOI] [PubMed] [Google Scholar]
  12. Ferry-Dumazet H., Gil L., Deborde C., Moing A., Bernillon S., Rolin D., et al. (2011) MeRy-B: a web knowledgebase for the storage, visualization, analysis and annotation of plant NMR metabolomic profiles. BMC Plant Biol. 11: 104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fiehn O. (2002) Metabolomics—the link between genotypes and phenotypes. Plant Mol. Biol. 48: 155–171. [PubMed] [Google Scholar]
  14. Fiehn O., Robertson D., Griffin J., van der Werf M., Nikolau B., Morrison N., et al. (2007a) The metabolomics standards initiative (MSI). Metabolomics 3: 175–178. [Google Scholar]
  15. Fiehn O., Sumner L.W., Rhee S.Y., Ward J., Dickerson J., Lange B.M., et al. (2007b) Minimum reporting standards for plant biology context information in metabolomic studies. Metabolomics 3: 195–201. [Google Scholar]
  16. Forsberg E.M., Huan T., Rinehart D., Benton H.P., Warth B., Hilmers B., et al. (2018) Data processing, multi-omic pathway mapping, and metabolite activity analysis using XCMS online. Nat. Protoc. 13: 633–651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fu G., Batchelor C., Dumontier M., Hastings J., Willighagen E. and Bolton E. (2015) PubChemRDF: towards the semantic annotation of PubChem compound and substance databases. J. Cheminform. 7: 34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fukushima A. and Kusano M. (2013) Recent progress in the development of metabolome databases for plant systems biology. Front. Plant Sci. 4: 73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fukushima A., Kusano M., Mejia R.F., Iwasa M., Kobayashi M., Hayashi N., et al. (2014) Metabolomic characterization of knockout mutants in Arabidopsis: development of a metabolite profiling database for knockout mutants in Arabidopsis. Plant Physiol. 165: 948–961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Goodacre R., Broadhurst D., Smilde A.K., Kristal B.S., Baker J.D., Beger R., et al. (2007) Proposed minimum reporting standards for data analysis in metabolomics. Metabolomics 3: 231–241. [Google Scholar]
  21. Gurevich A., Mikheenko A., Shlemov A., Korobeynikov A., Mohimani H. and Pevzner P.A. (2018) Increased diversity of peptidic natural products revealed by modification-tolerant database search of mass spectra. Nat. Microbiol. 3: 319–327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Haug K., Cochrane K., Nainala V.C., Williams M., Chang J., Jayaseelan K.V., et al. (2020) MetaboLights: a resource evolving in response to the needs of its scientific community. Nucleic Acids Res. 48: D440–D444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Horai H., Arita M., Kanaya S., Nihei Y., Ikeda T., Suwa K., et al. (2010) MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 45: 703–714. [DOI] [PubMed] [Google Scholar]
  24. Ichihashi Y., Kusano M., Kobayashi M., Suetsugu K., Yoshida S., Wakatake T., et al. (2018) Transcriptomic and metabolomic reprogramming from roots to haustoria in the parasitic plant, Thesium chinense. Plant Cell Physiol. 59: 724–733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kanehisa M., Furumichi M., Sato Y., Ishiguro-Watanabe M. and Tanabe M. (2021) KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49: D545–D551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kobayashi N., Kume S., Lenz K. and Masuya H. (2018) RIKEN MetaDatabase: a database platform for health care and life sciences as a microcosm of linked open data cloud. Int. J. Semant. Web Inf. Syst. 14: 140–164. [Google Scholar]
  27. Kopka J., Schauer N., Krueger S., Birkemeyer C., Usadel B., Bergmuller E., et al. (2005) GMD@CSB.DB: the Golm Metabolome Database. Bioinformatics 21: 1635–1638. [DOI] [PubMed] [Google Scholar]
  28. Kusano M., Fukushima A., Arita M., Jonsson P., Moritz T., Kobayashi M., et al. (2007) Unbiased characterization of genotype-dependent metabolic regulations by metabolomic approach in Arabidopsis thaliana. BMC Syst. Biol. 1: 53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Lombardot T., Morgat A., Axelsen K.B., Aimo L., Hyka-Nouspikel N., Niknejad A., et al. (2019) Updates in Rhea: SPARQLing biochemical reaction data. Nucleic Acids Res. 47: D596–D600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Mendez K.M., Pritchard L., Reinke S.N. and Broadhurst D.I. (2019) Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing. Metabolomics 15: 125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ogasawara O., Kodama Y., Mashima J., Kosuge T. and Fujisawa T. (2020) DDBJ database updates and computational infrastructure enhancement. Nucleic Acids Res. 48: D45–D50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Oliver S.G., Winson M.K., Kell D.B. and Baganz F. (1998) Systematic functional analysis of the yeast genome. Trends Biotechnol. 16: 373–378. [DOI] [PubMed] [Google Scholar]
  33. Ono H., Ogasawara O., Okubo K. and Bono H. (2017) RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes. Sci. Data 4: 170105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Papatheodorou I., Moreno P., Manning J., Fuentes A.M., George N., Fexova S., et al. (2020) Expression Atlas update: from tissues to single cells. Nucleic Acids Res. 48: D77–D83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Peters K., Bradbury J., Bergmann S., Capuccini M., Cascante M., de Atauri P., et al. (2019) PhenoMeNal: processing and analysis of metabolomics data in the cloud. Gigascience 8: giy149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Putri S.P., Nakayama Y., Matsuda F., Uchikata T., Kobayashi S., Matsubara A., et al. (2013) Current metabolomics: practical applications. J. Biosci. Bioeng. 115: 579–589. [DOI] [PubMed] [Google Scholar]
  37. Rai A. and Saito K. (2016) Omics data input for metabolic modeling. Curr. Opin. Biotechnol. 37: 127–134. [DOI] [PubMed] [Google Scholar]
  38. Ritchie M.E., Phipson B., Wu D., Hu Y.F., Law C.W., Shi W., et al. (2015) limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43: e47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Saito K. and Matsuda F. (2010) Metabolomics for functional genomics, systems biology, and biotechnology. Ann. Rev. Plant Biol. 61: 463–489. [DOI] [PubMed] [Google Scholar]
  40. Spicer R.A., Salek R. and Steinbeck C. (2017a) Compliance with minimum information guidelines in public metabolomics repositories. Sci. Data 4: 170137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Spicer R.A., Salek R. and Steinbeck C. (2017b) A decade after the metabolomics standards initiative it’s time for a revision. Sci. Data 4: 170138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Subbaraj A.K., Huege J., Fraser K., Cao M., Rasmussen S., Faville M., et al. (2019) A large-scale metabolomics study to harness chemical diversity and explore biochemical mechanisms in ryegrass. Commun. Biol. 2: 87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Sud M., Fahy E., Cotter D., Azam K., Vadivelu I., Burant C., et al. (2016) Metabolomics Workbench: an international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools. Nucleic Acids Res. 44: D463–470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. UniProt C. (2019) UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47: D506–D515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Watanabe Y., Aoki-Kinoshita K.F., Ishihama Y. and Okuda S. (2021a) GlycoPOST realizes FAIR principles for glycomics mass spectrometry data. Nucleic Acids Res. 49: D1523–D1528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Watanabe Y., Yoshizawa A.C., Ishihama Y. and Okuda S. (2021b) The jPOST repository as a public data repository for shotgun proteomics. Methods Mol. Biol. 2259: 309–322. [DOI] [PubMed] [Google Scholar]
  47. Wilkinson M.D., Dumontier M., Aalbersberg I.J., Appleton G., Axton M., Baak A., et al. (2016) The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3: 160018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wishart D.S., Feunang Y.D., Marcu A., Guo A.C., Liang K., Vazquez-Fresno R., et al. (2018) HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res. 46: D608–D617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Wishart D.S., Li C., Marcu A., Badran H., Pon A., Budinski Z., et al. (2020) PathBank: a comprehensive pathway database for model organisms. Nucleic Acids Res. 48: D470–D478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Yamada I., Shiota M., Shinmachi D., Ono T., Tsuchiya S., Hosoda M., et al. (2020) The GlyCosmos portal: a unified and comprehensive web resource for the glycosciences. Nat. Methods 17: 649–650. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

pcab173_Supp

Data Availability Statement

The raw data underlying this article are available in RIKEN Plant Metabolome MetaDatabase (PMM) (http://metabobank.riken.jp/pmm/db/plantMetabolomics), DropMet (http://prime.psc.riken.jp/menta.cgi/prime/drop_index) and MassBase (http://webs2.kazusa.or.jp/massbase/).


Articles from Plant and Cell Physiology are provided here courtesy of Oxford University Press

RESOURCES