Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2024 Oct 31;53(D1):D1716–D1723. doi: 10.1093/nar/gkae860

SSBD: an ecosystem for enhanced sharing and reuse of bioimaging data

Koji Kyoda 1, Hiroya Itoga 2, Yuki Yamagata 3,4, Emi Fujisawa 5, Fangfang Wang 6,7, Miguel Miranda-Miranda 8, Haruna Yamamoto 9, Yasue Nakano 10, Yukako Tohsato 11,12, Shuichi Onami 13,14,
PMCID: PMC11701685  PMID: 39479781

Abstract

SSBD (https://ssbd.riken.jp) is a platform for the sharing and reuse of bioimaging data. As part of efforts to build a bioimaging data ecosystem, SSBD has recently been updated to a two-tiered data resource comprising SSBD:repository, a public repository for the sharing of all types of bioimaging data reported in journals, and SSBD:database, an added-value database for the sharing of curated, highly reusable, metadata-rich data. This update addresses the conflicting demands of rapid data publication and sharing of richly annotated data, thereby promoting bioimaging data sharing and reuse. With this update, SSBD is now positioned as a core repository and database within the foundingGIDE, an international consortium working to establish a global image data ecosystem. Harmonizing metadata between data resources enables cross-searching and data exchange with data resources from other countries and regions.

Graphical Abstract

Graphical Abstract.

Graphical Abstract

Introduction

Bioimaging technologies allow us to capture biological phenomena as they are, both inside and outside cells. For this reason, alongside molecular biology technologies, these imaging technologies have become some of the most effective tools for understanding the mechanisms that underlie biological phenomena. In recent years, the importance of sharing bioimaging data according to the FAIR (Findable, Accessible, Interoperable and Reusable) principles (1), similar to nucleotide sequences, protein structures and gene expression data, has been recognized (2–4), leading to the establishment of several public data repositories and databases (5–8). In Europe, the BioImage Archive has been established as a public repository of imaging data, and the Image Data Resource and the Electron Microscopy Public Image Archive have been developed as added-value databases (AVDBs) of optical and electron microscopy data. In the United States, the establishment of similar public data resources has been extensively discussed (9).

SSBD is a platform for the sharing and reuse of bioimaging data that we develop with support from the National Bioscience Database Center of the Japan Science and Technology Agency (5). At its launch in 2013, SSBD was a public database designed primarily for the sharing of datasets containing original image data together with extracted biosystems dynamics data (e.g. segmentation and tracking data). However, in 2015, as part of efforts to advance image analysis technology and promote access to state-of-the-art imaging technologies through data sharing, the database was expanded to include stand-alone image data. Then, in 2016, in response to requests from authors, SSBD launched a repository service for the deposition of bioimaging data before article publication. These experiences laid the groundwork for the subsequent development of the bioimaging data ecosystem concept.

Recently, the international community has expressed the need for a bioimaging data ecosystem (2,3). This ecosystem would comprise two tiers: a repository for easy storage of datasets with limited metadata and an AVDB containing highly reusable, metadata-rich data. Such a two-tiered data resource would allow for rapid data publication, while also promoting the sharing and reuse of bioimaging data. In 2024, the foundingGIDE consortium was established to build a global image data ecosystem (GIDE) that facilitates the exchange of bioimaging data across the various data resources (https://founding-gide.eurobioimaging.eu).

Bioimaging data ecosystem

To build a bioimaging data ecosystem, we updated SSBD in 2019 to a two-tiered data resource comprising SSBD:repository and SSBD:database (Figure 1). SSBD:repository is a public repository that accepts and contains all types of bioimaging data reported in peer-reviewed articles. To enable rapid data publication, SSBD:repository allows authors to share bioimaging data by providing limited metadata such as contact information, licenses and species. Since the 2019 update, SSBD has also issued digital object identifiers that ensure persistent access to the deposited datasets.

Figure 1.

Figure 1.

SSBD ecosystem for the sharing and reuse of bioimaging data. The system comprises two tiers: a repository tier (SSBD:repository) and an AVDB tier (SSBD:database). SSBD serves as a foundation for the sharing and reuse of bioimaging data, including image and analysis data, and metadata.

SSBD:database is a public AVDB that contains highly reusable, metadata-rich data (Figure 2). In this context, ‘highly reusable’ refers to data that include images recorded by using state-of-the-art imaging technologies as well as data obtained through systematic experiments. Currently, expert curators select the data that are included in SSBD:database in two ways: by selecting image data that fit the above criteria from data already deposited in SSBD:repository or by identifying image data that fit the criteria from published papers. In both approaches, the curators assign metadata by using various ontologies (Table 1). After the curation process, the authors are asked to check the metadata before their data are published. In the future, the goal is to use SSBD:repository to gather all types of bioimaging data collected in, but not limited to, Japan and published in peer-reviewed journals; from these data, the curators will select those with the most extensive metadata to be shared in SSBD:database.

Figure 2.

Figure 2.

Representative screenshots of SSBD:database. (A) Example of bioimaging datasets deposited in SSBD:database. (B) Each dataset includes project-related metadata such as license information, biosample details, Gene Ontology (GO) annotations and imaging metadata.

Table 1.

Ontologies currently used or planned for future SSBD updates

Topic Controlled vocabulary/ontology RRID Current Planned
Biological processes, Cellular components, Molecular functions Gene Ontology (GO) (26,27) RRID:SCR_002811
Cell types Cell Ontology (CL) (28) RRID:SCR_004251  
Cell lines Cell Line Ontology (CLO) (29) RRID:SCR_005840
Anatomy Uberon Multi-species Anatomy Ontology (UBERON) (30) RRID:SCR_010668  
Imaging Biological Imaging Methods Ontology (FBbi) RRID:SCR_010235
Chemical compounds Chemical Entities of Biological Interest Ontology (ChEBI) (31) RRID:SCR_002088  
Organisms NCBI Organismal Classification (NCBITaxon) (32) RRID:SCR_000479
Experimental conditions Ontology for Biomedical Investigations (OBI) (33) RRID:SCR_006266  
Experimental conditions Experimental Factor Ontology (EFO) (34) RRID:SCR_003574  
Units Units of Measurement Ontology (UO) (35) RRID:SCR_010442  
Genes Ensembl (ENSG) (36) RRID:SCR_002344  
Proteins Universal Protein Resource (UniProt) (37) RRID:SCR_002380  
Controlled vocabulary Medical Subject Headings (MeSH) (38) RRID:SCR_004750

RRID: Research Resource Identifier (https://rrid.site/).

Recent advances in imaging technology have been rapid, with new techniques and equipment being developed in waves. As a result, the criteria for selecting highly reusable data are expected to evolve quickly. To address this issue, we are currently establishing an advisory board by inviting experts recommended by biological and biomedical societies in Japan. With insights from experts well-versed in bioimaging technologies across various fields, the SSBD:database will continuously update its inclusion criteria.

Global sharing of bioimaging data

The foundingGIDE consortium was established in 2024 to build a GIDE. Currently, the consortium consists of seven organizations from Europe, Japan and Australia. The plan is to build an ecosystem for the sharing of bioimaging and preclinical imaging data. SSBD is positioned within this ecosystem as a core repository and database. In addition, efforts are underway to harmonize the use of metadata among SSBD and the BioImage Archive and Image Data Resource in Europe. This will facilitate the exchange of bioimaging data across these different resources (Figure 3).

Figure 3.

Figure 3.

Overview of the proposed GIDE. Harmonizing the metadata used in each repository and AVDB enables the interoperability of shared bioimaging data across the various data resources.

In the life sciences, global data sharing has already been achieved in several domains, such as the sharing of nucleotide sequences through the International Nucleotide Sequence Database Collaboration (10) and of protein structures through the Worldwide Protein Data Bank (11). Unlike the data shared in these fields, bioimaging data present a challenge because of their very large file sizes, making it impractical to mirror the data across multiple locations. Therefore, the global sharing for bioimaging data will be facilitated by harmonizing metadata, including information about the storage location of the bioimaging data. To achieve this harmonization, metadata must be standardized across the various bioimaging data resources through the use of controlled vocabularies and ontologies.

Adoption of community guidelines and recommendations

In recent years, there has been an accelerated movement toward the standardization of metadata, with initiatives such as REMBI (Recommended Metadata for Biological Images) (12), which provides guidelines for metadata related to bioimaging data, and QUAREP-LiMi (the Consortium for Quality Assessment and Reproducibility for Instruments and Images in Light Microscopy) (13), which aims to improve the quality control and assurance of optical microscopy data. SSBD is undergoing major updates to align with the guidelines that have been published by these initiatives by adjusting the metadata structure and the components it uses and by using controlled vocabularies through various ontologies (https://doi.org/10.6084/m9.figshare.27023533). In addition to basic metadata such as contact information and licensing, we have been expanding the metadata to include items directly related to the data, such as species, biological processes, cellular components and molecular functions, by using various ontologies (Table 1). We have also added metadata related to the samples from which the data were collected, such as strain and cell line names, and metadata related to the imaging methods that were used. The overall intent of this metadata model update with controlled vocabularies is to ensure the consistency and accuracy of metadata.

Next-generation file format for storing bioimaging data has been proposed (14). In SSBD, we plan to provide all bioimaging data in OME-Zarr format (15). We have already shared 12 datasets (https://ssbd.riken.jp/ssbd-ome-ngff-samples) on S3-compatible object storage, enabling parallel data access; these datasets are intended to be used to develop and validate data visualization and analysis tools compatible with the OME-Zarr format and are expected to contribute to the widespread use of this format.

Dataset update and reuse

The amounts of bioimaging data contained in SSBD have increased rapidly (Figure 4). Examples of datasets recently contained in SSBD include image data exceeding 100 million pixels captured with the trans-scale scope AMATERAS (16) and whole-brain imaging data of Mus musculus captured by using tissue clearing with light-sheet microscopy (17). Such datasets, captured by using state-of-the-art imaging technologies and covering a wide range of biological phenomena and tissues, are expected to lead to the development of new image analysis techniques and the discovery of new knowledge through data reuse. Similarly, images and morphological data concerning about 6000 gene mutants of the budding yeast Saccharomyces cerevisiae and phenotypic analysis results obtained through statistical analysis have also been contained in SSBD (18,19). These data, which comprise comprehensive image and morphological data for each of the genetic mutations, will be useful as reference data for future studies. Additionally, the comprehensive functional analysis of genes could be achieved through the development of new image analysis techniques.

Figure 4.

Figure 4.

SSBD statistics. (A) Annual trend in the number of projects registered in SSBD:repository. (B) Annual trend in the number of projects registered in SSBD:database. (C) Annual trend in the number of datasets shared in SSBD:database. ‘Analysis data’ include segmentation and tracking data. (D) Total size of the data stored in SSBD.

A primary goal of SSBD is to facilitate the reuse of the data contained in its database. For example, SSBD can be used to build databases tailored to specific species, organs or phenomena. Two such databases have been built by using bioimaging data contained in SSBD. One is NeuroGT (20), a mouse-brain image database that enables the classification and manipulation of neuron subsets on the basis of neuronal birthdate. The other is AZEBEX (21), a database of brain gene expression in adult zebrafish. Therefore, by using SSBD, it is possible to build databases that combine metadata tailored to specific species, organs and phenomena, and to integrate bioimaging data with other data types.

We are also tracking the development of analytical techniques that have been developed by using data contained in SSBD. For example, a deep-learning-based method to extract anatomical regions from mouse brain image data has been developed, and the analysis data have been shared on SSBD (22). Also, a method has been created to detect phenotypic abnormalities by using variational auto-encoders from image segmentation data for nuclear regions in Caenorhabditis elegans embryos (23). Such analytical techniques developed by using data contained in SSBD will likely provide further data that will also be shared on SSBD.

Dataset features

One of the features of SSBD is that it contains large amounts of biosystems dynamics data measured through image analysis. This is because SSBD has historically prioritized the collection of this type of data. Recently, numerous image segmentation methods have been developed with the rapid development of artificial intelligence technology, and the biosystems dynamics data contained in SSBD can be used to develop or evaluate the performance of these new image segmentation methods. SSBD shares its biosystems dynamics data in an open unified format, BD5 (24), that can be visualized by using QtBD5Viewer (https://doi.org/10.6084/m9.figshare.27023530). Another feature of SSBD is that it contains large amounts of time-series image data. This is because SSBD was originally developed for research focusing on the dynamics of biological phenomena. These data can contribute to the development of tracking methods.

We are also developing a new viewer to visualize image data synchronized with image segmentation and tracking data stored in SSBD. The new viewer will load image data stored in the OME-Zarr format by using the Viv library, along with biosystems dynamics data in BD5 format via the HDF5 Web API for visualization (Movie 1). We are also developing the BD-Zarr format, which will use OME-Zarr’s table extension to describe tracking data and the features of segmented biological objects (https://doi.org/10.6084/m9.figshare.27023527). In the near future, we plan to achieve more unified data access to further enable the visualization and analysis of image and biosystems dynamics data.

Movie 1.

Movie 1.

SSBD viewer (currently under development) will be able to synchronize and visualize image and biosystems dynamics data.

Training and social media

To expand the number of users, we have been working to introduce researchers to the methods of depositing and using data in SSBD through seminars, webinars and training programs organized in collaboration with several domestic academic societies. Our ultimate goal, however, is to expand the use of SSBD by collaborating with bioimaging communities in other regions in the Asia–Pacific. Through foundingGIDE, collaborations with Australia’s preclinical imaging data resources have already begun. Through the holding of training programs and other initiatives, bioimaging data should be appropriately shared across the relevant data resources.

As part of our public-relations efforts, we actively use social media networks such as X and Instagram. On X (https://x.com/ssbd_en), we help maximize the impact of research from which data have been deposited in SSBD by posting links to data and introductions to the papers at publication. In addition, we are starting outreach activities through our Instagram account (https://www.instagram.com/bioimagram) to raise awareness of the importance of bioimaging technology research activities among a broader audience, including the general public. SSBD supports mobile phone and tablet access to facilitate direct access via these social media networks platforms.

Outlook and challenges

We are currently planning to expand the range of data contained within SSBD. Recently, the development of techniques to measure spatial omics data has become increasingly active, with some of these methods being based on imaging technologies. SSBD is an ideal platform for sharing spatial omics data measured by these imaging techniques. The OME-Zarr community is also discussing how to store spatial omics data and share them in OME-Zarr format.

The aim of SSBD is not only to facilitate data sharing but also to promote data reuse. To achieve this, it will be necessary to share image analysis workflows and provide execution environments for the associated workflows. To support workflow sharing, we are currently collecting not only bioimaging data but also information about the storage locations of workflows on platforms such as GitHub from published papers. By establishing and introducing execution environments for these workflows, including external environments such as Google Colab and MyBinder, researchers can reproduce analyses and apply these workflows to their bioimaging data. These activities will allow SSBD to become a platform that covers the sharing and reuse of image and biosystems dynamics data and the reuse of workflows.

We are also working on the development of an image search service that uses image queries. By using methods such as WndChrm (25) and deep learning, we plan to classify the image data stored in SSBD and display images similar to the query image in ranked order. If the ranked image data contain links to workflows, it is expected that these workflows can also be applied to the query image data. In this way, SSBD can promote the reuse of workflows.

Finally, several tasks are underway to ensure sustainable database management. One of these tasks is web form-based data registration in SSBD:repository. Currently, data registration involves many manual steps to go from receiving an author’s request to data deposition. In the near future, with the implementation of government policies and prerequisites for data deposition before journal publication, data registration in SSBD will likely dramatically increase. By delegating the task of data registration to authors through web forms, we aim to reduce human error and improve efficiency. Another task that is underway is the automated annotation of metadata by using large language models. Currently, annotation in SSBD is performed manually, thus limiting the number of papers that can be handled each year (Figure 4B). Automated metadata annotation can assign predictive metadata to the data and assist curators in metadata assignment. By leveraging technologies such as large language models, we aim to reduce human- and time-related costs and thus establish a sustainable database management approach that can cope with the rapid growth of open data.

Supplementary Material

gkae860_Supplemental_File
Download video file (1.2MB, mp4)

Acknowledgements

We would like to express our gratitude to the members of the BioImage Archive, the Image Data Resource and the Open Microscopy Environment for their advice and support. We are also grateful for tools such as DeepL and Grammarly, which helped us to refine our writing while we carefully reviewed and edited their outputs during manuscript preparation. We would like to thank the staff of the Onami laboratory for their advice and help.

Author contributions: K.K.: metadata modeling, data selection, metadata curation, data conversion, writing—original draft; H.I.: system development, data receipt, management and import, writing—review and editing; Y.Y.: metadata modeling, writing—review and editing; E.F.: data selection, metadata curation; F.W.: data selection, metadata curation; M.M.-M.: visualization, search system; H.Y.: data receipt and management; Y.N.: data receipt and management; Y.T.: writing—review and editing; and S.O.: conceptualization, writing—review and editing.

Contributor Information

Koji Kyoda, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Hiroya Itoga, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Yuki Yamagata, Life Science Data Sharing Unit, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Integrated Bioresource Information Division, RIKEN Bioresource Research Center, 3-1-1 Koyadai, Tsukuba, Ibaraki 350-0074, Japan.

Emi Fujisawa, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Fangfang Wang, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Life Science Data Sharing Unit, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Miguel Miranda-Miranda, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Haruna Yamamoto, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Yasue Nakano, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Yukako Tohsato, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Faculty of Information Science and Engineering, Ritsumeikan University, 2-150 Iwakura-cho, Ibaraki, Osaka 567-8570, Japan.

Shuichi Onami, Laboratory for Developmental Dynamics, RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Life Science Data Sharing Unit, RIKEN Information R&D and Strategy Headquarters, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.

Data availability

All data and source codes are available on the SSBD (https://ssbd.riken.jp) and Figshare (https://doi.org/10.6084/m9.figshare.27023533, https://doi.org/10.6084/m9.figshare.27023530 and https://doi.org/10.6084/m9.figshare.27023527).

Supplementary data

Supplementary Data are available at NAR Online.

Funding

National Bioscience Database Center [JPMJND2201]; Core Research for Evolutional Science and Technology [JPMJCR1511 and JPMJCR1926]; Japan Society for the Promotion of Science [JP18H05412 and JP22H04926]; European Union [101130216]. Funding for open access charge: RIKEN Center for Biosystems Dynamics Research.

Conflict of interest statement. None declared.

References

  • 1. Wilkinson M.D., Dumontier M., Aalbersberg I.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.W., da Silva Santos L.B., Bourne P.E.et al.. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016; 3:160018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Ellenberg J., Swedlow J.R., Barlow M., Cook C.E., Sarkans U., Patwardhan A., Brazma A., Birney E.. A call for public archives for biological image data. Nat. Methods. 2018; 15:849–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Swedlow J.R., Kankaanpää P., Sarkans U., Goscinski W., Galloway G., Malacrida L., Sullivan R.P., Härtel S., Brown C.M., Wood C.et al.. A global view of standards for open image data formats and repositories. Nat. Methods. 2021; 18:1440–1446. [DOI] [PubMed] [Google Scholar]
  • 4. Hartley M., Iudin A., Padwardhan A., Sarkans U., Yoldaş A.K., Kleywegt G.J.. Providing open imaging data at scale: an EMBL-EBI perspective. Histochem. Cell Biol. 2023; 160:211–221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Tohsato Y., Ho K.H., Kyoda K., Onami S.. SSBD: a database of quantitative data of spatiotemporal dynamics of biological phenomena. Bioinformatics. 2016; 32:3471–3479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Williams E., Moore J., Li S.W., Rustici G., Tarkowska A., Chessel A., Leo S., Antal B., Ferguson R.K., Sarkans U.et al.. The Image Data Resource: a bioimage data integration and publication platform. Nat. Methods. 2017; 14:775–781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Hartley M., Kleywegt G.J., Patwardhan A., Sarkans U., Swedlow J.R., Brazma A.. The BioImage Archive—building a home for life-sciences microscopy data. J. Mol. Biol. 2022; 434:167505. [DOI] [PubMed] [Google Scholar]
  • 8. Iudin A., Korir P.K., Somasundharam S., Weyand S., Cattavitello C., Fonseca N., Salih O., Kleywegt G.J., Patwardhan A.. EMPIAR: the Electron Microscopy Public Image Archive. Nucleic Acids Res. 2023; 51:D1503–D1511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Bajcsy P., Bhattiprolu S., Borner K., Cimini B., Collinson L., Ellenberg J., Fiolka R., Giger M., Goscinski W., Hartley M.et al.. Enabling global image data sharing in the life sciences. 2024; arXiv doi:23 January 2024, preprint: not peer reviewedhttps://arxiv.org/abs/2401.13023.
  • 10. International Nucleotide Sequence Database Collaboration Karsch-Mizrachi I., Nakamura Y., Cochrane G.. The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res. 2012; 40:D33–D37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Berman H., Henrick K., Nakamura H.. Announcing the worldwide Protein Data Bank. Nat. Struct. Biol. 2003; 10:980. [DOI] [PubMed] [Google Scholar]
  • 12. Sarkans U., Chiu W., Collinson L., Darrow M.C., Ellenberg J., Grunwald D., Hériché J.K., Iudin A., Martins G.G., Meehan T.et al.. REMBI: Recommended Metadata for Biological Images—enabling reuse of microscopy data in biology. Nat. Methods. 2021; 18:1418–1422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Nelson G., Boehm U., Bagley S., Bajcsy P., Bischof J., Brown C.M., Dauphin A., Dobbie I.M., Eriksson J.E., Faklaris O.et al.. QUAREP-LiMi: a community-driven initiative to establish guidelines for quality assessment and reproducibility for instruments and images in light microscopy. J. Microsc. 2021; 284:56–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Moore J., Allan C., Besson S., Burel J.M., Diel E., Gault D., Kozlowski K., Lindner D., Linkert M., Manz T.et al.. OME-NGFF: a next-generation file format for expanding bioimaging data-access strategies. Nat. Methods. 2021; 18:1496–1498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Moore J., Basurto-Lozada D., Besson S., Bogovic J., Bragantini J., Brown E.M., Burel J.M., Moreno X.C., de Medeiros G., Diel E.E.et al.. OME-Zarr: a cloud-optimized bioimaging file format with international community support. Histochem. Cell Biol. 2023; 160:223–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Ichimura T., Kakizuka T., Horikawa K., Seiriki K., Kasai A., Hashimoto H., Fujita K., Watanabe T.M., Nagai T.. Exploring rare cellular activity in more than one million cells by a transscale scope. Sci. Rep. 2021; 11:16539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Susaki E.A., Tainaka K., Perrin D., Kishino F., Tawara T., Watanabe T.M., Yokoyama C., Onoe H., Eguchi M., Yamaguchi S.et al.. Whole-brain imaging with single-cell resolution using chemical cocktails and computational analysis. Cell. 2014; 157:726–739. [DOI] [PubMed] [Google Scholar]
  • 18. Ohya Y., Sese J., Yukawa M., Sano F., Nakatani Y., Saito T.L., Saka A., Fukuda T., Ishihara S., Oka S.et al.. High-dimensional and large-scale phenotyping of yeast mutants. Proc. Natl Acad. Sci. U.S.A. 2015; 102:19015–19020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Ohnuki S., Ohya Y.. High-dimensional single-cell phenotyping reveals extensive haploinsufficiency. PLoS Biol. 2018; 16:e2005130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Hirata T., Tohsato Y., Itoga H., Shioi G., Kiyonari H., Oka S., Fujimori T., Onami S.. NeuroGT: a brain atlas of neurogenic tagging CreER drivers for birthdate-based classification and manipulation of mouse neurons. Cell Rep. Methods. 2021; 1:100012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Hiraki-Kajiyama T., Miyasaka N., Ando R., Wakisaka N., Itoga H., Onami S., Yoshihara Y.. An atlas and database of neuropeptide gene expression in the adult zebrafish forebrain. J. Comp. Neurol. 2024; 532:e25619. [DOI] [PubMed] [Google Scholar]
  • 22. Shimojo Y., Suehara K., Hirata T., Tohsato Y.. Segmentation of mouse brain slices with unsupervised domain adaptation considering cross-sectional locations. IPSJ Trans. Bioinform. 2024; 17:33–39. [Google Scholar]
  • 23. Ito E., Ueda T., Takano R., Tohsato Y., Kyoda K., Onami S., Nishikawa I.. Pimenidis E., Angelov P., Jayne C., Papaleonidas A., Aydin M.. Phenotype anomaly detection for biological dynamics data using a deep generative model. Artificial Neural Networks and Machine Learning—ICANN 2022. Lecture Notes in Computer Science. 2022; 13530:Cham: Springer; 432–444. [Google Scholar]
  • 24. Kyoda K., Ho K.H.L., Itoga H., Tohsato Y., Onami S.. BD5: an open HDF5-based data format to represent quantitative biological dynamics data. PLoS One. 2020; 15:e0237468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Shamir L., Orlov N., Eckley D.M., Macura T., Johnston J., Goldberg I.G.. Wndchrm—an open source utility for biological image analysis. Source Code Biol. Med. 2008; 3:13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T.et al.. Gene Ontology: tool for the unification of biology. Nat. Genet. 2000; 25:25–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Gene Ontology Consortium The Gene Ontology knowledgebase in 2023. Genetics. 2023; 224:iyad031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Diehl A.D., Meehan T.F., Bradford Y.M., Brush M.H., Dahdul W.M., Dougall D.S., He Y., Osumi-Sutherland D., Ruttenberg A., Sarntivijai S.et al.. The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability. J. Biomed. Semantics. 2016; 7:44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Sarntivijai S., Lin Y., Xiang Z., Meehan T.F., Diehl A.D., Vempati U.D., Schürer S.C., Pang C., Malone J., Parkinson H.et al.. CLO: the Cell Line Ontology. J. Biomed. Semantics. 2014; 5:37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Mungall C.J., Torniai C., Gkoutos G.V., Lewis S.E., Haendel M.A.. Uberon, an integrative multi-species anatomy ontology. Genome Biol. 2012; 13:R5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Hastings J., Owen G., Dekker A., Ennis M., Kale N., Muthukrishnan V., Turner S., Swainston N., Mendes P., Steinbeck C.. ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res. 2016; 44:D1214–D1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Schoch C.L., Ciufo S., Domrachev M., Hotton C.L., Kannan S., Khovanskaya R., Leipe D., Mcveigh R., O’Neill K., Robbertse B.et al.. NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database (Oxford). 2020; 2020:baaa062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Bandrowski A., Brinkman R., Brochhausen M., Brush M.H., Bug B., Chibucos M.C., Clancy K., Courtot M., Derom D., Dumontier M.et al.. The ontology for biomedical investigations. PLoS One. 2016; 11:e0154556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Malone J., Holloway E., Adamusiak T., Kapushesky M., Zheng J., Kolesnikov N., Zhukova A., Brazma A., Parkinson H.. Modeling sample variables with an experimental factor ontology. Bioinformatics. 2010; 26:1112–1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Gkoutos G.V., Schofield P.N., Hoehndorf R.. The Units Ontology: a tool for integrating units of measurement in science. Database (Oxford). 2012; 2012:bas033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Martin F.J., Amode M.R., Aneja A., Austine-Orimoloye O., Azov A.G., Barnes I., Becker A., Bennett R., Berry A., Bhai J.et al.. Ensembl 2023. Nucleic Acids Res. 2023; 51:D933–D941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. UniProt Consortium UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023; 51:D523–D531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Sewell W. Medical subject headings in MEDLARS. Bull. Med. Libr. Assoc. 1964; 52:164–170. [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkae860_Supplemental_File
Download video file (1.2MB, mp4)

Data Availability Statement

All data and source codes are available on the SSBD (https://ssbd.riken.jp) and Figshare (https://doi.org/10.6084/m9.figshare.27023533, https://doi.org/10.6084/m9.figshare.27023530 and https://doi.org/10.6084/m9.figshare.27023527).


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES