Skip to main content
Cell Genomics logoLink to Cell Genomics
. 2023 Aug 9;3(8):100374. doi: 10.1016/j.xgen.2023.100374

A standard for sharing spatial transcriptomics data

Kayla C Jackson 1,2, Lior Pachter 2,3,
PMCID: PMC10435375  PMID: 37601972

Abstract

Spatial transcriptomic technologies have the potential to reveal critical relationships between the function of genes and cells and their spatial organization. Here, we provide a sharing model for spatial transcriptomics data with the aim of establishing a set of primary data and metadata needed to reproduce analyses and facilitate computational methods development.


Spatial transcriptomic technologies have the potential to reveal critical relationships between the function of genes and cells and their spatial organization. Here, we provide a sharing model for spatial transcriptomics data with the aim of establishing a set of primary data and metadata needed to reproduce analyses and facilitate computational methods development.

Main text

Introduction

Methods for spatial transcriptomic profiling are developing rapidly,1 with technologies able to capture the expression of hundreds2,3 to thousands4,5 of genes at increasingly high resolution. The realization that these developments depend on the dissemination of high-quality data is underscored by policies from major funding agencies that recommend unrestricted release of data.6,7 These recommendations complement the work of organizations that create standards to ensure persistent access to data8,9,10 and journals that require data availability statements in their publications. Unfortunately, meeting these expectations is challenging for researchers working in spatial transcriptomics, as image data can be unwieldy, and there are many intermediate data types that could be beneficial for analysis. As a result, there is no currently accepted standard for spatial transcriptomics data sharing or even agreement on what constitutes sufficient sharing to meet journal and funding agency expectations.

Spatial transcriptomics technologies can be broadly divided into imaging-based and sequencing-based technologies. Imaging-based technologies rely on hybridization of a fluorescently labeled probe that is complementary to mRNA, followed by image-based quantification. These technologies encompass in situ hybridization (ISH) and in situ sequencing (ISS) methods and capture spatial information over sequential rounds of hybridization and imaging. These methods vary in the number of hybridizations, probes and gene targets, and instrumentation used to generate the microscopy images. The complexity of the imaging parameters alone has led to development of community guidelines that suggest the minimum metadata required to reproduce an imaging experiment.11,12 Alternatively, technologies based on next-generation sequencing (NGS) are broadly based on region selection followed by NGS or positional barcoding of regular (e.g., 10X Visium) or irregular arrays (e.g., Slide-seq13). Sequencing-based technologies vary in their resolution and can have associated images that represent features that are not captured at the same resolution as the image. Images that require region selection are accompanied by careful annotation of the location within a tissue and the features of the tissue space. These technologies can ostensibly generate coordinates corresponding to specific locations in a histological region, microscopy images, a count matrix, and various metadata, and these disparate forms of data have their own guidelines for dissemination.14,15

Here, we provide recommendations for the dissemination of spatial transcriptomic data based on successful models for data sharing in imaging,11,12,16,17 high-throughput sequencing,14,18 and single-cell RNA sequencing (scRNA-seq) experiments.15,19,20 Certainly, previous frameworks have modeled FAIR principles,10 ensuring findability, accessibility, interoperability, and reusability of experimental data. While researchers cannot anticipate and accommodate all the ways in which their data will be used, we believe that adherence to a handful of principles we outline can be beneficial. These recommendations are based on existing data types and infrastructure for data storage and transmission but are extensible for larger and more complex datasets. We recognize that even when researchers know what data to share, the question of where to share it is not trivial to answer, and we discuss current solutions to ensure persistent and broad access to their data.

Sharing images

Images generated from fluorescent microscopy are the basis for several current spatial transcriptomic technologies and, as a standard, should be shared freely. Researchers must decide at which step in the processing pipeline images should be shared. On one hand, sharing raw images offers end users the greatest flexibility for reanalysis and the ability to develop statistical methods. In contrast, sharing pre-processed images means that end users are not required to have expertise in the software and analysis tools for image processing. The tradeoffs for these sharing paradigms are discussed in Table 1.

Table 1.

Advantages and disadvantages of image-sharing paradigms

Raw images
Pre-processed images
Advantages Disadvantages Advantages Disadvantages
Allows for the complete reanalysis of the data Requires image analysis expertise Does not require image analysis expertise Images are biased by choices of pre-processing methods made by the authors
Can compare upstream image analysis pipeline for different technologies (e.g., segmentation, background reduction, dot detection) Potentially in proprietary formats; cross-platform incompatibility Fewer analytical demands May not be compatible with other datasets
Allow data from multiple studies to be processed uniformly Requires extensive storage Useful for end users focusing on re-analysis of downstream results Requires extensive storage; but potentially less than raw images if many images are stitched together
Can improve reproducibility — others can verify that the results have not been “cherry-picked” May not be easy to import into common analysis software Will not get different results from original authors because of minor differences in pre-processing Potential for wrong “version" to be shared. Authors may not share processed images that generated results in paper
Allow analysis of features that may be called as “background” May require conversion to other formats before they can be uploaded to public repositories Common starting point when discussing differences in downstream results
End-user can experiment with less conservative processing methods Lack of standardized metadata may make usability challenging

Both raw and processed microscopy images can feasibly consume gigabytes to terabytes of memory. It is also likely that storage requirements will grow in the coming years. Repositories for hosting image datasets must have both the capacity and flexibility to accommodate growth and potentially more complex images in the future. Spatial transcriptomics datasets published by prominent consortia and private companies exhibit sizes spanning from hundreds of gigabytes to a few terabytes, offering insight into the typical scale of datasets. Several of the broad-spectrum repositories noted in Table 2 can accommodate datasets on this scale and would probably serve the purposes of most researchers using current technologies.

Table 2.

Storage capacities and fees associated with image data repositories

Host Scope Individual file size limit (GB) Dataset size limit Additional storage cost and fees Closed-access options? Comment
Figshare Broad spectrum 5 20 GB for free accounts; 5 TB for paid accounts $585 per 250 GB;
$160 processing fee
Yes
Zenodo Broad spectrum None stated 50 GB for free accounts; larger datasets accepted on case-by-case basis Accepts donations Yes
Dryad Broad spectrum 10 300 GB $50 per 10 GB;
$120 Publishing charge
No PPI is not allowed
Image Data Resource Cells, tissue None stated None stated None stated Limited Contact if dataset >10TB
Cell Image Library Organisms, cell types, and cellular processes None stated None stated None stated No
BioImage Archive Bioimaging data from molecule to organism scale None stated None stated None stated No File upload recommendations change based on size of project
EMPIAR Electron microscopy 1,000 None stated None stated No Only justifiable changes are permitted after release
GigaDB Broad spectrum None stated None stated None stated No Only hosts data from manuscripts accepted in Gigascience
Kaggle Broad spectrum 2 100 GB None stated Yes
ODAP/Harvard DataVerse Broad spectrum 2.5 1 TB None stated Yes Contact for datasets >1TB
IEEE Data Portal Broad spectrum None stated 2 TB; Institutional accounts have access up to 10 TB/dataset $1950 open access fee Limited to paid accounts

The community guidelines on sharing and reporting imaging data11 have underscored the importance of metadata standardization with respect to data reuse, and many of these recommendations are relevant to spatial transcriptomics experiments (Table S1). Efforts have been made to encourage use of the open microscopy environment (OME)-TIFF file format,16,21 and tools22 have been developed to convert between proprietary image and OME file types. The OME-TIFF file format is near ubiquitous in the bioimaging community and enjoys robust support in bioimaging software packages. Though OME-TIFF can support multi-dimensional data, it is most performant on 2D TIFF tiles and is not optimized for cloud-based storage infrastructure. The OME next-generation file format (NGFF) is emerging as a scalable image format to address these limitations. Importantly, the OME-NGFF23 is necessarily extensible and will flexibly support multimodal and multidimensional images in the future. These efforts are arguably necessary to maximize the utility of spatial transcriptomics data.

Sharing segmentation data

Image segmentation involves annotating images to distinguish relevant objects from background. Segmentation techniques employ techniques that include random forests,24 convolutional neural networks,25 or manual outlining26 to define the boundaries of regions that contain transcripts. In many contemporary methods, segmentation data fundamentally enable single-cell analysis and encode the spatial context of the tissue under study. This information holds significant utility, positioning it as a crucial component within the sharing paradigm.

Segmentation data are typically conveyed as an image mask where pixels associated with objects and background are populated with distinct values. In the binary case, the mask assigns a value of 0 to pixels that are considered background and a non-zero value (usually 1) to pixels considered part of a feature in the image. Researchers can use standard image processing software like ImageJ26 to extract the coordinates for each object. Notably, segmentation masks are sensitive to the specific tools used to process the input image. In cases where segmentation data are not shared, end users looking to reproduce segmentation results may be unable to deduce heuristic choices made by the original researcher. Sharing segmentation data either as an image mask or as a series of coordinates easily circumvents this challenge. Additionally, since segmentation is not typically performed on every image, researchers should unambiguously indicate which images were used as input to segmentation algorithms.

The utility of the segmentation mask is diminished without identifiers that relate the segmented objects to other analysis data. In the segmentation mask, pixels associated with cells can be assigned a value corresponding to a unique cell identifier. Alternatively, researchers can supply object coordinates in a tabular format and trivially add a column for an identifier. Many NGS-based methods use spatially indexed sequencing barcodes rather than image segmentation to encode spatial context. Here, a spatial barcode represents a “spot” on a microscopy slide and introduces the possibility that sequenced barcodes overlap with several segmented features. Researchers should provide a mapping file that describes how these spatial encodings intersect (Table S1), particularly in cases where a spatial barcode captures more than one segmented object.

Sharing sequencing data

Many journals require that sequencing data be deposited in repositories that adhere to MIAME/MINSEQE guidelines14,18 or that are part of the International Nucleotide Sequence Collaboration (INSDC).27 Furthermore, efforts to describe additional metadata that should be shared with scRNA-seq experiments15 are relevant to spatial transcriptomics experiments (Table S1). Among the most popular repositories are the EMBL-EBI ArrayExpress,28 European Genome Phenome Archive (EGA),29 NCBI Gene Expression Omnibus (GEO),30 and NCBI Database for Genotypes and Phenotypes (dbGaP).31 The GEO and ArrayExpress provide unrestricted access, while the dbGaP and EGA are suitable for researchers that require controlled access to data containing protected information. These resources all provide archival storage along with an identifier that facilitates later access. Repositories that are part of INSDC are committed to FAIR principles and are in support of a framework that ensures that sequencing data are permanently available to the scientific community.

Sharing expression matrices and metadata

The expression matrix serves as the fundamental analytical unit for almost all spatial transcriptomic analyses. It represents the raw or processed count of the number of RNA molecules corresponding to each gene detected in each cell. While sharing the count matrix alone obscures the upstream processing steps involved in its generation, their portability in a variety of formats makes them practical to share. The unprocessed count matrix, prior to any filtering or normalization, and raw sequencing data would provide the greatest benefit to end users. Nevertheless, it remains important for researchers to share a thorough description of the downstream processing steps that were employed to generate the matrix. Importantly, the identifiers in the count matrix should be linked to those submitted to sequencing and imaging repositories. Models have been proposed for the types of metadata that should be shared with count matrices generated from scRNA-seq technologies,15 but complex metadata for spatial experiments are not easily transferable and can require custom infrastructure for their conveyance.

Two examples of metadata with special relevance include the geometries or shapes of segmented objects and the locations of individual transcripts, with the latter being available in technologies that use ISH for expression profiling. The geometry information is typically conveyed as coordinates, in pixel or physical space, defining the boundaries and center of the object. The coordinates of transcripts are typically located within these boundaries. Sharing these data in a memory-efficient manner that supports indexing across the dimensions of the expression matrix can be challenging, especially as the number of cells and genes profiled in a single experiment continues to grow. Various representations, such as SpatialFeatureExperiment32 in R, have been proposed, drawing inspiration from geospatial data analysis and integrating standards for storage and access of spatial geometries into objects familiar to biologists. By utilizing methods developed for geospatial analysis, biologists can take advantage of existing tools to explore spatial relationships. Ongoing efforts are focused on adapting these methods to take advantage of on-disk operations and on-demand memory loading to accommodate large datasets. The new SpatialData33 library is based on the OME-NGFF framework and addresses many of these memory challenges, but its adoption requires end users to have familiarity with Python. In keeping with FAIR principles, the ideal implementation should support interoperability across platforms and embed metadata alongside the count matrix.

Conclusion

The issue of ensuring FAIR access to data is not unique to the spatial transcriptomics field and has been considered elsewhere,34 but the challenges associated with sharing a combination of imaging, sequencing, and tabular data demand particular attention. The ideal data sharing paradigm should balance the difficulty imparted on the researchers generating the data and the utility to end users.

We have developed recommendations for sharing of spatial transcriptomics data based on the data types produced by existing technologies and storage capacities of widely used data repositories. However, spatial transcriptomics is a rapidly evolving field and advances in sequencing and imaging technologies will necessarily require that sharing standards be updated to accommodate new components that are not considered here. For example, efforts to combine spatial transcriptomics with three-dimensional imaging techniques will reasonably lead to the generation of multidimensional expression matrices and annotations. The exact amount of additional memory that will be required solely for storing data generated by these advancements poses a challenge. It is reasonable to anticipate that previous estimates of a typical dataset size ranging from several hundred gigabytes to a few terabytes may soon become outdated. Addressing these challenges will require improved infrastructure and the utilization of cloud-based solutions. We also anticipate that overcoming these challenges will emphasize collaboration and open data-sharing initiatives to support further progress.

The increasing volume of spatial transcriptomics datasets combined with the complexity of human sequencing datasets underscores the need to implement robust restrictions to safeguard genetic privacy. Achieving a balance between data utility and privacy protection can be accomplished by reinforcing data access controls and establishing comprehensive data use agreements. Additionally, as more data are being migrated to cloud-based servers, it is crucial to regularly monitor repositories containing human data to ensure secure storage protocols are in place both during transit and archiving. Furthermore, as policy changes are adapted to include evolving technologies and emerging privacy risks, it is important to develop protocols that will retroactively and automatically secure existing datasets to align with policy changes. These considerations are essential for facilitating scientific advancements and retaining the trust of research participants.

The challenges faced by spatial transcriptomics researchers cannot be understated, but, given that scientific progress rests on the ability of researchers to use existing data to pose questions not yet considered, the difficulties in sharing data must be balanced against the needs for transparency, reproducibility, and usability. We encourage researchers to examine their goals for data sharing at each step of the analysis pipeline. By taking these considerations into account, the broader scientific community can fulfill its promise to ensure that research data are accessible and that the results derived from them are reproducible.

Acknowledgments

Declaration of interests

The authors declare no competing interests.

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.xgen.2023.100374.

Supplemental information

Table S1. Description of attributes related to spatial transcriptomics datasets
mmc1.xlsx (11.9KB, xlsx)
Document S1. Transparent peer review records for Jackson et al
mmc2.pdf (393.5KB, pdf)

References

  • 1.Moses L., Pachter L. Museum of spatial transcriptomics. Nat. Methods. 2022;19:534–546. doi: 10.1038/s41592-022-01409-2. [DOI] [PubMed] [Google Scholar]
  • 2.Shah S., Lubeck E., Zhou W., Cai L. seqFISH Accurately Detects Transcripts in Single Cells and Reveals Robust Spatial Organization in the Hippocampus. Neuron. 2017;94:752–758.e1. doi: 10.1016/j.neuron.2017.05.008. [DOI] [PubMed] [Google Scholar]
  • 3.Chen K.H., Boettiger A.N., Moffitt J.R., Wang S., Zhuang X. RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells. Science. 2015;348:aaa6090. doi: 10.1126/science.aaa6090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ståhl P.L., Salmén F., Vickovic S., Lundmark A., Navarro J.F., Magnusson J., Giacomello S., Asp M., Westholm J.O., Huss M., et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016;353:78–82. doi: 10.1126/science.aaf2403. [DOI] [PubMed] [Google Scholar]
  • 5.Merritt C.R., Ong G.T., Church S.E., Barker K., Danaher P., Geiss G., Hoang M., Jung J., Liang Y., McKay-Fleisch J., et al. Multiplex digital spatial profiling of proteins and RNA in fixed tissue. Nat. Biotechnol. 2020;38:586–599. doi: 10.1038/s41587-020-0472-9. [DOI] [PubMed] [Google Scholar]
  • 6.NSF Public Access Initiative. https://beta.nsf.gov/public-access.
  • 7.Data Management & Sharing Policy Overview | Data Sharing https://sharing.nih.gov/data-management-and-sharing-policy/about-data-management-and-sharing-policy/data-management-and-sharing-policy-overview.
  • 8.Center for Open Science. https://www.cos.io/about/mission.
  • 9.OpenAIRE. https://www.openaire.eu/.
  • 10.Wilkinson M.D., Dumontier M., Aalbersberg I.J.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.-W., da Silva Santos L.B., Bourne P.E., et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016;3 doi: 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Sarkans U., Chiu W., Collinson L., Darrow M.C., Ellenberg J., Grunwald D., Hériché J.K., Iudin A., Martins G.G., Meehan T., et al. REMBI: Recommended Metadata for Biological Images-enabling reuse of microscopy data in biology. Nat. Methods. 2021;18:1418–1422. doi: 10.1038/s41592-021-01166-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Schapiro D., Yapp C., Sokolov A., Reynolds S.M., Chen Y.-A., Sudar D., Xie Y., Muhlich J., Arias-Camison R., Arena S., et al. MITI minimum information guidelines for highly multiplexed tissue images. Nat. Methods. 2022;19:262–267. doi: 10.1038/s41592-022-01415-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Stickels R.R., Murray E., Kumar P., Li J., Marshall J.L., Di Bella D.J., Arlotta P., Macosko E.Z., Chen F. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 2021;39:313–319. doi: 10.1038/s41587-020-0739-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Brazma A., Ball C., Bumgarner R., Furlanello C., Miller M., Quackenbush J., Reich M., Rustici G., Stoeckert C., Trutane S.C., et al. 2012. MINSEQE: Minimum Information about a High-Throughput Nucleotide SeQuencing Experiment - a Proposal for Standards in Functional Genomic Data Reporting. [DOI] [Google Scholar]
  • 15.Füllgrabe A., George N., Green M., Nejad P., Aronow B., Fexova S.K., Fischer C., Freeberg M.A., Huerta L., Morrison N., et al. Guidelines for reporting single-cell RNA-seq experiments. Nat. Biotechnol. 2020;38:1384–1386. doi: 10.1038/s41587-020-00744-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Besson S., Leigh R., Linkert M., Allan C., Burel J.-M., Carroll M., Gault D., Gozim R., Li S., Lindner D., et al. Bringing Open Data to Whole Slide Imaging. Digit Pathol. 2019:3–10. doi: 10.1007/978-3-030-23937-4_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Deutsch E.W., Ball C.A., Berman J.J., Bova G.S., Brazma A., Bumgarner R.E., Campbell D., Causton H.C., Christiansen J.H., Daian F., et al. Minimum information specification for in situ hybridization and immunohistochemistry experiments (MISFISHIE) Nat. Biotechnol. 2008;26:305–312. doi: 10.1038/nbt1391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Brazma A., Hingamp P., Quackenbush J., Sherlock G., Spellman P., Stoeckert C., Aach J., Ansorge W., Ball C.A., Causton H.C., et al. Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat. Genet. 2001;29:365–371. doi: 10.1038/ng1201-365. [DOI] [PubMed] [Google Scholar]
  • 19.Rehm H.L., Page A.J.H., Smith L., Adams J.B., Alterovitz G., Babb L.J., Barkley M.P., Baudis M., Beauvais M.J.S., Beck T., et al. GA4GH: International policies and standards for data sharing across genomic research and healthcare. Cell Genom. 2021;1 doi: 10.1016/j.xgen.2021.100029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.NHGRI GDS Policy Data Standards . 2019. Genome.gov.https://www.genome.gov/about-nhgri/Policies-Guidance/Genomic-Data-Sharing/data-standards [Google Scholar]
  • 21.Swedlow J.R., Goldberg I., Brauner E., Sorger P.K. Informatics and quantitative analysis in biological imaging. Science. 2003;300:100–102. doi: 10.1126/science.1082602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Linkert M., Rueden C.T., Allan C., Burel J.-M., Moore W., Patterson A., Loranger B., Moore J., Neves C., MacDonald D., et al. Metadata matters: access to image data in the real world. J. Cell Biol. 2010;189:777–782. doi: 10.1083/jcb.201004104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Moore J., Allan C., Besson S., Burel J.-M., Diel E., Gault D., Kozlowski K., Lindner D., Linkert M., Manz T., et al. OME-NGFF: a next-generation file format for expanding bioimaging data-access strategies. Nat. Methods. 2021;18:1496–1498. doi: 10.1038/s41592-021-01326-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Berg S., Kutra D., Kroeger T., Straehle C.N., Kausler B.X., Haubold C., Schiegg M., Ales J., Beier T., Rudy M., et al. ilastik: interactive machine learning for (bio)image analysis. Nat. Methods. 2019;16:1226–1232. doi: 10.1038/s41592-019-0582-9. [DOI] [PubMed] [Google Scholar]
  • 25.Stirling D.R., Swain-Bowden M.J., Lucas A.M., Carpenter A.E., Cimini B.A., Goodman A. CellProfiler 4: improvements in speed, utility and usability. BMC Bioinf. 2021;22:433. doi: 10.1186/s12859-021-04344-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Schneider C., Rasband W., Eliceiri K. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods. 2012;9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Arita M., Karsch-Mizrachi I., Cochrane G. The international nucleotide sequence database collaboration. Nucleic Acids Res. 2021;49:D121–D124. doi: 10.1093/nar/gkaa967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Athar A., Füllgrabe A., George N., Iqbal H., Huerta L., Ali A., Snow C., Fonseca N.A., Petryszak R., Papatheodorou I., et al. ArrayExpress update - from bulk to single-cell expression data. Nucleic Acids Res. 2019;47:D711–D715. doi: 10.1093/nar/gky964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Freeberg M.A., Fromont L.A., D’Altri T., Romero A.F., Ciges J.I., Jene A., Kerry G., Moldes M., Ariosa R., Bahena S., et al. The European Genome-phenome Archive in 2021. Nucleic Acids Res. 2022;50:D980–D987. doi: 10.1093/nar/gkab1059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Barrett T., Wilhite S.E., Ledoux P., Evangelista C., Kim I.F., Tomashevsky M., Marshall K.A., Phillippy K.H., Sherman P.M., Holko M., et al. NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res. 2013;41:D991–D995. doi: 10.1093/nar/gks1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mailman M.D., Feolo M., Jin Y., Kimura M., Tryka K., Bagoutdinov R., Hao L., Kiang A., Paschall J., Phan L., et al. The NCBI dbGaP database of genotypes and phenotypes. Nat. Genet. 2007;39:1181–1186. doi: 10.1038/ng1007-1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.SpatialFeatureExperiment: Extension of SpatialExperiment with sf (Github). https://github.com/pachterlab/SpatialFeatureExperiment.
  • 33.Marconato L., Palla G., Yamauchi K.A., Virshup I., Heidari E., Treis T., Toth M., Shrestha R.B., Vöhringer H., Huber W., et al. SpatialData: an open and universal data framework for spatial omics. bioRxiv. 2023 doi: 10.1101/2023.05.05.539647. 2023.05.05.539647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Tedersoo L., Küngas R., Oras E., Köster K., Eenmaa H., Leijen Ä., Pedaste M., Raju M., Astapova A., Lukner H., et al. Data sharing practices and data availability upon request differ across scientific disciplines. Sci. Data. 2021;8:192. doi: 10.1038/s41597-021-00981-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1. Description of attributes related to spatial transcriptomics datasets
mmc1.xlsx (11.9KB, xlsx)
Document S1. Transparent peer review records for Jackson et al
mmc2.pdf (393.5KB, pdf)

Articles from Cell Genomics are provided here courtesy of Elsevier

RESOURCES