Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2023 Jul 30:2023.07.27.550727. [Version 1] doi: 10.1101/2023.07.27.550727

Nextflow Pipeline for Visium and H&E Data from Patient-Derived Xenograft Samples

Sergii Domanskyi, Anuj Srivastava, Jessica Kaster, Haiyin Li, Meenhard Herlyn, Jill C Rubinstein, Jeffrey H Chuang
PMCID: PMC10402090  PMID: 37546876

Abstract

Highlights

We have developed an automated data processing pipeline to quantify mouse and human data from patient-derived xenograft samples assayed by Visium spatial transcriptomics with matched hematoxylin and eosin (H&E) stained image. We enable deconvolution of reads with Xenome, quantification of spatial gene expression from host and graft species with Space Ranger, extraction of B-allele frequencies, and splicing quantification with Velocyto. In the H&E image processing sub-workflow, we generate morphometric and deep learning-derived feature quantifications complementary to the Visium spots, enabling multi-modal H&E/expression comparisons. We have wrapped the pipeline into Nextflow DSL2 in a scalable, portable, and easy-to-use framework.

Summary

We designed a Nextflow DSL2-based pipeline, Spatial Transcriptomics Quantification (STQ), for simultaneous processing of 10x Genomics Visium spatial transcriptomics data and a matched hematoxylin and eosin (H&E)-stained whole slide image (WSI), optimized for Patient-Derived Xenograft (PDX) cancer specimens. Our pipeline enables the classification of sequenced transcripts for deconvolving the mouse and human species and mapping the transcripts to reference transcriptomes. We align the H&E WSI with the spatial layout of the Visium slide and generate imaging and quantitative morphology features for each Visium spot. The pipeline design enables multiple analysis workflows, including single or dual reference genomes input and stand-alone image analysis. We showed the utility of our pipeline on a dataset from Visium profiling of four melanoma PDX samples. The clustering of Visium spots and clustering of imaging features of H&E data reveal similar patterns arising from the two data modalities.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES