Skip to main content
. 2022 Feb 12;16(8):1306–1320. doi: 10.1093/ecco-jcc/jjac027

Table 4.

Best practices for IBD omic data generation and interpretation. Many decision points are influenced by clinical feasibility and translational application potential.

Phase of research Decision points Recommendations Examples
Cohort design and sample collection Inclusion / exclusion criterion • Fine tune based on research question and record the clinical data
• Think about the control population/samples.
• Consider enriching clinical data with environmental data.
• Include or exclude inflamed/non-inflamed samples, treatment-naïve patients, postoperative setting etc
• Should it be healthy controls, patients with other IMIDs, or non-diseased sites from the same IBD patients?
• Smoking status, dietary factors, BMI, exercise etc
Sampling dynamics • Using low-throughput biomarkers, determine the relationship between flares and patterns in high-throughout -omic datasets measured from longitudinal cohorts.
• Generate the same set of -omic datasets at multiple time-points from the corresponding samples (subject to technical limitations, available biomaterial and any possible time-lags between the -omic datasets) to enable sample-to-sample comparability.
• Incorporate additional measurements after starting a new treatment
• Markers like fecal calprotectin and CRP
• For example, host transcriptomics + microbial metatranscriptomics from biopsies at different time-points over treatment course or disease course.
• For example, pharmacodynamics
Sampling site • Sample from the primary disease site but consider paired samples from other sites. • Primary disease site (ileum, colon etc), other sites (such as PBMCs)
Sampling for |microbiota • Microbiota profiles from primary disease site generally preferred over faecal samples. Nonetheless, the latter might prove to be valuable given the ease of sampling.
• Additional host and microbial community based omics datasets can bridge the environmental and host aspects.
• Primary disease site - luminal contents (from biopsies directly)
• Metabolomics, metabonomics, metatranscriptomics, metaproteomics, viromics, mycobiomics etc
Sampling mass • Pre-determine the number of samples (or biopsies) and/or amount of biomass required to generate the different -omic datasets from the same sample. • Depends on the specification of the kit/ in-house protocols used for the extraction of the different molecular fractions
Sample treatment • Follow standardized protocols for sampling, processing and storage specific for each -omic dataset where applicable. • Kit-based or in-house protocols
Sample storage and documentation • Use of registered biobanks recommended
• Samples to be systematically indexed and barcoded during storage.
• UK Biobank
• -
Data generation Cellular resolution (bulk or single cell or purified cell-types/ fractions) • Measurements from single cell technologies highly recommended. • -
Sequencing strategy (short or long read sequencing) • Sequencing type. Budget, biological question(s) etc dictate the choice • Long-read or short-read sequencing?
-Omic data type (genomics, transcriptomics, proteomics etc) • Dependent on biological question and various other factors like budget, research question, clinical and translational feasibility.
• Include novel molecules for -omic measurements.
• For example, bulk transcriptomics provide a relatively cheap option for high-throughput genome-wide profiling of biological state. However, proteomics is closer to phenotype than transcriptomics but lags behind on coverage.
• Circular RNAs, long non-coding RNA
Microbiota specific data resolution (16S or WGS) • 16S for preliminary studies
• WGS preferred for making in depth assessments due to its advantage of being able to make strain-level inferences
• -
• -

IMID, Immune Mediated Inflammatory Disease; PBMC, Peripheral Blood Mononuclear Cells; BMI , Body Mass Index; CRP, C-reactive protein; WGS, Whole Genome Sequencing.