Abstract
Genome-wide association studies (GWAS) have identified a vast number of variants associated with various complex human diseases and traits. However, most of these GWAS variants reside in non-coding regions producing no proteins, making the interpretation of these variants a daunting challenge. Prior evidence indicates that a subset of non-coding variants detected within or near cis-regulatory elements (e.g., promoters, enhancers, silencers, and insulators) might play a key role in disease etiology by regulating gene expression. Advanced sequencing- and imaging-based technologies, together with powerful computational methods, enabling comprehensive characterization of regulatory DNA interactions, have substantially improved our understanding of the three-dimensional (3D) genome architecture. Recent literature witnesses plenty of examples where using chromosome conformation capture (3C)-based technologies successfully links non-coding variants to their target genes and prioritizes relevant tissues or cell types. These examples illustrate the critical capability of 3D genome organization in annotating non-coding GWAS variants. This review discusses how 3D genome organization information contributes to elucidating the potential roles of non-coding GWAS variants in disease etiology.
Keywords: 3D genome organization, GWAS variants, non-coding DNA variation, Hi-C, TADs, FIREs, chromatin interactions
Introduction
Genome-wide association studies (GWAS) have achieved great success during the last two decades, reproducibly identifying hundreds of thousands of genetic variants associated with complex human diseases and traits (Buniello et al., 2019). However, only a small proportion (<10%) of GWAS variants alter the coding sequence of the human genome, where relatively straightforward hypotheses can be formed to link these variants to organism-level phenotypes directly. The remaining vast majority (i.e.,>90%) of GWAS variants reside in non-coding regions, making the interpretation of these variants a daunting challenge in the post-GWAS era (Hindorff et al., 2009; Sun et al., 2022).
To better understand the functional roles of non-coding GWAS variants, it is essential to annotate the non-coding regions, which account for ∼97% of the human genome. In recent years, the ENCODE consortium (ENCODE Project Consortium, 20l2; ENCODE Project Consortium et al., 2020) and the Roadmap Epigenomics Consortium (Roadmap Epigenomics Consortium et al., 2015) have identified millions of cis-regulatory elements (CREs) (including enhancers, promoters, silencers, and insulators) across a large number of human tissues and cell types. These CREs play critical roles in regulating the expression of their target genes in a cell-type-specific manner. Intriguingly, many studies have demonstrated significant enrichment of non-coding GWAS variants within CREs (Degner et al., 2012; Trynka et al., 2013; Zhang and Lupski, 2015), suggesting an indirect yet crucial role of these non-coding GWAS variants. Instead of directly changing the protein-coding DNA sequences, these non-coding variants may disrupt the functional roles of CREs, resulting in dysregulation of relevant genes.
The comprehensive annotation of CREs is a substantial step forward in understanding the non-coding GWAS variants. However, it remains challenging to assign non-coding GWAS variants-overlapped CREs to their target genes in disease-relevant tissues and cell types. How CREs regulate the expression of their target genes is still an open question in the genomics field. The difficulties lie in at least four aspects. First of all, the same CRE, such as a super-enhancer, may regulate multiple genes simultaneously. In addition, genes with critical functional roles, such as cell-type-marker genes, may be regulated by multiple CREs simultaneously to allow for some buffer in the presence of disrupted CREs. Along the line, we have recently reported super interactive promoters (SIPs) that interact with a more significant number of CREs than non-SIPs (Wen et al., 2022). Moreover, both the function of CREs and the relationship between CREs and their target gene(s) are highly tissue- or cell-type-specific. Last but not least, the majority of genes are not regulated merely by CREs in a close one-dimensional (1D) vicinity. Instead, CREs can form DNA loops with the promoter of their target gene(s) and regulate the expression of gene(s) from hundreds of kilobase (Kb) away (Dekker et al., 2013) or even over 1 Mb away (Fulco et al., 2016). Thus, a deep understanding of chromatin spatial organization can shed novel insights on gene regulation mechanisms and disease etiology.
Recently developed genomics and high-resolution imaging technologies (Jerkovic and Cavalli, 2021) provide revolutionary tools to map the nucleus’s three-dimensional (3D) genome. Coupling with powerful genome or epigenome editing tools such as CRISPR/Cas9, CRISPRi, and CRISPRa (Yin et al., 2017; Nakamura et al., 2021), researchers can not only measure the spatial proximity between non-coding GWAS variants-overlapped CREs and their putative target gene(s) but also functionally validate the role of CREs in disease-relevant cell types. For example, recent studies have shown that non-coding GWAS variants can alter the 3D chromatin structure and contribute to the risk of various disorders, including cancer, asthma, thalassemia, sex reversal, and limb malformation (Benko et al., 2011; Lupiáñez et al., 2015; Lupiáñez et al., 2016; Franke et al., 2016; Krijger and de Laat, 2016; Schmiedel et al., 2016; Schmitt et al., 2016b; Yu and Ren, 2017; Li et al., 2018; Liu et al., 2022b). Thus, characterizing 3D chromatin structure has the potential to prioritize disease causal genes, particularly those spatially close but far away in 1D genomic distance from their CREs, and reveal mechanistic insights underlying non-coding GWAS variants.
This review paper will describe the state-of-the-art experimental technologies, including sequencing-based and imaging-based approaches, to map chromatin spatial organization. In addition, we will summarize advanced computational methods to integrate transcriptome, epigenome, and 3D genome data to achieve a deep understanding of the functional roles of non-coding GWAS variants. We highlight recent breakthroughs in predicting and validating disease causal genes of non-coding GWAS variants and discuss challenges and opportunities for future endeavors.
Experimental methods for detecting regulatory DNA interactions
There are three major approaches for examining 3D genome structure: microscopy (imaging)-based techniques, sequencing-based approaches, and integrative methods (Figure 1). Microscopy-based approaches quantify cell-to-cell variability in chromatin architecture at certain genomic regions by visualizing the relative placement of these genomic regions in single cells (Jerkovic and Cavalli, 2021). In contrast, sequencing-based approaches measure chromatin contacts by crosslinking spatially close DNA segments and then applying deep sequencing to these crosslinked segments (Jerkovic and Cavalli, 2021). Integrative methods simultaneously leverage both sequencing- and microscopy-based methods, applying these two techniques to the same cell (Boninsegna et al., 2022).
Microscopy-based methods, including fluorescence in situ hybridization (FISH) and more advanced FISH-based techniques, estimate the relative distance by hybridizing DNA probes of specific genomic regions and then using a microscope for visualization (Su et al., 2020; Jerkovic and Cavalli, 2021; Zhuang, 2021). The earlier FISH-based methods were limited by the resolution and coverage of the genome. In terms of resolution, FISH-based methods have been significantly improved via the super-resolution microscopy technology that has increased spatial resolution. Regarding genome coverage, oligopaints-based FISH methods have been developed, where the oligopaints are fluorescently-labeled DNA oligonucleotides designed for imaging genomic regions (Beliveau et al., 2012, 2014). These methods include multiplex FISH (Zhuang, 2021) and OligoSTORM (Beliveau et al., 2017). Multiplex FISH can detect a larger number of loci by running multiple rounds of imaging fluorophore-labeled oligo probes—within each round, using different fluorophores for different regions to detect chromatin interactions. OligoSTORM is coupled with the STORM imaging technology for super-resolution imaging and can be further combined with other methods to increase coverage. Oligopaint barcode-based methods have been developed to increase further the efficiency of detecting chromatin interactions. These methods include the FISH-based ORCA method (Mateo et al., 2019) and OligoFISSEQ (Nguyen et al., 2020). ORCA partitions the target region into consecutive short regions with unique barcodes, where the barcodes are connected to probes carrying a common fluorophore-labeled oligo for imaging, avoiding the use of different fluorophores. OligoFISSEQ uses the FISSEQ technology (Lee et al., 2015) to read the oligopaints barcode for imaging and image multiple target regions for thousands of cells to estimate cell-to-cell variability. OligoFISSEQ can also be combined with OligoSTORM to image hundreds of target regions (Nguyen et al., 2020).
Sequencing-based methods can be categorized based on whether they can estimate chromatin interactions across the whole genome and implement proximity ligation to process crosslinked segments (Jerkovic and Cavalli, 2021). Under the former classification, methods covering the entire genome are non-enrichment methods, while methods covering specific types of interactions are enrichment methods. With the latter taxonomy, proximity ligation methods are C-based and otherwise non-C-based. Among non-enrichment methods, C-based methods such as Hi-C (Lieberman-Aiden et al., 2009) and its variants [e.g., Micro-C (Hsieh et al., 2016)] can generate all possible pairwise interactions of the whole genome. Unbiased Hi-C approaches require ultra deep sequencing depth for high resolution inference, which can be cost prohibitive. For example, we typically need several billion raw reads to detect chromatin interactions at Kb resolution. Non-C-based methods such as SPRITE (Quinodoz et al., 2018) and GAM (Beagrie et al., 2017) were developed using ligation-free technologies that allow for multi-way interactions. SPRITE quantifies higher-order chromatin interactions by adopting a split-pool approach to barcode the crosslinked DNA segments. In contrast, GAM maps spatial proximity of multiple DNA segment by determining the extent of co-segregation in the same cryo-sectioned and laser-microdissected compartment. While non-enrichment-based methods provide an unbiased view of the entire genome, enrichment methods have been proposed to empower closer and finer-resolution interrogation at interactions enriched in specific genomic regions or associated with particular proteins or epigenomic marks. The most commonly used C-based enrichment methods that do not involve probe design include ChIA-PET (Fullwood et al., 2009), HiChIP (Mumbach et al., 2016) and PLAC-seq (Fang et al., 2016). ChIA-PET (chromatin interaction analysis by paired-end tag) estimates interactions mediated by a protein of interest by first applying immuno-precipitation to enrich fragments with the protein of interest, and then the regular Hi-C proximity ligation before sequencing ligation products. In contrast, HiChIP and PLAC-seq technologies apply segmentation and proximity ligation first and then use protein immunoprecipitation for the enrichment of the desired ligation products. Capture-C (Hughes et al., 2014; Davies et al., 2016) and capture Hi-C (Mifsud et al., 2015) are also C-based enrichment methods. Compared to HiChIP and PLAC-seq, Capture-C and capture Hi-C require designing probes for a given set of sequences of interest (e.g., promoters or GWAS loci) to enrich ligation products in local regions. Among non-C-based enrichment methods, adapted-DamID (Cléard et al., 2006) first tethers DNA adenine methyltransferase (Dam) to a specific region and then detects DNA methylation patterns for this region and distant regions to identify chromatin interactions (Aughey et al., 2019).
Imaging-based and sequencing-based methods, as two orthogonal types of experimental approaches, have their own unique strengths and weaknesses. The key advantage of the imaging-based methods is to record 3D coordinates of each genomic locus, and directly measure spatial distance among genomic loci. In addition, imaging-based methods can achieve single cell resolution, facilitating the characterization of cell-to-cell variability in chromatin spatial organization. However, currently available imaging-based methods cannot yet simultaneously achieve Kb resolution and genome-wide coverage: existing methods either measure the whole genome at megabase (Mb) resolution (Takei et al., 2021a; 2021b), an entire chromosome or several Mb regions at 25–50 Kb resolution (Su et al., 2020; Takei et al., 2021a, 2021b), or a small region (∼210 Kb containing TSS of a gene of interest and its interacting enhancers) at 5 Kb resolution (Huang et al., 2021). It is still technically challenging to image the whole genome at Kb resolution, limiting its utility for genome-wide high resolution mapping of enhancer-promoter interactions in mammalian genomes.
In contrast, sequencing-based methods can generate Kb (Rao et al., 2014a; Bonev et al., 2017) or even nucleosome resolution (Krietenstein et al., 2020) map of mammalian 3D genomes, as long as the sequencing depth is sufficiently high. They usually enjoy higher sensitivity than imaging-based methods in terms of detecting genome-wide regulatory DNA interactions. One key weakness of sequencing-based methods is that they do not directly measure the spatial distance between genomic loci of interest, but rather gauge the frequency of the loci coming in spatial proximity, which is an indirect measure of 3D distance. Moreover, most widely used sequencing-based methods are designed for bulk samples containing 105–106 cells. Single-cell-based sequencing methods, including single-cell Hi-C (scHi-C) (Nagano et al., 2013), sci-Hi-C (Kim et al., 2020), sc-m3c-seq (Lee et al., 2019) and Dip-C (Tan et al., 2018), all suffer from limited capture efficiency per cell, making the quantification of cell-to-cell variability extremely challenging.
Taken together, investigators need to balance the pros and cons of different experimental methods, based on their specific scientific questions. For example, we would recommend imaging-based methods when the primary interest is to understand cell-to-cell variability in regulatory DNA interactions near a specific gene or element of interest. While for another example, when the primary goal is to comprehensively characterize genome-wide enhancer-promoter interactions, sequencing-based methods would be a better choice.
Integrative approaches have been developed to combine the advantages of imaging- and sequencing-based methods for better genome coverage and higher resolution. For example, in situ genome sequencing (IGS) was designed to jointly conduct sequencing and imaging simultaneously for intact genomes and directly link DNA sequence to 3D spatial proximity (Payne et al., 2021). However, IGS does not allow an adequate evaluation of enhancer-promoter interactions due to the limited resolution. Other integrative methods are comprehensively reviewed by Boninsegna et al. (2022).
Utilizing 3D genome architecture to interpret disease-related genetic variants
Advanced technologies for studying 3D genome organization have generated an increasing amount of useful data. Accompanying advances in computational methods have enabled detection and quantification of various layers of chromatin spatial organization, including topologically associating domains (TADs) (Dixon et al., 2012; Crane et al., 2015; Rocha et al., 2015; Dali and Blanchette, 2017; Forcato et al., 2017; Zufferey et al., 2018; Liu et al., 2022a; Sefer, 2022), frequently interacting regions (FIREs) (Schmitt et al., 2016a; Crowley et al., 2021), and chromatin interactions (Ay et al., 2014; Rao et al., 2014b; Xu et al., 2016a, 2016b; Carty et al., 2017; Forcato et al., 2017; Cao et al., 2020; Kaul et al., 2020; Roayaei Ardakany et al., 2020; Rowley et al., 2020; Lagler et al., 2021; Sahin et al., 2021; Yu et al., 2021) (Table 1). These valuable pieces of 3D genome architecture information have been widely used to identify candidate risk genes for non-coding GWAS variants associated with complex diseases (Smemo et al., 2014; Giorgio et al., 2015; Schmitt et al., 2016a; Lupiáñez et al., 2016; Won et al., 2016; Martin et al., 2017; Fulco et al., 2019; Crowley et al., 2021; Yu et al., 2021). For instance, disarrangement of TAD boundaries can disrupt normal regulatory architecture and possibly form new loops, resulting in gene dysregulation, eventually leading to phenotypic aberrations (Lupiáñez et al., 2015; Krijger and de Laat, 2016). At the FIRE level, overlapping GWAS variants with FIREs can help to prioritize causal variants among many of their linkage disequilibrium (LD) tags (Huang et al., 2022a) and subsequently prioritize the putative effector genes in the neighborhood of FIREs in a tissue- or cell-type-specific manner (Schmitt et al., 2016a). At the most refined chromatin loop/interaction level, interruption of enhancer-promoter interactions can alter gene expression to cause diseases (Smemo et al., 2014; Krijger and de Laat, 2016). Finally, integrative approaches combine data from multiple resources to interpret non-coding variants, such as integrating chromatin structure information with other omics data to identify significant chromatin interactions, ensembling sequencing- and imaging-based data to simulate 3D genome structures, as reviewed in Liu et al. (2022b) and Boninsegna et al. (2022).
TABLE 1.
Title | Category | Description | Year | References |
---|---|---|---|---|
A critical assessment of topologically associating domain prediction tools | TADs | Compared seven TAD calling methods | 2017 | Dali & Blanchette, (2017) |
Comparison of computational methods for Hi-C data analysis | TADs and chromatin interactions | Compared seven TAD calling methods and six chromatin interaction callers | 2017 | Forcato et al. (2017) |
Comparison of computational methods for the identification of topologically associating domains | TADs | Compared 20 TAD calling methods | 2018 | Zufferey et al. (2018) |
Computational methods for analyzing genome-wide chromosome conformation capture data | General pipeline | Reviewed pipelines and methods for 3C-based data | 2018 | Nicoletti et al. (2018) |
Computational methods for assessing chromatin hierarchy | General pipeline | Reviewed computational tools for assessing chromatin hierarchy | 2018 | Chang et al. (2018) |
Computational methods for analyzing and modeling genome structure and organization | General pipeline | Reviewed analytic and modeling techniques for 3C-based methods | 2018 | Lin et al. (2019) |
Hi-C analysis: from data generation to integration | General pipeline | Reviewed methods for Hi-C data analysis | 2019 | Pal et al. (2019) |
Comparison of computational methods for 3D genome analysis at single-cell Hi-C level | General pipeline | Compared the performance of Hi-C methods on ultra-sparse Hi-C data | 2020 | Li et al. (2020) |
Computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles | Prediction | Summarized 48 computational methods for predicting chromatin interactions and spatial organization features | 2021 | Tao et al. (2021) |
A comparison of topologically associating domain callers over mammals at high resolution | TADs | Compared 27 TAD calling methods | 2022 | Sefer (2022) |
A comparison of topologically associating domain callers based on Hi-C data | TADs | Compared 26 TAD calling methods | 2022 | Liu et al. (2022a) |
Bacon: a comprehensive computational benchmarking framework for evaluating targeted chromatin conformation capture-specific methodologies | Chromatin interactions | Benchmarked 12 computational pipelines for HiChIP/PLAC-seq and/or ChIA-PET data | 2022 | Tang et al. (2022) |
Hi-C data analysis tools and papers | General pipeline | A collection of Hi-C tools and papers | Accessed on 05/27/2022 | https://github.com/mdozmorov/HiC_tools |
4DN Software | General pipeline | A collection of data analysis and visualization tools for studying the 3D genome | Accessed on 05/27/2022 | https://www.4dnucleome.org/software.html |
We first review some examples using chromatin interactions to prioritize putative target genes. One of the earliest and most renowned examples was reported by Smemo et al. (2014), where the authors elegantly elucidated molecular mechanisms underlying the noncoding obesity-associated GWAS variants at the FTO locus with chromatin interactions identified from 4C-seq (van de Werken et al., 2012), a C-based method that quantifies chromatin spatial proximity between a specific region of interest and all genomic loci in its neighborhood. Specifically, long-range chromatin interactions link FTO intronic variants to their target gene IRX3 (Smemo et al., 2014). Simultaneously considering long-range chromatin interactions, epigenetic annotations, and eQTL data, we can identify and prioritize causal variants and target genes for various human diseases and traits. Studies have shown that the majority of noncoding variants interact with distal genes based on Hi-C (Song et al., 2019; Sey et al., 2020), highlighting the importance of chromatin 3D organization in prioritization and functional follow-up of GWAS variants.
As the number and size of GWAS continue to grow rapidly, increasing evidence shows that regulatory variants function in a tissue- or cell-type-specific manner (Schmitt et al., 2016a; Barbeira et al., 2018; Gallagher and Chen-Plotkin, 2018; Sun et al., 2022). Literature in the past decade has accumulated many examples where long-range chromatin interactions have aided the prioritization and establishment of target genes for GWAS variants in disease-relevant tissues and cell types. For example, SnapHiC (Yu et al., 2021), the first computational method developed to identify chromatin interactions from single cell Hi-C data, reported long-range chromatin interactions between two GWAS variants (rs112481437 and rs138137383) associated with Alzheimer’s disease and APOE, specifically in astrocytes but not in other brain cell types. Other examples include a schizophrenia (SCZ) GWAS variant (rs1191551) forming a long-range (∼760 Kb away) interaction with the promoter of FOXG1 revealed by fetal brain Hi-C data (Won et al., 2016); a long-range (>500 Kb away) interaction in liver between the promoter of FST and a type 2 diabetes (T2D)-associated SNP rs6450176, which is an intronic variant in ARL15 (Martin et al., 2017); an interaction between the promoter of BACH2 and rs72928038 (∼30 Kb away), an intronic variant in BACH2 associated with various diseases including multiple sclerosis and type 1 diabetes, detected using promoter capture Hi-C data in naive CD4+ T cells (Kundu et al., 2022), and an interaction between the promoter of GATA3 and rs3824662 (∼7 Kb), a GATA3 intronic variant associated with Philadelphia chromosome-like childhood acute lymphoblastic leukemia (Yang et al., 2022). Such tissue- or cell-type-specific long-range chromatin interactions will greatly facilitate functional experiments, accelerating the uncovery of molecular mechanisms and new therapeutic targets.
Next, we will review examples where TAD boundaries are disrupted by non-coding variations, which result in enhancer-promoter interaction changes. Specifically, impacts of non-coding variants on TADs include removing, inverting, and duplicating TAD boundaries. These changes can break regular links between enhancers and promoters present in wild type and create new links that do not exist otherwise (Figure 2A) (Yu and Ren, 2017). One example is at the LMNB1 locus, where the deletion of a TAD boundary leads to an autosomal dominant, slowly progressive, and yet fatal adult-onset demyelinating leukodystrophy (ADLD) disorder. Specifically, the LMNB1 gene becomes highly expressed due to the missing boundary leading to new chromatin interactions between the promoter of the LMNB1 gene and several other enhancers (Giorgio et al., 2015; Yu and Ren, 2017). In another example, duplication and inversion of TAD boundaries near EPHA4 and WNT6 genes cause limb malformation. Specifically, as illustrated in Figure 2, disrupted TAD boundaries lead to significantly increased WNT6 gene expression and decreased EPHA4 gene expression (Figures 2B,C), resulting in syndactyly (Lupiáñez et al., 2015; Angier, 2017). Yu and Ren (2017) provide an excellent review, covering multiple examples where aberrations in TAD boundaries lead to phenotypic abnormalities. These studies demonstrate that genetic variations around TAD boundaries can modify expression patterns of nearby genes and illustrate the importance of studying alternations in the regulatory landscape through 3D genome structure (Figures 2B,C).
Furthermore, we will introduce several examples using FIREs to prioritize causal variants and the tissues or cell types where the causal variants exert their effects. For instance, when overlapping triglycerides-GWAS variants (Willer et al., 2013) on chromosome 11 with FIREs across 14 human primary tissues and seven cell types, a liver-specific FIRE overlapped the region harboring GWAS variants (Figure 3) (Schmitt et al., 2016a). This observation suggested that liver is likely the tissue where the GWAS variants play functional roles. Although in this case liver was known to be highly relevant for lipid metabolism, this finding serves as a successful proof-of-concept where tissue- or cell-type-specific FIREs can help prioritize the most pertinent tissues or cell types. Other examples include an asthma-GWAS variant (rs755023315) (Han et al., 2020) residing in a GM12878-specific FIRE that overlaps with an immune-related gene CD70 (Schmitt et al., 2016a) and a SCZ-GWAS variant (rs9960767) residing in a hippocampus super-FIRE overlapping with the neurodevelopment related gene TCF4 (Crowley et al., 2021). Although more recently developed, FIREs have been recognized for their roles in annotating functions of non-coding variants due to their high tissue- or cell-type specificity.
In addition, target genes for GWAS variants can also be predicted from integrative analysis. For example, the Activity-by-Contact (ABC) model, combining chromatin activity and interaction information, assigns rs12740374, a GWAS variant associated with low-density lipoprotein cholesterol (LDL) to the SORT1 gene. The authors additionally reported that this variant is a liver-eQTL for SORT1 and further validated its impact on SORT1 gene expression via CRISPR genome editing in primary hepatocytes (Fulco et al., 2019). We visualize the example in Figure 4A. Consistent with predictions by the ABC model, this chromatin interaction is also detected from liver Hi-C data (Schmitt et al., 2016a) with a significant interaction between the anchor bin (the gray highlighted region) including the GWAS variant rs12740374 and the bin containing the promoter of the SORT1 gene (green highlight) (Figure 4B). The ABC model shows the possibility of using non-liver Hi-C data (K562 Hi-C data) with liver enhancer activity data (H3K27ac ChIP-seq data in liver tissue) to prioritize enhancer-promoter interactions in the liver (Fulco et al., 2019).
In addition to the specific examples we described, many other studies have been conducted to understand whether and how non-coding variations exert their functions. For example, Figure 4B shows a virtual 4C plot using the HUGIn tool (Martin et al., 2017), which was developed to visualize chromatin interactions anchored at GWAS variants or regulatory regions of interest based on a compendium of Hi-C data from 14 primary human tissues and seven human cell lines. HUGIn tool also visualizes gene expression and epigenomic data, which can further facilitate researchers to prioritize target genes at GWAS loci. For another example, the E + G + Methyl (Wu and Pan, 2019) method performs a gene-based aggregation association test by integrating enhancer-promoter interactions and methylation QTLs with GWAS summary statistics. E + G + Methyl gains statistical power to detect target genes for GWAS variants by jointly modeling these two pieces of complementary information but the availability of both (i.e., enhancer–promoter interaction data and methylation QTL data) would limit the application of E + G + Methyl. In addition, because single-variant GWAS summary statistics are used for integration, rare variants would be under-represented in E + G + Methyl analysis. Applying E + G + Methyl to study SCZ, the authors identified several novel genes associated with SCZ, which standard GWAS missed. Along the same line, Yang et al. present the eSCAN method (Yang et al., 2022) (illustrated in Figure 5), an aggregation-based association testing framework that integrates various functional annotations, including chromatin accessibility, histone marks, and chromatin spatial organization. eSCAN uses these functional annotations to define “enhancers”, or more precisely, putative regulatory elements, and performs scanning across these putative enhancer regions. The scanning approach adopted by eSCAN allows simultaneous search and refinement of associated regions within the putative enhancers, using both genotype and phenotype data, rather than testing on a priori defined genes or region units. The eSCAN method focuses on variants residing in putative enhancer regions, which can increase statistical power by reducing the search space among non-coding regions. Furthermore, with its scan feature, eSCAN tends to identify associated regions that are shorter in size, effectively achieving fine-mapping of causal variants and regions. Integration with chromatin conformation data also makes easier biological interpretation of detected regions. As an aggregation method that tests a set of variants, eSCAN may not be able to narrow down to single variant level. With higher resolution (Kb or finer) chromatin conformation data, eSCAN can potentially pinpoint individual variants. When applied to hematological traits, eSCAN pinpointed multiplied regulatory regions associated with various blood cell indices. These regions were either missed by alternative methods or in much coarser resolution. Among them, a regulator region (chr6:90, 423, 754–90,425,200) was associated with platelet count, a signal missed by standard GWAS. The gene it regulates, the BACH2 gene, is an essential immune cell regulatory factor and plays a critical role in maintaining regulatory T-cell function and B-cell maturation (Afzali et al., 2017). These methods, integrating epigenomic information, including chromatin conformation data in genetic association testing, allow discovery, refinement, and interpretation of regulatory regions associated with complex diseases and traits. We anticipate that these methods will lead to more exciting findings in the near future, particularly given chromatin conformation data accumulated in more tissues and cell types relevant to various diseases and traits.
Discussion
Knowledge of genome-wide chromatin spatial organization has been significantly advanced, particularly since 2009, with the advent of Hi-C (Lieberman-Aiden et al., 2009) and Hi-C-derived technologies. We anticipate more rapid advancement and increasingly diverse data generated with the constantly evolving sequencing- and imaging-based technologies to study 3D chromatin structure (Liu et al., 2022b). These technologies enhance our understanding of chromatin 3D organization in general and arrive timely to help interpret GWAS findings, which have successfully identified hundreds of thousands of genetic variants associated with various diseases and traits (Buniello et al., 2019). These GWAS variants, easily reaching millions when including variants that are in LD (Huang et al., 2022a) with the index variants initially detected, reside predominantly in non-coding regions of the genome (Zhang and Lupski, 2015; Martin et al., 2017) with functional mechanisms remaining elusive. There is a pressing need to link GWAS variants to their target genes in disease-relevant tissues or cell types to advance these GWAS findings from variants to function (Sullivan and Susztak, 2020; Rowland et al., 2022b; Sun et al., 2022), to improved understanding of disease etiology, to the development of new drugs, and ultimately to personalized medicine.
Despite tremendous advances in both experimental technologies and computational methods to study chromatin spatial organization, multiple challenges and gaps remain before we can fully leverage DNA 3D organization information for the interpretation of GWAS results.
First, multiple layers of biases are buried in data generated from Hi-C and other C-based technologies. For Hi-C data, both explicit and implicit normalization methods have been developed to mitigate such biases. Explicit normalization assumes that systematic biases, due to restriction enzyme cutting frequency, GC content or sequence uniqueness (Yaffe and Tanay, 2011), are known a priori, and can be removed by explicit model-based approaches (Yaffe and Tanay, 2011; Hu et al., 2012). In contrast, implicit normalization methods such as ICE, VC and KRnorm (Imakaev et al., 2012; Rao et al., 2014b) assume the presence of unknown biases and perform normalization based on equal visibility assumption (Imakaev et al., 2012). Data generated from other C-based methods suffer from additional biases. For example, capture Hi-C data suffers from probe capture efficiency bias, while HiChIP and PLAC-seq data contain bias from immunoprecipitation efficiency. Reducing or removing biases from C-based as well as imaging data remains an active research area.
Second, we still need efficient and innovative methods to integrate chromatin interaction information with complementary pieces of information. Although we review multiple approaches and methods that leverage chromatin conformation data with various other sources of data (e.g., methylation QTL for E + G + Methyl, chromatin accessibility and histone marks for eSCAN), methods that integrate additional omics data at either bulk tissue or single cell level will further enhance power to prioritize and pinpoint important functional variants, regions and genes, and potentially in tissue- or cell-type-specific manner.
Third, studying of chromatin spatial organization can further benefit from advanced machine learning or deep learning methods. Deep learning-based methods have been used for chromatin interaction prediction or Hi-C and alike data enhancement. For example, Akita (Fudenberg et al., 2020) adopts a convolutional neural network (CNN) to predict chromatin interactions using DNA sequences alone, which can be leveraged to predict the regulatory potential of GWAS variants by assessing their impact on chromatin spatial organization. For another example, HiCPlus (Zhang et al., 2018) and HiCNN (Liu and Wang, 2019), both also CNN-based, have been proposed for the enhancement of Hi-C data and show promising results when applied to enhance HiChIP and PLAC-seq data (Huang et al., 2022b). With increasing scale and complexity of the data, we anticipate deep learning-based methods can further manifest their advantages to extract non-linear and complex relationships among high-dimensional features.
Finally, as a community, we need to generate high quality, high resolution data from complementary technologies in diverse biosamples. First, we need more comprehensive compendia of chromatin conformation data. Such data holds and has been delivering on the promise of helping to fulfill the crucial variant-to-function task. Future efforts should encompass diverse tissues and cell types across developmental stages, multiple disease progression time-points, and under various natural and perturbed conditions, as provided by recent publications (Schmitt et al., 2016a; Jung et al., 2019; Song et al., 2019, 2020) and efforts within the 4D Nucleome Project (Dekker et al., 2017). Second, we need more single-cell data. Recent single-cell technologies (Zhou et al., 2021; Yu et al., 2022) have further enhanced our capabilities to characterize cell-type-specific profiles as well as to potentially reveal cell-to-cell variability, which will additionally facilitate our interpretation and understanding of GWAS results (Yu et al., 2021; Li et al., 2022). In addition, chromatin interactome profiles in population samples will also be essential to understanding the variation across individuals, the genetics behind the variation (Gorkin et al., 2019), and the consequence of such variation for the inference of the molecular causal paths via causal inference or mediation analysis (Zhong et al., 2019, 2022). Such multi-sample chromatin conformation data have emerged at the bulk level encompassing many cells (Gorkin et al., 2019; Chandra et al., 2021). Cell type deconvolution can be essential when analyzing multi-sample data from tissue samples to ensure valid inference and gain insights in a cell-type-specific manner (Figure 6) (Sefer et al., 2016; Rowland et al., 2022a). We anticipate future studies involving single-cell data, similar to multi-sample single-cell RNA-sequencing data (Ren et al., 2021; Zheng et al., 2021), which can provide insights into disease etiology at an even more refined resolution (van Buren et al., 2021, 2022; Zhang et al., 2022).
Interpretation of GWAS results has received extensive attention in the past two decades, with many alternative approaches proposed and employed to achieve the variant-to-function goal. For example, eQTL and co-localization with GWAS signals (GTEx Consortium, 2020; Kundu et al., 2022), transcriptome-wide association studies (Gamazon et al., 2015; Zhou et al., 2020; Wen et al., 2021; Tapia et al., 2022), and correlation between the epigenetic profile and expression of nearby gene(s) (Sheffield et al., 2013) are among the commonly adopted methods to identify target genes and relevant tissues and cell types for GWAS variants. Chromatin conformation data offers complementary information and has been found to enhance our capabilities in generating and prioritizing potential functional mechanisms when integrated with alternative approaches (Fulco et al., 2019; Marsha Wheeler et al., 2021; Sun et al., 2022). In addition, DNA 3D organization help us gain insights in the orchestration of different regulatory elements, revealing enhancer-enhancer networks (Beytebiere et al., 2019; di Giammartino et al., 2019), super enhancers that regulate multiple genes (Huang et al., 2018; Zhang et al., 2021), and super interactive promoters (Song et al., 2020; Wen et al., 2022) that tend to have higher extent of enhancer redundancy. We urge future studies to increasingly generate and leverage relevant chromatin 3D organization information, which will significantly facilitate advancing GWAS findings to ultimate clinical transformation.
Acknowledgments
Figures 1, 2, 5, 6 are created with BioRender (https://biorender.com/). Figure 3A is generated with LocusZoom (Pruim et al., 2010) (http://locuszoom.org/). Figure 3B is created with the R package ggplot2 (https://ggplot2.tidyverse.org/). Figure 4A is created with WashU Epigenome Browser (Li et al., 2019). Figure 4B is created with the HUGIn tool (Martin et al., 2017).
Author contributions
Conceptualization and supervision: YL and MH. Writing—Original draft preparation: WZ, WL, MH, and YL. Writing—Review and editing: WZ, WL, JC, QS, MH, and YL. Visualization: WZ, JC, and QS.
Funding
YL was partially supported by NIH grants U01DA052713, U01HG011720, R01MH125236, and R01HL146500. WL was supported by the NIH grant R01NR019245. MH was partially supported by the NIH grant R35HG011922.
Conflict of interest
Author WZ was employed by Merck Sharp & Dohme LLC, a subsidiary of Merck & Co., Inc., Rahway, NJ, USA.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- Afzali B., Grönholm J., Vandrovcova J., O’Brien C., Sun H.-W., Vanderleyden I., et al. (2017). BACH2 immunodeficiency illustrates an association between super-enhancers and haploinsufficiency. Nat. Immunol. 18, 813–823. 10.1038/ni.3753 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Angier N. (2017). A family’s shared defect sheds light on the human genome. Available at: https://www.nytimes.com/2017/01/09/science/dna-tads.html (Accessed May 28, 2022).
- Aughey G. N., Cheetham S. W., Southall T. D. (2019). DamID as a versatile tool for understanding gene regulation. Development 146, dev173666. 10.1242/dev.173666 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ay F., Bailey T. L., Noble W. S. (2014). Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 24, 999–1011. 10.1101/gr.160374.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barbeira A. N., Dickinson S. P., Bonazzola R., Zheng J., Wheeler H. E., Torres J. M., et al. (2018). Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9, 1825. 10.1038/s41467-018-03621-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beagrie R. A., Scialdone A., Schueler M., Kraemer D. C. A., Chotalia M., Xie S. Q., et al. (2017). Complex multi-enhancer contacts captured by genome architecture mapping. Nature 543, 519–524. 10.1038/nature21411 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beliveau B. J., Apostolopoulos N., Wu C. (2014). Visualizing genomes with Oligopaint FISH probes. Curr. Protoc. Mol. Biol. 105, 14–23. 10.1002/0471142727.mb1423s105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beliveau B. J., Boettiger A. N., Nir G., Bintu B., Yin P., Zhuang X., et al. (2017). In situ super-resolution imaging of genomic DNA with OligoSTORM and OligoDNA-PAINT. Methods Mol. Biol. 1663, 231–252. 10.1007/978-1-4939-7265-4_19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beliveau B. J., Joyce E. F., Apostolopoulos N., Yilmaz F., Fonseka C. Y., McCole R. B., et al. (2012). Versatile design and synthesis platform for visualizing genomes with Oligopaint FISH probes. Proc. Natl. Acad. Sci. U. S. A. 109, 21301–21306. 10.1073/pnas.1213818110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benko S., Gordon C. T., Mallet D., Sreenivasan R., Thauvin-Robinet C., Brendehaug A., et al. (2011). Disruption of a long distance regulatory region upstream of SOX9 in isolated disorders of sex development. J. Med. Genet. 48, 825–830. 10.1136/jmedgenet-2011-100255 [DOI] [PubMed] [Google Scholar]
- Beytebiere J. R., Trott A. J., Greenwell B. J., Osborne C. A., Vitet H., Spence J., et al. (2019). Tissue-specific BMAL1 cistromes reveal that rhythmic transcription is associated with rhythmic enhancer-enhancer interactions. Genes Dev. 33, 294–309. 10.1101/gad.322198.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonev B., Mendelson Cohen N., Szabo Q., Fritsch L., Papadopoulos G. L., Lubling Y., et al. (2017). Multiscale 3D genome rewiring during mouse neural development. Cell 171, 557–572. e24. 10.1016/j.cell.2017.09.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boninsegna L., Yildirim A., Zhan Y., Alber F. (2022). Integrative approaches in genome structure analysis. Structure 30, 24–36. 10.1016/j.str.2021.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buniello A., MacArthur J. A. L., Cerezo M., Harris L. W., Hayhurst J., Malangone C., et al. (2019). The NHGRI-EBI gwas catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012. 10.1093/nar/gky1120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cao Y., Chen Z., Chen X., Ai D., Chen G., McDermott J., et al. (2020). Accurate loop calling for 3D genomic data with cLoops. Bioinformatics 36, 666–675. 10.1093/bioinformatics/btz651 [DOI] [PubMed] [Google Scholar]
- Carty M., Zamparo L., Sahin M., González A., Pelossof R., Elemento O., et al. (2017). An integrated model for detecting significant chromatin interactions from high-resolution Hi-C data. Nat. Commun. 8, 15454. 10.1038/ncomms15454 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chandra V., Bhattacharyya S., Schmiedel B. J., Madrigal A., Gonzalez-Colin C., Fotsing S., et al. (2021). Promoter-interacting expression quantitative trait loci are enriched for functional genetic variants. Nat. Genet. 53, 110–119. 10.1038/s41588-020-00745-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang P., Gohain M., Yen M.-R., Chen P.-Y. (2018). Computational methods for assessing chromatin hierarchy. Comput. Struct. Biotechnol. J. 16, 43–53. 10.1016/j.csbj.2018.02.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cléard F., Moshkin Y., Karch F., Maeda R. K. (2006). Probing long-distance regulatory interactions in the Drosophila melanogaster bithorax complex using Dam identification. Nat. Genet. 38, 931–935. 10.1038/ng1833 [DOI] [PubMed] [Google Scholar]
- Crane E., Bian Q., McCord R. P., Lajoie B. R., Wheeler B. S., Ralston E. J., et al. (2015). Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature 523, 240–244. 10.1038/nature14450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crowley C., Yang Y., Qiu Y., Hu B., Abnousi A., Lipiński J., et al. (2021). FIREcaller: Detecting frequently interacting regions from Hi-C data. Comput. Struct. Biotechnol. J. 19, 355–362. 10.1016/j.csbj.2020.12.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dali R., Blanchette M. (2017). A critical assessment of topologically associating domain prediction tools. Nucleic Acids Res. 45, 2994–3005. 10.1093/nar/gkx145 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davies J. O. J., Telenius J. M., McGowan S. J., Roberts N. A., Taylor S., Higgs D. R., et al. (2016). Multiplexed analysis of chromosome conformation at vastly improved sensitivity. Nat. Methods 13, 74–80. 10.1038/nmeth.3664 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Degner J. F., Pai A. A., Pique-Regi R., Veyrieras J.-B., Gaffney D. J., Pickrell J. K., et al. (2012). DNase I sensitivity QTLs are a major determinant of human expression variation. Nature 482, 390–394. 10.1038/nature10808 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dekker J., Belmont A. S., Guttman M., Leshyk V. O., Lis J. T., Lomvardas S., et al. (2017). The 4D nucleome project. Nature 549, 219–226. 10.1038/nature23884 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dekker J., Marti-Renom M. A., Mirny L. A. (2013). Exploring the three-dimensional organization of genomes: Interpreting chromatin interaction data. Nat. Rev. Genet. 14, 390–403. 10.1038/nrg3454 [DOI] [PMC free article] [PubMed] [Google Scholar]
- di Giammartino D. C., Kloetgen A., Polyzos A., Liu Y., Kim D., Murphy D., et al. (2019). KLF4 is involved in the organization and regulation of pluripotency-associated three-dimensional enhancer networks. Nat. Cell Biol. 21, 1179–1190. 10.1038/s41556-019-0390-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon J. R., Selvaraj S., Yue F., Kim A., Li Y., Shen Y., et al. (2012). Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380. 10.1038/nature11082 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Epigenomics Consortium R., Kundaje A., Meuleman W., Ernst J., Bilenky M., Yen A., et al. (2015). Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330. 10.1038/nature14248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fang R., Yu M., Li G., Chee S., Liu T., Schmitt A. D., et al. (2016). Mapping of long-range chromatin interactions by proximity ligation-assisted ChIP-seq. Cell Res. 26, 1345–1348. 10.1038/cr.2016.137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forcato M., Nicoletti C., Pal K., Livi C. M., Ferrari F., Bicciato S. (2017). Comparison of computational methods for Hi-C data analysis. Nat. Methods 14, 679–685. 10.1038/nmeth.4325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franke M., Ibrahim D. M., Andrey G., Schwarzer W., Heinrich V., Schöpflin R., et al. (2016). Formation of new chromatin domains determines pathogenicity of genomic duplications. Nature 538, 265–269. 10.1038/nature19800 [DOI] [PubMed] [Google Scholar]
- Fudenberg G., Kelley D. R., Pollard K. S. (2020). Predicting 3D genome folding from DNA sequence with Akita. Nat. Methods 17, 1111–1117. 10.1038/s41592-020-0958-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fulco C. P., Munschauer M., Anyoha R., Munson G., Grossman S. R., Perez E. M., et al. (2016). Systematic mapping of functional enhancer-promoter connections with CRISPR interference. Science 354, 769–773. 10.1126/science.aag2445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fulco C. P., Nasser J., Jones T. R., Munson G., Bergman D. T., Subramanian V., et al. (2019). Activity-by-contact model of enhancer-promoter regulation from thousands of CRISPR perturbations. Nat. Genet. 51, 1664–1669. 10.1038/s41588-019-0538-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fullwood M. J., Liu M. H., Pan Y. F., Liu J., Xu H., Mohamed Y., et al. (2009). An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 462, 58–64. 10.1038/nature08497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gallagher M. D., Chen-Plotkin A. S. (2018). The post-GWAS era: From association to function. Am. J. Hum. Genet. 102, 717–730. 10.1016/j.ajhg.2018.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gamazon E. R., Wheeler H. E., Shah K. P., Mozaffari S., Aquino-Michaels K., Carroll R., et al. (2015). A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098. 10.1038/ng.3367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giorgio E., Robyr D., Spielmann M., Ferrero E., di Gregorio E., Imperiale D., et al. (2015). A large genomic deletion leads to enhancer adoption by the lamin B1 gene: A second path to autosomal dominant adult-onset demyelinating leukodystrophy (ADLD). Hum. Mol. Genet. 24, 3143–3154. 10.1093/hmg/ddv065 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorkin D. U., Qiu Y., Hu M., Fletez-Brant K., Liu T., Schmitt A. D., et al. (2019). Common DNA sequence variation influences 3-dimensional conformation of the human genome. Genome Biol. 20, 255. 10.1186/s13059-019-1855-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- GTEx Consortium (2020). The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330. 10.1126/science.aaz1776 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han Y., Jia Q., Jahani P. S., Hurrell B. P., Pan C., Huang P., et al. (2020). Genome-wide analysis highlights contribution of immune system pathways to the genetic architecture of asthma. Nat. Commun. 11, 1776. 10.1038/s41467-020-15649-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hindorff L. A., Sethupathy P., Junkins H. A., Ramos E. M., Mehta J. P., Collins F. S., et al. (2009). Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl. Acad. Sci. U. S. A. 106, 9362–9367. 10.1073/pnas.0903103106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsieh T.-H. S., Fudenberg G., Goloborodko A., Rando O. J. (2016). Micro-C XL: Assaying chromosome conformation from the nucleosome to the entire genome. Nat. Methods 13, 1009–1011. 10.1038/nmeth.4025 [DOI] [PubMed] [Google Scholar]
- Hu M., Deng K., Selvaraj S., Qin Z., Ren B., Liu J. S. (2012). HiCNorm: Removing biases in Hi-C data via Poisson regression. Bioinformatics 28, 3131–3133. 10.1093/bioinformatics/bts570 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang H., Zhu Q., Jussila A., Han Y., Bintu B., Kern C., et al. (2021). CTCF mediates dosage- and sequence-context-dependent transcriptional insulation by forming local chromatin domains. Nat. Genet. 53, 1064–1074. 10.1038/s41588-021-00863-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang J., Li K., Cai W., Liu X., Zhang Y., Orkin S. H., et al. (2018). Dissecting super-enhancer hierarchy based on chromatin interactions. Nat. Commun. 9, 943. 10.1038/s41467-018-03279-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang L., Rosen J. D., Sun Q., Chen J., Wheeler M. M., Zhou Y., et al. (2022a). TOP-LD: A tool to explore linkage disequilibrium with TOPMed whole-genome sequence data. Am. J. Hum. Genet. 109, 1175–1181. 10.1016/j.ajhg.2022.04.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang L., Yang Y., Li G., Jiang M., Wen J., Abnousi A., et al. (2022b). A systematic evaluation of Hi-C data enhancement methods for enhancing PLAC-seq and HiChIP data. Brief. Bioinform. 23, bbac145. 10.1093/bib/bbac145 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hughes J. R., Roberts N., McGowan S., Hay D., Giannoulatou E., Lynch M., et al. (2014). Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment. Nat. Genet. 46, 205–212. 10.1038/ng.2871 [DOI] [PubMed] [Google Scholar]
- Imakaev M., Fudenberg G., McCord R. P., Naumova N., Goloborodko A., Lajoie B. R., et al. (2012). Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods 9, 999–1003. 10.1038/nmeth.2148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jerkovic I., Cavalli G. (2021). Understanding 3D genome organization by multidisciplinary methods. Nat. Rev. Mol. Cell Biol. 22, 511–528. 10.1038/s41580-021-00362-w [DOI] [PubMed] [Google Scholar]
- Jung I., Schmitt A., Diao Y., Lee A. J., Liu T., Yang D., et al. (2019). A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat. Genet. 51, 1442–1449. 10.1038/s41588-019-0494-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaul A., Bhattacharyya S., Ay F. (2020). Identifying statistically significant chromatin contacts from Hi-C data with FitHiC2. Nat. Protoc. 15, 991–1012. 10.1038/s41596-019-0273-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim H.-J., Yardımcı G. G., Bonora G., Ramani V., Liu J., Qiu R., et al. (2020). Capturing cell type-specific chromatin compartment patterns by applying topic modeling to single-cell Hi-C data. PLoS Comput. Biol. 16, e1008173. 10.1371/journal.pcbi.1008173 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krietenstein N., Abraham S., Venev S., Abdennur N., Gibcus J., Hsieh T.-H., et al. (2020). Ultrastructural details of mammalian chromosome architecture. Mol. Cell 78, 554–565. e7. 10.1016/j.molcel.2020.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krijger P. H. L., de Laat W. (2016). Regulation of disease-associated gene expression in the 3D genome. Nat. Rev. Mol. Cell Biol. 17, 771–782. 10.1038/nrm.2016.138 [DOI] [PubMed] [Google Scholar]
- Kundu K., Tardaguila M., Mann A. L., Watt S., Ponstingl H., Vasquez L., et al. (2022). Genetic associations at regulatory phenotypes improve fine-mapping of causal variants for 12 immune-mediated diseases. Nat. Genet. 54, 251–262. 10.1038/s41588-022-01025-y [DOI] [PubMed] [Google Scholar]
- Lagler T. M., Abnousi A., Hu M., Yang Y., Li Y. (2021). HiC-ACT: Improved detection of chromatin interactions from Hi-C data via aggregated cauchy test. Am. J. Hum. Genet. 108, 257–268. 10.1016/j.ajhg.2021.01.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee D.-S., Luo C., Zhou J., Chandran S., Rivkin A., Bartlett A., et al. (2019). Simultaneous profiling of 3D genome structure and DNA methylation in single human cells. Nat. Methods 16, 999–1006. 10.1038/s41592-019-0547-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J. H., Daugharthy E. R., Scheiman J., Kalhor R., Ferrante T. C., Terry R., et al. (2015). Fluorescent in situ sequencing (FISSEQ) of RNA for gene expression profiling in intact cells and tissues. Nat. Protoc. 10, 442–458. 10.1038/nprot.2014.191 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li D., Hsu S., Purushotham D., Sears R. L., Wang T. (2019). WashU epigenome browser update 2019. Nucleic Acids Res. 47, W158–W165. 10.1093/nar/gkz348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X., An Z., Zhang Z. (2020). Comparison of computational methods for 3D genome analysis at single-cell Hi-C level. Methods 182, 52–61. 10.1016/j.ymeth.2019.08.005 [DOI] [PubMed] [Google Scholar]
- Li X., Lee L., Abnousi A., Yu M., Liu W., Huang L., et al. (2022). SnapHiC2: A computationally efficient loop caller for single cell Hi-C data. Comput. Struct. Biotechnol. J. 20, 2778–2783. In press. 10.1016/j.csbj.2022.05.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Hu M., Shen Y. (2018). Gene regulation in the 3D genome. Hum. Mol. Genet. 27, R228–R233. 10.1093/hmg/ddy164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lieberman-Aiden E., van Berkum N. L., Williams L., Imakaev M., Ragoczy T., Telling A., et al. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293. 10.1126/science.1181369 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin D., Bonora G., Yardımcı G. G., Noble W. S. (2019). Computational methods for analyzing and modeling genome structure and organization. Wiley Interdiscip. Rev. Syst. Biol. Med. 11, e1435. 10.1002/wsbm.1435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu K., Li H., Li Y., Wang J., Wang J. (2022a). A comparison of topologically associating domain callers based on Hi-C data. IEEE/ACM Trans. Comput. Biol. Bioinform. 23, 1. 10.1109/TCBB.2022.3147805 [DOI] [PubMed] [Google Scholar]
- Liu T., Wang Z. (2019). HiCNN: A very deep convolutional neural network to better enhance the resolution of Hi-C data. Bioinformatics 35, 4222–4228. 10.1093/bioinformatics/btz251 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu W., Zhong W., Chen J., Huang B., Hu M., Li Y. (2022b). Understanding regulatory mechanisms of brain function and disease through 3D genome organization. Genes (Basel) 13, 586. 10.3390/genes13040586 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lupiáñez D. G., Kraft K., Heinrich V., Krawitz P., Brancati F., Klopocki E., et al. (2015). Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161, 1012–1025. 10.1016/j.cell.2015.04.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lupiáñez D. G., Spielmann M., Mundlos S. (2016). Breaking TADs: How alterations of chromatin domains result in disease. Trends Genet. 32, 225–237. 10.1016/j.tig.2016.01.003 [DOI] [PubMed] [Google Scholar]
- Marsha Wheeler A. M., Stilp A., Rao S., Halldórsson B., Beyter D., Wen J., et al. (2021). Whole genome sequencing identifies common and rare structural variants contributing to hematologic traits in the NHLBI TOPMed program. Am. J. Hum. Genet. 108, 1836. 10.1101/2021.12.16.21267871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin J. S., Xu Z., Reiner A. P., Mohlke K. L., Sullivan P., Ren B., et al. (2017). HUGIn: Hi-C unifying genomic interrogator. Bioinformatics 33, 3793–3795. 10.1093/bioinformatics/btx359 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mateo L. J., Murphy S. E., Hafner A., Cinquini I. S., Walker C. A., Boettiger A. N. (2019). Visualizing DNA folding and RNA in embryos at single-cell resolution. Nature 568, 49–54. 10.1038/s41586-019-1035-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mifsud B., Tavares-Cadete F., Young A. N., Sugar R., Schoenfelder S., Ferreira L., et al. (2015). Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598–606. 10.1038/ng.3286 [DOI] [PubMed] [Google Scholar]
- Mumbach M. R., Rubin A. J., Flynn R. A., Dai C., Khavari P. A., Greenleaf W. J., et al. (2016). HiChIP: Efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 919–922. 10.1038/nmeth.3999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nagano T., Lubling Y., Stevens T. J., Schoenfelder S., Yaffe E., Dean W., et al. (2013). Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 502, 59–64. 10.1038/nature12593 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakamura M., Gao Y., Dominguez A. A., Qi L. S. (2021). CRISPR technologies for precise epigenome editing. Nat. Cell Biol. 23, 11–22. 10.1038/s41556-020-00620-7 [DOI] [PubMed] [Google Scholar]
- Nguyen H. Q., Chattoraj S., Castillo D., Nguyen S. C., Nir G., Lioutas A., et al. (2020). 3D mapping and accelerated super-resolution imaging of the human genome using in situ sequencing. Nat. Methods 17, 822–832. 10.1038/s41592-020-0890-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nicoletti C., Forcato M., Bicciato S. (2018). Computational methods for analyzing genome-wide chromosome conformation capture data. Curr. Opin. Biotechnol. 54, 98–105. 10.1016/j.copbio.2018.01.023 [DOI] [PubMed] [Google Scholar]
- Pal K., Forcato M., Ferrari F. (2019). Hi-C analysis: From data generation to integration. Biophys. Rev. 11, 67–78. 10.1007/s12551-018-0489-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Payne A. C., Chiang Z. D., Reginato P. L., Mangiameli S. M., Murray E. M., Yao C.-C., et al. (2021). In situ genome sequencing resolves DNA sequence and structure in intact biological samples. Science 371, eaay3446. 10.1126/science.aay3446 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Project Consortium E. N. C. O. D. E. (2012). An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74. 10.1038/nature11247 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Project Consortium E. N. C. O. D. E., Moore J. E., Purcaro M. J., Pratt H. E., Epstein C. B., Shoresh N., et al. (2020). Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710. 10.1038/s41586-020-2493-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pruim R. J., Welch R. P., Sanna S., Teslovich T. M., Chines P. S., Gliedt T. P., et al. (2010). LocusZoom: Regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337. 10.1093/bioinformatics/btq419 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinodoz S. A., Ollikainen N., Tabak B., Palla A., Schmidt J. M., Detmar E., et al. (2018). Higher-order inter-chromosomal hubs shape 3D genome organization in the nucleus. Cell 174, 744–757. e24. 10.1016/j.cell.2018.05.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rao S. S. P., Huntley M. H., Durand N. C., Stamenova E. K., Bochkov I. D., Robinson J. T., et al. (2014a). A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680. 10.1016/j.cell.2014.11.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rao S. S. P., Huntley M. H., Durand N. C., Stamenova E. K., Bochkov I. D., Robinson J. T., et al. (2014b). A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680. 10.1016/j.cell.2014.11.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ren X., Wen W., Fan X., Hou W., Su B., Cai P., et al. (2021). COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas. Cell 184, 1895–1913.e19. e19. 10.1016/j.cell.2021.01.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roayaei Ardakany A., Gezer H. T., Lonardi S., Ay F. (2020). Mustache: Multi-scale detection of chromatin loops from Hi-C and Micro-C maps using scale-space representation. Genome Biol. 21, 256. 10.1186/s13059-020-02167-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rocha P. P., Raviram R., Bonneau R., Skok J. A. (2015). Breaking TADs: Insights into hierarchical genome organization. Epigenomics 7, 523–526. 10.2217/epi.15.25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowland B., Huh R., Hou Z., Crowley C., Wen J., Shen Y., et al. (2022a). Thunder: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data. PLoS Genet. 18, e1010102. 10.1371/journal.pgen.1010102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowland B., Venkatesh S., Tardaguila M., Wen J., Rosen J. D., Tapia A. L., et al. (2022b). Transcriptome-wide association study in UK biobank Europeans identifies associations with blood cell traits. Hum. Mol. Genet. 31, ddac011. 10.1093/hmg/ddac011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowley M. J., Poulet A., Nichols M. H., Bixler B. J., Sanborn A. L., Brouhard E. A., et al. (2020). Analysis of Hi-C data using SIP effectively identifies loops in organisms from C. elegans to mammals. Genome Res. 30, 447–458. 10.1101/gr.257832.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sahin M., Wong W., Zhan Y., van Deynze K., Koche R., Leslie C. S. (2021). HiC-DC+ enables systematic 3D interaction calls and differential analysis for Hi-C and HiChIP. Nat. Commun. 12, 3366. 10.1038/s41467-021-23749-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmiedel B. J., Seumois G., Samaniego-Castruita D., Cayford J., Schulten V., Chavez L., et al. (2016). 17q21 asthma-risk variants switch CTCF binding and regulate IL-2 production by T cells. Nat. Commun. 7, 13426. 10.1038/ncomms13426 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitt A. D., Hu M., Jung I., Xu Z., Qiu Y., Tan C. L., et al. (2016a). A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059. 10.1016/j.celrep.2016.10.061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitt A. D., Hu M., Ren B. (2016b). Genome-wide mapping and analysis of chromosome architecture. Nat. Rev. Mol. Cell Biol. 17, 743–755. 10.1038/nrm.2016.104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sefer E. (2022). A comparison of topologically associating domain callers over mammals at high resolution. BMC Bioinforma. 23, 127. 10.1186/s12859-022-04674-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sefer E., Duggal G., Kingsford C. (2016). Deconvolution of ensemble chromatin interaction data reveals the latent mixing structures in cell subpopulations. J. Comput. Biol. 23, 425–438. 10.1089/cmb.2015.0210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sey N. Y. A., Hu B., Mah W., Fauni H., McAfee J. C., Rajarajan P., et al. (2020). A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles. Nat. Neurosci. 23, 583–593. 10.1038/s41593-020-0603-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sheffield N. C., Thurman R. E., Song L., Safi A., Stamatoyannopoulos J. A., Lenhard B., et al. (2013). Patterns of regulatory activity across diverse human cell types predict tissue identity, transcription factor binding, and long-range interactions. Genome Res. 23, 777–788. 10.1101/gr.152140.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smemo S., Tena J. J., Kim K.-H., Gamazon E. R., Sakabe N. J., Gómez-Marín C., et al. (2014). Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature 507, 371–375. 10.1038/nature13138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song M., Pebworth M.-P., Yang X., Abnousi A., Fan C., Wen J., et al. (2020). Cell-type-specific 3D epigenomes in the developing human cortex. Nature 587, 644–649. 10.1038/s41586-020-2825-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song M., Yang X., Ren X., Maliskova L., Li B., Jones I. R., et al. (2019). Mapping cis-regulatory chromatin contacts in neural cells links neuropsychiatric disorder risk variants to target genes. Nat. Genet. 51, 1252–1262. 10.1038/s41588-019-0472-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Su J.-H., Zheng P., Kinrot S. S., Bintu B., Zhuang X. (2020). Genome-scale imaging of the 3D organization and transcriptional activity of chromatin. Cell 182, 1641–1659. e26. 10.1016/j.cell.2020.07.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sullivan K. M., Susztak K. (2020). Unravelling the complex genetics of common kidney diseases: From variants to mechanisms. Nat. Rev. Nephrol. 16, 628–640. 10.1038/s41581-020-0298-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun Q., Crowley C. A., Huang L., Wen J., Chen J., Bao E. L., et al. (2022). From GWAS variant to function: A study of ∼148, 000 variants for blood cell traits. HGG Adv. 3, 100063. 10.1016/j.xhgg.2021.100063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takei Y., Yun J., Zheng S., Ollikainen N., Pierson N., White J., et al. (2021a). Integrated spatial genomics reveals global architecture of single nuclei. Nature 590, 344–350. 10.1038/s41586-020-03126-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takei Y., Zheng S., Yun J., Shah S., Pierson N., White J., et al. (2021b). Single-cell nuclear architecture across cell types in the mouse brain. Science 374, 586–594. 10.1126/science.abj1966 [DOI] [PubMed] [Google Scholar]
- Tan L., Xing D., Chang C.-H., Li H., Xie X. S. (2018). Three-dimensional genome structures of single diploid human cells. Science 361, 924–928. 10.1126/science.aat5641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang L., Hill M. C., Ellinor P. T., Li M. (2022). Bacon: A comprehensive computational benchmarking framework for evaluating targeted chromatin conformation capture-specific methodologies. Genome Biol. 23, 30. 10.1186/s13059-021-02597-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao H., Li H., Xu K., Hong H., Jiang S., Du G., et al. (2021). Computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles. Brief. Bioinform. 22, bbaa405. 10.1093/bib/bbaa405 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tapia A. L., Rowland B. T., Rosen J. D., Preuss M., Young K., Graff M., et al. (2022). A large-scale transcriptome-wide association study (TWAS) of 10 blood cell phenotypes reveals complexities of TWAS fine-mapping. Genet. Epidemiol. 46, 3–16. 10.1002/gepi.22436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trynka G., Sandor C., Han B., Xu H., Stranger B. E., Liu X. S., et al. (2013). Chromatin marks identify critical cell types for fine mapping complex trait variants. Nat. Genet. 45, 124–130. 10.1038/ng.2504 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Buren E., Hu M., Cheng L., Wrobel J., Wilhelmsen K., Su L., et al. (2022). TWO-SIGMA-G: A new competitive gene set testing framework for scRNA-seq data accounting for inter-gene and cell-cell correlation. Brief. Bioinform. 23, bbac084. 10.1093/bib/bbac084 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Buren E., Hu M., Weng C., Jin F., Li Y., Wu D., et al. (2021). TWO-SIGMA: A novel two-component single cell model-based association method for single-cell RNA-seq data. Genet. Epidemiol. 45, 142–153. 10.1002/gepi.22361 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van de Werken H. J. G., Landan G., Holwerda S. J. B., Hoichman M., Klous P., Chachik R., et al. (2012). Robust 4C-seq data analysis to screen for regulatory DNA interactions. Nat. Methods 9, 969–972. 10.1038/nmeth.2173 [DOI] [PubMed] [Google Scholar]
- Wen J., Lagler T. M., Sun Q., Yang Y., Chen J., Harigaya Y., et al. (2022). Super interactive promoters provide insight into cell type-specific regulatory networks in blood lineage cell types. PLoS Genet. 18, e1009984. 10.1371/journal.pgen.1009984 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wen J., Xie M., Rowland B., Rosen J. D., Sun Q., Chen J., et al. (2021). Transcriptome-wide association study of blood cell traits in african ancestry and hispanic/latino populations. Genes (Basel) 12, 1049. 10.3390/genes12071049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willer C. J., Schmidt E. M., Sengupta S., Peloso G. M., Gustafsson S., Kanoni S., et al. (2013). Discovery and refinement of loci associated with lipid levels. Nat. Genet. 45, 1274–1283. 10.1038/ng.2797 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Won H., de la Torre-Ubieta L., Stein J. L., Parikshak N. N., Huang J., Opland C. K., et al. (2016). Chromosome conformation elucidates regulatory relationships in developing human brain. Nature 538, 523–527. 10.1038/nature19847 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu C., Pan W. (2019). Integration of methylation QTL and enhancer-target gene maps with schizophrenia GWAS summary results identifies novel genes. Bioinformatics 35, 3576–3583. 10.1093/bioinformatics/btz161 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu Z., Zhang G., Jin F., Chen M., Furey T. S., Sullivan P. F., et al. (2016a). A hidden Markov random field-based Bayesian method for the detection of long-range chromosomal interactions in Hi-C data. Bioinformatics 32, 650–656. 10.1093/bioinformatics/btv650 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu Z., Zhang G., Wu C., Li Y., Hu M. (2016b). FastHiC: A fast and accurate algorithm to detect long-range chromosomal interactions from Hi-C data. Bioinformatics 32, 2692–2695. 10.1093/bioinformatics/btw240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yaffe E., Tanay A. (2011). Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat. Genet. 43, 1059–1065. 10.1038/ng.947 [DOI] [PubMed] [Google Scholar]
- Yang Y., Sun Q., Huang L., Broome J. G., Correa A., Reiner A., et al. (2022). eSCAN: scan regulatory regions for aggregate association testing using whole-genome sequencing data. Brief. Bioinform. 23, bbab497. 10.1093/bib/bbab497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin H., Kauffman K. J., Anderson D. G. (2017). Delivery technologies for genome editing. Nat. Rev. Drug Discov. 16, 387–399. 10.1038/nrd.2016.280 [DOI] [PubMed] [Google Scholar]
- Yu M., Abnousi A., Zhang Y., Li G., Lee L., Chen Z., et al. (2021). SnapHiC: A computational pipeline to identify chromatin loops from single-cell Hi-C data. Nat. Methods 18, 1056–1059. 10.1038/s41592-021-01231-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu M., Li Y., Hu M. (2022). Mapping chromatin loops in single cells. Trends Genet. 38, 637–640. 10.1016/j.tig.2022.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu M., Ren B. (2017). The three-dimensional organization of mammalian genomes. Annu. Rev. Cell Dev. Biol. 33, 265–289. 10.1146/annurev-cellbio-100616-060531 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang F., Lupski J. R. (2015). Non-coding genetic variants in human disease. Hum. Mol. Genet. 24, R102–R110. 10.1093/hmg/ddv259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang J., Yue W., Zhou Y., Liao M., Chen X., Hua J. (2021). Super enhancers-Functional cores under the 3D genome. Cell Prolif. 54, e12970. 10.1111/cpr.12970 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang M., Liu S., Miao Z., Han F., Gottardo R., Sun W. (2022). Ideas: Individual level differential expression analysis for single-cell RNA-seq data. Genome Biol. 23, 33. 10.1186/s13059-022-02605-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y., An L., Xu J., Zhang B., Zheng W. J., Hu M., et al. (2018). Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus. Nat. Commun. 9, 750. 10.1038/s41467-018-03113-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng L., Qin S., Si W., Wang A., Xing B., Gao R., et al. (2021). Pan-cancer single-cell landscape of tumor-infiltrating T cells. Science 374, abe6474. 10.1126/science.abe6474 [DOI] [PubMed] [Google Scholar]
- Zhong W., Darville T., Zheng X., Fine J., Li Y. (2022). Generalized multi-SNP mediation intersection-union test. Biometrics 78, 364–375. 10.1111/biom.13418 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhong W., Spracklen C. N., Mohlke K. L., Zheng X., Fine J., Li Y. (2019). Multi-SNP mediation intersection-union test. Bioinformatics 35, 4724–4729. 10.1093/bioinformatics/btz285 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou D., Jiang Y., Zhong X., Cox N. J., Liu C., Gamazon E. R. (2020). A unified framework for joint-tissue transcriptome-wide association and Mendelian randomization analysis. Nat. Genet. 52, 1239–1246. 10.1038/s41588-020-0706-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou T., Zhang R., Ma J. (2021). The 3D genome structure of single cells. Annu. Rev. Biomed. Data Sci. 4, 21–41. 10.1146/annurev-biodatasci-020121-084709 [DOI] [PubMed] [Google Scholar]
- Zhuang X. (2021). Spatially resolved single-cell genomics and transcriptomics by imaging. Nat. Methods 18, 18–22. 10.1038/s41592-020-01037-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zufferey M., Tavernari D., Oricchio E., Ciriello G. (2018). Comparison of computational methods for the identification of topologically associating domains. Genome Biol. 19, 217. 10.1186/s13059-018-1596-9 [DOI] [PMC free article] [PubMed] [Google Scholar]