Abstract
The three major anatomic regions of the human kidney include the cortex, medulla and papilla, with different functions and vulnerabilities to kidney diseases. Epigenetic mechanisms underlying these anatomic structures are incompletely understood. Here, we performed chromatin conformation capture with Hi-C and histone modification H3K4me3/H3K27me3 Cleavage Under Targets and Release Using Nuclease (CUT&RUN) sequencing on the kidney cortex, medulla and papilla dissected from one individual donor. Nuclear suspensions were generated from each region and split subjected to paired Hi-C and CUT&RUN sequencing. We evaluated the quality of next-generation sequencing data, Hi-C chromatin contact matrices and CUT&RUN peak calling. H3K4me3 and H3K27me3 histone modifications represent active and repressive gene transcription, respectively, and differences in chromatin conformation between kidney regions can be analyzed with this dataset. All raw and processed data files are publicly available, allowing researchers to survey the epigenetic landscape across regional human kidney anatomy.
Subject terms: Epigenetics, Genetics, DNA methylation
Background & Summary
The human kidneys play an essential role in waste removal, fluid balance, blood pressure regulation and multiple endocrine functions. Chronic kidney disease is characterized by kidney function decline and affects over 800 million individuals worldwide1–4. The cortex, medulla and papilla are three major anatomic structures of the human kidney. The three regions contain different populations of kidney cell types and present distinct vulnerability to various kidney diseases. For example, proximal tubular cell, predominantly present in the kidney cortex, is the major target of diabetic kidney disease and drug responses5–8. The kidney medulla and papilla, on the other hand, are more susceptible to cystic kidney disease, autosomal-dominant tubulointerstitial kidney disease and renal hypodysplasia9. Therefore, understanding the multiomic signatures of these different kidney anatomic regions is crucial to identify new therapeutic targets.
Gene expression in mammalian cells is precisely controlled by epigenetic mechanisms such as chromatin modifications and regulation of 3-dimensional (3D) chromatin architectures10–15. We recently profiled transcriptomics, open chromatin accessibility and metabolomics profiles from cells of cortex, medulla and papilla in human kidneys16,17, but other epigenomic modalities, including chromatin conformation18 and histone modification19 profiles, across these kidney anatomic regions, remain unexplored. Although the 3D chromatin architecture or histone modifications such as H3K4me3 and H3K27Ac have been profiled in primary human kidney tissues20–24, these studies only included the cortex in their sample cohorts. A recent study successfully performed Hi-C (a technology of chromatin conformation capture) on human kidney cortex and medulla samples25, but histone modification profiling was still lacking.
A major challenge in profiling both chromatin conformation and histone modifications from the same sample is that sequencing each modality requires a large number of cells or nuclei as input in library preparation. Here, we leveraged our recently described high-yield nuclei extraction method26,27 and performed Hi-C and Cleavage Under Targets and Release Using Nuclease (CUT&RUN) sequencing of histone modifications (H3K4me3 and H3K27me3) in parallel on the kidney cortex, medulla and papilla samples dissected from one individual donor (Fig. 1). This dataset of simultaneous profiling of chromatin conformation and histone modifications is publicly available28,29, allowing investigation of the epigenetic landscape across regional human kidney anatomy.
Fig. 1.
Workflow schematic. Kidney cortex, medulla and papilla tissues were obtained from a healthy donor. Nuclei were extracted from these tissues and processed for two epigenomic profiling assays, Hi-C and CUT&RUN, in parallel.
Methods
Donor description
This research complies with all relevant ethical regulations and has been approved by the Washington University Institutional Review Board. Human kidney cortex, medulla and papilla tissue samples were obtained from the Washington University Kidney Translational Research Core Biobank with consent managed by the core. The kidney sample was discarded for transplantation and was from a deceased organ donor (67-year-old female). These studies were approved by the Washington University in St. Louis Institutional Review Board, protocol #201601020. This approval included consent to publish genomic data. The donor has a normal kidney function, with creatinine level at sampling as 0.6 mg/dL and with mild interstitial fibrosis. All tissues samples were frozen with liquid nitrogen and stored at −80 °C before use. To be noted, the aim of this study is to provide a high-quality paired Hi-C and CUT&RUN sequencing dataset on a healthy human control, instead of including a large cohort of samples to generate new biological hypotheses.
Nuclear suspension generation
Generation of nuclear suspension from frozen kidney samples was performed as previously described with minor modifications26,27. Briefly, Nuclei EZ Lysis Buffer (NUC101, Sigma) was supplemented with EDTA-free protease inhibitor tablets (5892791001, Roche). Tissues were exposed to the ice-cold lysis buffer, minced with a razor blade and homogenized with Dounce Tissue Grinders (885303-0002, Kimble) with the large pestle. The homogenate was filtered through a 200-µm Mini Strainer (43-10200-60, pluriSelect) and cell homogenization was performed with the Dounce Tissue Grinder again with the small pestle. The homogenate was incubated in the buffer for 3 minutes and then filtered through a 40-µm Mini Strainer (43-10040-60, pluriSelect). The homogenate was centrifuged at 500 × g for 4 minutes at 4 °C and the pellet was resuspended with the lysis buffer. After 5-minute incubation, the suspension was centrifuged at 500 × g for 4 minutes at 4 °C and the nuclei pellet was resuspended with 2 mL nuclei buffer (10 mM Tris-HCl pH 7.5, 10 mM NaCl, 3 mM MgCl2) supplemented with 3% BSA (B9000S, NEB) by pipetting 10 times. The concentration of nuclear suspension was counted with a Countess Automated Cell Counter.
Hi-C library generation
Hi-C was performed with the Arima-HiC kit (A510008, Arima Genomics) following the manufacture’s manual with minor changes. Briefly, we transferred 5–7 million fresh nuclei and reconstituted the volume to 5 mL with nuclei buffer (10 mM Tris-HCl pH 7.5, 10 mM NaCl, 3 mM MgCl2) supplemented with 3% BSA (B9000S, NEB). Then, crosslinking was performed according to the manufacture’s protocol (“Crosslinking – Standard Input” section in A160134 v01) by adding 286 µL 37% formaldehyde (final concentration 2%). The BSA-supplemented nuclei buffer was consistently used during nuclei washes. Nuclei were aliquoted with 1 million nuclei per aliquot and stored at −80 °C for subsequent reactions. Each aliquot containing 1 million nuclei is considered as a replicate and we included two replicates for each kidney anatomic region in this study.
The 1-million nuclei pellet was resuspended with 25 µL nuclease-free water and 20 µL was proceeded with the manufacture’s protocol (“Arima-HiC Protocol” section in A160134 v01) for restriction enzyme digestion, end filling, ligation and purification. Hi-C library preparation was performed with the Arima Library Prep Kit v2 (A303011, Arima Genomics) with minor changes. Briefly, DNA fragmentation was performed with a Diagenode Bioruptor Pico device (6-7 cycles of 15 seconds’ on and 90 seconds’ off on 100 µL DNA products). DNA size selection, biotin enrichment and library amplification were performed according to the manufacture’s protocol (A160432 v02). A total of 8 cycles were used for library amplification and the final library was eluted with 25 µL nuclease-free water. The final libraries were visualized and quantified with a TapeStation instrument, with a representative Hi-C library trace shown in Fig. 2a. The i7 and i5 index sequences of each library are presented in Table 1.
Fig. 2.
TapeStation profiles of representative Hi-C and CUT&RUN libraries. (a) TapeStation HSD1000 gel image of the ladder and a Hi-C library (left) and the electropherogram trace of the Hi-C library (right). (b) TapeStation HSD5000 gel image of the ladder and an H3K4me3 CUT&RUN library (left) and the electropherogram trace of the CUT&RUN library (right).
Table 1.
Index sequences of 6 Hi-C libraries presented in this study.
| Region | Replicate | i7 | i5 |
|---|---|---|---|
| Cortex | Replicate1 | AGTCGCGA | TAGACCAA |
| Replicate2 | CGGTAGAG | AGTCGCGA | |
| Medulla | Replicate1 | TCAGCATC | AAGGAGCG |
| Replicate2 | AGAAGCAA | TCAGCATC | |
| Papilla | Replicate1 | GCAGGTTC | AGAAGCAA |
| Replicate2 | AAGTGTCT | GCAGGTTC |
CUT&RUN library generation
CUT&RUN was performed with the CUTANA™ ChIC / CUT&RUN Kit v3 (Cat# 14-1048, EpiCypher) with minor modifications. Briefly, after counting the concentration of the nuclear suspension, the suspension was spined down at 600 × g for 4 minutes at 4 °C and reconstituted to 5 million/mL with nuclei buffer (10 mM Tris-HCl pH 7.5, 10 mM NaCl, 3 mM MgCl2). Then, 100 µL suspension containing 500,000 nuclei was mixed with 10 µL activated EpiCypher ConA Beads. Each 100 µL aliquot containing 500,000 nuclei is considered as a replicate and we included two replicates for each reaction in this study.
For antibody binding, 0.5 µg H3K4me3 (Cat # 13-0041k, EpiCypher) and H3K27me3 (Cat # 13-0055, EpiCypher) antibodies were used and 0.5 µg rabbit IgG antibody (Cat # 13-0042k, EpiCypher) was used as a negative control. As recommended by the manufacture’s manual, a SNAP-CUTANA™ K-MetStat Panel (Cat # 19-1002k, EpiCypher), which contains 16 different DNA-barcoded histone modifications, was added into the H3K4me3 and IgG samples as spike-in controls before antibody binding. pAG-MNase binding, chromatin digestion and DNA purification were performed following the manufacture’s protocol (User Manual v3.3). 0.5 ng E. coli Spike-in DNA (Cat# 18-1401, EpiCypher) was added to each reaction for downstream data normalization.
Library preparation was performed with the NEBNext® Ultra™ II DNA Library Prep Kit (E7645S, NEB), with PCR amplification parameters as [98 °C for 45 seconds, 13 cycles of (98 °C for 15 seconds, 60 °C for 10 seconds), 72 °C for 60 seconds and 4 °C hold]. NEBNext® Multiplex Oligos (E6440S, NEB) were used for library indexing, with the i7 and i5 index sequences of each library presented in Table 2. The final libraries were visualized and quantified with a TapeStation instrument, with a representative library trace shown in Fig. 2b, which successfully indicates the periodicity of chromatin structures.
Table 2.
Index sequences of 18 CUT&RUN libraries presented in this study.
| Region | Modification | Replicate | i7 | i5 |
|---|---|---|---|---|
| Cortex | H3K27me3 | Replicate1 | TCCACGTT | GGTTGAAC |
| Replicate2 | AACCAGAG | CTTCGGTT | ||
| H3K4me3 | Replicate1 | GTCAGTCA | CGGCATTA | |
| Replicate2 | CCTTCCAT | CACGCAAT | ||
| IgG | Replicate1 | AGGAACAC | GGAATGTC | |
| Replicate2 | CTTACAGC | TGGTGAAG | ||
| Medulla | H3K27me3 | Replicate1 | CAGTGCTT | TTCCAGGT |
| Replicate2 | TCCATTGC | TACGGTCT | ||
| H3K4me3 | Replicate1 | GTCGATTG | AAGACCGT | |
| Replicate2 | ATAACGCC | CAGGTTCA | ||
| IgG | Replicate1 | GCCTTAAC | TAGGAGCT | |
| Replicate2 | GGTATAGG | TACTCCAG | ||
| Papilla | H3K27me3 | Replicate1 | GTAAGGTG | CGAATTGC |
| Replicate2 | CGAGAGAA | GGAAGAGA | ||
| H3K4me3 | Replicate1 | CGCAACTA | TCGGATTC | |
| Replicate2 | CACAGACT | CTGTACCA | ||
| IgG | Replicate1 | TGGAAGCA | GAGAGTAC | |
| Replicate2 | CAATAGCC | TCTACGCA |
Next-generation sequencing
All 6 dual-indexed Hi-C libraries and 18 dual-indexed CUT&RUN libraries were sequenced on a 300-cycle NovaSeq X Plus platform (Illumina). Sequencing depths of all Hi-C and CUT&RUN libraries are summarized in Tables 3 and 4, respectively. Hi-C libraries were sequenced with an average depth of 578 million reads and CUT&RUN libraries were sequenced with an average depth of 19 million reads.
Table 3.
Sequencing depths of Hi-C libraries presented in this work.
| Region | Replicate | Total Reads |
|---|---|---|
| Cortex | Replicate1 | 552,451,162 |
| Replicate2 | 573,392,643 | |
| Medulla | Replicate1 | 482,596,289 |
| Replicate2 | 623,193,489 | |
| Papilla | Replicate1 | 677,546,608 |
| Replicate2 | 559,329,612 |
Table 4.
Sequencing depths of CUT&RUN libraries presented in this work.
| Region | Modification | Replicate | Total Reads | E. coli Reads (Bowtie2 alignment) |
|---|---|---|---|---|
| Cortex | H3K27me3 | Replicate1 | 17,939,046 | 57,656 |
| Replicate2 | 15,548,369 | 39,133 | ||
| H3K4me3 | Replicate1 | 29,670,340 | 165,311 | |
| Replicate2 | 20,971,242 | 114,477 | ||
| IgG | Replicate1 | 20,486,985 | 134,230 | |
| Replicate2 | 17,107,957 | 138,626 | ||
| Medulla | H3K27me3 | Replicate1 | 19,927,691 | 110,202 |
| Replicate2 | 25,289,963 | 126,381 | ||
| H3K4me3 | Replicate1 | 13,429,807 | 105,156 | |
| Replicate2 | 26,763,620 | 363,567 | ||
| IgG | Replicate1 | 16,126,257 | 105,183 | |
| Replicate2 | 21,147,369 | 170,977 | ||
| Papilla | H3K27me3 | Replicate1 | 24,922,313 | 152,521 |
| Replicate2 | 20,944,175 | 123,093 | ||
| H3K4me3 | Replicate1 | 23,117,763 | 304,089 | |
| Replicate2 | 4,119,522 | 46,513 | ||
| IgG | Replicate1 | 15,885,706 | 190,892 | |
| Replicate2 | 13,094,707 | 138,635 |
Hi-C data processing and analysis
Preprocessing of paired-end raw .fastq files of Hi-C sequencing data were performed with Juicer v1.630, including genome alignment, read sorting, duplicate removal and generation of .hic contact matrices, with resolutions of 5 K, 10 K, 25 K, 50 K, 100 K, 250 K, 500 K, 1000 K and 2500 K bases. Human hg19 (primary assembly of GRCh37) was used as the reference genome. The hg19 .fasta file was used to create an Arima Hi-C restriction enzyme site position file with the generate_site_positions.py script provided by Juicer. All other data preprocessing procedures were performed following the Arima-HiC bioinformatics manual (A160600 version 07/19/2021). Only .hic files with mapping quality over 30 (MAPQ >= 30) were used for downstream analysis in this study.
Finding chromatin contact domains was performed with Arrowhead (implemented in Juicer tools v1.6.2) as previously described at a resolution of 5000 bp31. This analytical pipeline was first performed on individual replicate .fastq files, and after quality check, was performed again on .fastq files merged from two replicates of the same kidney anatomic region. Data visualization was performed with Juicebox v1.8.8 and WashU Epigenome Browser32. Data files generated through hg38 alignment are also available at 10.5281/zenodo.1195517529.
CUT&RUN data processing and analysis
The paired-end raw.fastq files of CUT&RUN sequencing data were first trimmed with TrimGalore v0.6.4_dev (https://github.com/FelixKrueger/TrimGalore). The reads were aligned to the human hg19 (primary assembly of GRCh37) genome or the E. coli genome (E. coli strain K12, MG1655; available at https://support.illumina.com/sequencing/sequencing_software/igenome.html) for .sam file generation with Bowtie2 v2.3.5.133. The number of E. coli reads of each CUT&RUN library is summarized in Table 4. The .sam files post hg19 genome alignment were sorted and indexed with samtools v1.1034 to generate sorted.bam files. For each library, bigwig file were subsequently generated with bamCoverage v3.5.435. This analytical pipeline was first performed on individual replicate .fastq files, and after quality check, was performed again on .fastq files merged from two replicates of the same kidney anatomic region. Evaluation of the SNAP-CUTANA™ K-MetStat Panel was performed on H3K4me3 and IgG libraries with scripts provided by the EpiCypher manufacture.
The peak calling step was performed using the Sparse Enrichment Analysis for CUT&RUN (SEACR) workflow developed by Meers et al.36 Previously sorted .bam files were first converted to .bed files using the bamtobed function of bedtools. Then, the 5′ and 3′ coordinates of the read pairs were selected to generate a new .bed file, which was subsequently converted to .bedgraph format using the genomecov function of bedtools. The scale factor used in this step followed the Epicypher CUTANA manual, which measures the ratio of mapped reads between each histone marker and its corresponding E. coli Spike-in DNA. After generating the .bedgraph files, SEACR was used to call peaks between each histone modification marker and its region-specific IgG control. The called peaks were exported as .peaks.stringent.bed files for downstream analysis using R v4.2.2. R packages circlize37 and karyoploteR38 were utilized to generate the density and coverage plots of peaks on each chromosome. The R package ChIPseeker was employed for annotating and visualizing the genomic locations of each peak39,40. Data files generated through hg38 alignment are also available at 10.5281/zenodo.1195517529.
Data Records
All primary data have been deposited in NCBI’s Gene Expression Omnibus and are available through GEO Series accession number GSE25363428, including (1) raw fastq files of Hi-C sequencing data, (2) processed Hi-C contact matrices for all Hi-C libraries, (3) raw fastq files of CUT&RUN sequencing data and (4) processed bigwig files for all CUT&RUN libraries. Supporting data files, including donor clinical information and intermediate data that are not compliant with the GEO requirement, are available at 10.5281/zenodo.1195517529.
Technical Validation
Hi-C sequencing data evaluation
We analyzed the quality control metrics generated by Juicer30 during Hi-C data preprocessing. A total of 687, 797, 625, 675, 269, 740 and 808,998,165 Hi-C contacts were identified in the human kidney cortex, medulla and papilla, respectively. As indicated by the Arima Hi-C manual, high-quality Hi-C libraries are typically characterized by a ratio of alignable reads to total reads over 80% and a ratio of Hi-C contact reads to alignable reads over 60%, and we found the values of all 6 libraries generated in this work are above these thresholds (Fig. 3a,b). Our library quality was also validated by fraction of chimeric ambiguous reads and fraction of unmapped reads lower than their corresponding thresholds for all libraries (Fig. 3c,d).
Fig. 3.
Quality control metrics of Hi-C libraries. Ratios of the alignable read number to the total read number (a), the identified Hi-C contact number to the alignable read number (b), the chimeric ambiguous read number to the total read number (c), the unmapped read number to the total read number (d), the long-cis interaction number to the total Hi-C contact number (e) and the intra-chromosomal interaction number to the total Hi-C contact number (f). Libraries with values above the indicated threshold (a/b/e) or below the indicated threshold (c/d) are considered with high quality.
Since the Hi-C technology mostly captures long-range 3D interactions within chromosomes18, we examined ratios of long-cis interactions with a distance greater than 20kb to total Hi-C contact numbers, and identified that for all 6 libraries, over half of the Hi-C contacts present long-cis interactions, with ratios higher than the quality threshold (Fig. 3e). Over 70% of Hi-C contacts are enriched for intra-chromosomal interactions for all libraries (Fig. 3f), consistent with expectation.
Evaluation of Hi-C contact matrices
We projected all Hi-C contacts onto the hg19 genome for all kidney anatomic regions. As mentioned above, most Hi-C contacts are enriched for intra-chromosomal interactions (Fig. 4a) for all autosomes and the X chromosome. The Hi-C contact maps of kidney cortex, medulla and papilla share a similar pattern when we examined them across all chromosomes (Fig. 4a), as a result of conserved chromatin conformation in mammalian cells. On the other hand, distinct Hi-C contact profiles can be observed when examined within a chromosome or specific genomic regions (Fig. 4b), indicating different chromatin interaction networks across kidney anatomic regions.
Fig. 4.
Hi-C contact matrix analysis across the genome. (a) Visualization of Hi-C contact matrices of human kidney cortex, medulla and papilla across all chromosomes. Maximum contact score is set as 10,000 and contacts with score lower than 500 are filtered for the convenience of data visualization. (b) Visualization of differences of Hi-C contact matrices between kidney cortex and papilla (top), cortex and medulla (middle) and medulla and papilla (bottom) in chromosome 10. The red color indicates a positive value and the blue color indicates a negative value after subtraction. Matrix files generated from merged duplicates are used.
For example, chromatin contact domain calling analysis with Arrowhead31 identified a domain located nearby BIN1 promoter region in the kidney cortex, but not in kidney medulla and papilla, and we observed more Hi-C contacts within this region in the kidney cortex (Fig. 5a). This observation is concordant with the identification of high BIN1 gene expression in proximal tubular cells in recent single-cell studies6,41,42. We recently described increased RUNX1 gene expression and chromatin accessibility along the human kidney corticopapillary axis16, and consistently, our Hi-C sequencing data revealed a domain covering RUNX1 with more chromatin interactions in medulla and papilla than that in the kidney cortex (Fig. 5b).
Fig. 5.
Visualization of chromatin interactions for specific genomic loci. (a) Increased Hi-C contacts in kidney cortex than medulla and papilla, nearby BIN1 promoter region. (b) Increased Hi-C contacts in kidney medulla and papilla than cortex in a RUNX1-containing region. Data are visualized at a resolution of 5,000 bp and Square-Root Vanilla-Coverage normalization is used for visualization.
CUT&RUN sequencing data evaluation
For all 16 CUT&RUN libraries, an average of 97.8% of the total reads (from 97.09% to 98.62%) were mapped to the hg19 genome. As mentioned in Methods, E. coli Spike-in DNA was added to each library (Table 4) and we found an average of 0.8% of the total reads (from 0.26% to 1.39%) were mapped to the E. coli genome during Bowtie2 alignment, consistent with expectation. For H3K4me3 and IgG libraries in which the SNAP-CUTANA K-MetStat Panel was added as spike-in controls (see Methods), the ratio of reads containing exogenous histone modification barcodes to the total reads is consistently <0.6%. Analyzing the composition of the K-MetStat Panel revealed an enrichment for H3K4me3 barcodes in the H3K4me3 CUT&RUN libraries, indicating a strong on-target effect (Fig. 6a). Since H3K4me3 typically marks the transcription start sites (TSSs) and indicates active transcription, and H3K27me3 is a regressive histone marker20,43–45, we examined CUT&RUN signals of the two histone modification markers, and identified sharp H3K4me3 peaks localized to TSSs at gene promoter regions, as well as broad H3K27me3 peaks over repressive genomic regions, suggesting successful chromatin modification profiling (Fig. 6b).
Fig. 6.
CUT&RUN data evaluation. (a) Heatmap showing percentage of reads mapping to each histone modification in the K-MetStat Panel in the H3K4me3 and IgG libraries. (b) Visualization of CUT&RUN signals on a selected genomic region. Bigwig files generated from merged duplicates are used.
CUT&RUN data peak calling evaluation
To evaluate the efficacy of our CUT&RUN dataset in downstream analysis, we performed peak calling analysis to identify genome-wide CUT&RUN signal-enriched regions for each library with a computational pipeline previously described36, where the IgG CUT&RUN profile of each kidney region was subtracted as a negative control to eliminate background noise. A total of 8726, 2153 and 4955 H3K4me3 peaks, and a total of 10864, 5125 and 24184 H3K27me3 peaks, were identified in the human kidney cortex, medulla and papilla, respectively. Many genomic regions show different relative abundances of H3K4me3 and H3K27me3 peaks. For example, a chromosome 19 region exhibits low presence of H3K4me3 peaks while high abundance of H3K27me3 peaks across the three kidney anatomic regions (Fig. 7a,b; highlighted with arrows; Fig. 7c for a zoom-in view). Differences in peak distribution across three kidney regions are evident, with coverage plots of representative chromosomes shown in Fig. 7d,e. As another example, for kidney cortical genome-wide H3K4me3 peaks, there are 97.3% of medullary H3K4me3 peaks exhibiting genomic overlaps with them, while only 16.0% of cortical H3K27me3 peaks coincide with them, supporting existing knowledge that the two histone modifications typically mark distinct genomic regions.
Fig. 7.
Visualization of CUT&RUN peaks. (a,b) Circos plots for genome-wide H3K4me3 (a) and H3K27me3 (b) peak profiles across kidney cortex (outer layer), medulla (middle layer) and papilla (inner layer). Sizes of each chromosome are shown. A region of interest on chromosome 19 is highlighted with arrows and is shown with zoom-in (c). (d,e) Coverage plots of the called peaks over chromosome 3 (d) and chromosome 4 (e) for each CUT&RUN library (with duplicates merged).
Furthermore, annotation of genomic locations for the called peaks revealed that over 75% of the H3K4me3 peaks reside within the promoter regions, being less than or equal to 1 kb from the TSSs, for all three kidney regions (Fig. 8). Conversely, 34%–51% of the H3K27me3 peaks are located in distal intergenic regions, indicative of a preference for a cis-regulatory role. These observations are consistent with previous reports46,47.
Fig. 8.
Genomic location annotations of CUT&RUN peaks. Distribution of genomic locations of the called peaks for each library (with duplicates merged). Annotations was provided by ChIPseeker39,40.
Acknowledgements
These experiments were funded by NIH grants DK103740, UC2DK126024 and U54DK137332 to BDH. The other authors declare no funding support related to this work. The authors acknowledge Washington University Genome Technology Access Center for next-generation sequencing support. The authors also appreciate suggestions from personnel in Ting Wang’s laboratory at Washington University in St. Louis in Hi-C library preparation.
Author contributions
Haikuo Li: Conceptualization, Methodology, Data curation, Writing, Original draft preparation. Dian Li: Conceptualization, Methodology, Data curation, Writing, Original draft preparation. Benjamin D. Humphreys: Supervision, Writing – Reviewing and Editing.
Code availability
Scripts for Hi-C sequencing data preprocessing, CUT&RUN sequencing data preprocessing and analysis relevant to figures in this study are available at https://github.com/TheHumphreysLab/kidney_anatomy_chromatin. Versions of all required packages are described in the scripts and Methods.
Competing interests
The authors declare that they have no known competing financial or non-financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Haikuo Li, Dian Li.
References
- 1.Kovesdy, C. P. Epidemiology of chronic kidney disease: an update 2022. Kidney International Supplements vol. 12, 10.1016/j.kisu.2021.11.003 (2022). [DOI] [PMC free article] [PubMed]
- 2.Kalantar-Zadeh K, Jafar TH, Nitsch D, Neuen BL, Perkovic V. Chronic kidney disease. Lancet. 2021;398:786–802. doi: 10.1016/S0140-6736(21)00519-5. [DOI] [PubMed] [Google Scholar]
- 3.Coresh J, et al. Prevalence of chronic kidney disease in the United States. J. Am. Med. Assoc. 2007;298:2038–2047. doi: 10.1001/jama.298.17.2038. [DOI] [PubMed] [Google Scholar]
- 4.Li, H. & Humphreys, B. D. Targeting de novo lipogenesis to mitigate kidney disease. J. Clin. Invest. 134, (2024). [DOI] [PMC free article] [PubMed]
- 5.Wu H, et al. Mapping the single-cell transcriptomic response of murine diabetic kidney disease to therapies. Cell Metab. 2022;34:1064–1078.e6. doi: 10.1016/j.cmet.2022.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wilson PC, et al. Multimodal single cell sequencing implicates chromatin accessibility and genetic background in diabetic kidney disease progression. Nat. Commun. 2022;13:1–20. doi: 10.1038/s41467-022-32972-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wilson PC, et al. The single-cell transcriptomic landscape of early human diabetic nephropathy. Proc. Natl. Acad. Sci. USA. 2019;116:19619–19625. doi: 10.1073/pnas.1908706116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li H, Dixon EE, Wu H, Humphreys BD. Comprehensive single-cell transcriptional profiling defines shared and unique epithelial injury responses during kidney fibrosis. Cell Metab. 2022;34:1977–1998.e9. doi: 10.1016/j.cmet.2022.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Song R, Yosypiv IV. Development of the kidney medulla. Organogenesis. 2012;8:10–17. doi: 10.4161/org.19308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Jaenisch, R. & Bird, A. Epigenetic regulation of gene expression: How the genome integrates intrinsic and environmental signals. Nature Genetics vol. 33, 10.1038/ng1089 (2003). [DOI] [PubMed]
- 11.Ding H, Zhang L, Yang Q, Zhang X, Li X. Epigenetics in kidney diseases. Adv. Clin. Chem. 2021;104:233–297. doi: 10.1016/bs.acc.2020.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gibney ER, Nolan CM. Epigenetics and gene expression. Hered. 2010;105:4–13. doi: 10.1038/hdy.2010.54. [DOI] [PubMed] [Google Scholar]
- 13.Dai Z, Ramesh V, Locasale JW. The evolving metabolic landscape of chromatin biology and epigenetics. Nat. Rev. Genet. 2020;21:737–753. doi: 10.1038/s41576-020-0270-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Li H, Humphreys BD. Single Cell Technologies: Beyond Microfluidics. Kidney360. 2021;2:1196–1204. doi: 10.34067/KID.0001822021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Muto, Y., Li, H. & Humphreys, B. D. Single Cell Transcriptomics. in Innovations in Nephrology: Breakthrough Technologies in Kidney Disease Care 87–102, 10.1007/978-3-031-11570-7_5 (Springer, Cham, 2022).
- 16.Li H, et al. Transcriptomic, epigenomic, and spatial metabolomic cell profiling redefines regional human kidney anatomy. Cell Metab. 2024;36:1105–1125.e10. doi: 10.1016/j.cmet.2024.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Li H, Humphreys BD. Spatially resolved metabolomic dataset of distinct human kidney anatomic regions. Data Br. 2024;54:110431. doi: 10.1016/j.dib.2024.110431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.McCord RP, Kaplan N, Giorgetti L. Chromosome Conformation Capture and Beyond: Toward an Integrative View of Chromosome Structure and Function. Mol. Cell. 2020;77:688–708. doi: 10.1016/j.molcel.2019.12.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Stillman B. Histone Modifications: Insights into Their Influence on Gene Expression. Cell. 2018;175:6–9. doi: 10.1016/j.cell.2018.08.032. [DOI] [PubMed] [Google Scholar]
- 20.Yoshimura Y, et al. A single-cell multiomic analysis of kidney organoid differentiation. Proc. Natl. Acad. Sci. 2023;120:e2219699120. doi: 10.1073/pnas.2219699120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Eun, M. et al. Chromatin accessibility analysis and architectural profiling of human kidneys reveal key cell types and a regulator of diabetic kidney disease. Kidney Int. 0, (2023). [DOI] [PubMed]
- 22.Mimura I, Kanki Y, Kodama T, Nangaku M. Revolution of nephrology research by deep sequencing: ChIP-seq and RNA-seq. Kidney Int. 2014;85:31–38. doi: 10.1038/ki.2013.321. [DOI] [PubMed] [Google Scholar]
- 23.Luo Y, et al. New developments on the Encyclopedia of DNA Elements (ENCODE) data portal. Nucleic Acids Res. 2020;48:D882–D889. doi: 10.1093/nar/gkz1062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gisch DL, et al. The chromatin landscape of healthy and injured cell types in the human kidney. Nat. Commun. 2024;15:1–21. doi: 10.1038/s41467-023-44467-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Haug, S. et al. Multi-omic analysis of human kidney tissue identified medulla-specific gene expression patterns. Kidney Int. 0, (2023). [DOI] [PMC free article] [PubMed]
- 26.Li H, Humphreys BD. Mouse kidney nuclear isolation and library preparation for single-cell combinatorial indexing RNA sequencing. STAR Protoc. 2022;3:101904. doi: 10.1016/j.xpro.2022.101904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li H, Humphreys BD. Protocol for multimodal profiling of human kidneys with simultaneous high-throughput ATAC and RNA expression with sequencing. STAR Protoc. 2024;5:103049. doi: 10.1016/j.xpro.2024.103049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Li H, Humphreys BD. 2024. GEO. //identifiers.org/geo/GSE253634
- 29.Li H. 2024. Supporting data for ‘Chromatin conformation and histone modification profiling across human kidney anatomic regions’. Zenodo. [DOI] [PMC free article] [PubMed]
- 30.Durand NC, et al. Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. Cell Syst. 2016;3:95–98. doi: 10.1016/j.cels.2016.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rao SSP, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li D, Hsu S, Purushotham D, Sears RL, Wang T. WashU Epigenome Browser update 2019. Nucleic Acids Res. 2019;47:W158–W165. doi: 10.1093/nar/gkz348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ramírez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Meers MP, Tenenbaum D, Henikoff S. Peak calling by Sparse Enrichment Analysis for CUT&RUN chromatin profiling. Epigenetics and Chromatin. 2019;12:1–11. doi: 10.1186/s13072-019-0287-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize Implements and enhances circular visualization in R. Bioinformatics. 2014;30:2811–2812. doi: 10.1093/bioinformatics/btu393. [DOI] [PubMed] [Google Scholar]
- 38.Gel B, Serra E. karyoploteR: an R/Bioconductor package to plot customizable genomes displaying arbitrary data. Bioinformatics. 2017;33:3088–3090. doi: 10.1093/bioinformatics/btx346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang Q, et al. Exploring Epigenomic Datasets by ChIPseeker. Curr. Protoc. 2022;2:e585. doi: 10.1002/cpz1.585. [DOI] [PubMed] [Google Scholar]
- 40.Yu G, Wang LG, He QY. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31:2382–2383. doi: 10.1093/bioinformatics/btv145. [DOI] [PubMed] [Google Scholar]
- 41.Lake, B. B. et al. An atlas of healthy and injured cell states and niches in the human kidney. Nature 2021.07.28.454201, 10.1038/s41586-023-05769-3 (2023). [DOI] [PMC free article] [PubMed]
- 42.Abedini, A. et al. Spatially resolved human kidney multi-omics single cell atlas highlights the key role of the fibrotic microenvironment in kidney disease progression. bioRxiv 2022.10.24.513598, 10.1101/2022.10.24.513598 (2022).
- 43.Benayoun BA, et al. H3K4me3 breadth is linked to cell identity and transcriptional consistency. Cell. 2014;158:673. doi: 10.1016/j.cell.2014.06.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Liu X, et al. Distinct features of H3K4me3 and H3K27me3 chromatin domains in pre-implantation embryos. Nat. 2016;537:558–562. doi: 10.1038/nature19362. [DOI] [PubMed] [Google Scholar]
- 45.Bogliotti YS, Ross PJ. Mechanisms of histone H3 lysine 27 trimethylation remodeling during early mammalian development. Epigenetics. 2012;7:976. doi: 10.4161/epi.21615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wei G, et al. Global mapping of H3K4me3 and H3K27me3 reveals specificity and plasticity in lineage fate determination of differentiating CD4+ T cells. Immunity. 2009;30:155–167. doi: 10.1016/j.immuni.2008.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhang Y, et al. Canonical nucleosome organization at promoters forms during genome activation. Genome Res. 2014;24:260–266. doi: 10.1101/gr.157750.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Li H, Humphreys BD. 2024. GEO. //identifiers.org/geo/GSE253634
- Li H. 2024. Supporting data for ‘Chromatin conformation and histone modification profiling across human kidney anatomic regions’. Zenodo. [DOI] [PMC free article] [PubMed]
Data Availability Statement
Scripts for Hi-C sequencing data preprocessing, CUT&RUN sequencing data preprocessing and analysis relevant to figures in this study are available at https://github.com/TheHumphreysLab/kidney_anatomy_chromatin. Versions of all required packages are described in the scripts and Methods.








