Skip to main content
Genomics Data logoLink to Genomics Data
. 2015 Jun 12;5:238–240. doi: 10.1016/j.gdata.2015.06.007

Dataset of transcriptional landscape of B cell early activation

Alexander S Garruss a,b, Trent Fowler c,
PMCID: PMC4583667  PMID: 26484262

Abstract

Signaling via B cell receptors (BCR) and Toll-like receptors (TLRs) result in activation of B cells with distinct physiological outcomes, but transcriptional regulatory mechanisms that drive activation and distinguish these pathways remain unknown. At early time points after BCR and TLR ligand exposure, 0.5 and 2 h, RNA-seq was performed allowing observations on rapid transcriptional changes. At 2 h, ChIP-seq was performed to allow observations on important regulatory mechanisms potentially driving transcriptional change. The dataset includes RNA-seq, ChIP-seq of control (Input), RNA Pol II, H3K4me3, H3K27me3, and a separate RNA-seq for miRNA expression, which can be found at Gene Expression Omnibus Dataset GSE61608. Here, we provide details on the experimental and analysis methods used to obtain and analyze this dataset and to examine the transcriptional landscape of B cell early activation.

Keywords: Transcriptional, Landscape, B cell, Activation


Specifications
Organism/cell line/tissue Mus musculus C57BL6/Spleen/resting B cells
Sex Male
Sequencer or array type Illumina HiSeq 2000 and Genome Analyzer
Data format Raw and analyzed
Experimental factors Unstimulated, stimulated with anti-IgM or LPS
Experimental features RNA-seq, ChIP-seq
Consent Not applicable
Sample source location Jackson Immunology, Bar Harbor, Maine, USA

1. Direct link to deposited data

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE61608.

2. Experimental design

Experiments were designed to capture the earliest time point of a recognizably coordinated change in RNA expression of murine resting splenic B cells. A “coordinated change” was defined by changes in gene expression that correlated with pathways relevant to the stimuli; BCR signaling for BCR stimulus and NFKB/TLR for LPS stimulus, and was found lacking at 0.5 h but significant at 2 h stimulation [1]. At this earliest time point, we captured regulatory element associated sequences (ChIP-seq) to describe the regulatory landscape.

3. Materials and methods

3.1. Cells and induction

Naïve resting splenic B cells from 8 week old male C57BL6 mice (Jackson Laboratory, Bar Harbor, ME, USA) were isolated with anti-CD43 beads (Miltenyi), and confirmed as ~ 95% CD19 + by flow cytometry (FACSCalibur). The cells were re-suspended in ice-cold media (RPMI 1640 GlutaMAX with 25 mM HEPES (Life Technologies)/10% Gemini FetalPlex sera/55 nM B-mercaptoethanol (Life Technologies)) with either 10 μg/ml anti-mouse IgM goat IgG Fab fragments (Jackson Immunology) or 25 μg/ml Salmonella typhimurium typhus LPS (Sigma)). Afterwards, it was necessary to rest the cells on ice for 30 min prior to removing to an incubator at 37 °C/5% CO2 for the experimental times. Otherwise, we found that sera stimulation obscured specific stimulus stimulation. Monitoring expression of the ratio of either the primary response gene c-fos or the early activation marker CD69 over Actin B by RT-PCR offers a sensitive indicator of the resting state: both will be several folds above 1 over the resting state if activation occurs. Note that resting B cells have very low amounts of RNA; 25 million were used for each RNA-seq time point, while 10 million were sufficient for ChIP-seq. Animal care and use in this study are covered under the “Assurance of Compliance with PHS Policy on Humane Care and Use of Laboratory animals by Awardee Institutions” and approved by the Institutional Animal Care and Use Committee (IACUC) (Animal Welfare Assurance Number A-3775-01).

3.2. Deep sequencing

RNA libraries were prepared with the NuGEN Ovation RNA-Seq v2 library preparation kit, which amplifies both mRNA and non-polyadenylated transcripts. The resulting cDNA was fractionated by sonication and prepared using the Illumina TruSeq library preparation kit which was then multiplexed across single-end, 100 bp (initial RNA), 50 bp (and miRNA), lanes of an Illumina HiSeq 2000 sequencer using the Illumina pipeline RTA version 1.12.4.2 and de-multiplexing with CASAVA v1.8.2. Sequence data was aligned with Illumina ELAND software. Secondary RNA-seq and ChIP-seq for RNA Pol II was performed by the lab of Ali Shilatifard at the Stowers Institute using typical protocols with the Santa Cruz antibody N-20, sc-816x and chip libraries prepared with a KAPA Biosystems BHTP kit. H3K4me3 (Abcam antibody ab8580) and H3K27me3 (Abcam antibody ab6002) ChIP-seq was performed in the lab of Ranjan Sen at NIH using an Illumina TruSeq ChIP library kit according to the manufacturer's protocol followed by cluster generation with a TruSeq Cluster generation kit v5 and libraries were sequenced on an Illumina Genome Analyzer (GA-II) using Illumina RTA version 1.8.

3.3. Read mapping

Fastqc profiles showed good quality data (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Non-manipulated reads were mapped against the mm9/ENSEMBL build 67 genome reference [2] using Tophat v2.0.10 [3] along with Samtools v0.1.19 [4] with general settings for RNA and bowtie 1.0.0 [5] for ChIP (options -n 1 -m 1 --best --strata). Mapped read numbers per million: RNA initial: rest 75.3, BCR30 20.9, BCR120 73.6, LPS30 44.7, and LPS120 40.5; RNA secondary: rest 77, BCR120 61, and LPS120 68.9; RNA Pol II: rest 18.1, BCR120 14.2, and LPS120 21.3; H3K4me3: rest 13.1, BCR120 16.4, LPS120 19.1; and K3K27me3: rest 19.0, BCR120 20.0, and LPS120 18.1.

3.4. Differential expression (DE) analysis

DE was identified by a minimal two-fold difference in log ratios of normalized reads generated with Cufflinks v1.3.1 using default settings [6]. Preferentially induced or reduced gene sets included genes that were identified by a change in either; a single response, or when affected by both responses with the preferred response changed at a ratio at least 2 fold more than the other non-preferred response. An XLSX spreadsheet of these results has been made available in the GSE61608 dataset.

3.5. MiRNA-seq analysis

After TRIzol isolation of RNA, Illumina's TruSeq Small RNA Sample Preparation Kits were used to produce material for generating 50 bp single end reads which were then analyzed with miRdeep2 [7] using the miRBase reference v14 with standard settings. Mapped miRNAs were confirmed by visual inspection of miRNA structure output from mirDeep2 and UCSC Genome Browser tracks [8], and inclusion in the Ensembl 67 data base [2]. Differential expression from the resting state was identified by a minimal two fold difference in miRdeep2 normalized reads. An XSLX spreadsheet of this analysis can be found in the GEO data set. Total miRNA dataset reads per million are the following: rest 20.2, BCR120 28.6, and LPS120 14.8. Total miRdeep2 miRNA reads (per thousand) are the following: rest 55.8, BCR120 16.2, and LPS120 27.6.

3.6. ChIP-seq analysis

To determine transcription start site (TSS) coverage, bedtools' “bamToBed” function and custom R scripts (bed2cov.R then cov2rpm.R) were used with R-2.11.1 to produce bedgraphs from bowtie mapped bam files. BedGraph file formats were then converted to BigWig files with bedgraphToBigwig for UCSC Genome Browser visualization (https://genome.ucsc.edu/goldenPath/help/bigWig.html). Reads-per-million-normalized coverage was computed for the gene sets and regions indicated with summary statistics, such as mean coverage, calculated at single base pair resolution. Histogram figures were made by converting the “cov2rpm.R” output using “R.prep_tss_matrix” followed by plot generation using “R.plot_tss_matrix”. The resulting TSS matrix is organized with transcript or gene names as the rows whereby the columns are positions flanking the start site of each transcript/gene. The matrix can easily be subsetted on transcript/gene names of interest, such as those found to be differentially expressed by another assay. Total normalized read coverage across other regions was created using the script “R.calculate_sums_and_max” and shown as boxplots in a manner similar to that shown in “R.plot_tss_matrix”. Standard R Wilcoxon Rank Sums Testing commands were used to determine the significance of differentially covered regions between biological samples. We have provided these scripts for reference. Note that cov2rpm.R and R.prep_tss_matrix require R-2.11.1 (http://cran.r-project.org/).

3.7. Quantitative PCR-RNA validation

Real-time PCR was performed with specific primers. Primer sequences have been made available in Supplement data from an earlier publication of the analysis of this dataset [1]. Target sequences are reported relative to Beta Actin and normalized to resting cells.

Acknowledgments

We would like to acknowledge the following people for their very helpful support including Jeff Johnston and Sam Meier for contributing portions of the ChIP-seq analysis code. For material support necessary to produce the dataset we would like to thank Ananda L Roy of Tufts University, Ali Shilatifard of Northwestern University (then at the Stowers Institute), and Ranjan Sen of NIH, Institute for Aging. This work was supported in part by the Intramural Research Program of the NIH, the National Institute on Aging to RS and by grants to STS (R01 GM086372) and ALR (AHA 12GRNT12180023).

Footnotes

Appendix A

Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.gdata.2015.06.007.

Appendix A. Supplementary data

Supplementary material 1.

mmc1.zip (996B, zip)

Supplementary material 2.

mmc2.zip (803B, zip)

Supplementary material 3.

mmc3.zip (1.2KB, zip)

Supplementary material 4.

mmc4.zip (895B, zip)

Supplementary material 5.

mmc5.zip (1.2KB, zip)

References

  • 1.Fowler T., Garruss A.S., Ghosh A., De S., Becker K.G., Wood W.H. Divergence of transcriptional landscape occurs early in B cell activation. Epigenetics Chromatin. 2015;8:20. doi: 10.1186/s13072-015-0012-x. (Epub 2015/05/20) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Flicek P., Ahmed I., Amode M.R., Barrell D., Beal K., Brent S. Ensembl 2013. Nucleic Acids Res. 2013;41(Database issue):D48–D55. doi: 10.1093/nar/gks1236. (Epub 2012/12/04) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Trapnell C., Pachter L., Salzberg S.L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–1111. doi: 10.1093/bioinformatics/btp120. (Epub 2009/03/18) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352. (Epub 2009/06/10) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Langmead B., Trapnell C., Pop M., Salzberg S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. (Epub 2009/03/06) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Trapnell C., Roberts A., Goff L., Pertea G., Kim D., Kelley D.R. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7(3):562–578. doi: 10.1038/nprot.2012.016. (Epub 2012/03/03) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Friedlander M.R., Mackowiak S.D., Li N., Chen W., Rajewsky N. miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2012;40(1):37–52. doi: 10.1093/nar/gkr688. (Epub 2011/09/14) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Raney B.J., Dreszer T.R., Barber G.P., Clawson H., Fujita P.A., Wang T. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics. 2014;30(7):1003–1005. doi: 10.1093/bioinformatics/btt637. (Epub 2013/11/15) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material 1.

mmc1.zip (996B, zip)

Supplementary material 2.

mmc2.zip (803B, zip)

Supplementary material 3.

mmc3.zip (1.2KB, zip)

Supplementary material 4.

mmc4.zip (895B, zip)

Supplementary material 5.

mmc5.zip (1.2KB, zip)

Articles from Genomics Data are provided here courtesy of Elsevier

RESOURCES