Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2018 Nov 8;47(Database issue):D752–D758. doi: 10.1093/nar/gky1099

Update of the FANTOM web resource: expansion to provide additional transcriptome atlases

Marina Lizio 1, Imad Abugessaisa 1, Shuhei Noguchi 1, Atsushi Kondo 1, Akira Hasegawa 1, Chung Chau Hon 1, Michiel de Hoon 1, Jessica Severin 1, Shinya Oki 2, Yoshihide Hayashizaki 3, Piero Carninci 1, Takeya Kasukawa 1,, Hideya Kawaji 1,3,
PMCID: PMC6323950  PMID: 30407557

Abstract

The FANTOM web resource (http://fantom.gsc.riken.jp/) was developed to provide easy access to the data produced by the FANTOM project. It contains the most complete and comprehensive sets of actively transcribed enhancers and promoters in the human and mouse genomes. We determined the transcription activities of these regulatory elements by CAGE (Cap Analysis of Gene Expression) for both steady and dynamic cellular states in all major and some rare cell types, consecutive stages of differentiation and responses to stimuli. We have expanded the resource by employing different assays, such as RNA-seq, short RNA-seq and a paired-end protocol for CAGE (CAGEscan), to provide new angles to study the transcriptome. That yielded additional atlases of long noncoding RNAs, miRNAs and their promoters. We have also expanded the CAGE analysis to cover rat, dog, chicken, and macaque species for a limited number of cell types. The CAGE data obtained from human and mouse were reprocessed to make them available on the latest genome assemblies. Here, we report the recent updates of both data and interfaces in the FANTOM web resource.

INTRODUCTION

Transcription is the first process in the synthesis of molecules that carry out functions encoded on the genome. Promoters and enhancers are key classes of cis-regulatory elements controlling the transcription of target genes from proximal upstream regions and distal regions, respectively. Elements of both of these classes are transcribed (1).

FANTOM (functional annotation of mammalian genomes) is an international collaborative effort that aims to study genomes from the perspective of the transcriptome (2–6). Its fifth iteration (FANTOM5) drafted atlases of promoters and enhancers and profiled dynamic changes in their activities during differentiation and responses to stimuli (4,7,8). These studies were based on transcription initiation profiles at nucleotide resolution detected using CAGE (cap analysis of gene expression) (9), the most accurate experimental approach to identify 5′-ends of capped RNAs (10). cis-regulatory elements were monitored in more than 1800 samples from human and 1000 samples from mouse (11), then the resulting data were compiled and made available via the FANTOM web resource (12,13).

The FANTOM5 project recently provided atlases of long noncoding RNAs and micro RNAs and their promoters, with accompanying RNA-seq and short RNA transcriptome data (Figure 1). The project also provided additional transcriptome data, such as CAGE profiles from rhesus macaque (Macaca mulatta), chicken (Gallus gallus), dog (Canis lupus familiaris), and rat (Rattus norvegicus) and CAGEscan (14) profiles that can associate promoters with their cognate exons with paired-end sequencing (Figure 1). Here, we introduce the recent updates of the FANTOM web resource.

Figure 1.

Figure 1.

Schematic view of the FANTOM5 atlases and data. The atlases are indicated by book icons, and the data sets are indicated by gray boxes. New data sets and atlases are indicated in the top panel.

RESULTS

Long noncoding RNA atlas with matched RNA-seq

The human genome is pervasively transcribed and produces a large number of long noncoding RNAs (lncRNAs) (15,16). However, the current annotation of lncRNAs is far from complete (17). We attempted to comprehensively annotate human lncRNAs by integrating CAGE and RNA-seq on a large scale. We performed RNA-seq on a subset of FANTOM5 human samples (n = 70) and assessed the compatibility of the resulting transcriptome assembly with CAGE-defined transcription start sites (TSSs). Based on the matched CAGE and RNA-seq datasets, we formulated a custom metric, the transcription initiation evidence score (TIEScore), to quantitatively integrate CAGE-defined TSSs and transcriptome assemblies (18). We then applied the TIEScore to our own RNA-seq assembly and to other transcriptome assemblies and gene models (Human Body Map 2.0 (19), miTranscriptome (20), and GENCODE (21)), to build a meta-assembly of transcripts with accurate 5′-ends, called FANTOM CAT (CAGE-Associated Transcriptome). At a robust TIEScore cutoff, FANTOM CAT consists of 59 110 genes, including 27 919 lncRNAs (18). All genes in FANTOM CAT were classified based on their genomic (e.g. intergenic or antisense) and epigenomic (e.g. promoter- or enhancer-derived) contexts. They were then annotated with associations to genetic traits (using known GWAS loci) and to sample ontologies (using FANTOM5 expression profiles and sample ontologies). In addition, their dynamic expression patterns among FANTOM5 samples were annotated based on differential expression analyses. Finally, we integrated the GTEx expression quantitative trait loci (eQTL) data (22) with the FANTOM5 expression profiles to define a set of lncRNAs with potential roles in transcriptional regulation. All these annotations, as well as the FANTOM CAT meta-assemblies at various TIEScore cutoffs, are available at http://fantom.gsc.riken.jp/cat. The FANTOM CAT browser (Figure 2) allows interactive visualization of genes and their annotations (http://fantom.gsc.riken.jp/cat/v1/#/). The raw reads and alignments are available from the FANTOM5 data files repository (http://fantom.gsc.riken.jp/5/datafiles/phase2.5/).

Figure 2.

Figure 2.

Using the FANTOM CAT browser to explore lncRNAs associated with classical monocytes. (A) Browse the ontology list page by clicking on ‘Ontologies’ on the landing page; (B) filter for ‘classical monocytes’; (C) list of genes associated with ontology CL:0000860 classical monocytes; (D) filter for gene classes of ‘lncRNA’; (E) sort genes by their level of association with classical monocytes; (F) summary page of the lncRNA gene CATG00000102578.1, with highest level of association to classical monocytes (http://fantom.gsc.riken.jp/cat/v1/#/genes/CATG00000102578.1); (G) annotation summary of CATG00000102578.1; (H) expression level of CATG00000102578.1 in FANTOM5 samples, with samples of classical monocytes highlighted; (I) dynamic expression pattern of CATG00000102578.1 in FANTOM5 samples.

Atlas of miRNAs and their promoters with matched short RNA-seq

In addition to the synthesis of RNA during transcription, gene expression also depends on post-transcriptional processes such as RNA stabilization, translation efficiency, and degradation. Micro RNAs (miRNAs) are a major class of small noncoding RNAs involved in post-transcriptional regulation by repressing target genes via RNA degradation or inhibition of translation. As part of FANTOM5, we created an atlas of miRNA expression by sequencing short RNA libraries produced from a subset of the human and mouse RNA samples that were used for the promoter atlas (23). The atlas comprises 422 libraries of human samples and 78 libraries of mouse samples; most samples were derived from human primary cells. Expression levels of all annotated miRNAs, as well as candidate novel miRNAs discovered using miRDeep2 (24), were calculated for each sample. The enrichment or depletion of particular miRNAs in specific human primary cell types was then evaluated (Figure 3). Promoters were systematically identified for human and mouse miRNAs, using the FANTOM5 CAGE data to identify transcription start sites at the single-nucleotide resolution. Expression levels of mature miRNAs, as measured by short RNA library sequencing, were correlated with the expression level of the corresponding primary miRNA transcript in the matching CAGE library. Thus we can use the CAGE expression signal at the miRNA promoter as a proxy for the expression level of the mature miRNA, and to extend the miRNA expression atlas to all CAGE samples in FANTOM5. The FANTOM5 miRNA atlas with the annotated promoter for each miRNA, the expression profiles of the mature and primary miRNAs, and the cell-type enrichment analysis is available at http://fantom.gsc.riken.jp/5/suppl/De_Rie_et_al_2017/ (Figure 3). A clickable heatmap of miRNA expression levels in human primary cell samples is available at http://fantom.gsc.riken.jp/5/suppl/De_Rie_et_al_2017/vis_viewer/#/heatmap. The raw reads and alignments are available as part of the FANTOM5 data files (http://fantom.gsc.riken.jp/5/datafiles/phase2.5/).

Figure 3.

Figure 3.

The FANTOM5 miRNA expression atlas. The annotated miRNA promoter, expression profile and results of the cell-type enrichment analysis are shown by using miRNA hsa-miR-9-5p as an example (http://fantom.gsc.riken.jp/5/suppl/De_Rie_et_al_2017/vis_viewer/#/human).

CAGEscan to link orphan promoters to transcriptional unit

While the application of CAGE to ∼1800 human samples enabled us to identify nearly 200 000 promoter regions, only half of them were associated with existing gene models (4) and characterizing the rest remains a challenge. One approach to improve genome annotation is to perform RNA-seq from the matched samples, as described above; however, the efficiency of this method in capturing the terminal ends of transcripts is limited. CAGEscan is an experimental protocol that links transcription initiation sites to cognate exons by paired-end sequencing of random-primed sites of cDNAs (14). We applied CAGEscan to a subset of human samples (n = 56) from the promoter atlas (25). Aligned pairs starting in the same region were assembled into transcript models called CAGEscan clusters and added to the FANTOM5 data files together with the raw data (http://fantom.gsc.riken.jp/5/datafiles/phase2.5/).

Complementary CAGE profiles for macaque, chicken, rat and dog

To complement the promoter atlases created for human and mouse, we obtained CAGE profiles from rhesus macaque, chicken, dog and rat. The macaque samples cover 15 brain regions that match the brain regions samples profiled in human (26). The chicken samples cover consecutive embryonic stages, providing snapshots of promoter activation and usage during development (27). Additional samples from chicken include adult primary mesenchymal stem cells, aortic smooth muscle cells, and hepatocytes. These cell types match the cell types in the other two species (rat and dog) (28), as well as in human and mouse. CAGE profiles from these four species were processed as performed for human and mouse, including the identification of CAGE peaks (4). Although the coverage of samples is far more limited than in human and mouse, our experimental design of matched samples, application of the same CAGE library preparation protocol, and identical data processing pipeline enables us to study transcriptional regulation across species. All the processed data are integrated into the FANTOM5 data files (http://fantom.gsc.riken.jp/5/datafiles/phase2.5/).

Up-to-date promoter atlases based on GRCh38 and GRCm38

All of the transcriptome atlases discussed above were built based on GRCh37 for human and GRCm37 for mouse, which were the latest reference genome assemblies at the time of the launch of the FANTOM5 project. Since the improved GRCh38 and GRCm38 reference genome assemblies were released for human and mouse, thanks to extensive efforts by the Genome Reference Consortium (29), we updated the promoter atlases accordingly by realigning the CAGE reads to the new reference genomes and recalculating the TSS frequencies. We further converted the genomic coordinates of the CAGE peaks from GRCh37 to GRCh38 and from GRCm37 to GRCm38 via UCSC’s liftOver tool, and used the same identifiers to ensure compatibility between genome assemblies. We carefully assessed the results of the conversion and found that >99% of the CAGE peaks were successfully converted. A handful of regions with issues after liftOver were curated manually (30). Finally, the converted CAGE peaks were merged with those called de novo to remedy the absence of converted peaks in the latest genomes due to differences between the two assemblies. The annotations of the peaks, such as names and gene associations, were also updated based on the most recent gene models (30). The resulting data are available from the web resource at http://fantom.gsc.riken.jp/5/datafiles/reprocessed/.

Update of the FANTOM5 interfaces

To increase accessibility to the datasets added, we loaded them into the FANTOM5 interfaces and databases (12,13). In addition to the expansion of data sets in the data archive, the ZENBU genome browser (31) now provides visualizations for the new data sets of CAGE and RNA-seq, as well as annotation data on the latest genome assemblies. The SSTAR browser (32) has been updated to include sample information and CAGE peaks for the dog, rat, macaque, and chicken samples. The sample information is of particular importance in this context, not only because of the matching of RNA sources across species, but also because the same RNAs were profiled by multiple protocols (CAGE, RNA-seq, short RNA-seq and CAGEscan). SSTAR provides a complete set of metadata to connect all of them based on common information. Semantic MediaWiki—the backend system of SSTAR—has been upgraded to improve performance. TET, the FANTOM5 table extraction tool, now provides CAGE peaks and expression levels for the four new species, enabling researchers to obtain subsets of data of their interest from the large data files.

We have updated the configuration of trackHub (33) to reflect the latest data, visible in both the UCSC Genome Browser (34) and the ENSEMBL Genome Browser (35). Because the epigenetic machinery plays crucial roles in controlling transcriptional regulation, we have further extended the contents of the FANTOM web resource by including epigenetic data from the ChIP-Atlas (Figure 4), a comprehensive database of public ChIP-seq and DNase-seq data available via the Sequence Read Archive (SRA). The data it contains have been uniformly reprocessed and annotated (36). We have manually curated epigenetics data of cell lines corresponding to those profiled by CAGE and made the data available via the trackHub configuration tool (bioRxiv: https://doi.org/10.1101/314807). ChIP-Atlas integration with transcription initiation activities will facilitate the study of transcriptional regulation. The collected data files with the configuration file (hub.txt) are accessible at http://fantom.gsc.riken.jp/5prim/external/CellLines-CAGE-with-epigenome/.

Figure 4.

Figure 4.

MCF-7 cells: integrated view of transcriptome and epigenetic marks. The data were obtained from ChIP-Atlas, which covers all ChIP-seq experiments deposited in SRA, in addition to FANTOM5. The view of EGR-1 locus is displayed by the UCSC Genome Browser via the trackHub framework(https://genome-asia.ucsc.edu/cgi-bin/hgTracks?db=hg19&hubUrl=http://fantom.gsc.riken.jp/5prim/external/CellLines-CAGE-with-epigenome/hub.txt).

DISCUSSION

With this latest update of the FANTOM web resource, we focused mainly on diversifying the types of assays rather than increasing the number of samples for the same assay because CAGE technology allows us to identify the 5′-ends of capped RNAs but not the internal structure of the RNAs. We sought to overcome this limitation by integrating data sets targeting different portions of the RNA, and thus to understand the transcriptome from a more comprehensive perspective. We expanded our resource to encompass lncRNAs and miRNAs, whose atlases further our understanding cellular regulatory mechanisms; however, the functional roles of these molecules—in particular lncRNAs—largely remain to be elucidated. The sixth iteration of FANTOM is thus aimed at specifically assessing lncRNA functions in multiple cell types (37).

We have also expanded the number of samples by covering four additional species—rat, dog, chicken and macaque — for a limited amount of cell types. As the comparison of multiple genome sequences enables us to study the evolutionary processes of genetic inheritance, cross-species transcriptome comparisons make it possible in turn to assess the functional consequences of evolution at the molecular level. The expanded data set presented here, and future data releases, will further facilitate these cross-species studies.

ACKNOWLEDGEMENTS

The authors would like to thank Dr A.R.R. Forrest for scientific coordination of the FANTOM5 collaboration, Dr C. Plessy for helping the manuscript preparation, J. Harshbarger for development of several FANTOM5 interfaces, and all of the members in the FANTOM consortium for their fruitful collaboration.

Authors’ contributions: M.L., I.A., S.N., A.K., A.H., C.C.H., M.D.H, J.S., S.O., T.K. and H.K. updated the database contents. Y.H., P.C., T.K. and H.K. coordinated maintenance and update of the FANTOM web resource. M.L., C.C.H., M.D.H, T.K. and H.K wrote the manuscript. All authors read and approved the manuscript.

FUNDING

Research grants to RIKEN Center for Life Science Technologies, RIKEN Preventive Medicine and Diagnosis Innovation Program, RIKEN Center for Integrative Medical Sciences from the Japan Ministry of Education, Culture, Sports, Science and Technology (MEXT); Japan Society for the Promotion of Science KAKENHI Grants [16H02902, 16K12529 to H.K.]; Research grant from National Bioscience Database Center of the Japan Science and Technology Agency. Funding for open access charge: MEXT.

Conflict of interest statement. None declared.

REFERENCES

  • 1. De Santa F., Barozzi I., Mietton F., Ghisletti S., Polletti S., Tusi B.K., Muller H., Ragoussis J., Wei C.L., Natoli G.. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol. 2010; 8:e1000384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Carninci P., Sandelin A., Lenhard B., Katayama S., Shimokawa K., Ponjavic J., Semple C.A., Taylor M.S., Engstrom P.G., Frith M.C. et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat. Genet. 2006; 38:626–635. [DOI] [PubMed] [Google Scholar]
  • 3. FANTOM Consortium Suzuki H., Forrest A.R., van Nimwegen E., Daub C.O., Balwierz P.J., Irvine K.M., Lassmann T., Ravasi T., Hasegawa Y. et al. The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line. Nat. Genet. 2009; 41:553–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Forrest A.R., Kawaji H., Rehli M., Baillie J.K., de Hoon M.J., Haberle V., Lassmann T., Kulakovskiy I.V., Lizio M., Itoh M. et al. A promoter-level mammalian expression atlas. Nature. 2014; 507:462–470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Kawai J., Shinagawa A., Shibata K., Yoshino M., Itoh M., Ishii Y., Arakawa T., Hara A., Fukunishi Y., Konno H. et al. Functional annotation of a full-length mouse cDNA collection. Nature. 2001; 409:685–690. [DOI] [PubMed] [Google Scholar]
  • 6. Okazaki Y., Furuno M., Kasukawa T., Adachi J., Bono H., Kondo S., Nikaido I., Osato N., Saito R., Suzuki H. et al. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature. 2002; 420:563–573. [DOI] [PubMed] [Google Scholar]
  • 7. Andersson R., Gebhard C., Miguel-Escalada I., Hoof I., Bornholdt J., Boyd M., Chen Y., Zhao X., Schmidl C., Suzuki T. et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014; 507:455–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Arner E., Daub C.O., Vitting-Seerup K., Andersson R., Lilje B., Drablos F., Lennartsson A., Ronnerblad M., Hrydziuszko O., Vitezic M. et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science. 2015; 347:1010–1014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Kanamori-Katayama M., Itoh M., Kawaji H., Lassmann T., Katayama S., Kojima M., Bertin N., Kaiho A., Ninomiya N., Daub C.O. et al. Unamplified cap analysis of gene expression on a single-molecule sequencer. Genome Res. 2011; 21:1150–1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Adiconis X., Haber A.L., Simmons S.K., Levy Moonshine A., Ji Z., Busby M.A., Shi X., Jacques J., Lancaster M.A., Pan J.Q. et al. Comprehensive comparative analysis of 5′-end RNA-sequencing methods. Nat. Methods. 2018; 15:505–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Noguchi S., Arakawa T., Fukuda S., Furuno M., Hasegawa A., Hori F., Ishikawa-Kato S., Kaida K., Kaiho A., Kanamori-Katayama M. et al. FANTOM5 CAGE profiles of human and mouse samples. Sci Data. 2017; 4:170112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Lizio M., Harshbarger J., Abugessaisa I., Noguchi S., Kondo A., Severin J., Mungall C., Arenillas D., Mathelier A., Medvedeva Y.A. et al. Update of the FANTOM web resource: high resolution transcriptome of diverse cell types in mammals. Nucleic Acids Res. 2017; 45:D737–D743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Lizio M., Harshbarger J., Shimoji H., Severin J., Kasukawa T., Sahin S., Abugessaisa I., Fukuda S., Hori F., Ishikawa-Kato S. et al. Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol. 2015; 16:22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Plessy C., Bertin N., Takahashi H., Simone R., Salimullah M., Lassmann T., Vitezic M., Severin J., Olivarius S., Lazarevic D. et al. Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nat. Methods. 2010; 7:528–534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C. et al. The transcriptional landscape of the mammalian genome. Science. 2005; 309:1559–1563. [DOI] [PubMed] [Google Scholar]
  • 16. Djebali S., Davis C.A., Merkel A., Dobin A., Lassmann T., Mortazavi A., Tanzer A., Lagarde J., Lin W., Schlesinger F. et al. Landscape of transcription in human cells. Nature. 2012; 489:101–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Uszczynska-Ratajczak B., Lagarde J., Frankish A., Guigo R., Johnson R.. Towards a complete map of the human long non-coding RNA transcriptome. Nat. Rev. Genet. 2018; 19:535–548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Hon C.C., Ramilowski J.A., Harshbarger J., Bertin N., Rackham O.J., Gough J., Denisenko E., Schmeier S., Poulsen T.M., Severin J. et al. An atlas of human long non-coding RNAs with accurate 5′ ends. Nature. 2017; 543:199–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Cabili M.N., Trapnell C., Goff L., Koziol M., Tazon-Vega B., Regev A., Rinn J.L.. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011; 25:1915–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Iyer M.K., Niknafs Y.S., Malik R., Singhal U., Sahu A., Hosono Y., Barrette T.R., Prensner J.R., Evans J.R., Zhao S. et al. The landscape of long noncoding RNAs in the human transcriptome. Nat. Genet. 2015; 47:199–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Harrow J., Frankish A., Gonzalez J.M., Tapanari E., Diekhans M., Kokocinski F., Aken B.L., Barrell D., Zadissa A., Searle S. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 2012; 22:1760–1774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Consortium G.T. Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science. 2015; 348:648–660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. de Rie D., Abugessaisa I., Alam T., Arner E., Arner P., Ashoor H., Astrom G., Babina M., Bertin N., Burroughs A.M. et al. An integrated expression atlas of miRNAs and their promoters in human and mouse. Nat. Biotechnol. 2017; 35:872–878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Friedlander M.R., Mackowiak S.D., Li N., Chen W., Rajewsky N.. miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2012; 40:37–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Bertin N., Mendez M., Hasegawa A., Lizio M., Abugessaisa I., Severin J., Sakai-Ohno M., Lassmann T., Kasukawa T., Kawaji H. et al. Linking FANTOM5 CAGE peaks to annotations with CAGEscan. Sci Data. 2017; 4:170147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Francescatto M., Lizio M., Philippens I., Bontrop R., Sakai M., Watanabe S., Itoh M., Hasegawa A., Lassmann T., Severin J. et al. Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system. Sci Data. 2017; 4:170163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Lizio M., Deviatiiarov R., Nagai H., Galan L., Arner E., Itoh M., Lassmann T., Kasukawa T., Hasegawa A., Ros M.A. et al. Systematic analysis of transcription start sites in avian development. PLoS Biol. 2017; 15:e2002887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Lizio M., Mukarram A.K., Ohno M., Watanabe S., Itoh M., Hasegawa A., Lassmann T., Severin J., Harshbarger J., Abugessaisa I. et al. Monitoring transcription initiation activities in rat and dog. Sci. Data. 2017; 4:170173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Church D.M., Schneider V.A., Graves T., Auger K., Cunningham F., Bouk N., Chen H.C., Agarwala R., McLaren W.M., Ritchie G.R. et al. Modernizing reference genome assemblies. PLoS Biol. 2011; 9:e1001091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Abugessaisa I., Noguchi S., Hasegawa A., Harshbarger J., Kondo A., Lizio M., Severin J., Carninci P., Kawaji H., Kasukawa T.. FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies. Sci Data. 2017; 4:170107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Severin J., Lizio M., Harshbarger J., Kawaji H., Daub C.O., Hayashizaki Y., Consortium F., Bertin N., Forrest A.R.. Interactive visualization and analysis of large-scale sequencing datasets using ZENBU. Nat. Biotechnol. 2014; 32:217–219. [DOI] [PubMed] [Google Scholar]
  • 32. Abugessaisa I., Shimoji H., Sahin S., Kondo A., Harshbarger J., Lizio M., Hayashizaki Y., Carninci P., consortium F., Forrest A. et al. FANTOM5 transcriptome catalog of cellular states based on Semantic MediaWiki. Database (Oxford). 2016; 2016:baw105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Raney B.J., Dreszer T.R., Barber G.P., Clawson H., Fujita P.A., Wang T., Nguyen N., Paten B., Zweig A.S., Karolchik D. et al. Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics. 2014; 30:1003–1005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Speir M.L., Zweig A.S., Rosenbloom K.R., Raney B.J., Paten B., Nejad P., Lee B.T., Learned K., Karolchik D., Hinrichs A.S. et al. The UCSC Genome Browser database: 2016 update. Nucleic Acids Res. 2016; 44:D717–D725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Kersey P.J., Allen J.E., Allot A., Barba M., Boddu S., Bolt B.J., Carvalho-Silva D., Christensen M., Davis P., Grabmueller C. et al. Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species. Nucleic Acids Res. 2018; 46:D802–D808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Oki S., Ohta T., Shioi G., Hatanaka H., Ogasawara O., Okuda Y., Kawaji H., Nakaki R., Sese J., Meno C.. ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep. 2018; 19:e46255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. de Hoon M., Shin J.W., Carninci P.. Paradigm shifts in genomics through the FANTOM projects. Mamm. Genome. 2015; 26:391–402. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES