Abstract
The National BioResource Project (NBRP) is a Japanese project that aims to establish a system for collecting, preserving and providing bioresources for use as experimental materials for life science research. It is promoted by 27 core resource facilities, each concerned with a particular group of organisms, and by one information center. The NBRP database is a product of this project. Thirty databases and an integrated database-retrieval system (BioResource World: BRW) have been created and made available through the NBRP home page (http://www.nbrp.jp). The 30 independent databases have individual features which directly reflect the data maintained by each resource facility. The BRW is designed for users who need to search across several resources without moving from one database to another. BRW provides access to a collection of 4.5-million records on bioresources including wild species, inbred lines, mutants, genetically engineered lines, DNA clones and so on. BRW supports summary browsing, keyword searching, and searching by DNA sequences or gene ontology. The results of searches provide links to online requests for distribution of research materials. A circulation system allows users to submit details of papers published on research conducted using NBRP resources.
INTRODUCTION
Japanese bioresources for life science research have an 80 year history. Although some unique and precious collections of resources have been accumulated during this time, some of these have recently became dispersed and lost as a result of the increasing age of their providers or through shortages of funds, so that systems for providing useful collections of resources became inadequate. To improve this situation, it was necessary to establish a sustainable environment in which researchers could readily obtain bioresources, so the National BioResource Project (NBRP) began in 2002. The species included in the project were selected on the basis that they were indigenous to Japan or that they were model organisms currently studied or expected to be studied in the future by large numbers of researchers.
A major feature of this project is that it promotes the centralization of resources and information to ensure continuity. Resources are organized by species or groups of organisms, and a system of interaction between the resource and the information center [which belongs to the National Institute of Genetics (NIG)] was created to centralize the information. All the information in the databases is publicly available and several enhancements in the contents of the databases and in the services offered are being introduced.
Here, we outline some details of the BioResourceWorld (BRW) integrated database and several representative component databases. All the information described is available at http://www.nbrp.jp/.
NBRP Information Site is a gateway to access all the information of NBRP. The menu on the left-hand side beneath the heading ‘Resource Center’ provides links to all 30 individual databases, and gives the user direct access to the appropriate database for the organism of interest. The text box ‘Keyword’ near the upper center of the page is a query box for entering keyword searches of the BRW integrated database. Users can access the home page of BRW directly by clicking on ‘Resource Integrated Search Site’ in the menu bar near the top of the page. This menu bar also provides links to other pages: ‘Journal’ provides a link to the Research Resource Circulation (RRC) Web site for browsing and submission of papers reporting research using resources obtained through NBRP; ‘Japan Genetic Resources’ provides a link to a list of URLs for resources in Japan, but outside the NBRP group; and ‘Worldwide Genetic Resources’ provides a link to a list of URLs for bioresources worldwide. The total number of records currently available in the database for all bioresources and the subtotals for three individual classifications (animals, plants and microbes) are shown in the turquoise-colored box near the top left of the page.
NBRP DATABASES
An integrated NBRP database retrieval system: BioResourceWorld
BRW is an integrated database retrieval system that allows users to retrieve resources by using the body of information held by NBRP on a number of organisms. All the resources (∼4.5 million items) are available for distribution. Keyword searching of BRW is directly available from the NBRP home page. A user performing a search can specify an organism and a resource category from pull-down menus. (The default values are ‘All organisms’ and ‘All categories’.) When a user enters keywords and performs a search with the default settings, the total number of results is shown together with a breakdown of the number of strains and the number of DNA clones. This is followed by the list of resources broken down by organism. By clicking on a desired resource on the list, the user can access a detailed information page that provides links to the distribution-request site for each database (where this exists).
The BRW home page, which is linked from the top menu bar of the NBRP home page, contains the search functions described above and four tabs (ALL, DNA, BLAST and Gene Ontology). These four tabs provide, respectively, a summary display of all resources by organism groups, a summary display of DNA clones by organism groups, a BLAST search and a Gene Ontology (GO) search.
The BLAST search allows the user to search all organism groups and also to specify particular organisms or categories, such as genomic clones, cDNA clones, or libraries. The search results are shown as diagrams of alignments in which abbreviations of names of organisms or groups of organisms appear in a color-coded manner on the left-hand side of the diagram to help users recognize particular organisms. Individual sequences in hits have links to NCBI (1), DDBJ (2) and BRW, which gives the user access to the web sites from which the appropriate resources can be ordered.
The latest service of BRW is a search of resources on the basis of GO. The user can examine the hierarchical structure of a GO term and query it by using GO ID, GO term, or GO Gene. For example, if a user enters ‘0000038’ (or ‘very-long-chain fatty acid metabolic process’) into the GO-ID (or GO Term), a resource list containing ‘mouse (5/5), drosophila (2/4), arabidopsis (10/10)’ is shown. The numbers in parenthesis indicate the number of resources associated with this GO-ID (left) and its descendants (right). Thus this service allows users a more semantic search and provides them with a wider range of resources.
The GO search is currently at a testing stage, because not all resources contain GO information. The total number of resources mapped to a GO Term is displayed, and a list of relevant resources can be obtained by clicking the figures. Like other search modes, the GO allows the ordering of resources from search results. We plan to enhance the search of resources across organism by the addition of supporting bio-ontology data, such as phenotype, anatomy and development.
The information contained in BRW is updated concurrently with that in the individual databases.
Resource research circulation
If a researcher obtains good results with NBRP resources, such a result may be useful for later researchers who use the same resources. We therefore ask researchers to feed back information on papers that report the results of their studies, and we also collect other papers in which NBRP resources are used. This information is used to create an open database (RRC) of papers related to the NBRP resources. RRC also provides an online registration system through which a paper can be submitted merely by entering its PubMed identification number and the name of the resource used in the paper. We have asked many researchers to use this system to feed back on the papers that they have published. It would be ideal for experimental researchers if, as in the case of the accession numbers of the DNA Data Bank, we could establish a system in which detailed information on the bioresources described in ‘Materials and Methods’ sections of papers could be obtained from public databases and, furthermore, the materials themselves could be easily acquired.
The NBRP databases
Table 1 shows the names and features of the NBRP component databases, organized by organism group. As shown in Table 1, the NBRP resource collection covers a wide range of taxa with indications of the presence or absence of the following collections: (i) naturally mated lines (including wild species, cultivated species, inbred lines and spontaneous mutants); (ii) genetically engineered organisms (including induced mutants, transgenic strains, transposon-inserted strains, deletion strains, consomic lines, genome-wide knockout strains, and enhancer trap lines); and (iii) DNA (plasmids, vectors and genomic/cDNA clones). The numbers of records of resources vary according to the group of organisms, and the resources in classification (3) account for 95% of the total. The External DB column in Table 1 indicates databases that have one-way links or cross links to external comprehensive databases for the model organisms. The Collaborating DB inside the parenthesis indicates an external database with which the NBRP database collaborates.
Table 1.
Database name | Taxon | Wild/ inbred/ landrace | Mutants | Plasmid/ vector/ clones | Map (Physical or Linkage) | Gene (mutated genes/all genes) | Phenotype | SNPs | Blast service | Images | External DB (collaborating DB)[reference] | Featured contents/ services | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | NBRP-Macaque | Vertebrata-mammalia-primate | o | o | |||||||||
2 | NBRP-Mouse(Riken BRC) | Vertebrata-mammalia- murinae-mus | o | o | o | P | M | o | o | o | o | MGI[3], (IMSR, JMSR;http://www.shigen.nig.ac.jp/mouse/jmsr/) | |
3 | NBRP-Rat | Mammalia-murinae-rattus | o | o | o | P | M | o | o | o | o | RGD[4], (JMSR) | graphical display of the phenotype data |
4 | NBRP-Xenopus | Vertebrata-amphibia | o | o | o | ||||||||
5 | NBRP-Zebrafish | Vertebrata-actinopterygii-cypriniformes-danio | o | M | o | o | ZFIN[5] | ||||||
6 | NBRP-Medaka | Vertebrata-actinopterygii-beloniformes-oryzias | o | o | o | P | M | o | o | o | atlas, phylogenetic tree | ||
7 | NBRP-Ciona | Ascidiacea-ciona | o | o | M | o | o | (Ghost,[6] CIPRO;http://cipro.ibio.jp/new/) | |||||
8 | NBRP-Drosophila (DGRC) | Arthropoda-insecta-diptera-drosophilidae | o | o | o | P | A | o | o | o | o | FlyBase[7] | |
9 | NBRP-Silkworm (SilkwormBase) | Arthropoda-insecta-lepidoptera-bombyx | o | L | M | o | o | laeva period time | |||||
10 | NBRP-C. elegans | Pseudocoelomata-nematoda | o | M | Wormbase[8] | ||||||||
11 | NBRP-Rice (Oryzabase) | Viridiplantae-poaceae-oryzeae | o | o | P,L | A | o | o | o | Gramene[9], IRRI; http://beta.irri.org/index.php/Home/Welcome/Frontpage.html | developmental stage | ||
12 | NBRP-Barley | Viridiplantae-poaceae-triticeae-hordeum | o | o | L | o | o | o | phenotype data | ||||
13 | NBRP-Wheat (KOMUGI) | Viridiplantae-poaceae-triticeae | o | o | o | L | A | o | o | o | (TriFLDB[10]) | gene catalogue | |
14 | NBRP-Arabidopsis (Riken BRC) | Viridiplantae-brassicales-arabidopsis | o | o | o | o | o | ||||||
15 | NBRP-Chrysanthemum | Viridiplantae-asterids-chrysanthemum | o | o | o | ||||||||
16 | NBRP-Morning glory | Viridiplantae-solanales-ipomoeeae | o | o | L | M | o | o | |||||
17 | NBRP-Lotus/Glycine (LegumeBase) | Fabales-fabaceae | o | o | o | L | M | o | o | (miyakogusa.jp[11], Soybean Full-length cDNA database[12]) | |||
18 | NBRP-Tomato | Viridiplantae-solanales-lycopersicon | o | (KafTom;http://www.pgb.kazusa.or.jp/kaftom/, MiBASE[13]) | |||||||||
19 | NBRP-Algae | 14 phyla (eukaryota and bacteria) | o | o | o | photograph, phylogenetic tree | |||||||
20 | NBRP-Yeast | Fungi-ascomycota | o | o | P | A | SGD[14], geneDB[15] | ||||||
21 | NBRP-Cellular slime mold | Mycetozoa-dictyostelliida | o | dictyBase[16] | |||||||||
22 | NBRP-Prokaryote E. coli | Bacteria-proteobacteria | o | o | P | A | o | PEC[17], EcoGene[18], EcoCyc[19], COG[20], NCBI[1], SwissProt[21], GTOP[22], KEGG[23], InterPro[24] | |||||
23 | NBRP-Prokaryote B. subtilis | Bacteria-firmicutes-bacilli-bacillaceae | o | M | |||||||||
24 | NBRP-Pathogenic microbes | Bacteria and protozoa | o | o | o | ||||||||
25 | NBRP-General microbes (Riken BRC, JCM) | Bacteria | o |
Types and names (identifiers) of the resources, and distribution/deposition methods (such as MTA) are essential information for all resources. For DNA clones, the accession numbers in the DNA Data Bank, together with homologous sequences, are the items of information common to all organism groups. NBRP also includes some activities in which the distribution of resources requires a prescribed review procedure (such as the Human ES-cells and Macaque). Activities where resources cannot be obtained by proliferation, but require non-invasive sampling or sampling from dead animals, are provided only for research that meets prescribed standards [such as the Great Ape Information Network (GAIN)]. The Global Biodiversity Information Facility (GBIF; http://www.gbif.org/) is an activity that forms the Japanese node of an international project that is involved in collecting and creating a database on specimens and observation data held in museums. Information from the GBIF and resources from the GAIN and Macaque databases are generally not distributed, so that they are not included in BRW.
As illustrative examples of the features described above, the databases for rice, Escherichia coli, Caenorhabditi elegans and the rat are described in more detail below, and some details of the RIKEN BioResource Center databases are also given.
NBRP-rice (Oryzabase) (25)
Oryzabase, a comprehensive database that is currently used by many researchers, succeeded the Database of Resources and Trait Genes, which was launched in 1995. Genomic information has been added to the database on resources and genes so that it has become comprehensive and now functions as the NBRP database. The characteristic wild-rice collection consists of three different core collections. Core collection Rank 1 relates to 18 species from nine genomes and contains detailed phenotype data and many photographs. The collection of unique mutant strains includes strains for which trait genes have been identified, as well as strains that have phenotypes for which the responsible genes are not known. The former set of strains can be retrieved on the basis of genetic information, and the latter set of strains can be retrieved on the basis of the phenotype. The data on both sets of strains include photographs.
We constantly update the dictionary of trait genes with data extracted from published papers, and we have also been promoting the establishment of correspondences between trait genes and the accession numbers of DNA sequences, ORF numbers of genome projects and displays of chromosomes by using physical maps. Oryzabase also provides access to a Web site for online submission of new rice genes, which is based on the Gene Nomenclature System for Rice as determined by the Committee on Gene Symbolization, Nomenclature and Linkage of the Rice Genetics Cooperative (CGSNL) (26). Users can give feedback on an individual gene through the detailed pages for that gene.
Basic information on rice, including the definitions of the tissue-specific developmental stages (such as embryo, inflorescence, leaf, root, anther, ovule, pollen mother cell, stoma or vascular bundle) and information on tissue-specific or developmental stage-specific gene expression are also available through this web site.
A version of Textpresso, a text-mining system for scientific papers developed by the Generic Model Organism Database and adapted for use with rice, has been constructed and is available through Oryzabase. It provides access to a total of ∼20 000 rice-related papers (abstracts and titles) in PubMed, and is linked from the home page of Textpresso.
Oryzabase has links to two external databases: Gramene and IRRI. Genetic information is linked to the former database, and wild-strain information is linked to the latter. Oryzabase also allows BLAST searches to be made through all genomes or by chromosomes, provides a tool for extracting the specified region of genome sequences, and can provide downloadable text files of almost all information.
NBRP-E. coli
Of all the organisms in the NBRP collection, E. coli was the first to have its genome sequence determined. A feature of NBRP-E. coli is its genome-wide genetically engineered strain collections. At the web site of NBRP-E. coli (http://www.shigen.nig.ac.jp/ecoli/strain/), users can browse outlines and lists of collections and search full data or perform queries by specifying details. Although online requests are available, these often take time because of the complexity of the MTA process. For the convenience of users, a tracking system has been introduced that allows users to check the progress of their requests in real time.
The information center maintains profiling of E. coli chromosomes (PEC), an information site of a project on essential E. coli genes (27). Through PEC, the information center has made genome maps and genetic information available. The resource information contained in NBRP-E. coli is cross linked with PEC to permit access to resources through the maps. For example, by clicking the deletion regions of extensive deletion mutants, or by clicking the gene parts of mutants mutated by gene units, the user can access detailed pages that are linked to request pages for the resources. PEC has many links to external databases, including NCBI, UniProtKB, COG, EcoCyc, GTOP, EcoGene and KEGG. We also perform similarity searches on gene sequences to add domain information from Pfam and PROSITE. In addition, PEC provides BLAST search through two different strains, MG1655 and W3110; all ORF, and essential genes. The tool for specifying the desired fragments of genome sequences is also available. Almost all the information in the database can be downloaded in the form of text files.
NBRP-C. elegans
NBRP-C. elegans (http://www.shigen.nig.ac.jp/c.elegans/) is a smaller database than the two databases discussed above. Each record consists only of the allele of the deletion strain, a systematic identification tag for the gene (the CGC name), information on the position of the gene on the chromosome, information on the positions of deletion regions, and primer information. However, the CGC name, allele and sequence are cross linked to the corresponding page of WormBase, a comprehensive database on C. elegans, providing an easy access to that database. In the case of this resource, requested mutants are isolated from the pool after the request. Users can check on the state of progress of isolation online. Mutants that have been isolated once will be listed as available mutants (isolated), so that other users can request the same mutants. NBRP-C. elegans is unique in that information on researchers who have received mutants from this project is made public online. Because almost all papers in which the resources are used contain the names of these resources, and the information is fed back from paper-registration sites, the C. elegans database automatically reflects information in the RRC. This is a model case of good circulation between resources and researches.
NBRP-rat
Because the database of rat resources (http://www.anim.med.kyoto-u.ac.jp/NBR/) contains substantial characterization data from individual resources and has many tools for browsing this data, it is efficient in allowing researchers to find the resources best suited to their research from a range of trait information. For example, the top page displays a pie chart that shows a breakdown of the research fields in which the resources are used, and by clicking a research field of interest, the user can obtain a list of resources related to that particular research field. Another example, ‘Phenome Project’ provides 109 items of physiological, behavioral and anatomical phenome data in nine tables and in strain-distribution maps with the two items selected by the user as the abscissa and ordinate axes. Users can access detailed pages for strains by clicking data points in the maps. ‘Genome’ provides access to polymorphism data for 357 simple sequence-length polymorphism (SSLP) markers, obtained from investigations on more than 150 strains. A phylogenetic tree, constructed from polymorphism data, is also available, through which users can access detailed pages on resources.
Besides characterization data, the database also contains detailed pages on resources, including the preservation status, genetic status, research category, origin, genotype, references and a link to the Rat Genome Database (RGD). Recent enhancements include a BAC browser for F344 and LE BAC end sequences, and the addition of functional polymorphism data obtained by comparing 16 disease-associated gene mutations among multiple strains. The ENU-induced mutant archive (28) is also provided at this site.
NBRP-RIKEN BRC
The RIKEN BioResource Center (BRC) was established in 2001 with the aim of becoming the finest core bioresource facility in the world. Since then, it has been engaged in collecting bioresources developed mainly by Japanese scientists. These bioresources include living strains of mice and Arabidopsis, human and animal cells, DNA materials and various microbes for which the RIKEN BRC has been designated the national core facility by the NBRP. The RIKEN BRC preserves these bioresources under conditions of strict quality control for provision to the scientific community. The RIKEN BRC also collects information on the whereabouts and characteristics of the bioresources, constructs databases, and offers the bioresource information to the research community within and outside Japan.
The Animal Search System (http://www2.brc.riken.jp/lab/animal/search.php) allows keyword searches to be performed on >2000 strains of mice available from the RIKEN BRC, including transgenic, knockout, inbred, wild-derived, ENU mutant and congenic strains. The system also provides detailed information on each strain, such as the strain name, description, gene details, references, availability status, health report, depositor, specific terms and conditions for distribution, and image(s), which capture the characteristics of the strain. Some gene symbols have links to the Mouse Genome Informatics (MGI) database, which leads to further detailed information on the particular gene. As well as being publicly available, the up-to-date information on mice is also sent to the International Mouse Strain Resource (IMSR, http://www.findmice.org/) database, to which the RIKEN BRC is a contributing repository. The RIKEN BRC Mouse Phenome Database (RMPD) (http://www.brc.riken.jp/rmpd/mouse_phenome_top.html) contains phenotypic data on the physiology, biochemistry, hematology and morphology of inbred, mutant, wild-derived and recombinant inbred strains of mice, and enables biomedical researchers to find appropriate strains for their researches.
The Arabidopsis transposon tagged lines can be searched in a web-based catalogue (http://www.brc.riken.jp/lab/epd/catalog/transposon.html), where information on insert positions of transposons and adjacent genes can be obtained for more than 15 000 lines. The SENDAI Arabidopsis Seed Stock Center (SASSC) database (http://www.brc.riken.jp/lab/epd/SASSC/) provides information on its collection (wild type and mutant), such as strain name, region of collection and phenotypic remarks. The RIKEN Arabidopsis full-length cDNA (RAFL) clone database (http://www.brc.riken.jp/lab/epd/catalog/cdnaclone.html) contains more than 250 000 clones and the users can retrieve clones by NCBI accession number, AGI code, clone name or sequence homology. The Systematic Consolidation of Arabidopsis and other Botanical REsources (SABRE) database (http://saber.epd.brc.riken.jp/sabre/SABRE0101.cgi) offers searches of BRC plant DNA resources across the species that it contains.
The database of human and animal cells (http://www2.brc.riken.jp/lab/cell/search.php) has its origins in the database of the RIKEN Cell Bank, which began collecting and distributing cell resources in 1987. Starting from a stand-alone database for internal use, it has been transformed to allow searching of cell resources and the provision of the associated information through the Internet. Because the Cell Bank began its activity as a division of the BRC, its stock list has grown with the addition of various new kinds of cell resources, such as Epstein–Barr virus-transformed B cell lines, human somatic stem cells, embryonic stem cell lines and induced pluripotent stem cell lines. The function of the database has accordingly been enhanced to allow the presentation of other items of information that differ from resource to resource. The information available therefore depends on the resource; for example, information on the origin, morphology, culture conditions, restrictions on distribution, results of short tandem repeat analyses, images, etc. can be obtained for a conventional cell line resources.
Before the BRC was established, DNA materials were collected and distributed as an activity of the RIKEN DNA Bank. A keyword search system (http://www.brc.riken.jp/lab/dna/search/index.html) is available for DNA clones, vectors and recombinant adenoviruses. Resources are retrieved by plasmid name, by gene name or symbol, or by accession number. In the Geneset Bank database (http://www.brc.riken.jp/lab/dna/en/GENESETBANK/index.html), more than 20 illustrations for principal gene pathways are implemented, and DNA materials can be easily found according to the gene pathways to which they belong.
The microbial resource collection in RIKEN was founded in 1980 as the Japan Collection of Microorganisms (JCM), and the construction of a database for the collection began at its inception. The Web-based online catalog database (http://www.jcm.riken.jp/JCM/catalogue.shtml) was launched in 1995, and it has since been improved and updated. It now provides access to information on more than 11 000 available strains, which are searchable by their accession number, scientific name and keywords on strain data. It is also possible to search for JCM strains that are equivalent to those in other culture collections. The database contains various useful items of information about strains, such as culture media and conditions, history (including isolation source), taxonomic data and references. The DNA sequence data, critical for the phylogeny and the genome data are linked to the DDBJ database for each strain.
Future directions
We will continue to improve the content of individual databases and upgrade the functions available at the integrated database-retrieval site. We also hope to expand external access to the databases and expand collaboration with other databases to permit access to our resources by a wider range of users. In particular, we will examine a new possibility for interconnecting reference data and resource databases with the aim of construction a virtual international network.
FUNDING
Ministry of Education, Culture, Sports, Science and Technology (MEXT) (to National Bio Resource Project). Funding for open access charge: The Management Expenses Grant for National University Cooperation, MEXT.
Conflict of interest statement. None declared.
REFERENCES
- 1.Benson A, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2009;37:D26–D31. doi: 10.1093/nar/gkn723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sugawara H, Ikeo K, Fukuchi S, Gojobori T, Tateno Y. DDBJ dealing with mass data produced by the second generation sequencer. Nucleic Acids Res. 2009;37:D16–D18. doi: 10.1093/nar/gkn724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Blake JA, Bult CJ, Eppig JT, Kadin JA, Richardson JE the Mouse Genome Database Group. The Mouse Genome Database genotypes:phenotypes. Nucleic Acids Res. 2009;37:D712–D719. doi: 10.1093/nar/gkn886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Dwinell MR, Worthey EA, Shimoyama M, Bakir-Gungor B, DePons J, Laulederkind S, Lowry T, Nigram R, Petri V, Smith J, et al. The Rat Genome Database 2009: variation, ontologies and pathways. Nucleic Acids Res. 2009;37:D744–D749. doi: 10.1093/nar/gkn842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Sprague J, Bayraktaroglu L, Bradford Y, Conlin T, Dunn N, Fashena D, Frazer K, Haendel M, Howe DG, Knight J, et al. The Zebrafish Information Network: the zebrafish model organism database provides expanded support for genotypes and phenotypes. Nucleic Acids Res. 2008;36:D768–D772. doi: 10.1093/nar/gkm956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Satou Y, Kawashima T, Shoguchi E, Nakayama A, Satoh N. An integrated database of the ascidian, Ciona intestinalis: towards functional genomics. Zoolog Sci. 2005;22:837–843. doi: 10.2108/zsj.22.837. [DOI] [PubMed] [Google Scholar]
- 7.Tweedie S, Ashburner M, Falls K, Leyland P, McQuilton P, Marygold S, Millburn G, Osumi-Sutherland D, Schroeder A, Seal R, et al. FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 2009;37:D555–D559. doi: 10.1093/nar/gkn788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rogers A, Antoshechkin I, Bieri T, Blasiar D, Bastiani C, Canaran P, Chan J, Chen WJ, Davis P, Fernandes J, et al. WormBase2007. Nucleic Acids Res. 2008;36:D612–D617. doi: 10.1093/nar/gkm975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Liang C, Jaiswal P, Hebbard C, Avraham S, Buckler ES, Casstevens T, Hurwitz B, McCouch S, Ni J, Pujar A, et al. Gramene: a growing plant comparative genomics resource. Nucleic Acids Res. 2008;36:D947–D953. doi: 10.1093/nar/gkm968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mochida K, Yoshida T, Sakurai T, Ogihara Y, Shinozaki K. TriFLDB: a database of clustered full-length coding sequences from Triticeae with applications to comparative grass genomics. Plant Physiol. 2009;150:1135–1146. doi: 10.1104/pp.109.138214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sato S, Nakamura Y, Kaneko T, Asamizu E, Kato T, Nakao M, Sasamoto S, Watanabe A, Ono A, Kawashima K, et al. Genome structure of the legume, Lotus japonicus. DNA Res. 2008;15:227–239. doi: 10.1093/dnares/dsn008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Umezawa T, Sakurai T, Totoki Y, Toyoda A, Seki M, Ishiwata A, Akiyama K, Kurotani A, Yoshida T, Mochida K, et al. Sequencing and analysis of approximately 40,000 soybean cDNA clones from a full-length-enriched cDNA library. DNA Res. 2008;15:333–346. doi: 10.1093/dnares/dsn024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yano K, Watanabe M, Yamamoto N, Tsugane T, Aoki K, Sakurai N, Shibata D. MiBASE: a database of a miniature tomato cultivar Micro-Tom. Plant Biotechnol. 2006;23:195–198. [Google Scholar]
- 14.Hong EL, Balakrishnan R, Dong Q, Christie KR, Park J, Binkley G, Costanzo MC, Dwight SS, Engel SR, Fisk DG, et al. Gene Ontology annotations at SGD: new data sources and annotation methods. Nucleic Acids Res. 2008;36:D577–D581. doi: 10.1093/nar/gkm909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hertz-Fowler C, Peacock CS, Wood V, Aslett M, Kerhornou A, Mooney P, Tivey A, Berriman M, Hall N, Rutherford K, et al. GeneDB: a resource for prokaryotic and eukaryotic organisms. Nucleic Acids Res. 2004;32:D339–D343. doi: 10.1093/nar/gkh007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fey P, Gaudet P, Curk T, Zupan B, Just EM, Basu S, Merchant SN, Bushmanova YA, Shaulsky G, Kibbe WA, et al. dictyBase—a Dictyostelium bioinformatics resource update. Nucleic Acids Res. 2009;37:D515–D519. doi: 10.1093/nar/gkn844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yamazaki Y, Niki H, Kato J. Profiling of Escherichia coli Chromosome Database. Methods Mol. Biol. 2008;416:385–389. doi: 10.1007/978-1-59745-321-9_26. [DOI] [PubMed] [Google Scholar]
- 18.Rudd KE. EcoGene: a genome sequence database for Escherichia coli K-12. Nucleic Acids Res. 2000;28:60–64. doi: 10.1093/nar/28.1.60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Keseler IM, Bonavides-Martínez C, Collado-Vides J, Gama-Castro S, Gunsalus RP, Johnson DA, Krummenacker M, Nolan LM, Paley S, Paulsen IT, et al. EcoCyc: a comprehensive view of Escherichia coli biology. Nucleic Acids Res. 2009;37:D464–D470. doi: 10.1093/nar/gkn751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedovm SI, Nikolskaya AN, et al. The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003;4:41. doi: 10.1186/1471-2105-4-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.UniProt Consortium. The Universal Protein Resource (UniProt) 2009. Nucleic Acids Res. 2009;37:D169–D174. doi: 10.1093/nar/gkn664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Fukuchi S, Homma K, Sakamoto S, Sugawara H, Tateno Y, Gojobori T, Nishikawa K. The GTOP database in 2009: updated content and novel features to expand and deepen insights into protein structures and functions. Nucleic Acids Res. 2009;37:D333–D337. doi: 10.1093/nar/gkn855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kanehisa M, Ataki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–D484. doi: 10.1093/nar/gkm882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, et al. InterPro: the integrative protein signature database (2009) Nucleic Acids Res. 2009;37:D224–D228. doi: 10.1093/nar/gkn785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yamazaki Y, Kurata N. Oryzabase, An Integrated Biological and Genome Information Database for Rice. Plant Physiol. 2006;140:12–17. doi: 10.1104/pp.105.063008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Susan RM CGSNL (Committee on Gene Symbolization, Nomenclature and Linkage, Rice Genetics Cooperative. Gene nomenclature system for rice. Rice. 2008;1:72–84. [Google Scholar]
- 27.Kato J, Hashimoto M. Construction of long chromosomal deletion mutants of Escherichia coli and minimization of the genome. Methods Mol. Biol. 2008:279–293. doi: 10.1007/978-1-59745-321-9_18. [DOI] [PubMed] [Google Scholar]
- 28.Mashimo T, Yanagihara K, Tokuda S, Voigt B, Takizawa A, Nakajima R, Kato M, Hirabayashi M, Kuramoto T, Serikawa T. An ENU-iuduced mutant archive for gene targeting in rats. Nat. Genet. 2008;40:514–515. doi: 10.1038/ng0508-514. [DOI] [PubMed] [Google Scholar]