Abstract
Background
Plant genome information is fundamental to plant research and development. Along with the increase in the number of published plant genomes, there is a need for an efficient system to retrieve various kinds of genome-related information from many plant species across plant kingdoms. Various plant databases have been developed, but no public database covers both genomic and genetic resources over a wide range of plant species.
Main body
We have developed a plant genome portal site, Plant GARDEN (Genome And Resource Database Entry: https://plantgarden.jp/en/index), to provide diverse information related to plant genomics and genetics in divergent plant species. Elasticsearch is used as a search engine, and cross-keyword search across species is available. Web-based user interfaces (WUI) for PCs and tablet computers were independently developed to make data searches more convenient. Several types of data are stored in Plant GARDEN: reference genomes, gene sequences, PCR-based DNA markers, trait-linked DNA markers identified in genetic studies, SNPs, and in/dels on publicly available sequence read archives (SRAs). The data registered in Plant GARDEN as of March 2023 included 304 assembled genome sequences, 11,331,614 gene sequences, 419,132 DNA markers, 8,225 QTLs, and 5,934 SNP lists (gvcf files). In addition, we have re-annotated all the genes registered in Plant GARDEN by using a functional annotation tool, Hayai-Annotation, to compare the orthologous relationships among genes.
Conclusion
The aim of Plant GARDEN is to provide plant genome information for use in the fields of plant science as well as for plant-based industries, education, and other relevant areas. Therefore, we have designed a WUI that allows a diverse range of users to access such information in an easy-to-understand manner. Plant GARDEN will eventually include a wide range of plant species for which genome sequences are assembled, and thus the number of plant species in the database will continue to expand. We anticipate that Plant GARDEN will promote the understanding of genomes and gene diversity by facilitating comparisons of the registered sequences.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12870-023-04392-8.
Keywords: Database, Plant, Genome, Gene, Marker, SNPs, Cross-search
Background
The growing area of plant research and development (R&D) is characterized by two major types of diversity: a diversity of species to be studied and a diversity of research communities, ranging from groups performing basic research to those involved in industrial applications. In particular, the fact that the number of plant species used for industrial purposes—including as food and energy raw materials—is much larger than the corresponding number of animal species used in industry, is a major factor in the great diversity of both species and communities targeted for R&D.
Plant genome information is fundamental to plant R&D. Next-generation sequencing (NGS) technologies, which were first used in the research field around 2010, have accelerated the amount of plant genome information generated worldwide. Especially in recent years, de novo assembly of genome sequences and resequencing of numerous varieties and lines have been conducted in many plant species [1]. For example, as of March 2023, the NCBI genome database (https://www.ncbi.nlm.nih.gov/genome/) had entries for 3,089 genomes for 1360 Viridiplantae species, 43 genomes for 20 Rhodophyta species, and eight genomes for 11 Glaucophyta species. This high availability of data has led many biologists and industries to accelerate the introduction of genome information in their research.
In conjunction with the development of NGS technologies, several plant genome databases covering a broad diversity of plant species have been developed, such as Phytozome [2], Ensembl Plants [3], GreenPhylDB [4], and PlabiPD (https://www.plabipd.de/). For example, in Phytozome, 147 species and 295 genomes were available in March 2023, with lists of genes showing high homology to the selected gene. On the other hand, at the same time, Ensembl Plants has registered 132 genomes derived from 108 species. Ensembl Plants also provides comparative genomic analyses that allow researchers to compare genetic features across different plant species. PlabiPD is a unique database of plant genomic information, providing graphical representations of the status of plant genome decoding using timelines or phylogenetic positions. It offers a visual representation of plant genome information that is not found in other plant genome databases. These databases provide comprehensive phylogenetic relationships of plant genes and species based on evolutionary information such as orthologs and paralogs. However, they do not provide information related to genetic studies, such as genome-wide SNP lists and DNA markers.
Comprehensive databases that provide the data needed for both genomic and genetic analyses, including genes, DNA markers, and trait-related loci along with the reference genome sequence, have been developed as family- or species-specific databases, such as Gramene for Poaceae [5], SGN (Sol Genomics Network) for Solanaceae [6], and LIS (Legume Information System) for Fabaceae [7]. However, these databases are only for plant species belonging to specific taxa and do not allow for sequence comparisons across species in the plant kingdom.
As the number of published plant genomes continues to increase, there is a need for an efficient system to retrieve various genome-related data from many plant species across plant kingdoms. The Plant GARDEN database (Genome And Resource Database Entry: https://plantgarden.jp) presented in this paper was developed as a portal website to provide genome-related information such as genomes and DNA markers of various plant species covering many families, as well as tools for comparative analysis of the information both within and between species. In addition, we have developed a web-based user interface (WUI) that enables a diverse range of users to access information in an easy-to-understand manner, in the anticipation that genome analysis will be routinely performed not only by R&D specialists but also in an educational context and other relevant areas.
Construction and content
Implementation
The web address of Plant GARDEN is https://plantgarden.jp. Plant GARDEN was developed in Apache (https://db.apache.org/), MySQL (https://www.mysql.com/), and PHP (https://www.php.net/) and built by XAMPP (https://www.apachefriends.org/) on a Linux server with RHEL (Rad Hat Enterprise Linux). The database for the contents was built using MariaDB (https://mariadb.org/), a MySQL-compatible database. A PHP framework, Laravel (https://laravel.com/), was used to construct the WUI. Different types of WUI were developed for display on personal computers (PCs) and tablet computers (hereinafter tablets) to make data searches more convenient (Fig. 1). The system automatically selects the appropriate version (PC or tablet) of the display based on the type of operating system used. Some pages, such as the browser, were developed only for the PC version but can also be accessed from the tablet version. The available languages are English and Japanese.
The data structure was designed in the REST (REpresentational State Transfer) style, and restful API is used to display the data in the tablet version. Since the PC version was developed early in the creation of Plant GARDEN, the data are displayed by direct query to the SQL database. A keyword search system named pges was developed for rapid full-text search in Plant GARDEN (Supplementary Fig. S1). Elasticsearch (https://www.elastic.co/) was used as the search engine, and an API server, crawler, and indexer were developed with perl scripts.
Outline of the contents
Plant GARDEN will eventually include a wide range of plant species for which genome sequences are assembled, and thus the number of plant species in the database will continue to expand (Supplementary Fig. S2). At the end of each year, we report the number of data added that year on the News pages of Plant GARDEN. The database contains the following types of data: 1) assembled genome sequences, 2) gene sequences, 3) base variants (SNPs and in/dels) detected on SRAs (Sequence Read Archives) against an assembled genome, 4) PCR-based DNA markers, 5) trait-related loci identified by genetic studies, and 6) linkage maps (Fig. 2). The sequence and marker information were collected in Plant GARDEN by curating public data except for base variants on SRAs. The genus, family, and common name to which each stored plant species belongs are also provided as general information.
As of March 28, 2023, the available information included 304 genome sequences from 234 species, 11,331,614 gene sequences from 162 species, 419,132 DNA markers from 72 species, 8,255 trait-related loci from 28 species, linkage maps from 1 species, and variant information (gvcf) from 105 species, consisting of 5,934 data points (Table 1). The stored data are also listed in the Data List, which is accessible at the top page of Plant GARDEN. The files obtained from original sources, including genome and gene sequences and gene annotation files, can be downloaded from the download site on Plant GARDEN.
Table 1.
Data type | Number of species | Number of data |
---|---|---|
Genome | 234 | 304 |
Gene | 162 | 11,331,614 |
Base variants on SRA (VCF) | 106 | 108 |
Base variants on SRA (gVCF) | 105 | 8,934 |
DNA marker | 72 | 419,132 |
Trait-linked DNA marker (QTL) | 28 | 8,255 |
Assembled genome sequences
Most of the genome sequences in Plant GARDEN are collected from public databases available on the condition that they were assembled at the chromosome level, but several scaffold-level sequences were also registered in Plant GARDEN upon request from users. The source name and URL of the site providing the sequence data are shown on each genome sequence page, with the statuses of the assembled genomes and information on the source article along with contact information to reach the corresponding author (Supplementary Fig. S3). In cases in which the genome sequences were directly submitted to NCBI without being published in an article, the entry was designated as a "Direct submission" in the article information field. The quality evaluation of assembled genomes and gene sequences was performed by our team using BUSCOs [8].
Functional re-annotation of genes associated with OrthoDB
In general, inferred genes on a genome are functionally annotated by the publisher of the genome sequence. The annotation criteria, databases used, and analysis items differ among publishers, which makes it difficult to compare orthologous relationships of genes based on the provided annotation information. Thus, we have re-annotated functions of all the genes registered in Plant GARDEN by using a functional annotation tool, Hayai-Annotation [9], based on KusakiDB (https://github.com/aghelfi/kusakiDB). KusakiDB is a database of orthologous genes in plants, consisting of sequence information derived from OrthoDB (https://www.orthodb.org/), UniProt (https://www.uniprot.org/), and RefSeq (https://www.ncbi.nlm.nih.gov/refseq/). Re-annotation of gene function is carried out on the gene sequence file (pep file) created by the publisher on the genome, and the gene sequence names are used as defined by the publisher. The re-annotated genes are associated with OG (Ortholog Group) IDs on OrthoDB to find ortholog gene sequences in other species (Supplementary Fig. S4). In addition, Hayai-Annotation detects the most similar gene from among the genes registered in UniProtKB (https://www.uniprot.org/help/uniprotkb) via KusakiDB, and estimates the GO, KEGG, Pfam, Interpro ID, EC, and TF families. If there are no similar gene sequences in UniProtKB, it refers to RefSeq. When referring to RefSeq, functional annotation is carried out using InterProscan (https://www.ebi.ac.uk/interpro/) for GO, Pfam, and Interpro ID.
Base variants on SRA
Base variants on SRAs are unique contents generated by the Plant GARDEN project. We collected a maximum of 100 SRA sequences from either the NCBI database (https://www.ncbi.nlm.nih.gov/sra) or DDBJ (https://www.ddbj.nig.ac.jp/dra/index-e.html), prioritizing those with higher genome coverage and a greater use of common plant materials. The variants were identified by a combination of Bowtie2 [10] and SAMTools [11] for mapping and variant calls, respectively, or by an Illumina DRAGEN pipeline (Illumina, San Diego, CA, USA). When multiple reference genome sequence files were available for a species at the time of analysis, we selected a single sequence file based on assembly quality and the file's significance in the research community. A gVCF (genomic variant call format) file was created for each SRA file, and a VCF (variant call format) file was subsequently created by integrating all the gVCF files created in a species (Supplementary Fig. S5). VCF files provided the variant information in a specific format, making it difficult to understand the descriptions, especially for users who were not specialists in genetics. Therefore, we converted the polymorphic allele information in the vcf file to the bases with the letters A, G, T, and C and provided this information as a text table. Although we created most of the gvcf and vcf files in Plant GARDEN, Plant GARDEN also accepts the registration of publicly available vcf and gvcf files.
PCR-based DNA markers
PCR-based DNA markers, categorized as SSR (simple sequence repeats), CAPS/dCAPS (cleaved amplified polymorphic sequences), SCARs (sequence characterized amplified regions), and others were collected by manual curation of published articles. The information on markers available in PGDBj [12] and the Kazusa Marker DataBase [13] were also registered in Plant GARDEN. Each reference genome in each species was selected on the basis of assembly quality and importance to the research community. Marker positions on the selected reference genomes were estimated by BLAST analysis of primer or target sequences of the markers with the default parameters and the '-task blastn-short' option. The following criteria were also used to determine the positions: 1) The maximum mismatch is equal to or less than 2 bp. 2) The distances between forward and reverse primers were within 5 Kb. If multiple hits were observed, the top hit result was selected, and the number of the hit positions was given in the comment column. Marker names, types, positions on the reference genome, and the name of the reference genome are shown in a marker list and marker details pages (Supplementary Fig. S6). Primer or target sequences and source manuscript information are also available on the marker details page.
Trait-related loci
Trait-related loci identified by genetic studies, such as interval mapping or genome-wide association studies (GWAS), were also registered in Plant GARDEN by collecting the information with manual curation of the published articles. The trait types were classified as stress, fertility, yield, morphology/growth, quality/components, and others. The positions of the registered loci on the reference genome are graphically shown on the trait-related loci list page, and a locus list is shown under the graphical view (Supplementary Fig. S7A and B). The locus details page shows the locus name, trait name, trait category, analysis method, population name, population type, position on the genome estimated via nearest markers, position on the linkage map (if available in the source manuscript), and source manuscript (Supplementary Fig. S7C).
Utility
Search by species
Several processes are available for searching the contents in Plant GARDEN. We considered that searches by species would be the most frequently performed, because much plant R&D targets specific species. Therefore, various types of registered data are organized and aggregated on the species page set for each plant species.
A list of plant species registered in Plant GARDEN is displayed by clicking on the “Search by species” button on the top page (Figs. 1 and 3A). The target species can be searched by scrolling through the list or using the keyword search window. An alphabetized search by scientific or common name can also be performed by selecting the "Sorting” arrow on the top row of the species list. Once a target species is selected from the list, a species page is shown (Fig. 3B). On the species pages, all contents described above in the “Construction and content” section are accessible. Links to BLAST searches against the genomes listed on the pages and major DBs for the species are also shown.
Search by genes
The “search by genes” function is accessible on the top page or species pages. In cases in which users perform gene searches from the top page, all gene files registered in Plant GARDEN are targeted for search. Meanwhile, gene files in a specific species are targeted in cases where a gene search is performed using the “All” button in a gene search column on a species page. A target gene file can be chosen by selecting the genome sequence name.
Two types of gene searches are available: keyword searches and refined searches. The gene keyword search targets words displayed on the gene pages (Fig. 4A). For example, all gene sequences identified on targeted genome sequences are listed when the genome sequence name is used as a keyword, because every gene sequence page displays the name of the genome sequence (e.g., Lj3.0 in Supplementary Fig. S4A). However, if the specificity of the character strings in the genome sequence name is low, gene sequences other than those on the target genome sequence may be hit, because the keyword search simply searches pages containing a target key word. Gene searches by assembled genome version, gene ontology, and KEGG ontology are available in the “advanced search” function shown under the keyword search (Fig. 4B).
A sample data comparison by JBrowse
In Plant GARDEN, we use two genome browsers: JBrowse [14] and TASUKE + [15]. JBrowse is an excellent system for displaying and comparing diverse information and is used to visualize assembled genome sequences, gene models, DNA markers, and variants on SRA identified on the selected genome. TASUKE + is also a well-designed browser that displays multiple genomic and genetic resources, and is particularly suited to displaying variants. Therefore, TASUKE + is used to display vcf files. The data items currently displayed in JBrowse are as follows: reference sequence, gene models, markers, and base variants. At present, when multiple genomes are registered within a single species, marker and variant information is displayed only on the single genome that we have selected. For other genome sequences, only the genome sequence and gene model can be viewed on JBrowse. This is because the location information of markers or variants on the genome is determined for only the selected genomes as described in the section entitled “Base variants on SRA”.
An example of the utility of JBrowse for genetic study is shown in Fig. 5. Leal-Bertioli et al. (2009) identified a QTL, cp4.2, related to late leaf spot resistance by using an F2 mapping population derived from a cross between Arachis duranensis and A. stenosperma [16]. The cp4.2 information is registered in Plant GARDEN, along with information on the nearest markers of RN5H02 and TC9E08 (Fig. 5A). The position of RN5H02 (the Marker ID in Plant GARDEN is t3818.M014442.1) on the A. hypogaea genome (Tifrunner.gnm1.KYV3) was determined to be around 102,722,262 bp of chromosome Arahy.04. Then genome sequences and genes near RN5H02 were confirmed on JBrowse. The nearest gene located on RN5H02, arahy.J4ZAEP.1, was annotated as an “uncharacterized protein”. However, the gene arahy.DVK8Y6.1, whose function was annotated as NAD-dependent epimerase/dehydratase in OrthDB, was found on a 41 kb upper region of RN5H02 by looking for genes on the JBrowse screen (Fig. 5B). NAD-dependent epimerase/dehydratase is considered to affect cell-surface properties [17] and was reported as an enzyme related to disease resistance in Lens culinaris, which belongs to Fabaceae, the same family to which genus Arachis belongs [18].
In 2009, when Leal-Bertioli et al. published their manuscript, no chromosome-level genome sequences were available. Therefore, they searched for the nearest candidate disease-resistance genes on the QTLs by mapping resistance gene analog (RGA) markers, which were designed on the nucleotide binding site domain (NBS). However, no RGA markers were determined near the cp4.2 QTL. On the other hand, Plant GARDEN makes the search for candidate genes much simpler than when Leal-Bertioli et al. (2009) published their paper. For further experiments to determine the causative mutations of the traits, the variants identified on SRA would provide useful information for identifying candidate causal variants (Fig. 5C).
BLAST search
BLAST search pages can be accessed from the top and species pages. All sequences files registered in Plant GARDEN are listed on the BLAST search page accessed from the top page. This BLAST page can be used for cross-species searches. Meanwhile, species-specific files are listed on the BLAST search pages accessed from the species pages. The available types of searches are BLASTN, BLASTP, BLASTX, TBLASTN, and TBLASTX. The input query sequences are submitted to a server at the Kazusa DNA Research Institute, and results are returned to a Plant GARDEN page (Supplementary Fig. S8A).
In most BLAST searches, users input query sequences on a submission column by copying and pasting a query sequence from a source text. However, this operation is difficult to implement on the tablet version of the website. Therefore, the tablet version converts the text characters of a query shown in an image into text data using an OCR (Optical Character Reader). The OCR engine was developed based on Google Cloud Vision API (https://cloud.google.com/vision/). We implemented our own API that sends the acquired image data to the Google cloud and displays the obtained results on the Plant GARDEN page (Supplementary Fig. S8B).
Tool list
Along with the widespread use of NGS technologies, the demand for self-analysis of NGS data by nonbioinformatic specialists has increased. However, many of those nonbioinformatic specialists have difficulties in setting up an analysis environment and in learning commands for analysis. Therefore, Plant GARDEN provides several analysis tools operating on its GUI (https://plantgarden.jp/en/list/tool). Two types of tools are available: those for the cloud and those for local environments. We expect that Plant GARDEN will contribute to a deeper understanding of genomic and genetic studies in various fields of expertise.
Conclusion
Plant GARDEN is currently being developed and maintained through research funds from Kazusa DNA Research Institute as of 2023. The aim of Plant GARDEN is to provide plant genome information to the fields of plant science as well as to plant-based industries, education, and other relevant areas. Therefore, we have designed a web-based user interface (WUI) that allows a diverse range of users to access such information in an easy-to-understand manner. Plant GARDEN contains genome and gene information for 235 species as ofMarch 2023. At present, orthologous genes across species can be searched for on OrthoDB using the Ortho group unique ID associated with each gene, or by performing a BLAST search specifying multiple species sequences from the top page. Additionally, we plan to soon enable searches for orthologous genes within Plant GARDEN using the Ortho group unique IDs.
We aim to store as much genomic information in Plant GARDEN as possible. However, because of the limited amount of data we can work with, we are prioritizing data in two metrics: high-quality assembly, such as scaffolding at the chromosome level, or importance in the research community. For the latter metric, we consider two factors: high-frequency use in research communities and diversity in registered species. Because our knowledge is limited, we place importance on direct requests and opinions from users. We receive requests via e-mail (plantgarden@kazusa.or.jp) or through the questionnaire form at the Plant GARDEN site. We expect that Plant GARDEN will promote the understanding of genomes and gene diversity by facilitating comparisons of sequences registered in the DB.
Supplementary Information
Acknowledgements
We thank Keisuke Bunya, Nyok Sean Lau, Padmalatha Koilkonda, Akihiro Miwa, Marina Iwasa, Mashiro Okada, Yuka Ikezaki, Youko Shima, Ikuhiro Shimamoto, and Masaomi Hatakeyama for curating the contents registered in Plant GARDEN. We also thank Sho Torii, Kana Takagi and Seki Nakayama of Bascule Inc. and Komei Sato of Kappuku for their assistance with designing and coding during Plant GARDEN construction.
Abbreviations
- PC
Personal Computer
- SNP
Single Nucleotide Polymorphism
- QTL
Quantitative Trait Locus
- DB
Data Base
- PHP
Hypertext Preprocessor
- API
Application Programming Interface
- SQL
Structured Query Language
- URL
Uniform Resource Locator
- BUSCO
Benchmarking Universal Single-Copy Orthologs
- GO
Gene Ontology
- KEGG
Kyoto Encyclopedia of Genes and Genomes
- EC
Enzyme Commission
- TF
Transcription Factor
- DDBJ
DNA Data Bank of Japan
- VCF
Variant Call Format
- BLAST
Basic Local Alignment Search Tool
- GUI
Graphical User Interface
Authors' contributions
HI, DH, JF, SY, YN and SS contributed to the DB design and content curation. MY and TF contributed to the DB construction and server management. MK contributed to the DB construction, data curation, and creation of vcf files. HH contributed to the server management and content curation. AG contributed to the functional gene annotation. TT and ES contributed to the development of the keyword search. AN contributed to the ortholog gene search. AK contributed to the creation of vcf files. YT contributed to the implementation of Blast search. ST contributed to the organization of the project. SNI was the project organizer and the corresponding author.
Funding
This work was supported by research funding from the Kazusa DNA Research Institute for the DB development, maintenance, and manuscript writing. Additionally, the DB development from 2017 to 2021 was funded by the Database Integration Coordination Program (DICP) of The National Bioscience Database Center (NBDC) at the Japan Science and Technology Agency (JST).
Availability of data and materials
The database and web interface are available at https://plantgarden.jp/.
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no conflict of interest.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Marks RA, Hotaling S, Frandsen PB, VanBuren R. Representation and participation across 20 years of plant genome sequencing. Nature Plants. 2021;7:1571–1578. doi: 10.1038/s41477-021-01031-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012; 40 Database issue:D1178–D1186. doi: 10.1093/nar/gkr944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bolser DM, Staines DM, Perry E, Kersey PJ. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomic Data. Methods Mol Biol. 2017;1533:1–31. doi: 10.1007/978-1-4939-6658-5_1. [DOI] [PubMed] [Google Scholar]
- 4.Conte MG, Gaillard S, Lanau N, Rouard M, Périn C. GreenPhy1DB: a database for plant comparative genomics. Nucleic Acids Res. 2008;36:D991–D998. doi: 10.1093/nar/gkm934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ware DH, Jaiswal P, Ni J, Yap IV, Pan X, Clark KY, et al. Gramene, a tool for grass genomics. Plant Physiol. 2002;130:1606–1613. doi: 10.1104/pp.015248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fernandez-Pozo N, Menda N, Edwards JD, Saha S, Tecle IY, Strickler SR, et al. The Sol Genomics Network (SGN)—from genotype to phenotype to breeding. Nucleic Acids Res. 2014;43:D1036–D1041. doi: 10.1093/nar/gku1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Dash S, Campbell JD, Cannon EKS, Cleary AM, Huang W, Kalberer SR, et al. Legume information system (Legumeinfo.org): a key component of a set of federated data resources for the legume family. Nucleic Acids Res. 2016;44:D1181–D1188. doi: 10.1093/nar/gkv1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 9.Ghelfi A, Shirasawa K, Hirakawa H, Isobe S. Hayai-Annotation Plants: an ultra-fast and comprehensive functional gene annotation system in plants. Bioinformatics. 2019;35:4427–4429. doi: 10.1093/bioinformatics/btz380. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 10.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Asamizu E, Ichihara H, Nakaya A, Nakamura Y, Hirakawa H, Ishii T, et al. Plant Genome DataBase Japan (PGDBj): a portal website for the integration of plant genome-related databases. Plant Cell Physiol. 2014;55:e8. doi: 10.1093/pcp/pct189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Shirasawa K, Isobe S, Tabata S, Hirakawa H. Kazusa Marker DataBase: a database for genomics, genetics, and molecular breeding in plants. Breed Sci. 2014;64:264–271. doi: 10.1270/jsbbs.64.264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol. 2016;17:66. doi: 10.1186/s13059-016-0924-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kumagai M, Nishikawa D, Kawahara Y, Wakimoto H, Itoh R, Tabei N, et al. TASUKE+: a web-based platform for exploring genome-wide association studies results and large-scale resequencing data. DNA Res. 2019;26:445–452. doi: 10.1093/dnares/dsz022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Leal-Bertioli SCM, José ACVF, Alves-Freitas DMT, Moretzsohn MC, Guimarães PM, Nielen S, et al. Identification of candidate genome regions controlling disease resistance in Arachis. BMC Plant Biol. 2009;9:112. doi: 10.1186/1471-2229-9-112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Islam R, Brown S, Taheri A, Dumenyo CK. The Gene Encoding NAD-Dependent Epimerase/Dehydratase, wcaG, Affects Cell Surface Properties, Virulence, and Extracellular Enzyme Production in the Soft Rot Phytopathogen, Pectobacterium carotovorum. Microorganisms. 2019;7:172. [DOI] [PMC free article] [PubMed]
- 18.Bhadauria V, Vijayan P, Wei Y, Banniza S. Transcriptome analysis reveals a complex interplay between resistance and effector genes during the compatible lentil-Colletotrichum lentis interaction. Sci Rep. 2017;7:1–13. doi: 10.1038/srep42338. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The database and web interface are available at https://plantgarden.jp/.