Skip to main content
Frontiers in Plant Science logoLink to Frontiers in Plant Science
. 2021 Dec 23;12:791219. doi: 10.3389/fpls.2021.791219

Review on the Development and Applications of Medicinal Plant Genomes

Qi-Qing Cheng 1, Yue Ouyang 1, Zi-Yu Tang 1, Chi-Chou Lao 1, Yan-Yu Zhang 1, Chun-Song Cheng 1,2, Hua Zhou 1,3,*
PMCID: PMC8732986  PMID: 35003182

Abstract

With the development of sequencing technology, the research on medicinal plants is no longer limited to the aspects of chemistry, pharmacology, and pharmacodynamics, but reveals them from the genetic level. As the price of next-generation sequencing technology becomes affordable, and the long-read sequencing technology is established, the medicinal plant genomes with large sizes have been sequenced and assembled more easily. Although the review of plant genomes has been reported several times, there is no review giving a systematic and comprehensive introduction about the development and application of medicinal plant genomes that have been reported until now. Here, we provide a historical perspective on the current situation of genomes in medicinal plant biology, highlight the use of the rapidly developing sequencing technologies, and conduct a comprehensive summary on how the genomes apply to solve the practical problems in medicinal plants, like genomics-assisted herb breeding, evolution history revelation, herbal synthetic biology study, and geoherbal research, which are important for effective utilization, rational use and sustainable protection of medicinal plants.

Keywords: medicinal plant, genome, sequencing, long-read sequencing technology, application

Introduction

Medicinal plants, in the simple definition, are plants that can be used for medicinal purposes; in the detailed definition, are plants that have been verified and used for a long time as traditional medicines, have been found to have medicinal value in modern research, or contain medicinal ingredients in them. And they can provide the essential resources for human life, such as drugs, nourishment, condiments, and medicinal oil. They also uncovered and promoted the evolution of nature, animals, and humans. The foundation of all life is the genetic code. Therefore, access to the primary DNA sequence and how genes are encoded within the genome has become a basic resource in biology (Hamilton and Robin Buell, 2012). The genomics study of medicinal plants is to elucidate their molecular mechanism to prevent human diseases, by utilizing the genetic information and regulatory network of the species and the omics technologies, accordingly, to reveal their effect on the human body from the level of the genome. Now the process of genome sequencing in plants lags behind that in microorganisms and animals. Due to the lack of genomic information, there is a lack of communication between medicinal plants and modern life sciences, and the new frontier life science technology is hardly be applied to their research. Over the years, the works of research on medicinal plant medicines mainly focus on chemistry and pharmacology, the studies to uncover the biological nature of medicinal plants need to be strengthened.

Regarding plant genome sequencing methods and strategies, radical changes have taken place in the past 5 years, and medicinal plant genome sequencing is no exception. Previous reviews summarized the status of sequenced plant genomes before 2012 (Hamilton and Robin Buell, 2012), the status of sequenced angiosperm genomes before 2018 (Chen et al., 2018), and the impact of third generation genomic technologies on plant genome assembly (Jiao and Schneeberger, 2017). In addition, there were also Chinese reviews that proposed and introduced the Herb Genome Program (Chen et al., 2010) and 1,000 genome projects of medicinal plants (Chen et al., 2019). As sequencing cost reduces drastically and long-read sequencing technology develops quickly in recent years, it is certain that the genome continues to be improved, while more and more large and complicated medicinal plant genomes are reported. The future of revealing the secret of medicinal plant biology is bright. However, there is still not a review covering the medicinal plant genomes that have been released so far and introducing the development of sequencing strategies and applications.

In this manuscript, we conducted a systematic review of medicinal plants genome research. Moreover, the genome situation, sequencing technology development, and application of medicinal plant genomes are discussed. This review provides a historical perspective on the current situation of genomes in medicinal plant biology and highlights the use of rapidly developing sequencing technologies in plant biology. Challenges in genomics for medicinal plants are improved to some extent by long-read sequencing technologies regarding the current limitations. Multiple omics methods are integrated to make better use of medicinal plant genome data and to solve practical problems meeting in the breeding and medical fields. We also conduct a comprehensive summary of the application of medicinal plant genomes, to promote the studies of important questions in plant biology, like genomics-assisted herb breeding, herbal synthetic biology, and geoherbal research, which are significant for securing the future of medicinal plants and their active compounds.

Literature Search Methods and Results

The systematic literature search was performed by the following PRISMA guidelines (Moher et al., 2009). Firstly, it was performed through electronic databases, including PubMed (National Library of Medicine, United States), EMBASE (Elsevier, Netherlands), and Web of Science (Clarivate Analytics, United States) databases published until June 4, 2021. Studies were selected using the term “medicinal plant genome.” Additionally, we also searched the studies from the plaBiPD (Forschungszentrum Jülich GmbH, Germany) database and identified the medicinal plants from all the plants which have reported genomes. About the medicinal plant genomes, a total of 5,064 articles were identified initially by retrieving the electronic database comprehensively. Among these, 1,678 articles were from PubMed, 1,982 articles were from EMBASE, and 1,404 articles were from Web of Science, 173 articles were from the plaBiPD database, 831 articles were excluded for duplicates. A total of 4,189 articles were excluded by scanning the titles and the abstracts, and the exclusion reasons included irrelevant articles, not studies, and so on. Fifty-nine articles were excluded by reading the full-text manuscripts, with the exclusion reasons of reviews, not for medicinal plants and not for whole-genome sequencing and no mention of medicinal related content. Finally, a total of 158 articles were included in this meta-analysis. A flowchart of articles search and selection is shown in Figure 1. According to our statistical result, there were at least 161 reference genomes reported in 158 articles belonging to 126 medicinal plants published. We counted the number of journals that have published medicinal plant genomes, there were a totally of 40 journals, and the corresponding journal name and article number are provided in Supplementary Table 1. Since 2010, articles about medicinal plant genomes have appeared in journals almost every year. Since 2017, the number of medicinal plant genome articles has increased significantly.

FIGURE 1.

FIGURE 1

A flowchart of literature search and selection for a systematic review about medicinal plant genomes.

General Introduction of Medicinal Plant Genomes Research

History and General Characteristics of Medicinal Plant Genome Research

The medicinal plant genomes are more complex than animal genomes, so the process of sequencing the medicinal plant genomes has been hindered, and it entered a period of rapid development from 2016. This may be due to the decline in sequencing price and the development of long-read sequencing technologies. The status of medicinal plant genomes articles reported each year is shown in Figure 2A. In 2020, the number of published medicinal plant genomes has reached up to 53. In 2021, 33 medicinal plant genome articles have been published until June 4th, and the total article number is inferred to be more than 60. As more and more medicinal plant genomes have been revealed, several plants have been sequenced twice or multiple times for genomes. Among these repeatedly sequenced medicinal plant genomes, some are because of sequencing at the same time, some are due to improved level and quality, and some are genomes of different varieties from the same species. Among those 53 medicinal plant genomes reported in 2020, 18 genomes were reported repeatedly, accounting for 34%. This tells us that sequencing technology is continuously developing and progressing, bringing us to a completer and more accurate genome. Take Panax notoginseng (Chinese name: Sanqi) as an example, five versions of its genomes have been reported, the first two versions published in 2017 were sequenced by the next-generation sequencing (NGS) technology of Illumina platform (Chen et al., 2017; Zhang D. et al., 2017), and the recent three versions published in 2020 and 2021 were sequenced by the third-generation sequencing technologies of Pacific Biosciences (PacBio) and Oxford Nanopore (ONT) (Fan G. et al., 2020; Jiang et al., 2021; Yang et al., 2021b). The latest two versions of the genome were assembled to the chromosome level, the length of the assembled sequences was hundreds of times longer than the first two versions, and the accuracy and credibility of annotation have also been greatly improved. The statistical results of detailed information about medicinal plant genomes were shown in Table 1.

FIGURE 2.

FIGURE 2

Publication history (A) and general information (B) of medicinal plant genomes. (A) The total number and repeated sequencing number of medicinal plant genomes are increasing year by year, proving that it has received more and more scientific research attention. (B) The figure shows published medicinal genome assemblies analyzed for genome-wide repeat levels and genome size. The repetitiveness of most medicinal plant genomes is generally high and correlated to genome size. The sequenced medicinal plants are divided into five groups based on phylogeny, including lycopodiophyta, gymnosperms, eudicots, monocots, and magnoliids, and eudicot accounts for the majority of them.

TABLE 1.

The statistical results of medicinal plant genome published journals.

# Name Platform Class Esti-size Assem-size Repeat Contig N50 Scaffold N50 Gene References
Mb Mb % kb kb #
1 Acer truncatum I, P eudi 739 633 61.8 773.2 kb 46.36 Mb 28438 Ma Q. et al., 2020
2 Akebia trifoliatasubsp.australis I, P eudi 670 682 71.2 6.2 Mb 43.11 Mb 25598 Huang et al., 2021
3 Allium sativum I, P, O mono 16900 16243 91.3 194 kb 1691 Mb 57561 Sun et al., 2020
4 Aloe vera I, O mono 16040 12930 78.7 NA 14.6 kb 86177 Jaiswal et al., 2021
5–1 Andrographis paniculata I, P eudi 280 269 53.3 388 kb NA 25428 Sun et al., 2019
5–2 Andrographis paniculata I, P eudi 310 284 57.4 5.14 Mb NA 24015 Liang et al., 2020
6–1 Apium graveolens I eudi 3180 2210 68.9 13.1 kb 35.6 kb 34277 Li M.-Y. et al., 2020
6–2 Apium graveolens I, P eudi 3470 3330 87.1 790.6 kb 289.78 Mb 31326 Song et al., 2021
7–1 Aquilaria sinensis I, O eudi 773 727 59.1 1.1 Mb 88.78 Mb 29203 Ding et al., 2020
7–2 Aquilaria sinensis I eudi 784 784 61.2 NA 87.6 Mb 35965 Nong et al., 2020
8 Aquilegia oxysepala var. kansuensis I, P eudi 312 293 45.7 2.2 Mb 40.9 Mb 25571 Xie et al., 2020
9 Artemisia annua I, P, R eudi 1740 1740 61.6 18.95 kb 104.86 kb 63226 Shen et al., 2018
10 Asparagus setaceus I, O mono 720 710 64.4 1.36 Mb 2.19 Mb 28410 Li S.-F. et al., 2020
11–1 Averrhoa carambola I, O eudi 358 335 61.3 4.22 Mb 31.25 Mb 25419 Wu S. et al., 2020
11–2 Averrhoa carambola I eudi 475 471 68.2 44.84 kb 2.76 Mb 24726 Fan Y. et al., 2020
12 Azadirachta indica I eudi 364 364 13.0 740 bp 452 kb 20169 Krishnan et al., 2012
13 Betula platyphylla I, P eudi 430 430 43.0 751 kb NA 31253 Chen S. et al., 2020
14 Brassica oleracea I, R, Sa eudi 630 540 38.8 26.8 kb 1.46 Mb 45758 Liu S. et al., 2014
15 Broussonetia papyrifera I, P eudi 380 387 49.2 171.2 kb 29.48 Mb 30512 Peng et al., 2019
16 Calotropis gigantea I eudi 225 157 28.3 48.6 kb 806.0 kb 18197 Hoopes et al., 2018
17 Camellia sinensis I eudi 3000 3020 80.9 20.0 kb 449.5 kb 36951 Xia et al., 2017
18 Camptotheca acuminata I eudi 503 403 35.6 108 kb 1752 kb 31825 Zhao et al., 2017
19–1 Cannabis sativa I, R eudi 820 534 NA NA 16.2 kb 30000 vanBakel et al., 2011
19–2 Cannabis sativa I, P eudi 843 808 74.8 513.6 kb 83 Mb 38828 Gao et al., 2020
20–1 Capsicum annuum I eudi 3260 3349 80.9 55.4 kb 1226.8 kb 35336 Qin et al., 2014
20–2 Capsicum annuum I eudi 3480 3060 76.4 30 kb 2.47 Mb 34903 Kim S. et al., 2014
21 Carthamus tinctorius I, P eudi 1170 1060 60.1 21.23 Mb 88.21 Mb 33343 Wu et al., 2021
22 Catharanthus roseus I eudi 738 523 NA NA 26.2 kb 33829 Kellner et al., 2015
23 Centella asiatica I eudi 430 430 56.4 NA 15.7 Mb 25226 Pootakham et al., 2021a
24 Cerasus humilis I, P eudi 228 223 43.1 1.45 Mb 26.23 Mb 26821 Wang et al., 2020
25 Chimonanthus praecox I, P magno 779 695 47.5 2.19 Mb 65.35 Mb 23591 Shang et al., 2020
26 Chimonanthus salicifolius I, P magno 836 820 57.7 2.3 Mb NA 36651 Lv Q. et al., 2020
27 Chiococca alba I eudi 567 558 NA NA 2.35 Mb 28707 Lau et al., 2020
28 Chrysanthemum nankingense I, O eudi 3240 2530 69.6 130.7 kb NA 56870 Song et al., 2018
29 Cinnamomum kanehirae I, P magno 824 731 48.0 0.9 Mb 50.4 Mb 27899 Chaw et al., 2019
30 Citrus medica I eudi 407 405 43.8 46.5 kb 367 kb 32579 Wang et al., 2017
31 Citrus reticulata I eudi 334 334 50.1 24.7 kb 1.7 Mb 28820 Wang L. et al., 2018
32 Coix aquatica I, P mono 1680 1620 75.4 2.24 Mb NA 39629 Guo et al., 2020
33–1 Coix lacryma-jobi I, P mono 1800 1730 77.7 3.19 Mb 13.98 Mb 44485 Liu H. et al., 2020
33–2 Coix lacryma-jobi I, P mono 1560 1280 77.0 NA 594.3 kb 39574 Kang et al., 2020b
34 Colocasia esculenta I, P, O mono 2390 2405 88.4 400 kb 159.4 Mb 28695 Yin et al., 2021
35–1 Coptis chinensis I, P eudi 1047 958 62.2 1.58 Mb 4.53 Mb 34109 Chen D. et al., 2021
35–2 Coptis chinensis I, O eudi 1150 937 62.5 806.6 kb NA 41004 Liu et al., 2021
36 Coriandrum sativum I, P eudi 2130 2119 80.6 604.1 kb 160.99 Mb 40747 Song X. et al., 2020
37 Cuscuta australis P, I eudi 273 265 58.0 3.63 Mb 5.95 Mb 19671 Sun et al., 2018
38 Dalbergia odorifera I, P eudi 653 638 54.2 5.92 Mb 56.16 Mb 30310 Hong et al., 2020
39 Datura stramonium I, O eudi 2000 2100 61.0 13.1 kb 164.1 kb 52149 Rajewski et al., 2021
40 Daucus carota I eudi 473 422 46.0 31.2 kb 12.7 Mb 32113 Iorizzo et al., 2016
41 Dendrobium catenatum I eudi 1110 1010 78.1 33.1 kb 391 kb 28910 Zhang G. Q. et al., 2016
42–1 Dendrobium officinale I, P mono 1270 1350 63.3 25.1 kb 76.4 kb 35567 Yan et al., 2015
42–2 Dendrobium officinale I, P mono 1210 1230 64.4 1.44 Mb 63.07 Mb 27631 Niu et al., 2021
43 Dimocarpus longan I eudi 445 472 52.9 26.0 kb 566.6 kb 31007 Lin et al., 2017
44 Dioscorea zingiberensis I mono 851 800 42.8 1.08 kb 1.96 kb 27057 Zhou et al., 2018
45 Dracaena cambodiana I mono 1120 1064 53.5 1.87 kb 3.19 kb 53700 Ding et al., 2018
46 Eleutherococcus senticosus I, P eudi 1260 1300 73.6 309.4 kb 50.79 Mb 36372 Yang et al., 2021a
47–1 Erigeron breviscapus I, P eudi 1520 1200 54.6 18.8 kb 31.5 kb 37504 Yang et al., 2017
47–2 Erigeron breviscapus I, P eudi 1520 1430 67.4 140.95 kb 156.82 Mb 43514 He et al., 2021
48 Eriobotrya japonica I, P eudi 803 761 85.9 3.98 Mb 43.16 Mb 43996 Su et al., 2021
49–1 Eucommia ulmoides I, P eudi 1100 1180 61.2 17.06 kb 1.03 Mb 26723 Wuyun et al., 2018
49–2 Eucommia ulmoides P eudi 1020 948 62.5 13.16 Mb 53.15 Mb 26001 Li Y. et al., 2020
50 Fagopyrum tataricum I, P eudi 489 489 51.0 550.7 kb NA 33366 Zhang L. et al., 2017
51 Forsythia suspensa I, O eudi 701 737 54.5 7.3 Mb 7.3 Mb 33062 Li L.-F. et al., 2020
52 Gardenia jasminoides I, O eudi 551 535 62.2 1.0 Mb 44 Mb 35967 Xu et al., 2020b
53–1 Gastrodia elata I mono 1180 1061 66.2 68.9 kb 4.9 Mb 18969 Yuan Y. et al., 2018
53–2 Gastrodia elata I mono 1378 1120 69.8 110 kb 1.64 Mb 24484 Chen S. et al., 2020
54 Gelsemium elegans I, O eudi 338 335 43.2 10.23 Mb 40.47 Mb 26768 Liu Y. et al., 2020
55 Gelsemium sempervirens I eudi 219 244 NA NA 411 kb 22617 Franke et al., 2019
56 Ginkgo biloba I gymno 11750 10610 76.6 48.2 kb 1.36 Mb 41840 Guan et al., 2016
57 Glycyrrhiza uralensis I, P eudi 401 379 36.5 7.3 kb 109.3 kb 34445 Mochida et al., 2017
58 Hemerocallis citrina I, P mono 3800 3770 78.9 2.09 Mb NA 54295 Qing et al., 2021
59 Hypericum perforatum I, P eudi 400 373 46.9 1.41 Mb 2.31 Mb 29150 Zhou et al., 2021
60 Isatis indigotica I, P eudi 305 294 53.3 1.18 Mb 36.17 Mb 30323 Kang et al., 2020a
61 Jacaranda mimosifolia I, O eudi 739 707 56.8 16.77 Mb 39.98 Mb 30507 Wang M. et al., 2021
62–1 Juglans regia I, P eudi 606 667 51.2 46.1 kb 465.0 kb 32498 Martínez-García et al., 2016
62–2 Juglans regia O eudi 620 574 58.4 1.1 Mb 37 Mb 37554 Marrano et al., 2020
63 Lagenaria siceraria I eudi 334 313 46.9 28.3 kb 8.7 Mb 18534 Wu et al., 2017
64 Lavandula angustifolia I, P eudi 1095 895 58.3 1.22 Mb 36.2 Mb 65905 Li et al., 2021
65 Lepidium meyenii I eudi 751 743 47.7 81.8 kb 2.4 Mb 96417 Zhang J. et al., 2016
66 Linum usitatissimum I eudi 373 302 50.0 20.1 kb 693.5 kb 43384 Wang et al., 2012
67 Lithospermum erythrorhizon I, O eudi 369 367 51.8 314.3 kb NA 27720 Auber et al., 2020
68 Litsea cubeba I, P magno 1370 1326 55.5 607.3 1760.0 31329 Chen Y.-C. et al., 2020
69 Lonicera japonica I, O eudi 887 843 58.2 2.1 Mb 84.4 Mb 33939 Pu et al., 2020
70 Luffa acutangula I, P eudi 760 735 62.2 NA 786.1 kb 32233 Pootakham et al., 2021b
71–1 Luffa cylindrica I, P eudi 737 669 62.2 5 Mb 53 Mb 31661 Zhang et al., 2020
71–2 Luffa cylindrica I, P eudi 720 656 63.8 8.8 Mb 48.76 Mb 25508 Wu H. et al., 2020
71–3 Luffa cylindrica I, P eudi 773 690 56.8 NA 578.6 kb 43828 Pootakham et al., 2021b
72 Macleaya cordata I eudi 541 378 43.5 25.0 kb 308.0 kb 22328 Liu et al., 2017
73 Magnolia biondii I, P magno 2240 2220 66.5 269.1 kb 92.86 Mb 47547 Dong et al., 2021
74–1 Medicago sativa/autotetraploid I, P, O eudi 3150 2738 55.0 459.0 kb NA 164632 Chen H. et al., 2020
74–2 Medicago sativaZhongmu No.1 /haploid I, P eudi 800 816 57.0 3.92 Mb NA 49165 Shen et al., 2020
74–3 Medicago sativaspp. caerulea/diploid I, O eudi 802 793 55.6 3.86 Mb NA 47202 Li A. et al., 2020
75 Mentha longifolia I, P eudi 400 353 NA 4.5 kb NA 35597 Vining et al., 2017
76 Mesua ferrea I eudi 685 614 NA 251.7 kb 392.8 kb 46540 Patil et al., 2021
77 Mitragyna speciosa I eudi 1123 1123 44.2 70.4 kb 1.02 Mb 55746 Brose et al., 2021
78–1 Momordica charantia I eudi 339 286 15.3 NA 1.1 Mb 45859 Urasaki et al., 2017
78–2 Momordica charantia I, P eudi 303 303 52.5 9.9 Mb 25.37 Mb NA Matsumura et al., 2020
79 Morinda officinalis I, O eudi 485 485 58.0 4.2 Mb 40.97 Mb 27102 Wang J. et al., 2021
80 Moringa oleifera I eudi 278 217 40.6 45.3 kb 957.2 kb 18451 Chang et al., 2019
81 Morus notabilis I eudi 357 331 47.0 34.5 kb 390.1 kb 29338 He et al., 2013
82 Myrica rubra I eudi 313 290 45.6 68.6 kb 2164.2 kb 26325 Ren et al., 2019
83–1 Nelumbo nucifera I, R eudi 929 804 57.0 38.8 kb 3.4 Mb 26685 Ming et al., 2013
83–2 Nelumbo nucifera I eudi 879 792 49.5 39.3 kb 986.5 kb 36385 Wang et al., 2013
84–1 Ocimum basilicum I eudi 2360 2068 61.6 48.3 kb 1.5 Mb 78990 Bornowski et al., 2020
84–2 Ocimum basilicum I eudi 2320 2130 76.0 45.7 kb 19.3 Mb 62067 Gonda et al., 2020
85 Ocimum tenuiflorum I eudi 612 374 42.9 2.6 kb 27.1 kb 36768 Upadhyay et al., 2015
86 Ophiorrhiza pumila I, P eudi 440 440 58.2 18.49 Mb 40.06 Mb 32389 Rai et al., 2021
87 Osmanthus fragrans I, P eudi 741 727 49.4 1.59 Mb NA 45542 Yang et al., 2018
88 Paeonia suffruticosa I, P eudi 15760 13790 80.2 49.9 kb NA 34854 Lv S. et al., 2020
89–1 Panax ginseng I eudi 3500 3430 62.2 22.0 kb 108.7 kb 42006 Xu et al., 2017
89–2 Panax ginseng I eudi 3600 2980 79.5 22.5 kb 569.0 kb 59352 Kim et al., 2018
90–1 Panax notoginseng I eudi 2310 2390 75.9 16.0 kb 96.0 kb 36790 Chen et al., 2017
90–2 Panax notoginseng I eudi 2002 1850 61.3 13.2 kb 158.0 kb 34369 Zhang D. et al., 2017
90–3 Panax notoginseng O, P eudi 2310 2240 79.1 220.9 kb NA 39452 Fan G. et al., 2020
90–4 Panax notoginseng I, P eudi 2380 2660 85.9 1.12 Mb 216.47 Mb 37606 Jiang et al., 2021
90–5 Panax notoginseng I, P eudi 2310 2410 88.2 1.45 Mb 196.33 Mb 47870 Yang et al., 2021b
91–1 Papaver somniferum I, O eudi 2870 2720 70.9 1.77 Mb 2.04 Mb 51213 Guo et al., 2018
91–2 Papaver somniferum I eudi 3370 2620 65.8 86.0 kb 6.86 Mb 79668 Pei et al., 2021
92–1 Passiflora edulis I, O eudi 1396 1341 NA 3.1 Mb NA 23171 Xia et al., 2021
92–2 Passiflora edulis I, P eudi 1410 1280 86.3 70.2 kb 126.4 Mb 39309 Ma et al., 2021
93 Phytolacca americana I eudi 1260 930 NA 35.2 kb 42.5 kb 29773 Neller et al., 2019
94 Piper nigrum I, P magno 762 761 54.9 NA 29.8 Mb 63466 Hu et al., 2019
95 Platycodon grandiflorus I eudi 683 680 36.2 15 kb 277.1 kb 40017 Kim et al., 2020
96 Pogostemon cablin /diploid I eudi 1576 1150 58.6 0.4 kb 1.1 kb 45020 He et al., 2016
97 Pogostemon cablin /octaploid I eudi 2380 1916 43.7 34.7 kb 699.0 kb 110850 He et al., 2018
98 Polygonum cuspidatum I eudi 2600 2560 71.5 2.8 kb 3.2 kb 55075 Zhang Y. et al., 2019
99 Poncirus trifoliata I, P eudi 265 265 42.6 842.8 kb 27.7 Mb 25538 Peng et al., 2020
100–1 Punica granatum I eudi 357 328 46.1 67.0 kb 1.89 Mb 29229 Qin et al., 2017
100–2 Punica granatum I eudi 336 274 51.2 97.0 kb 1.7 Mb 30903 Yuan Z. et al., 2018
100–3 Punica granatum I, P eudi 313 320 50.9 4.49 Mb 39.96 Mb 33594 Luo et al., 2020
101 Raphanus sativus I eudi 529 402 26.7 NA 46.3 kb 61572 Kitashiba et al., 2014
102 Rhodiola crenulata I eudi 420 345 66.2 25.4 kb 144.7 kb 31517 Fu et al., 2017
103 Ricinus communis Sa eudi 320 351 50.3 21.1 kb 496.5 kb 31237 Chan et al., 2010
104–1 Rosa chinensis I, P eudi 560 560 67.9 NA 24 Mb 36377 Raymond et al., 2018
104–2 Rosa chinensis I, P eudi 568 512 63.2 3.4 Mb NA 39669 Hibrand Saint-Oyant et al., 2018
105 Rosa roxburghii I eudi 481 409 47.6 1.5 kb 3.6 kb 22721 Lu et al., 2016
106 Rosmarinus officinalis I eudi 1180 1014 54.7 21.8 kb 368.7 kb 51389 Bornowski et al., 2020
107 Salvia bowleyana I, P eudi 462 462 58.7 1.18 Mb 57.96 Mb 44044 Zheng et al., 2021
108–1 Salvia miltiorrhiza I, P, R eudi 615 538 54.4 12.4 kb 51.0 kb 30478 Xu et al., 2016
108–2 Salvia miltiorrhiza I, P eudi 572 595 64.8 2.7 Mb 69.8 Mb 32483 Song Z. et al., 2020
109 Santalum album I eudi 203 221 27.4 460.7 kb 460.7 kb 38119 Mahesh et al., 2018
110–1 Scutellaria baicalensis I, P eudi 408 387 55.2 880.6 kb 1.34 Mb 28524 Zhao Q. et al., 2019
110–2 Scutellaria baicalensis I, O eudi 442 377 55.2 2.1 Mb NA 33414 Xu et al., 2020a
111 Scutellaria barbata I, P eudi 405 353 53.5 2.5 Mb NA 41697 Xu et al., 2020a
112 Selaginella tamariscina I, P lycopo 301 301 60.6 201.2 kb 407.7 kb 27761 Xu et al., 2018
113 Senna tora I, P eudi 547 526 53.9 4.03 Mb 41.7 Mb 45268 Kang et al., 2020c
114–1 Sesamum indicum I eudi 357 274 28.5 52.2 kb 2.1 Mb 27148 Wang et al., 2014
114–2 Sesamum indicum P eudi 337 292 NA 1.06 Mb 20.5 Mb 28406 Li C. et al., 2020
115 Sinapis alba I eudi 553 459 NA 1.7 kb NA 34012 Kumari et al., 2020
116–1 Siraitia grosvenorii I eudi 420 470 NA 34.2 kb 101.1 kb 43856 Itkin et al., 2016
116–2 Siraitia grosvenorii I, P eudi 420 470 51.1 432.4 kb NA 30565 Xia et al., 2018
117 Spatholobus suberectus I, P eudi 793 798 47.8 2.1 Mb 86.99 Mb 31634 Qin et al., 2019
118 Stevia rebaudiana I, O eudi 1160 1416 80.1 616.9 kb 106.55 Mb 44143 Xu et al., 2021
119 Taxus wallichiana I, O gymno 10600 10900 85.0 8.6 Mb 987 Mb 44008 Cheng et al., 2021
120 Toona sinensis I, O eudi 559 596 64.6 1.5 Mb 21.5 Mb 34345 Ji et al., 2021
121 Trichopus zeylanicus I, P mono 860 714 47.4 289.5 kb 430.0 kb 34452 Chellappan et al., 2019
122 Trichosanthes anguina I, O eudi 1030 920 80.0 20.11 Mb 82.12 Mb 22874 Ma L. et al., 2020
123 Tripterygium wilfordii P eudi 366 348 52.4 4.36 Mb 13.52 Mb 28321 Tu et al., 2020
124–1 Vernicia fordii I eudi 1200 1176 58.7 NA 474.9 kb 46829 Cui et al., 2018
124–2 Vernicia fordii I, P eudi 1310 1120 73.3 NA 87.15 Mb 28422 Zhang L. et al., 2019
125 Xanthoceras sorbifolium I, P eudi 442 440 56.4 642.3 kb 29.43 Mb 21059 Liang et al., 2019
126 Ziziphus jujuba I eudi 444 438 46.8 34.0 kb 301.0 kb 32808 Liu M. J. et al., 2014

#, number; Esti-size, estimated genome size; Assem-size, assembled genome size; mono, monocots; eudi, eudicots; magno, magnoliids; lycopo, lycopodiophyta; gymno, gymnosperm; Sa, Sanger; R, Roche/454; I, Illumina; P, PacBio; O, Oxford Nanopore; NA, not reported.

Research, Protection, and Utilization of Geoherbal Resources

With the widespread application of NGS technology, genome sequencing of medicinal plants has become more feasible due to the greatly reduced cost and time required to complete the project. According to the whole genome sequence, the basic information of biology and biomedical functions can be well understood.

We have made statistics on the medicinal plant genome articles over the years, and have a basic understanding of the general characteristics of the reported medicinal plant genomes. The comparison of size and repetitiveness ratio of these published medicinal plant genomes and their evolution relationship is shown in Figure 2B. Among them, the genomes of five medicinal plants are much larger than other medicinal plants, they are Allium sativum, Paeonia suffruticosa, Aloe vera, Taxus wallichiana, and Ginkgo biloba. In the plants whose genomes have been sequenced, there are 123 angiosperms (including 12 monocots, 105 eudicots, and 6 magnoliids), two gymnosperms, and one lycopodiophyta plant. The simplified phylogeny of the major clades of sequenced medicinal plants is also shown in Figure 2B. Angiosperms account for the vast majority of sequenced medicinal plants, and eudicots make up the majority of angiosperms. Genome size has a positive correlation with the ratio of repetitive elements, when the genome size is larger, the proportion of repetitive elements also tends to be correspondingly larger. Most of the genome size is concentrated within 4 Gb, and the repetitiveness ratio sequences are concentrated between 30 and 90%.

It has been said that plant genome reports were formulaic and lack biology significance, their descriptions mainly included the assembly, protein-coding genes, repeats, evolution analysis, some aspects of biology, usually with a focus on transcription factors and active compounds biosynthesis pathway (Michael and Jackson, 2013). According to these published medicinal plant genomes, most of them have not yet been used to solve specific application problems, such as discovering new medicinal mechanisms, cultivating new resistant varieties, explaining evolutionary events, and so on. But the assembly of the genomes provides us with the guarantee of the database. Once we need the support of genetic information, the genome is the solid foundation and reference.

Implications and Hallmark of Medicinal Plant Genome

Medicinal plants are the main sources of medicine, and their records for medicinal usage can be traced back to almost 5,000 years ago in China, India, and Egypt (Moss and Yuan, 2006; Jamshidi-Kia et al., 2018). They are also the precious resource libraries for many chemical drugs, currently, more than one-third of clinical medications are derived from plant extracts or their derivatives (Chen and Song, 2016). The sequencing and demystification of the genome can give us a better understanding of the biosynthesis and regulation of bioactive compounds. Artemisinin-derived plant named Artemisia annua is one of the most famous medicinal plants, while the discovery of artemisinin has won the 2015 Nobel Prize in Physiology or Medicine (Su and Miller, 2015). A semi-synthetic system has been used to improve the production of artemisinin greatly (Paddon et al., 2013). Further revealing the genome of A. annua provides a comprehensive understanding of artemisinin biosynthesis and leads to improvement in artemisinin production. Before A. annua genome revelation, studies manipulating artemisinin biosynthesis focused on either upstream (Nafis et al., 2011) or downstream (Yuan et al., 2015) genes on the artemisinin biosynthesis pathway. Then the combined study and analysis of A. annua genomic and associated transcriptomic data proposed other efficient strategies to increase the production of artemisinin, one was to simultaneously enhance the expression of enzyme genes in different steps in the biosynthesis pathway including the upstream (HMGR), midstream (FPS) and downstream (DBR2), and the other was to overexpress the expression of transcription factors like AaMYB2 that could regulate the expression of ADS, CYP71AV1, DBR2, and ALDH1 in artemisinin biosynthesis pathway, which could significantly improve artemisinin and dihydroartemisinic acid content, providing a new insight for increasing the supply of artemisinin from plant sources (Shen et al., 2018).

In addition to improving the content of active compounds, it is also necessary to ensure the agronomic traits and enhance the resistance ability to stresses of medicinal plants. Genome sequencing can help identify the genes associated with agronomic and disease resistance traits, and can target control of the genes to cultivate new varieties of medicinal plants with highly effective ingredients, excellent agronomic features, and high resistance abilities. P. notoginseng, a well-known medicinal plant, is susceptible to a wide range of pathogens, so its cultivation faces several challenges (Ou et al., 2011). The sequencing of the P. notoginseng chromosome level genome, combining a genome-wide association study on 240 cultivated individuals, successfully identified 63 genes associated with dry root weight (included genes encoding cysteine/histidine-rich C1 domain proteins), 168 genes associated with stem thickness (included APC6, WRKY71, and RWA3, etc.) and 33 genes associated with disease resistance trait (included genes encoding LRR receptor-like serine/threonine-protein kinases) (Fan G. et al., 2020). These valuable resources of P. notoginseng can provide new opportunities to harness the full potential of its economic and medicinal values.

Moreover, some medicinal plants also play an important role in evolution, and the discovery of their genomes can help to understand the evolutionary relationship of plants. Ginkgo biloba is a living fossil without living relatives, which represents one of the four extant gymnosperm lineages (cycads, ginkgo, conifers, and gnetophytes). Its genome showed that LTR-RT insertions and two whole-genome duplications (WGD) events in evolution history contribute to the large genome size and long introns. In angiosperms, chromosomal breakages and fusions, as well as uneven gene loss, might occur to prevent a continuous growth in genome size (Schnable et al., 2009), and this mechanism for removing transposable elements (TEs) might lack and lead to enormous genome size in gymnosperms like ginkgo. The outstanding defense ability of ginkgo resulted from the remarkable duplication of resistance genes and enrichment of relevant pathways. The ginkgo genome sheds light on sequencing large plant genomes and helps to know the genetic and evolutionary process of land plants in natural evolution (Guan et al., 2016).

Quality and Integrity Improvement of Medicinal Plant Genomes

The quality of genome assembly directly affects the quality of the whole genome. Contig N50 and scaffold N50 are the primary indicators for evaluating genome assembly results. Generally, the longer the contig N50 and scaffold N50 are, the better the assembly result is. As shown in Table 1, in 2017 and before, most of the reported medicinal plant genomes used the NGS technologies, such as Illumina and Roche/454, and the length of contig N50 ranged from a few kilobases to dozens of kilobases. In 2018, half of the published genomes used a combination of next- and third-generation sequencing technologies, such as Illumina + PacBio and Illumina + Oxford Nanopore. In 2019 and beyond, the sequencing strategy of combining next- and third-generation has been applied to the majority of the reported genomes. It can be seen from Figure 3 that the length of contig N50 became long since 2018, and then increased year by year. By 2020, the length has been greatly improved, the length of contig N50 was generally increased to the range between a few hundred kilobases and several megabases. The length of contig N50 was similar in the medicinal plant genomes published in 2020 and 2021. And the longest length was as long as 21.23 Mb (Cheng et al., 2021). It shows that the popularization and application of third-generation sequencing have brought convenience to scientific research, and at the same time have greatly improved the quality and integrity of the genome.

FIGURE 3.

FIGURE 3

Distribution of contig N50 length in published medicinal plant genomes. Before 2016 represents the published years before 2016, and 2021.06 represents the time between 2021 January and June.

Sequencing Strategy Development

The development process of sequencing strategy on medicinal plant genomes has experienced three stages, germination stage, development stage, and expansion stage (Figure 4).

FIGURE 4.

FIGURE 4

The development process of medicinal plant genome sequencing.

Germination Stage of Medicinal Plant Genome Sequencing

The start of genomics is from the early 1990s, and automated sequencing methods using dideoxy chain termination with fluorescent molecules developed, which is known as Sanger sequencing. The effectiveness of the Sanger platform for large eukaryotic genomes was first reported in 2000 for Drosophila melanogaster, ushering in a new era of genomics (Adams et al., 2000). This method was also applied in plant biology, like sequencing ESTs in Arabidopsis thaliana (Newman et al., 1994), and then sequencing the whole genome of various plants, like Oryza sativa (Yu et al., 2002), Populus trichocarpa (Tuskan et al., 2006), Carica papaya (Ming et al., 2008) and Brachypodium distachyon (International Brachypodium Initiative, 2010). However, there are still gaps and errors in the assembly of these genomes, so they are not completely “finished,” because the process of “finishing” needs inspection and experimental resolution of inconsistencies, and it’s a time-consuming, tough, and expensive work (Hamilton and Robin Buell, 2012). In the germination stage of the development process about medicinal plant genome sequencing, considering this and cost, the Sanger sequencing method is only used to sequence the genome of major economic crops that are also regarded as medicinal plants, like Ricinus communis, to provide references and templates for subsequent research.

The Development Stage of Medicinal Plant Genome Sequencing

After 2011, the NGS technology develops rapidly and occupies the position of the mainstream sequencing platform, becoming the preferred technology for sequencing the medicinal plant genomes. The widely and mainly used NGS platforms are Roche 454 platform and Illumina platform.

Roche 454 platform is the first commercially successful NGS system. This sequencing method uses a high-throughput pyrosequencing technology (Margulies et al., 2005). This platform utilizes emulsion PCR to detect the pyrophosphate released during nucleotide incorporation. In 2005, the read length of Roche 454 was only 100–150 bp with 20 Mb output data per run (Mardis, 2008). In 2008, the 454 GS FLX Titanium system appeared, with a reading length up to 700 bp and 0.7 G output data per run within 24 h. In late 2009, Roche simplified the library preparation and data processing and improved the output to 14 G per run (Liu et al., 2012). In 2012, the platform upgraded to the FLX+ and could generate 1 million reads, with a reading length up to 1,000 bp.

Illumina platform is a high-throughput technology of sequencing by synthesis using reversible dye terminators developed by Solexa and then purchased by Illumina in 2008 (Bentley et al., 2008). The mechanism of the Illumina platform is bridge PCR, which is different from the Roche/454 platform. The library DNA with fixed adaptors is denatured to single strands and linked on the flow cell, followed by bridge amplification to synthesize clusters of clonal DNA fragments. The library splices into single strands by linearization enzyme (Mardis, 2008), and then four kinds of fluorescently labeled nucleotides which have been modified with a terminator complement the template one base at a time, the signal is captured, then the terminator and fluorescent dye are cleaved, and a new round of synthesis repeats until coming up to the desired read length. In late 2011, the paired-end mode of the Hi-Seq2000 Illumina platform could generate more than 250 million reads total sequences of one lane.

Because the throughput of Hi-Seq 2000 is higher, the price is lower, and the application range is wider than Roche/454, the application of the Illumina platform in the medicinal plant genome sequencing occupies the mainstream position. The Illumina platform is widely applied for expression profiling, de novo sequencing, and re-sequencing in plant sequencing, like Thellungiella parvula (Dassanayake et al., 2011) and Arabidopsis thaliana (Cao et al., 2011). As more and more medicinal plant genomes have been reported, the medicinal plant genome sequencing has begun to enter the development stage, many large size medicinal plant genomes were successfully sequenced. However, another difficulty of plant genomes is the high repetition in the genome, so it is difficult to accurately assemble them by the NGS technologies.

Expansion Stage of Medicinal Plant Genome Sequencing

The development of third-generation sequencing has overcome this problem. The most widely applied long-read sequencing platform is Single-Molecule Real-Time (SMRT) sequencing of Pacific Biosciences company. SMRT sequencing is run on cells, which have tiny wells called zero-mode waveguides (ZMWs). In each ZMWs, a DNA polymerase/template complex gets immobilized, and synthesizes a new DNA strand (Jiao and Schneeberger, 2017). Each incorporation generates a light pulse that can be recognized for differently labeled nucleotides (Eid et al., 2009). PacBio systems can sequence reads with an average size of about 20 kb and a maximum length of over 60 kb (Kim K. E. et al., 2014; Vanburen et al., 2015). Although the sequencing error rate of raw reads is up to 15%, self-correction by adequate coverage sequencing data (Chin et al., 2013) or correction with NGS data (Bashir et al., 2012; Koren et al., 2012) enables genome assemblies with the accuracy of over 99.999% simply by running bioinformatics analysis software (Chin et al., 2016). Besides the PacBio SMRT platform, there is also another long-read sequencing platform introduced by ONT Technologies, which provided access to their first sequencing system in 2014 (Quick et al., 2014; Deamer et al., 2016). Single DNA molecules are run through nanopores, and individual nucleotides create characteristic disruptions in them, which reveal the sequence of the nucleotides. The reads length and sequencing accuracy are similar with PacBio reads, and the longest reads can reach up to 200 kb. First, whole-genome assemblies using ONT data have reached N50 values of multiple hundred kb for fungal genomes, and bacterial genomes could be fully assembled with a nucleotide accuracy of over 99% (Goodwin et al., 2015; Loman et al., 2015).

The emergence of third-generation sequencing technology has made a great leap in sequencing read length and brought medicinal plant genome sequencing into a stage of rapid development. The strategy used in this stage is a combination of second- and third-generation sequencing technologies, which can ensure long read length, high throughput, and reasonable sequencing price at the same time. Medicinal plant genomes are large and have high-ratio repetitive elements, the frequently-used strategy is combining high coverage Illumina and low coverage PacBio SMRT or ONT data. Because third-generation sequencing can provide long-read sequences to increase the assembly accuracy and genome draft quality, but the price is relatively high, so Illumina platform is used to guarantee enough sequencing data. And this can make it possible to assemble large and complex medicinal plant genomes to the chromosome level. After these years of sequencing development, the medicinal plants not only can obtain draft genome relevant information and dig out target protein-coding genes, but also recognize the chromosome-level of the genome to discover the evolution, gene cluster’s function, repetitive elements effect, and so on.

Genomes of Species Have Been Repeatedly Sequenced

We found that not only does the number of medicinal plant genomes sequenced continue to increase, but the number of medicinal plant genomes sequenced repeatedly is also increasing. Why? First of all, because the genomes of many medicinal plants have not been revealed yet, many teams are performing de novo sequencing of the genomes at the same time, and accordingly publish them at the same time. Then, with the continuous development of gene sequencing technology, we can obtain longer sequencing read lengths, so as to assemble more complete and accurate high-level genomes. Genomes assembled to the chromosome level are the current trend. The information that the genome gives us is no longer a contig or scaffold, but the chromosome and the position of a gene on the chromosome.

There are 25 medicinal plants with two reported genomes, three medicinal plants with three reported genomes, and one plant with five reported genomes. Representative medicinal plants include Momordica charantia (bitter gourd), Salvia miltiorrhiza (Danshen), Punica granatum (pomegranate), Panax notoginseng (Sanqi), Panax ginseng (Asian ginseng), etc. Bitter gourd and danshen have two reported versions of the genome. Bitter gourd completed the de novo assembly of the genome draft in 2017, as well as basic annotation and evolutionary analysis (Urasaki et al., 2017). In 2020, using PacBio long-read sequencing technology, the Momordica charantia genome was assembled to the chromosome level, and further investigate the genomic changes under domestication (Matsumura et al., 2020). The genome of Salvia miltiorrhiza was also assembled to eight chromosomes, the assembled genome size increased from 538 to 594.75 Mb, and the proportion of repetitive elements also increased from 54.44 to 64.84% (Xu et al., 2016; Song Z. et al., 2020). Punica granatum (pomegranate), which is a popular and nutritious fruit with medicinal properties, has three published genome versions (Qin et al., 2017; Yuan Z. et al., 2018; Luo et al., 2020). The third version of the genome is assembled to the chromosome level, and it is a high-quality genome map of the soft-seed pomegranate, which helps to clarify the genetic divergence between soft- and hard-seeded varieties and provides insights into the genetic diversity and population structure of pomegranates (Luo et al., 2020). Panax notoginseng (Sanqi) is a well-known TCM whose genome research is sought after by scientists, and a total of five versions have been reported. The three recent versions are assembled to the chromosome level (Fan G. et al., 2020; Jiang et al., 2021; Yang et al., 2021b), which are more complete than the previously available genome assemblies (Chen et al., 2017; Zhang D. et al., 2017), further reveal the biosynthesis pathways of ginsenosides and dencichine, as well as provide a resource for further exploration of the saponin biosynthesis, cultivation, and breeding of P. notoginseng. Panax ginseng (Asian ginseng), reputed as the king of medicinal herbs, belongs to the same genus Panax, which also has two versions of reported genomes (Xu et al., 2017; Kim et al., 2018). Both of these two genomes provide a comprehensive understanding for functional and evolutionary analysis as well as ginsenoside biosynthesis. Additionally, Kim et al. (2018) identified fatty acid desaturases that can increase freezing tolerance and chlorophyll a/b binding protein genes which enable efficient photosynthesis under low light. However, the read length of both genomes is not long enough by the current standards, and there is still space for further improvement in the integrity and accuracy of the ginseng genome.

Application of Medicinal Plant Genomes

Genomics-Assisted Herb Breeding

The genes related to medicinal plant growth and development, disease resistance, important genetic traits, and germplasm characters which are the important functional genes in medicinal plants, taking advantage of genome annotation information, discovering good genes, using genetic engineering methods to break the reproductive isolation, and cultivating the new species with excellent agronomic characters and high content of active ingredients, so that it can lay the foundation for the large-amount extraction of active ingredients and extensive clinical application. By combing transcriptome and resequencing of individual species within or between species, the large-scale molecular markers can be identified rapidly and accurately, and genetic linkage study of molecular markers and qualified characters can also be accelerated, the phenotypes of medicinal plants and the relationship of physical characteristics and genotypes are discovered quickly so that efficiency of breeding are improved obviously.

The study of Scutellaria baicalensis (Huangqin) genome sequencing revealed that a specialized metabolic pathway for the synthesis of 40-deoxyflavonebioactives evolved in the genus Scutellaria and found that the gene encoding a specific cinnamate coenzyme A ligase likely obtained its new function following recent mutations and that four genes encoding enzymes in the 40-deoxyflavone pathway are present as tandem repeats in the genome of Huangqin. Further analysis discovered that gene duplications, segmental duplication, gene amplification, and point mutations coupled to gene neo- and subfunctionalizations were involved in the evolution of 40-deoxyflavone synthesis in Scutellaria. These results not only provide significant insight into the evolution of specific flavone biosynthetic pathways in the mint family Lamiaceae but also facilitate the development of tools for enhancing bioactive productivity by molecular breeding in plants (Zhao Q. et al., 2019).

Evolution History Revealing

Whole-genome sequencing cannot only elucidate the biosynthesis pathways of natural products but also give insight into their evolution. The evolution will bring the whole genome change, like WGD and whole-genome triplication (WGT), to adapt to the environment alteration and explain the characters of plants. We summarized the WGD and WGT events of some representative species reported in the medicinal plant genome articles, and these situations are shown in Figure 5. These WGD and WGT events are summarized and introduced into three types of plants, which are eudicots, monocots, and magnoliids.

FIGURE 5.

FIGURE 5

The whole genome duplication (WGD) and whole genome triplication (WGT) events in representative medicinal plants.

In the eudicots part, we select five representative branches to demonstrate the situation. The representative medicinal plants of Araliaceae and Apiaceae are clustered together. P. ginseng, P. notoginseng, and E. senticosus belong to Araliaceae, and P. notoginseng is diploid, while P. ginseng and E. senticosus are tetraploid. Two rounds of WGD were discovered in these Araliaceae plants, the first round occurred around 29.6 Mya, P. ginseng, and E. senticosus both had the second round of WGD, which were found almost 2.2 Mya in P. ginseng and 13 Mya in E. senticosus, respectively. Additionally, these recent WGDs were discovered to contribute to the ability of P. ginseng to overwinter and E. senticosus to adapt to cold environment, enabling them to live and spread broadly through the cold area (Kim et al., 2018; Jiang et al., 2021; Yang et al., 2021a). These two rounds of WGD occurred in the family Araliaceae after divergence with the Apiaceae, which may be one of the reasons why its genome was bigger than other medicinal plants. In the D. carota and A. graveolens that belonged to Apiaceae, one shared WGD occurred in about 43 Mya, and one recent WGD only existed in A. graveolens in approximately 1.9 Mya, and this duplication contributed to the expansion of terpene synthase gene families (Song X. et al., 2020). The second branch in the eudicots part includes six plants belonging to Lamiales, one shared WGD (almost 60.7 Mya) was identified in S. baicalensis, S. barbata, S. miltiorrhiza, and S. indicum, which might be responsible for chromosomal expansion and rearrangement (Xu et al., 2020a), and two rounds of WGD were found in S. splendens and L. angustifolia, which could result in the gene families expansion related to terpenoid biosynthesis (Li et al., 2021). In P. cuspidatum, it experienced current lineage-specific WGD at 6.6 Mya after the divergence with F. tataricum from the ancestor, and it shared the ancient and common WGD with F. tataricum at 65 Mya (Zhang Y. et al., 2019), after this WGD, the genome of F. tataricum experienced dramatic chromosomal rearrangements, resulting in very fragmented intra-genome collinear blocks (Zhang L. et al., 2017). There is also a WGT event identified and reported in the medicinal plant genome articles. T. wilfordii was found to have a WGT event in approximately 21 Mya, which enabled it to cope better with and adapt to the markedly changed environment, and the duplication of the triptolide biosynthesis genes were almost generated by this WGT event, suggesting this WGT event was important to the evolution of triptolide biosynthesis (Tu et al., 2020).

In the monocots part, A. sativum and H. citrina are the representatives. A. sativum has undergone two rounds of WGD, suggesting WGD can be the important driving force of the proliferation of TEs and genome expansion in garlic (Sun et al., 2020). Otherwise, H. citrina experienced a recent WGD event at about 15.73 Mya, which was the main factor resulting in multiple copies of the orthologous genes (Qing et al., 2021). In the magnoliids part, C. salicifolius and P. nigrum are the representatives. Two rounds of ancient WGD were inferred in the C. salicifolious genome, one was shared by Calycanthaceae at ∼87 Mya after its divergence with Lauraceae, and the other was dating back to approximately 142 Mya in the ancestry of Magnoliales and Laurales (Lv Q. et al., 2020). Meanwhile, the P. nigrum genome was speculated to have a WGD event at ∼17.9 Mya, which brought genetic changes that were responsible for the particular biosynthesis of piperine (Hu et al., 2019).

Domestication Process Understanding

Domestication is a complex evolutionary process, which is one of the most important technological innovations in human history, humans use plants to change their morphology and physiology traits, distinguishing them from wild ancestors, and ultimately giving rise to the current human cultures (Diamond, 2002; Hancock, 2005). Some of the domesticated plants are medicinal plants. The timing and geographical origins of domesticated traits, as well as the genes that lead to changes in traits, can be sent to find clues from genomic information (Purugganan and Fuller, 2009).

Coix is a widely cultivated grass crop with high nutritional and medicinal value, which has been domesticated as early as the Neolithic era. However, its genetic research and breeding were hampered by the lack of a sequenced genome. Two chromosome-level genomes of coix have been reported simultaneously, which belong to elite cultivar Beijing (Liu H. et al., 2020) and wild relative Coix aquatica Daheishan (Guo et al., 2020), respectively. They both find that hull thickness is an important domestication trait between the wild relatives and cultivars, and selection of papery hull from the stony hull in wild progenitors was a key step in coix domestication. Combining resequencing analysis and comparative analysis, several domesticated loci or genes (like loci in the ∼2 to 150 kb region upstream of ub3) and two major quantitative trait loci associated with hull thickness and color (Ccph1 and Ccph2), were discovered to be the potential identification loci for domestication. These findings will greatly facilitate and benefit the molecular breeding of coix and provide a powerful reference for the domestication and evolution of medicinal plants.

Herbal Synthetic Biology

The active components of medicinal plants with complex and diverse structures are the material basis for their medicinal effect, and it’s also an important source of new drug discovery. However, many medicinal plant materials often face a series of problems in the process of development and utilization, for example, the growth of many medicinal materials is greatly affected by environmental factors; some rare herbs grow slowly and are difficult to grow by artificial cultivation; most of the active ingredients are low in content, complex in chemical structure and difficult in chemical synthesis; traditional methods of natural extraction or artificial chemical synthesis cannot meet the needs of scientific research and new drug development. Synthetic biology will be an effective way to resolve these problems.

As high-throughput sequencing technology for genome and transcriptome studies have developed rapidly, using bioinformatics method and functional genomics approach to screen and identify enzyme-coding genes on specific secondary biosynthesis pathway from a large number of the original species of medicinal plants, which will greatly accelerate the analysis process of secondary biosynthesis pathway and lay a solid foundation for herbal synthetic biology research of medicinal plants.

Tripterygium wilfordii genome is one of the typical examples. Because of the extremely low yield of triptolide extracted from T. wilfordii, its original plant cannot be grown on a large scale, and the current chemical synthesis route is limited to a yield of less than 1.64%. A more promising method to obtain more triptolide could be metabolic engineering, which can be realized via a synthetic biology strategy. However, it required elucidation of the triptolide biosynthesis pathway. Therefore, the sequencing of the T. wilfordii genome was completed, and cytochrome P450 TwCYP728B70 involved in triptolide biosynthesis was identified, accordingly, the triptolide content in the CYP728B70 overexpression line increased obviously (Tu et al., 2020). It’s important to make full use of genomic resources to reveal the biosynthesis pathways of active compounds in medicinal plants and use candidate genes in these pathways for the heterologous bioproduction under synthetic biology strategy.

Geoherbal Research, Protection, and Utilization of Resources

Geoherbs, controlled by genetic factors and affected by environmental conditions, are representative of high-quality medicinal materials. The utilization of sequencing technology and data can provide useful tools to elucidate the molecular mechanism of geoherbs. For the same medicinal plants in different areas, epigenomic studies of medicinal plants can be carried out to clarify the genetic variation of different production areas, especially the modification effect of different environments on the epigenome of medicinal material, including DNA methylation modification, small RNA sequencing analysis, chromatin immunoprecipitation analysis, and so on. In addition, microorganisms in soil are also important factors in the growth environment of geoherbs. Metagenomic analysis of soil microbial community can be sequenced to provide the basis for revealing the interaction between soil microorganisms and the growth of medicinal plants.

Recently, 545 genomes of ginkgo trees sampled from 51 populations across the world were sequenced to identify three refugia in China and detect multiple cycles of population expansion and reduction along with glacial admixture between relict populations in the southwestern and southern refugia, and multiple anthropogenic introductions of ginkgo were proved to occur from eastern China into different continents. This study provides insight into the evolutionary history of ginkgo and helps to provide protection and utilization way for its valuable genomic resources (Zhao Y. P. et al., 2019).

Improving the Synthesis Efficiency of Bioactive Compounds Within Species

Because of the rapid development and progress of sequencing technology, more and more biosynthesis pathways of active ingredients from medicinal plants have been revealed. The early-stage was based on the mining from transcriptome data, and the later stage was based on the combined mining from genome and transcriptome data. Although transcriptome sequencing has so far occupied a major position in the research of biosynthesis pathways of medicinal ingredients, genome data can provide more important information, for example, it can reveal the evolution process of biosynthesis pathway genes, thereby efficiently synthesizing secondary metabolites with medicinal activity. In the opium poppy genome, a great discovery about a gene cluster including 15 genes was reported. Meanwhile, in its evolution process, the events like gene duplication, rearrangement, and fusion, could lead to the aggregation and co-expression of genes in the two metabolic pathways of noscapine and morphinan, so that it resulted in the formation of this supergene cluster, which could synergistically synthesize the medicinal ingredients in opium poppy (Guo et al., 2018). Therefore, the opium poppy genome helps to decipher the mystery of the synthesis of secondary metabolites. It is not only beneficial to the development of molecular plant breeding tools and cultivating new varieties, but also has great guiding significance for the selective improvement of the production of alkaloids with different efficacy in future artificial synthesis.

It also provides new ideas for the application of medicinal plant genomes. Through the evolution process, gene duplications and neofunctionalization can generate gene clusters, which may relate to specialized metabolites, and this phenomenon has already been observed in several model plants, like A. thaliana, Zea mays, and Solanum lycopersicum (Bharadwaj et al., 2021). In medicinal plants, we can refer to the research strategy of the opium poppy (Guo et al., 2018), which can help us understand the formation process of gene clusters related to medicinal active ingredients and improve their biosynthesis efficiency.

Comparative Genomic Analysis Among Different Species or Different Populations in the Same Species

The continuous emergence of high-quality genomes has made the application of comparative genomics analysis more and more extensive and in-depth, and it is also a powerful tool for researchers to dig out biological problems and explain biological phenomena (Nobrega and Pennacchio, 2004). Comparative genomics, based on genome mapping and sequencing technology, are generally referred to as comparative analysis of the structural and functional gene regions of the genomes among multiple species or multiple individuals (populations) from one species. Specifically, it is to compare the similarities and differences in the structural characteristics, study the contraction and expansion of gene families, discover the differentiation time and evolution relationship, analyze the generation and evolution of new genes, etc.

One representative example of comparative genomics among different medicinal plant genome species can be Scutellaria baicalensis and Scutellaria barbata. The comparative genomic analysis of them showed the recent LTR may result in chromosomal rearrangement and expansion, and tandem duplication of paralogs after their speciation might contribute to the divergent evolution of flavonoid biosynthesis gene families, which provided a significant foundation for the evolution and chemodiversity studies in the Lamiaceae (Xu et al., 2020a).

Moreover, a representative of comparative genomics among different populations in the same species can be Forsythia suspense. Genome-wide comparative analysis was then conducted for the 15 natural populations across its current distribution range. The results revealed that candidate genes associated with local adaptation were functionally correlated with heterogeneous environmental factors, and supported the hypothesis that adaptive differentiation should be highly obvious in the genes of signal crosstalk between different environmental variables, which gave insights into the fundamental genetic mechanisms of the local adaptation to climatic gradients in plant species (Li L.-F. et al., 2020).

Outlook and Challenges of Medicinal Plant Genome Sequencing

The use of medicinal plants has a long history and diverse application methods. Related works of research mainly focus on the discovery of chemical basis and the analysis of pharmacodynamic effect, but the understanding of medicinal plant genetic resources is relatively weak. Therefore, the research on the genome of medicinal plants should make use of the latest technologies and achievements of genomics, and integrate the studies of structural genome, functional genome, transcriptome, proteome, epigenome, metagenome, synthetic biology, metabolome, bioinformatics, and other relevant databases. Therefore, the essence of medicinal plants can be revealed, the relationship among genetic resources, chemical quality, and drug efficacy can be recognized.

We are most concerned about the medicinal value of medicinal plants. The medicinal value is not only reflected in the content of their medicinal ingredients, but also the stability of the quality of their medicinal materials. Now medicinal plant genomes can be annotated to obtain protein-coding genes, especially biosynthesis genes of active ingredients, analyze their evolutionary history and domestication process, and discover genes that respond to environmental stresses to help improve their resistance and ability. However, the powerful ability of the medicinal plant group has not yet been manifested, and its ability to solve the difficulties in practical applications remains to be developed. How to use the information of the medicinal plant genome to transform and obtain excellent medicinal plant varieties has not yet been realized. Determining suitable model medicinal plants is of great significance to the research on the practical application of medicinal plant genomes. The determination of appropriate model medicinal plants is of great significance to the study of the genomics of medicinal plants. From the perspective of general biological characteristics, it usually should have the traits of a short age cycle, many offspring, and stable phenotype. As for genetic resources, the genome should be relatively small, easy to sequence, and genetic transformation is relatively easy. As for medicinal characteristics, it should be suitable for secondary metabolite biosynthesis and production research. Therefore, the establishment and improvement of a suitable model medicinal plant platform will greatly enhance the application value of medicinal plant genomes.

The assembly of plant genomes is a challenging problem because of their high repetitiveness due to TEs, extreme genome sizes, and polyploid nature. With the development and emergence of long-read sequencing (Eid et al., 2009; Deamer et al., 2016) and long-range scaffolding methods such as optical mapping (Schwartz et al., 1993), chromosome conformation capture (Burton et al., 2013), and DNA dilution-based technologies (Amini et al., 2014; Zheng et al., 2016), the medicinal plant genome sequencing overcomes weaknesses of short-read assemblies and becomes possible to assemble to the chromosome-level (Jiao and Schneeberger, 2017). Although there have been medicinal plants that enable the assembly of entire chromosomes, most medicinal plants just still obtained long scaffolds or super-scaffolds. And now we have got a large amount of sequencing data from medicinal plants, how to effectively explore and apply them to dig deeper information is still facing problems and challenges.

Moreover, thanks to the advancement and development of sequencing technology and bioinformatics algorithms, at least one hundred medicinal plant genomes have been obtained. How to use them thoroughly and effectively has attracted the attention of many institutions and researchers. In recent years, several databases of medicinal plant genomes have already been built, such as the Herbal Medicine Omics Database1 (Wang X. et al., 2018), 1K Medicinal Plant Genome Database,2 and Database of 10,000 Medicinal Plants.3 These databases summarize the medicinal plant genomes that have been reported at this stage or aim to build a biological big data platform for medicinal plants, linking the omics data, active ingredients, disease information, and other information to promote their modernization. All of the above indicate that the medicinal plant genome has entered the stage of big data association research from the stage of exploring the unknown. Moreover, because of the limitations of previous technologies and methods, the disclosed medicinal plant genome information is limited. If the obtained medicinal plant genome information is aggregated and shared through the database, this should be a huge treasure to be unearthed, which will prompt the research efficiency of medicinal plants.

Conclusion

Thanks to the invention of the long-read sequencing technology, the research on medicinal plant genomes has developed rapidly and is no longer limited by their huge genome size and high repetitive sequences. The number of genomes reported in the past 2 years has increased significantly, and the quality of genomes has also been greatly improved, most of which have been assembled to the chromosome level. Correspondingly, the sequencing strategy they adopted has also been continuously updated, making them more and more widely used, answering and solving many problems in scientific researches and practical applications, including herb breeding assistance, evolutionary history revealing, domestication process understanding, herb synthetic biology study, geoherbal research and comparative genome analysis, these are of great significance to the effective use and sustainable protection of medicinal plants, which can improve their research efficiency and promote their modern development.

Author Contributions

Q-QC planned the manuscript outline, wrote the draft, and created the figures and tables. YO, Z-YT, C-CL, Y-YZ, and C-SC proofread the manuscript. HZ supervised the study and revised the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

Funding

This project was funded by the Science and Technology Development Fund, Macao SAR (Project Nos. 0001/2020/AKP, 0061/2019/AGJ, 0027/2017/AMJ, and 062/2017/A2) and the National Key Research and Development Program of China (Project No. 2017YFE0119900).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2021.791219/full#supplementary-material

References

  1. Adams M. D., Celniker S. E., Holt R. A., Evans C. A., Gocayne J. D., Amanatides P. G., et al. (2000). The genome sequence of Drosophila melanogaster. Science 287 2185–2195. 10.1126/science.287.5461.2185 [DOI] [PubMed] [Google Scholar]
  2. Amini S., Pushkarev D., Christiansen L., Kostem E., Royce T., Turk C., et al. (2014). Haplotype-resolved whole-genome sequencing by contiguity-preserving transposition and combinatorial indexing. Nat. Genet. 46 1343–1349. 10.1038/ng.3119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Auber R. P., Suttiyut T., McCoy R. M., Ghaste M., Crook J. W., Pendleton A. L., et al. (2020). Hybrid de novo genome assembly of red gromwell (Lithospermum erythrorhizon) reveals evolutionary insight into shikonin biosynthesis. Hortic. Res. 7:82. 10.1038/s41438-020-0301-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bashir A., Klammer A. A., Robins W. P., Chin C., Webster D., Paxinos E., et al. (2012). A hybrid approach for the automated finishing of bacterial genomes. Nat. Biotechnol. 30 701–707. 10.1038/nbt.2288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bentley D. R., Balasubramanian S., Swerdlow H. P., Smith G. P., Milton J., Brown C. G., et al. (2008). Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456 53–59. 10.1038/nature07517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bharadwaj R., Kumar S. R., Sharma A., Sathishkumar R. (2021). Plant metabolic gene clusters: evolution, organization, and their applications in synthetic biology. Front. Plant Sci. 12:697318. 10.3389/fpls.2021.697318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bornowski N., Hamilton J. P., Liao P., Wood J. C., Dudareva N., Buell C. R. (2020). Genome sequencing of four culinary herbs reveals terpenoid genes underlying chemodiversity in the Nepetoideae. DNA Res. 27 1–12. 10.1093/dnares/dsaa016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Brose J., Lau K. H., Dang T. T. T., Hamilton J. P., do Vale Martins L., Hamberger B., et al. (2021). The Mitragyna speciosa (Kratom) Genome: a resource for data-mining potent pharmaceuticals that impact human health. G3 (Bethesda) 11:jkab058. 10.1093/g3journal/jkab058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Burton J. N., Adey A., Patwardhan R. P., Qiu R., Kitzman J. O., Shendure J. (2013). Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 31 1119–1125. 10.1038/nbt.2727 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cao J., Schneeberger K., Ossowski S., Günther T., Bender S., Fitz J., et al. (2011). Whole-genome sequencing of multiple Arabidopsis thaliana populations. Nat. Genet. 43 956–963. 10.1038/ng.911 [DOI] [PubMed] [Google Scholar]
  11. Chan A. P., Crabtree J., Zhao Q., Lorenzi H., Orvis J., Puiu D., et al. (2010). Draft genome sequence of the oilseed species Ricinus communis. Nat. Biotechnol. 28 951–956. 10.1038/nbt.1674 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chang Y., Liu H., Liu M., Liao X., Sahu S. K., Fu Y., et al. (2019). The draft genomes of five agriculturally important African orphan crops. Gigascience 8 1–16. 10.1093/gigascience/giy152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chaw S.-M., Liu Y.-C., Wu Y.-W., Wang H.-Y., Lin C.-Y. I., Wu C.-S., et al. (2019). Stout camphor tree genome fills gaps in understanding of flowering plant genome evolution. Nat. Plants 5 63–73. 10.1038/s41477-018-0337-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chellappan B. V., Shidhi P. R., Vijayan S., Rajan V. S., Sasi A., Nair A. S., et al. (2019). High quality draft genome of arogyapacha (Trichopus zeylanicus), an important medicinal plant endemic to Western Ghats of India. G3 9 2395–2404. 10.1534/g3.119.400164 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chen D., Pan Y., Wang Y., Cui Y.-Z., Zhang Y.-J., Mo R., et al. (2021). The chromosome-level reference genome of Coptis chinensis provides insights into genomic evolution and berberine biosynthesis. Hortic. Res. 8:121. 10.1038/s41438-021-00559-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chen F., Dong W., Zhang J., Guo X., Chen J., Wang Z., et al. (2018). The sequenced angiosperm genomes and genome databases. Front. Plant Sci. 9:418. 10.3389/fpls.2018.00418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chen H., Zeng Y., Yang Y., Huang L., Tang B., Zhang H., et al. (2020). Allele-aware chromosome-level genome assembly and efficient transgene-free genome editing for the autotetraploid cultivated alfalfa. Nat. Commun. 11:2494. 10.1038/s41467-020-16338-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chen S. L., Song J.-Y. (2016). [Herbgenomics]. Zhongguo Zhong Yao Za Zhi 41 3881–3889. 10.4268/cjcmm20162101 [DOI] [PubMed] [Google Scholar]
  19. Chen S. L., Sun Y. Z., Xu J., Luo H. M., Sun C., He L., et al. (2010). Strategies of the study on Herb genome program. Yaoxue Xuebao 45 807–812. [PubMed] [Google Scholar]
  20. Chen S. L., Wu W.-G., Wang C.-X., Xiang L., Shi Y.-H., Zhang D., et al. (2019). [Molecular genetics research of medicinal plants]. Zhongguo Zhong Yao Za Zhi 44 2421–2432. 10.19540/j.cnki.cjcmm.20190514.102 [DOI] [PubMed] [Google Scholar]
  21. Chen S., Wang X., Wang Y., Zhang G., Song W., Dong X., et al. (2020). Improved de novo assembly of the achlorophyllous orchid Gastrodia elata. Front. Genet. 11:580568. 10.3389/fgene.2020.580568 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Chen S., Wang Y., Yu L., Zheng T., Wang S., Yue Z., et al. (2021). Genome sequence and evolution of Betula platyphylla. Hortic. Res. 8:37. 10.1038/s41438-021-00481-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Chen W., Kui L., Zhang G., Zhu S., Zhang J., Wang X., et al. (2017). Whole-Genome sequencing and analysis of the Chinese herbal plant Panax notoginseng. Mol. Plant 10 899–902. 10.1016/j.molp.2017.02.010 [DOI] [PubMed] [Google Scholar]
  24. Chen Y.-C., Li Z., Zhao Y.-X., Gao M., Wang J.-Y., Liu K.-W., et al. (2020). The Litsea genome and the evolution of the laurel family. Nat. Commun. 11:1675. 10.1038/s41467-020-15493-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Cheng J., Wang X., Liu X., Zhu X., Li Z., Chu H., et al. (2021). Chromosome-level genome of Himalayan yew provides insights into the origin and evolution of the paclitaxel biosynthetic pathway. Mol. Plant 14 1199–1209. 10.1016/j.molp.2021.04.015 [DOI] [PubMed] [Google Scholar]
  26. Chin C. S., Alexander D. H., Marks P., Klammer A. A., Drake J., Heiner C., et al. (2013). Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10 563–569. 10.1038/nmeth.2474 [DOI] [PubMed] [Google Scholar]
  27. Chin C.-S., Peluso P., Sedlazeck F. J., Nattestad M., Concepcion G. T., Clum A., et al. (2016). Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13 1050–1054. 10.1038/nmeth.4035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Cui P., Lin Q., Fang D., Zhang L., Li R., Cheng J., et al. (2018). Tung Tree (Vernicia fordii, Hemsl.) Genome and transcriptome sequencing reveals co-ordinate up-regulation of fatty acid β-oxidation and triacylglycerol biosynthesis pathways during Eleostearic acid accumulation in seeds. Plant Cell Physiol. 59 1990–2003. 10.1093/pcp/pcy117 [DOI] [PubMed] [Google Scholar]
  29. Dassanayake M., Oh D.-H., Haas J. S., Hernandez A., Hong H., Ali S., et al. (2011). The genome of the extremophile crucifer Thellungiella parvula. Nat. Genet. 43 913–918. 10.1038/ng.889 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Deamer D., Akeson M., Branton D. (2016). Three decades of nanopore sequencing. Nat. Biotechnol. 34 518–524. 10.1038/nbt.3423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Diamond J. (2002). Evolution, consequences and future of plant and animal domestication. Nature 418 700–707. 10.1038/nature01019 [DOI] [PubMed] [Google Scholar]
  32. Ding X., Mei W., Huang S., Wang H., Zhu J., Hu W., et al. (2018). Genome survey sequencing for the characterization of genetic background of Dracaena cambodiana and its defense response during dragon’s blood formation. PLoS One 13:e0209258. 10.1371/journal.pone.0209258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Ding X., Mei W., Lin Q., Wang H., Wang J., Peng S., et al. (2020). Genome sequence of the agarwood tree Aquilaria sinensis (Lour.) Spreng: the first chromosome-level draft genome in the Thymelaeceae family. Gigascience 9:giaa013. 10.1093/gigascience/giaa013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Dong S., Liu M., Liu Y., Chen F., Yang T., Chen L., et al. (2021). The genome of Magnolia biondii Pamp. provides insights into the evolution of Magnoliales and biosynthesis of terpenoids. Hortic. Res. 8 38. 10.1038/s41438-021-00471-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Eid J., Fehr A., Gray J., Luong K., Lyle J., Otto G., et al. (2009). Real-time DNA sequencing from single polymerase molecules. Science 323 133–138. 10.1126/science.1162986 [DOI] [PubMed] [Google Scholar]
  36. Fan G., Liu X., Sun S., Shi C., Du X., Han K., et al. (2020). The chromosome level genome and genome-wide association study for the agronomic traits of Panax notoginseng. iScience 23 101538. 10.1016/j.isci.2020.101538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Fan Y., Sahu S. K., Yang T., Mu W., Wei J., Cheng L., et al. (2020). Dissecting the genome of star fruit (Averrhoa carambola L.). Hortic. Res. 7:94. 10.1038/s41438-020-0306-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Franke J., Kim J., Hamilton J. P., Zhao D., Pham G. M., Wiegert-Rininger K., et al. (2019). Gene discovery in gelsemium highlights conserved gene clusters in monoterpene indole alkaloid biosynthesis. ChemBioChem 20 83–87. 10.1002/cbic.201800592 [DOI] [PubMed] [Google Scholar]
  39. Fu Y., Li L., Hao S., Guan R., Fan G., Shi C., et al. (2017). Draft genome sequence of the Tibetan medicinal herb Rhodiola crenulata. Gigascience 6 1–5. 10.1093/gigascience/gix033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Gao S., Wang B., Xie S., Xu X., Zhang J., Pei L., et al. (2020). A high-quality reference genome of wild Cannabis sativa. Hortic. Res. 7:73. 10.1038/s41438-020-0295-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Gonda I., Faigenboim A., Adler C., Milavski R., Karp M.-J., Shachter A., et al. (2020). The genome sequence of tetraploid sweet basil, Ocimum basilicum L., provides tools for advanced genome editing and molecular breeding. DNA Res. 27 1–10. 10.1093/dnares/dsaa027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Goodwin S., Gurtowski J., Ethe-Sayers S., Deshpande P., Schatz M. C., McCombie W. R. (2015). Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome. Genome Res. 25 1750–1756. 10.1101/gr.191395.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Guan R., Zhao Y., Zhang H., Fan G., Liu X., Zhou W., et al. (2016). Draft genome of the living fossil Ginkgo biloba. Gigascience 5:49. 10.1186/s13742-016-0154-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Guo C., Wang Y., Yang A., He J., Xiao C., Lv S., et al. (2020). The coix genome provides insights into Panicoideae evolution and papery hull domestication. Mol. Plant 13 309–320. 10.1016/j.molp.2019.11.008 [DOI] [PubMed] [Google Scholar]
  45. Guo L., Winzer T., Yang X., Li Y., Ning Z., He Z., et al. (2018). The opium poppy genome and morphinan production. Science 362 343–347. 10.1126/science.aat4096 [DOI] [PubMed] [Google Scholar]
  46. Hamilton J. P., Robin Buell C. (2012). Advances in plant genome sequencing. Plant J. 70 177–190. 10.1111/j.1365-313X.2012.04894.x [DOI] [PubMed] [Google Scholar]
  47. Hancock J. F. (2005). Contributions of domesticated plant studies to our understanding of plant evolution. Ann. Bot. 96 953–963. 10.1093/aob/mci259 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. He N., Zhang C., Qi X., Zhao S., Tao Y., Yang G., et al. (2013). Draft genome sequence of the mulberry tree Morus notabilis. Nat. Commun. 4:2445. 10.1038/ncomms3445 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. He S., Dong X., Zhang G., Fan W., Duan S., Shi H., et al. (2021). High quality genome of Erigeron breviscapus provides a reference for herbal plants in Asteraceae. Mol. Ecol. Resour. 21 153–169. 10.1111/1755-0998.13257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. He Y., Peng F., Deng C., Xiong L., Huang Z., Zhang R., et al. (2018). Building an octaploid genome and transcriptome of the medicinal plant Pogostemon cablin from Lamiales. Sci. Data 5:180274. 10.1038/sdata.2018.274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. He Y., Xiao H., Deng C., Xiong L., Nie H., Peng C. (2016). Survey of the genome of Pogostemon cablin provides insights into its evolutionary history and sesquiterpenoid biosynthesis. Sci. Rep. 6:26405. 10.1038/srep26405 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Hibrand Saint-Oyant L., Ruttink T., Hamama L., Kirov I., Lakhwani D., Zhou N. N., et al. (2018). A high-quality genome sequence of Rosa chinensis to elucidate ornamental traits. Nat. Plants 4 473–484. 10.1038/s41477-018-0166-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Hong Z., Li J., Liu X., Lian J., Zhang N., Yang Z., et al. (2020). The chromosome-level draft genome of Dalbergia odorifera. Gigascience 9 1–8. 10.1093/gigascience/giaa084 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Hoopes G. M., Hamilton J. P., Kim J., Zhao D., Wiegert-Rininger K., Crisovan E., et al. (2018). Genome assembly and annotation of the medicinal plant Calotropis gigantea, a producer of anticancer and antimalarial cardenolides. G3 Genes Genomes Genetics 8 385–391. 10.1534/g3.117.300331 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Hu L., Xu Z., Wang M., Fan R., Yuan D., Wu B., et al. (2019). The chromosome-scale reference genome of black pepper provides insight into piperine biosynthesis. Nat. Commun. 10:4702. 10.1038/s41467-019-12607-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Huang H., Liang J., Tan Q., Ou L., Li X., Zhong C., et al. (2021). Insights into triterpene synthesis and unsaturated fatty-acid accumulation provided by chromosomal-level genome analysis of Akebia trifoliata subsp. australis. Hortic. Res. 8:33. 10.1038/s41438-020-00458-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. International Brachypodium Initiative (2010). Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature 463 763–768. 10.1038/nature08747 [DOI] [PubMed] [Google Scholar]
  58. Iorizzo M., Ellison S., Senalik D., Zeng P., Satapoomin P., Huang J., et al. (2016). A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution. Nat. Genet. 48 657–666. 10.1038/ng.3565 [DOI] [PubMed] [Google Scholar]
  59. Itkin M., Davidovich-Rikanati R., Cohen S., Portnoy V., Doron-Faigenboim A., Oren E., et al. (2016). The biosynthetic pathway of the nonsugar, high-intensity sweetener mogroside V from Siraitia grosvenorii. Proc. Natl. Acad. Sci. U.S.A. 113 E7619–E7628. 10.1073/pnas.1604828113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Jaiswal S. K., Mahajan S., Chakraborty A., Kumar S., Sharma V. K. (2021). The genome sequence of Aloe vera reveals adaptive evolution of drought tolerance mechanisms. iScience 24:102079. 10.1016/j.isci.2021.102079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Jamshidi-Kia F., Lorigooini Z., Amini-Khoei H. (2018). Medicinal plants: past history and future perspective. J. Herbmed Pharmacol. 7 1–7. 10.15171/jhp.2018.01 [DOI] [Google Scholar]
  62. Ji Y., Xiu Z., Chen C., Wang Y., Yang J., Sui J., et al. (2021). Long read sequencing of Toona sinensis (A. Juss) Roem: a chromosome-level reference genome for the family Meliaceae. Mol. Ecol. Resour. 21 1243–1255. 10.1111/1755-0998.13318 [DOI] [PubMed] [Google Scholar]
  63. Jiang Z., Tu L., Yang W., Zhang Y., Hu T., Ma B., et al. (2021). The chromosome-level reference genome assembly for Panax notoginseng and insights into ginsenoside biosynthesis. Plant Commun. 2:100113. 10.1016/j.xplc.2020.100113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Jiao W.-B., Schneeberger K. (2017). The impact of third generation genomic technologies on plant genome assembly. Curr. Opin. Plant Biol. 36 64–70. 10.1016/j.pbi.2017.02.002 [DOI] [PubMed] [Google Scholar]
  65. Kang S.-H., Kim B., Choi B.-S., Lee H. O., Kim N.-H., Lee S. J., et al. (2020b). Genome assembly and annotation of soft-shelled adlay (Coix lacryma-jobi Variety ma-yuen), a cereal and medicinal crop in the poaceae family. Front. Plant Sci. 11:630. 10.3389/fpls.2020.00630 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Kang M., Wu H., Yang Q., Huang L., Hu Q., Ma T., et al. (2020a). A chromosome-scale genome assembly of Isatis indigotica, an important medicinal plant used in traditional Chinese medicine. Hortic. Res. 7:18. 10.1038/s41438-020-0240-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Kang S.-H., Pandey R. P., Lee C.-M., Sim J.-S., Jeong J.-T., Choi B.-S., et al. (2020c). Genome-enabled discovery of anthraquinone biosynthesis in Senna tora. Nat. Commun. 11:5875. 10.1038/s41467-020-19681-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Kellner F., Kim J., Clavijo B. J., Hamilton J. P., Childs K. L., Vaillancourt B., et al. (2015). Genome-guided investigation of plant natural product biosynthesis. Plant J. 82 680–692. 10.1111/tpj.12827 [DOI] [PubMed] [Google Scholar]
  69. Kim J., Kang S.-H., Park S.-G., Yang T.-J., Lee Y., Kim O. T., et al. (2020). Whole-genome, transcriptome, and methylome analyses provide insights into the evolution of platycoside biosynthesis in Platycodon grandiflorus, a medicinal plant. Hortic. Res. 7:112. 10.1038/s41438-020-0329-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Kim K. E., Peluso P., Babayan P., Yeadon P. J., Yu C., Fisher W. W., et al. (2014). Long-read, whole-genome shotgun sequence data for five model organisms. Sci. Data 1 1–10. 10.1038/sdata.2014.45 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Kim N.-H., Jayakodi M., Lee S.-C., Choi B.-S., Jang W., Lee J., et al. (2018). Genome and evolution of the shade-requiring medicinal herb Panax ginseng. Plant Biotechnol. J. 16 1904–1917. 10.1111/pbi.12926 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Kim S., Park M., Yeom S.-I., Kim Y.-M., Lee J. M., Lee H.-A., et al. (2014). Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat. Genet. 46 270–278. 10.1038/ng.2877 [DOI] [PubMed] [Google Scholar]
  73. Kitashiba H., Li F., Hirakawa H., Kawanabe T., Zou Z., Hasegawa Y., et al. (2014). Draft Sequences of the Radish (Raphanus sativus L.) Genome. DNA Res. 21 481–490. 10.1093/dnares/dsu014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Koren S., Schatz M. C., Walenz B. P., Martin J., Howard J. T., Ganapathy G., et al. (2012). Hybrid error correction and de novo assembly of single-molecule sequencing reads. Nat. Biotechnol. 30 693–700. 10.1038/nbt.2280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Krishnan N. M., Pattnaik S., Jain P., Gaur P., Choudhary R., Vaidyanathan S., et al. (2012). A draft of the genome and four transcriptomes of a medicinal and pesticidal angiosperm Azadirachta indica. BMC Genomics 13:464. 10.1186/1471-2164-13-464 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Kumari P., Singh K. P., Rai P. K. (2020). Draft genome of multiple resistance donor plant Sinapis alba: an insight into SSRs, annotations and phylogenetics. PLoS One 15:e0231002. 10.1371/journal.pone.0231002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Lau K. H., Bhat W. W., Hamilton J. P., Wood J. C., Vaillancourt B., Wiegert-Rininger K., et al. (2020). Genome assembly of Chiococca alba uncovers key enzymes involved in the biosynthesis of unusual terpenoids. DNA Res. 27 1–12. 10.1093/dnares/dsaa013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Li A., Liu A., Du X., Chen J.-Y., Yin M., Hu H.-Y., et al. (2020). A chromosome-scale genome assembly of a diploid alfalfa, the progenitor of autotetraploid alfalfa. Hortic. Res. 7:194. 10.1038/s41438-020-00417-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Li C., Li X., Liu H., Wang X., Li W., Chen M.-S., et al. (2020). Chromatin architectures are associated with response to dark treatment in the oil crop Sesamum indicum, based on a high-quality genome assembly. Plant Cell Physiol. 61 978–987. 10.1093/pcp/pcaa026 [DOI] [PubMed] [Google Scholar]
  80. Li J., Wang Y., Dong Y., Zhang W., Wang D., Bai H., et al. (2021). The chromosome-based lavender genome provides new insights into Lamiaceae evolution and terpenoid biosynthesis. Hortic. Res. 8:53. 10.1038/s41438-021-00490-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Li L.-F., Cushman S. A., He Y.-X., Li Y. (2020). Genome sequencing and population genomics modeling provide insights into the local adaptation of weeping forsythia. Hortic. Res. 7:130. 10.1038/s41438-020-00352-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Li M.-Y., Feng K., Hou X.-L., Jiang Q., Xu Z.-S., Wang G.-L., et al. (2020). The genome sequence of celery (Apium graveolens L.), an important leaf vegetable crop rich in apigenin in the Apiaceae family. Hortic. Res. 7:9. 10.1038/s41438-019-0235-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Li S.-F., Wang J., Dong R., Zhu H.-W., Lan L.-N., Zhang Y.-L., et al. (2020). Chromosome-level genome assembly, annotation and evolutionary analysis of the ornamental plant Asparagus setaceus. Hortic. Res. 7:48. 10.1038/s41438-020-0271-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Li Y., Wei H., Yang J., Du K., Li J., Zhang Y., et al. (2020). High-quality de novo assembly of the Eucommia ulmoides haploid genome provides new insights into evolution and rubber biosynthesis. Hortic. Res. 7:183. 10.1038/s41438-020-00406-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Liang Q., Li H., Li S., Yuan F., Sun J., Duan Q., et al. (2019). The genome assembly and annotation of yellowhorn (Xanthoceras sorbifolium Bunge). Gigascience 8 1–15. 10.1093/gigascience/giz071 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Liang Y., Chen S., Wei K., Yang Z., Duan S., Du Y., et al. (2020). Chromosome level genome assembly of Andrographis paniculata. Front. Genet. 11:701. 10.3389/fgene.2020.00701 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Lin Y., Min J., Lai R., Wu Z., Chen Y., Yu L., et al. (2017). Genome-wide sequencing of longan (Dimocarpus longan Lour.) provides insights into molecular basis of its polyphenol-rich characteristics. Gigascience 6 1–14. 10.1093/gigascience/gix023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Liu H., Shi J., Cai Z., Huang Y., Lv M., Du H., et al. (2020). Evolution and domestication footprints uncovered from the genomes of Coix. Mol. Plant 13 295–308. 10.1016/j.molp.2019.11.009 [DOI] [PubMed] [Google Scholar]
  89. Liu L., Li Y., Li S., Hu N., He Y., Pong R., et al. (2012). Comparison of next-generation sequencing systems. J. Biomed. Biotechnol. 2012:251364. 10.1155/2012/251364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Liu M. J., Zhao J., Cai Q.-L., Liu G.-C., Wang J.-R., Zhao Z.-H., et al. (2014). The complex jujube genome provides insights into fruit tree biology. Nat. Commun. 5:5315. 10.1038/ncomms6315 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Liu S., Liu Y., Yang X., Tong C., Edwards D., Parkin I. A. P., et al. (2014). The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5:3930. 10.1038/ncomms4930 [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Liu X., Liu Y., Huang P., Ma Y., Qing Z., Tang Q., et al. (2017). The genome of medicinal plant Macleaya cordata provides new insights into benzylisoquinoline alkaloids metabolism. Mol. Plant 10 975–989. 10.1016/j.molp.2017.05.007 [DOI] [PubMed] [Google Scholar]
  93. Liu Y., Tang Q., Cheng P., Zhu M., Zhang H., Liu J., et al. (2020). Whole-genome sequencing and analysis of the Chinese herbal plant Gelsemium elegans. Acta Pharm. Sin. B 10 374–382. 10.1016/j.apsb.2019.08.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Liu Y., Wang B., Shu S., Li Z., Song C., Liu D., et al. (2021). Analysis of the Coptis chinensis genome reveals the diversification of protoberberine-type alkaloids. Nat. Commun. 12:3276. 10.1038/s41467-021-23611-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Loman N. J., Quick J., Simpson J. T. (2015). A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat. Methods 12 733–735. 10.1038/nmeth.3444 [DOI] [PubMed] [Google Scholar]
  96. Lu M., An H., Li L. (2016). Genome survey sequencing for the characterization of the genetic background of Rosa roxburghii tratt and leaf ascorbate metabolism genes. PLoS One 11:e0147530. 10.1371/journal.pone.0147530 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Luo X., Li H., Wu Z., Yao W., Zhao P., Cao D., et al. (2020). The pomegranate (Punica granatum L.) draft genome dissects genetic divergence between soft- and hard-seeded cultivars. Plant Biotechnol. J. 18 955–968. 10.1111/pbi.13260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Lv Q., Qiu J., Liu J., Li Z., Zhang W., Wang Q., et al. (2020). The Chimonanthus salicifolius genome provides insight into magnoliid evolution and flavonoid biosynthesis. Plant J. 103 1910–1923. 10.1111/tpj.14874 [DOI] [PubMed] [Google Scholar]
  99. Lv S., Cheng S., Wang Z., Li S., Jin X., Lan L., et al. (2020). Draft genome of the famous ornamental plant Paeonia suffruticosa. Ecol. Evol. 10 4518–4530. 10.1002/ece3.5965 [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Ma D., Dong S., Zhang S., Wei X., Xie Q., Ding Q., et al. (2021). Chromosome-level reference genome assembly provides insights into aroma biosynthesis in passion fruit (Passiflora edulis). Mol. Ecol. Resour. 21 955–968. 10.1111/1755-0998.13310 [DOI] [PubMed] [Google Scholar]
  101. Ma L., Wang Q., Mu J., Fu A., Wen C., Zhao X., et al. (2020). The genome and transcriptome analysis of snake gourd provide insights into its evolution and fruit development and ripening. Hortic. Res. 7:199. 10.1038/s41438-020-00423-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Ma Q., Sun T., Li S., Wen J., Zhu L., Yin T., et al. (2020). The Acer truncatum genome provides insights into nervonic acid biosynthesis. Plant J. 104 662–678. 10.1111/tpj.14954 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Mahesh H. B., Subba P., Advani J., Shirke M. D., Loganathan R. M., Chandana S. L., et al. (2018). Multi-Omics driven assembly and annotation of the sandalwood (Santalum album) Genome. Plant Physiol. 176 2772–2788. 10.1104/pp.17.01764 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Mardis E. R. (2008). The impact of next-generation sequencing technology on genetics. Trends Genet. 24 133–141. 10.1016/j.tig.2007.12.007 [DOI] [PubMed] [Google Scholar]
  105. Margulies M., Egholm M., Altman W. E., Attiya S., Bader J. S., Bemben L. A., et al. (2005). Genome sequencing in microfabricated high-density picolitre reactors. Nature 437 376–380. 10.1038/nature03959 [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Marrano A., Britton M., Zaini P. A., Zimin A. V., Workman R. E., Puiu D., et al. (2020). High-quality chromosome-scale assembly of the walnut (Juglans regia L.) reference genome. Gigascience 9 1–16. 10.1093/gigascience/giaa050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Martínez-García P. J., Crepeau M. W., Puiu D., Gonzalez-Ibeas D., Whalen J., Stevens K. A., et al. (2016). The walnut (Juglans regia) genome sequence reveals diversity in genes coding for the biosynthesis of non-structural polyphenols. Plant J. 87 507–532. 10.1111/tpj.13207 [DOI] [PubMed] [Google Scholar]
  108. Matsumura H., Hsiao M.-C., Lin Y.-P., Toyoda A., Taniai N., Tarora K., et al. (2020). Long-read bitter gourd (Momordica charantia) genome and the genomic architecture of nonclassic domestication. Proc. Natl. Acad. Sci. U.S.A. 117 14543–14551. 10.1073/pnas.1921016117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Michael T. P., Jackson S. (2013). The first 50 plant genomes. Plant Genome 6 1–7. 10.3835/plantgenome2013.03.0001in [DOI] [Google Scholar]
  110. Ming R., Hou S., Feng Y., Yu Q., Dionne-Laporte A., Saw J. H., et al. (2008). The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452 991–996. 10.1038/nature06856 [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Ming R., VanBuren R., Liu Y., Yang M., Han Y., Li L.-T., et al. (2013). Genome of the long-living sacred lotus (Nelumbo nucifera Gaertn.). Genome Biol. 14:R41. 10.1186/gb-2013-14-5-r41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Mochida K., Sakurai T., Seki H., Yoshida T., Takahagi K., Sawai S., et al. (2017). Draft genome assembly and annotation of Glycyrrhiza uralensis, a medicinal legume. Plant J. 89 181–194. 10.1111/tpj.13385 [DOI] [PubMed] [Google Scholar]
  113. Moher D., Liberati A., Tetzlaff J., Altman D. G. (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 6:e1000097. 10.1371/journal.pmed.1000097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Moss J., Yuan C. S. (2006). Herbal medicines and perioperative care. Anesthesiology 105 441–442. 10.1097/00000542-200609000-00002 [DOI] [PubMed] [Google Scholar]
  115. Nafis T., Akmal M., Ram M., Alam P., Ahlawat S., Mohd A., et al. (2011). Enhancement of artemisinin content by constitutive expression of the HMG-CoA reductase gene in high-yielding strain of Artemisia annua L. Plant Biotechnol. Rep. 5 53–60. 10.1007/s11816-010-0156-x [DOI] [Google Scholar]
  116. Neller K. C. M., Diaz C. A., Platts A. E., Hudak K. A. (2019). De novo assembly of the pokeweed genome provides insight into pokeweed antiviral protein (PAP) gene expression. Front. Plant Sci. 10:1002. 10.3389/fpls.2019.01002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Newman T., deBruijn F. J., Green P., Keegstra K., Kende H., McIntosh L., et al. (1994). Genes galore: a summary of methods for accessing results from large-scale partial sequencing of anonymous Arabidopsis cDNA clones. Plant Physiol. 106 1241–1255. 10.1104/pp.106.4.1241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Niu Z., Zhu F., Fan Y., Li C., Zhang B., Zhu S., et al. (2021). The chromosome-level reference genome assembly for Dendrobium officinale and its utility of functional genomics research and molecular breeding study. Acta Pharm. Sin. B 11 2080–2092. 10.1016/j.apsb.2021.01.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Nobrega M. A., Pennacchio L. A. (2004). Comparative genomic analysis as a tool for biological discovery. J. Physiol. 554 31–39. 10.1113/jphysiol.2003.050948 [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Nong W., Law S. T. S., Wong A. Y. P., Baril T., Swale T., Chu L. M., et al. (2020). Chromosomal-level reference genome of the incense tree Aquilaria sinensis. Mol. Ecol. Resour. 20 971–979. 10.1111/1755-0998.13154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Ou X., Jin H., Guo L., Yang Y., Cui X., Xiao Y., et al. (2011). [Status and prospective on nutritional physiology and fertilization of Panax notoginseng]. Zhongguo Zhong Yao Za Zhi 36 2620–2624. 10.4268/cjcmm20111904 [DOI] [PubMed] [Google Scholar]
  122. Paddon C. J., Westfall P. J., Pitera D. J., Benjamin K., Fisher K., McPhee D., et al. (2013). High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496 528–532. 10.1038/nature12051 [DOI] [PubMed] [Google Scholar]
  123. Patil A. B., Shinde S. S., Raghavendra S., Satish B. N., Kushalappa C. G., Vijay N. (2021). The genome sequence of Mesua ferrea and comparative demographic histories of forest trees. Gene 769 145214. 10.1016/j.gene.2020.145214 [DOI] [PubMed] [Google Scholar]
  124. Pei L., Wang B., Ye J., Hu X., Fu L., Li K., et al. (2021). Genome and transcriptome of Papaver somniferum Chinese landrace CHM indicates that massive genome expansion contributes to high benzylisoquinoline alkaloid biosynthesis. Hortic. Res. 8:5. 10.1038/s41438-020-00435-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Peng X., Liu H., Chen P., Tang F., Hu Y., Wang F., et al. (2019). A chromosome-scale genome assembly of paper mulberry (Broussonetia papyrifera) provides new insights into its forage and papermaking usage. Mol. Plant 12 661–677. 10.1016/j.molp.2019.01.021 [DOI] [PubMed] [Google Scholar]
  126. Peng Z., Bredeson J. V., Wu G. A., Shu S., Rawat N., Du D., et al. (2020). A chromosome-scale reference genome of trifoliate orange (Poncirus trifoliata) provides insights into disease resistance, cold tolerance and genome evolution in Citrus. Plant J. 104 1215–1232. 10.1111/tpj.14993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Pootakham W., Naktang C., Kongkachana W., Sonthirod C., Yoocha T., Sangsrakru D., et al. (2021a). De novo chromosome-level assembly of the Centella asiatica genome. Genomics 113 2221–2228. 10.1016/j.ygeno.2021.05.019 [DOI] [PubMed] [Google Scholar]
  128. Pootakham W., Sonthirod C., Naktang C., Nawae W., Yoocha T., Kongkachana W., et al. (2021b). De novo assemblies of Luffa acutangula and Luffa cylindrica genomes reveal an expansion associated with substantial accumulation of transposable elements. Mol. Ecol. Resour. 21 212–225. 10.1111/1755-0998.13240 [DOI] [PubMed] [Google Scholar]
  129. Pu X., Li Z., Tian Y., Gao R., Hao L., Hu Y., et al. (2020). The honeysuckle genome provides insight into the molecular mechanism of carotenoid metabolism underlying dynamic flower coloration. New Phytol. 227 930–943. 10.1111/nph.16552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  130. Purugganan M. D., Fuller D. Q. (2009). The nature of selection during plant domestication. Nature 457 843–848. 10.1038/nature07895 [DOI] [PubMed] [Google Scholar]
  131. Qin C., Yu C., Shen Y., Fang X., Chen L., Min J., et al. (2014). Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization. Proc. Natl. Acad. Sci. U.S.A. 111 5135–5140. 10.1073/pnas.1400975111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Qin G., Xu C., Ming R., Tang H., Guyot R., Kramer E. M., et al. (2017). The pomegranate (Punica granatum L.) genome and the genomics of punicalagin biosynthesis. Plant J. 91 1108–1128. 10.1111/tpj.13625 [DOI] [PubMed] [Google Scholar]
  133. Qin S., Wu L., Wei K., Liang Y., Song Z., Zhou X., et al. (2019). A draft genome for Spatholobus suberectus. Sci. Data 6:113. 10.1038/s41597-019-0110-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  134. Qing Z., Liu J., Yi X., Liu X., Hu G., Lao J., et al. (2021). The chromosome-level Hemerocallis citrina Borani genome provides new insights into the rutin biosynthesis and the lack of colchicine. Hortic. Res. 8:89. 10.1038/s41438-021-00539-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  135. Quick J., Quinlan A. R., Loman N. J. (2014). A reference bacterial genome dataset generated on the MinIONTM portable single-molecule nanopore sequencer. Gigascience 3 1–6. 10.1186/2047-217X-3-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Rai A., Hirakawa H., Nakabayashi R., Kikuchi S., Hayashi K., Rai M., et al. (2021). Chromosome-level genome assembly of Ophiorrhiza pumila reveals the evolution of camptothecin biosynthesis. Nat. Commun. 12:405. 10.1038/s41467-020-20508-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  137. Rajewski A., Carter-House D., Stajich J., Litt A. (2021). Datura genome reveals duplications of psychoactive alkaloid biosynthetic genes and high mutation rate following tissue culture. BMC Genomics 22:201. 10.1186/s12864-021-07489-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  138. Raymond O., Gouzy J., Just J., Badouin H., Verdenaud M., Lemainque A., et al. (2018). The Rosa genome provides new insights into the domestication of modern roses. Nat. Genet. 50 772–777. 10.1038/s41588-018-0110-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  139. Ren H., Yu H., Zhang S., Liang S., Zheng X., Zhang S., et al. (2019). Genome sequencing provides insights into the evolution and antioxidant activity of Chinese bayberry. BMC Genomics 20:458. 10.1186/s12864-019-5818-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  140. Schnable P. S., Ware D., Fulton R. S., Stein J. C., Wei F., Pasternak S., et al. (2009). The B73 maize genome: complexity, diversity, and dynamics. Science 326 1112–1115. 10.1126/science.1178534 [DOI] [PubMed] [Google Scholar]
  141. Schwartz D. C., Li X., Hernandez L. I., Ramnarain S. P., Huff E. J., Wang Y. K. (1993). Ordered restriction maps of Saccharomyces cerevisiae chromosomes constructed by optical mapping. Science 262 110–114. 10.1126/science.8211116 [DOI] [PubMed] [Google Scholar]
  142. Shang J., Tian J., Cheng H., Yan Q., Li L., Jamal A., et al. (2020). The chromosome-level wintersweet (Chimonanthus praecox) genome provides insights into floral scent biosynthesis and flowering in winter. Genome Biol. 21:200. 10.1186/s13059-020-02088-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  143. Shen C., Du H., Chen Z., Lu H., Zhu F., Chen H., et al. (2020). The chromosome-level genome sequence of the Autotetraploid alfalfa and Resequencing of core Germplasms provide genomic resources for alfalfa research. Mol. Plant 13 1250–1261. 10.1016/j.molp.2020.07.003 [DOI] [PubMed] [Google Scholar]
  144. Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., et al. (2018). The genome of Artemisia annua provides insight into the evolution of Asteraceae family and Artemisinin biosynthesis. Mol. Plant 11 776–788. 10.1016/j.molp.2018.03.015 [DOI] [PubMed] [Google Scholar]
  145. Song C., Liu Y., Song A., Dong G., Zhao H., Sun W., et al. (2018). The Chrysanthemum nankingense genome provides insights into the evolution and diversification of Chrysanthemum flowers and medicinal traits. Mol. Plant 11 1482–1491. 10.1016/j.molp.2018.10.003 [DOI] [PubMed] [Google Scholar]
  146. Song X., Sun P., Yuan J., Gong K., Li N., Meng F., et al. (2021). The celery genome sequence reveals sequential paleo-polyploidizations, karyotype evolution and resistance gene reduction in apiales. Plant Biotechnol. J. 19 731–744. 10.1111/pbi.13499 [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Song X., Wang J., Li N., Yu J., Meng F., Wei C., et al. (2020). Deciphering the high-quality genome sequence of coriander that causes controversial feelings. Plant Biotechnol. J. 18 1444–1456. 10.1111/pbi.13310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  148. Song Z., Lin C., Xing P., Fen Y., Jin H., Zhou C., et al. (2020). A high-quality reference genome sequence of Salvia miltiorrhiza provides insights into tanshinone synthesis in its red rhizomes. Plant Genome 13:e20041. 10.1002/tpg2.20041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  149. Su W., Jing Y., Lin S., Yue Z., Yang X., Xu J., et al. (2021). Polyploidy underlies co-option and diversification of biosynthetic triterpene pathways in the apple tribe. Proc. Natl. Acad. Sci. U.S.A. 118 e2101767118. 10.1073/pnas.2101767118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  150. Su X. Z., Miller L. H. (2015). The discovery of artemisinin and the Nobel Prize in Physiology or Medicine. Sci. China Life Sci. 58 1175–1179. 10.1007/s11427-015-4948-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  151. Sun G., Xu Y., Liu H., Sun T., Zhang J., Hettenhausen C., et al. (2018). Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis. Nat. Commun. 9:2683. 10.1038/s41467-018-04721-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  152. Sun W., Leng L., Yin Q., Xu M., Huang M., Xu Z., et al. (2019). The genome of the medicinal plant Andrographis paniculata provides insight into the biosynthesis of the bioactive diterpenoid neoandrographolide. Plant J. 97 841–857. 10.1111/tpj.14162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. Sun X., Zhu S., Li N., Cheng Y., Zhao J., Qiao X., et al. (2020). A chromosome-level genome assembly of garlic (Allium sativum) provides insights into genome evolution and allicin biosynthesis. Mol. Plant 13 1328–1339. 10.1016/j.molp.2020.07.019 [DOI] [PubMed] [Google Scholar]
  154. Tu L., Su P., Zhang Z., Gao L., Wang J., Hu T., et al. (2020). Genome of Tripterygium wilfordii and identification of cytochrome P450 involved in triptolide biosynthesis. Nat. Commun. 11:971. 10.1038/s41467-020-14776-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  155. Tuskan G. A., Difazio S., Jansson S., Bohlmann J., Grigoriev I., Hellsten U., et al. (2006). The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313 1596–1604. 10.1126/science.1128691 [DOI] [PubMed] [Google Scholar]
  156. Upadhyay A. K., Chacko A. R., Gandhimathi A., Ghosh P., Harini K., Joseph A. P., et al. (2015). Genome sequencing of herb Tulsi (Ocimum tenuiflorum) unravels key genes behind its strong medicinal properties. BMC Plant Biol. 15:212. 10.1186/s12870-015-0562-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  157. Urasaki N., Takagi H., Natsume S., Uemura A., Taniai N., Miyagi N., et al. (2017). Draft genome sequence of bitter gourd (Momordica charantia), a vegetable and medicinal plant in tropical and subtropical regions. DNA Res. 24 51–58. 10.1093/dnares/dsw047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  158. vanBakel H., Stout J. M., Cote A. G., Tallon C. M., Sharpe A. G., Hughes T. R., et al. (2011). The draft genome and transcriptome of Cannabis sativa. Genome Biol. 12:R102. 10.1186/gb-2011-12-10-r102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  159. Vanburen R., Bryant D., Edger P. P., Tang H., Burgess D., Challabathula D., et al. (2015). Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum. Nature 527 508–511. 10.1038/nature15714 [DOI] [PubMed] [Google Scholar]
  160. Vining K. J., Johnson S. R., Ahkami A., Lange I., Parrish A. N., Trapp S. C., et al. (2017). Draft genome sequence of Mentha longifolia and development of resources for mint cultivar improvement. Mol. Plant 10 323–339. 10.1016/j.molp.2016.10.018 [DOI] [PubMed] [Google Scholar]
  161. Wang J., Xu S., Mei Y., Cai S., Gu Y., Sun M., et al. (2021). A high-quality genome assembly of Morinda officinalis, a famous native southern herb in the Lingnan region of southern China. Hortic. Res. 8:135. 10.1038/s41438-021-00551-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  162. Wang L., He F., Huang Y., He J., Yang S., Zeng J., et al. (2018). Genome of wild mandarin and domestication history of mandarin. Mol. Plant 11 1024–1037. 10.1016/j.molp.2018.06.001 [DOI] [PubMed] [Google Scholar]
  163. Wang L., Yu S., Tong C., Zhao Y., Liu Y., Song C., et al. (2014). Genome sequencing of the high oil crop sesame provides insight into oil biosynthesis. Genome Biol. 15:R39. 10.1186/gb-2014-15-2-r39 [DOI] [PMC free article] [PubMed] [Google Scholar]
  164. Wang M., Zhang L., Wang Z. (2021). Chromosomal-Level reference genome of the neotropical tree Jacaranda mimosifolia D. Don. Genome Biol. Evol. 13 2–7. 10.1093/gbe/evab094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  165. Wang P., Yi S., Mu X., Zhang J., Du J. (2020). Chromosome-Level genome assembly of Cerasus humilis using PacBio and Hi-C technologies. Front. Genet. 11:956. 10.3389/fgene.2020.00956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  166. Wang X., Xu Y., Zhang S., Cao L., Huang Y., Cheng J., et al. (2017). Genomic analyses of primitive, wild and cultivated citrus provide insights into asexual reproduction. Nat. Genet. 49 765–772. 10.1038/ng.3839 [DOI] [PubMed] [Google Scholar]
  167. Wang X., Zhang J., He S., Gao Y., Ma X., Gao Y., et al. (2018). HMOD: an omics database for herbal medicine plants. Mol. Plant 11 757–759. 10.1016/j.molp.2018.03.002 [DOI] [PubMed] [Google Scholar]
  168. Wang Y., Fan G., Liu Y., Sun F., Shi C., Liu X., et al. (2013). The sacred lotus genome provides insights into the evolution of flowering plants. Plant J. 76 557–567. 10.1111/tpj.12313 [DOI] [PubMed] [Google Scholar]
  169. Wang Z., Hobson N., Galindo L., Zhu S., Shi D., McDill J., et al. (2012). The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads. Plant J. 72 461–473. 10.1111/j.1365-313X.2012.05093.x [DOI] [PubMed] [Google Scholar]
  170. Wu H., Zhao G., Gong H., Li J., Luo C., He X., et al. (2020). A high-quality sponge gourd (Luffa cylindrica) genome. Hortic. Res. 7:128. 10.1038/s41438-020-00350-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  171. Wu S., Shamimuzzaman M., Sun H., Salse J., Sui X., Wilder A., et al. (2017). The bottle gourd genome provides insights into Cucurbitaceae evolution and facilitates mapping of a Papaya ring-spot virus resistance locus. Plant J. 92 963–975. 10.1111/tpj.13722 [DOI] [PubMed] [Google Scholar]
  172. Wu S., Sun W., Xu Z., Zhai J., Li X., Li C., et al. (2020). The genome sequence of star fruit (Averrhoa carambola). Hortic. Res. 7:95. 10.1038/s41438-020-0307-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  173. Wu Z., Liu H., Zhan W., Yu Z., Qin E., Liu S., et al. (2021). The chromosome-scale reference genome of safflower (Carthamus tinctorius) provides insights into linoleic acid and flavonoid biosynthesis. Plant Biotechnol. J. 19 1725–1742. 10.1111/pbi.13586 [DOI] [PMC free article] [PubMed] [Google Scholar]
  174. Wuyun T., Wang L., Liu H., Wang X., Zhang L., Bennetzen J. L., et al. (2018). The hardy rubber tree genome provides insights into the evolution of polyisoprene biosynthesis. Mol. Plant 11 429–442. 10.1016/j.molp.2017.11.014 [DOI] [PubMed] [Google Scholar]
  175. Xia E. H., Zhang H.-B., Sheng J., Li K., Zhang Q.-J., Kim C., et al. (2017). The tea tree genome provides insights into tea flavor and independent evolution of caffeine biosynthesis. Mol. Plant 10 866–877. 10.1016/j.molp.2017.04.002 [DOI] [PubMed] [Google Scholar]
  176. Xia M., Han X., He H., Yu R., Zhen G., Jia X., et al. (2018). Improved de novo genome assembly and analysis of the Chinese cucurbit Siraitia grosvenorii, also known as monk fruit or luo-han-guo. Gigascience 7 1–9. 10.1093/gigascience/giy067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  177. Xia Z., Huang D., Zhang S., Wang W., Ma F., Wu B., et al. (2021). Chromosome-scale genome assembly provides insights into the evolution and flavor synthesis of passion fruit (Passiflora edulis Sims). Hortic. Res. 8:14. 10.1038/s41438-020-00455-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  178. Xie J., Zhao H., Li K., Zhang R., Jiang Y., Wang M., et al. (2020). A chromosome-scale reference genome of Aquilegia oxysepala var. kansuensis. Hortic. Res. 7:113. 10.1038/s41438-020-0328-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  179. Xu H., Song J., Luo H., Zhang Y., Li Q., Zhu Y., et al. (2016). Analysis of the genome sequence of the medicinal plant Salvia miltiorrhiza. Mol. Plant 9 949–952. 10.1016/j.molp.2016.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  180. Xu J., Chu Y., Liao B., Xiao S., Yin Q., Bai R., et al. (2017). Panax ginseng genome examination for ginsenoside biosynthesis. Gigascience 6 1–15. 10.1093/gigascience/gix093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  181. Xu X., Yuan H., Yu X., Huang S., Sun Y., Zhang T., et al. (2021). The chromosome-level Stevia genome provides insights into steviol glycoside biosynthesis. Hortic. Res. 8:129. 10.1038/s41438-021-00565-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  182. Xu Z., Pu X., Gao R., Demurtas O. C., Fleck S. J., Richter M., et al. (2020b). Tandem gene duplications drive divergent evolution of caffeine and crocin biosynthetic pathways in plants. BMC Biol. 18:63. 10.1186/s12915-020-00795-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  183. Xu Z., Gao R., Pu X., Xu R., Wang J., Zheng S., et al. (2020a). Comparative genome analysis of Scutellaria baicalensis and Scutellaria barbata reveals the evolution of active flavonoid biosynthesis. Genomics Proteomics Bioinformatics 18 230–240. 10.1016/j.gpb.2020.06.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  184. Xu Z., Xin T., Bartels D., Li Y., Gu W., Yao H., et al. (2018). Genome analysis of the ancient tracheophyte Selaginella tamariscina reveals evolutionary features relevant to the acquisition of desiccation tolerance. Mol. Plant 11 983–994. 10.1016/j.molp.2018.05.003 [DOI] [PubMed] [Google Scholar]
  185. Yan L., Wang X., Liu H., Tian Y., Lian J., Yang R., et al. (2015). The genome of Dendrobium officinale illuminates the biology of the important traditional Chinese orchid herb. Mol. Plant 8 922–934. 10.1016/j.molp.2014.12.011 [DOI] [PubMed] [Google Scholar]
  186. Yang J., Zhang G., Zhang J., Liu H., Chen W., Wang X., et al. (2017). Hybrid de novo genome assembly of the Chinese herbal fleabane Erigeron breviscapus. Gigascience 6 1–7. 10.1093/gigascience/gix028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  187. Yang X., Yue Y., Li H., Ding W., Chen G., Shi T., et al. (2018). The chromosome-level quality genome provides insights into the evolution of the biosynthesis genes for aroma compounds of Osmanthus fragrans. Hortic. Res. 5:72. 10.1038/s41438-018-0108-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  188. Yang Z., Chen S., Wang S., Hu Y., Zhang G., Dong Y., et al. (2021a). Chromosomal-scale genome assembly of Eleutherococcus senticosus provides insights into chromosome evolution in Araliaceae. Mol. Ecol. Resour. 21 2204–2220. 10.1111/1755-0998.13403 [DOI] [PubMed] [Google Scholar]
  189. Yang Z., Liu G., Zhang G., Yan J., Dong Y., Lu Y., et al. (2021b). The chromosome-scale high-quality genome assembly of Panax notoginseng provides insight into dencichine biosynthesis. Plant Biotechnol. J. 19 869–871. 10.1111/pbi.13558 [DOI] [PMC free article] [PubMed] [Google Scholar]
  190. Yin J., Jiang L., Wang L., Han X., Guo W., Li C., et al. (2021). A high-quality genome of taro (Colocasia esculenta (L.) Schott), one of the world’s oldest crops. Mol. Ecol. Resour. 21 68–77. 10.1111/1755-0998.13239 [DOI] [PubMed] [Google Scholar]
  191. Yu J., Hu S., Wang J., Wong G. K.-S., Li S., Liu B., et al. (2002). A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296 79–92. 10.1126/science.1068037 [DOI] [PubMed] [Google Scholar]
  192. Yuan Y., Jin X., Liu J., Zhao X., Zhou J., Wang X., et al. (2018). The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nat. Commun. 9:1615. 10.1038/s41467-018-03423-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  193. Yuan Y., Liu W., Zhang Q., Xiang L., Liu X., Chen M., et al. (2015). Overexpression of artemisinic aldehyde Δ11 (13) reductase gene-enhanced artemisinin and its relative metabolite biosynthesis in transgenic Artemisia annua L. Biotechnol. Appl. Biochem. 62 17–23. 10.1002/bab.1234 [DOI] [PubMed] [Google Scholar]
  194. Yuan Z., Fang Y., Zhang T., Fei Z., Han F., Liu C., et al. (2018). The pomegranate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology. Plant Biotechnol. J. 16 1363–1374. 10.1111/pbi.12875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  195. Zhang D., Li W., Xia E., Zhang Q., Liu Y., Zhang Y., et al. (2017). The medicinal herb panax Notoginseng genome provides insights into Ginsenoside biosynthesis and genome evolution. Mol. Plant 10 903–907. 10.1016/j.molp.2017.02.011 [DOI] [PubMed] [Google Scholar]
  196. Zhang G. Q., Xu Q., Bian C., Tsai W.-C., Yeh C.-M., Liu K.-W., et al. (2016). The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci. Rep. 6:19029. 10.1038/srep19029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  197. Zhang J., Tian Y., Yan L., Zhang G., Wang X., Zeng Y., et al. (2016). Genome of plant Maca (Lepidium meyenii) illuminates genomic basis for high-altitude adaptation in the central andes. Mol. Plant 9 1066–1077. 10.1016/j.molp.2016.04.016 [DOI] [PubMed] [Google Scholar]
  198. Zhang L., Li X., Ma B., Gao Q., Du H., Han Y., et al. (2017). The tartary buckwheat genome provides insights into rutin biosynthesis and abiotic stress tolerance. Mol. Plant 10 1224–1237. 10.1016/j.molp.2017.08.013 [DOI] [PubMed] [Google Scholar]
  199. Zhang L., Liu M., Long H., Dong W., Pasha A., Esteban E., et al. (2019). Tung tree (Vernicia fordii) genome provides a resource for understanding genome evolution and improved oil production. Genomics Proteomics Bioinformatics 17 558–575. 10.1016/j.gpb.2019.03.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  200. Zhang T., Ren X., Zhang Z., Ming Y., Yang Z., Hu J., et al. (2020). Long-read sequencing and de novo assembly of the Luffa cylindrica (L.) Roem. genome. Mol. Ecol. Resour. 20 511–519. 10.1111/1755-0998.13129 [DOI] [PubMed] [Google Scholar]
  201. Zhang Y., Zheng L., Zheng Y., Zhou C., Huang P., Xiao X., et al. (2019). Assembly and annotation of a draft genome of the medicinal plant Polygonum cuspidatum. Front. Plant Sci. 10:1274. 10.3389/fpls.2019.01274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  202. Zhao D., Hamilton J. P., Pham G. M., Crisovan E., Wiegert-Rininger K., Vaillancourt B., et al. (2017). De novo genome assembly of Camptotheca acuminata, a natural source of the anti-cancer compound camptothecin. Gigascience 6 1–7. 10.1093/gigascience/gix065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  203. Zhao Q., Yang J., Cui M.-Y., Liu J., Fang Y., Yan M., et al. (2019). The reference genome sequence of Scutellaria baicalensis provides insights into the evolution of Wogonin biosynthesis. Mol. Plant 12 935–950. 10.1016/j.molp.2019.04.002 [DOI] [PubMed] [Google Scholar]
  204. Zhao Y. P., Fan G., Yin P. P., Sun S., Li N., Hong X., et al. (2019). Resequencing 545 ginkgo genomes across the world reveals the evolutionary history of the living fossil. Nat. Commun 10:4201. 10.1038/s41467-019-12133-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  205. Zheng G. X. Y., Lau B. T., Schnall-Levin M., Jarosz M., Bell J. M., Hindson C. M., et al. (2016). Haplotyping germline and cancer genomes with high-throughput linked-read sequencing. Nat. Biotechnol. 34 303–311. 10.1038/nbt.3432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  206. Zheng X., Chen D., Chen B., Liang L., Huang Z., Fan W., et al. (2021). Insights into salvianolic acid B biosynthesis from chromosome-scale assembly of the Salvia bowleyana genome. J. Integr. Plant Biol. 63 1309–1323. 10.1111/jipb.13085 [DOI] [PubMed] [Google Scholar]
  207. Zhou W., Li B., Li L., Ma W., Liu Y., Feng S., et al. (2018). Genome survey sequencing of Dioscorea zingiberensis. Genome 61 567–574. 10.1139/gen-2018-0011 [DOI] [PubMed] [Google Scholar]
  208. Zhou W., Wang Y., Li B., Petijová L., Hu S., Zhang Q., et al. (2021). Whole-genome sequence data of Hypericum perforatum and functional characterization of melatonin biosynthesis by N-acetylserotonin O-methyltransferase. J. Pineal Res. 70:e12709. 10.1111/jpi.12709 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Frontiers in Plant Science are provided here courtesy of Frontiers Media SA

RESOURCES