A complete sequence of mitochondrial genome of Neolamarckia cadamba and its use for systematic analysis

Xi Wang; Ling-Ling Li; Yu Xiao; Xiao-Yang Chen; Jie-Hu Chen; Xin-Sheng Hu

doi:10.1038/s41598-021-01040-9

. 2021 Nov 2;11:21452. doi: 10.1038/s41598-021-01040-9

A complete sequence of mitochondrial genome of Neolamarckia cadamba and its use for systematic analysis

Xi Wang ^1,², Ling-Ling Li ^1,², Yu Xiao ^1,², Xiao-Yang Chen ^1,², Jie-Hu Chen ³, Xin-Sheng Hu ^1,^2,^✉

PMCID: PMC8564537 PMID: 34728739

Abstract

Neolamarckia cadamba is an important tropical and subtropical tree for timber industry in southern China and is also a medicinal plant because of the secondary product cadambine. N. cadamba belongs to Rubiaceae family and its taxonomic relationships with other species are not fully evaluated based on genome sequences. Here, we report the complete sequences of mitochondrial genome of N. cadamba, which is 414,980 bp in length and successfully assembled in two genome circles (109,836 bp and 305,144 bp). The mtDNA harbors 83 genes in total, including 40 protein-coding genes (PCGs), 31 transfer RNA genes, 6 ribosomal RNA genes, and 6 other genes. The base composition of the whole genome is estimated as 27.26% for base A, 22.63% for C, 22.53% for G, and 27.56% for T, with the A + T content of 54.82% (54.45% in the small circle and 54.79% in the large circle). Repetitive sequences account for ~ 0.14% of the whole genome. A maximum likelihood (ML) tree based on DNA sequences of 24 PCGs supports that N. cadamba belongs to order Gentianales. A ML tree based on rps3 gene of 60 species in family Rubiaceae shows that N. cadamba is more related to Cephalanthus accidentalis and Hymenodictyon parvifolium and belongs to the Cinchonoideae subfamily. The result indicates that N. cadamba is genetically distant from the species and genera of Rubiaceae in systematic position. As the first sequence of mitochondrial genome of N. cadamba, it will provide a useful resource to investigate genetic variation and develop molecular markers for genetic breeding in the future.

Subject terms: Next-generation sequencing, Phylogenetics

Introduction

Neolamarckia cadamba is one of two species (N. macrophylla for the other species) in genus Neolamarckia of Rubiaceae¹, one of the largest families in flowering plants. The species is naturally distributed in Vietnam, Malaysia, Myanmar, India and Sri Lanka, and mainly grows in Guangdong, Guangxi and Yunnan Provinces in China. It grows in the habitat of high temperature and humidity, with the average temperature of 20–24 °C and the annual precipitation of 1200–2400 mm, and also in the fertile, loose and humid soil or in the humid sandy soil. N. cadamba, aka a miraculous tree, is a fast-growing species² and commercially important materials. Its wood is good for building construction, wood board making, furniture, pulp and paper production³. In addition, the tree fruits can be used for nutraceutical enriched beverage⁴. Leaves are used as woody forage to feed livestock⁵ and have effects of antibacterial and anti-inflammatory to animals⁶. One particular value is that the species has enormous pharmacological implications due to its rich secondary metabolites (e.g., phenols and alkaloids)^6–9. The monoterpenoids, alkaloids and triterpenoids are potentially used for medicinal purposes^10,11. N. cadamba is exploited for antimicrobial, wound healing and antioxidant activities^12–14 and for traditionally curing a number of diseases, such as diabetes, anaemia and infectious diseases⁶. The species as a medicinal plant is appreciated in South Asia^15,16 and shows enormous medical implications.

Although N. cadamba is a miraculous tree, the absence of reference genome limits the molecular and evolutionary studies of this species. Current genetic studies of this species cover broad areas, including provenance trials^17,18, propagation through tissue culture^19,20, transcriptome analysis of gene expressions^21,22, single nucleotide polymorphisms (SNPs) and SNP-trait association^23,24, expressed sequence tags (ESTs) of xylem tissues²⁵ and gene discovery in the developing xylem tissue²⁶. Nevertheless, few studies with molecular markers have been reported on population genetic structure, phylogeography and molecular systematics²⁷. This necessitates determination of the genomic sequences to understand the genetic basis of these characters (rapid growth, quality timber, secondary metabolites, etc.), to develop appropriate molecular markers for breeding program, and to gain insights into the evolutionary history of this species.

To develop markers for population genetics and phylogenetic analysis, we here sequenced and reported the mitochondrial genome of this species. The well-known features of mitochondrial DNA (mtDNA) in plants include (i) maternal inheritance in angiosperms, (ii) haplotype per cell, (iii) intra-molecular recombination between repeats²⁸, and (iv) the number of females as its population size (N_f). These features differ from those of nuclear genomes, which correspondingly exhibits (i) biparental inheritance, (ii) diploid per cell, (iii) inter-chromosome recombination and relatively high mutation rates, and (iv) large effective population sizes (2N_e) of nuclear genes ( $2 N_{e} = 4 N_{f}$ under 1:1 sexual ratio)²⁹. Compared with chloroplast and nuclear DNAs, mtDNA has generally a lower mutation rate in plants³⁰. Thus, mtDNA sequences are useful for studying the long-term phylogenetic relationships at the level of species or higher order, and also for studying other perspectives of evolutionary relationships, such as lineage sorting, hybridization and cytonuclear interactions³¹.

Although three major subfamilies in Rubiaceae are delineated, including Rubioideae³², Ixoroideae³³ and Cinchonoideae³⁴, systematic position of N. cadamba remains to be evident. From the morphological characters, N. cadamba is classified into subfamily Cinchonoideae, tribe Naucleeae¹. Based on the cytogenetic study^35,36, N. cadamba has 44 chromosomes (2n) and belongs to subfamily Cinchonoideae, tribe Naucleeae and Subtribes Neolamarckinae³⁷. In this study, we determined the complete mtDNA sequence of N. cadamba and detailed its characteristics. Based on the mtDNA sequence, we then evaluated the phylogenetic relationships among families and genera of Rubiaceae to gain insights into the taxonomic position of N. cadamba.

Results and discussion

Assembly of mitochondrial genome

MtDNA sequence of N. cadamba was determined using PacBio sequencing technique and was successfully assembled in two genome circles. This probably reflects the feature of rapid evolution of structure of plant mitochondrial genomes^38–40. Figure 1 shows two parts of circular structure of the mitochondrial genome, designated as genomes 1 and 2. The genome 1 has 109,836 bp (GenBank Access No. MT320890). It contains 14 genes (Table 1), including 7 protein-coding genes (PCGs), 5 transfer RNA genes, and 2 other genes (ccmFc, ccmFn). The PCGs are 1 NADH dehydrogenase genes (nad7), 2 ATP synthase genes (atp6, atp9), 2 ribosomal proteins genes (rpl18, rps3), 1 maturases gene, and 1 ORF. Four PCGs (nad7, rpl16, rps3, atp9), 4 tRNA genes, and 1 other gene (ccmFc) are on the N-strand, and genes of 1 PCG atp6, 1 tRNA gene and 1 other gene (ccmFn) are on the J-strand. There is only one overlapping region (110 bp in length) between rpl16 and rps3 in genome 1.

Two circular maps of the mitochondrial genome of *Neolamarckia cadamba.*

Table 1.

Annotations and characteristics of mitochondrial genome 1 of Neolamarckia cadamba.

Gene	Strand*	Position	Length (bp)	GC content (%)	Initiation codon	Termination codon	Intergenic nucleotides (bp)
tRNA-Thr	N	4421–4492	72	51.39
orf309	J	7042–7092	309	0.45	ATG	TAA	2549
		10,274–10,531
matR	J	7756–9723	1968	51.78	ATG	TAG	663
ccmFc	N	12,975–13,523	1317	0.45	ATG	TGA	3251
		14,475–15,242
ccmFn	J	37,088–38,716	1629	45.30	ATG	TAA	21,845
nad7	N	47,657–47,917	1185	0.44	ATG	TAG	8940
		49,703–50,413
		51,678–51,746
		52,647–52,790
atp6	J	66,186–67,004	819	37.61	ACG	TAA	13,395
rpl16	N	72,277–72,792	516	44.57	ATG	TAA	5272
rps3	N	72,683–74,287	1674	0.42	ATG	TAG	−110
		76,033–76,101
tRNA-Tyr	N	91,413–91,496	84	50.00			15,311
tRNA-Asn	N	92,321–92,392	72	52.78			824
tRNA-Cys	N	94,407–94,478	72	51.39			2014
atp9	N	99,941–100,186	246	45.53	ATG	TAA	5462
tRNA-Arg	J	104,397–104,467	71	43.66			4210

Open in a new tab

*J stands for the direction of a gene from 5′ to 3′, and N for the direction of a gene from 3′ to 5′.

The mitochondrial genome 2 is 305,144 bp in length (GenBank Access No. MT364442). The genome 2 contains 69 genes (Table 2), including 33 PCGs, 26 transfer RNA genes, 6 ribosomal RNA genes, and 4 other genes (ccmC, mttB, ccmB, ccmC). The PCGs are 8 NADH dehydrogenase genes (nad4L, nad2, nad3, nad5, nad6, nad3, nad9, nad4), 1 succinate dehydrogenase genes (sdh4), 2 ubichinol cytochrome reductase genes, 4 cytochrome c oxidase genes, 5 ATP synthase genes (atp4, atp1, atp9, atp1, atp8), 10 ribosomal proteins genes (6 rps, 4 rpl), and 3 ORFs. The 15 PCGs (nad4L, nad2, nad3, nad9, atp4, atp1, cob, cox2, rpl10, rpl5, rps13, rps12, rps4, orf954, orf108), 10 tRNA genes, and 3 rRNA genes and 1 other gene (ccmC) are located on the N-strand, and the remaining 18 PCG genes, 16 tRNA genes, 3 rRNA genes and 3 other genes (mttB, ccmB, ccmC) are on the J-strand. There are two overlapping regions, with 73 bp overlapping between cox3 and sdh4 and 817 bp overlapping between rps4 and tRNA-Leu. There are certain intergenic sequences among adjacent genes in the remaining genes, indicating relatively low density of gene distribution along the genome. This is consistent with the patterns of other plants where non-coding regions are the important parts in consisting of mitochondrial genome^40–42.

Table 2.

Annotations and characteristics of mitochondrial genome 2 of Neolamarckia cadamba.

Gene	Strand*	Position	Length (bp)	GC content (%)	Initiation codon	Termination codon	Intergenic nucleotides (bp)
nad2	N	1–153	1467	38.72	ATG	TAA
		186,243–186,401
		188,825–189,397
		190,869–191,057
		303,715–304,107
tRNA-Thr	N	15,576–15,647	72	51.39			15,422
cox2	J	17,577–17,996	783	40.74	ATG	TAA	1929
		19,517–19,879
rrn26	N	31,879–35,307	3429	50.22			11,999
ccmC	N	41,488–42,240	753	43.69	ATG	TAA	6180
rrn5	N	42,414–42,529	116	53.45			173
rrn18	N	42,691–44,531	1841	53.56			161
mttB	J	57,829–58,668	840	41.79	ATG	TAG	13,297
tRNA-Ser	J	64,038–64,125	88	51.14			5369
atp4	N	67,536–68,123	588	42.01	ATG	TGA	3410
nad4L	N	68,311–68,613	303	35.97	ATG	TAA	187
tRNA-Lys	N	75,008–75,082	75	46.67			6394
ccmB	J	77,440–78,060	621	41.38	ATG	TGA	2357
rpl10	N	78,351–78,839	489	41.51	ATG	TAA	290
rps1	J	81,832–82,437	606	42.08	ATG	TAA	2992
tRNA-Met	J	89,045–89,121	77	44.16			6607
orf954	N	93,928–94,413	954	43.50	ATG	TAA	4806
		95,766–95,846
		140,865–141,251
rps13	N	96,776–97,126	351	39.32	ATG	TGA	929
tRNA-Gly	J	104,293–104,366	74	54.05			7166
tRNA-Gln	J	107,333–107,404	72	47.22			2966
tRNA-Ile	N	108,714–108,789	76	35.53			1309
atp1	J	108,877–110,406	1530	43.86	ATG	TAA	87
rpl5	J	112,970–113,524	555	42.16	ATG	TAA	2563
orf108	J	113,526–113,633	108	35.19	ATG	TAG	1
cob	J	115,168–116,349	1182	40.52	ATG	TGA	1534
rps12	N	117,371–117,748	378	43.65	ATG	TGA	1021
nad3	N	117,797–118,153	357	40.62	ATG	TAA	48
tRNA-Val	N	132,275–132,334	60	50.00			14,121
rrn18	J	133,925–135,765	1841	53.56			1590
rrn5	J	135,927–136,041	115	53.04			161
ccmC	J	136,215–136,967	753	43.69	ATG	GAA	173
rrn26	J	143,147–146,575	3429	50.22			6179
cox2	N	158,575–158,937	783	40.74	ATG	TAA	11,999
		160,458–160,877
tRNA-Thr	J	162,807–162,878	72	51.39			1929
tRNA-Arg	N	172,669–172,739	71	43.66			9790
atp9	J	176,950–177,195	246	45.53	ATG	TAA	4210
nad5	J	177,495–177,722	1986	40.89	ATG	TAA	299
		178,573–179,787
		261,418–261,810
		262,913–263,062
tRNA-Cys	J	182,658–182,729	72	51.39			2870
tRNA-Asn	J	184,744–184,815	72	52.78			2014
tRNA-Tyr	J	185,640–185,723	84	50.00			824
rps10	J	202,042–202,293	333	37.84	ACG	TGA	16,318
		203,144–203,224
cox1	J	203,424–205,007	1584	42.87	ATG	TAA	199
nad6	J	215,496–216,113	618	40.13	ATG	TAA	10,488
tRNA-Val	J	217,699–217,758	60	50.00			1585
nad3	J	231,877–232,233	357	40.62	ATG	TAA	14,118
rps12	J	232,282–232,659	378	43.65	ATG	TGA	48
cob	N	233,681–234,862	1182	40.52	ATG	TGA	1021
orf108	N	236,398–236,505	108	35.19	ATG	TAG	1535
rpl5	N	236,507–237,061	555	42.16	ATG	TAA	1
atp1	N	239,625–241,154	1530	43.86	ATG	TAA	2563
tRNA-Ile	J	241,242–241,317	76	35.53			87
tRNA-Gln	N	242,627–242,698	72	47.22			1309
tRNA-Gly	N	245,665–245,738	74	54.05			2966
atp8	J	252,332–252,811	480	41.04	ATG	TAA	6593
cox3	J	253,922–254,719	798	43.23	ATG	TGA	1110
sdh4	J	254,647–255,078	432	40.74	ATG	TGA	−73
rps4	N	256,045–256,875	831	39.47	ATG	TAA	966
tRNA-Leu	J	256,059–256,123	65	46.15			−817
rpl2	J	258,510–259,376	978	49.49	ATG	TAA	2386
		260,429–260,479
		260,484–260,543
tRNA-Ala	J	261,648–261,712	65	40.00			1104
tRNA-Trp	N	269,778–269,851	74	52.70			8065
nad9	N	274,043–274,615	573	41.19	ATG	TAA	4191
tRNA-His	N	278,687–278,761	75	60.00			4071
tRNA-Glu	J	280,888–280,959	72	50.00			2126
tRNA-Ser	J	283,072–283,159	88	44.32			2112
tRNA-Phe	J	283,411–283,484	74	47.30			251
tRNA-Pro	J	283,740–283,814	75	54.67			255
tRNA-Asp	N	289,908–289,981	74	63.51			6093
nad4	J	292,242–292,703	1488	40.59	ATG	TGA	2260
		294,113–294,628
		297,800–298,219
		300,710–300,799

Open in a new tab

*J stands for the direction of a gene from 5′ to 3′, and N for the direction of a gene from 3′ to 5′.

Characteristics of nucleotide composition

The two genome circles slightly differ in nucleotide composition (SI Table 1). Genome 1 has a high content of the T base but a low content of the G base. The AT content is 54.45% and the four types of bases are 29,521 bp of A (26.88%), 30,287 bp of T (27.57%), 25,616 bp (23.32%), and 24,412 bp of G (22.23%). Genome 2 has a high content of the T base but a low content of the C base. The AT content is 54.94%, and the four bases are 83,584 bp of A (27.39%), 84,075 bp of T (27.55%), 68,286 bp of C (22.38%), and 69,089 bp of G (22.64%). The AT content is slightly higher than the GC content. The relatively high AT content was also reported in other plant species⁴³ or animal species⁴⁴.

Besides the AT or GC content, the AT-and GC-skews are often used to assess the nucleotide-compositional differences in mitochondrial genomes⁴⁵. From SI Table 1, both AT- and GC-skews in genome 1 are negative (AT-skew = −0.0128 and GC-skew = −0.0241), indicating that genome 1 has a higher percentage of T and C than A and G, respectively. Both AT- and GC-skews are negative in PCG sequences (AT-skew = −0.0408 and GC-skew = −0.0501). However, the AT-skew in tRNAs is positive (0.0430), indicating that these genes have a higher percentage of A than T. The GC-skew in tRNAs is negative (−0.1135), indicating that these genes have a higher percentage of G than C.

In genome 2, the AT-skew (−0.0029) is negative but the GC-skew (0.0058) is positive (SI Table 1), indicating that genome 2 has a higher percentage of T and G than A and C, respectively. The GC-skews in both PCGs (−0.0115) and rRNAs (−0.1242) are negative, but positive in tRNAs (0.0449). The AT-skews are negative in PCGs (−0.0569), tRNA (−0.0289) and rRNAs (−0.0864). The extents of both AT- and GC-skews are greater in rRNAs than in PCGs and tRNAs. Generally, the extents of AT-and GC-skews in both genomes 1 and 2 are small, comparable to the pattern in mitochondrial genomes of Pyrus pyrifolia (AT-skew = 0.004, GC-skew = 0)⁴⁶ but different from that of animal species Ledra auditura ⁴⁴ (AT-skew = 0.22 and GC-skew = 0.12).

Protein-coding genes and codon usage

Codon usage bias is an important character of a genome since it is associated with gene expression^47,48, the base composition of genes⁴⁹, amino acid composition⁵⁰, GC content⁵¹, the length of a gene⁵² and tRNA richness^53,54. Large differences in the codon usage of genes often occur among different species and organisms⁵².

The mitochondrial genome of N. cadamba harbors a total of 83 coding genes and 45,639 bp in length, accounting for about 11% of the entire mitochondrial genome. This density is greater than those of watermelon (Citrullus lanatus; 10.3% of 379,236 bp), zucchini (Cucurbita pepo; 3.9% of 982,833 bp)⁵⁵ and neem (Azadirachta indica A. Juss; 7.7% of 266,430 bp)⁵⁶ mitochondrial genomes. The base composition of the whole mtDNA of N. cadamba is 27.26% for A, 22.63% for C, 22.53% for G and 27.56% for T, exhibiting a AT-biased pattern, with the A + T content of 54.82%. The AT-biased pattern is frequently observed in both plant and animal mitochondrial genomes⁵⁷.

The mitochondrial genomic protein-coding genes of N. cadamba are 37,521 bp in length, accounting for 83.03% of all coding genes. The 40 protein-coding genes encode a total of 12,507 codons. Figure 2 shows the frequencies of different amino acids in the protein-coding genes where the amino acid Leu is most frequently used, followed by Ser, Ile and Gly. From the values of relative synonymous codon usage (RSCU), there are 32 optimal codons (RSCU > 1): TAA, GCT, TAT, CAA, CAT, GGA, TTA, TCT, CCT, AGA, CGA, GAA, GAT, ACT, AAT, ATT, GGT, TGT, GTT, CTT, GTA, CGT, TTG, TCA, AAA, TTT, CCA, AGT, ACC, GCA, ATG, and TGG. The remaining 32 codons are non-optimal (RSCU < 1). The most frequently used codons are TTT (Phe), ATT (lle), GAA (Glu) and GCT (Ala). Reasons for the bias synonymous codon usage probably arise from different processes (e.g., distinct levels of gene expression, the base composition of genes, gene length and tRNA richness).

Amino acid frequency and RSCU value of protein-coding genes in mitochondrial genome of *Neolamarckia cadamba*.

According to the RSCU values, codons are classified into optimal codons (RSCU > 1) and non-optimal codon (RSCU < 1). From Fig. 2 and SI Table 3, each amino acid has its preferred codon, with exception of amino acids Met (ATG) and Trp (TGG) that have only one codon and no preference.

A universal genetic code is used for all mitochondrial genes in angiosperms, and the third codon tends to be A or T⁵⁸. A typical translation initiation codon is ATG, but alternative initiation codons occur in translation of rpl16⁵⁹, mttB⁵², and matB genes. The initiation codon of the protein-coding genes in the mitochondrial genome of N. cadamba is ATG, except for rps10 and rpl16 where ACG is the initiation codon.

Transfer RNA and ribosomal RNA genes

There are 5 tRNA genes in genome 1, with a total length of 371 bp (Table 1). The five tRNA genes range from 71 (tRNA-Arg) to 84 bp (tRNA-Tyr) in length, of which four genes are on the N-strand and one gene is on the J-strand. There are 26 tRNA genes in genome 2, with a total length of 1,909 bp (Table 1). These genes range from 60 bp (tRNA-Val) to 88 bp (tRNA-Ser) in length, of which ten genes are on the N-strand and sixteen genes are on the J-strand.

The secondary structure map of tRNA was predicted and generated using tRNAscan-SE 2.0 (http://lowelab.ucsc.edu/tRNAscan-SE/) ⁶⁰ and ARWEN (Version1.2, http://mbio-serv2.mbioekol.lu.se/ARWEN/) ⁶¹. Structurally, tRNA-Ser (GCT), tRNA-Ser (TGA) and tRNA-Tyr (GTA) have a group of stem-loop structure on the extra loop between the TψC loop and the anti-codon stem, but the remaining tRNA genes are the typical clover-type secondary structure (SI Fig. 1).

In the secondary structure of tRNA, besides three classic base matches (A-T, G-C and G-T), there are also mismatches, such as G-A, A-C, T-T, T-C and A-A. The T-C and A-A mismatch pairs are only in the anti-codon stems. Three G-A pairs are in the amino acid acceptor stems, and the other three G-A mismatch pairs are in the TψC stems. Two A-C pairs are in the TψC stems, and the other five A-C pairs are in the amino acid acceptor stems. One T-T pair is in the amino acid acceptor stems, and the other four T-T pairs are in the anti-codon stems.

The mitochondrial genome of N. cadamba has 3 rRNA genes in total (rrn18, rrn5, and rrn26), ranging from 116 bp to 3,429 bp in length, and all rRNA genes are on the N-strand.

Repetitive sequences

SI Table 2 indicates that both genome 1 (~ 0.08%) and genome 2(~ 0.16%) have small proportions of repetitive sequences, with the repeat length of 579 bp in total. Most repetitive sequences (microsatellites) consist of single- and di-nucleotide repeats, with more numbers of (A)_n and (T)_n (14) than (G)_n and (C)_n (2), and more numbers of (AT)_n and (TA)_n (8) than others (2 (GA)_n). Three minisatellites are present in genome 2. All these repeats are not located in protein-coding regions except (T)₁₀ in orf309 of genome 1 and (A)₁₀ in Atp1 of genome 2. The small proportion of repeats implies that repetitive sequences do not play an important role in contributing to mitochondrial genome size of N. cadamba, different from the patterns of Nymphaea colorata ⁴² and other plants ⁴⁰. However, these repetitive sequences could be used to develop molecular markers for population genetic structure analysis in the future.

Phylogenetic analyses

To assess the taxonomic position of N. cadamba, we analyzed the phylogenetic trees species based on complete mitochondrial genomes. Twenty-three species of the asterids-lamiids classification with complete mitochondrial genomes were selected, and Helianthus annuus of the non-lamiids classification of the asterids was selected as outgroup. This selection of 22 species of Astragalus was based on tandem sequences of 24 protein-coding genes. The 24 protein-coding genes were 3 adenosine triphosphate synthase genes (atp1, atp6, atp9), 3 cytochrome c oxidase genes (cox1, cox2, cox3), and 1 cytochrome b protein gene (cytB), 9 nicotinamide adenine dinucleotide (NADH) dehydrogenase protein genes (nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, nad9), 4 ribosomal proteins genes (rps12, rps13, rps3, rps4) and 4 other genes (ccmb, ccmc, ccmfc, ccmfn).

JModelTest2.1.7 was used to test the nucleic acid model of the selected sequence DNA⁶², and the best model was GTR + I + G. Maximum likelihood phylogenetic tree was constructed with RAxML8.1.5 software⁶³. The clade with N. cadamba in Gentianales has two families (Fig. 3): Rubiaceae and Apocynacceae. Rhazya stricta, Asclepias syriaca, Cynanchum auriculatum and C. wilfordii in the neighbor branches belong to Apocynacceae, and have closer genetic relationships. N. cadamba as the species in family Rubiaceae was earlier differentiated from Apocynacceae. This phylogenetic relationship among the 22 species is consistent with taxonomic groups based on morphological studies.

Maximum likelihood tree based on the sequences of 24 PCGs from the mitochondrial genomes of 23 species. The values on branch nodes represent the supporting rates (percentages) derived from 1000 bootstrapping analyses.

The rps3 gene sequence of 60 species of Rubiaceae was available from NCBI GenBank. Phylogenetic genetic relationships based on this single gene was constructed using the maximum likelihood method. SI Fig. 2 shows that N. cadamba is genetically close to Cephalanthus occidentalis and Hymenodictyon parvifolium. These three species together with Cubanola domingensis, Hillia triflora and Rondeletia odorata provide evidence that they belong to the Cinchonoideae subfamily although Deppea grandiflora (Ixoroideae subfamily) and Guettarda scabra (Rubioideae subfamily) were incompletely sorted in this clade. Using cpDNA segments (rbcL, rsp16 intron, nadhF, atpB–rbcL spacer) and nuclear ribosomal ITS, Rydin et al.⁶⁴ showed that five species (C. occidentalis, H. parvifolium, C. domingensis, H. triflora and R. odorata) belong to Cinchonoideae subfamily. The whole phylogenetic relationships indicate that large genetic divergence and incomplete linage sorting occurred among the three subfamilies of Rubiaceae in terms of the rps3 gene sequence.

Conclusions

In this study, we sequenced the mitochondrial genome of N. cadamba and successfully assembled the genome in two maps of circular molecule structure. Genome 1 has 109,836 bp and contains 14 genes. Genome 2 has 305,144 bp and contains 69 genes. The whole genome has slightly high AT content (54.82%). Genome 1 shows negative AT- and GC-skews, while genome 2 shows a negative AT-skew but a positive GC-skew. All protein-coding genes are initiated by the start codon ATG, except for a few genes initiated by alternative codons. The termination codes are TAA for most genes but TGA or TAG for a few genes. Each amino acid has its preferred codon except amino acids Met (ATG) and Trp (TGG) that have only one codon and no preference. The tRNA genes exhibit a typical clover-type secondary structure except tRNA-Ser (GCT), tRNA-Ser (TGA) and tRNA-Tyr (GTA) that have an extra loop between the TψC loop and the anti-codon stem. Tandem repeat sequences are minor, accounting for ~ 0.14% of the whole genome. Phylogenetic analysis with the DNA sequences of 24 PCGs confirms that N. cadamba belongs to order Gentianales. Analysis with a single gene rps3 of 60 species shows that N. cadamba is genetically closer to Cephalanthus accidentalis and Hymenodictyon parvifolium and belongs to the Cinchonoideae subfamily.

Methods

Sample collection and DNA extraction

The leaf sample used in this study was collected from a wild tree (Specimen ID: SCAUNC20190110) on January 10th, 2019. This tree grows on University Campus (23°16′N 113°35′E), South China Agricultural University (SCAU), Guangzhou, Guangdong Province, China. Figure 4 shows the sample tree growing in the fertile and humid soil. XW and XSH identified the voucher specimen and collected leaf samples. The specimen was stored for records in Guangdong Key Laboratory for Innovative Development and Utilization of Forest Plant Germplasm, SCAU, Guangdong Province, China. The use of plant leaves in this study complies with institutional guidelines. Collection of the plant specimen was permitted by the University. Total genomic DNA was extracted from fresh leaves using CTAB method⁶⁵. Then the quality of the extracted DNA samples was tested using (1) 0.8% agarose electrophoresis to detect DNA samples for degradation and impurities, and to estimate the DNA concentration; (2) Nanodrop spectrophotometer to detect the concentration and purity of samples; and (3) Qubit 2.0 Flurometer (Life Technologies, USA) to detect the concentration of samples.

The tree of *Neolamarckia cadamba* from which young leaves were sampled for mtDNA sequencing. The tree grows on campus of South China Agricultural University (23°16′N 113°35′E), Guangzhou, China. It is about 14.5 m in height and 49.04 cm in diameter at the breast height in eleven years.

Library construction and high-throughput sequencing

High-quality genomic DNA of 50 μg was used to generate a 40-kb SMRTbell library, with the size selection on the BluePippin (Sage Science, USA). The genomic DNA library was sequenced on the PacBio sequel platform (Pacific Biosciences, USA). SMRTbell DNA library preparation and sequencing were performed in accordance with the manufacturer’s protocols (Pacific Biosciences, USA), and totally 2 Gb subreads were generated. In order to check the correction of PacBio assembly, an insert size of 500 bp pair-end genomic DNA library for Illumina Hiseq 4000 (Illumina, USA), was constructed by Science Corporation of Gene according to the standard protocol of Illumina. DNA library was constructed after quality control with Agilent 2100 Bioanalyzer (Agilent Technologies, USA). Four gigabytes DNA data were sequenced by Illumina Hiseq 4000 (Illumina, USA).

Different sequencing methods were used in this study because lengths of PacBio sequencing reads were up to 40 kb, which was more suitable for complex genome assembly. However, the PacBio long reads potentially had much more sequencing errors, and the Illumina short reads were then used to fix the errors.

Genome assembly and annotations

The mitochondrial genome sequence was assembled using Canu (version 2.1, https://github.com/marbl/canu) ⁶⁶ with default parameters on PacBio CLR subreads, and mitochondrial genome sequences were identified with blastn (version 2.10.1 + , https://blast.ncbi.nlm.nih.gov/Blast.cgi) and NCBI nucleotide sequence database. To make improvements of assembly genome with Pilon (version 1.24, https://github.com/broadinstitute/pilon) ⁶⁷, the final PacBio CLR subreads and Illumina clean reads were remapped to mitochondrial genome with bwa (version 0.7.17-r1188, http://bio-bwa.sourceforge.net/) ⁶⁸ and IGV (version 2.9.4, https://igv.org/) ⁶⁹ to confirm. Genome was annotated using DOGMA (http://dogma.ccbb.utexas.edu/) ⁷⁰ and ORF Finder (https://www.ncbi.nlm.nih.gov/orffinder/). For the preliminary results of the annotations, the methods of Blastn and Blastp were used to compare the encoded proteins and rRNA of the reported mitochondrial genome of related species, verify the accuracy of the results and modify them. TRNA was annotated by tRNAscan-SE 2.0 (http://lowelab.ucsc.edu/tRNAscan-SE/) ⁷¹ and ARWEN (Version1.2, http://mbio-serv2.mbioekol.lu.se/ARWEN/) ⁶¹, leaving out the tRNA with unreasonable length and incomplete structure, and generating the tRNA secondary structure diagram. Microsatellite identification tool (MISA v2.1) ⁷² and tandem repeat finder (TRF) ⁷³ were used to search for repetitive sequences.

Comparative analysis of mitochondrial genomes

The use of mitochondrial codons had a preference, which would affect the expression of genes and reflect the evolutionary relationship of species to a certain extent. The calculation of relative synonymous codon usage was analyzed with a reference to the formula mentioned in Sharp and Li⁷⁴. The relative synonymous codon usage (RSCU) was calculated as the ratio of the frequency of a focal codon to the mean frequency of all synonymous codons in a given protein-coding sequence. The usage bias of one synonymous codon is indicated when RSCU is not equal to 1; no usage bias is present when RSCU is equal to 1.

In most bacterial genomes, mitochondrial and plastid genomes, there are significant differences in base composition between heavy and light chains, which are called AT-skew and GC-skew. Calculations of the AT- and GC-skews are as follows⁷⁵:

AT-skew = \frac{A % - T %}{A % + T %},

GC-skew = \frac{G % - C %}{G % + C %}

where A%, T%, G% and C% represent the percentages of A, T, G and C in a given sequence, respectively.

Phylogenetic analyses

MUSCLE v.3.8.31 (http://www.drive5.com/muscle/) software⁷⁶ was used to compare individual genes among multiple species, and then the genes of each species were aligned in a certain order. The protein-encoding gene sequence set of each species was generated by catenating 24 PCG sequences in the same gene order for further analysis. jModelTest2.1.7 (https://code.google.com/p/jmodeltest2/) was used to test the nucleic acid model of the selected sequence DNA⁶², and the best model has the minimum AIC (Akaike Information Criterion) value. Phylogeny tree was constructed with RAxML8.1.5 software (https://sco.h-its.org/exelixis/web/software/raxml/index.html)⁶³ using the maximum likelihood (ML) method for both the catenated sequences of 23 species and the rps3 gene sequences of 60 species. The bootstrap value was set to be 1000 for each phylogenetic tree analysis.

Supplementary Information

Supplementary Information.^{(847.5KB, pdf)}

Acknowledgements

We appreciate associate editor and two anonymous reviewers for helpful comments that substantially improved this article. This work is financially supported by the Central Finance Forestry Reform and Development Fund (2018-GDTK-08) and the funding from South China Agricultural University (4400-K16013).

Author contributions

X.S.H. and X.W. conceived and designed the study; X.W. conducted the experiment and drafted the manuscript; L.L.L. and Y.X. participated in experiment, X.Y.C. provided experimental supports, J.H.C. participated in DNA sequencing; X.S.H. revised and finalized the manuscript. All authors read and approved the final manuscript.

Data availability

mtDNA sequences of Neolamarckia cadamba in NCBI GenBank: Genome 1:https://www.ncbi.nlm.nih.gov/nuccore/MT320890.1. Genome 2: https://www.ncbi.nlm.nih.gov/nuccore/MT364442.1.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-01040-9.

References

1.Lo, H.S., Ko, W.C., Chen, W.C., Hsue, H.H. & Wu, H. Flora Reipublicae Popularis Sinicae: Tomus 71(1): Angiospermae Dictotyledoneae, Rubiaceae (1), 260–261. (Science Press, Beijing, 1999) (in Chinese).
2.Mojiol, AR, Lintangah, W, Maid, M. & Julius, K, Neolamarckia cadamba (Roxb.) Bosser, 1984, 1–12 (2014).
3.Ho, W.S., et al. Applications of genomics to plantation forestry with kelampayan in Sarawak in Sustaining Tropical Natural Resources Through Innovations, Technologies and Practices (eds. Wasli, M.E., et al.). 4th Regional Conference on Natural Resources in the Tropics, 103–111. (Universiti Malaysia Sarawak, 2012).
4.Pandey A, Chauhan AS, Haware DJ, Negi PS. Proximate and mineral composition of Kadamba (Neolamarckia cadamba) fruit and its use in the development of nutraceutical enriched beverage. J. Food Sci. Technol. 2018;55(10):4330–4336. doi: 10.1007/s13197-018-3382-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.He L, Zhou W, Wang Y, Wang C, Chen X, Zhang Q. Effect of applying lactic acid bacteria and cellulase on the fermentation quality, nutritive value, tannins profile and in vitro digestibility of Neolamarckia cadamba leaves silage. J. Anim. Physiol. Anim. Nutr. 2018;102:1429–1436. doi: 10.1111/jpn.12965. [DOI] [PubMed] [Google Scholar]
6.Pandey A, Negi PS. Traditional uses, phytochemistry and pharmacological properties of Neolamarckia cadamba: A review. J. Ethnopharmacol. 2016;181:118–135. doi: 10.1016/j.jep.2016.01.036. [DOI] [PubMed] [Google Scholar]
7.Santiarworn D, Liawruangrath S, Baramee A, Takayama H, Liawruangrath B. Bioactivity screening of crude alkaloidal extracts from some Rubiaceae. Chiang Mai Univ. J. 2005;4:59–64. [Google Scholar]
8.Dwevedi A, Sharma K, Sharma YK. Cadamba: A miraculous tree having enormous pharmacological implications. Pharmacogn. Rev. 2014;9(18):107. doi: 10.4103/0973-7847.162110. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Chandel M, et al. Isolation and characterization of flavanols from Anthocephalus cadamba and evaluation of their antioxidant, antigenotoxic, cytotoxic and COX-2 inhibitory activities. Rev. Bras. Farmacogn. 2016;26:474–483. doi: 10.1016/j.bjp.2016.02.007. [DOI] [Google Scholar]
10.Dubey A, Nayak S, Goupale D. A review on phytochemical, pharmacological and toxicological studies on Neolamarckia cadamba. Pharm. Lett. 2011;3:45–54. [Google Scholar]
11.Mishra DP, et al. Monoterpene indole alkaloids from Anthocephalus cadamba fruits exhibiting anticancer activity in human lung cancer cell line H1299. Chem. Sel. 2018;3:8468–8472. doi: 10.1002/slct.201801475. [DOI] [Google Scholar]
12.Umachigi SP, et al. Antimicrobial, wound healing and antioxidant activities of Anthocephalus cadamba. Afr. J. Tradit. Complement. Alternat. Med. 2007;4(4):481–487. doi: 10.4314/ajtcam.v4i4.31241. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Patel Divyakant A, Dirji V, Bariya A, Patel K, Sonpal R. Evaluation of antifungal activity of Neolamarckia cadamba (roxb.) bosser leaf and bark extract. Int. Res. J. Pharm. 2011;2:192–193. [Google Scholar]
14.Acharyya S, Rathore D, Kumar H, Panda N. Screening of Anthocephalus cadamba (Roxb.) Miq. root for antimicrobial and anthelmintic activities. Int. J. Res. Pharm. Biomed. Sci. 2011;2(1):297–300. [Google Scholar]
15.Dogra SC. Antimicrobial agents used in ancient India. Indian J. Hist. Sci. 1987;22(2):164–169. [PubMed] [Google Scholar]
16.Khare CP, Khare CP. Indian Herbal Remedies: Rational Western Therapy, Ayurvedic and Other Traditional Usage, Botany. Springer; 2004. [Google Scholar]
17.Que QM, et al. Genetic variation of young forest growth traits of Neolamarckia cadamba. Subtrop. Plant Sci. 2017;46:248–253. [Google Scholar]
18.Parthiban KT, Thirunirai-Selvan R, Palanikumaran B, Krishnakumar N. Variability and genetic diversity studies on Neolamarckia cadamba genetic resources. J. Trop. Res. Sci. 2019;31:90–98. doi: 10.26525/jtfs2019.31.1.090098. [DOI] [Google Scholar]
19.Li JJ, Zhang D, Ouyang KX, Chen XY. High frequency plant regeneration from leaf culture of Neolamarckia cadamba. Plant Biotechnol. 2019;36:13–19. doi: 10.5511/plantbiotechnology.18.1119a. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Mok PK, Ho WS. Rapid in vitro propagation and efficient acclimatisation protocols of Neolamarckia cadamba. Asian J. Plant Sci. 2019;18:153–163. doi: 10.3923/ajps.2019.153.163. [DOI] [Google Scholar]
21.Ouyang K, et al. Transcriptomic analysis of multipurpose timber yielding tree Neolamarckia cadamba during xylogenesis using RNA-seq. PLoS ONE. 2016;11:e159407. doi: 10.1371/journal.pone.0159407. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Huang T, et al. Selection and validation of reference genes for mRNA expression by quantitative real-time PCR analysis in Neolamarckia cadamba. Sci. Rep. 2018;8:9311. doi: 10.1038/s41598-018-27633-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Tchin BL, Ho WS, Pang SL, Ismail J. Association genetics of the cinnamyl alcohol dehydrogenase (CAD) and cinnamate 4-hydroxylase (C4H) genes with basic wood density in Neolamarckia cadamba. Biotechnology. 2012;11:307–317. doi: 10.3923/biotech.2012.307.317. [DOI] [Google Scholar]
24.Tiong SY, Ho WS, Pang SL, Ismail J. Nucleotide diversity and association genetics of xyloglucan endotransglycosylase/hydrolase (XTH) and cellulose synthase (CesA) genes in Neolamarckia cadamba. J. Biol. Sci. 2014;14:267–275. doi: 10.3923/jbs.2014.267.275. [DOI] [Google Scholar]
25.Ho W-S, Pang S-L, Abdullah J. Identification and analysis of expressed sequence tags present in xylem tissues of kelampayan (Neolamarckia cadamba (Roxb.) Bosser) Physiol. Mol. Biol. Plants. 2014;20:393–397. doi: 10.1007/s12298-014-0230-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Pang SL, Ho WS, Mat-Isa MN, Abdullah J. Gene discovery in the developing xylem tissue of a tropical timber tree species: Neolamarckia cadamba (Roxb.) Bosser (kelampayan) Tree Genet Genomes. 2015;11:47. doi: 10.1007/s11295-015-0873-y. [DOI] [Google Scholar]
27.Ying TS, Fu CS, Seng HW, Ling PS. Genetic diversity of Neolamarckia cadamba using dominant DNA markers based on inter-simple sequence repeats (ISSRs) in Sarawak. Adv. Appl. Sci. Res. 2014;5:458–463. [Google Scholar]
28.Morley SA, Nielsen BL. Plant mitochondrial DNA. Front. Biosci. 2017;22:1023–1032. doi: 10.2741/4531. [DOI] [PubMed] [Google Scholar]
29.Wright S. Evolution and the Genetics of Populations. The Theory of Gene Frequencies. Chicago: University of Chicago Press; 1969. [Google Scholar]
30.Wolfe KH, Li WH, Sharp PM. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc. Natl. Acad. Sci. USA. 1987;84:9054–9058. doi: 10.2307/30764. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Wang X, et al. Assessing the ecological and evolutionary processes underlying cytonuclear interactions. Sci. Sin. Vitae. 2019;49:951–964. doi: 10.1360/SSV-2019-0049. [DOI] [Google Scholar]
32.Bremer B, Manen JF. Phylogeny and classification of the subfamily Rubioideae (Rubiaceae) Plant Syst. Evol. 2000;225:43–72. doi: 10.1007/BF00985458. [DOI] [Google Scholar]
33.Andreasen K, Bremer B. Combined phylogenetic analysis in the Rubiaceae–Ixoroideae: morphology, nuclear and chloroplast DNA data. Am. J. Bot. 2000;87:1731–1748. doi: 10.2307/2656750. [DOI] [PubMed] [Google Scholar]
34.Andersson L, Antonelli A. Phylogeny of the tribe Cinchoneae (Rubiaceae), its position in Cinchonoideae, and description of a new genus, Ciliosemina. Taxon. 2005;54:17–28. doi: 10.2307/25065299. [DOI] [Google Scholar]
35.Mathew PM, Philip O. The distribution and systematic significance of pollen nuclear number in the Rubiaceae. Cytologia. 1986;51:117–124. doi: 10.1508/cytologia.51.117. [DOI] [Google Scholar]
36.Lee YS. Remarks on chromosome number in Rubiaceae. Korean J. Plant Taxon. 1979;9(1):57–66. doi: 10.11110/kjpt.1979.9.1.057. [DOI] [Google Scholar]
37.Eng WH, Ho WS, Ling KH. Cytogenetic, chromosome count optimization and automation of Neolamarckia cadamba (Rubiaceae) root tips derived from in vitro mutagenesis. Notulae Sci. Biol. 2021;13(3):10995. doi: 10.15835/nsb13310995. [DOI] [Google Scholar]
38.Palmer JD, Herbon LA. Plant mitochondrial-DNA evolves rapidly in structure, but slowly in sequence. J. Mol. Evol. 1988;28:87–97. doi: 10.1007/BF02143500. [DOI] [PubMed] [Google Scholar]
39.Morley SA, Ahmad N, Nielsen BL. Plant organelle genome replication. Plants. 2019;8:358. doi: 10.3390/plants8100358. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Mower JP, Sloan DB, Alverson AJ, et al. Plant mitochondrial genome diversity: the genomics revolution. In: Wendel JF, et al., editors. Plant Genome Diversity. Wien: Springer; 2012. pp. 123–144. [Google Scholar]
41.Guo W, Zhu A, Fan W, Mower JP. Complete mitochondrial genomes from the ferns Ophioglossum californicum and Psilotum nudum are highly repetitive with the largest organellar introns. New Phytol. 2017;213(1):391–403. doi: 10.1111/nph.14135. [DOI] [PubMed] [Google Scholar]
42.Dong S, et al. The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics. 2018;19:614. doi: 10.1186/s12864-018-4991-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Shi Y, et al. Assembly and comparative analysis of the complete mitochondrial genome sequence of Sophora japonica ‘JinhuaiJ2’. PLoS ONE. 2018;13(8):0202. doi: 10.1371/journal.pone.0202485. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Wang JJ, Li DF, Li H, Yang MF, Dai RH. Structural and phylogenetic implications of the complete mitochondrial genome of Ledra auditura. Sci. Rep. 2019;9:15746. doi: 10.1038/s41598-019-52337-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Alexandre H, Nelly L, Jean D. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst. Biol. 2005;54:277–298. doi: 10.1080/10635150590947843. [DOI] [PubMed] [Google Scholar]
46.Chung H, Won S, Kang S, Sohn S, Kim JS. The complete mitochondrial genome of Wonwhang (Pyrus pyrifolia) Mitochondrial DNA Part B Resources. 2017;2:902–903. doi: 10.1080/23802359.2017.1413300. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Gupta SK, Ghosh TC. Gene expressivity is the main factor indicating the codon usage variation among the genes in Pseudomonas aeruginosa. Gene. 2001;273:63–70. doi: 10.1016/S0378-1119(01)00576-5. [DOI] [PubMed] [Google Scholar]
48.Carlini DB, Ying C, Stephan W. The relationship between third-codon position nucleotide content, codon bias, mRNA secondary structure and gene expression in the drosophilid alcohol dehydrogenase genes Adh and Adhr. Genetics. 2001;159:623–633. doi: 10.1103/PhysRevA.86.013626. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Alexei F, Serge S, Walter G. Regularities of context-dependent codon bias in eukaryotic genes. Nucleic Acids Res. 2002;30:1192–1197. doi: 10.1093/nar/30.5.1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Onofrio GD, Mouchiroud D, Aïssani B, Gautier C, Bernardi G. Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins. J. Mol. Evol. 1991;32:504–510. doi: 10.1007/BF02102652. [DOI] [PubMed] [Google Scholar]
51.Knight RD, Freeland SJ, Landweber LF. A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol. 2001;2:2001. doi: 10.1186/gb-2001-2-4-research0010. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Marias G, Duret L. Synonymous codon usage, accuracy of translation and gene length in Caenorhabditis elegans. J. Mol. Evol. 2001;52:275–280. doi: 10.1007/s002390010155. [DOI] [PubMed] [Google Scholar]
53.Moriyama EN, Powell JR. Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli. Nucleic Acids Res. 1998;26:3188–3193. doi: 10.1093/nar/26.13.3188. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Moriyama EN, Powell JR. Codon usage bias and tRNA abundance in Drosophila. J. Mol. Evol. 1997;45:514–523. doi: 10.1007/PL00006256. [DOI] [PubMed] [Google Scholar]
55.Alverson JA, et al. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae) Mol. Biol. Evol. 2010;27:1436–1448. doi: 10.1093/molbev/msq029. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Kuravadi NA, Russiachand H, Loganathan RM, Lingu CS, Siddappa S, Ramamurthy A, Sathyanarayana BN, Gowda M. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree. Peer J. 2015;3:341–345. doi: 10.7717/peerj.1066. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Bi QX, et al. Complete mitochondrial genome of Quercus variabilis (Fagales, Fagaceae) Mitochond. DNA Part B. 2019;4:3927–3928. doi: 10.1080/23802359.2019.1687027. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Sugiyama Y, et al. The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants. Mol Genet Genomics. 2005;272:603–615. doi: 10.1007/s00438-004-1075-8. [DOI] [PubMed] [Google Scholar]
59.Rowen L, Mahairas G, Hood L. Sequencing the human genomes. Science. 1997;278:605–607. doi: 10.2165/00128413-199208300-00010. [DOI] [PubMed] [Google Scholar]
60.Lowe TM, Chan PP. tRNAscan-SE on-line: search and contextual analysis of transfer RNA genes. Nucleic Acids Res. 2016;44:54–57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Laslett D, Canbäck B. ARWEN, a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics. 2008;24:172–175. doi: 10.1093/bioinformatics/btm573. [DOI] [PubMed] [Google Scholar]
62.Posada D. jModelTest: phylogenetic model averaging. Mol. Biol. Evol. 2008;25:1253–1256. doi: 10.1093/molbev/msn083. [DOI] [PubMed] [Google Scholar]
63.Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
64.Rydin C, Kainulainen K, Razafimandimbison SG, Smedmark JEE, Bremer B. Deep divergences in the coffee family and the systematic position of Acranthera. Plant Syst. Evol. 2009;278:101–123. doi: 10.1007/s00606-008-0138-4. [DOI] [Google Scholar]
65.Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull.19, 11–15. http://irc.igd.cornell.edu/Protocols/DoyleProtocol.pdf (1987).
66.Koren S, Walenz BP, Berlin K, Miller JR, Phillippy AM. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Walker BJ, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Robinson JT, et al. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–3255. doi: 10.1093/bioinformatics/bth352. [DOI] [PubMed] [Google Scholar]
71.Lowe TM, Eddy SR. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: A web server for microsatellite prediction. Bioinformatics. 2017;33:2583–2585. doi: 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. doi: 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
74.Sharp PM, Li WH. The codon Adaptation Index—a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987;15:1281–1295. doi: 10.1093/nar/15.3.1281. [DOI] [PMC free article] [PubMed] [Google Scholar]
75.Hassanin A, Léger N, Deutsch J. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst. Biol. 2005;54:277–298. doi: 10.1080/10635150590947843. [DOI] [PubMed] [Google Scholar]
76.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(847.5KB, pdf)}

Data Availability Statement

mtDNA sequences of Neolamarckia cadamba in NCBI GenBank: Genome 1:https://www.ncbi.nlm.nih.gov/nuccore/MT320890.1. Genome 2: https://www.ncbi.nlm.nih.gov/nuccore/MT364442.1.

[CR1] 1.Lo, H.S., Ko, W.C., Chen, W.C., Hsue, H.H. & Wu, H. Flora Reipublicae Popularis Sinicae: Tomus 71(1): Angiospermae Dictotyledoneae, Rubiaceae (1), 260–261. (Science Press, Beijing, 1999) (in Chinese).

[CR2] 2.Mojiol, AR, Lintangah, W, Maid, M. & Julius, K, Neolamarckia cadamba (Roxb.) Bosser, 1984, 1–12 (2014).

[CR3] 3.Ho, W.S., et al. Applications of genomics to plantation forestry with kelampayan in Sarawak in Sustaining Tropical Natural Resources Through Innovations, Technologies and Practices (eds. Wasli, M.E., et al.). 4th Regional Conference on Natural Resources in the Tropics, 103–111. (Universiti Malaysia Sarawak, 2012).

[CR4] 4.Pandey A, Chauhan AS, Haware DJ, Negi PS. Proximate and mineral composition of Kadamba (Neolamarckia cadamba) fruit and its use in the development of nutraceutical enriched beverage. J. Food Sci. Technol. 2018;55(10):4330–4336. doi: 10.1007/s13197-018-3382-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.He L, Zhou W, Wang Y, Wang C, Chen X, Zhang Q. Effect of applying lactic acid bacteria and cellulase on the fermentation quality, nutritive value, tannins profile and in vitro digestibility of Neolamarckia cadamba leaves silage. J. Anim. Physiol. Anim. Nutr. 2018;102:1429–1436. doi: 10.1111/jpn.12965. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Pandey A, Negi PS. Traditional uses, phytochemistry and pharmacological properties of Neolamarckia cadamba: A review. J. Ethnopharmacol. 2016;181:118–135. doi: 10.1016/j.jep.2016.01.036. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Santiarworn D, Liawruangrath S, Baramee A, Takayama H, Liawruangrath B. Bioactivity screening of crude alkaloidal extracts from some Rubiaceae. Chiang Mai Univ. J. 2005;4:59–64. [Google Scholar]

[CR8] 8.Dwevedi A, Sharma K, Sharma YK. Cadamba: A miraculous tree having enormous pharmacological implications. Pharmacogn. Rev. 2014;9(18):107. doi: 10.4103/0973-7847.162110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Chandel M, et al. Isolation and characterization of flavanols from Anthocephalus cadamba and evaluation of their antioxidant, antigenotoxic, cytotoxic and COX-2 inhibitory activities. Rev. Bras. Farmacogn. 2016;26:474–483. doi: 10.1016/j.bjp.2016.02.007. [DOI] [Google Scholar]

[CR10] 10.Dubey A, Nayak S, Goupale D. A review on phytochemical, pharmacological and toxicological studies on Neolamarckia cadamba. Pharm. Lett. 2011;3:45–54. [Google Scholar]

[CR11] 11.Mishra DP, et al. Monoterpene indole alkaloids from Anthocephalus cadamba fruits exhibiting anticancer activity in human lung cancer cell line H1299. Chem. Sel. 2018;3:8468–8472. doi: 10.1002/slct.201801475. [DOI] [Google Scholar]

[CR12] 12.Umachigi SP, et al. Antimicrobial, wound healing and antioxidant activities of Anthocephalus cadamba. Afr. J. Tradit. Complement. Alternat. Med. 2007;4(4):481–487. doi: 10.4314/ajtcam.v4i4.31241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Patel Divyakant A, Dirji V, Bariya A, Patel K, Sonpal R. Evaluation of antifungal activity of Neolamarckia cadamba (roxb.) bosser leaf and bark extract. Int. Res. J. Pharm. 2011;2:192–193. [Google Scholar]

[CR14] 14.Acharyya S, Rathore D, Kumar H, Panda N. Screening of Anthocephalus cadamba (Roxb.) Miq. root for antimicrobial and anthelmintic activities. Int. J. Res. Pharm. Biomed. Sci. 2011;2(1):297–300. [Google Scholar]

[CR15] 15.Dogra SC. Antimicrobial agents used in ancient India. Indian J. Hist. Sci. 1987;22(2):164–169. [PubMed] [Google Scholar]

[CR16] 16.Khare CP, Khare CP. Indian Herbal Remedies: Rational Western Therapy, Ayurvedic and Other Traditional Usage, Botany. Springer; 2004. [Google Scholar]

[CR17] 17.Que QM, et al. Genetic variation of young forest growth traits of Neolamarckia cadamba. Subtrop. Plant Sci. 2017;46:248–253. [Google Scholar]

[CR18] 18.Parthiban KT, Thirunirai-Selvan R, Palanikumaran B, Krishnakumar N. Variability and genetic diversity studies on Neolamarckia cadamba genetic resources. J. Trop. Res. Sci. 2019;31:90–98. doi: 10.26525/jtfs2019.31.1.090098. [DOI] [Google Scholar]

[CR19] 19.Li JJ, Zhang D, Ouyang KX, Chen XY. High frequency plant regeneration from leaf culture of Neolamarckia cadamba. Plant Biotechnol. 2019;36:13–19. doi: 10.5511/plantbiotechnology.18.1119a. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Mok PK, Ho WS. Rapid in vitro propagation and efficient acclimatisation protocols of Neolamarckia cadamba. Asian J. Plant Sci. 2019;18:153–163. doi: 10.3923/ajps.2019.153.163. [DOI] [Google Scholar]

[CR21] 21.Ouyang K, et al. Transcriptomic analysis of multipurpose timber yielding tree Neolamarckia cadamba during xylogenesis using RNA-seq. PLoS ONE. 2016;11:e159407. doi: 10.1371/journal.pone.0159407. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Huang T, et al. Selection and validation of reference genes for mRNA expression by quantitative real-time PCR analysis in Neolamarckia cadamba. Sci. Rep. 2018;8:9311. doi: 10.1038/s41598-018-27633-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Tchin BL, Ho WS, Pang SL, Ismail J. Association genetics of the cinnamyl alcohol dehydrogenase (CAD) and cinnamate 4-hydroxylase (C4H) genes with basic wood density in Neolamarckia cadamba. Biotechnology. 2012;11:307–317. doi: 10.3923/biotech.2012.307.317. [DOI] [Google Scholar]

[CR24] 24.Tiong SY, Ho WS, Pang SL, Ismail J. Nucleotide diversity and association genetics of xyloglucan endotransglycosylase/hydrolase (XTH) and cellulose synthase (CesA) genes in Neolamarckia cadamba. J. Biol. Sci. 2014;14:267–275. doi: 10.3923/jbs.2014.267.275. [DOI] [Google Scholar]

[CR25] 25.Ho W-S, Pang S-L, Abdullah J. Identification and analysis of expressed sequence tags present in xylem tissues of kelampayan (Neolamarckia cadamba (Roxb.) Bosser) Physiol. Mol. Biol. Plants. 2014;20:393–397. doi: 10.1007/s12298-014-0230-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Pang SL, Ho WS, Mat-Isa MN, Abdullah J. Gene discovery in the developing xylem tissue of a tropical timber tree species: Neolamarckia cadamba (Roxb.) Bosser (kelampayan) Tree Genet Genomes. 2015;11:47. doi: 10.1007/s11295-015-0873-y. [DOI] [Google Scholar]

[CR27] 27.Ying TS, Fu CS, Seng HW, Ling PS. Genetic diversity of Neolamarckia cadamba using dominant DNA markers based on inter-simple sequence repeats (ISSRs) in Sarawak. Adv. Appl. Sci. Res. 2014;5:458–463. [Google Scholar]

[CR28] 28.Morley SA, Nielsen BL. Plant mitochondrial DNA. Front. Biosci. 2017;22:1023–1032. doi: 10.2741/4531. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Wright S. Evolution and the Genetics of Populations. The Theory of Gene Frequencies. Chicago: University of Chicago Press; 1969. [Google Scholar]

[CR30] 30.Wolfe KH, Li WH, Sharp PM. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc. Natl. Acad. Sci. USA. 1987;84:9054–9058. doi: 10.2307/30764. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Wang X, et al. Assessing the ecological and evolutionary processes underlying cytonuclear interactions. Sci. Sin. Vitae. 2019;49:951–964. doi: 10.1360/SSV-2019-0049. [DOI] [Google Scholar]

[CR32] 32.Bremer B, Manen JF. Phylogeny and classification of the subfamily Rubioideae (Rubiaceae) Plant Syst. Evol. 2000;225:43–72. doi: 10.1007/BF00985458. [DOI] [Google Scholar]

[CR33] 33.Andreasen K, Bremer B. Combined phylogenetic analysis in the Rubiaceae–Ixoroideae: morphology, nuclear and chloroplast DNA data. Am. J. Bot. 2000;87:1731–1748. doi: 10.2307/2656750. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Andersson L, Antonelli A. Phylogeny of the tribe Cinchoneae (Rubiaceae), its position in Cinchonoideae, and description of a new genus, Ciliosemina. Taxon. 2005;54:17–28. doi: 10.2307/25065299. [DOI] [Google Scholar]

[CR35] 35.Mathew PM, Philip O. The distribution and systematic significance of pollen nuclear number in the Rubiaceae. Cytologia. 1986;51:117–124. doi: 10.1508/cytologia.51.117. [DOI] [Google Scholar]

[CR36] 36.Lee YS. Remarks on chromosome number in Rubiaceae. Korean J. Plant Taxon. 1979;9(1):57–66. doi: 10.11110/kjpt.1979.9.1.057. [DOI] [Google Scholar]

[CR37] 37.Eng WH, Ho WS, Ling KH. Cytogenetic, chromosome count optimization and automation of Neolamarckia cadamba (Rubiaceae) root tips derived from in vitro mutagenesis. Notulae Sci. Biol. 2021;13(3):10995. doi: 10.15835/nsb13310995. [DOI] [Google Scholar]

[CR38] 38.Palmer JD, Herbon LA. Plant mitochondrial-DNA evolves rapidly in structure, but slowly in sequence. J. Mol. Evol. 1988;28:87–97. doi: 10.1007/BF02143500. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Morley SA, Ahmad N, Nielsen BL. Plant organelle genome replication. Plants. 2019;8:358. doi: 10.3390/plants8100358. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Mower JP, Sloan DB, Alverson AJ, et al. Plant mitochondrial genome diversity: the genomics revolution. In: Wendel JF, et al., editors. Plant Genome Diversity. Wien: Springer; 2012. pp. 123–144. [Google Scholar]

[CR41] 41.Guo W, Zhu A, Fan W, Mower JP. Complete mitochondrial genomes from the ferns Ophioglossum californicum and Psilotum nudum are highly repetitive with the largest organellar introns. New Phytol. 2017;213(1):391–403. doi: 10.1111/nph.14135. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Dong S, et al. The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics. 2018;19:614. doi: 10.1186/s12864-018-4991-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Shi Y, et al. Assembly and comparative analysis of the complete mitochondrial genome sequence of Sophora japonica ‘JinhuaiJ2’. PLoS ONE. 2018;13(8):0202. doi: 10.1371/journal.pone.0202485. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Wang JJ, Li DF, Li H, Yang MF, Dai RH. Structural and phylogenetic implications of the complete mitochondrial genome of Ledra auditura. Sci. Rep. 2019;9:15746. doi: 10.1038/s41598-019-52337-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Alexandre H, Nelly L, Jean D. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst. Biol. 2005;54:277–298. doi: 10.1080/10635150590947843. [DOI] [PubMed] [Google Scholar]

[CR46] 46.Chung H, Won S, Kang S, Sohn S, Kim JS. The complete mitochondrial genome of Wonwhang (Pyrus pyrifolia) Mitochondrial DNA Part B Resources. 2017;2:902–903. doi: 10.1080/23802359.2017.1413300. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Gupta SK, Ghosh TC. Gene expressivity is the main factor indicating the codon usage variation among the genes in Pseudomonas aeruginosa. Gene. 2001;273:63–70. doi: 10.1016/S0378-1119(01)00576-5. [DOI] [PubMed] [Google Scholar]

[CR48] 48.Carlini DB, Ying C, Stephan W. The relationship between third-codon position nucleotide content, codon bias, mRNA secondary structure and gene expression in the drosophilid alcohol dehydrogenase genes Adh and Adhr. Genetics. 2001;159:623–633. doi: 10.1103/PhysRevA.86.013626. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Alexei F, Serge S, Walter G. Regularities of context-dependent codon bias in eukaryotic genes. Nucleic Acids Res. 2002;30:1192–1197. doi: 10.1093/nar/30.5.1192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Onofrio GD, Mouchiroud D, Aïssani B, Gautier C, Bernardi G. Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins. J. Mol. Evol. 1991;32:504–510. doi: 10.1007/BF02102652. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Knight RD, Freeland SJ, Landweber LF. A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol. 2001;2:2001. doi: 10.1186/gb-2001-2-4-research0010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR52] 52.Marias G, Duret L. Synonymous codon usage, accuracy of translation and gene length in Caenorhabditis elegans. J. Mol. Evol. 2001;52:275–280. doi: 10.1007/s002390010155. [DOI] [PubMed] [Google Scholar]

[CR53] 53.Moriyama EN, Powell JR. Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli. Nucleic Acids Res. 1998;26:3188–3193. doi: 10.1093/nar/26.13.3188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR54] 54.Moriyama EN, Powell JR. Codon usage bias and tRNA abundance in Drosophila. J. Mol. Evol. 1997;45:514–523. doi: 10.1007/PL00006256. [DOI] [PubMed] [Google Scholar]

[CR55] 55.Alverson JA, et al. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae) Mol. Biol. Evol. 2010;27:1436–1448. doi: 10.1093/molbev/msq029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR56] 56.Kuravadi NA, Russiachand H, Loganathan RM, Lingu CS, Siddappa S, Ramamurthy A, Sathyanarayana BN, Gowda M. Comprehensive analyses of genomes, transcriptomes and metabolites of neem tree. Peer J. 2015;3:341–345. doi: 10.7717/peerj.1066. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR57] 57.Bi QX, et al. Complete mitochondrial genome of Quercus variabilis (Fagales, Fagaceae) Mitochond. DNA Part B. 2019;4:3927–3928. doi: 10.1080/23802359.2019.1687027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR58] 58.Sugiyama Y, et al. The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants. Mol Genet Genomics. 2005;272:603–615. doi: 10.1007/s00438-004-1075-8. [DOI] [PubMed] [Google Scholar]

[CR59] 59.Rowen L, Mahairas G, Hood L. Sequencing the human genomes. Science. 1997;278:605–607. doi: 10.2165/00128413-199208300-00010. [DOI] [PubMed] [Google Scholar]

[CR60] 60.Lowe TM, Chan PP. tRNAscan-SE on-line: search and contextual analysis of transfer RNA genes. Nucleic Acids Res. 2016;44:54–57. doi: 10.1093/nar/gkw413. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR61] 61.Laslett D, Canbäck B. ARWEN, a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics. 2008;24:172–175. doi: 10.1093/bioinformatics/btm573. [DOI] [PubMed] [Google Scholar]

[CR62] 62.Posada D. jModelTest: phylogenetic model averaging. Mol. Biol. Evol. 2008;25:1253–1256. doi: 10.1093/molbev/msn083. [DOI] [PubMed] [Google Scholar]

[CR63] 63.Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]

[CR64] 64.Rydin C, Kainulainen K, Razafimandimbison SG, Smedmark JEE, Bremer B. Deep divergences in the coffee family and the systematic position of Acranthera. Plant Syst. Evol. 2009;278:101–123. doi: 10.1007/s00606-008-0138-4. [DOI] [Google Scholar]

[CR65] 65.Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull.19, 11–15. http://irc.igd.cornell.edu/Protocols/DoyleProtocol.pdf (1987).

[CR66] 66.Koren S, Walenz BP, Berlin K, Miller JR, Phillippy AM. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR67] 67.Walker BJ, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR68] 68.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] 69.Robinson JT, et al. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR70] 70.Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–3255. doi: 10.1093/bioinformatics/bth352. [DOI] [PubMed] [Google Scholar]

[CR71] 71.Lowe TM, Eddy SR. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR72] 72.Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: A web server for microsatellite prediction. Bioinformatics. 2017;33:2583–2585. doi: 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR73] 73.Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. doi: 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR74] 74.Sharp PM, Li WH. The codon Adaptation Index—a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987;15:1281–1295. doi: 10.1093/nar/15.3.1281. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR75] 75.Hassanin A, Léger N, Deutsch J. Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences. Syst. Biol. 2005;54:277–298. doi: 10.1080/10635150590947843. [DOI] [PubMed] [Google Scholar]

[CR76] 76.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A complete sequence of mitochondrial genome of Neolamarckia cadamba and its use for systematic analysis

Xi Wang

Ling-Ling Li

Yu Xiao

Xiao-Yang Chen

Jie-Hu Chen

Xin-Sheng Hu

Abstract

Introduction

Results and discussion

Assembly of mitochondrial genome

Figure 1.

Table 1.

Table 2.

Characteristics of nucleotide composition

Protein-coding genes and codon usage

Figure 2.

Transfer RNA and ribosomal RNA genes

Repetitive sequences

Phylogenetic analyses

Figure 3.

Conclusions

Methods

Sample collection and DNA extraction

Figure 4.

Library construction and high-throughput sequencing

Genome assembly and annotations

Comparative analysis of mitochondrial genomes

Phylogenetic analyses

Supplementary Information

Acknowledgements

Author contributions

Data availability

Competing interests

Footnotes

Supplementary Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases