Genome assemblies for two Neotropical trees: Jacaranda copaia and Handroanthus guayacan

John T Burley; James R Kellner; Stephen P Hubbell; Brant C Faircloth

doi:10.1093/g3journal/jkab010

. 2021 Jan 28;11(2):jkab010. doi: 10.1093/g3journal/jkab010

Genome assemblies for two Neotropical trees: Jacaranda copaia and Handroanthus guayacan

John T Burley ^1,², James R Kellner ^1,², Stephen P Hubbell ³, Brant C Faircloth ^4,^✉

Editor: R K Dawe

PMCID: PMC8034707 PMID: 33693604

Abstract

The lack of genomic resources for tropical canopy trees is impeding several research avenues in tropical forest biology. We present genome assemblies for two Neotropical hardwood species, Jacaranda copaia and Handroanthus (formerly Tabebuia) guayacan, that are model systems for research on tropical tree demography and flowering phenology. For each species, we combined Illumina short-read data with in vitro proximity-ligation (Chicago) libraries to generate an assembly. For Jacaranda copaia, we obtained 104X physical coverage and produced an assembly with N50/N90 scaffold lengths of 1.020/0.277 Mbp. For H. guayacan, we obtained 129X coverage and produced an assembly with N50/N90 scaffold lengths of 0.795/0.165 Mbp. J. copaia and H. guayacan assemblies contained 95.8% and 87.9% of benchmarking orthologs, although they constituted only 77.1% and 66.7% of the estimated genome sizes of 799 and 512 Mbp, respectively. These differences were potentially due to high repetitive sequence content (>59.31% and 45.59%) and high heterozygosity (0.5% and 0.8%) in each species. Finally, we compared each new assembly to a previously sequenced genome for Handroanthus impetiginosus using whole-genome alignment. This analysis indicated extensive gene duplication in H. impetiginosus since its divergence from H. guayacan.

Keywords: genome assembly, tropical tree, Chicago library, dovetail, Jacaranda copaia, Handroanthus (Tabebuia) guayacan

Introduction

Although there is a fruitful history of population genetic and phylogenetic research on tropical forest trees using allozymes, microsatellite loci, and chloroplast loci (Hamrick and Murawski 1991; Lowe 2005; Lohmann 2006; Olmstead et al. 2009; Ragsac et al. 2019) modern genomic approaches have rarely been used to study these communities (c.f. Collevatti et al. 2019; Brousseau et al. 2020). As a result, tropical forest trees are under-represented in the universe of plant genomic studies (Plomion et al. 2016; Fetter et al. 2017), aside from a few commercially important species. However, these types of studies would advance our understanding of tropical forest communities—the ecological dynamics of which have been studied for decades (Hubbell 1979; Phillips et al. 1994; Laurance et al. 2018). For example, genome-enabled studies of tropical trees would allow us investigate: the origin and maintenance of tropical forest diversity (Dick and Pennington 2019), the genetic basis of tropical tree traits (Wright et al. 2010), or the potential for evolutionary responses of tree populations to climate change and deforestation (Hamrick 2004; Phillips et al. 2008; Alberto et al. 2013). These efforts, and more (Antonelli et al. 2018), will accelerate with the sequencing and assembly of additional tropical tree reference genomes by individual research groups and larger collaborative efforts like the 10KP project (Cheng et al. 2018).

We therefore developed reference genomes for Jacaranda copaia and Handroanthus guayacan (hereafter JACO and HAGU). Both species are insect-pollinated, wind-dispersed, predominately outcrossing canopy trees that are widely distributed throughout the Neotropics (Gentry 1974, 1976, 1992; Condit et al. 2011; Figure 1). Jacaranda is the sister clade to the core Bignoniaceae, a family that includes 827 recognized species of trees, shrubs, and lianas (Olmstead et al. 2009) with an estimated divergence of approximately 40–60 million years ago (Wikström et al. 2001; Magallón et al. 2015). JACO, HAGU, and their close relatives are important study systems for understanding tropical tree population dynamics (Jones and Hubbell 2006; Jones and Comita 2008; Kellner and Hubbell 2017, 2018), the diversification and proximate cues of flowering phenology (Gentry 1974; Reich and Borchert 1982; Lobo-Segura 2019), Neotropical phylogeography (Collevatti et al. 2012, 2019; Scotti-Saintagne et al. 2013; Vitorino et al. 2018), late-acting reproductive self-incompatibility (Bittencourt Júnior 2017), and genome size evolution (Collevatti and Dornelas 2016). In addition, many tree species in this group are economically important and simultaneously of conservation concern due to legal and illegal logging (Schulze et al. 2008; Brancalion et al. 2018), creating a need for the development of molecular markers for species identification and timber tracking (Meyer-Sand et al. 2018; Sebbenn et al. 2019). Although important for addressing these types of questions, genomic approaches have rarely featured in these investigations, with the exception of population genomic surveys of H. impetiginosus (hereafter HAIM) (Collevatti et al. 2019), which were facilitated by a reference assembly produced for that species (Silva-Junior et al. 2018).

Geographic distributions of herbarium records for species sequenced in this study (*J. copaia* and *H. guayacan*) as well as the previously sequenced *H. impetiginosus* (Silva-Junior *et al.* 2018). Data obtained from gbif.org.

Our primary reason for sequencing JACO and HAGU genomes was to create opportunities for combining population genomic analyses with estimates of demographic rates and phenotypic variation. This possibility results from highly conspicuous flowering in each species (Figure 2). Individual trees can be mapped and monitored using high-resolution satellite data, particularly among species in the genus Handroanthus (Kellner and Hubbell 2017, 2018; Kellner et al. 2019). Using satellite data to observe individuals while flowering also produces an unambiguous indicator of reproductive status and population variation in phenological traits across large areas.

Flowers of (A) *H. guayacan* (HAGU) and (B) *J. copaia* (JACO). While in bloom, individual tree crowns are highly conspicuous in (C) aerial and (D) satellite imagery of a lowland tropical forest in Central Panama. Image credits: (A and B) Steven Paton, Smithsonian Tropical Research Institute (STRI); (C) Jonathan Dandois, STRI.

In this study, we describe the sequencing and assembly of reference genomes for JACO and HAGU, we summarize the basic genomic characteristics of each species estimated using our assemblies, and we use whole genome alignment to compare both JACO and HAGU assemblies with the HAIM genome announced by Silva-Junior et al. (2018).

Materials and methods

Tissue collection from vouchered specimens

We sampled JACO tissue from one individual grown from seed in the Brown University Plant Environmental Center using seeds we purchased commercially that were collected in Huila, Colombia during November 2016. We germinated seeds in Sungro propagation mix in an irrigation tent at 27°C, and we transferred individuals to larger pots as they grew. After approximately 12 months, we collected fresh leaf tissue from one individual. Tissue was flash-frozen in liquid nitrogen and then transferred to –80°C storage within 30 min. All tissue was stored at −80°C until shipment. We archived a voucher specimen from the sampled individual in the Brown University Herbarium (accession P BRU 00008453).

For HAGU, we collected leaf tissue directly from a vouchered adult individual (accession HBG# 8421) at the Foster Botanical Garden in Honolulu, Hawai’i. Leaf tissue was flash-frozen in liquid nitrogen and then stored at −80°C until shipment. The individual we sampled was originally collected from a wild population of H. guayacan in the Canal Zone Experimental Gardens, Panama during 1939 and is now a large tree. Herbarium vouchers for this individual are stored in the Bishop Museum in Honolulu, Hawai’i (accession IDs BISH 43029, BISH 43030, BISH 43031).

Preliminary de novo assemblies using short-insert read libraries and the Meraculous assembler

We shipped frozen leaf tissue on dry ice to Dovetail Genomics, LLC (Scotts Valley, CA), for DNA extraction and library preparation. Dovetail staff extracted DNA using a standard CTAB DNA extraction, and extracts were sheared for Illumina library preparation using a Bioruptor Pico. For each species, Dovetail staff prepared two short-insert libraries with insert lengths of approximately 400 and 500 bp, following the Illumina TruSeq PCR-free protocol with no deviations. To validate these libraries, Dovetail staff used an Illumina MiSeq to generate several million paired-end (PE), 250 bp reads from each species.

Although both species had been identified by expert botanists prior to sequencing, Jacaranda and Handroanthus species can be difficult to identify using morphology. To validate the species identity of both extracts/libraries prior to sequencing them to depth, we assembled the reads generated for library validation using Spades (Bankevich et al. 2012) and a k-mer length of 55. Because chloroplast DNA was at high concentrations in both libraries, this enabled us to assemble large segments of the chloroplast genome (Straub et al. 2012). We used the NCBI BLAST (Altschul et al. 1990) search tool with no query limits to find the closest matches of our contigs to published sequences in GenBank. For JACO, the chloroplast DNA contigs matched more closely with J. copaia chloroplast sequences than for any other Jacaranda species available in GenBank. For HAGU, bioinformatic species validation was more complicated because few full cpDNA assemblies are available for Tabebuia and Handroanthus species. We overcame this difficulty by extracting standard barcoding genes (matK, rbcL, and psbA) (Kress et al. 2009), and BLASTing these against the GenBank dataset, representing 8 Handroanthus species. For matK and psbA, the top hits to our HAGU query were H. guayacan isolates (including one voucher specimen) with 100% identity over 100% length. For rbcL, however, isolates from H. albus and H. guayacan both matched with 100% identity over 100% length. Based on these three markers, given that H. albus does not occur where our sample was collected in Central Panama (Gentry 1992), and because our sample is from a vouchered specimen (identified by Alwyn Gentry), we are confident in the species identification for HAGU. After species validation, all libraries were sequenced using PE, 150 bp sequencing on an Illumina NextSeq 500.

Following sequencing, reads were trimmed to remove sequencing adapters and low-quality bases using Trimmomatic (Bolger et al. 2014). Dovetail staff profiled short-insert reads at a variety of k-mer values (19, 31, 49, 55, 79, and 109) to estimate genome size, heterozygosity, and repeat content. To optimize coverage and achieve a balance between repetitive and heterozygous fractions during assembly, Dovetail staff fit negative binomial models to the k-mer size distributions. These models suggested that a k-mer size of 55 for both species and homozygous peak depths of 78.0 (JACO) and 162.0 (HAGU) were optimal for the Meraculous assembler. Dovetail staff generated de novo assemblies by inputting paired-end libraries (described above) for each species to Meraculous v2 (Chapman et al. 2011) with a k-mer size of 55 and minimum k-mer frequencies of 15 (JACO) and 29 (HAGU). Based on the level of heterozygosity in both genomes, Dovetail staff used the pseudo-haploid mode in Meraculous.

Assembly improvement using Chicago read libraries and the HiRise scaffolding pipeline

Following de novo assembly with Meraculous, Dovetail staff prepared proprietary “Chicago” libraries following the methods described in Putnam et al. (2016). Briefly, they reconstituted ∼500 ng of HMW genomic DNA [mean fragment length = 60 kb (HAGU)], into chromatin in vitro and fixed the reconstituted DNA with formaldehyde. Then, they digested fixed chromatin with DpnII, filled in 5’ overhangs with biotinylated nucleotides, and ligated free, blunt ends. After ligation, they reversed crosslinks and purified the DNA from protein. Dovetail staff treated purified DNA to remove biotin that was not internal to ligated fragments and sheared the resulting DNA to ∼350 bp mean fragment size using a Bioruptor Pico. Dovetail staff then prepared sequencing libraries from these sheared DNA using NEBNext Ultra enzymes (New England Biolabs, Inc.) and Illumina-compatible adapters. They isolated biotin-containing fragments using streptavidin beads before PCR enrichment of each library. Dovetail Staff then sequenced amplified libraries on an Illumina HiSeq X Ten platform using PE 150 reads. Dovetail staff combined preliminary de novo Meraculous assemblies along with shotgun reads and Chicago read libraries and used the HiRise scaffolding software (Putnam et al. 2016) to iteratively break and reassemble contigs into scaffolds based on long-range linkage information provided by the Chicago reads.

Assessment of assembly contiguity and completeness

We used NX and LX statistics to evaluate assembly contiguity, and we estimated assembly completeness using BUSCO v3.0.2 (Simão et al. 2015) to search each assembly for a set of 2121 eudicot Benchmarking Universal Single-Copy Orthologs (BUSCOs) (odb10). We performed these analyses (and analyses described below) for our JACO and HAGU assemblies, as well as the previous published genome assembly of HAIM (GCA_002762385.1) (Silva-Junior et al. 2018). Finally, we used Pseudohaploid (https://github.com/schatzlab/pseudohaploid) to identify redundant sequence that may exist in HiRise assemblies due to uncollapsed heterozygous haplotypes.

Identification and classification of repetitive elements

We used the RepeatModeler v.1.0.11 (Smit and Hubley 2008) and RepeatMasker v.4.0.7 (Smit et al. 2013) to identify and classify repetitive DNA in the genome assemblies. RepeatModeler first builds a database of putative repeats from an NCBI database search guided by the input assembly, then employs two de novo repeat finding programs [RECON v.1.08 (Bao and Eddy 2002) and RepeatScout v.1.0.5 (Price et al. 2005)] to refine and classify models of putative interspersed repeats. RepeatMasker then screens the input assembly for putative repeat elements and provides summaries of genome-wide repeat content.

Genome alignment

To compare our JACO and HAGU assemblies to the HAIM assembly (Silva-Junior et al. 2018), we performed whole genome alignments using the NUCmer module of MUMmer v.4.0.0beta2 (Marçais et al. 2018) with default parameters, and we summarized the number and fraction of aligned scaffolds and bases, as well as the lengths and identity score of alignments, using the dnadiff module of MUMmer. In each species comparison, we used the assembly with the highest N50 and N90 as the reference sequence and the other assembly as the query. We also performed an alignment of JACO and the high-quality assembly of Olea europeae (Cruz et al. 2016), which is another species within the taxonomic order (Lamiales) that includes JACO and HAGU.

Data availability

All sequencing data and assemblies are available through NCBI BioProject PRJNA614908. The Whole Genome Shotgun assemblies have been deposited at DDBJ/ENA/GenBank under the accession JACGDI000000000 (JACO) & JACFYI000000000 (HAGU). The versions described in this manuscript are JACGDI010000000 and JACFYI010000000. Supplemental material is available at figshare: https://doi.org/10.25387/g3.12964592.

Results and discussion

The Jacaranda copaia genome

Sequencing of two paired-end libraries with mean inserts of length 413 and 519 bp produced a total of 369.8 million read pairs (Supplementary Table S1). Dovetail staff estimated a genome size of 799 Mbp based on k-mer analysis of these reads using the optimal k-mer length of 55 bp; as well as heterozygosity of 0.5% and repeat content of 18.6% (Supplementary Table S2). This genome size estimate is within the range of estimates for three other Jacaranda species obtained via flow cytometry (558.6–1274.0 Mbp) (Collevatti and Dornelas 2016). The preliminary assembly produced by Meraculous contained 53,440 contigs with N50 of 36.3 Kbp (Table 1). These contigs were joined by Meraculous into 29,490 scaffolds with a total length of 613.9 Mbp, an N50 of 56.7 Kbp, and L90 of 12,202 scaffolds (Table 1).

Table 1.

Basic genome assembly metrics and quality statistics

Species	J. copaia		H. guayacan
Assembly method	Meraculous	Dovetail HiRise assembly	Meraculous	Dovetail HiRise assembly
Total length	613.86 Mbp	616.19 Mbp	336.39 Mbp	339.77 Mbp
Scaffold N50	0.057 Mbp	1.020 Mbp	0.017 Mbp	0.795 Mbp
Scaffold N90	0.010 Mbp	0.277 Mbp	0.004 Mbp	0.165 Mbp
Scaffold L50	2,974 scaffolds	184 scaffolds	5,519 scaffolds	113 scaffolds
Scaffold L90	12,202 scaffolds	624 scaffolds	21,878 scaffolds	452 scaffolds
Longest scaffold	506,509 bp	5,337,482 bp	148,601 bp	4,933,827 bp
Number of scaffolds	29,490	5,869	38,393	3,857
Number of scaffolds >1 kb	29,490	5,869	38,393	3,856
Number of scaffolds >10 kb	12,474	1,086	10,322	825
Proportion of assembly in scaffolds >10 kb	0.90	0.98	0.69	0.98
Number of contigs	53,440	53,041	70,867	70,146
Contig N50	36.27 Kbp	36.74 Kbp	12.25 Kbp	12.47 Kbp
GC content	—	32.99%	—	33.24%
Number of gaps	23,950	47,236	32,474	66,288
Percent of genome in gaps	0.44%	0.81%	1.32%	2.30%

Open in a new tab

We then sequenced two Chicago libraries, which yielded 421 million read pairs (Supplementary Table S1) or 104.00X physical coverage of the Meraculous assembly. HiRise made 23,645 joins and 24 breaks to the Meraculous assembly, closing 359 gaps to form the Chicago assembly. The improved JACO HiRise assembly contained 5869 scaffolds with a total length of 616.19 Mbp, or 77.1% of the k-mer estimated genome size of 799 Mbp (Table 1, Supplementary Table S2); and it had an N50 of 1.020 Mbp, an L90 of 624 scaffolds, and a longest scaffold of 5.34 Mbp (Table 1).

The Handroanthus guayacan genome

Sequencing of two paired-end libraries with mean inserts of length 414 and 518 bp produced a total of 558.78 million read pairs (Supplementary Table S1). Dovetail staff estimated a genome size of 512 Mbp based on k-mer analysis of these reads using the optimal k-mer length of 55 bp; as well as heterozygosity of 0.8% and repeat content of 19.24% (Supplementary Table S2). The HAGU genome size estimate is within the range of estimates for seven other Handroanthus species obtained via flow cytometry (Collevatti and Dornelas 2016). The preliminary assembly produced by Meraculous contained 70,867 contigs with N50 of 12.2 Kbp (Table 1). These contigs were joined by Meraculous into 38,393 scaffolds with a total length of 613.9 Mbp, an N50 of 17.1 Kbp, and an L90 of 21,878 scaffolds (Table 1).

We then sequenced one Chicago library, which yielded 414 million read pairs (Supplementary Table S1), or 128.99X physical coverage of the Meraculous assembly. HiRise made 34,560 joins and 24 breaks to the Meraculous assembly, closing 746 gaps to form the Chicago assembly. The JACO HiRise assembly contained 3857 scaffolds with a total length of 339.77 Mbp, or 66.4% of the k-mer estimated genome size of 512 Mbp (Table 1, Supplementary Table S2); and it had an N50 of 0.795 Mbp, an L90 of 452 scaffolds, and a longest scaffold of 4.93 Mbp (Table 1).

Assembly quality assessments

Chicago scaffolding of the Meraculous assembly improved assembly contiguity for both species, illustrated by 17.9-fold (JACO) and 46.8-fold (HAGU) increase in scaffold N50 (Table 1). For both species, 98% of HiRise assemblies were contained in scaffolds longer than 10 Kbp (Table 1).

Improvements to the Meraculous assembly by Chicago scaffolding were also evident by fewer missing and fragmented BUSCOs for both species. Complete and unfragmented BUSCOs increased from 92.3% to 95.8% (JACO) and 82.8% to 87.9% (HAGU) as a result of HiRise scaffolding (Table 2). For comparison to our BUSCO results, we also analyzed the HAIM assembly, which showed a similar level of BUSCO completeness to HAGU despite different assembly methods and contiguity statistics (Table 2).

Table 2.

Genome completeness assessed using 2,121 eudicot Benchmarking Universal Single-Copy Orthologs (BUSCOs)

Species:	J. copaia		H. guayacan		H. impetiginosus
Assembly:	Meraculous	HiRise	Meraculous	HiRise	Himp0.1
Complete	1,957	2,032	1,754	1,864	1,847
Single copy	1,844	1,911	1,676	1,775	1,574
Duplicated	113	121	78	89	273
Fragmented	82	34	179	99	103
Missing	82	55	188	158	171
% Complete	92.3	95.8	82.8	87.9	87.1

Open in a new tab

Despite the high level of genic content represented in each assembly, both comprised a relatively low fraction of their estimated genome size, compared to other draft genome assemblies of trees (Silva-Junior et al. 2018; Supplementary Table S3). This may be a product of the assembly algorithm, Meraculous, which will not make ambiguous joins and, therefore, is unlikely to resolve repeat-dense regions (Chapman et al. 2011). If this is the case, the relatively high BUSCO scores for both species suggest that the difference between estimated and realized genome size may result from the extensive presence of repetitive sequence.

Repetitive sequences

Although Meraculous will not resolve some repeat-dense regions, we used standard procedures to identify and classify the repetitive elements in both JACO and HAGU assemblies, and we compared these results to estimates for HAIM (Silva-Junior et al. 2018). We identified 59.31% (JACO) and 45.59% (HAGU) total repetitive sequence content in each of our assemblies (Table 3). The vast majority of repeats were interspersed, of which approximately one third were not further classified (Table 3). Of the interspersed repeats that were classified, <1% were long interspersed nuclear elements (LINEs), 24.93% in JACO and 10.85% in HAGU were long terminal repeats (LTRs), and 2.97% in JACO and 1.53% in HAGU were DNA elements (Table 3). HAGU and HAIM contained similar repetitive sequence fractions in almost all categories, relative to JACO (Table 3).

Table 3.

Predicted repetitive sequence content in assemblies

	J. copaia			H. guayacan			H. impetiginosus
	Number of elements	Length (Mb)	Fraction of assembly (%)	Number of elements	Length (Mb)	Fraction of assembly (%)	Number of elements	Length (Mb)	Fraction of assembly (%)
Total interspersed repeats	619,981	356.7	57.89	451,277	149.8	44.08	634,428	231.8	45.06
LINEs	3,759	3.3	0.54	3,081	2.1	0.63	3,974	3.3	0.66
LTR elements	105,619	153.6	24.93	43,361	36.9	10.85	52,006	56.7	11.26
DNA elements	27,586	17.2	2.97	9,425	5.2	1.53	18,240	10.1	2.01
Unclassified	483,017	182.6	29.63	395,410	105.6	31.07	560,208	161.7	32.13
Simple repeats	151,890	9.0	1.47	125,684	5.1	1.51	182,721	7.8	1.54
Low complexity	26,279	1.3	0.21	23,734	1.1	0.33	33,613	1.6	0.33
Total	798,150	367.0	59.57	600,695	156.0	45.92	850,762	241.2	46.93

Open in a new tab

Although repeat fractions estimated by RepeatMasker are more reliable than those estimated using k-mer analysis, the proportions of genome-wide repeat content and individual repeat classes we computed are likely underestimated. This is because (1) both assemblies are incomplete with respect to estimated genome sizes (see above) and (2) the assembly technology used is systematically biased to perform poorly for genome regions that contain long-repeats (Chapman et al. 2011) or that are highly repetitive, such as heterochromatic regions (Altemose et al. 2014). Given that LTR elements typically comprise the majority of repetitive sequence in plant genomes (Vitte and Bennetzen 2006; Tenaillon et al. 2011), the high-estimated fraction of LTR repeats helps explain the difference we observed between estimated genome size and realized assembly length.

Genome alignments

Because we were interested in: (1) obtaining a broad comparison of sequence divergence among the taxa, (2) gauging the extent of possible chromosomal rearrangements (specifically and duplications), and (3) identifying potential assembly errors, we performed whole-genome sequence alignments between the JACO, HAGU, and HAIM genome assemblies.

Assemblies for HAGU and HAIM shared a high fraction of aligned scaffolds: 3655 (94.76%) for HAGU and 12,398 (93.90%) for HAIM (Table 4). This alignment includes 233 Mbp in 149,208 1-to-1 best alignments—locations where HAGU aligned to its best hit in HAIM, and vice versa—with an average identity of 90.34%, which represents approximately 68.6% of the HAGU assembly (total length of 339.77 Mbp) and 46.3% of HAIM (total length of 503.29 Mbp) (Table 4; Figure 3). In contrast to the HAGU-HAIM comparison, JACO and HAGU have a far less extensive alignment, with only 22 Mbp of both genomes in 68,689 1-to-1 best alignments, although the average identity of those alignments is high (88.73%); and a similar result was obtained in the JACO-HAIM alignment (Table 4; Figure 3). This low degree of alignment likely reflects divergence of almost all but the most highly conserved sequence between Handroanthus and Jacaranda, which is unsurprising given that they shared a common ancestor >40 million years ago (Lohmann et al. 2013; Magallón et al. 2015).

Table 4.

Genome alignment summary statistics

Species comparison	H. guayacan and H. impetiginosus		J. copaia and H. impetiginosus		J. copaia and H. guayacan
	Reference	Query	Reference	Query	Reference	Query
	HAGU	HIMP	JACO	HIMP	JACO	HAGU
Sequences	—	—	—	—	—	—
Total	3,857	13,204	5,869	13,204	5,869	3,857
Aligned	3,655 (94.76%)	12,398 (93.90%)	2,172 (37.01%)	8,150 (61.72%)	2,257 (38.46%)	1,937 (50.22%)
Unaligned	202 (5.24%)	806 (6.10%)	3,697 (62.99%)	5,054 (38.28%)	3,612 (61.54%)	1,920 (49.78%)
Bases	—	—	—	—	—	—
Total (Mb)	339.77	503.29	616.19	503.29	616.19	339.77
Aligned (Mb)	266.8 (78.52%)	374.8 (74.46%)	21.8 (3.54%)	32.8 (6.52%)	23.8 (3.86%)	26.8 (7.87%)
Unaligned (Mb)	72.9 (21.48%)	128.5 (25.54%)	594.4 (96.46%)	470.5 (93.48%)	592.4 (96.14%)	313.0 (92.13%)
Alignments	—	—	—	—	—	—
1-to-1	149,208	149,208	44,485	44,485	68,689	68,689
Total length (Mb)	232.8	232.7	20.2	20.1	22.2	22.2
Av. length (Bp)	1561	1560	454	453	324	323
Av. identity (%)	90.34	90.34	87.27	87.27	88.73	88.73
Many-to-Many	447,736	447,736	91,126	91,126	174,083	174,083
Total length (Mb)	479.4	479.1	40.8	40.7	38.3	38.2
Av. length (Bp)	1071	1070	448	447	220	220
Av. identity (%)	89.94	89.94	91.87	91.87	92.64	92.64

Open in a new tab

Depiction of relationships among the three currently available Bignoniaceae genomes and a high-quality assembly of the European olive tree (*Olea europeae*) derived from relationships of *Jacaranda* and *Handroanthus* resolved in Grose and Olmstead (2007a) and Olmstead *et al.* (2009). Node values depict whole genome alignment summary metrics: total length of 1-to-1 alignments (black) and average percentage identify of 1:1 alignments (gray).

Further inspection of HAGU-HAIM alignments suggests the possibility of extensive gene duplication since their divergence. The length of many-to-many alignments between HAGU and HAIM represents 141% of the HAGU assembly and 95% of HAIM (Table 4). Therefore, the length of the HAGU-HAIM many-to-many alignments exceeds the HAGU assembly by 139 Mbp. One possibility to explain this excess is that many gene duplications occurred in HAIM since its divergence from a common ancestor shared with HAGU such that multiple gene copies in HAIM are homologous with, and relatively undifferentiated from, single-copy regions in HAGU. Another possibility is that the HAIM assembly contains a relatively high proportion of uncollapsed heterozygous sequence. This can occur during genome assembly if highly heterozygous segments in a diploid genome are mistakenly characterized as two separate sequences rather than homologous alleles on opposite chromosome arms, thereby erroneously extending the length of a genome assembly. To investigate these two possibilities, we first generated a Venn diagram of BUSCOs obtained in each assembly. HAIM has 203 unique duplicated BUSCOs compared to 24 in HAGU (Figure 4); and the total number of duplicated BUSCOs is threefold higher in HAIM than HAGU (Table 2). Most of the duplicated BUSCOs in HAIM appear as single copies in HAGU (71%); but the opposite is not true with only 42% of HAGU duplicated BUSCOs appearing as single copies in HAIM. While this result is consistent with the hypothesis that HAIM accumulated numerous duplications since its divergence from the HAIM-HAGU common ancestor, it does not exclude the possibility of assembly artifact. However, Pseudohaploid identified only 27.5 Mbp of probable uncollapsed heterozygous sequence in HAIM (and none in HAGU or JACO). In addition, the HAIM assembly length (503 Mbp) is close to the genome size estimated through flow cytometry for that species (499.8 Mbp) (Collevatti and Dornelas 2016; Silva-Junior et al. 2018). It therefore seems unlikely that uncollapsed heterozygous sequences account for the excess of many-to-many alignments between HAGU-HAIM. In sum, this analysis suggests gene duplications may have occurred extensively in HAIM since its divergence from a common ancestor shared with HAGU, although we cannot rule out the possibility of gene-loss in HAGU during the same period.

Shared and unique duplicated BUSCOs in genome assemblies for JACO, HAGU, and HAIM.

Outlook

The genome assemblies for JACO and HAGU presented here meet standards of the field for use in studies involving genetic marker discovery, gene and gene family evolution, comparative genomics, phylogenomics and ecological genomics (Sork et al. 2016; Plomion et al. 2018; Tuskan et al. 2018) (Supplementary Table S3). These assemblies add to a rapidly expanding collection of publicly available genomic resources for the Bignoniaceae including, to date, six transcriptomes (Leebens-Mack et al. 2019; Zhang et al. 2020) and the HAIM genome assembly (Silva-Junior et al. 2018). Each assembly may be further improved by the incorporation of long-reads for gap-filling (English et al. 2012; Rhie et al. 2020), linkage maps created by population resequencing (The Heliconius Genome Consortium 2012; Kawakami et al. 2017), or syntenic information from alignments of multiple draft Bignoniaceae genomes (Kolmogorov et al. 2018). Chromosome-scale assemblies of additional Bignoniaceae will aid further investigations of the remarkable variation in genome size and chromosome number documented in these taxa (Collevatti and Dornelas 2016; Mendes et al. 2018). Furthermore, phylogenomic investigations of Handroanthus and relatives in the Tabebuia alliance will increase resolution of unclear speciation histories (Grose and Olmstead 2007a, 2007b) that may involve hybridization among sympatric species (Gentry 1992; Pinheiro et al. 2018). Finally, reference genomes for JACO and HAGU will facilitate ongoing research efforts to investigate landscape-scale interactions between population genetic processes and population dynamics of tropical canopy trees where genome resequencing data are combined with high throughput surveys of individual-level traits, such as flowering time, and population demographic parameters such as density, recruitment and mortality (Kellner and Hubbell 2017, 2018).

Acknowledgments

We thank Naomi Hoffman, Brenden Holland, Barbara Kennedy, Fred Jackson, Sherry Warner, Rebecca Kartzinel, Timothy Whitfeld, Martha Cooper, Laura Lagomarsino, Jennifer Kluse, Joaquin Nunez, Orzenil da Silva-Junior, Rosane Collevatti, Jasmine Haimovitz, and Sierra McWilson for their help collecting, preparing, sequencing, and assembling data from exemplar samples. We also thank Richard Olmstead and two anonymous reviewers for their comments that improved the manuscript.

Funding

This work was supported by National Science Foundation Grant DEB-1358915 (to J.R.K) and DEB-1146440 (to S.P.H. and B.C.F.).

Conflicts of interest: None declared.

Literature cited

Alberto FJ, Aitken SN, Alía R, González-Martínez SC, Hänninen H, et al. 2013. Potential for evolutionary responses to climate change—evidence from tree populations. Glob Change Biol. 19:1645–1661. [DOI] [PMC free article] [PubMed] [Google Scholar]
Altemose N, Miga KH, Maggioni M, Willard HF. 2014. Genomic characterization of large heterochromatic gaps in the human genome assembly. PLoS Comput Biol. 10:e1003628. [DOI] [PMC free article] [PubMed] [Google Scholar]
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215:403–410. [DOI] [PubMed] [Google Scholar]
Antonelli A, Ariza M, Albert J, Andermann T, Azevedo J, et al. 2018. Conceptual and empirical advances in Neotropical biodiversity research. PeerJ. 6:e5644. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comp Biol. 19:455–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bao Z, Eddy SR. 2002. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 12:1269–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bittencourt Júnior NS. 2017. Evidence for post-zygotic self-incompatibility in Handroanthus impetiginosus (Bignoniaceae). Plant Reprod. 30:69–79. [DOI] [PubMed] [Google Scholar]
Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30:2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
Brancalion PHS, de Almeida DRA, Vidal E, Molin PG, Sontag VE, et al. 2018. Fake legal logging in the Brazilian Amazon. Sci Adv. 4:eaat1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
Brousseau L, Fine PVA, Dreyer E, Vendramin GG, Scotti I. 2020. Genomic and phenotypic divergence unveil microgeographic adaptation in the Amazonian hyperdominant tree Eperua falcata Aubl (Fabaceae). Mol Ecol. 10.1111/mec.15595. [DOI] [PubMed] [Google Scholar]
Chapman JA, Ho I, Sunkara S, Luo S, Schroth GP, et al. 2011. Meraculous: de novo genome assembly with short paired-end reads. PLoS One. 6:e23501. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheng S, Melkonian M, Smith SA, Brockington S, Archibald JM, et al. 2018. 10KP: a phylodiverse genome sequencing plan. GigaScience. 7:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Collevatti RG, Dornelas MC. 2016. Clues to the evolution of genome size and chromosome number in Tabebuia alliance (Bignoniaceae). Plant Syst Evol. 302:601–607. [Google Scholar]
Collevatti RG, Novaes E, Silva-Junior OB, Vieira LD, Lima-Ribeiro MS, et al. 2019. A genome-wide scan shows evidence for local adaptation in a widespread keystone Neotropical forest tree. Heredity. 123:117–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
Collevatti RG, Terribile LC, Lima‐Ribeiro MS, Nabout JC, de Oliveira G, et al. 2012. A coupled phylogeographical and species distribution modelling approach recovers the demographical history of a Neotropical seasonally dry forest tree species. Mol Ecol. 21:5845–5863. [DOI] [PubMed] [Google Scholar]
Condit R, Perez R. 2011. Trees of Panama and Costa Rica. Princeton, NJ: Princeton University Press. [Google Scholar]
Cruz F, Julca I, Gómez-Garrido J, Loska D, Marcet-Houben M, et al. 2016. Genome sequence of the olive tree, Olea europaea. GigaScience. 5:29. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dick CW, Pennington RT. 2019. History and geography of Neotropical tree diversity. Annu Rev Ecol Evol Syst. 50:279–301. [Google Scholar]
English AC, Richards S, Han Y, Wang M, Vee V, et al. 2012. Mind the gap: upgrading genomes with pacific biosciences RS long-read sequencing technology. PLoS One 7:e47768. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fetter KC, Gugger PF, 2017. Landscape genomics of angiosperm trees: from historic roots to discovering new branches of adaptive evolution. In: Keller SR, Groover A, Cronk Q, editors. Comparative and Evolutionary Genomics of Angiosperm Trees. Cham: Springer International Publishing. p. 303–333. [Google Scholar]
Gentry AH. 1974. Flowering phenology and diversity in tropical bignoniaceae. Biotropica. 6:64–68. [Google Scholar]
Gentry AH. 1976. Bignoniaceae of southern central America: distribution and ecological specificity. Biotropica. 8:117–131. [Google Scholar]
Gentry AH. 1992. Bignoniaceae: Part II (Tribe Tecomeae). Flora Neotropica. 25:1–370. [Google Scholar]
Grose SO, Olmstead RG. 2007a. Evolution of a charismatic neotropical clade: molecular phylogeny of tabebuia s. l., crescentieae, and allied genera (Bignoniaceae). Syst Botany. 32:650–659. [Google Scholar]
Grose SO, Olmstead RG. 2007b. Taxonomic revisions in the polyphyletic genus tabebuia s. l. (Bignoniaceae). Syst Botany. 32:660–670. [Google Scholar]
Hamrick JL. 2004. Response of forest trees to global environmental changes. Forest Ecol Manage. 197:323–335. [Google Scholar]
Hamrick JL, Murawski DA. 1991. Levels of allozyme diversity in populations of uncommon neotropical tree species. J Trop Ecol. 7:395–399. [Google Scholar]
Hubbell SP. 1979. Tree dispersion, abundance, and diversity in a tropical dry forest. Science. 203:1299–1309. [DOI] [PubMed] [Google Scholar]
Jones FA, Comita LS. 2008. Neighbourhood density and genetic relatedness interact to determine fruit set and abortion rates in a continuous tropical tree population. Proc Biol Sci. 275:2759–2767. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jones FA, Hubbell SP. 2006. Demographic spatial genetic structure of the neotropical tree, Jacaranda copaia: demographic genetics of Jacaranda copaia. Mol Ecol. 15:3205–3217. [DOI] [PubMed] [Google Scholar]
Kawakami T, Mugal CF, Suh A, Nater A, Burri R, et al. 2017. Whole-genome patterns of linkage disequilibrium across flycatcher populations clarify the causes and consequences of fine-scale recombination rate variation in birds. Mol Ecol. 26:4158–4172. [DOI] [PubMed] [Google Scholar]
Kellner JR, Albert LP, Burley JT, Cushman KC. 2019. The case for remote sensing of individual plants. Am J Bot. 106:1139–1142. [DOI] [PubMed] [Google Scholar]
Kellner JR, Hubbell SP. 2017. Adult mortality in a low-density tree population using high-resolution remote sensing. Ecology. 98:1700–1709. [DOI] [PubMed] [Google Scholar]
Kellner JR, Hubbell SP. 2018. Density-dependent adult recruitment in a low-density tropical tree. Proc Natl Acad Sci U S A. 115:11268–11273. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kolmogorov M, Armstrong J, Raney BJ, Streeter I, Dunn M, et al. 2018. Chromosome assembly of large and complex genomes using multiple references. Genome Res. 28:1720–1713. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kress WJ, Erickson DL, Jones FA, Swenson NG, Perez R, et al. 2009. Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc Natl Acad Sci U S A. 106:18621–18626. [DOI] [PMC free article] [PubMed] [Google Scholar]
Laurance WF, Camargo JLC, Fearnside PM, Lovejoy TE, Williamson GB, et al. 2018. An Amazonian rainforest and its fragments as a laboratory of global change. Biol Rev. 93:223–247. [DOI] [PubMed] [Google Scholar]
Leebens-Mack JH, Barker MS, Carpenter EJ, Deyholos MK, Gitzendanner MA, et al. 2019. One thousand plant transcriptomes and the phylogenomics of green plants. Nature. 574:679–685. [10.1038/s41586-019-1693-2] [DOI] [PMC free article] [PubMed] [Google Scholar]
Lobo-Segura J-A. 2019. Diversity of phenological patterns of Handroanthus ochraceus (Bignoniaceae) in Costa Rica. Rev Biol Trop. 67:S149–S158. [Google Scholar]
Lohmann LG. 2006. Untangling the phylogeny of neotropical lianas (Bignonieae, Bignoniaceae). Am J Bot. 93:304–318. [DOI] [PubMed] [Google Scholar]
Lohmann LG, Bell CD, Calió MF, Winkworth RC. 2013. Pattern and timing of biogeographical history in the Neotropical tribe Bignonieae (Bignoniaceae): Bignonieae Biogeography. Bot J Linn Soc. 171:154–170. [Google Scholar]
Lowe A. 2005. Population genetics of neotropical trees focus issue. Heredity. 95:243–245. [DOI] [PubMed] [Google Scholar]
Magallón S, Gómez‐Acevedo S, Sánchez‐Reyes LL, Hernández‐Hernández T. 2015. A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. New Phytol. 207:437–453. [DOI] [PubMed] [Google Scholar]
Marçais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, et al. 2018. MUMmer4: a fast and versatile genome alignment system. PLoS Comput Biol. 14:e1005944. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mendes MG, de Oliveira AP, Oliveira PE, Bonetti AM, Sampaio DS. 2018. Sexual, apomictic and mixed populations in Handroanthus ochraceus (Bignoniaceae) polyploid complex. Plant Syst Evol. 304:817–829. [Google Scholar]
Meyer-Sand BRV, Blanc-Jolivet C, Mader M, Paredes-Villanueva K, Tysklind N, et al. 2018. Development of a set of SNP markers for population genetics studies of Ipe (Handroanthus sp.), a valuable tree genus from Latin America. Conservation Genet Res. 10:779–781. [Google Scholar]
Olmstead RG, Zjhra ML, Lohmann LG, Grose SO, Eckert AJ. 2009. A molecular phylogeny and classification of Bignoniaceae. Am J Botany. 96:1731–1743. [DOI] [PubMed] [Google Scholar]
Phillips OL, Hall P, Gentry AH, Sawyer SA, Vasquez R. 1994. Dynamics and species richness of tropical rain forests. Proc Natl Acad Sci U S A. 91:2805–2809. [DOI] [PMC free article] [PubMed] [Google Scholar]
Phillips OL, Lewis SL, Baker TR, Chao K-J, Higuchi N. 2008. The changing Amazon forest. Philos Trans R Soc B. 363:1819–1827. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pinheiro F, Dantas-Queiroz MV, Palma-Silva C. 2018. Plant species complexes as models to understand speciation and evolution: a review of south American studies. Crit Rev Plant Sci. 37:54–80. [Google Scholar]
Plomion C, Aury J-M, Amselem J, Leroy T, Murat F, et al. 2018. Oak genome reveals facets of long lifespan. Nat Plants. 4:440–452. [DOI] [PMC free article] [PubMed] [Google Scholar]
Plomion C, Bastien C, Bogeat-Triboulot M-B, Bouffier L, Déjardin A, et al. 2016. Forest tree genomics: 10 achievements from the past 10 years and future prospects. Ann Forest Sci. 73:77–103. [Google Scholar]
Price AL, Jones NC, Pevzner PA. 2005. De novo identification of repeat families in large genomes. Bioinformatics. 21:i351–i358. [DOI] [PubMed] [Google Scholar]
Putnam NH, O'Connell BL, Stites JC, Rice BJ, Blanchette M, et al. 2016. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 26:342–350. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ragsac AC, Farias‐Singer R, Freitas LB, Lohmann LG, Olmstead RG. 2019. Phylogeny of the Neotropical tribe Jacarandeae (Bignoniaceae). Am J Bot. 106:1589–1601. [DOI] [PubMed] [Google Scholar]
Reich PB, Borchert R. 1982. Phenology and ecophysiology of the tropical tree, tabebuia neochrysantha (Bignoniaceae). Ecology. 63:294–299. [Google Scholar]
Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, et al. 2020. Towards complete and error-free genome assemblies of all vertebrate species. bioRxiv 05.22.110833. [DOI] [PMC free article] [PubMed]
Schulze M, Grogan J, Uhl C, Lentini M, Vidal E. 2008. Evaluating ipê (Tabebuia, Bignoniaceae) logging in Amazonia: sustainable management or catalyst for forest degradation? Bio Conserv. 141:2071–2085. [Google Scholar]
Scotti-Saintagne C, Dick CW, Caron H, Vendramin GG, Troispoux V, et al. 2013. Amazon diversification and cross-Andean dispersal of the widespread neotropical tree species Jacaranda copaia (Bignoniaceae). J Biogeography. 40:707–719. [Google Scholar]
Sebbenn AM, Blanc-Jolivet C, Mader M, Meyer-Sand BRV, Paredes-Villanueva K, et al. 2019. Nuclear and plastidial SNP and INDEL markers for genetic tracking studies of Jacaranda copaia. Conserv Genet Res. 11:341–343. [Google Scholar]
Silva-Junior OB, Grattapaglia D, Novaes E, Collevatti RG. 2018. Genome assembly of the Pink Ipê (Handroanthus impetiginosus, Bignoniaceae), a highly valued, ecologically keystone Neotropical timber forest tree. GigaScience. 7:16. [DOI] [PMC free article] [PubMed] [Google Scholar]
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 31:3210–3212. [DOI] [PubMed] [Google Scholar]
Smit AFA, Hubley R. 2008-2015. RepeatModeler Open-1.0 < http://www.repeatmasker.org>.
Smit AFA, Hubley R, Green P. 2013-2015. RepeatMasker Open-4.0 < http://www.repeatmasker.org>.
Sork VL, Fitz-Gibbon ST, Puiu D, Crepeau M, Gugger PF, et al. 2016. First draft assembly and annotation of the genome of a California endemic oak Quercus lobata nee (Fagaceae). G3 (Bethesda). 6:3485–3495. [DOI] [PMC free article] [PubMed] [Google Scholar]
Straub SCK, Parks M, Weitemier K, Fishbein M, Cronn RC, et al. 2012. Navigating the tip of the genomic iceberg: next-generation sequencing for plant systematics. Am J Bot. 99:349–364. [DOI] [PubMed] [Google Scholar]
Tenaillon MI, Hufford MB, Gaut BS, Ross-Ibarra J. 2011. Genome size and transposable element content as determined by high-throughput sequencing in maize and Zea luxurians. Genome Biol Evol. 3:219–229. [DOI] [PMC free article] [PubMed] [Google Scholar]
The Heliconius Genome Consortium. 2012. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature. 487:94–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tuskan GA, Groover AT, Schmutz J, DiFazio SP, Myburg A, et al. 2018. Hardwood tree genomics: unlocking woody plant biology. Front Plant Sci. 9:1799. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vitorino LC, Lima-Ribeiro MS, Terribile LC, Collevatti RG. 2018. Demographical expansion of Handroanthus ochraceus in the Cerrado during the Quaternary: implications for the genetic diversity of neotropical trees. Bio J Linn Soc. 123:561–577. [Google Scholar]
Vitte C, Bennetzen JL. 2006. Analysis of retrotransposon structural diversity uncovers properties and propensities in angiosperm genome evolution. Proc Natl Acad Sci U S A. 103:17638–17643. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wikström N, Savolainen V, Chase MW. 2001. Evolution of the angiosperms: calibrating the family tree. Proc Biol Sci. 268:2211–2220. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wright SJ, Kitajima K, Kraft NJB, Reich PB, Wright IJ, et al. 2010. Functional traits and the growth–mortality trade-off in tropical trees. Ecology. 91:3664–3674. [DOI] [PubMed] [Google Scholar]
Zhang C, Zhang T, Luebert F, Xiang Y, Huang C-H, et al. 2020. Asterid phylogenomics/phylotranscriptomics uncover morphological evolutionary histories and support phylogenetic placement for numerous whole-genome duplications. Mol Bio Evol. 37:3188–3210. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[jkab010-B1] Alberto FJ, Aitken SN, Alía R, González-Martínez SC, Hänninen H, et al. 2013. Potential for evolutionary responses to climate change—evidence from tree populations. Glob Change Biol. 19:1645–1661. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B2] Altemose N, Miga KH, Maggioni M, Willard HF. 2014. Genomic characterization of large heterochromatic gaps in the human genome assembly. PLoS Comput Biol. 10:e1003628. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B3] Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215:403–410. [DOI] [PubMed] [Google Scholar]

[jkab010-B4] Antonelli A, Ariza M, Albert J, Andermann T, Azevedo J, et al. 2018. Conceptual and empirical advances in Neotropical biodiversity research. PeerJ. 6:e5644. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B5] Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comp Biol. 19:455–477. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B6] Bao Z, Eddy SR. 2002. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 12:1269–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B7] Bittencourt Júnior NS. 2017. Evidence for post-zygotic self-incompatibility in Handroanthus impetiginosus (Bignoniaceae). Plant Reprod. 30:69–79. [DOI] [PubMed] [Google Scholar]

[jkab010-B8] Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30:2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B9] Brancalion PHS, de Almeida DRA, Vidal E, Molin PG, Sontag VE, et al. 2018. Fake legal logging in the Brazilian Amazon. Sci Adv. 4:eaat1192. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B10] Brousseau L, Fine PVA, Dreyer E, Vendramin GG, Scotti I. 2020. Genomic and phenotypic divergence unveil microgeographic adaptation in the Amazonian hyperdominant tree Eperua falcata Aubl (Fabaceae). Mol Ecol. 10.1111/mec.15595. [DOI] [PubMed] [Google Scholar]

[jkab010-B11] Chapman JA, Ho I, Sunkara S, Luo S, Schroth GP, et al. 2011. Meraculous: de novo genome assembly with short paired-end reads. PLoS One. 6:e23501. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B12] Cheng S, Melkonian M, Smith SA, Brockington S, Archibald JM, et al. 2018. 10KP: a phylodiverse genome sequencing plan. GigaScience. 7:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B13] Collevatti RG, Dornelas MC. 2016. Clues to the evolution of genome size and chromosome number in Tabebuia alliance (Bignoniaceae). Plant Syst Evol. 302:601–607. [Google Scholar]

[jkab010-B14] Collevatti RG, Novaes E, Silva-Junior OB, Vieira LD, Lima-Ribeiro MS, et al. 2019. A genome-wide scan shows evidence for local adaptation in a widespread keystone Neotropical forest tree. Heredity. 123:117–137. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B15] Collevatti RG, Terribile LC, Lima‐Ribeiro MS, Nabout JC, de Oliveira G, et al. 2012. A coupled phylogeographical and species distribution modelling approach recovers the demographical history of a Neotropical seasonally dry forest tree species. Mol Ecol. 21:5845–5863. [DOI] [PubMed] [Google Scholar]

[jkab010-B16] Condit R, Perez R. 2011. Trees of Panama and Costa Rica. Princeton, NJ: Princeton University Press. [Google Scholar]

[jkab010-B17] Cruz F, Julca I, Gómez-Garrido J, Loska D, Marcet-Houben M, et al. 2016. Genome sequence of the olive tree, Olea europaea. GigaScience. 5:29. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B18] Dick CW, Pennington RT. 2019. History and geography of Neotropical tree diversity. Annu Rev Ecol Evol Syst. 50:279–301. [Google Scholar]

[jkab010-B19] English AC, Richards S, Han Y, Wang M, Vee V, et al. 2012. Mind the gap: upgrading genomes with pacific biosciences RS long-read sequencing technology. PLoS One 7:e47768. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B20] Fetter KC, Gugger PF, 2017. Landscape genomics of angiosperm trees: from historic roots to discovering new branches of adaptive evolution. In: Keller SR, Groover A, Cronk Q, editors. Comparative and Evolutionary Genomics of Angiosperm Trees. Cham: Springer International Publishing. p. 303–333. [Google Scholar]

[jkab010-B21] Gentry AH. 1974. Flowering phenology and diversity in tropical bignoniaceae. Biotropica. 6:64–68. [Google Scholar]

[jkab010-B22] Gentry AH. 1976. Bignoniaceae of southern central America: distribution and ecological specificity. Biotropica. 8:117–131. [Google Scholar]

[jkab010-B23] Gentry AH. 1992. Bignoniaceae: Part II (Tribe Tecomeae). Flora Neotropica. 25:1–370. [Google Scholar]

[jkab010-B24] Grose SO, Olmstead RG. 2007a. Evolution of a charismatic neotropical clade: molecular phylogeny of tabebuia s. l., crescentieae, and allied genera (Bignoniaceae). Syst Botany. 32:650–659. [Google Scholar]

[jkab010-B25] Grose SO, Olmstead RG. 2007b. Taxonomic revisions in the polyphyletic genus tabebuia s. l. (Bignoniaceae). Syst Botany. 32:660–670. [Google Scholar]

[jkab010-B26] Hamrick JL. 2004. Response of forest trees to global environmental changes. Forest Ecol Manage. 197:323–335. [Google Scholar]

[jkab010-B27] Hamrick JL, Murawski DA. 1991. Levels of allozyme diversity in populations of uncommon neotropical tree species. J Trop Ecol. 7:395–399. [Google Scholar]

[jkab010-B28] Hubbell SP. 1979. Tree dispersion, abundance, and diversity in a tropical dry forest. Science. 203:1299–1309. [DOI] [PubMed] [Google Scholar]

[jkab010-B29] Jones FA, Comita LS. 2008. Neighbourhood density and genetic relatedness interact to determine fruit set and abortion rates in a continuous tropical tree population. Proc Biol Sci. 275:2759–2767. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B30] Jones FA, Hubbell SP. 2006. Demographic spatial genetic structure of the neotropical tree, Jacaranda copaia: demographic genetics of Jacaranda copaia. Mol Ecol. 15:3205–3217. [DOI] [PubMed] [Google Scholar]

[jkab010-B31] Kawakami T, Mugal CF, Suh A, Nater A, Burri R, et al. 2017. Whole-genome patterns of linkage disequilibrium across flycatcher populations clarify the causes and consequences of fine-scale recombination rate variation in birds. Mol Ecol. 26:4158–4172. [DOI] [PubMed] [Google Scholar]

[jkab010-B32] Kellner JR, Albert LP, Burley JT, Cushman KC. 2019. The case for remote sensing of individual plants. Am J Bot. 106:1139–1142. [DOI] [PubMed] [Google Scholar]

[jkab010-B33] Kellner JR, Hubbell SP. 2017. Adult mortality in a low-density tree population using high-resolution remote sensing. Ecology. 98:1700–1709. [DOI] [PubMed] [Google Scholar]

[jkab010-B34] Kellner JR, Hubbell SP. 2018. Density-dependent adult recruitment in a low-density tropical tree. Proc Natl Acad Sci U S A. 115:11268–11273. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B35] Kolmogorov M, Armstrong J, Raney BJ, Streeter I, Dunn M, et al. 2018. Chromosome assembly of large and complex genomes using multiple references. Genome Res. 28:1720–1713. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B36] Kress WJ, Erickson DL, Jones FA, Swenson NG, Perez R, et al. 2009. Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc Natl Acad Sci U S A. 106:18621–18626. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B37] Laurance WF, Camargo JLC, Fearnside PM, Lovejoy TE, Williamson GB, et al. 2018. An Amazonian rainforest and its fragments as a laboratory of global change. Biol Rev. 93:223–247. [DOI] [PubMed] [Google Scholar]

[jkab010-B38] Leebens-Mack JH, Barker MS, Carpenter EJ, Deyholos MK, Gitzendanner MA, et al. 2019. One thousand plant transcriptomes and the phylogenomics of green plants. Nature. 574:679–685. [10.1038/s41586-019-1693-2] [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B39] Lobo-Segura J-A. 2019. Diversity of phenological patterns of Handroanthus ochraceus (Bignoniaceae) in Costa Rica. Rev Biol Trop. 67:S149–S158. [Google Scholar]

[jkab010-B40] Lohmann LG. 2006. Untangling the phylogeny of neotropical lianas (Bignonieae, Bignoniaceae). Am J Bot. 93:304–318. [DOI] [PubMed] [Google Scholar]

[jkab010-B41] Lohmann LG, Bell CD, Calió MF, Winkworth RC. 2013. Pattern and timing of biogeographical history in the Neotropical tribe Bignonieae (Bignoniaceae): Bignonieae Biogeography. Bot J Linn Soc. 171:154–170. [Google Scholar]

[jkab010-B42] Lowe A. 2005. Population genetics of neotropical trees focus issue. Heredity. 95:243–245. [DOI] [PubMed] [Google Scholar]

[jkab010-B43] Magallón S, Gómez‐Acevedo S, Sánchez‐Reyes LL, Hernández‐Hernández T. 2015. A metacalibrated time-tree documents the early rise of flowering plant phylogenetic diversity. New Phytol. 207:437–453. [DOI] [PubMed] [Google Scholar]

[jkab010-B44] Marçais G, Delcher AL, Phillippy AM, Coston R, Salzberg SL, et al. 2018. MUMmer4: a fast and versatile genome alignment system. PLoS Comput Biol. 14:e1005944. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B45] Mendes MG, de Oliveira AP, Oliveira PE, Bonetti AM, Sampaio DS. 2018. Sexual, apomictic and mixed populations in Handroanthus ochraceus (Bignoniaceae) polyploid complex. Plant Syst Evol. 304:817–829. [Google Scholar]

[jkab010-B46] Meyer-Sand BRV, Blanc-Jolivet C, Mader M, Paredes-Villanueva K, Tysklind N, et al. 2018. Development of a set of SNP markers for population genetics studies of Ipe (Handroanthus sp.), a valuable tree genus from Latin America. Conservation Genet Res. 10:779–781. [Google Scholar]

[jkab010-B47] Olmstead RG, Zjhra ML, Lohmann LG, Grose SO, Eckert AJ. 2009. A molecular phylogeny and classification of Bignoniaceae. Am J Botany. 96:1731–1743. [DOI] [PubMed] [Google Scholar]

[jkab010-B48] Phillips OL, Hall P, Gentry AH, Sawyer SA, Vasquez R. 1994. Dynamics and species richness of tropical rain forests. Proc Natl Acad Sci U S A. 91:2805–2809. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B49] Phillips OL, Lewis SL, Baker TR, Chao K-J, Higuchi N. 2008. The changing Amazon forest. Philos Trans R Soc B. 363:1819–1827. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B50] Pinheiro F, Dantas-Queiroz MV, Palma-Silva C. 2018. Plant species complexes as models to understand speciation and evolution: a review of south American studies. Crit Rev Plant Sci. 37:54–80. [Google Scholar]

[jkab010-B51] Plomion C, Aury J-M, Amselem J, Leroy T, Murat F, et al. 2018. Oak genome reveals facets of long lifespan. Nat Plants. 4:440–452. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B52] Plomion C, Bastien C, Bogeat-Triboulot M-B, Bouffier L, Déjardin A, et al. 2016. Forest tree genomics: 10 achievements from the past 10 years and future prospects. Ann Forest Sci. 73:77–103. [Google Scholar]

[jkab010-B53] Price AL, Jones NC, Pevzner PA. 2005. De novo identification of repeat families in large genomes. Bioinformatics. 21:i351–i358. [DOI] [PubMed] [Google Scholar]

[jkab010-B54] Putnam NH, O'Connell BL, Stites JC, Rice BJ, Blanchette M, et al. 2016. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 26:342–350. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B55] Ragsac AC, Farias‐Singer R, Freitas LB, Lohmann LG, Olmstead RG. 2019. Phylogeny of the Neotropical tribe Jacarandeae (Bignoniaceae). Am J Bot. 106:1589–1601. [DOI] [PubMed] [Google Scholar]

[jkab010-B56] Reich PB, Borchert R. 1982. Phenology and ecophysiology of the tropical tree, tabebuia neochrysantha (Bignoniaceae). Ecology. 63:294–299. [Google Scholar]

[jkab010-B57] Rhie A, McCarthy SA, Fedrigo O, Damas J, Formenti G, et al. 2020. Towards complete and error-free genome assemblies of all vertebrate species. bioRxiv 05.22.110833. [DOI] [PMC free article] [PubMed]

[jkab010-B58] Schulze M, Grogan J, Uhl C, Lentini M, Vidal E. 2008. Evaluating ipê (Tabebuia, Bignoniaceae) logging in Amazonia: sustainable management or catalyst for forest degradation? Bio Conserv. 141:2071–2085. [Google Scholar]

[jkab010-B59] Scotti-Saintagne C, Dick CW, Caron H, Vendramin GG, Troispoux V, et al. 2013. Amazon diversification and cross-Andean dispersal of the widespread neotropical tree species Jacaranda copaia (Bignoniaceae). J Biogeography. 40:707–719. [Google Scholar]

[jkab010-B60] Sebbenn AM, Blanc-Jolivet C, Mader M, Meyer-Sand BRV, Paredes-Villanueva K, et al. 2019. Nuclear and plastidial SNP and INDEL markers for genetic tracking studies of Jacaranda copaia. Conserv Genet Res. 11:341–343. [Google Scholar]

[jkab010-B61] Silva-Junior OB, Grattapaglia D, Novaes E, Collevatti RG. 2018. Genome assembly of the Pink Ipê (Handroanthus impetiginosus, Bignoniaceae), a highly valued, ecologically keystone Neotropical timber forest tree. GigaScience. 7:16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B62] Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 31:3210–3212. [DOI] [PubMed] [Google Scholar]

[jkab010-B63] Smit AFA, Hubley R. 2008-2015. RepeatModeler Open-1.0 < http://www.repeatmasker.org>.

[jkab010-B64] Smit AFA, Hubley R, Green P. 2013-2015. RepeatMasker Open-4.0 < http://www.repeatmasker.org>.

[jkab010-B65] Sork VL, Fitz-Gibbon ST, Puiu D, Crepeau M, Gugger PF, et al. 2016. First draft assembly and annotation of the genome of a California endemic oak Quercus lobata nee (Fagaceae). G3 (Bethesda). 6:3485–3495. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B66] Straub SCK, Parks M, Weitemier K, Fishbein M, Cronn RC, et al. 2012. Navigating the tip of the genomic iceberg: next-generation sequencing for plant systematics. Am J Bot. 99:349–364. [DOI] [PubMed] [Google Scholar]

[jkab010-B67] Tenaillon MI, Hufford MB, Gaut BS, Ross-Ibarra J. 2011. Genome size and transposable element content as determined by high-throughput sequencing in maize and Zea luxurians. Genome Biol Evol. 3:219–229. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B68] The Heliconius Genome Consortium. 2012. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature. 487:94–98. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B69] Tuskan GA, Groover AT, Schmutz J, DiFazio SP, Myburg A, et al. 2018. Hardwood tree genomics: unlocking woody plant biology. Front Plant Sci. 9:1799. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B70] Vitorino LC, Lima-Ribeiro MS, Terribile LC, Collevatti RG. 2018. Demographical expansion of Handroanthus ochraceus in the Cerrado during the Quaternary: implications for the genetic diversity of neotropical trees. Bio J Linn Soc. 123:561–577. [Google Scholar]

[jkab010-B71] Vitte C, Bennetzen JL. 2006. Analysis of retrotransposon structural diversity uncovers properties and propensities in angiosperm genome evolution. Proc Natl Acad Sci U S A. 103:17638–17643. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B72] Wikström N, Savolainen V, Chase MW. 2001. Evolution of the angiosperms: calibrating the family tree. Proc Biol Sci. 268:2211–2220. [DOI] [PMC free article] [PubMed] [Google Scholar]

[jkab010-B73] Wright SJ, Kitajima K, Kraft NJB, Reich PB, Wright IJ, et al. 2010. Functional traits and the growth–mortality trade-off in tropical trees. Ecology. 91:3664–3674. [DOI] [PubMed] [Google Scholar]

[jkab010-B74] Zhang C, Zhang T, Luebert F, Xiang Y, Huang C-H, et al. 2020. Asterid phylogenomics/phylotranscriptomics uncover morphological evolutionary histories and support phylogenetic placement for numerous whole-genome duplications. Mol Bio Evol. 37:3188–3210. [DOI] [PubMed] [Google Scholar]

PERMALINK

Genome assemblies for two Neotropical trees: Jacaranda copaia and Handroanthus guayacan

John T Burley

James R Kellner

Stephen P Hubbell

Brant C Faircloth

Roles

Abstract

Introduction

Figure 1.

Figure 2.

Materials and methods

Tissue collection from vouchered specimens

Preliminary de novo assemblies using short-insert read libraries and the Meraculous assembler

Assembly improvement using Chicago read libraries and the HiRise scaffolding pipeline

Assessment of assembly contiguity and completeness

Identification and classification of repetitive elements

Genome alignment

Data availability

Results and discussion

The Jacaranda copaia genome

Table 1.

The Handroanthus guayacan genome

Assembly quality assessments

Table 2.

Repetitive sequences

Table 3.

Genome alignments

Table 4.

Figure 3.

Figure 4.

Outlook

Acknowledgments

Funding

Literature cited

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases