Abstract
Pyrosequencing was used to identify 133 isolates of clinically relevant non-dematiaceous yeasts. These included 97 ATCC strains (42 type strains), 7 UAMH strains, and 29 clinical isolates. Isolates belonged to the following genera: Candida (18 species), Trichosporon (10), Cryptococcus (7), Malassezia (3), Rhodotorula (2), Geotrichum (1), Blastoschizomyces (1) and Kodamaea (1). Amplicons were obtained of a hypervariable ITS region and analyzed using Pyrosequencing technology. The data were evaluated by a BLAST search against the GenBank database and correlated with data obtained by conventional cycle sequencing of the ITS1-5.8S-ITS2 region. Cycle sequencing identified (78.9%) if the isolates to the species level. Pyrosequencing technology identified 69.1%. In 90.1% of all the strains tested, identification results of both sequencing methods were identical. Most Candida isolates can be identified to the species level by pyrosequencing. Trichosporon species and some Cryptococcus cannot be differentiated at the species level. Pyrosequencing can be used for the reliable identification of most commonly isolated non-dematiaceous yeasts, with a reduction of cost per identification compared to conventional sequencing.
Keywords: Yeast, Candida, Pyrosequencing, Identification, ITS
Introduction
The increase in the population of severely immunocompromised patients has been associated with an increase of invasive fungal infections [1]. These infections remain a significant cause of morbidity and mortality in patients receiving chemotherapy for cancer or organ transplantation [1, 2]. Early detection of infection and identification of the etiologic agent permits the initiation of appropriate antifungal therapy, with reduced morbidity and increased possibility for survival [2, 3].
Currently, more than 300 fungal species have been recognized as potential human pathogens [4]. Although most episodes of invasive fungal infection are still caused by Candica albicans, the proportion of infections caused by non-C. albicans species has risen substantially [5]. The widespread use of antifungal agents may have contributed to a shift in species distribution via the emergence of inherently resistant species as significant pathogens [3, 5]. The increasing importance of these isolates as causes of life-threatening invasive fungal infections in high-risk patients has important diagnostic implications for the clinical laboratory. Timely identification of this increasing diversity of pathogens may be hampered by the limitation of current diagnostic methods. Conventional identification of yeasts relies upon a combination of morphological and biochemical features using tests that may require relatively long periods of incubation, are labor intensive and/or require subjective interpretation. Popular biochemical identification panels such as API 20C AUX or VITEK ID-YST (BioMerieux, Durham N.C) may have limited databases resulting in misidentification of some isolates [6–9]. Therefore, molecular methods offer an attractive, accurate and rapid alternative for the identification of pathogenic yeast.
Several platforms have been evaluated for identification of yeasts in which the rRNA genes are frequently the selected target [10, 11]. The availability of conserved and highly variable regions within the rRNA and its presence in multiple copies in fungal cells (Figure 1) allows high sensitivity as well as accurate molecular identification for most yeasts [10, 12, 13].
Figure 1. Diagram of the ribosomal DNA cluster of fungi and location of the primers used for this study.
The ribosomal DNA (rDNA) gene is a tandam array of at least 50–100 copies in the haploid genome of all fungi (7). Diagram based on one single gene repeat in the genome sequence of C . albicans SC5314(7,20).(LSU)Large ribosomal subunit,(SSU) Small ribosomal subunit, Intergenic transcribed spacer region 1 (ITSI), Intergenic transcribed spacer region 2 (ITS2). Small bars (ITS1,ITS4) refers to approximate location of the universal fungal primers(38).
Pyrosequencing® technology is a relatively inexpensive, easy to use method that employs coupled enzymatic reactions to determine the nucleotide composition of a target DNA during the polymerization reaction, transforming the concomitant release of pyrophosphate into visible light [14]. In comparison to other methods, Pyrosequencing technology is limited to short fragments but if the target is properly selected it may provide enough information for accurate identification of most clinically relevant yeasts. In this study the conventional cycle sequencing (Sanger) method was compared to Pyrosequencing® analysis for identification of yeasts associated with human disease.
Materials and Methods
Reference strains and clinical isolates
In the present study 133 isolates were used, Table 1 and 2 present the reference strains tested. Table 3 includes the clinical isolates tested. Lyophilized isolates, were cultured on Sabouraud plates and incubated at 30°C for 3–5 days before genomic DNA extraction. For the remaining strains genomic DNA was extracted directly from growth on Sabouraud agar slants. Biochemical assimilation tests were performed on some of the clinical isolates and reference strains that showed discrepant results between the ATCC identification and cycle sequencing results. All biochemical tests were performed using the API 20C AUX clinical yeast system, following the manufacturer’s recommendations (BioMerieux Durham N.C.).
Table 1.
Reference strains (83) showing concordant Pyrosequencing and cycle sequencing results
Identification (Synonyms) | ATCC/UAMH number |
---|---|
Blastoschizomyces capitatus (Dipodascus capitatus, Geotrichum capitatum) | ATCC 10663T, ATCC 62964 |
Candida albicans (Torulopsis albicans) | ATCC 18804T |
Candida catenulata (Candida ravautii) | ATCC 10565T, ATCC 20117, ATCC 22872, ATCC 42214, ATCC 200552 |
Candida dubliniensis | ATCC MYA-646T, ATCC MYA-180, ATCC MYA-579, ATCC MYA-582, ATCC MYA-583 |
Candida glabrata (Torulopsis glabrata) | ATCC 2001T |
Candida guilliermondii (Pichia guilliermondii) | ATCC 6260T |
Candida haemulonii (Torulopsis haemulonii) | ATCC 22991T |
Candida kefyr (Kluyveromyces marxianus) | ATCC 42265T, ATCC 90902, ATCC 200533, ATCC 20493, ATCC 64885 |
Candida krusei (Issatchenkia orientalis) | ATCC 6258T |
Candida lusitaniae (Clavispora lusitaniae) | ATCC 34449T |
Candida parapsilosis | ATCC 22019T |
Candida pelliculosa (Pichia anomala) | ATCC 8168T |
C. periphelosum (Debaryomyces hansenii, Candida famata) | ATCC 36239T |
Candida pintolopesii(Kazachstania/Arxiozyma telluris complex) | ATCC 22998T, ATCC 22987, ATCC 36231, ATCC 64177, ATCC 66058 |
Candida rugosa | ATCC 10571T, ATCC 200555, ATCC 38772, ATCC 58964, |
Candida tropicalis | ATCC 750T |
Candida zeylanoides (Candida krissii) | ATCC 7351T, ATCC 96278, ATCC 26899 |
Cryptococcus flavescens | ATCC 10668T, ATCC 26024. |
Cryptococcus gattii (C. neoformans var. Gatti) | ATCC 32269T, ATCC 5699, ATCC 32609 |
Cryptococcus neoformans (Filobasidiella neoformans, C. neoformans var. neoformans, C. neoformans var. grubbi ) | ATCC 32045T, ATCC 14248, ATCC 32267, ATCC 2505, ATCC 32268, ATCC 208821 |
Cryptococcus laurentii | ATCC 18803T, ATCC 34142, ATCC 36833 |
Cryptococcus luteolus | ATCC 32044T |
Cryptococcus uniguttulatus (Filobasidium uniguttulatum) | ATCC 32047T, ATCC 66033, ATCC 38298, ATCC 32048 |
Cryptococcus terreus | ATCC 11799T, ATCC 32046, ATCC 32422, ATCC 34145 |
Malassezia pachydermatis (Pityrosposporum pachydermatis) | ATCC 14522T |
Malassezia sympodialis (Pityrosporum sympodialis) | ATCC 96803T |
Rhodotorula glutinis (Rhodosporium diobovatum) | ATCC 16726T |
Rhodotorula mucilaginosa var. mucilaginosa | ATCC 11799T |
Trichosporon asahii | ATCC 90039T, UAMH 7655, UAMH 10278 |
Trichosporon asteroides | ATCC 90043T, UAMH 4261 |
Trichosporon coremiiforme | ATCC 90042T |
Trichosporon faecale | ATCC 90041T |
Trichosporon inkin | ATCC 18020T, UAMH 7664 |
Trichosporon jiroveci | ATCC 34499T |
Trichosporon moniliforme | ATCC 90045T |
Trichosporon mucoides | ATCC 90046T, |
Trichosporon ovoides | ATCC 90040T, UAMH 563, UAMH 635, UAMH 672 |
Trichosporon pullulans(Guehomyces pullulans) | ATCC 10677T |
Identification as provided by ATCC (American Type Culture Collection ) or UAMH (University of Alberta Microfungus Collection and Herbarium).
C. pintolopesii ATCC 22987 corresponds to the type strain of Candida bovina, synonym of Arxiozyma telluris and C. pintolopesii [33].
C. zeylanoides ATCC 96278 corresponds to the type strain of Pichia dubia, synonym of C. zeylanoides [33].
F. neoformans ATCC 32609 corresponds to the type strain of Filobasidiella bacillispora, synonym of Cryptococcus gatti [33].
C. neoformans ATCC 14248 corresponds to the type strain of Cryptococcus hondurianus, synonym of C. neoformans var. neoformans [33].
C. neoformans ATCC 2505 corresponds to the type strain of Torula nasalis, synonym of C. neoformans var. neoformans [33].
Table 2.
Reference strains (21) showing discordant identification results
Identification (Synonyms) | Biochemical identification or tests. | Cycle Sequencing | Pyrosequencing Identification | Comments |
---|---|---|---|---|
Candida lambica (Pichia fermentans) ATCC 24750T | Not performed | C. lambica | No signal | PCR products on initial step of Pyrosequencing analysis were observed but repeatedly no Pyrogram reports were obtained. |
Candida lipolytica (Yarrowia lipolytica) ATCC 18943T, ATCC 18942, ATCC 20776, ATCC 20496, ATCC 201089 | Not Performed | C. lipolytica | No signal | PCR products on initial step of Pyrosequencing analysis were observed but repeatedly no Pyrogram reports were obtained |
C. periphelosum ATCC 20314 ATCC 20317 | C. guilliermondii | C. guilliermondii | C. guilliermondii | |
Candida rugosa ATCC 96275 | C. rugosa | Candida pararugosa | No result | C. pararugosa have been recognized as colonizer of human oral flora but not associated with human pathogenesis [34] |
C. zeylanoides ATCC 15585 | C. guilliermondii | C. guilliermondii | C. guilliermondii | |
C. zeylanoides ATCC 20393 | Urease positive | C. lipolytica | No result | Urease positive test is consistent with the identification as C. lipolytica [33]. |
C. luteolus ATCC 42279 | No growth on API 20C AUX System | C. podzolicus | C. humicola | Pyrosequencing identification match with 97% identity C. humicola, whereas, cycle sequencing provide a sequencing that is more similar to C. podzolicus (93% identity) |
C. luteolus ATCC 42280 | Not performed | Bullera sp. | Bullera sp. | Both sequencing methods provide a result that match the same sequence on GenBank. However, the Pyrosequencing method had 90%, whereas sequencing was 99% identical. |
C. luteolus ATCC 44440 | No Ballistoconidia produced. | C. luteolus/C. zeae | Bullera sinensis | Bullera sinensis and Cryptococcus luteolus belong to a highy similar group defined by its ITS sequencing [35] |
C. uniguttulatus ATCC 34143 | Urease negative. | C. glabrata | C. glabrata | This isolate corresponds to a small yeast that forms shiny white colonies on Sabouraud agar, a Urease negative test is consistent with the identification as C. glabrata. |
Geotrichum candidum (Galactomyces geotrichum) ATCC 22601T, ATCC 10834, ATCC 74169, ATCC 204307 | Morphological features consistent with G. candidum | G. candidum | No signal | Geotrichum candidum (Galactomyces geotrichum) |
Kodamaea ohmeri (Pichia ohmeri) ATCC 26861T | Not performed | K. ohmeri | No signal | PCR products on initial step of Pyrosequencing analysis were observed but repeatedly, no Pyrogram reports were obtained. |
M. furfur ATCC 96809 | Not performed | M. slooffliae | M. slooffiae | M. furfur ATCC 96809 corresponds to the strain CBS 7875, sequencing of the 26rDNA confirms identification as M. slooffiae [36, 37]. |
Identification as provided by ATCC (American Type Culture Collection ) or UAMH (University of Alberta Microfungus Collection and Herbarium).
Synonyms based on NCBI Taxonomy Browser and The Atlas of Clinical Fungi [33]
Table 3.
Clinical isolate identificationa
Biochemical Identificationb | Number of isolates | Cycle Sequencing Identification | Pyrosequencing Identification |
---|---|---|---|
C. albicans | 10 | C. albicans | C. albicans |
C. glabrata | 5 | C. glabrata | C. glabrata |
C. guilliermondii | 3 | C. guilliermondii | C. guilliermondii |
C. haemulonii | 1 | C. haemulonii | C. haemulonii |
C. lusitaniae | 3 | C. lusitaniae | C. lusitaniae |
C. parapsilosis | 3 | C. parapsilosis | C. parapsilosis species complexc |
C. parapsilosis | 2 | C. orthopsilosis | C. parapsilosis species complexc |
C. tropicalis | 2 | C. tropicalis | C. tropicalis |
Clinical isolates obtained from patients being treated at the Warren G. Magnuson Clinical Center, National Institutes of Health.
Biochemical carbohydrate assimilation tests performed with API 20C AUX kit (BioMerieux, Durham, NC).
C. parapsilosis species complex consists of a set of newly defined species C. parapsilosis, C. orthopsilosis and C. metapsilosis [17].
Cycle Sequencing
Fungal DNA was extracted using the UltraClean Microbial DNA Isolation kit (MoBio Laboratories, Solana Beach, CA) following manufacturer’s recommendations for yeast genomic DNA extraction. Amplification of the ITS region (ITS1-5.8S-ITS2) was performed by using 1 pmol of the ITS1 (5′-TCCGTAGGTGAACCTGCGG-3′) and ITS4 (5′-TCCTCCGCTTATTGATATGC-3′) primers [15], 0.1 ig of extracted DNA and ultrapure water to a final volume of 25 il with PuReTaq Ready-To-Go PCR beads (GE Healthcare, Fairfield, CT). The DNA was denatured at 95°C for 5 min and then subjected to 35 cycles of amplification (95°C for 1 min, 55°C for 1 min, and 72°C for 1 min), followed by a 10 min extension at 72°C. The presence of PCR products was confirmed by gel electrophoresis. Purification of the PCR product was performed using a Microcon YM-100 centrifugation filter device (Millipore Corporation, Bedford, MA.). Amplicons were then sequenced using the ITS1 and ITS4 primers and the Big Dye Terminator v 1.1 cycle sequencing kit (PerkinElmer Applied Biosystems, Foster City, CA). Excess dye terminators were removed using CleanSeq magnetic particles according to the guidelines of the manufacturer (Agencourt, Beverly MA). Fluorescence-based sequence analysis of the extension products was performed with the ABI 3100 genetic analyzer (Applied Biosystems/Hitachi, Foster City, CA). The resulting sequences were assembled using Lasergene SeqMan II software (DNA Star, Inc., Madison, WI), and compared to GenBank nucleotide sequences by using the BLASTN program (NCBI, Bethesda, MD). Sequences were aligned using the CLUSTAL W algorithm and phylogenetic trees were constructed using the Lasergene MegAlign software (DNASTAR Inc., Madison, WI).
Pyrosequencing analysis
PCR was performed using PuReTaq Ready-To-Go PCR beads (GE Healthcare, Fairfield, CT) and primers included in the PyroMark™ Fungi ASR (Biotage, Charlottesville, VA). The PCR cycle included a 5 min. initial denaturation and 35 cycles (95°C for 30 sec, 58°C for 30 sec, and 72°C for 30 sec), followed by a 5 min. extension at 72°C. PCR products were then confirmed by gel electrophoresis. Biotinylated amplicons were purified using streptavidin coated sepharose beads (GE Healthcare, Fairfield, CT) and purification was performed using the Vacuum Prep Workstation and PSQ® Sample preparation kit following a standard protocol (Biotage, Charlottesville, VA), with annealing at a final sequencing primer concentration of 0.4 μM. Pyro Gold reagents were loaded into the PSQ reagent cartridge in a PyroMark ID unit (Biotage, Charlottesville, VA) for sequencing using 15 dispensation cycles (CTGA). The primer Seq2Pyro (5′-TAGTCTTGCGTGATGTGAGG-3′), not included in the PyroMark Fungi ASR, was used for sequence identification of C. catenulata and B. capitatus. Regardless of the quality of the PyrogramTM assessed by the PyroMark ID software, all Pyrogram reports generated were submitted as BLAST queries through GenBank (NCBI, Bethesda, MD). One inherent limitation of the Pyrosequencing technology is the determination of the number of incorporated nucleotides in homopolymeric regions. A non-linear light output resulting from the sequential incorporation of more than five to six identical nucleotides may require adjustments of the software algorithms used for the integration of light intensities [16]. Therefore, visual interpretation of Pyrogram reports targeting these regions is recommended. Nonetheless, no “visual reading” adjustment was performed for any of the sequences obtained. Pyrogram reports that failed to provide an identifiable signal were annotated as “no signal” (Table 3).
Results
For all of the species successfully identified within our samples, sequencing of 20–30 bases of the selected ITS2 region (Figure 1) was sufficient for differentiation. A summary of the concordant and discordant identification results on all of the reference strains is presented in Table 1 and Table 2, respectively. Results of the identification of the clinical isolates included in this study are presented in Table 3. Cycle sequencing of two of the clinical isolates identified by biochemical methods as C. parapsilosis provided sequences homologous to C. parapsilosis and C. orthopsilosis entries in GenBank. C. orthopsilosis is a newly defined species within the C. parapsilosis species complex [17]. However, differentiation within this species complex may be clinically irrelevant [17, 18].
Of the 133 clinical isolates and reference strains used in this study (43 species), a total of 119 (38 species) resulted in interpretable Pyrogram reports. No signal was generated for 14 strains (6 species) and correspond to isolates of C. lambica, Geotrichum candidum, Kodamaea ohmeri, C. lipolytica, C. zeylanoides 20393, and C. rugosa ATCC 96275. With the exception of Geotrichum candidum, PCR amplicons were successfully generated for these 5 species without any apparent reduction in yield or additional bands.
Our results using Pyrosequencing technology are presented in Table 4 and include the first 30 base pairs of all clinical isolates or reference strains used in this study. A total of 54 sequences that represent 38 different species are presented. All of these species were identified at the genus level and 27 of them at the species level. Neither method targeting the ITS region was able to identify at the species level the remaining 11 species. These species consist of all Trichosporon species (except T. pullulans), all isolates from the C. neoformans species complex and some strains from the Cryptococcus luteolus/Bullera species group.
Discussion
In this work we present an evaluation of Pyrosequencing technology as a tool for the identification of the most commonly isolated non-dematiaceous yeasts [5, 19]. The aim of the study was to compare the performance of this technology with the conventional cycle sequencing method. The targeted regions selected for these methods differ greatly in size, ITS1-5.8S-ITS2 (400–900bp) for cycle sequencing versus a variable region of size limited by the Pyrosequencing technical constraints (<60 bp). A previous study showed that conventional sequencing of the ITS2 region of 373 reference and clinical yeast isolates (86 species, mostly Candida species) allowed the identification 98.8% of the species tested [13]. Here, the resolving power of this hyper-variable ITS2 region is further evaluated by selecting a small 30 bp region for identification of yeasts by Pyrosequencing technology.
Previous reports have shown the utility of Pyrosequencing analysis for the identification of clinically relevant yeasts [20, 21]. Those studies however, were limited to not more than 5 different species. In our study targeting a different region, we were able to extend this approach and identify 27 different yeast species to the species level and 11 more to the genus level (Tables 1, 2 and 3). Taking into consideration recent reports on global trends of species distribution of invasive candidiasis [5, 22]; we estimate that this test is potentially capable of identifying 95% of all Candida clinicalisolates to the species level. However, we found some limitations. Although our primer sets generate suitable PCR products from C. lipolytica, C. lambica, G. candidum, Kodamaea ohmeri, C. zeylanoides 20393, C. rugosa ATCC 96275, identification by the Pyrosequencing method was not possible. Problems of annealing between the primers used for sequencing and the target rDNA sequence of these strains may explain the absence of signal when Pyrosequencing analysis is performed.
As described previously, this assay is unable to identify to the species level members of the Cryptococcus neoformans species complex, Cryptococcus luteolus/Bullera species group, and most Trichosporon species previously grouped within Trichosporon beigelli [23]. The Cryptococcus neoformans species complex, is now considered to be conformed of two species, C. neoformans (serogroups A & D) and Cryptococcus gattii (serogroups B and C) [24, 25]. However the clinical relevance of this differentiation has not been firmly established [25, 26]. If differentiation within the C. neoformans sp. complex is warranted, Pyrosequencing analysis or cycle sequencing of the rDNA will not adequately resolve these species unless sequencing of a “fast-evolving” region (such as the intergenic spacer region-IGS) is used for this purpose [24, 27].
A similar problem was observed within Trichosporon, a group that has recently undergone an extensive taxonomic re-evaluation. Sequencing the ITS region distinguishes three clades [23]. As shown in Table 4, the Pyrosequencing results of species in Clade I ( T. jiroveci, T. moniliforme, and T. mucoides ) and Clade III (T. asahii, T. asteroides, T. coremiiforme, T. faecale, T. ovoides and T. inkin ) possesses unique sequence signatures [23]. Our strain collection did not include any member of Clade II (e.g. T. domesticum, T. montevideense). However, using bioinformatics methods and sequences publicly available in GenBank, we found that T. pullulans yielded a Pyrogram report that was distinguishable from all clades. However this species now renamed Guehomyces pullulans [28] should not be considered a significant infectious agent in humans [29].
Table 4.
Identification results using Pyrosequencing technology
N | Sequence (First 30 bp) | Instances | Identification | Matcha |
---|---|---|---|---|
2 | TGCAAATAGTTAAAGTAGTTAACTAAAAGA | 2 | Blastoschizomyces capitatus (Dipodascus capitatus) | 100% |
11 | GTCAAAGTTTGAAGATATACGTGGTGGACG | 8 | Candida albicans | 100% |
GTCAAAGTTTGAAGATATACGTGGTAGACG | 3 | Candida albicans | 100% | |
5 | GAAAAAGTATGGTTGTGAGTAAACTGTAAT | 3 | Candida catenulata | 100% |
GAAAAAGTATGGTTGTGAGTAAACTGATAA | 2 | Candida catenulata | 97% | |
5 | GTCAAAGTTTGAAGAATAAAATGGGCGACG | 5 | Candida dubliniensis | 100% |
6 | GTCAAACTTAAAGACGTCTGTCTGCCCAGC | 4 | Candida glabrata | 100% |
GTCAAAACTTAAAGACGTCTGTCTGCCCAG | 3 | Candida glabrata | 97% | |
6 | GTCAAACTTGTTTGGTTGTTGTAAGGCCCG | 4 | Candida guilliermondii | 100% |
GTCAAGCTTGTTTGGTTGTTGTAAGGGCCC | 3 | Candida guilliermondii | 97% | |
2 | GGGAAAAAGCAGTGAAAGCGTAGACTTCAC | 1 | Candida haemulonii | 100% |
GAAAATAGCATGGTGCAGTGAAAGCGTAGA | 1 | Candida haemulonii | 100% | |
5 | GTCAAACTTTGAGATGTTTTGGTTAAAGCC | 5 | Candida kefyr (Klyveromyces marxianus) | 100% |
1 | GTCGAGCTTTTTGTTGTCTCGCAACACTCG | 1 | Candida krusei (Issatchenkia orientalis) | 100% |
1 | No signal | 1 | Not available (Candida lambica) | (n/a) |
6 | No signal | 6 | Not available (Candida lipolytica) | (n/a) |
4 | GGCGAAATGTCGTGCTGTAAACGTGCTTGA | 3 | Candida lusitaniae (Clavispora lusitaniae) | 100% |
GGCGAAAATGTCGTGCTGTAAACGTGCTTG | 1 | Candida lusitaniae (Clavispora lusitaniae) | 97% | |
6 | GTCGAATTTGGAAGAAGTTTTGGAGTTTGT | 4 | Candida parapsilosis complex | 100% |
GTCGAATTTGGAAGAATTTTGGAGTTGTAC | 2 | Candida parapsilosis complex | 100% | |
1 | No signal | 1 | Not available (Candida pararugosa) | (n/a) |
5 | GTCAAACTTAAAAGAACACTGTTCGCCACG | 3 | Candida pintolopesii (Kazachstania telluris) | 100% |
GTCAAAACTTAAAAGAACACTGTTCGCCAC | 2 | Candida pintolopesii (Kazachstania telluris) | 97% | |
1 | GTCAAACTTGTTTGTTATATTGTAAGGCCG | 1 | Candida periphelosum (Debaryomyces hansenii) | 100% |
4 | AATAAGATCAAGAGTCTGTAACAAGCTTAA | 2 | Candida rugosa | 100% |
AATAAGATCGTGAGTCTGTAACAAGCTTAA | 2 | Candida rugosa | 100% | |
3 | GTCAAAGTTATGAAATAAATTGTGGTGGCC | 2 | Candida tropicalis | 100% |
GTCAAAGTTATGAAAATAAATTGTGGTGGC | 1 | Candida tropicalis | 97% | |
4 | GTCAAACTTTGTTTGTTGTTGTAAGGCCGA | 4 | Candida zeylanoides | 100% |
3 | GTCAGAGTCTAAAGGCGCGATGGCAGGTTA | 3 | Cryptococcus laurentii | 97% |
1 | GTCTAACATAAAGGTTGTTAGAGTGGAGGG | 1 | Cryptococcus luteolus | 100% |
2 | GTCAAAAGTCAAAAGGTGCGCGATGGCAGG | 2 | Cryptococcus flavescens | 97% |
9 | GTCAAACAAAAAGAGATGGTTGTTATCAGC | 1 | Cryptococcus neoformans sp. complex | 100% |
GTCAAAACAAAAAAGAGATGGTTGTTATCA | 1 | Cryptococcus neoformans sp. complex | 97% | |
GTCAAAACAAAAAGAGATGGTTGTTATCAG | 4 | Cryptococcus neoformans sp. complex | 97% | |
GTCAAACAAAAGAGATGGTTGTTATCAGCA | 2 | Cryptococcus neoformans sp. complex | 93% | |
GTCAAAACAAAAGAGATGGTTGTTATCAGC | 1 | Cryptococcus neoformans sp. complex | 93% | |
4 | GCCAGATAATAAAAAGTCATGTCCACGAGG | 2 | Cryptococcus terreus | 100% |
GCCAGATAATAAAAAAGTCATGTCCCACGA | 2 | Cryptococcus terreus | 97% | |
4 | GTCAGATGTCAAAGTATACACAGGAAGCAA | 2 | Cryptococcus uniguttulatus | 100% |
GTCAGATGTCAAAGTATCACAGGAAGCAAA | 2 | Cryptococcus uniguttulatus | 97% | |
4 | No signal | 4 | Not available (Geotrichum candidum) | (n/a) |
1 | No signal | 1 | Not available (Kodamea ohmeri) | (n/a) |
1 | GCCAGAAGTGAAGAGAGGCTGTGCGACTTC | 1 | M. slooffiae | 97% |
1 | ACCAGAATTGAAAAAGGAATGATACTTTTC | 1 | M. pachydermatis | 100% |
1 | ACCAGAATTAAAAAAGGGCTGGGTGAAAGA | 1 | M. sympodialis | 93% |
1 | GTCAAAACTTTTAGTTTATTGTTGTTAAGC | 1 | P. anomala | 100% |
1 | ATCTAATCTTAAAATGTAGACATTCTGATT | 1 | R. mucilaginosa | 100% |
1 | ATCTAATCAAAAATAGACTTTGAGGATTAG | 1 | R. diobovatum | 97% |
4 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 3 | Trichosporon group (ATCC: Trichosporon asahii) | 100% |
3 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 2 | Trichosporon group (ATCC: Trichosporon asteroides) | 100% |
2 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 1 | Trichosporon group (ATCC: Trichosporon coremiiforme) | 100% |
1 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 1 | Trichosporon group (ATCC: Trichosporon faecale) | 100% |
4 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 2 | Trichosporon group (ATCC: Trichosporon inkin) | 100% |
1 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 1 | Trichosporon group (ATCC: Trichosporon jirovecii) | 100% |
5 | GCCAGAGTCAAAGTAATTGTCCTTGCGGAC | 4 | Trichosporon group (ATCC: Trichosporon ovoides) | 100% |
1 | GCCAGATCATAAAAAATTGCCCAAGGGCGG | 1 | Trichosporon pullulans (Guehomyces pullulans) | 100% |
1 | ACCAGAGTTCAAGAATTGTCCGAAGACGAT | 1 | Trichosporon group (ATCC: Trichosporon moniliforme) | 100% |
1 | ACCAGAGTTCAAGAATTGTCCGAAGACGAT | 1 | Trichosporon group (ATCC: Trichosporon mucoides) | 100% |
Match in direct proportion of number of bases: Match of 100% indicates perfect agreement of the first 30 bp to a sequence previously submitted to GenBank.
Insertion or deletion of a base is considered as a single mismatch.
The Cryptococcus laurentii species complex (15 species) can be divided into two phylogenetically distinct groups [30]. Based on the sequences determined by Takashima et al. [30], our assay can differentiate among all the species currently included withinphylogenetic group I (e.g. C. flavescens, C. aureus, C. laurentii ). However, differences in the target annealing sequences of the sequencing primers (results not shown) prevents obtaining evaluable sequence data results from any member of phylogenetic group II (e.g. C. carnescens, C. peneaus, C. victoriae); These species do not appear to be clinically relevant and are primarily associated with marine or soil environments [30].
One of the unanticipated outcomes of our study is the finding of discrepancies between the species designations of some of the ATCC isolates and the biochemical and molecular identification obtained by the methods employed here (Table 2). These discrepancies are especially evident with regard to the identification of the reference strains of Cryptococcus luteolus. The type strain C. luteolus ATCC 32044 was identified by both sequencing methods as C. luteolus. However, all other “C. luteolus” strains (ATCC 42279, ATCC 42280, ATCC 44440) were not identified satisfactorily at the species level by either sequencing method (Table 3). It is unclear if these isolates belong to the species C. luteolus.
One of the technical limitations of Pyrosequencing technology is the accurate resolution of long homopolymers. Unfortunately, the region targeted in our assay included some of theses sequences. Examples of these homopolymers are shown for the sequences obtained from C. terreus, M. sympodialis, C. catenulata and C. neoformans (Table 4). The presence of homopolymers in the target region of C. neoformans increases the number of expected polymorphisms from two (according to cycle sequencing and GenBank databases) to five. Homopolymers are also present in the target sequence of Cryptococcus terreus, based on the current GenBank databases. Two polymorphisms are expected within this region. However, in two of the reference strains tested, the Pyrosequencing result obtained was different by one nucleotide deletion from either one of these polymorphisms.
It has been previously estimated that DNA polymerase in the presence of apyrase (the enzyme that eliminates excess deoxynucleotides during Pyrosequencing analysis), is capable of incorporating up to 10 identical adjacent nucleotides within a homopolymeric region [31]. However, in the experimental conditions employed, it seems that Pyrosequencing analysis may not reliably define the homopolymeric regions of more than 3 consecutive adenine bases in certain sequences (C. neoformans sequence, Table 4). However, adenine homopolymers may be resolved if they are located at the beginning of the region to be sequenced (C. catenulata sequence, Table 4). A condition that accentuates this problem is the excess of template (PCR product) that may enhance the non-synchronized extension of DNA fragments. Some references suggest that, upon re-sequencing, it is possible to use a dispensation cycle in which the problem homopolymeric nucleotide is added twice to ensure complete polymerization and reduction of these non-synchronized DNA fragments [16]. However, the utility of this approach has to be determined empirically based on the sequence of the most frequent clinical isolates.
In four cases (Table 4, isolates with 93% similarity), 2 unexpected nucleotide differences within the first 30 sequenced bases were observed in the following strains: M. sympodialis ATCC 96803, Cryptococcus gatti ATCC 32609, Cryptococcus neoformans ATCC 32045 and Cryptococcus neoformans ATCC 32268. In each case, BLASTN searches of the obtained sequences in GenBank resulted in correct identification with a 100% match. However, BLASTN obtained these results from portions of the submitted Pyrosequencing results excluding sections with problems of deletion or addition of bases within the homopolymeric regions.
It is important to emphasize that not all polymorphisms detected correspond to artifacts. In some instances, the intra-species variability observed was corroborated by conventional sequencing (e.g. C. haemulonii, C. parapsilosis, C. laurentii, C. luteolus, C. neoformans, C. terreus). Further optimization of the conditions tested for Pyrosequencing analysis (for instance, controlling the amount of template used per reaction) might reduce non-synchronized DNA polymerization associated with some of the sequence polymorphisms observed here.
The routine utilization of a molecular approach for the identification of clinically relevant yeasts will allow an accurate identification of the isolates and will provide a more reliable correlation of possible differences in clinical outcome virulence and epidemiology. The proposed identification approach provides an alternative tool for the differentiation between the closely related C. albicans and C. dubliniensis. C. dubliniensis was described as being more prevalent among immunocompromised and HIV infected populations [32].
In our experience, when compared to sequencing methods, the use of carbohydrate assimilation kits for identification of yeasts is less reliable. When compared to conventional cycle sequencing, the implementation of Pyrosequencing analysis in the molecular identification scheme for yeasts is associated with a reduction in processing time of at least 4 hours per batch. When compared to “in house” sequencing assays, reactive cost per reaction (including cost of PyroMark Fungi ASR kit) is reduced by 13%. However, if the sample preparation buffers are prepared in the laboratory, the reactive cost per reaction is reduced by 46%.
The present approach will allow the identification to the species level of the majority of clinical yeast isolates (>95%). However, similar to the limitations of cycle sequencing of the ITS region, the Pyrosequencing assay evaluated here will not differentiate among species included within the Cryptococcus neoformans species complex and the Trichosporon species complex. Additionally, our Pyrosequencing approach was not able to obtain any sequence information for five species. However, these species may probably represent less than 1% of all clinical yeast isolates in the majority of clinical laboratories [5, 22]. They account for not more than 0.06% of the clinical isolates identified in our institution since 2001 (data not shown).
The workflow, workload and financial resources available in each clinical laboratory will determine if DNA-based methods of identification could be used. Pyrosequencing technology represents a rapid and less expensive sequencing alternative that can be implemented in a clinical laboratory by technologists trained in conventional sequencing. The cost advantage of pyrosequencing is maximized when a high volume of samples is processed; therefore, standardization of targets and methodologies for the identification of different groups of bacteria, molds and yeasts will favor the decision to incorporate this technology into clinical laboratory settings.
The present study demonstrates an alternative and relatively inexpensive approach for sequencing-based identification of non-dematiaceous yeasts. A database of reference strains is provided (Table 4) that can be uploaded into the Pyrosequencing® identification software (IdentiFire™ Software) for the rapid implementation of this technology. The use of the database and the IdentiFire Software will allow the identification to the species or genus level of the vast majority of non-dematiaceous yeast of clinical significance.
Acknowledgments
We thank Patricia S. Conville, Department of Laboratory Medicine, Warren G. Magnuson Clinical Center, NIH, for her technical assistance. This research was supported by the Intramural Research Program of the NIH Clinical Center. The views expressed herein are those of the authors and should not be construed as those of the U.S. Department of Health and Human Services.
References
- 1.Walsh TJ, Groll A, Hiemenz J, Fleming R, Roilides E, Anaissie E. Infections due to emerging and uncommon medically important fungal pathogens. Clin Microbiol Infect. 2004;10(Suppl 1):48–66. doi: 10.1111/j.1470-9465.2004.00839.x. [DOI] [PubMed] [Google Scholar]
- 2.Vartivarian SE, Anaissie EJ, Bodey GP. Emerging fungal pathogens in immunocompromised patients: classification, diagnosis, and management. Clin Infect Dis. 1993;17(Suppl 2):S487–S491. doi: 10.1093/clinids/17.supplement_2.s487. [DOI] [PubMed] [Google Scholar]
- 3.Aperis G, Myriounis N, Spanakis EK, Mylonakis E. Developments in the treatment of candidiasis: more choices and new challenges. Expert Opin Investig Drugs. 2006;15:1319–1336. doi: 10.1517/13543784.15.11.1319. [DOI] [PubMed] [Google Scholar]
- 4.Taylor LH, Latham SM, Woolhouse ME. Risk factors for human disease emergence. Philos Trans R Soc Lond B Biol Sci. 2001;356:983–989. doi: 10.1098/rstb.2001.0888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pfaller MA, Diekema DJ. Epidemiology of invasive candidiasis: a persistent public health problem. Clin Microbiol Rev. 2007;20:133–163. doi: 10.1128/CMR.00029-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Bhally HS, Jain S, Shields C, Halsey N, Cristofalo E, Merz WG. Infection in a neonate caused by Pichia fabianii: importance of molecular identification. Med Mycol. 2006;44:185–187. doi: 10.1080/13693780500160793. [DOI] [PubMed] [Google Scholar]
- 7.Ramani R, Gromadzki S, Pincus DH, Salkin IF, Chaturvedi V. Efficacy of API 20C and ID 32C systems for identification of common and rare clinical yeast isolates. J Clin Microbiol. 1998;36:3396–3398. doi: 10.1128/jcm.36.11.3396-3398.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Coignard C, Hurst SF, Benjamin LE, Brandt ME, Warnock DW, Morrison CJ. Resolution of discrepant results for Candida species identification by using DNA probes. J Clin Microbiol. 2004;42:858–861. doi: 10.1128/JCM.42.2.858-861.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Massonet C, Van Eldere J, Vaneechoutte M, De Baere T, Verhaegen J, Lagrou K. Comparison of VITEK 2 with ITS2-fragment length polymorphism analysis for identification of yeast species. J Clin Microbiol. 2004;42:2209–2011. doi: 10.1128/JCM.42.5.2209-2211.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Chen SC, Halliday CL, Meyer W. A review of nucleic acid-based diagnostic tests for systemic mycoses with an emphasis on polymerase chain reaction-based assays. Med Mycol. 2002;40:333–357. doi: 10.1080/mmy.40.4.333.357. [DOI] [PubMed] [Google Scholar]
- 11.Summerbell RC, et al. Microcoding: the second step in DNA barcoding. Philos Trans R Soc Lond B Biol Sci. 2005;360:1897–1903. doi: 10.1098/rstb.2005.1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Linton CJ, et al. Molecular identification of unusual pathogenic yeast isolates by large ribosomal subunit gene sequencing: 2 years of experience at the United Kingdom mycology reference laboratory. J Clin Microbiol. 2007;45:1152–1158. doi: 10.1128/JCM.02061-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Leaw SN, Chang HC, Sun HF, Barton R, Bouchara JP, Chang TC. Identification of medically important yeast species by sequence analysis of the internal transcribed spacer regions. J Clin Microbiol. 2006;44:693–699. doi: 10.1128/JCM.44.3.693-699.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ahmadian A, Ehn M, Hober S. Pyrosequencing: history, biochemistry and future. Clin Chim Acta. 2006;363:83–94. doi: 10.1016/j.cccn.2005.04.038. [DOI] [PubMed] [Google Scholar]
- 15.White T, Bruns T, Lee S, Taylor J. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In: Innis MA, Gelfand DH, Sninsky JJ, White TJ, editors. PCR protocols: a guide to methods and applications. Academic Press San Diego; California: 1990. [Google Scholar]
- 16.Fakhrai-Rad H, Pourmand N, Ronaghi M. Pyrosequencing: an accurate detection platform for single nucleotide polymorphisms. Hum Mutat. 2002;19:479–485. doi: 10.1002/humu.10078. [DOI] [PubMed] [Google Scholar]
- 17.Tavanti A, Davidson AD, Gow NAR, Maiden MCJ, Odds FC. Candida orthopsilosis and Candida metapsilosis spp. nov to replace Candida parapsilosis groups II and III. Journal of Clinical Microbiology. 2005;43:284–292. doi: 10.1128/JCM.43.1.284-292.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gacser A, Schafer W, Nosanchuk J, Salomon S, Nosanchuk J. Virulence of Candida parapsilosis, C. orthopsilosis and C. metapsilosis in reconstituted human tissue models. Fungal Genetics. 2007 doi: 10.1016/j.fgb.2007.02.002. Article in press. [DOI] [PubMed] [Google Scholar]
- 19.Hazen KC. New and emerging yeast pathogens. Clin Microbiol Rev. 1995;8:462–478. doi: 10.1128/cmr.8.4.462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gharizadeh B, et al. Identification of medically important fungi by the Pyrosequencing technology. Mycoses. 2004;47:29–33. doi: 10.1046/j.1439-0507.2003.00949.x. [DOI] [PubMed] [Google Scholar]
- 21.Trama JP, Mordechai E, Adelson ME. Detection and identification of Candida species associated with Candida vaginitis by real-time PCR and pyrosequencing. Mol Cell Probes. 2005;19:145–152. doi: 10.1016/j.mcp.2004.10.004. [DOI] [PubMed] [Google Scholar]
- 22.Pfaller MA, et al. Results from the ARTEMIS DISK Global Antifungal Surveillance study, 1997 to 2005: an 8.5-year analysis of susceptibilities of Candida species and other yeast species to fluconazole and voriconazole determined by CLSI standardized disk diffusion testing. J Clin Microbiol. 2007;45:1735–1745. doi: 10.1128/JCM.00409-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Rodriguez-Tudela JL, et al. Susceptibility patterns and molecular identification of Trichosporon species. Antimicrob Agents Chemother. 2005;49:4026–4034. doi: 10.1128/AAC.49.10.4026-4034.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Diaz MR, Fell JW. Use of a suspension array for rapid identification of the varieties and genotypes of the Cryptococcus neoformans species complex. J Clin Microbiol. 2005;43:3662–3672. doi: 10.1128/JCM.43.8.3662-3672.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kwon-Chung KJ, Varma A. Do major species concepts support one, two or more species within Cryptococcus neoformans? FEMS Yeast Res. 2006;6:574–587. doi: 10.1111/j.1567-1364.2006.00088.x. [DOI] [PubMed] [Google Scholar]
- 26.Hazen KC, Howell SA. Candida, Cryptococcus and other yeasts of medical importance. In: Murray PR, Baron EJ, Jorgensen JH, Landry ML, Pfaller MA, editors. Manual of Clinical Microbiology. 9. ASM Press; Washington DC: 2007. pp. 1762–1788. [Google Scholar]
- 27.Bovers M, et al. Identification of genotypically diverse Cryptococcus neoformans and Cryptococcus gattii isolates by Luminex xMAP technology. J Clin Microbiol. 2007;45:1874–1883. doi: 10.1128/JCM.00223-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Fell JW, Scorzetti G. Reassignment of the basidiomycetous yeasts Trichosporon pullulans to Guehomyces pullulans gen. nov., comb. nov. and Hyalodendron lignicola to Trichosporon lignicola comb. nov. Int J Syst Evol Microbiol. 2004;54:995–998. doi: 10.1099/ijs.0.03017-0. [DOI] [PubMed] [Google Scholar]
- 29.Holland SM, Shea YR, Kwon-Chung J. Regarding Trichosporon pullulans infection in 2 patients with chronic granulomatous disease. Journal of Allergy and Clinical Immunology. 2004;114:206. doi: 10.1016/j.jaci.2004.01.781. [DOI] [PubMed] [Google Scholar]
- 30.Takashima M, Sugita T, Shinoda T, Nakase T. Three new combinations from the Cryptococcus laurentii complex: Cryptococcus aureus, Cryptococcus carnescens and Cryptococcus peneaus. International Journal of Systematic and Evolutionary Microbiology. 2003;53:1187–1194. doi: 10.1099/ijs.0.02498-0. [DOI] [PubMed] [Google Scholar]
- 31.Ronaghi M. Improved performance of pyrosequencing using single-stranded DNA-binding protein. Anal Biochem. 2000;286:282–288. doi: 10.1006/abio.2000.4808. [DOI] [PubMed] [Google Scholar]
- 32.Sullivan DJ, Moran GP, Coleman DC. Candida dubliniensis: ten years on. FEMS Microbiol Lett. 2005;253:9–17. doi: 10.1016/j.femsle.2005.09.015. [DOI] [PubMed] [Google Scholar]
- 33.de Hoog GS, Guarro J, Gene J, Figueras MJ. Atlas of Clinical Fungi. 2. Centraalbureau voor Schimmelcultures/Universitat Rovira i Virgili; Utrecht: 2000. [Google Scholar]
- 34.Giammanco GM, Melilli D, Pizzo G. Candida pararugosa isolation from the oral cavity of an Italian denture wearer. Res Microbiol. 2004;155:571–574. doi: 10.1016/j.resmic.2004.04.003. [DOI] [PubMed] [Google Scholar]
- 35.Bai FY, Takashima M, Nakase T. Description of Bullera kunmingensis sp. nov., and clarification of the taxonomic status of Bullera sinensis and its synonyms based on molecular phylogenetic analysis. FEMS Yeast Res. 2001;1:103–109. doi: 10.1111/j.1567-1364.2001.tb00021.x. [DOI] [PubMed] [Google Scholar]
- 36.Gueho E, Midgley G, Guillot J. The genus Malassezia with description of four new species. Antonie Van Leeuwenhoek. 1996;69:337–355. doi: 10.1007/BF00399623. [DOI] [PubMed] [Google Scholar]
- 37.CBS. Centraalbureau vor Schimmelcultures. Fungal Biodiversity Center Yeast Database. 2007 http://www.cbs.knaw.nl/yeast/BioloMICS.aspx.