Abstract
Multiple mechanisms collaborate for proper regulation of gene expression. One layer of this regulation is through the clustering of functionally related genes at discrete loci throughout the genome. This phenomenon occurs extensively throughout Ascomycota fungi and is an organizing principle for many gene families whose proteins participate in diverse molecular functions throughout the cell. Members of this phylum include organisms that serve as model systems and those of interest medically, pharmaceutically, and for industrial and biotechnological applications. In this review, we discuss the prevalence of functional clustering through a broad range of organisms within the phylum. Position effects on transcription, genomic locations of clusters, transcriptional regulation of clusters, and selective pressures contributing to the formation and maintenance of clusters are addressed, as are common methods to identify and characterize clusters.
Keywords: Biosynthetic gene clusters, Metabolic gene clusters, Secondary metabolite gene clusters, Ascomycota fungi, Genomics
1. Introduction
Transcriptional regulation is essential to ensure cellular and organismal survival. Cellular cues, both intracellularly initiated and extracellularly recognized, trigger comprehensive changes to the transcriptome. Such changes establish cellular identity and maintain homeostasis during stress [1], [2], [3], [4], [5], [6]. Transcriptional changes can also be much smaller in scope, allowing for fine-tuning expression of individual genes as needed. At the core of these transcriptional changes is the production of a transient messenger RNA (mRNA) that serves as a template for protein synthesis via the ribosome, commonly referred to as the Central Dogma of Molecular Biology [7], [8]. While the Central Dogma is a bit of an oversimplification, it remains a foundation for an understanding of gene expression [9], [10]. Extensive study over the decades has yielded a wealth of knowledge about the myriad processes that collaborate to regulate proper gene expression in countless organisms – from the simplest single celled to staggeringly complex [11], [12], [13], [14], [15]. Gene positioning and arrangement throughout the genome can profoundly influence transcription, with the cis regulatory ‘logic’ effecting expression across a locus to silence or activate neighboring genes [16], [17]. The influence of genomic arrangement on transcription has yielded significant insights, with implications for all organisms.
One of the major clades of living organisms is fungi, with estimates that there may be up to 1.5 million species within this group. There is incredible diversity within the fungal kingdom and throughout the phylum Ascomycota, with members exhibiting vast differences in morphology and lifestyles [18], [19]. Ascomycetes form a characteristic sac-like structure called an ascus, that forms around their meiotic spores and leads to their common name, sac fungus (Fig. 1) [20]. Representative members are ideal as model organisms for researchers, providing many insights into eukaryotic biology – with multiple Nobel prizes awarded to researchers studying the budding yeast and the fission yeast. It includes filamentous fungi, bread molds, and the causative pathogens for powdery mildew, black rot, and anthracnose [21], [22], [23], [24], [25]. There are also a number of opportunistic human pathogenic Ascomycete organisms, including several emerging pathogens [26].
Members of this phylum can have significant metabolic flexibility, which makes them useful for a variety of biotechnological applications, including the production of fatty alcohols, fatty acids, biofuels, reduction and degradation of chemicals and solvents [27]. This phylum includes a broad range of well-studied model organisms, plant pathogens, animal pathogens, and a number of organisms of interest to the pharmaceutical and biotechnology industries (Table 1). Due to the many applications for Ascomycete fungi in academia, industry, and medicine, a thorough understanding of genomic organization and the potential implications on gene expression is vital. There have been many advances in the identification and characterization of Ascomycetes on a genomic and transcriptional level. In this paper, we review these advances, with a focus on genomic organization and implications upon transcriptional regulation.
Table 1.
Species name | Note about species | |
---|---|---|
Model Organisms | Aspergillus nidulans | Filamentous fungus |
Neurospora crassa | Bread mold | |
Saccharomyces cerevisiae | Budding yeast | |
Schizzosaccharomyces japonicas | Fission yeast | |
Schizzosaccharomyces pombe | Fission yeast | |
Sordaria macrospora | Model of fruiting body development | |
Pathogenic and Medically Relevent | Aspergillus fumigatus | Aspergillosis |
Aspergillus terreus | Potato bilight | |
Aspergillus viridinutans | Aspergillosis | |
Blastomyces dermatitidis | Blastomycosis | |
Botrytis cinerea | Necrotrophic fungus of grapes | |
Candida albicans | Candidiasis | |
Candida auris | Candidiasis, high incidence of drug resistance | |
Candida glabrata | Candidiasis | |
Candida krusei | Oportunistic pathogen | |
Candida lusitaniae | Fungemia | |
Candida parapsilosis | Sepsis | |
Candida rugosa | Emerging human fungal pathogen | |
Candida tropicalis | Candidiasis | |
Cladophialphora bantianum | Cerebral pheohyphomycosis | |
Claviceps purpurea | Ergot fungus tthat infects rye and cereal and forage plants | |
Coccidioides immitis | Valley fever (coccidioidomycosis) | |
Coccidioides posadasii | Valley fever (coccidioidomycosis) | |
Cordyceps militaris | Entomopathogenic fungus | |
Cryphonectria parasitica | Chestnut blight (Stajich et al 2009) | |
Dothistroma septosporum | Red band needle blight | |
Exophiala dermatitidis | Phaeohyphomycosis | |
Exophiala jeanselmei | Skin infections | |
Fusarium fujikuroi | Bakanae disease | |
Fusarium oxysporum | Animal and plant pathogen | |
Fusarium oxysporum | Plant and animal pathogen | |
Fusarium solanum | Onychomycosis, keratomycosis, plant pathogen | |
Gibberella moniliformis | Maize ear and stalk rot | |
Histoplasma capsulatum | Pulmonary mycosis | |
Lacazia loboi | Lobo's disease | |
Leptosphaeria maculans | Blackleg disease | |
Ochroconis gallopava | Pathogen to fowls, turkeys, poults, and humans | |
Ophiostoma ulmi | Dutch elm disease (Stajich et al 2009) | |
Paecilomyces variotii | Common environmental mold | |
Paracoccidioides brasiliensis | Paracoccidioidomycosis | |
Paracoccidioides lutzii | Paracoccidioidomycosis | |
Penicillium expansum | Psychrophilic blue mold, infects apples | |
Pneumocystis jiroveci | Pneumocystis pneumonia (PCP) | |
Pseudoallescheria boydii | Eumycetoma; maduromycosis; pseudallescheriasis | |
Ramichloridium musae | Banana pathogen | |
Sporothrix schenckii | Sporotrichosis | |
Talaromyces marneffei | Talaromycosis (penicilliosis) | |
Biotechnological and Pharmaceutical Applications | Aureobasidium pullulans | Reductase |
Candida boidinii | Phenylalanine dehydrogenase | |
Candida cylindracea | Lipases (Lip) hydrolyze triglycerides into fatty acids and glycerol | |
Candida maltosa | Resolving D- and L-racemic mixtures of amino acids | |
Candida sorbophila | ω-oxidizing yeast | |
Debaryomyces hansenii | Oleaginous yeast; osmotolerant | |
Geotrichum candidum | Dehydrogenase | |
Kluyveromyces lactis | Assimilate lactose and convert it into lactic acid | |
Kluyveromyces marxianus | Aerobic yeast capable of respiro-fermentative metabolism | |
Komagataella pastoris | Glycolate oxidase; Phytase | |
Ogataea polymorpha | Protein expression; use methanol consumption | |
Pichia pastoris | Methanol metabolism | |
Podospora anserina | Pentaketides | |
Saccharomyces boulardii | Treatment of intestinal diseases | |
Scheffersomyces stipitis | Ferment xylose, converting it to ethanol | |
Schwanniomyces occidentalis | Mylolytic enzymes, including α-amylase and glucoamylase | |
Trichoderma reesei | Cellulolytic enzymes (cellulases and hemicellulases) | |
Trigonopsis variabilis | D-amino acid oxidase | |
Yarrowia lipolytica | Specialty lipids; lipolytic enzymes | |
Zygosaccharomyces rouxii | L-Glutaminase |
2. Position effects due to the proximity of heterochromatin
One of the advantages for the usage of members of Ascomycota for a wide range of applications is that they are amenable to genetic manipulations [22], [28]. Classical genetic approaches allow for the characterization of molecular functions and frequently utilize reporter genes, or selectable markers, typically utilizing nutritional or drug resistance for selection. The telomere proximal effect (TPE) is a phenomenon well characterized in the budding yeast, Saccharomyces cerevisiae, initially identified during the integration of the URA3, TRP1, and ADE2 reporter genes. Regardless of genomic orientation, when integrated adjacent to certain telomeres transcriptional repression of the reporter constructs is observed. The TPE is distance dependent; as the region between the telomere and a reporter increased, so does expression of the reporter gene [29]. Transcriptional repression by the TPE is due, in part, to the assembly of transcriptionally inactive heterochromatin that forms at the telomeres and spreads outwards in a continuous fashion mediated by the silent information regulator (SIR) histone deacetylase proteins and the structural maintenance of chromosome (SMC) protein complexes [30], [31].
Further study demonstrated that silencing was not universal at all native yeast telomeres equally, while there are a number of telomeric sites where integration of a reporter gene is heavily silenced there are other sites with little silencing observed [32]. The major indicator of silencing at telomeres appears to be the proximity of the telomere to an autonomously replicating sequence contained within one of the telomeric repetitive elements (the core X element) [32]. A recent study provides the most definitive understanding of TPE via global characterization of transcription utilizing a highly sensitive RNA sequencing approach. There is widely seen transcription at many endogenous yeast telomeres, although at lower absolute levels of transcription compared to non-telomeric regions. Likewise, the SIR proteins play a role in silencing genes at telomeric regions adjacent to known SIR protein binding sites. The majority of telomeric genes are un-effected by the loss of SIR protein function, indicating other mechanisms limiting their expression [33].
The overwhelming majority of eukaryotic organisms maintain heterochromatin at the telomeres. This is true for many Ascomycetes, including Schizosaccharomyces pombe, Aspergillus fumigatus, and Candida species [34], [35]. Heterochromatin is a feature of repetitive genomic regions, which include the telomeres, centromeres, and mating loci [36]. While oftentimes heterochromatin conjures images of a static, silent structure, it can be dynamic, exhibiting plasticity under specific growth conditions. S. pombe and C. albicans alters heterochromatin at the telomere during elevated temperatures in a SIR dependent manner, indicating that this is dynamic – undergoing significant changes depending on the growth conditions [37], [38]. This could result in the clustering of specific genes at specific telomeric regions that require activation under specific stimuli, such as limiting the co-factor necessary for SIR functioning, while maintaining low levels of expression under normal growth conditions [33]. This is particularly relevant for gene members of the toxin response regulon, which is upregulated in response to specific cellular toxins to facilitate cell survival. This family of genes exhibits a non-random distribution throughout the genome, including a number of members localized to the telomeric regions in S. cerevisiae [39].
3. Functional clustering of genes within complex biosynthetic pathways
Many prokaryotic organisms contain a streamlined genome, with functionally related, co-regulated genes organized in a linear arrangement under the transcriptional regulation of a single promoter region. This organization, called an operon, represents an efficient mechanism to balance production of multiple components within a metabolic process, playing a critical role in gene expression and organismal survival [40], [41], [42], [43]. Operon-like gene clusters are present in at least one eukaryote, although the canonical operon structure is largely absent in eukaryotes on the whole [44].
One characteristic feature of eukaryotic chromosomes is that they exhibit domains, or neighborhoods, of correlated gene expression throughout the genome [45], [46], [47], [48]. In S. cerevisiae, there is a broad incidence of locally correlated gene expression, which is distance dependent (the closer any two genes are located the higher their average transcriptional correlation throughout the genome) [49]. This, coupled with the fact that many functionally related genes cluster together non-randomly across the genome, supports the hypothesis that functional clustering represents fundamental layer of transcriptional regulation for many of these genes [39], [45].
Approximately 25% of functionally related gene families are organized into such clusters, and this arrangement increases the transcriptional similarity (as quantified by the Pearson’s correlation coefficient) of these clusters compared to the non-clustered members within the same co-regulated gene family [39]. This phenomenon is not unique to S. cerevisiae – clustering of gene families occurs throughout this phylum for gene families participating in a variety of different molecular processes. In this section, we discuss the types and the incidence of these clusters identified in and observed across Ascomycetes species.
3.1. Secondary metabolite gene clusters
Fungi produce a number of bioactive compounds collectively called secondary metabolites (SMs), molecules that are not required during normal growth. These molecules typically confer a survival advantage exhibiting properties that are antibiotic, anti-proliferative, and catabolic [50]. Many different Ascomycetes produce SMs that are mycotoxins, phytotoxins, and compounds that enhance virulence and pathogenesis [51], [52]. The production of SMs by pathogenic fungi can facilitate fungal cooption of a host’s cells, triggering apoptosis and the absorption of host nutrients [52]. Due to the bioactive effects of many SMs, these molecules are of broad interest pharmaceutically and medically for their potentially therapeutic effects [53], [54].
3.1.1. Pathogenic toxins and defense
Aspergillus species are non-specific pathogens that infect plants, animals, insects, and immunocompromised people. They cause diseases that cause significant economic impacts in the agricultural industry and produce the potent carcinogen aflatoxin [55]. In A. nidulans the pathway for production of sterigmatocystin, a highly toxic metabolite and a precursor to the aflatoxins, is located as a 23-gene cluster that spans a 54 kilobase genomic region. In A. fumigatus the six genes which are necessary conidial pigment biosynthesis – known to increase the virulence in this species – form a cluster spanning 19 kb (Fig. 2A) [56]. The biosynthetic pathway for the synthesis of the meroterpenoids austinol and dehydroaustinol in A. nidulans are found as a split cluster of four and 10-genes [57].
Gliotoxin is an ETP (epipolythiodioxopiperazine)-type fungal toxin produced by A. fumigatus that controls the immune response and induces apoptosis in specific cell types [58]. The biosynthesis of gliotoxin arises from a 16 gene cluster – which includes the synthetic enzymes and the proteins to detoxify and protect A. fugimatus from the effects of gliotoxin [59], [60]. This ETP cluster is conserved in many Aspergillus lineages. Although there are differences in the clusters between species, this cluster was most likely the result of an ancestral relationship. The current model is that following assembly of this gene cluster they diverged and diversified during the course of evolution of Ascomycetes [61].
The clustering of functionally related genes across the genome is widespread within this phylum for genes that are necessary for virulence. The fungal pine pathogen, Dothistroma septosporum, produces the potent phytotoxin dothistromin – which is chemically similar to aflatoxin. Biosynthesis of dothistromin requires 19 genes, found in six distinct clusters along the same chromosome [62]. This clustering appears to be the result of a dispersal of an ancestral cluster that exhibited a tighter clustering organizational relationship [62]. Gibberella moniliformis, a plant pathogen that causes ‘bakanae’ disease in rice, contains a cluster of 18 putative genes spanning a 75 kB region for the production of the toxin fumonisin [63].
The rye and grass pathogen, Claviceps purpurea, produces ergot alkaloids – potent bioactive mycotoxins – from a four-gene cluster [64]. The antibiotic viriditoxin exhibits anti-proliferative effects in bacteria (via interactions with FtsZ during division) and in cancer cells [65], [66]. This toxin is produced in Paecilomyces variotii and Aspergillus viridinutans via a nine- and eight- gene cluster respectively [67]. Penicillium expansum is a blue mold that causes apple decay and produces the cytotoxic SM patulin, which can be immunosuppressive and carcinogenic [68]. This biosynthetic pathway is encoded by a 15-gene cluster that includes the biosynthetic enzymes and detoxification proteins [69]. Aspergillus terreus has a gene cluster for lovastatin biosynthesis, which is used medically to treat high cholesterol and triglyceride levels in patients [70]. Bikaverin is a reddish pigment produced by Fusarium fungi with reported antibiotic and antitumoral properties. This molecule depends, in part, on a six-gene cluster as seen in Fusarium fujikuroi, where the genes for biosynthesis, regulation, and transport are found in a contiguous stretch [71].
Horizontal gene transfer (HGT) refers to the passage of genes between organisms by means other than parent to offspring transmission. While HGT is quite frequent in prokaryotic organisms, it is significantly less common in eukaryotic organisms. The production of toxins confers a growth and a survival advantage, making SM pathways excellent candidates for HGT in eukaryotes. The 23-gene cluster necessary for sterigmatocystin production in A. nidulans, was adopted by the filamentous fungus, Podospora anserine, via horizontal gene transfer from Aspergillus [72]. Likewise, the biosynthetic clusters for gliotoxin production were acquired by horizontal gene transfer in several species [61]. Interestingly, five of the clustered genes involved in the synthesis of bikaverin in Fusarium fungi are identified in the distantly related Botrytis cinerea. Although the regulatory genes maintained their functionality, the others did not. This presents an example of horizontal gene transfer of the entire cluster and suggests this mechanism might contribute to the incorporation of novel regulators in addition to metabolic pathways [73].
3.1.2. Detoxification clusters
Arsenic is a naturally occurring element found in a variety of compounds, many of which can be toxic. In budding yeast, S. cerevisiae, the three genes that confer resistance to arsenic containing compounds, ARR1, ARR2, and ARR3, are clustered together with a 4.2 kilobase region on chromosome XVI [74]. The expression of these genes results in the production of a basic helix-turn-helix transcription factor, arsenate reductase, and a metalloid-proton antiporter that collaborate to protect the cell from the toxin. Fusarium oxysporum contains a two-gene cluster that allows for the detoxification of cyanate (CNO–), a defense compound that is produced by a wide range of organisms that inhibits oxidative phosphorylation through the interference with cytochrome C function [75], [76].
In addition to protection from exogenous toxins, there is an inherent risk to produce SMs that are nonspecific toxins. A form of protection for the host cell can involve a gene (or genes) that offers protection from the toxin, termed the ‘resistance hypothesis’. This can include efflux pumps for export and modification or detoxification enzymes [77]. Leptosphaeria maculans contains a sirodesmin biosynthesis gene cluster, which is a nonspecific mycotoxin. This gene cluster is predicted to contain 18 genes, including several genes that code for P450 cytochrome proteins that serve to detoxify this compound [78]. The necrotrophic fungus, Botrytis cinerea, contains the five-gene cluster for the synthesis of the phytotoxin sequesterterpine and several members of the p450 monooxygenase gene family [79]. It also produces the plant hormone abscisic acid (ABA), coded by a cluster of four genes that include two members of the p450 monooxygenase gene family [80].
3.2. Primary metabolite gene clusters
Primary metabolite (PM) biosynthetic genes are those whose products participate in basic metabolic processes that are widespread in many organisms, whereas SMs biosynthetic pathways occur in a limited number of organisms. One of the best characterized functional clustering of a gene family are those of the galactose metabolism genes, which have been extensively characterized in S. cerevisiae. Three of the genes are co-localized together on chromosome II: GAL7-GAL10-GAL1 which encode the enzymes galactose-1-phosphate uridyl transferase, UDP-glucose-4-epimerase, and galactokinase, respectively [81]. Clustering of the galactose metabolism genes in the Saccharomycotina and Candida ancestors, with the S. pombe and S. japonicas acquiring the cluster via HGT from a common ancestor from the Candida cluster [82].
Proline catabolism depends on the activity of a permease, an oxidase, and a P5C (pyrroline-5-carboxylate) dehydrogenase to produce glutamate. In A. nidulans, proline metabolism is dependent on a four-gene cluster not conserved in S. cerevisiae. The biosynthesis of biotin is dependent on a three-gene cluster (Fig. 2B) [83], [84], [85]. Nicotinate metabolism is the result of a six-gene cluster that occurs over a 14.4kB region along chromosome VI in A. nidulans [86]. The utilization of alcohol as a carbon source depends on a five-gene cluster found on chromosome VII [87]. Interestingly, there are two additional genes seen in this cluster when compared to the cluster as seen in A. fumigatus, representing a species specific expansion of this cluster [87].
Pichia stipites contains the genes to catabolize the sugar L-rhamnose, of which four of the five genes are clustered [88]. There at least partial conservation of this grouping in members of the subphyla Pezizomycotina and Saccharomycotina [88]. Sulfate assimilation depends on the function of an ATP-sulfurase and PAPS (3-phosphoadenoine-5′phosphosulfate) reductase, which are found as a gene pair in A. terreus [89].
C. albicans catabolize phenolic compounds into acetyl coA via the 3-oxoadipate pathway. The genes involved in this are localized into two distinct clusters and are conserved in the Candida and ‘CTG’ lineages (species that translate CUG as serine), although with significant evolutionary divergence [90]. The three genes responsible for GlcNAc metabolism into fructose-6-phosphate are found clustered in C. albicans. This arrangement is conserved in Trichoderma reesei, along with the transcription factor, RON1 that regulates expression of the cluster [91].
In the case of PMs, there is an advantage to the clustering of genes; however, it is not a ubiquitous phenomenon. The production of SMs demonstrate an advantage to the maintenance of clusters to maintain integrity of the product and to deal with the potential ramifications of toxicity. This selection does not appear to be the case for PMs which exhibit more variation between species.
3.3. Superclusters
The descriptor of superclusters refers to the functional clustering of genes that code for the production of multiple SMs or PMs that have a complex relationship. COR (cordycepin, or 3′-deoxyadenosine) is produced by Cordyceps militaris from a four-gene cluster that also includes the enzyme to produce PTN (pentostatin, or 2′-deoxycoformycin), an inhibitor of adenosine deaminases (Fig. 2C). Both of these are bioactive molecules – COR has antibiotic and anti-inflammatory properties and PTN has chemotherapeutic effects [92], [93]. Interestingly, PTN production prevents the deamination of COR to 3′-deoxyinosine by endogenous enzymes in C. militaris [94]. COR and PTN biosynthesis is mediated by a single gene cluster and PTN prevents COR deamination, potentially through enzyme inhibition of endogenous adenosine deaminases present in C. militaris. Another example of a supercluster is seen in A. fumigatus, where the genes for the production of fumagillin and pseurotin are localized to the subtelomeric region of chromosome VIII. There are fifteen genes localized to this cluster, presenting a complex intertwined relationship for these two seemingly unrelated chemicals [95]. This genomic arrangement is conserved, albeit with significant rearrangements in related species [95].
3.4. Regulons
While operons are primarily a prokaryotic phenomenon, eukaryotic organisms contain regulons; functionally related gene families that are co-regulated and spread throughout the genome. Canonical examples include the genes involved with the biogenesis of the ribosome – both the ribosomal protein (RP) gene family and the rRNA and ribosome biosynthesis (RRB, or Ribi) gene family – are found clustered together throughout the genome in S. cerevisiae [96]. This distribution is non-random and highly unlikely to occur by chance [96], [97]. This distribution is not limited to budding yeast, the clustering of both gene families is conserved in both C. albicans, and the distantly related S. pombe. The identity of the individual paired genes differed among species, however the absolute numbers of clustered genes (e.g. the overall number of pairings that exist) is similar [96], [98]. Systematic analysis revealed that approximately 25% of all functionally related gene families exhibit a non-random genomic distribution as clustered pairings in S. cerevisiae. This genomic distribution results in a tighter transcriptional response during cellular changes when compared to the singleton (non-clustered) members within the same family [39].
4. Transcriptional co-regulation within a neighborhood may drive functional clustering
In addition to the repressive effects that can occur based on proximity to heterochromatic regions, certain genomic regions are more susceptible to transcriptional disruption during the course of genomic manipulations. The advent of genomic libraries allows for high throughput genetic screening in many organisms, and a wealth of resources exist for S. cerevisiae. Genetic manipulation alters expression of genes surrounding the site of manipulation at 7–15% of targeted regions. This led to the misannotation of over nine-thousand genetic interactions in systematic screens [99], [100]. Furthermore, genomic integration sites are responsible for 13-fold variation in protein levels [101]. The disruption of gene expression via the integration of highly expressed reporters may lead to this phenomena through transcriptional interference and repression of the adjacent gene(s) [102], [103].
Aside from transcriptional interference, there is evidence that the non-random genomic distribution of functionally related genes is essential to their proper transcriptional regulation via shared promoter elements. Using a RRB gene pair as a test, functional dissection of the MPP10-MRX12 pairing identified both genes share a common promoter region. What is notable about this pairing is that the genes are oriented in a convergent manner (→ ←), meaning that the canonical promoter elements, termed PAC and RRPE, exerts influence across a 4.0kB genomic region [104]. Both promoter elements are localized upstream of MPP10, there are no identifiable motifs in the promoter region of MRX12, and their mutation uncouples transcription of this pair from the rest of the regulon. Transcription of the pair is disrupted by their separation, and proper expression depends on chromatin remodeling and transcription factor binding to the promoter of MPP10 [105]. This phenomenon is termed ‘adjacent gene co-regulation’ (AGcoR).
There are several conformations for functionally related gene clusters, including a divergent orientation (Fig. 3A). This confirmation results in cis regulatory elements between the pair of genes, where it can act as a bidirectional promoter, which is quite common in yeast [106]. Much less common are the tandem (→ →) and convergent orientations (Fig. 3B and C). In these arrangements co-expression and transcriptional regulation must occur across a larger distance – which is not as common in budding yeast [107]. Interestingly, the convergent gene pair RRP15-NOC4 have the PAC and RRPE elements localized upstream of one member of the pair, as in the case with MPP10-MRX12. The pairing RPF1-GAR1 have the two elements split, with each gene containing either the PAC or the RRPE promoter motif. Though not functionally dissected, it would be quite interesting to observe the transcriptional regulation across a broad genomic distance once it is completed.
Transcriptional regulation at a distance depends, in part, on alterations to the chromatin underlying a genomic neighborhood extending across a broad region [108]. The constraints that these could impose would result in certain portions of the genome being more permissive to local influencing regulators. This is a particularly attractive model based on the diverse nature of gene families that are clustered and do not necessarily share the same promoter motifs. In support of this model, the ribosomal proteins, the nitrogen metabolism, and the toxin response gene families, cluster into genomic regions that are more susceptible to transcription at a distance during induction of the stress response [108]. Data was extracted and the pairwise Spearman’s correlation coefficient was plotted as a function of genomic distance (between the transcription start sites for each gene) for every member of each family. The members of each family were divided into plots for the singleton members (Fig. 4A) and the clustered members (Fig. 4B). Both plots reveal a positive correlation in expression regardless of genomic confirmation across the stress response. These data fit a logarithmic decay curve, which was overlaid as a separate plot (Fig. 4C). The transcriptional similarity is greater and spreads across a longer genomic distance in the clustered gene members – indicating that regardless of function, clusters are localizing to transcriptionally permissive regions of the genome (at least for these gene families). This would potentially explain how there are similar numbers of clusters within the RP and RRB gene families throughout Ascomycota, though the identities of the pairings differ greatly [96].
5. Computational approaches to identify and characterize clusters
The identification and characterization of co-regulated functional clusters as described throughout this work is currently an active area of research. This primarily involves two interrelated components – the first is the identification of co-expressed genes and the second is the characterization of the co-localization of two genes to the same genomic locus. In this section, we describe several tools employed to identify correlated expression from gene expression datasets.
5.1. Identification of functional clustering relationships using Euclidean distance from gene expression datasets
Identification of the complete membership of co-regulated gene families is a fundamental challenge for researchers, but can yield rewarding applications. One approach that successfully identified the membership of the ribosome biogenesis regulon is through the utilization of the function daisy from the S library [109]. Through the analysis of the budding yeast stress-response datasets, the Euclidean distance for all budding yeast genes (from the composite average gene expression response of the RRB gene family) was determined. This resulted in the expansion of the gene family to 188 members – many of which were clustered throughout the genome and had yet to be annotated [97]. The metric utilized is defined as:
where xik and xjk denote the expression levels of genes i and j at time point k [97]. This approach represents a straightforward method to mine gene expression data when the transcriptional behavior of a gene family is known beforehand.
5.2. Pearson’s correlation coefficient analysis of gene expression data
The transcriptional similarity between two genes can be calculated by the Pearson’s correlation coefficient (PCC). This approach is straightforward and represents a standard analysis that can be applied to gene expression data to identify linear relationships among genes, yet powerful enough determine correlated networks and similarities [39]. To calculate the PCC between two genes, X and Y, across a series of N conditions:
where:
where Goffset is set to the reference state in each data set.
5.3. Spearman’s correlation coefficient analysis of genomic neighborhoods
The Spearman’s correlation coefficient (SCC) is similar to the PCC, although without the magnitude component utilized in the PCC. The use of the SCC is ideal for characterizing gene expression throughout a broad region and uncovering more subtle transcriptional effects within a genomic locus or neighborhood [49]. The SCC is computed utilizing the formula:
where g1 and g2 are the corresponding genes for comparison, cov is their covariance, and σ is the standard deviation of expression. Such analysis can also give an accurate representation of the transcriptional effects that ripple outward as the result of transcriptional activation and that may be indirect as a function of proximity [16], [49]
5.4. The multivariate copula model for analysis of gene expression data
An addition to the computational toolkit that allows for modelling the directional dependence of two genes is through the application of a multivariate copula model (MCM) [110]. This represents a powerful computational approach that can help to construct the gene interaction networks and identify co-regulated gene families. This approach, when applied to gene expression datasets, allows for the reconstruction of regulatory relationships even when the data are non-linear [111]. The MCM is an enhancement over the use of the Pearson’s and Spearman’s correlation coefficients that can allow of the characterization of complex networks and relationships – as evidenced by the identification of network of chromosomal maintenance and cell-cycle regulating genes [112].
5.5. Determining the statistical significance of genomic arrangement through the with a hypergeometric distribution
Characterization of functional clustering observed in co-expressed gene families has been primarily done via manual curation. While this can be laborious at time, the systematic nomenclature adopted by many organisms allow for rapid searching and querying for identification of clusters. Once the genomic distribution of a gene set has been characterized, the probability of this distribution can be calculated. The statistical significance for the genomic distribution of a functionally related gene family is determined by calculating the binomial probability for the arrangement. The chance probability that there would be j adjacent genes within a regulon of size M genes is:
where N is the total number of genes present within S. cerevisiae (total number of genes after deduction of dubious open reading frames).
6. Summary and outlook
The physical linkage of genes into functional clusters throughout the genome can buffer the effects of stochastic noise in gene expression. Even in isogenic, clonal populations of cells there can be considerable variation in the levels of expression from similar genetic constructs [113], [114]. One model for selective pressures that favor clustering is that this arrangement would minimize the effects of stochastic noise. This is advantageous when dealing with biological pathways that produce potentially toxic intermediates, as seen with the galactose metabolism and the tyrosine biosynthetic pathways. The clustering of the genes within those pathways can limit variations in the levels of component enzymes – minimizing the risk of toxin buildup [115]. The risk of toxin production is significant, when the GAL genes are not clustered in S. cerevisiae there is significantly higher levels of the toxic galactose-1-phosphate and reduced cell viability [116]. Gene linkage via functional clustering would also minimize copy number variations that would occur during DNA replication, synchronizing the timing together.
A clustering arrangement would also be advantageous when dealing the production of large macromolecular complexes where stoichiometric levels of proteins are necessary. Some of these complexes, such as the ribosome, consume significant energetic reserves [117]. Transcription of the component genes to balance the production and abundance of mRNA as needed, allocates the limited cellular resources to minimize waste [118]. In addition to efficient energetic expenditures, the unbalanced production of the components to produce a ribosome results in orphan ribosomal proteins (RPs). The presence of these extra-ribosomal RPs perturbs cellular proteostasis, activates the expression of the heat shock activated transcription factor, HSF1, and reduces cellular fitness [119], [120].
There are thousands of species in the fungal kingdom – and in the phylum Ascomycota – yet to be discovered and are still being characterized. There will undoubtedly be many novel SMs identified. Characterization of biosynthetic gene clusters that produce these SMs will provide valuable insight to the target and mechanisms of these SMs, and the defense and protective genes present to ensure secondary metabolites do not harm the host. [121]. This could offer therapeutic targets for emerging pathogens, provide genetic modifications for the protection of food crops, and offer novel pathways for industrial and pharmaceutical biosynthesis. The farther the levels of transcription deviate from the optimal expression level for a gene, the more advantageous the tolerance for transcriptional noise [122]. This is particularly germane for researchers modifying organisms to recreate biosynthetic pathways in organisms that are easier to cultivate, including S. cerevisiae. The site of manipulation and modification should be chosen carefully in order to maximize organismal fitness, biosynthetic output, and metabolite yield [123].
The functional clustering of co-regulated genes – and the effects of positional expression – are not limited to Ascomycetes or to fungi. It has long been known that the effects of heterochromatin, as in the TPE, are conserved in more complex eukaryotic organisms, including humans [124]. There is conservation of the molecular mechanisms as well [125]. In addition to these repressive effects, there is a global correlation between proximity of two genes and transcriptional similarity [49]. Transcriptional activation of a gene can alter transcription across genomic ‘neighborhoods’, activating expression of the neighboring genes across a large distance [16]. Comprehensive analysis and further elucidation of the mechanisms underlying this phenomena are essential to fully understanding the relationship between the clustering of related genes and co-regulation – and a systems level understanding of transcription within the cell.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
The authors would like to acknowledge and thank Kelley Healey (William Paterson University, NJ) for providing the Penicillium images in Fig. 1. The authors would like to acknowledge David White (Denison University, OH) for his helpful discussions on the manuscript. The authors acknowledge support for this work from the Office of the Provost for Assigned Release Time (ART) and the Office of the Dean of the College of Science and Health’s Center for Research (CfR) at William Paterson University. The authors would also like to acknowledge the reviewers of this manuscript, who provided thoughtful suggestions that enhanced this work.
References
- 1.Sekhon RS, Lin H, Childs KL, Hansey CN, Buell CR, et al. Genome-wide atlas of transcription during maize development. Plant J 2011;66:553–63. [DOI] [PubMed]
- 2.Mohammed H., Hernando-Herraez I., Savino A., Scialdone A., Macaulay I., Mulas C., Chandra T., Voet T., Dean W., Nichols J., Marioni J.C., Reik W. Single-cell landscape of transcriptional heterogeneity and cell fate decisions during mouse early gastrulation. Cell Reports. 2017;20(5):1215–1228. doi: 10.1016/j.celrep.2017.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vonk P.J., Ohm R.A. The role of homeodomain transcription factors in fungal development. Fungal Biol Rev. 2018;32(4):219–230. doi: 10.1016/j.fbr.2018.04.002. [DOI] [Google Scholar]
- 4.Gasch A.P., Spellman P.T., Kao C.M., Carmel-Harel O., Eisen M.B., Storz G., Botstein D., Brown P.O., Silver P.A. Genomic Expression Programs in the Response of Yeast Cells to Environmental Changes. MBoC. 2000;11(12):4241–4257. doi: 10.1091/mbc.11.12.4241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Storz G., Hengge R., editors. Bacterial Stress Responses. ASM Press; Washington, DC, USA: 2010. [Google Scholar]
- 6.Pakos‐Zebrucka K., Koryga I., Mnich K., Ljujic M., Samali A., Gorman A.M. The integrated stress response. EMBO Rep. 2016;17(10):1374–1395. doi: 10.15252/embr.201642195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Crick FH. On protein synthesis; 1958, p. 8. [PubMed]
- 8.Crick F. Central Dogma of Molecular Biology. Nature. 1970;227(5258):561–563. doi: 10.1038/227561a0. [DOI] [PubMed] [Google Scholar]
- 9.Koonin E.V. Does the central dogma still stand? Biol Direct. 2012;7(1):27. doi: 10.1186/1745-6150-7-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Manjrekar J. Epigenetic inheritance, prions and evolution. J Genet. 2017;96(3):445–456. doi: 10.1007/s12041-017-0798-3. [DOI] [PubMed] [Google Scholar]
- 11.Brown J.B., Boley N., Eisman R., May G.E., Stoiber M.H., Duff M.O., Booth B.W., Wen J., Park S., Suzuki A.M., Wan K.H., Yu C., Zhang D., Carlson J.W., Cherbas L., Eads B.D., Miller D., Mockaitis K., Roberts J., Davis C.A., Frise E., Hammonds A.S., Olson S., Shenker S., Sturgill D., Samsonova A.A., Weiszmann R., Robinson G., Hernandez J., Andrews J., Bickel P.J., Carninci P., Cherbas P., Gingeras T.R., Hoskins R.A., Kaufman T.C., Lai E.C., Oliver B., Perrimon N., Graveley B.R., Celniker S.E. Diversity and dynamics of the Drosophila transcriptome. Nature. 2014;512(7515):393–399. doi: 10.1038/nature12962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Araya C.L., Kawli T., Kundaje A., Jiang L., Wu B., Vafeados D., Terrell R., Weissdepp P., Gevirtzman L., Mace D., Niu W., Boyle A.P., Xie D., Ma L., Murray J.I., Reinke V., Waterston R.H., Snyder M. Regulatory analysis of the C. elegans genome with spatiotemporal resolution. Nature. 2014;512(7515):400–405. doi: 10.1038/nature13497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Stamatoyannopoulos J.A., Snyder M., Hardison R., Ren B., Gingeras T. An encyclopedia of mouse DNA elements (Mouse ENCODE) Genome Biol. 2012;13:1–5. doi: 10.1186/gb-2012-13-8-418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Consortium EP An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gehring A.M., Walker J.E., Santangelo T.J., Margolin W. Transcription Regulation in Archaea. J Bacteriol. 2016;198(14):1906–1917. doi: 10.1128/JB.00255-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ebisuya M., Yamamoto T., Nakajima M., Nishida E. Ripples from neighbouring transcription. Nat Cell Biol. 2008;10(9):1106–1113. doi: 10.1038/ncb1771. [DOI] [PubMed] [Google Scholar]
- 17.Henikoff S. Position effect and related phenomena. Curr Opin Genet Dev. 1992;2(6):907–912. doi: 10.1016/S0959-437X(05)80114-5. [DOI] [PubMed] [Google Scholar]
- 18.Robbertse B., Reeves J.B., Schoch C.L., Spatafora J.W. A phylogenomic analysis of the Ascomycota. Fungal Genet Biol. 2006;43(10):715–725. doi: 10.1016/j.fgb.2006.05.001. [DOI] [PubMed] [Google Scholar]
- 19.Schoch CL, Sung G-H, López-Giráldez F, Townsend JP, Miadlikowska J, et al. The Ascomycota tree of life: a phylum-wide phylogeny clarifies the origin and evolution of fundamental reproductive and ecological traits. Systematic biology 2009;58:224–39. [DOI] [PubMed]
- 20.Wallen R.M., Perlin M.H. An overview of the function and maintenance of sexual reproduction in dikaryotic fungi. Front Microbiol. 2018;9:503. doi: 10.3389/fmicb.2018.00503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Pirrello C, Mizzotti C, Tomazetti TC, Colombo M, Bettinelli P, et al. Emergent Ascomycetes in viticulture: an interdisciplinary overview. Frontiers in Plant Science 2019;10:1394. [DOI] [PMC free article] [PubMed]
- 22.Duina AA, Miller ME, Keeney JB. Budding yeast for budding geneticists: a primer on the Saccharomyces cerevisiae model system. Genetics 2014;197:33–48. [DOI] [PMC free article] [PubMed]
- 23.Egel R. Fission yeast in general genetics. The Molecular Biology of Schizosaccharomyces pombe: Springer; 2004, p. 1–12.
- 24.Pontecorvo G, Roper J, Chemmons L, MacDonald K, Bufton A. The genetics of Aspergillus nidulans. Advances in genetics: Elsevier; 1953, p. 141–238. [DOI] [PubMed]
- 25.Davis R.H., Perkins D.D. Neurospora: a model of model microbes. Nat Rev Genet. 2002;3(5):397–403. doi: 10.1038/nrg797. [DOI] [PubMed] [Google Scholar]
- 26.Köhler JR, Casadevall A, Perfect J. The spectrum of fungi that infects humans. Cold Spring Harbor perspectives in medicine 2015;5:a019273. [DOI] [PMC free article] [PubMed]
- 27.Johnson E.A. Biotechnology of non-Saccharomyces yeasts—the ascomycetes. Appl Microbiol Biotechnol. 2013;97(2):503–517. doi: 10.1007/s00253-012-4497-y. [DOI] [PubMed] [Google Scholar]
- 28.Hoffman C.S., Wood V., Fantes P.A. An Ancient Yeast for Young Geneticists: A Primer on the Schizosaccharomyces pombe Model System. Genetics. 2015;201(2):403–423. doi: 10.1534/genetics.115.181503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Gottschling D.E., Aparicio O.M., Billington B.L., Zakian V.A. Position effect at S. cerevisiae telomeres: reversible repression of Pol II transcription. Cell. 1990;63(4):751–762. doi: 10.1016/0092-8674(90)90141-Z. [DOI] [PubMed] [Google Scholar]
- 30.Renauld H., Aparicio O.M., Zierath P.D., Billington B.L., Chhablani S.K., Gottschling D.E. Silent domains are assembled continuously from the telomere and are defined by promoter distance and strength, and by SIR3 dosage. Genes Dev. 1993;7(7a):1133–1145. doi: 10.1101/gad.7.7a.1133. [DOI] [PubMed] [Google Scholar]
- 31.Moradi-Fard S, Sarthi J, Tittel-Elmer M, Lalonde M, Cusanelli E, et al. Smc5/6 is a telomere-associated complex that regulates Sir4 binding and TPE. PLoS genetics 2016;12:e1006268. [DOI] [PMC free article] [PubMed]
- 32.Pryde FE, Louis EJ. Limitations of silencing at native yeast telomeres. EMBO J. 1999;18:2538–50. [DOI] [PMC free article] [PubMed]
- 33.Ellahi A., Thurtle D.M., Rine J. The Chromatin and Transcriptional Landscape of Native Saccharomyces cerevisiae Telomeres and Subtelomeric Domains. Genetics. 2015;200(2):505–521. doi: 10.1534/genetics.115.175711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.De ALP, Juárez-Cepeda J, López-Fuentes E, Briones-Martín-Del-Campo M, Gutiérrez-Escobedo G, et al. Local and regional chromatin silencing in Candida glabrata: consequences for adhesion and the response to stress. FEMS yeast research 2015:15. [DOI] [PubMed]
- 35.Buscaino A. Chromatin-Mediated Regulation of Genome Plasticity in Human Fungal Pathogens. Genes 2019;10:855. [DOI] [PMC free article] [PubMed]
- 36.Bühler M, Gasser SM. Silent chromatin at the middle and ends: lessons from yeasts. EMBO J. 2009;28:2149-61. [DOI] [PMC free article] [PubMed]
- 37.Freire-Benéitez V., Price R.J., Tarrant D., Berman J., Buscaino A. Candida albicans repetitive elements display epigenetic diversity and plasticity. Sci Rep. 2016;6(1) doi: 10.1038/srep22989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ayoub N., Goldshmidt I., Cohen A. Position effect variegation at the mating-type locus of fission yeast: a cis-acting element inhibits covariegated expression of genes in the silent and expressed domains. Genetics. 1999;152:495–508. doi: 10.1093/genetics/152.2.495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Eldabagh RS, Mejia NG, Barrett RL, Monzo CR, So MK, et al. Systematic identification, characterization, and conservation of adjacent-gene coregulation in the budding yeast Saccharomyces cerevisiae. Msphere 2018:3. [DOI] [PMC free article] [PubMed]
- 40.Jacob F, Monod J. Genetic mapping of the elements of the lactose region in Escherichia coli. Biochemical and Biophysical Research Communications 1965;18:693–701.
- 41.Kolter R., Yanofsky C. Attenuation in Amino Acid Biosynthetic Operons. Annu. Rev. Genet. 1982;16(1):113–134. doi: 10.1146/annurev.ge.16.120182.000553. [DOI] [PubMed] [Google Scholar]
- 42.Osborn A.M., Bruce K.D., Strike P., Ritchie D.A. Distribution, diversity and evolution of the bacterial mercury resistance (mer) operon. FEMS Microbiol Rev. 1997;19(4):239–262. doi: 10.1111/j.1574-6976.1997.tb00300.x. [DOI] [PubMed] [Google Scholar]
- 43.Ben Fekih I, Zhang C, Li YP, Zhao Y, Alwathnani HA, et al. Distribution of arsenic resistance genes in prokaryotes. Frontiers in microbiology 2018;9:2473. [DOI] [PMC free article] [PubMed]
- 44.Blumenthal T., Davis P., Garrido-Lecca A. Operon and non-operon gene clusters in the C. elegans genome. WormBook: The Online Review of C elegans Biology [Internet] WormBook. 2018 doi: 10.1895/wormbook.1.175.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cohen B.A., Mitra R.D., Hughes J.D., Church G.M. A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression. Nat Genet. 2000;26(2):183–186. doi: 10.1038/79896. [DOI] [PubMed] [Google Scholar]
- 46.Schmid M., Davison T.S., Henz S.R., Pape U.J., Demar M., Vingron M., Schölkopf B., Weigel D., Lohmann J.U. A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005;37(5):501–506. doi: 10.1038/ng1543. [DOI] [PubMed] [Google Scholar]
- 47.Spellman P.T., Rubin G.M. Evidence for large domains of similarly expressed genes in the Drosophila genome. J Biol. 2002;1:1–8. doi: 10.1186/1475-4924-1-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Williams E.J., Bowles D.J. Coexpression of neighboring genes in the genome of Arabidopsis thaliana. Genome Res. 2004;14:1060–1067. doi: 10.1101/gr.2131104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Quintero-Cadena P., Sternberg P.W. Enhancer Sharing Promotes Neighborhoods of Transcriptional Regulation Across Eukaryotes. G3. 2016;6(12):4167–4174. doi: 10.1534/g3.116.036228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Vining L.C. Functions of Secondary Metabolites. Annu Rev Microbiol. 1990;44(1):395–427. doi: 10.1146/annurev.mi.44.100190.002143. [DOI] [PubMed] [Google Scholar]
- 51.Fox E.M., Howlett B.J. Secondary metabolism: regulation and role in fungal biology. Curr Opin Microbiol. 2008;11(6):481–487. doi: 10.1016/j.mib.2008.10.007. [DOI] [PubMed] [Google Scholar]
- 52.Möbius N., Hertweck C. Fungal phytotoxins as mediators of virulence. Curr Opin Plant Biol. 2009;12(4):390–398. doi: 10.1016/j.pbi.2009.06.004. [DOI] [PubMed] [Google Scholar]
- 53.Jensen P.R. Natural Products and the Gene Cluster Revolution. Trends Microbiol. 2016;24(12):968–977. doi: 10.1016/j.tim.2016.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Stone M.J., Williams D.H. On the evolution of functional secondary metabolites (natural products) Mol Microbiol. 1992;6(1):29–34. doi: 10.1111/j.1365-2958.1992.tb00834.x. [DOI] [PubMed] [Google Scholar]
- 55.Klich M.A. Aspergillus flavus: the major producer of aflatoxin. Mol Plant Pathol. 2007;8(6):713–722. doi: 10.1111/j.1364-3703.2007.00436.x. [DOI] [PubMed] [Google Scholar]
- 56.Tsai H.-F., Wheeler M.H., Chang Y.C., Kwon-Chung K.J. A Developmentally Regulated Gene Cluster Involved in Conidial Pigment Biosynthesis in Aspergillus fumigatus. J Bacteriol. 1999;181(20):6469–6477. doi: 10.1128/JB.181.20.6469-6477.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Lo H.-C., Entwistle R., Guo C.-J., Ahuja M., Szewczyk E., Hung J.-H., Chiang Y.-M., Oakley B.R., Wang C.C.C. Two Separate Gene Clusters Encode the Biosynthetic Pathway for the Meroterpenoids Austinol and Dehydroaustinol in Aspergillus nidulans. J Am Chem Soc. 2012;134(10):4709–4720. doi: 10.1021/ja209809t. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Scharf D.H., Heinekamp T., Remme N., Hortschansky P., Brakhage A.A., Hertweck C. Biosynthesis and function of gliotoxin in Aspergillus fumigatus. Appl Microbiol Biotechnol. 2012;93(2):467–472. doi: 10.1007/s00253-011-3689-1. [DOI] [PubMed] [Google Scholar]
- 59.Gardiner DM, Howlett BJ. Bioinformatic and expression analysis of the putative gliotoxin biosynthetic gene cluster of Aspergillus fumigatus. FEMS Microbiol Lett. 2005;248:241–8. [DOI] [PubMed]
- 60.Schrettl M, Carberry S, Kavanagh K, Haas H, Jones GW, et al. Self-protection against gliotoxin—a component of the gliotoxin biosynthetic cluster, GliT, completely protects Aspergillus fumigatus against exogenous gliotoxin. PLoS Pathog 2010;6:e1000952. [DOI] [PMC free article] [PubMed]
- 61.Patron NJ, Waller RF, Cozijnsen AJ, Straney DC, Gardiner DM, et al. Origin and distribution of epipolythiodioxopiperazine (ETP) gene clusters in filamentous ascomycetes. BMC Evolutionary Biology 2007;7:174. [DOI] [PMC free article] [PubMed]
- 62.Bradshaw RE, Slot JC, Moore GG, Chettri P, de Wit PJ, et al. Fragmentation of an aflatoxin‐like gene cluster in a forest pathogen. New Phytologist 2013;198:525–35. [DOI] [PubMed]
- 63.Proctor R.H., Brown D.W., Plattner R.D., Desjardins A.E. Co-expression of 15 contiguous genes delineates a fumonisin biosynthetic gene cluster in Gibberella moniliformis. Fungal Genet Biol. 2003;38(2):237–249. doi: 10.1016/S1087-1845(02)00525-X. [DOI] [PubMed] [Google Scholar]
- 64.Tudzynski P., Hölter K., Correia T., Arntz C., Grammel N., Keller U. Evidence for an ergot alkaloid gene cluster in Claviceps purpurea. Mol Gen Genet. 1999;261(1):133–141. doi: 10.1007/s004380050950. [DOI] [PubMed] [Google Scholar]
- 65.Wang J., Galgoci A., Kodali S., Herath K.B., Jayasuriya H., Dorso K., Vicente F., González A., Cully D., Bramhill D., Singh S. Discovery of a Small Molecule That Inhibits Cell Division by Blocking FtsZ, a Novel Therapeutic Target of Antibiotics. J Biol Chem. 2003;278(45):44424–44428. doi: 10.1074/jbc.M307625200. [DOI] [PubMed] [Google Scholar]
- 66.Park J.H., Noh T.H., Wang H., Kim N.D., Jung J.H. Viriditoxin Induces G2/M Cell Cycle Arrest and Apoptosis in A549 Human Lung Cancer Cells. Nat Prod Sci. 2015;21(4):282. doi: 10.20307/nps.2015.21.4.282. [DOI] [Google Scholar]
- 67.Urquhart A.S., Hu J., Chooi Y.-H., Idnurm A. The fungal gene cluster for biosynthesis of the antibacterial agent viriditoxin. Fungal Biol Biotechnol. 2019;6(1) doi: 10.1186/s40694-019-0072-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Moake M.M., Padilla-Zakour O.I., Worobo R.W. Comprehensive Review of Patulin Control Methods in Foods. Comp Rev Food Sci Food Safety. 2005;4(1):8–21. doi: 10.1111/j.1541-4337.2005.tb00068.x. [DOI] [PubMed] [Google Scholar]
- 69.Artigot MP, Loiseau N, Laffitte J, Mas-Reguieg L, Tadrist S, et al. Molecular cloning and functional characterization of two CYP619 cytochrome P450s involved in biosynthesis of patulin in Aspergillus clavatus. Microbiology 2009;155:1738. [DOI] [PMC free article] [PubMed]
- 70.Kennedy J., Auclair K., Kendrew S., Park C., Vederas J. Modulation of polyketide synthase activity by accessory proteins during lovastatin biosynthesis. Science (New York, NY) 1999;284:1368–1372. doi: 10.1126/science.284.5418.1368. [DOI] [PubMed] [Google Scholar]
- 71.Limón M.C., Rodríguez-Ortiz R., Avalos J. Bikaverin production and applications. Appl Microbiol Biotechnol. 2010;87(1):21–29. doi: 10.1007/s00253-010-2551-1. [DOI] [PubMed] [Google Scholar]
- 72.Slot J.C., Rokas A. Horizontal Transfer of a Large and Highly Toxic Secondary Metabolic Gene Cluster between Fungi. Curr Biol. 2011;21(2):134–139. doi: 10.1016/j.cub.2010.12.020. [DOI] [PubMed] [Google Scholar]
- 73.Campbell MA, Rokas A, Slot JC. Horizontal transfer and death of a fungal secondary metabolic gene cluster. Genome biology and evolution 2012;4:289–93. [DOI] [PMC free article] [PubMed]
- 74.Bobrowicz P., Wysocki R., Owsianik G., Goffeau A., Ułaszewski S. Isolation of Three Contiguous Genes,ACR1,ACR2 andACR3, Involved in Resistance to Arsenic Compounds in the YeastSaccharomyces cerevisiae. Yeast. 1997;13(9):819–828. doi: 10.1002/(SICI)1097-0061(199707)13:9<819::AID-YEA142>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
- 75.Elmore MH, McGary KL, Wisecaver JH, Slot JC, Geiser DM, et al. Clustering of two genes putatively involved in cyanate detoxification evolved recently and independently in multiple fungal lineages. Genome Biol Evolution 2015;7:789–800. [DOI] [PMC free article] [PubMed]
- 76.Jain A., Kassner R. Cyanate binding to the ferric heme octapeptide from cytochrome c. A model for anion binding to high spin ferric hemoproteins. J Biol Chem. 1984;259:10309–10314. [PubMed] [Google Scholar]
- 77.Tran P.N., Yen M.-R., Chiang C.-Y., Lin H.-C., Chen P.-Y. Detecting and prioritizing biosynthetic gene clusters for bioactive compounds in bacteria and fungi. Appl Microbiol Biotechnol. 2019;103(8):3277–3287. doi: 10.1007/s00253-019-09708-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Gardiner DM, Cozijnsen AJ, Wilson LM, Pedras MSC, Howlett BJ. The sirodesmin biosynthetic gene cluster of the plant pathogenic fungus Leptosphaeria maculans. Molecular microbiology 2004;53:1307–18 [DOI] [PubMed]
- 79.Pinedo C., Wang C.-M., Pradier J.-M., Dalmais B., Choquer M., Le Pêcheur P., Morgant G., Collado I.G., Cane D.E., Viaud M. Sesquiterpene Synthase from the Botrydial Biosynthetic Gene Cluster of the Phytopathogen Botrytis cinerea. ACS Chem Biol. 2008;3(12):791–801. doi: 10.1021/cb800225v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Siewers V., Kokkelink L., Smedsgaard J., Tudzynski P. Identification of an Abscisic Acid Gene Cluster in the Grey Mold Botrytis cinerea. AEM. 2006;72(7):4619–4626. doi: 10.1128/AEM.02919-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Johnston M. A model fungal gene regulatory mechanism: the GAL genes of Saccharomyces cerevisiae. Microbiol Rev 1987;51:458. [DOI] [PMC free article] [PubMed]
- 82.Slot J.C., Rokas A. Multiple GAL pathway gene clusters evolved independently and by different mechanisms in fungi. Proc Natl Acad Sci. 2010;107(22):10136–10141. doi: 10.1073/pnas.0914418107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Jones S.A., Arst H.N., Jr., MacDonald D.W. Gene roles in the prn cluster of Aspergillus nidulans. Curr Genet. 1981;3(1):49–56. doi: 10.1007/BF00419580. [DOI] [PubMed] [Google Scholar]
- 84.Magliano P., Flipphi M., Sanglard D., Poirier Y. Characterization of the Aspergillus nidulans biotin biosynthetic gene cluster and use of the bioDA gene as a new transformation marker. Fungal Genet Biol. 2011;48(2):208–215. doi: 10.1016/j.fgb.2010.08.004. [DOI] [PubMed] [Google Scholar]
- 85.Hull E.P., Green P.M., Arst H.N., Scazzocchlo C. Cloning and physical characterization of the L-proline catabolism gene cluster of Aspergillus nidulans. Mol Microbiol. 1989;3(4):553–559. doi: 10.1111/j.1365-2958.1989.tb00201.x. [DOI] [PubMed] [Google Scholar]
- 86.Ámon J., Fernández-Martín R., Bokor E., Cultrone A., Kelly J.M., Flipphi M., Scazzocchio C., Hamari Z. A eukaryotic nicotinate-inducible gene cluster: convergent evolution in fungi and bacteria. Open Biol. 2017;7(12):170199. doi: 10.1098/rsob.170199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Flipphi M., Sun J., Robellet X., Karaffa L., Fekete E., Zeng A.-P., Kubicek C.P. Biodiversity and evolution of primary carbon metabolism in Aspergillus nidulans and other Aspergillus spp. Fungal Genet Biol. 2009;46(1):S19–S44. doi: 10.1016/j.fgb.2008.07.018. [DOI] [PubMed] [Google Scholar]
- 88.Koivistoinen O.M., Arvas M., Headman J.R., Andberg M., Penttilä M., Jeffries T.W., Richard P. Characterisation of the gene cluster for l-rhamnose catabolism in the yeast Scheffersomyces (Pichia) stipitis. Gene. 2012;492(1):177–185. doi: 10.1016/j.gene.2011.10.031. [DOI] [PubMed] [Google Scholar]
- 89.Schierová M., Linka M., Pažoutová S. Sulfate assimilation in Aspergillus terreus : analysis of genes encoding ATP-sulfurylase and PAPS-reductase. Curr Genet. 2000;38(3):126–131. doi: 10.1007/s002940000143. [DOI] [PubMed] [Google Scholar]
- 90.Gérecová G, Neboháčová M, Zeman I, Pryszcz LP, Tomáška Ľ, et al. Metabolic gene clusters encoding the enzymes of two branches of the 3-oxoadipate pathway in the pathogenic yeast Candida albicans. FEMS Yeast Research 2015;15. [DOI] [PubMed]
- 91.Kappel L., Gaderer R., Flipphi M., Seidl‐Seiboth V. The N ‐acetylglucosamine catabolic gene cluster in Trichoderma reesei is controlled by the Ndt80‐like transcription factor RON1. Mol Microbiol. 2016;99(4):640–657. doi: 10.1111/mmi.13256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Liao Y., Ling J., Zhang G., Liu F., Tao S., Han Z., Chen S., Chen Z., Le H. Cordycepin induces cell cycle arrest and apoptosis by inducing DNA damage and up-regulation of p53 in Leukemia cells. Cell Cycle. 2015;14(5):761–771. doi: 10.1080/15384101.2014.1000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Tuli H.S., Sharma A.K., Sandhu S.S., Kashyap D. Cordycepin: A bioactive metabolite with therapeutic potential. Life Sci. 2013;93(23):863–869. doi: 10.1016/j.lfs.2013.09.030. [DOI] [PubMed] [Google Scholar]
- 94.Xia Y., Luo F., Shang Y., Chen P., Lu Y., Wang C. Fungal Cordycepin Biosynthesis Is Coupled with the Production of the Safeguard Molecule Pentostatin. Cell Chem Biol. 2017;24(12):1479–1489.e4. doi: 10.1016/j.chembiol.2017.09.001. [DOI] [PubMed] [Google Scholar]
- 95.Wiemann P., Guo C.-J., Palmer J.M., Sekonyela R., Wang C.C.C., Keller N.P. Prototype of an intertwined secondary-metabolite supercluster. Proc Natl Acad Sci. 2013;110(42):17065–17070. doi: 10.1073/pnas.1313258110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Arnone J.T., McAlear M.A. Adjacent Gene Pairing Plays a Role in the Coordinated Expression of Ribosome Biogenesis Genes MPP10 and YJR003C in Saccharomyces cerevisiae. Eukaryot Cell. 2011;10(1):43–53. doi: 10.1128/EC.00257-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Wade C.H., Umbarger M.A., McAlear M.A. The budding yeast rRNA and ribosome biosynthesis (RRB) regulon contains over 200 genes. Yeast. 2006;23(4):293–306. doi: 10.1002/yea.1353. [DOI] [PubMed] [Google Scholar]
- 98.Arnone J.T., Robbins-Pianka A., Arace J.R., Kass-Gergi S., McAlear M.A. The adjacent positioning of co-regulated gene pairs is widely conserved across eukaryotes. BMC Genomics. 2012;13(1):546. doi: 10.1186/1471-2164-13-546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Ben-Shitrit T., Yosef N., Shemesh K., Sharan R., Ruppin E., Kupiec M. Systematic identification of gene annotation errors in the widely used yeast mutation collections. Nat Methods. 2012;9(4):373–378. doi: 10.1038/nmeth.1890. [DOI] [PubMed] [Google Scholar]
- 100.Atias N, Kupiec M, Sharan R. Systematic identification and correction of annotation errors in the genetic interaction map of Saccharomyces cerevisiae. Nucleic Acids Res 2016;44:e50–e50. [DOI] [PMC free article] [PubMed]
- 101.Wu X.-L., Li B.-Z., Zhang W.-Z., Song K., Qi H. Genome-wide landscape of position effects on heterogeneous gene expression in Saccharomyces cerevisiae. Biotechnol Biofuels. 2017;10:1–10. doi: 10.1186/s13068-017-0872-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Martens J.A., Laprade L., Winston F. Intergenic transcription is required to repress the Saccharomyces cerevisiae SER3 gene. Nature. 2004;429:571–574. doi: 10.1038/nature02538. [DOI] [PubMed] [Google Scholar]
- 103.Martens J.A., Wu P.-Y.-J., Winston F. Regulation of an intergenic transcript controls adjacent gene transcription in Saccharomyces cerevisiae. Genes Dev. 2005;19:2695–2704. doi: 10.1101/gad.1367605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Bosio M.C., Negri R., Dieci G. Promoter architectures in the yeast ribosomal expression program. Transcription. 2011;2:71–77. doi: 10.4161/trns.2.2.14486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Arnone J.T., Arace J.R., Soorneedi A.R., Citino T.T., Kamitaki T.L. Dissecting the cis and trans elements that regulate adjacent-gene coregulation in Saccharomyces cerevisiae. Eukaryot Cell. 2014;13:738–748. doi: 10.1128/EC.00317-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Xu Z., Wei W., Gagneur J., Perocchi F., Clauder-Münster S. Bidirectional promoters generate pervasive transcription in yeast. Nature. 2009;457:1033–1037. doi: 10.1038/nature07728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Dobi K.C., Winston F. Analysis of transcriptional activation at a distance in Saccharomyces cerevisiae. Mol Cell Biol. 2007;27:5575–5586. doi: 10.1128/MCB.00459-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Cera A., Holganza M.K., Hardan A.A., Gamarra I., Eldabagh R.S. Functionally Related Genes Cluster into Genomic Regions That Coordinate Transcription at a Distance in Saccharomyces cerevisiae. Msphere. 2019;4 doi: 10.1128/mSphere.00063-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Venables W.N., Ripley B.D. Springer Science & Business Media; 2013. Modern applied statistics with S-PLUS. [Google Scholar]
- 110.Kim D., Kim J.-M. Analysis of directional dependence using asymmetric copula-based regression models. J Stat Comput Simul. 2014;84:1990–2010. [Google Scholar]
- 111.Kim J.-M., Jung Y.-S., Soderberg T. Directional dependence of genes using survival truncated FGM type modification copulas. Commun Statistics-Simulation Computation. 2009;38:1470–1484. [Google Scholar]
- 112.Kim J.-M., Jung Y.-S., Sungur E.A., Han K.-H., Park C. A copula method for modeling directional dependence of genes. BMC Bioinf. 2008;9:225. doi: 10.1186/1471-2105-9-225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Elowitz M.B., Levine A.J., Siggia E.D., Swain P.S. Stochastic gene expression in a single cell. Science. 2002;297:1183–1186. doi: 10.1126/science.1070919. [DOI] [PubMed] [Google Scholar]
- 114.Blake W.J., Kærn M., Cantor C.R., Collins J.J. Noise in eukaryotic gene expression. Nature. 2003;422:633–637. doi: 10.1038/nature01546. [DOI] [PubMed] [Google Scholar]
- 115.McGary K.L., Slot J.C., Rokas A. Physical linkage of metabolic genes in fungi is an adaptation against the accumulation of toxic intermediate compounds. Proc Natl Acad Sci. 2013;110:11481–11486. doi: 10.1073/pnas.1304461110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Xu H., Liu J.-J., Liu Z., Li Y., Jin Y.-S. Synchronization of stochastic expressions drives the clustering of functionally related genes. Sci Adv. 2019;5:eaax6525. doi: 10.1126/sciadv.aax6525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Warner J.R. The economics of ribosome biosynthesis in yeast. Trends Biochem Sci. 1999;24:437–440. doi: 10.1016/s0968-0004(99)01460-7. [DOI] [PubMed] [Google Scholar]
- 118.Li G.-W., Burkhardt D., Gross C., Weissman J.S. Quantifying absolute protein synthesis rates reveals principles underlying allocation of cellular resources. Cell. 2014;157:624–635. doi: 10.1016/j.cell.2014.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Sorger P.K., Pelham H. Purification and characterization of a heat-shock element binding protein from yeast. EMBO J. 1987;6:3035–3041. doi: 10.1002/j.1460-2075.1987.tb02609.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Tye B.W., Commins N., Ryazanova L.V., Wühr M., Springer M. Proteotoxicity from aberrant ribosome biogenesis compromises cell fitness. Elife. 2019;8 doi: 10.7554/eLife.43002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Keller N.P. Translating biosynthetic gene clusters into fungal armor and weaponry. Nat Chem Biol. 2015;11:671–677. doi: 10.1038/nchembio.1897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Duveau F., Hodgins-Davis A., Metzger B.P., Yang B., Tryban S. Fitness effects of altering gene expression noise in Saccharomyces cerevisiae. Elife. 2018;7 doi: 10.7554/eLife.37272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Arnone J.T. Genomic Considerations for the Modification of Saccharomyces cerevisiae for Biofuel and Metabolite Biosynthesis. Microorganisms. 2020;8:321. doi: 10.3390/microorganisms8030321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Baur J.A., Zou Y., Shay J.W., Wright W.E. Telomere position effect in human cells. Science. 2001;292:2075–2077. doi: 10.1126/science.1062329. [DOI] [PubMed] [Google Scholar]
- 125.Tennen R.I., Bua D.J., Wright W.E., Chua K.F. SIRT6 is required for maintenance of telomere position effect in human cells. Nat Commun. 2011;2:1–7. doi: 10.1038/ncomms1443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Kück U., Pöggeler S., Nowrousian M., Nolting N., Engh I. Springer; 2009. Sordaria macrospora, a model system for fungal development. Physiology and Genetics; pp. 17–39. [Google Scholar]