Abstract
G-rich nucleic acids can form non-canonical G-quadruplex structures (G4s) in which four guanines fold in a planar arrangement through Hoogsteen hydrogen bonds. Although many biochemical and structural studies have focused on DNA sequences containing successive, adjacent guanines that spontaneously fold into G4s, evidence for their in vivo relevance has recently begun to accumulate. Complete sequencing of the human genome highlighted the presence of ∼300 000 sequences that can potentially form G4s. Likewise, the presence of putative G4-sequences has been reported in various viruses genomes [e.g., Human immunodeficiency virus (HIV-1), Epstein–Barr virus (EBV), papillomavirus (HPV)]. Many studies have focused on telomeric G4s and how their dynamics are regulated to enable telomere synthesis. Moreover, a role for G4s has been proposed in cellular and viral replication, recombination and gene expression control. In parallel, DNA aptamers that form G4s have been described as inhibitors and diagnostic tools to detect viruses [e.g., hepatitis A virus (HAV), EBV, cauliflower mosaic virus (CaMV), severe acute respiratory syndrome virus (SARS), simian virus 40 (SV40)]. Here, special emphasis will be given to the possible role of these structures in a virus life cycle as well as the use of G4-forming oligonucleotides as potential antiviral agents and innovative tools.
FROM TETRADS OF GUANOSINE TO G-QUADRUPLEXES: DISCOVERY AND TOPOLOGY
Almost a century ago, the ability of guanosine, but not guanine, to form viscous gels was described (1). Fifty years later, X-ray diffraction data clearly showed that the guanosine moieties in these gels were arranged in a tetrameric organization linked by eight Hoogsteen hydrogen bonds (Figure 1) (2,3). These hydrogen bonds differ from the bonds observed in canonical Watson–Crick pairing and involve the interaction of the N7 group from one guanine with the exocyclic amino group from a neighboring base (Figure 1a). Therefore, a G-tetrad or a G-quartet results from planar association between four guanines that are held together by eight hydrogen bonds and coordinated with a central Na+ or K+ cation (4–8). In addition, nucleoside derivatives were also used to confirm the structural properties of G-quartets (9–14).
Conversely, a G-quadruplex or G4 is formed by nucleic acid sequences (DNA or RNA) containing G-tracts or G-blocks (adjacent runs of guanines) and composed of various numbers of guanines. Depending on the nucleotide sequence, the way G4s can be formed presents a high degree of diversity. The core of a G4 is based on stacking between two or more G-tetrads, wherein the guanines can adopt either a syn or an anti glycosidic bond angle conformation. Consequently, each of the four G-tracts that form the core of the structure can run in the same or opposite direction with respect to its two neighbors, forming parallel, anti-parallel or hybrid core conformations. Depending on these orientations, the G-blocks delimit four negatively charged grooves of different sizes: narrow, medium or wide (Figure 1b–e). For intra-molecular structures (Figure 1b and c), the four G-tracts belong to the same oligonucleotide and are attached by linkers with variable nucleotide sequences and lengths. These loops can adopt three different conformations: lateral, diagonal or propeller (Figure 1b–d). The bi- or tetra-molecular G4 structures (Figure 1d-e) are assembled from G-tracts belonging to two or four different strands. The G-blocks can also be interrupted by one to seven non-G nucleotides, which result in bulges that protrude from the G4 core (Figure 1e). In contrast to the almost mono-morphic canonical duplex, these variable structural parameters are directly related to the nucleotide primary sequence. This unique family of globular-shaped nucleic acid structures (Figure 1f–j) presents a high level of plasticity that enables various applications (see ‘G4s as antiviral agents’ section).
However, a potential physiological role for G4s has remained controversial for many years. Nonetheless, interest in G-quaduplexes has increased over time, with thousands of reports and reviews published on several aspects of G4s, including the biophysical and chemical characteristics as well as biological function in prokaryotes and eukaryotes (5,15–24). With an increasing volume of sequencing data, databases and algorithms have also been developed to enable search and mapping of G4 in a few mammalians as well as hundreds of bacterial species (25–28).
Additionally, G4s present an intrinsic resistance to ‘regular’ nucleases. However, G4 specific nucleases have been isolated, first in yeast with KEM1/SEP1 (29) and later in humans with GQN1 (30). Later, a DNA binding protein (G4R1/RHAU) that can unfold RNA/DNA quadruplexes was isolated in human cells (31,32), which further highlights the biological relevance of G4s. Finally, development of G4-specific probes such as monoclonal antibodies (33–35) and small chemical ligands (36–41) facilitated in vivo studies that support quadruplex formation in cells. Taken together, G4s are likely present in several significant genomic regions and may be a key component in important cellular processes (42,43).
BIOLOGICAL ROLES OF G4S IN THE GENOME
The sequencing of the human genome highlighted the presence of many sequences enriched in guanines that can potentially form quadruplexes [for a review, see (44)]. In theory, more than 350 000 putative G-tetrads may be formed with loops of 1–7 nucleotides, and over 700 000, with loops of up to 12 nucleotides. However, the genomic DNA is dynamic, and quadruplexes, duplexes or other structural forms at those sites will be influenced by chromatin and other DNA-binding proteins. Thus, G4 formation depends on the cell type and cell cycle, ultimately impacted by environmental conditions and stresses.
The telomeres
Without clear signaling and tight regulation, extremities present in linear chromosomes could be recognized as damage DNA and it would be deleterious for the cell if processed as such by repair mechanisms (45). Telomere length has also been linked to the lifespan of organisms. Telomeres are nucleoprotein structures found at the end of chromosomes protecting the genome from instability. This terminal region of chromosomes contains long tracts (several kilobases) of double stranded TTAGGG:CCCTAA repeats ending with a 3′ protrusion of single stranded TTAGGG repeats (10–50 repeats). Telomerase is a ribonucleoprotein enzyme that adds TTAGGG repeats to the 3′ end of DNA strands in the telomere regions. Human telomerase is a heterodimeric complex composed of (i) telomerase reverse transcriptase (TERT), a DNA-polymerase RNA-dependent, (ii) a single-stranded RNA template referred to as telomerase RNA component (TERC) and (iii) dyskerin [dyskeratosis congenita 1 (DKC1)], a pseudouridine synthase binding TERC through the H/ACA motif and stabilizing the telomerase complex (46,47).
Accumulated experimental data indicate the presence of G4s in telomeric DNA (6,48–52). For example, DNA2, a helicase/nuclease that cleaves G4s, is involved in maintaining telomere integrity (53,54). Altogether, these observations clearly establish that telomeric G4s are crucial structures for regulating telomere maintenance, thus providing a mechanism for controlling cell proliferation (55,56). At the 3′ ends of telomeres, if the G-rich overhang is longer than four TTAGGG repeats (>23 nucleotides), it can fold over itself and form secondary structures, including G4s (Figure 2a). Those structures prevent telomere elongation by the telomerase complex (57). Accordingly, small G4-ligands (e.g. telomestatin) and G4-binding proteins (e.g. TRF2) show anti-proliferative and potential anti-tumor activities through telomere interference (36,58–61). However, the exact mechanism is more complicated as telomestatin derivatives present telomerase independent activity, most likely through targeting G4s involved in tumor growth elsewhere in the genome (62–64).
DNA replication
DNA replication is highly regulated, which ensures faithful reproduction of genetic information through each cell cycle. In eukaryotes, this process is initiated at thousands of DNA replication origins (ORIs) distributed along each chromosome (65). Initially, a pre-replicative complex (Pre-RC) is assembled during late mitosis and the early G1 phase; it evolves during the S phase to a pre-initiation complex through initiator protein activity (e.g., cyclin‐dependent kinases, Cdc7, Cdc45 and minichromosome maintenance complex (MCM) proteins). This step involves unwinding the origin and DNA polymerase recruitment. A tissue-specific temporal system exists, and only a few origins undergo ‘firing’ (replication fork elongation). The remaining origins are dormant backups used under stress (e.g., DNA damage and collisions with transcription machinery). Studies on organisms ranging from drosophila to humans have highlighted that repeated elements are enriched with G residues (66). In vitro, origin G-rich repeated element (OGRE) sequences form G4s, and point mutations that affect the stability of these G4s also impair origin function. Even if a 200-bp cis-regulatory element is necessary for efficient initiation, G4 structure formation at ORIs might be the key to selecting the firing origins (67,68). Additionally, G4s formed in the lagging strand can lead to the stalling of the replication fork (Figure 2d). On the other side, formation of G4s in the leading strand can displaced it from the template and causes polymerase slippage (Figure 2d). Such events could participate to the unusual expansions of G4 forming mini-satellite sequences (69).
Gene expression
Altered gene expression levels are implicated in many human diseases. G4s have been found in the promoter region of many genes and implicated in transcription and protein translation regulation [for a review, see (70)]. Particularly interesting, the P1 region, which is upstream of the c-myc promoter, is highly sensitive to DNase I and S1 nucleases (nuclease-hypersensitive element or NHE). This purine-rich sequence can form an intramolecular G4 (71–74), which structure has been determined by nuclear magnetic resonance (NMR) (75). This NHEIII1 region controls up to 90% of gene transcription activation, and the G4 acts as a transcriptional repressor element (76,77). Because interactions between DNA-binding proteins and local DNA supercoiling can impact the equilibrium between duplex/single-stranded DNA (active transcription) and quadruplex DNA (silent), c-myc expression and, ultimately, cell proliferation may be modulated. Nucleolin (a nucleolar phosphoprotein) and ADAR1 (a Z-DNA binding/RNA editing protein) both bind the c-myc promoter in vivo and stabilize the G4 structure of the promoter (78,79).
Due to the high therapeutic potential of genomic G4s, studies have also been performed on promoters of other genes implicated in cancer biology. For example, KRAS is one of the most frequently mutated genes in human cancer. Its promoter contains a G-rich nuclease hypersensitive element that is critical for transcription (80). The polypurine strand forms G4 structures that bind crucial DNA repair proteins (e.g., PARP-1, Ku70) (81). Direct evidence for G4 formation was also reported for the proximal promoter region of the RET proto-oncogene, and targeting this region with a small molecule represses RET proto-oncogene transcription (82).
Although many G-rich sequences have been identified in the genome, they enable generation of corresponding G-rich RNAs when the sequences are located in transcription units. Ultimately, these RNAs can fold into quadruplex structures and impact gene expression (83). In 5′-untranslated regions (5′-UTRs), G4s can act as translation repressors (84–86). G4 structures have also been proposed to regulate translation driven by IRES domains in the absence of an mRNA cap [for a recent review, see (87) and the references therein]. Finally, it has recently been shown that G4 RNAs in coding regions can stimulate ribosomal frame-shifts in vitro and in cultured cells providing a role for G4 in translational regulation (88,89).
Genome stability
A major challenge for future studies is to better understand genome instability, cancer and severe neurological/neuromuscular defects origins. Repeated sequences, such as polynucleotide repeats (three and above), mini- and mega-satellites, are source of genome instability through either expansion or contraction (90). However, DNA secondary structures associated with those tandem repeats may be crucial elements in the expansion phenomenon. For progressive myoclonus epilepsy type-1 (EPM1), expansion of the dodecamer sequence (CGCG4CG4) within the cystatin B promoter is thought to depend on G4 formation (91).
In addition to this potential deleterious consequence, G4s have also been implicated in development of specific immune responses. In antigen-activated B cells, class switch recombination promotes deletion of a DNA fragment (several kilobases) that joins a new constant region to the variable immunoglobulin chain (92). These switch regions (S regions) are intronic, G-rich and repetitive sequences that form G-loops upon transcription (Figure 2e). In vivo, class switch recombination depends on the activity of AID (a cytidine deaminase) and MutSα (MSH2/MSH6, and conserved DNA repair factors involved in mismatch repair). AID promotes creation of U:G mismatches, and MutSα binds with high affinity the G4 DNA formed upon transcription of the S regions.
In parallel, pathogens have also developed counter-measures to evade the immune system. The role of G-quartets in this evasion is related to antigenic variation. For Plasmodium falciparum, G-rich sequences have been found in the upstream region of group B var genes that can form in vitro stable G4s (93). Similarly, a G4-forming sequence (5′-G3TG3TTG3TG3) is located upstream of the pilE expression locus in Neisseria gonorrhoeae (94). Formation of G4s is necessary for initiating this pilin antigenic variation through recombination, and interactions with RecA could facilitate this specialized reaction. Overall, immune evasion and immunoglobulin gene class switch recombination in vertebrates are clearly analogous mechanistically (95).
THE ROLE OF G4S IN THE VIRUS LIFE CYCLE
Human immunodeficiency virus
Searches for G4s and elucidation of their function in the viruses’ genome have mainly focused on the human immunodeficiency virus (HIV), which causes acquired immunodeficiency syndrome (AIDS). Currently, ∼35 million people are infected with HIV worldwide. With over 2 million new infections and ∼1.6 million deaths from AIDS per year, the pandemic continues to spread. Even without a vaccine, development of highly active anti-retroviral therapies have allowed people to live with HIV as a chronic disease (96). However, viruses within T cells remain fully capable of replicating and infecting other cells if the drug pressure is removed or when resistance emerges. Thus, new drugs must be developed to overcome the treatment's genetic barrier.
HIV-1 is an RNA virus in the Lentivirus genus and is part of the Retroviridae family. Lentiviruses are single-stranded, positive-sense, enveloped RNA viruses. HIV-1 particles contain two molecules of genomic RNA that are converted into double-stranded DNA by the viral reverse transcriptase (RT). The resulting viral DNA is then imported into the nucleus and insertion into the cellular DNA is catalyzed by the virally encoded integrase (IN). Once integrated, transcription from the viral promoter at the 5′ long terminal repeat generates mRNAs that code various viral proteins and genomic RNA. Alternatively, the provirus may become latent, which allows the virus and its host cell to avoid detection by the immune system. The presence of G4 structures has been highlighted at both RNA and DNA levels with implications throughout the viral life cycle.
HIV-1 RNA dimerization and recombination
Retroviral RNAs dimerize in the cytoplasm of an infected cell allowing two copies of the genome to be encapsidated in the newly produced virion (97). While a single copy of the genome is sufficient for viral replication, the second copy is also used during reverse transcription, and the viral RT switches multiple times between the two RNA molecules (98,99). The strand transfers are partially responsible for the viral variability through production of recombinant molecules. Therefore, understanding the mechanisms that drive dimerization and recombination is essential.
Dimerization is a two-step process that involves sequences upstream of the splice donor site (100,101). The sequences involved in initial dimerization and encapsidation partially overlap at the 5′ end of the viral genome. One of the sequences is a highly conserved dimer initiation site (DIS) that forms a stem loop. A concentration-dependent kissing–loop interaction is initiated from contacts between consecutive guanines (102); the interaction then spreads to the stems. However, this interaction does not seem sufficiently strong to keep the two copies together during reverse transcription.
Several studies have identified a G-rich sequence that form bi-molecular G4 structures in the gag region of the HIV-1 genome, near the DIS (103–105). Localization of these G4-forming sequences correlates with recombination hot spots and exhibits an increased rate of template switching that highlights a potential role for these structures (106). Supporting this hypothesis, recombination in the U3 domain is cation-dependent and is lower in the presence of Li+, which is a metal ion that fails to stabilize G4s (106).
Short RNA templates from the central region of the HIV-1 genome contain G-rich sequences near the central poly-purine tract (cPPT) at the 3′ end of the pol gene (IN coding sequence); this is a region where one of the two primers used for synthesizing the (−) strand DNA is produced during reverse transcription. These sequences can form both intramolecular and dimeric G4 structures. Moreover, reconstituted systems have confirmed that G4 structures near the cPPT facilitate strand transfer and promote template switching by the RT (107). Interestingly, certain elements from the cPPT region are involved in forming the cPPT flap, which is a region that plays an important role in nuclear entry of the double-stranded DNA (108,109). Taken together, the G-rich regions located at the 5′ end of the genome and in the central region are likely maintained in proximity through inter-RNA G4 formation with a crucial role in HIV-1 replication.
HIV-1 nucleocapsid: G4 chaperone properties
Retroviral nucleocapsid proteins (NCp) are multifunctional elements encoded in the gag gene. Notably, NCp participate in many retroviral cycle steps by remodeling nucleic acid structures to favor thermodynamically stable conformations. They are referred to as nucleic acid chaperones and interact with nucleic acid phosphodiester backbones through electrostatic interactions thanks to basic residues (especially residues in the N-terminus) [for reviews, see (110,111)]. Moreover, HIV-1 NCp (NCp7) exhibits sequence-specific binding to runs of Gs, UGs or TGs through interactions involving its two zinc fingers (CCHC motifs separated by a proline-rich linker). Although there is no doubt that NCp7 tightly binds G4 sequences, data in the literature shows that a hydrophobic interaction engaged by the C-terminal zinc finger of the protein may lead to G4 stabilization (112), while high concentration of NCp7 promotes G4 unfolding (113). A recent biophysical study using high-speed atomic force microscopy (HS-AFM) addressed direct and real-time investigations on the molecular chaperone activity at the single-molecule level (114). NCp7 can efficiently promote bimolecular G4 formation and is able to anneal the G4 structures. The G4 structure is induced by both unprocessed NCp15 and mature NCp7, which indicates that both proteins may participate in genome recognition, recombination, dimerization and packaging. NCp7 could act through a potential mechanism that involves synaptic G4 intermediates, as illustrated in Figure 3 (115).
Regulation of HIV-1 promoter activity
Transcription of the viral DNA is performed by the cellular RNA polymerase II from the viral promoter located in the 5′-Long terminal repeat (LTR) of the proviral genome. The U3 region (Figure 4a) contains a G-rich sequence 50 nucleotides upstream from the transcription-starting site (TSS) and close to the TATA box (Figure 4b). This sequence overlaps with the so-called minimum promoter, which is composed of three SP1 as well as two NF-kB binding sites and is crucial for transcription initiation. The presence of eight blocks of guanines suggests that this region is a good candidate for G4 formation.
Two independent biophysical studies based on DMS-mediated foot-printing assays or NMR recently evaluated the ability of this 45-nt G-rich sequence to form G4 structures (116,117). DNA fragments spanning this region (corresponding to two or three SP1 or NF-kB binding sites with oligonucleotides ranging from 19 to 32 bases) were all able to form stable parallel and anti-parallel G4 topologies with melting temperatures ranging from 43°C to 56°C in 100-mM KCl solutions. The three models are formed with rather long sequences (up to 32 nucleotides), allowing the formation of long loops of 5 to 12 nucleotides (Figure 4 d, e and f). Interestingly, a G4 structure was also observed in the NMR study with an anti-parallel G4 core composed of only two tetrads and an additional Watson–Crick base pair that stacks on top of the upper tetrad (Figure 4g). Notably, these four topologies are mutually exclusive, and forming one of these G4s in the promoter will prevent formation of the three alternative conformations. Thus, the equilibrium between these forms may play a role in regulating promoter activity.
Recently, the interaction between the SP1 protein and a fragment of the HIV-1 promoter sequence folded into a G4 was studied (107). Piekna-Przybylska et al. used an affinity-based selection approach using biotinylated G4s immobilized on streptavidin-coated magnetic beads. The pull down experiments followed by western blotting revealed that the SP1 protein can bind the HIV-1 promoter sequence when it adopts a G4 conformation. Perrone et al. analyzed the effect of point mutations that disrupt the G4 structures formed in the promoter (116). The wild-type (WT) and mutated LTR promoters were cloned upstream of a firefly gene in a promoter-free plasmid. In HEK 293T cells, the promoter activity of the mutated sequences (unable to form G4) was twice as high as the WT LTR. These data suggest that G4s act as repressor elements in the transcriptional activation of HIV-1. Therefore, G4 structures might be critical for HIV-1 fitness and represent novel targets for antiviral drug development (see below). A better understanding of the role for G4s in regulating the HIV-1 promoter activity would also shed light on HIV-1 latency and reactivation mechanisms, from which a new landscape may emerge in clinical research to eradicate HIV from reservoirs (118).
G4s and HIV-1 proteins expression
A G-rich sequence composed of three conserved clusters has been identified in the reading frame of the negative regulatory factor (Nef) (119). This coding sequence is located at the 3′-end of the genome and overlaps with the 3′-LTR (Figure 4a). Isolated nef G-stretches can form G4s in vitro. Moreover, their stabilization represses Nef expression and decreases viral replication. Nef has multiple roles during HIV infection and has been implicated in immune evasion by promoting CD4 and major histocompatibility complex (MHC) molecule down-regulation. Importantly, the absence of Nef seems to correlate with low viral load and inhibition of disease progression. Thus, targeting the G4s located in the Nef coding sequence is a new attractive therapeutic opportunity.
Simian virus 40 (SV40)
SV40 is a polyomavirus with a 5-kb, closed circular and double-stranded DNA genome that codes for six proteins and includes a non-coding regulatory region (NCRR). This latter regulatory region contains the ORI and the encapsidation sequence (ses) but also controls the transcription direction (early versus late transcription). Notably, six GC boxes (GGGCGG) are present in this region, which can form an unusual quadruplex structure containing a C-tetrad stacked between two G-tetrads as determined by NMR (120). These repeated sequences are binding motifs for SP1 and, therefore, play an important role in early transcription. On the other hand, SV40 genome replication requires the large T-antigen (TAg), which is a multifunctional protein that binds the ORI, has adenosine triphosphate-dependent helicase activity and interacts with cellular proteins such as p53 and Rb (121). Interestingly, TAg can unwind G4 DNA structures (122,123); thus, it might play a crucial role in regulating replication as well as early and late transcription. Perylene di-imide derivatives (PDI) stabilize G4 structures and inhibit both the G4 and TAg duplex DNA helicase activities. Hence, PDI provide tools for probing the role of the G4 helicase activity in SV40 replication (123,124) and introduces new insights into the link between helicases and tumorigenesis (125) or other human genetic diseases (126).
Human papillomaviruses
The human papillomaviruses (HPV) family consists of more than 120 viruses; approximately half are sexually transmissible and 15 are considered high risk due to their carcinogenic properties. These viruses are one of the most common sexually transmitted infections and induce cervical cancer. Notably, HPV16 and HPV18 contribute to 70% of cervical cancer induced by HPV infections. The HPV genome consists of 8 kb, circular, double-stranded DNA and integration into the infected cell induces dramatic genome instability (127). The open reading frames encode six ‘early’ proteins (E1, E2, E4, E5, E6 and E7) and two ‘late’ structural proteins (L1 and L2). The late proteins are major and minor capsids, respectively, and association with 72 L1/L2 heterodimers forms star-shaped capsomeres. However, L1 can self-assemble into 55–60 nanometers, virus-like particles, which can be used in prophylactic HPV vaccines to protect against an initial HPV infection.
G-rich regions have been found in HPV genomes, and their potential to fold into a G4 structure has been described (128). G-rich loci that fulfill the criteria for G4 formation have only been found in eight types of HPV. However, a strong argument for the relevance of G4s in HPV biology is that the viral protein E1 is a helicase that resembles SV40 TAg (121). Consequently, E1 may also present a G4 unwinding activity. For HPV52 and HPV58, potential G4-forming sequences are located in the long control region (LCR), which is a regulatory sequence composed of nearly 1 kb, suggesting a potential role in transcription and replication. The presence of G4s in the sequence coding for the L2 protein (HPV57), E1 (HPV32, HPV42) and E4 (HPV3, HPV9, HPV25) suggests that G4 formation may also alter alternative splicing necessary for producing viral proteins from the overlapping Open reading frame (ORFs). Targeting these G4s could potentially serve as a basis for novel antiviral therapies.
Epstein–Barr virus
Also referred to as human herpesvirus 4 (HHV-4), Epstein–Barr virus (EBV) is a virus of the herpes family that can induce infectious mononucleosis and cancers, such as Hodgkin's lymphoma, Burkitt's lymphoma and nasopharyngeal carcinoma (129). In the case of HIV-1 co-infection, EBV is also associated with hairy leukoplakia and central nervous system lymphoma. The virus is ∼120 to 180 nm in diameter and is composed of a 172 kb DNA double helix that circularizes upon entry into the nucleus and becomes a viral episome. Human herpes viruses are mostly asymptomatic due to latency.
The Epstein–Barr virus encodes a genome maintenance protein (nuclear antigen 1, EBNA1), expressed in all EBV-associated malignancies. EBNA1 binds G-rich sequences at the viral replication origin, recruits the replication complex and is involved in metaphase chromosome attachment, which insures maintenance throughout mitosis (130). Thus, formation of secondary structures, such as G4s, may play a role in regulating EBV replication.
The level of EBNA1 synthesized is tightly controlled; it is sufficiently high to maintain viral infection but sufficiently low to avoid immune recognition by the host's virus-specific T cells. Regulation occurs during translation due to secondary structures present in the mRNA (131). Destabilization of the G4-forming sequence through antisense oligonucleotide annealing increases the translation rate and, consequently, promotes antigen presentation. On the other hand, stabilization of the G4 with a G4 ligand (pyridostatin) decreases EBNA1 synthesis and allows immune evasion. Interestingly, G4s are similarly observed in mRNA for other maintenance genes in the gammaherpesvirus family. These findings suggest that this mode of translational regulation may be more general among proteins that self-regulate synthesis. In addition, one could imagine alternative therapeutic strategies focused on targeting RNA structures within viral ORFs to interfere with the virus cycle as well as to promote antigen presentation and to stimulate the host immune response.
Targeting viral G4s
Recently, a G4 ligand called quarfloxin (CX-3543) entered phase II clinical trials. It is capable to suppress MYC transcription by inhibiting the interaction between nucleolin and the c-myc G4 motifs (132). Even if CX-3543 failed trials because of bioavailability issues, original anti-cancer applications based on such ligands seem underway (133). Taken together, all these studies suggest that targeting these G4s could potentially serve as a basis for novel antiviral therapies. Hence, as described for cellular targets (134), small ligands that can stabilize the G4 structure may compose a potential new class of therapeutic agents to fight viral infections (62,135) [for reviews, see (136,137)]. Recent studies showed that G4 ligands, such as BRACO-19 and Tmpyp4, inhibited HIV-1 replication (116,119). Exhaustive viral assays demonstrated that BRACO-19 acts at both reverse-transcription and post-integration steps by targeting of the viral promoter (138).
Several laboratories are already working to identify G4 ligands that preferentially interact with the pathogen's G4 over cellular G4s. This new antiviral strategy presents many advantages: (i) these compounds target viral DNAs and RNAs, which are the source of the disease; (ii) high conservation of these targets across subspecies suggests that they are important for viruses and (iii) mutations will likely impact viral fitness, limiting the emergence of resistant strains.
G4S AS ANTIVIRAL AGENTS
Nucleic acids have already been validated as therapeutics (139). The first example is Formivirsen (Vitravene®), a phosphorothioate anti-sense oligonucleotide approved from 1998 to 2002 to treat retinitis caused by Cytomegalovirus (CMV) (140). The second is Pegaptanib (Macugen®), used in the clinic from 2004 to 2011. This 2′-fluoropyrimidine RNA-based aptamer targets VEGF to treat neovascular age-related macular degeneration (141). However, those two drugs have been withdrawn due to the insufficient benefice associated with their use. Regarding G4s, certain G4 oligonucleotides present therapeutic potential, including the thrombin-binding aptamer (TBA, G2T2G2TGTG2T2G2) (142,143). The most promising G4 to date is AS1411 (AGRO100), a 26 bases oligonucleotide targeting nucleolin with anti-proliferative properties (144). AS1411 recently completed phase II clinical trials as anti-cancer drug (NCT00740441) with low toxicity, highlighting the high therapeutic potential of G4 (145).
For most applications, G4-forming molecules have been selected through combinatorial methods (e.g. pegaptanib, anti-TBA). Two key protocols have been developed. First, polynucleotide arrays facilitated rapid detection of molecules with an affinity to the target. Then, the development of SELEX (systematic evolution of ligands by exponential enrichment) facilitates isolation of oligonucleotide sequences with the capacity to recognize virtually any class of target molecule with high affinity and specificity (146). These oligonucleotides, ‘aptamers’, are emerging as a class of molecules that rival antibodies in both therapeutic and diagnostic applications (147,148). In this section, we describe a few examples of aptamers specifically selected to interact with viral components and interfere with viral replication.
Hepatitis A virus
Hepatitis A virus (HAV) belongs to the picornaviridae family. Even without a specific treatment for HAV infection, a vaccine has been produced to protect against initial infection. However, only a limited number of countries recommend vaccination because HAV is rarely lethal. The HAV genome comprises a 7.5-kb single-stranded (+) RNA with a 5′-UTR and 3′ poly(A) tail. It encodes for a single polyprotein, which includes the 3C protease. This enzyme is crucial for the virus because it cleaves the polyprotein into several capsid proteins and non-structural proteins. Additionally, the 3C protease binds regulatory structural elements at the 5′-UTR, which control viral genome replication. Thus, the 3C protease is an attractive target for develop a specific antiviral treatment. Recently, hexadeoxyribonucleotides that specifically bind the HAV 3C protease were identified through a hexanucleotide array (149). In vitro experiments showed that the hexanucleotide GGGGGT (G5T) forms a tetramolecular G4, binds the C-terminal domain of HAV protease and is a potent protease inhibitor (150).
Influenza virus (the flu)
Every year in winter, influenza viruses cause seasonal respiratory disease epidemics (151). Three subtypes have been defined (A, B and C), which depend on the surface proteins (hemagglutinin and neuraminidase). Sixteen hemagglutinin (H) and nine neuraminidase (N) variants have been discovered, but only H 1, 2 and 3 as well as N 1 and 2 are commonly found in humans. These viruses belong to the Orthomyxoviridae family, which composes enveloped viruses with a single-stranded negative-sense RNA. The genome is 10 to 15 kb but is segmented into multiple (7 or 8) molecules. Each RNA is 0.9 to 2.3 kb and encodes for 1 or 2 proteins. The non-structural protein 1 (NS1) is a 26 kDa multifunctional protein and participates in protein–protein and protein–RNA interactions (152). During infection, NS1 interferes with cellular mRNA biology (splicing, maturation and translation). As a result, NS1 prevents interferon (IFN) production, which leads to inhibition of the host's innate immunity (153). After 15 cycles of SELEX, which employs a 45-base variable sequence, a G4-forming aptamer was selected that presents high affinity for the NS1 RNA binding domain (Kd ≈ 20 nM) (154). In a cellular context, this G4 can block NS1 and restore INF production, which results in antiviral activity without cellular toxicity.
Severe acute respiratory syndrome coronavirus (SARS-CoV)
Severe acute respiratory syndrome coronavirus (SARS-CoV) is an enveloped virus with a single-stranded RNA genome ∼30 kb long. It encodes 2 poly-cistronic ORFs, which code for 16 non-structural proteins (Nsp). Since the 2003 outbreak, the three-dimensional structures of several key proteins have been determined (e.g. RDRP, 3C-like protease). The non-structural protein 3 (Nsp3) is one component of the viral replicase complex and contains a domain referred to as SARS unique domain (SUD), which interacts with G4s. Thus, G4s are also relevant for the coronavirus and might be involved either in viral replication or host immune evasion (155). On the other hand, using a SELEX approach with oligonucleotides that harbor a 30 nucleotide variable sequence and following 20 rounds of selection, DNA aptamers against the SARS-CoV helicase were isolated (156). These aptamers compose two distinct classes, G4 and non-G4 forming sequences, which were determined through circular dichroism and gel electrophoresis. Their inhibitory effect on viral replication is being studied with encouraging results.
Human immunodeficiency virus
Combinatorial approaches have been used to identify several aptamers that target HIV-1. The first RNA aptamer isolated using the SELEX approach was an RNA pseudoknot inhibitor of HIV-1 RT (157). Using another procedure, the SURF (synthetic unrandomization of randomized fragments) (158), ISIS 5320 was selected and exhibited sub-micromolar inhibition of HIV-1. ISIS 5320 is a phosphorothioate containing DNA octamer (T2G4T2) that forms G4 with anti-HIV properties. More specifically, it inhibited cell-to-cell and virus-to-cell spread of the HIV-1 by interacting with the V3 loop located on the viral glycoprotein gp120 (159–161). Its G4 structure and phosphorothioate backbone were reported as essential to this inhibition. Later studies showed that phosphodiester oligonucleotides containing only G and T inhibit HIV-1 replication and the most potent molecule, GTG2TG3TG3TG3T (T30177, EC50 at ∼100 nM), formed a G4 in vitro (162–164). This oligonucleotide and related molecules, such as T30923 (G3TG3TG3TG3T, Figure 1j), are potent HIV-1 IN inhibitors in vitro. T30177 was the first IN inhibitor tested in clinical trials (Zintevir™ developed by Aronex Pharmaceuticals in 1996) (165). However, the action mechanism in cells is more complex because it also targets viral entry (166).
In attempts to obtain natural-type oligonucleotide, Hotoda et al. identified a hexamer (TG3AG) also targeting HIV-1 entry through gp120 binding (167). This ‘Hotoda's sequence’ adopts a tetramolecular G4 structure and submicromolar HIV-1 inhibition was described for derivatives with 5′-end substitutions. Once more, the antiviral activity of the molecule was directly linked to its capability to form G4s.
Later, SELEX approaches were developed to isolate DNA aptamers with high affinity for the RNase H domain of HIV-1 RT. Thus, the target protein used was either the isolated RNase H domain (p15) or the functional heterodimer p66/p51 with a counter-selection that used the p51/p51 form (truncated for the RNase H domain) (168,169). Interestingly, these selections mainly led to aptamers with G-rich sequences capable of forming G4s. Some, but not all, of these G4 inhibited the RNase H activity of HIV-1 RT in vitro with an IC50 in the nM range (168). Surprisingly, these G4 aptamers (93del and 112del) were also potent IN inhibitors in vitro with IC50 values in the range 10–40 nM (170). This dual inhibition can be explained by the structural similarities between the IN active site and RT RNase H domain (171). The aptamer 93del can form an original dimeric interlocked G4 (Figure 1i), which is stable even at temperatures over 90°C (172). Through comparing the structures of the G4-forming aptamers that inhibit IN, IN inhibition likely requires a stack of 6 G-tetrads (164).
Similar to T30177, 93del is a potent antiviral agent with multimodal inhibition and, in cells, targets the viral entry step, reverse transcription and integration (173). Additional in vitro studies indicated that free 93del and T30923 can enter human cells, including epithelial (HeLa), hepatic (Huh7) and lymphocytes (H9) cells (174). However, striking differences were observed in the presence of viral particles; HIV-1 strongly stimulates cellular uptake of aptamer (174). This latter observation opens an opportunity for specific drug delivery to cells that are infected, which may prevent intracellular side effects from G4 off-targeting.
G4S AS DETECTION TOOLS
Given the remarkable physical properties of G4s, scientists have engineered innovative tools using G4 folding to control ribozyme activity and constructed probes for state-of-the-art imagery as well as quantification techniques, among other approaches. As an example, the aforementioned G4 93del was engineered into aptamer beacons to visualize endogenous protein HIV-1 reverse transcriptase in living cells (175). The following section describes two such innovations related to viruses.
Hepatitis delta virus (HDV)
HDV is a small, enveloped virus that depends on hepatitis B virus (HBV) for propagation (176). Its genome is a circular, single-stranded (−) RNA molecule of 1679 nucleotides. However, due to self-complementarity over ∼70% of its length, it folds into a partially double-stranded molecule (rod-like RNA structure). Because HDV does not encode for a polymerase, replication of its genome relies on cellular enzymes. Upon polymerization following the ‘double rolling circle’ mechanism, genomic and anti-genomic circular RNA is produced with linear polyadenylated mRNA that only codes for the HDV protein, delta antigen (HDAg). However, HDV also encompasses an 85 nucleotide ribozyme with self-cleavage activity. Molecular engineering led to development of a ‘G-quartzyme’, which is a ribozyme controlled by a G4 structure (177,178). Stabilization of the G4 at the 3′ end by monovalent ions (K+) activates the ribozyme.
Cauliflower mosaic virus (CaMV)
Cauliflower mosaic virus (CaMV) is a pararetrovirus that infects plants (179). Its genome is an 8-kb, circular, double-stranded DNA molecule produced through reverse transcription of a pre-genomic mRNA. Therefore, transcription of the genome is a crucial step for two reasons: (i) it is necessary to code for several viral proteins and (ii) it generates 35S RNA that is slightly longer than the genome (terminally redundant), which serves as a matrix to replicate the DNA genome. Because the unique CaMV promoter, the 35S promoter, is a strong and constitutive promoter, it has been widely used to develop genetically modified organisms (GMOs). Consequently, there is a growing need to detect GMOs in the food industry, and many methods have been developed based on ELISA, polymerase chain reaction and other techniques. Notably, a fluorescent assay has been engineered based on G4 formation. It involves two specific primers that recognize the 35S promoter. Each primer is linked to a double repeat of the human telomeric sequence TTAGGG. As a consequence, binding the two primers facilitates inter-strand quadruplex formation. Upon adding berberine (a selective G4-binder), a strong fluorescent signal is produced (180), which facilitates promoter detection.
CONCLUSION
The X-ray structure of G-tetrads was determined in the early 60s; in the 90s, G4s remained an intriguing deviation from the Watson–Crick canonical structure. However, a remarkable quantity of information has been obtained over the past two decades, which has allowed this field to grow from basic science to clinical application (181,182). Thus, the presence of G4s in telomeres and oncogenic promoters has opened broad opportunities for understanding and generating new treatments against cancer.
The viral world is extensive and includes viruses with replication schemes based on RNA (e.g., flaviviruses), DNA (e.g., papillomavirus) or intermediates thereof (e.g., retroviruses) and G4s are part of their life cycle. First, they must address the presence of G4s in their hosts, and second, they must also contain G4-forming sequences that regulate important replication steps. Several challenges remain to better define the features that provide specific G4 motifs with the ability to function as structural elements. Nevertheless, G4 sequences and G4-binders have been identified in some of the most pathogenic viruses; thus, they are attractive targets for controlling viral infections.
ACCESSION NUMBERS
PDB IDs: 2KF8, 2LPW, 2M4P, 1Y8D and 2LE6.
FUNDING
Agence Nationale de Recherche sur le SIDA et les Hépatites virales (ANRS) [AO 2013 to M.M. and AO 2012 to S.A.]; National Center for Scientific Research; University of Bordeaux. Funding for open access charge: ANRS [AO 2013].
This survey is dedicated to the memory of Dr Wilhelm Guschlbauer, a dear friend, distinguished biophysicist and pioneer in the field of G-quadruplexes, who passed away on 01 December 2012.
Conflict of interest statement. None declared.
REFERENCES
- 1.Bang I. Untersuchungen über die Guanylsäure. Biochem. Z. 1910;26:293–311. [Google Scholar]
- 2.Gellert M., Lipsett M.N., Davies D.R. Helix formation by guanylic acid. Proc. Natl. Acad. Sci. U.S.A. 1962;48:2013–2018. doi: 10.1073/pnas.48.12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sasisekharan V., Zimmerman S., Davies D.R. The structure of helical 5′-guanosine monophosphate. J. Mol. Biol. 1975;92:171–179. doi: 10.1016/0022-2836(75)90221-1. [DOI] [PubMed] [Google Scholar]
- 4.Howard F.B., Miles H.T. Poly(inosinic acid) helices: essential chelation of alkali metal ions in the axial channel. Biochemistry. 1982;21:6736–6745. doi: 10.1021/bi00269a019. [DOI] [PubMed] [Google Scholar]
- 5.Williamson J.R. Guanine quartets. Curr. Opin. Struct. Biol. 1993;3:357–362. [Google Scholar]
- 6.Sundquist W.I., Klug A. Telomeric DNA dimerizes by formation of guanine tetrads between hairpin loops. Nature. 1989;342:825–829. doi: 10.1038/342825a0. [DOI] [PubMed] [Google Scholar]
- 7.Miura T., Benevides J.M., Thomas G.J., Jr A phase diagram for sodium and potassium ion control of polymorphism in telomeric DNA. J. Mol. Biol. 1995;248:233–238. doi: 10.1016/s0022-2836(95)80046-8. [DOI] [PubMed] [Google Scholar]
- 8.Bouaziz S., Kettani A., Patel D.J. A K cation-induced conformational switch within a loop spanning segment of a DNA quadruplex containing G-G-G-C repeats. J. Mol. Biol. 1998;282:637–652. doi: 10.1006/jmbi.1998.2031. [DOI] [PubMed] [Google Scholar]
- 9.Chantot J., Guschlbauer W. Physicochemical properties of nucleosides 3. Gel formation by 8-bromoguanosine. FEBS Lett. 1969;4:173–176. doi: 10.1016/0014-5793(69)80227-9. [DOI] [PubMed] [Google Scholar]
- 10.Chantot J.F., Sarocchi M.T., Guschlbauer W. Physico-chemical properties of nucleosides. 4. Gel formation by quanosine and its analogues. Biochimie. 1971;53:347–354. doi: 10.1016/s0300-9084(71)80101-3. [DOI] [PubMed] [Google Scholar]
- 11.Tougard P., Chantot J.F., Guschlbauer W. Nucleoside conformations. X. An X-ray fiber diffraction study of the gels of guanine nucleosides. Biochim. Biophys. Acta. 1973;308:9–16. [PubMed] [Google Scholar]
- 12.Delabar J.M., Guschlbauer W. Nucleoside conformations. XI. Solvent effects on optical properties of guanosine and its derivatives in dilute solutions. J. Am. Chem. Soc. 1973;95:5729–5735. doi: 10.1021/ja00798a047. [DOI] [PubMed] [Google Scholar]
- 13.Leicknam J.P., Chauvelier C., Chantot J.F., Guschlbauer W. Nucleoside conformations. 12. An infrared study of the polymorphism of guanine nucleosides in the solid state. Biophys. Chem. 1973;1:134–139. doi: 10.1016/0301-4622(73)80011-0. [DOI] [PubMed] [Google Scholar]
- 14.Chantot J.F., Haertle T., Guschlbauer W. Nucleoside conformations. XIII. Circular dichroism of guanosine gels and the conformation of GpG and poly (G) Biochimie. 1974;56:501–507. doi: 10.1016/s0300-9084(74)80065-9. [DOI] [PubMed] [Google Scholar]
- 15.Guschlbauer W., Chantot J.F., Thiele D. Four-stranded nucleic acid structures 25 years later: from guanosine gels to telomer DNA. J. Biomol. Struct. Dyn. 1990;8:491–511. doi: 10.1080/07391102.1990.10507825. [DOI] [PubMed] [Google Scholar]
- 16.Williamson J.R. G-quartet structures in telomeric DNA. Annu. Rev. Biophys. Biomol. Struct. 1994;23:703–730. doi: 10.1146/annurev.bb.23.060194.003415. [DOI] [PubMed] [Google Scholar]
- 17.Gottarelli G., Spada G.P., Garbesi A. In: Comprehensive Supramolecular Chemistry. Lehn J-M, Atwood JL, MacNicol DD, Davies JAD, Vögtle F, Sauvage J-P, Hosseini MW, editors. Vol. 9. Oxford: Pergamon; 1996. pp. 483–506. [Google Scholar]
- 18.Gilbert D.E., Feigon J. Multistranded DNA structures. Curr. Opin. Struct. Biol. 1999;9:305–314. doi: 10.1016/S0959-440X(99)80041-4. [DOI] [PubMed] [Google Scholar]
- 19.Keniry M.A. Quadruplex structures in nucleic acids. Biopolymers. 2000;56:123–146. doi: 10.1002/1097-0282(2000/2001)56:3<123::AID-BIP10010>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
- 20.Hardin C.C., Perry A.G., White K. Thermodynamic and kinetic characterization of the dissociation and assembly of quadruplex nucleic acids. Biopolymers. 2000;56:147–194. doi: 10.1002/1097-0282(2000/2001)56:3<147::AID-BIP10011>3.0.CO;2-N. [DOI] [PubMed] [Google Scholar]
- 21.Shafer R.H., Smirnov I. Biological aspects of DNA/RNA quadruplexes. Biopolymers. 2000;56:209–227. doi: 10.1002/1097-0282(2000/2001)56:3<209::AID-BIP10018>3.0.CO;2-Y. [DOI] [PubMed] [Google Scholar]
- 22.Suhnel J. Beyond nucleic acid base pairs: from triads to heptads. Biopolymers. 2001;61:32–51. doi: 10.1002/1097-0282(2001)61:1<32::AID-BIP10063>3.0.CO;2-B. [DOI] [PubMed] [Google Scholar]
- 23.Simonsson T. G-quadruplex DNA structures–variations on a theme. Biol. Chem. 2001;382:621–628. doi: 10.1515/BC.2001.073. [DOI] [PubMed] [Google Scholar]
- 24.Rawal P., Kummarasetti V.B., Ravindran J., Kumar N., Halder K., Sharma R., Mukerji M., Das S.K., Chowdhury S. Genome-wide prediction of G4 DNA as regulatory motifs: role in Escherichia coli global regulation. Genome Res. 2006;16:644–655. doi: 10.1101/gr.4508806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kikin O., D'Antonio L., Bagga P.S. QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences. Nucleic Acids Res. 2006;34:W676–W682. doi: 10.1093/nar/gkl253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kostadinov R., Malhotra N., Viotti M., Shine R., D'Antonio L., Bagga P. GRSDB: a database of quadruplex forming G-rich sequences in alternatively processed mammalian pre-mRNA sequences. Nucleic Acids Res. 2006;34:D119–D124. doi: 10.1093/nar/gkj073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yadav V.K., Abraham J.K., Mani P., Kulshrestha R., Chowdhury S. QuadBase: genome-wide database of G4 DNA–occurrence and conservation in human, chimpanzee, mouse and rat promoters and 146 microbes. Nucleic Acids Res. 2008;36:D381–D385. doi: 10.1093/nar/gkm781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhao Y., Du Z., Li N. Extensive selection for the enrichment of G4 DNA motifs in transcriptional regulatory regions of warm blooded animals. FEBS Lett. 2007;581:1951–1956. doi: 10.1016/j.febslet.2007.04.017. [DOI] [PubMed] [Google Scholar]
- 29.Liu Z., Gilbert W. The yeast KEM1 gene encodes a nuclease specific for G4 tetraplex DNA: implication of in vivo functions for this novel DNA structure. Cell. 1994;77:1083–1092. doi: 10.1016/0092-8674(94)90447-2. [DOI] [PubMed] [Google Scholar]
- 30.Sun H., Yabuki A., Maizels N. A human nuclease specific for G4 DNA. Proc. Natl. Acad. Sci. U.S.A. 2001;98:12444–12449. doi: 10.1073/pnas.231479198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Creacy S.D., Routh E.D., Iwamoto F., Nagamine Y., Akman S.A., Vaughn J.P. G4 resolvase 1 binds both DNA and RNA tetramolecular quadruplex with high affinity and is the major source of tetramolecular quadruplex G4-DNA and G4-RNA resolving activity in HeLa cell lysates. J. Biol. Chem. 2008;283:34626–34634. doi: 10.1074/jbc.M806277200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Vaughn J.P., Creacy S.D., Routh E.D., Joyner-Butt C., Jenkins G.S., Pauli S., Nagamine Y., Akman S.A. The DEXH protein product of the DHX36 gene is the major source of tetramolecular quadruplex G4-DNA resolving activity in HeLa cell lysates. J. Biol. Chem. 2005;280:38117–38120. doi: 10.1074/jbc.C500348200. [DOI] [PubMed] [Google Scholar]
- 33.Fernando H., Rodriguez R., Balasubramanian S. Selective recognition of a DNA G-quadruplex by an engineered antibody. Biochemistry. 2008;47:9365–9371. doi: 10.1021/bi800983u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Henderson A., Wu Y., Huang Y.C., Chavez E.A., Platt J., Johnson F.B., Brosh R.M., Jr, Sen D., Lansdorp P.M. Detection of G-quadruplex DNA in mammalian cells. Nucleic Acids Res. 2014;42:860–869. doi: 10.1093/nar/gkt957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Schaffitzel C., Berger I., Postberg J., Hanes J., Lipps H.J., Pluckthun A. In vitro generated antibodies specific for telomeric guanine-quadruplex DNA react with Stylonychia lemnae macronuclei. Proc. Natl. Acad. Sci. U.S.A. 2001;98:8572–8577. doi: 10.1073/pnas.141229498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Biffi G., Di Antonio M., Tannahill D., Balasubramanian S. Visualization and selective chemical targeting of RNA G-quadruplex structures in the cytoplasm of human cells. Nat. Chem. 2014;6:75–80. doi: 10.1038/nchem.1805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Biffi G., Tannahill D., McCafferty J., Balasubramanian S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat. Chem. 2013;5:182–186. doi: 10.1038/nchem.1548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Rizzo A., Salvati E., Porru M., D'Angelo C., Stevens M.F., D'Incalci M., Leonetti C., Gilson E., Zupi G., Biroccio A. Stabilization of quadruplex DNA perturbs telomere replication leading to the activation of an ATR-dependent ATM signaling pathway. Nucleic Acids Res. 2009;37:5353–5364. doi: 10.1093/nar/gkp582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rodriguez R., Miller K.M., Forment J.V., Bradshaw C.R., Nikan M., Britton S., Oelschlaegel T., Xhemalce B., Balasubramanian S., Jackson S.P. Small-molecule-induced DNA damage identifies alternative DNA structures in human genes. Nat. Chem. Biol. 2012;8:301–310. doi: 10.1038/nchembio.780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Shin-ya K., Wierzba K., Matsuo K., Ohtani T., Yamada Y., Furihata K., Hayakawa Y., Seto H. Telomestatin, a novel telomerase inhibitor from Streptomyces anulatus. J. Am. Chem. Soc. 2001;123:1262–1263. doi: 10.1021/ja005780q. [DOI] [PubMed] [Google Scholar]
- 41.Tauchi T., Shin-ya K., Sashida G., Sumi M., Okabe S., Ohyashiki J.H., Ohyashiki K. Telomerase inhibition with a novel G-quadruplex-interactive agent, telomestatin: in vitro and in vivo studies in acute leukemia. Oncogene. 2006;25:5719–5725. doi: 10.1038/sj.onc.1209577. [DOI] [PubMed] [Google Scholar]
- 42.Huppert J.L., Balasubramanian S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 2005;33:2908–2916. doi: 10.1093/nar/gki609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Yang D., Okamoto K. Structural insights into G-quadruplexes: towards new anticancer drugs. Future Med. Chem. 2010;2:619–646. doi: 10.4155/fmc.09.172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Murat P., Balasubramanian S. Existence and consequences of G-quadruplex structures in DNA. Curr. Opin. Genet. Dev. 2014;25:22–29. doi: 10.1016/j.gde.2013.10.012. [DOI] [PubMed] [Google Scholar]
- 45.O'Sullivan R.J., Karlseder J. Telomeres: protecting chromosomes against genome instability. Nat. Rev. Mol. Cell Biol. 2010;11:171–181. doi: 10.1038/nrm2848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mitchell J.R., Wood E., Collins K. A telomerase component is defective in the human disease dyskeratosis congenita. Nature. 1999;402:551–555. doi: 10.1038/990141. [DOI] [PubMed] [Google Scholar]
- 47.Angrisani A., Vicidomini R., Turano M., Furia M. Human dyskerin: beyond telomeres. Biol. Chem. 2014;395:593–610. doi: 10.1515/hsz-2013-0287. [DOI] [PubMed] [Google Scholar]
- 48.Klobutcher L.A., Swanton M.T., Donini P., Prescott D.M. All gene-sized DNA molecules in four species of hypotrichs have the same terminal sequence and an unusual 3′ terminus. Proc. Natl. Acad. Sci. U.S.A. 1981;78:3015–3019. doi: 10.1073/pnas.78.5.3015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Blackburn E.H., Karrer K.M. Genomic reorganization in ciliated protozoans. Annu. Rev. Genet. 1986;20:501–521. doi: 10.1146/annurev.ge.20.120186.002441. [DOI] [PubMed] [Google Scholar]
- 50.Henderson E., Hardin C.C., Walk S.K., Tinoco I., Jr, Blackburn E.H. Telomeric DNA oligonucleotides form novel intramolecular structures containing guanine-guanine base pairs. Cell. 1987;51:899–908. doi: 10.1016/0092-8674(87)90577-0. [DOI] [PubMed] [Google Scholar]
- 51.Henderson E.R., Blackburn E.H. An overhanging 3′ terminus is a conserved feature of telomeres. Mol. Cell. Biol. 1989;9:345–348. doi: 10.1128/mcb.9.1.345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Williamson J.R., Raghuraman M.K., Cech T.R. Monovalent cation-induced structure of telomeric DNA: the G-quartet model. Cell. 1989;59:871–880. doi: 10.1016/0092-8674(89)90610-7. [DOI] [PubMed] [Google Scholar]
- 53.Masuda-Sasa T., Polaczek P., Peng X.P., Chen L., Campbell J.L. Processing of G4 DNA by Dna2 helicase/nuclease and replication protein A (RPA) provides insights into the mechanism of Dna2/RPA substrate recognition. J. Biol. Chem. 2008;283:24359–24373. doi: 10.1074/jbc.M802244200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Lin W., Sampathi S., Dai H., Liu C., Zhou M., Hu J., Huang Q., Campbell J., Shin-Ya K., Zheng L., et al. Mammalian DNA2 helicase/nuclease cleaves G-quadruplex DNA and is required for telomere integrity. EMBO J. 2013;32:1425–1439. doi: 10.1038/emboj.2013.88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Prescott J.C., Blackburn E.H. Telomerase: Dr Jekyll or Mr Hyde. Curr. Opin. Genet. Dev. 1999;9:368–373. doi: 10.1016/s0959-437x(99)80055-0. [DOI] [PubMed] [Google Scholar]
- 56.Zhu J., Wang H., Bishop J.M., Blackburn E.H. Telomerase extends the lifespan of virus-transformed human cells without net telomere lengthening. Proc. Natl. Acad. Sci. U.S.A. 1999;96:3723–3728. doi: 10.1073/pnas.96.7.3723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Wang Q., Liu J.Q., Chen Z., Zheng K.W., Chen C.Y., Hao Y.H., Tan Z. G-quadruplex formation at the 3′ end of telomere DNA inhibits its extension by telomerase, polymerase and unwinding by helicase. Nucleic Acids Res. 2011;39:6229–6237. doi: 10.1093/nar/gkr164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Redman J.E., Ladame S., Reszka A.P., Neidle S., Balasubramanian S. Discovery of G-quadruplex stabilizing ligands through direct ELISA of a one-bead-one-compound library. Org. Biomol. Chem. 2006;4:4364–4369. doi: 10.1039/b611716c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Jain A.K., Bhattacharya S. Recent developments in the chemistry and biology of G-quadruplexes with reference to the DNA groove binders. Curr. Pharm. Des. 2012;18:1917–1933. doi: 10.2174/138161212799958396. [DOI] [PubMed] [Google Scholar]
- 60.Biffi G., Tannahill D., Balasubramanian S. An intramolecular G-quadruplex structure is required for binding of telomeric repeat-containing RNA to the telomeric protein TRF2. J. Am. Chem. Soc. 2012;134:11974–11976. doi: 10.1021/ja305734x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Kar R.K., Suryadevara P., Jana J., Bhunia A., Chatterjee S. Novel G-quadruplex stabilizing agents: in-silico approach and dynamics. J. Biomol. Struct. Dyn. 2013;31:1497–1518. doi: 10.1080/07391102.2012.742246. [DOI] [PubMed] [Google Scholar]
- 62.Tsai Y.C., Qi H., Lin C.P., Lin R.K., Kerrigan J.E., Rzuczek S.G., LaVoie E.J., Rice J.E., Pilch D.S., Lyu Y.L., et al. A G-quadruplex stabilizer induces M-phase cell cycle arrest. J. Biol. Chem. 2009;284:22535–22543. doi: 10.1074/jbc.M109.020230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.De Cian A., Cristofari G., Reichenbach P., De Lemos E., Monchaud D., Teulade-Fichou M.P., Shin-Ya K., Lacroix L., Lingner J., Mergny J.L. Reevaluation of telomerase inhibition by quadruplex ligands and their mechanisms of action. Proc. Natl. Acad. Sci. U.S.A. 2007;104:17347–17352. doi: 10.1073/pnas.0707365104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Miyazaki T., Pan Y., Joshi K., Purohit D., Hu B., Demir H., Mazumder S., Okabe S., Yamori T., Viapiano M., et al. Telomestatin impairs glioma stem cell survival and growth through the disruption of telomeric G-quadruplex and inhibition of the proto-oncogene, c-Myb. Clin. Cancer Res. 2012;18:1268–1280. doi: 10.1158/1078-0432.CCR-11-1795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Aladjem M.I. Replication in context: dynamic regulation of DNA replication patterns in metazoans. Nat. Rev. Genet. 2007;8:588–600. doi: 10.1038/nrg2143. [DOI] [PubMed] [Google Scholar]
- 66.Cayrou C., Coulombe P., Puy A., Rialle S., Kaplan N., Segal E., Mechali M. New insights into replication origin characteristics in metazoans. Cell Cycle. 2012;11:658–667. doi: 10.4161/cc.11.4.19097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Besnard E., Babled A., Lapasset L., Milhavet O., Parrinello H., Dantec C., Marin J.M., Lemaitre J.M. Unraveling cell type-specific and reprogrammable human replication origin signatures associated with G-quadruplex consensus motifs. Nat. Struct. Mol. Biol. 2012;19:837–844. doi: 10.1038/nsmb.2339. [DOI] [PubMed] [Google Scholar]
- 68.Valton A.L., Hassan-Zadeh V., Lema I., Boggetto N., Alberti P., Saintome C., Riou J.F., Prioleau M.N. G4 motifs affect origin positioning and efficiency in two vertebrate replicators. EMBO J. 2014;33:732–746. doi: 10.1002/embj.201387506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Piazza A., Serero A., Boule J.B., Legoix-Ne P., Lopes J., Nicolas A. Stimulation of gross chromosomal rearrangements by the human CEB1 and CEB25 minisatellites in Saccharomyces cerevisiae depends on G-quadruplexes or Cdc13. PLoS Genet. 2012;8:e1003033. doi: 10.1371/journal.pgen.1003033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Brooks T.A., Kendrick S., Hurley L. Making sense of G-quadruplex and i-motif functions in oncogene promoters. FEBS J. 2010;277:3459–3469. doi: 10.1111/j.1742-4658.2010.07759.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Simonsson T., Pecinka P., Kubista M. DNA tetraplex formation in the control region of c-myc. Nucleic Acids Res. 1998;26:1167–1172. doi: 10.1093/nar/26.5.1167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Grand C.L., Han H., Munoz R.M., Weitman S., Von Hoff D.D., Hurley L.H., Bearss D.J. The cationic porphyrin TMPyP4 down-regulates c-MYC and human telomerase reverse transcriptase expression and inhibits tumor growth in vivo. Mol. Cancer Ther. 2002;1:565–573. [PubMed] [Google Scholar]
- 73.Siddiqui-Jain A., Grand C.L., Bearss D.J., Hurley L.H. Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription. Proc. Natl. Acad. Sci. U.S.A. 2002;99:11593–11598. doi: 10.1073/pnas.182256799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Seenisamy J., Rezler E.M., Powell T.J., Tye D., Gokhale V., Joshi C.S., Siddiqui-Jain A., Hurley L.H. The dynamic character of the G-quadruplex element in the c-MYC promoter and modification by TMPyP4. J. Am. Chem. Soc. 2004;126:8702–8709. doi: 10.1021/ja040022b. [DOI] [PubMed] [Google Scholar]
- 75.Phan A.T., Modi Y.S., Patel D.J. Propeller-type parallel-stranded G-quadruplexes in the human c-myc promoter. J. Am. Chem. Soc. 2004;126:8710–8716. doi: 10.1021/ja048805k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Davis T.L., Firulli A.B., Kinniburgh A.J. Ribonucleoprotein and protein factors bind to an H-DNA-forming c-myc DNA element: possible regulators of the c-myc gene. Proc. Natl. Acad. Sci. U.S.A. 1989;86:9682–9686. doi: 10.1073/pnas.86.24.9682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Berberich S.J., Postel E.H. PuF/NM23-H2/NDPK-B transactivates a human c-myc promoter-CAT gene via a functional nuclease hypersensitive element. Oncogene. 1995;10:2343–2347. [PubMed] [Google Scholar]
- 78.Gonzalez V., Guo K., Hurley L., Sun D. Identification and characterization of nucleolin as a c-myc G-quadruplex-binding protein. J. Biol. Chem. 2009;284:23622–23635. doi: 10.1074/jbc.M109.018028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Kang H.J., Le T.V., Kim K., Hur J., Kim K.K., Park H.J. Novel interaction of the Z-DNA binding domain of human ADAR1 with the oncogenic c-Myc promoter G-quadruplex. J. Mol. Biol. 2014;426:2594–2604. doi: 10.1016/j.jmb.2014.05.001. [DOI] [PubMed] [Google Scholar]
- 80.Jordano J., Perucho M. Chromatin structure of the promoter region of the human c-K-ras gene. Nucleic Acids Res. 1986;14:7361–7378. doi: 10.1093/nar/14.18.7361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Cogoi S., Quadrifoglio F., Xodo L.E. G-rich oligonucleotide inhibits the binding of a nuclear protein to the Ki-ras promoter and strongly reduces cell growth in human carcinoma pancreatic cells. Biochemistry. 2004;43:2512–2523. doi: 10.1021/bi035754f. [DOI] [PubMed] [Google Scholar]
- 82.Guo K., Pourpak A., Beetz-Rogers K., Gokhale V., Sun D., Hurley L.H. Formation of pseudosymmetrical G-quadruplex and i-motif structures in the proximal promoter region of the RET oncogene. J. Am. Chem. Soc. 2007;129:10220–10228. doi: 10.1021/ja072185g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Kumari S., Bugaut A., Huppert J.L., Balasubramanian S. An RNA G-quadruplex in the 5′ UTR of the NRAS proto-oncogene modulates translation. Nat. Chem. Biol. 2007;3:218–221. doi: 10.1038/nchembio864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Morris M.J., Basu S. An unusually stable G-quadruplex within the 5′-UTR of the MT3 matrix metalloproteinase mRNA represses translation in eukaryotic cells. Biochemistry. 2009;48:5313–5319. doi: 10.1021/bi900498z. [DOI] [PubMed] [Google Scholar]
- 85.Gomez D., Guedin A., Mergny J.L., Salles B., Riou J.F., Teulade-Fichou M.P., Calsou P. A G-quadruplex structure within the 5′-UTR of TRF2 mRNA represses translation in human cells. Nucleic Acids Res. 2010;38:7187–7198. doi: 10.1093/nar/gkq563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Beaudoin J.D., Perreault J.P. 5′-UTR G-quadruplex structures acting as translational repressors. Nucleic Acids Res. 2010;38:7022–7036. doi: 10.1093/nar/gkq557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Bugaut A., Balasubramanian S. 5′-UTR RNA G-quadruplexes: translation regulation and targeting. Nucleic Acids Res. 2012;40:4727–4741. doi: 10.1093/nar/gks068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Yu C.H., Teulade-Fichou M.P., Olsthoorn R.C. Stimulation of ribosomal frameshifting by RNA G-quadruplex structures. Nucleic Acids Res. 2014;42:1887–1892. doi: 10.1093/nar/gkt1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Horsburgh B.C., Kollmus H., Hauser H., Coen D.M. Translational recoding induced by G-rich mRNA sequences that form unusual structures. Cell. 1996;86:949–959. doi: 10.1016/S0092-8674(00)80170-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Lopez Castel A., Cleary J.D., Pearson C.E. Repeat instability as the basis for human diseases and as a potential target for therapy. Nat. Rev. Mol. Cell Biol. 2010;11:165–170. doi: 10.1038/nrm2854. [DOI] [PubMed] [Google Scholar]
- 91.Saha T., Usdin K. Tetraplex formation by the progressive myoclonus epilepsy type-1 repeat: implications for instability in the repeat expansion diseases. FEBS Lett. 2001;491:184–187. doi: 10.1016/s0014-5793(01)02190-1. [DOI] [PubMed] [Google Scholar]
- 92.Larson E.D., Duquette M.L., Cummings W.J., Streiff R.J., Maizels N. MutSalpha binds to and promotes synapsis of transcriptionally activated immunoglobulin switch regions. Curr. Biol. 2005;15:470–474. doi: 10.1016/j.cub.2004.12.077. [DOI] [PubMed] [Google Scholar]
- 93.Smargiasso N., Gabelica V., Damblon C., Rosu F., De Pauw E., Teulade-Fichou M.P., Rowe J.A., Claessens A. Putative DNA G-quadruplex formation within the promoters of Plasmodium falciparum var genes. BMC Genomics. 2009;10:362. doi: 10.1186/1471-2164-10-362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Kuryavyi V., Cahoon L.A., Seifert H.S., Patel D.J. RecA-binding pilE G4 sequence essential for pilin antigenic variation forms monomeric and 5′ end-stacked dimeric parallel G-quadruplexes. Structure. 2012;20:2090–2102. doi: 10.1016/j.str.2012.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Maizels N., Gray L.T. The G4 genome. PLoS Genet. 2013;9:e1003468. doi: 10.1371/journal.pgen.1003468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Metifiot M., Marchand C., Pommier Y. HIV integrase inhibitors: 20-year landmark and challenges. Adv. Pharmacol. 2013;67:75–105. doi: 10.1016/B978-0-12-405880-4.00003-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Jouvenet N., Laine S., Pessel-Vivares L., Mougel M. Cell biology of retroviral RNA packaging. RNA Biol. 2011;8:572–580. doi: 10.4161/rna.8.4.16030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Hu W.S., Temin H.M. Retroviral recombination and reverse transcription. Science. 1990;250:1227–1233. doi: 10.1126/science.1700865. [DOI] [PubMed] [Google Scholar]
- 99.Charpentier C., Nora T., Tenaillon O., Clavel F., Hance A.J. Extensive recombination among human immunodeficiency virus type 1 quasispecies makes an important contribution to viral diversity in individual patients. J. Virol. 2006;80:2472–2482. doi: 10.1128/JVI.80.5.2472-2482.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Ly H., Nierlich D.P., Olsen J.C., Kaplan A.H. Moloney murine sarcoma virus genomic RNAs dimerize via a two-step process: a concentration-dependent kissing-loop interaction is driven by initial contact between consecutive guanines. J. Virol. 1999;73:7255–7261. doi: 10.1128/jvi.73.9.7255-7261.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Ly H., Nierlich D.P., Olsen J.C., Kaplan A.H. Functional characterization of the dimer linkage structure RNA of Moloney murine sarcoma virus. J. Virol. 2000;74:9937–9945. doi: 10.1128/jvi.74.21.9937-9945.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Paillart J.C., Skripkin E., Ehresmann B., Ehresmann C., Marquet R. A loop-loop ‘kissing’ complex is the essential part of the dimer linkage of genomic HIV-1 RNA. Proc. Natl. Acad. Sci. U.S.A. 1996;93:5572–5577. doi: 10.1073/pnas.93.11.5572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Sundquist W.I., Heaphy S. Evidence for interstrand quadruplex formation in the dimerization of human immunodeficiency virus 1 genomic RNA. Proc. Natl. Acad. Sci. U.S.A. 1993;90:3393–3397. doi: 10.1073/pnas.90.8.3393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Awang G., Sen D. Mode of dimerization of HIV-1 genomic RNA. Biochemistry. 1993;32:11453–11457. doi: 10.1021/bi00093a024. [DOI] [PubMed] [Google Scholar]
- 105.Marquet R., Paillart J.C., Skripkin E., Ehresmann C., Ehresmann B. Dimerization of human immunodeficiency virus type 1 RNA involves sequences located upstream of the splice donor site. Nucleic Acids Res. 1994;22:145–151. doi: 10.1093/nar/22.2.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Shen W., Gorelick R.J., Bambara R.A. HIV-1 nucleocapsid protein increases strand transfer recombination by promoting dimeric G-quartet formation. J. Biol. Chem. 2011;286:29838–29847. doi: 10.1074/jbc.M111.262352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Piekna-Przybylska D., Sharma G., Bambara R.A. Mechanism of HIV-1 RNA dimerization in the central region of the genome and significance for viral evolution. J. Biol. Chem. 2013;288:24140–24150. doi: 10.1074/jbc.M113.477265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Zennou V., Petit C., Guetard D., Nerhbass U., Montagnier L., Charneau P. HIV-1 genome nuclear import is mediated by a central DNA flap. Cell. 2000;101:173–185. doi: 10.1016/S0092-8674(00)80828-4. [DOI] [PubMed] [Google Scholar]
- 109.Riviere L., Darlix J.L., Cimarelli A. Analysis of the viral elements required in the nuclear import of HIV-1 DNA. J. Virol. 2010;84:729–739. doi: 10.1128/JVI.01952-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Darlix J.L., Lapadat-Tapolsky M., de Rocquigny H., Roques B.P. First glimpses at structure-function relationships of the nucleocapsid protein of retroviruses. J. Mol. Biol. 1995;254:523–537. doi: 10.1006/jmbi.1995.0635. [DOI] [PubMed] [Google Scholar]
- 111.Godet J., Boudier C., Humbert N., Ivanyi-Nagy R., Darlix J.L., Mely Y. Comparative nucleic acid chaperone properties of the nucleocapsid protein NCp7 and Tat protein of HIV-1. Virus Res. 2012;169:349–360. doi: 10.1016/j.virusres.2012.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Lyonnais S., Gorelick R.J., Mergny J.L., Le Cam E., Mirambeau G. G-quartets direct assembly of HIV-1 nucleocapsid protein along single-stranded DNA. Nucleic Acids Res. 2003;31:5754–5763. doi: 10.1093/nar/gkg716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Kankia B.I., Barany G., Musier-Forsyth K. Unfolding of DNA quadruplexes induced by HIV-1 nucleocapsid protein. Nucleic Acids Res. 2005;33:4395–4403. doi: 10.1093/nar/gki741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Rajendran A., Endo M., Hidaka K., Tran P.L., Mergny J.L., Gorelick R.J., Sugiyama H. HIV-1 nucleocapsid proteins as molecular chaperones for tetramolecular antiparallel G-quadruplex formation. J. Am. Chem. Soc. 2013;135:18575–18585. doi: 10.1021/ja409085j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Kuryavyi V., Phan A.T., Patel D.J. Solution structures of all parallel-stranded monomeric and dimeric G-quadruplex scaffolds of the human c-kit2 promoter. Nucleic Acids Res. 2010;38:6757–6773. doi: 10.1093/nar/gkq558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Perrone R., Nadai M., Frasson I., Poe J.A., Butovskaya E., Smithgall T.E., Palumbo M., Palu G., Richter S.N. A dynamic G-quadruplex region regulates the HIV-1 long terminal repeat promoter. J. Med. Chem. 2013;56:6521–6530. doi: 10.1021/jm400914r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Amrane S., Kerkour A., Bedrat A., Vialet B., Andreola M.L., Mergny J.L. Topology of a DNA G-quadruplex structure formed in the HIV-1 promoter: a potential target for anti-HIV drug development. J. Am. Chem. Soc. 2014;136:5249–5252. doi: 10.1021/ja501500c. [DOI] [PubMed] [Google Scholar]
- 118.Piekna-Przybylska D., Sullivan M.A., Sharma G., Bambara R.A. U3 Region in the HIV-1 genome adopts a G-quadruplex structure in its RNA and DNA sequence. Biochemistry. 2014;53:2581–2593. doi: 10.1021/bi4016692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Perrone R., Nadai M., Poe J.A., Frasson I., Palumbo M., Palu G., Smithgall T.E., Richter S.N. Formation of a unique cluster of G-quadruplex structures in the HIV-1 Nef coding region: implications for antiviral activity. PloS One. 2013;8:e73121. doi: 10.1371/journal.pone.0073121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Patel P.K., Bhavesh N.S., Hosur R.V. NMR observation of a novel C-tetrad in the structure of the SV40 repeat sequence GGGCGG. Biochem. Biophys. Res. Commun. 2000;270:967–971. doi: 10.1006/bbrc.2000.2479. [DOI] [PubMed] [Google Scholar]
- 121.Topalis D., Andrei G., Snoeck R. The large tumor antigen: a ‘Swiss Army knife’ protein possessing the functions required for the polyomavirus life cycle. Antiviral Res. 2013;97:122–136. doi: 10.1016/j.antiviral.2012.11.007. [DOI] [PubMed] [Google Scholar]
- 122.Baran N., Pucshansky L., Marco Y., Benjamin S., Manor H. The SV40 large T-antigen helicase can unwind four stranded DNA structures linked by G-quartets. Nucleic Acids Res. 1997;25:297–303. doi: 10.1093/nar/25.2.297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Plyler J., Jasheway K., Tuesuwan B., Karr J., Brennan J.S., Kerwin S.M., David W.M. Real-time investigation of SV40 large T-antigen helicase activity using surface plasmon resonance. Cell Biochem. Biophys. 2009;53:43–52. doi: 10.1007/s12013-008-9038-z. [DOI] [PubMed] [Google Scholar]
- 124.Tuesuwan B., Kern J.T., Thomas P.W., Rodriguez M., Li J., David W.M., Kerwin S.M. Simian virus 40 large T-antigen G-quadruplex DNA helicase inhibition by G-quadruplex DNA-interactive agents. Biochemistry. 2008;47:1896–1909. doi: 10.1021/bi701747d. [DOI] [PubMed] [Google Scholar]
- 125.Sharma S., Doherty K.M., Brosh R.M., Jr DNA helicases as targets for anti-cancer drugs. Curr. Med. Chem. Anticancer Agents. 2005;5:183–199. doi: 10.2174/1568011053765985. [DOI] [PubMed] [Google Scholar]
- 126.Sun H., Karow J.K., Hickson I.D., Maizels N. The Bloom's syndrome helicase unwinds G4 DNA. J. Biol. Chem. 1998;273:27587–27592. doi: 10.1074/jbc.273.42.27587. [DOI] [PubMed] [Google Scholar]
- 127.Akagi K., Li J., Broutian T.R., Padilla-Nash H., Xiao W., Jiang B., Rocco J.W., Teknos T.N., Kumar B., Wangsa D., et al. Genome-wide analysis of HPV integration in human cancers reveals recurrent, focal genomic instability. Genome Res. 2014;24:185–199. doi: 10.1101/gr.164806.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Tluckova K., Marusic M., Tothova P., Bauer L., Sket P., Plavec J., Viglasky V. Human papillomavirus G-quadruplexes. Biochemistry. 2013;52:7207–7216. doi: 10.1021/bi400897g. [DOI] [PubMed] [Google Scholar]
- 129.Kutok J.L., Wang F. Spectrum of Epstein-Barr virus-associated diseases. Annu. Rev. Pathol. 2006;1:375–404. doi: 10.1146/annurev.pathol.1.110304.100209. [DOI] [PubMed] [Google Scholar]
- 130.Norseen J., Johnson F.B., Lieberman P.M. Role for G-quadruplex RNA binding by Epstein-Barr virus nuclear antigen 1 in DNA replication and metaphase chromosome attachment. J. Virol. 2009;83:10336–10346. doi: 10.1128/JVI.00747-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Murat P., Zhong J., Lekieffre L., Cowieson N.P., Clancy J.L., Preiss T., Balasubramanian S., Khanna R., Tellam J. G-quadruplexes regulate Epstein-Barr virus-encoded nuclear antigen 1 mRNA translation. Nat. chem. Biol. 2014;10:358–364. doi: 10.1038/nchembio.1479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Balasubramanian S., Hurley L.H., Neidle S. Targeting G-quadruplexes in gene promoters: a novel anticancer strategy? Nat. Rev. Drug Discov. 2011;10:261–275. doi: 10.1038/nrd3428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Shalaby T., Fiaschetti G., Nagasawa K., Shin-ya K., Baumgartner M., Grotzer M. G-quadruplexes as potential therapeutic targets for embryonal tumors. Molecules. 2013;18:12500–12537. doi: 10.3390/molecules181012500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Renaud de la Faverie A., Hamon F., Di Primo C., Largy E., Dausse E., Delauriere L., Landras-Guetta C., Toulme J.J., Teulade-Fichou M.P., Mergny J.L. Nucleic acids targeted to drugs: SELEX against a quadruplex ligand. Biochimie. 2011;93:1357–1367. doi: 10.1016/j.biochi.2011.05.022. [DOI] [PubMed] [Google Scholar]
- 135.Phan A.T., Kuryavyi V., Gaw H.Y., Patel D.J. Small-molecule interaction with a five-guanine-tract G-quadruplex structure from the human MYC promoter. Nat. Chem. Biol. 2005;1:167–173. doi: 10.1038/nchembio723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Hurley L.H., Wheelhouse R.T., Sun D., Kerwin S.M., Salazar M., Fedoroff O.Y., Han F.X., Han H., Izbicka E., Von Hoff D.D. G-quadruplexes as targets for drug design. Pharmacol. Ther. 2000;85:141–158. doi: 10.1016/s0163-7258(99)00068-6. [DOI] [PubMed] [Google Scholar]
- 137.Ou T.M., Lu Y.J., Tan J.H., Huang Z.S., Wong K.Y., Gu L.Q. G-quadruplexes: targets in anticancer drug design. ChemMedChem. 2008;3:690–713. doi: 10.1002/cmdc.200700300. [DOI] [PubMed] [Google Scholar]
- 138.Perrone R., Butovskaya E., Daelemans D., Palu G., Pannecouque C., Richter S.N. Anti-HIV-1 activity of the G-quadruplex ligand BRACO-19. J. Antimicrob. Chemother. 2014 doi: 10.1093/jac/dku280. in press. [DOI] [PubMed] [Google Scholar]
- 139.Santosh B., Yadava P.K. Nucleic acid aptamers: research tools in disease diagnostics and therapeutics. Biomed. Res. Int. 2014;2014:540451. doi: 10.1155/2014/540451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Perry C.M., Balfour J.A. Fomivirsen. Drugs. 1999;57:375–380. doi: 10.2165/00003495-199957030-00010. [DOI] [PubMed] [Google Scholar]
- 141.Ng E.W., Shima D.T., Calias P., Cunningham E.T., Jr, Guyer D.R., Adamis A.P. Pegaptanib, a targeted anti-VEGF aptamer for ocular vascular disease. Nat. Rev. Drug Discov. 2006;5:123–132. doi: 10.1038/nrd1955. [DOI] [PubMed] [Google Scholar]
- 142.Bock L.C., Griffin L.C., Latham J.A., Vermaas E.H., Toole J.J. Selection of single-stranded DNA molecules that bind and inhibit human thrombin. Nature. 1992;355:564–566. doi: 10.1038/355564a0. [DOI] [PubMed] [Google Scholar]
- 143.Avino A., Fabrega C., Tintore M., Eritja R. Thrombin binding aptamer, more than a simple aptamer: chemically modified derivatives and biomedical applications. Curr. Pharma. Des. 2012;18:2036–2047. doi: 10.2174/138161212799958387. [DOI] [PubMed] [Google Scholar]
- 144.Girvan A.C., Teng Y., Casson L.K., Thomas S.D., Juliger S., Ball M.W., Klein J.B., Pierce W.M., Jr, Barve S.S., Bates P.J. AGRO100 inhibits activation of nuclear factor-kappaB (NF-kappaB) by forming a complex with NF-kappaB essential modulator (NEMO) and nucleolin. Mol. Cancer Ther. 2006;5:1790–1799. doi: 10.1158/1535-7163.MCT-05-0361. [DOI] [PubMed] [Google Scholar]
- 145.Rosenberg J.E., Bambury R.M., Van Allen E.M., Drabkin H.A., Lara P.N., Jr, Harzstark A.L., Wagle N., Figlin R.A., Smith G.W., Garraway L.A., et al. A phase II trial of AS1411 (a novel nucleolin-targeted DNA aptamer) in metastatic renal cell carcinoma. Invest. New Drugs. 2014;32:178–187. doi: 10.1007/s10637-013-0045-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Tuerk C., Gold L. Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science. 1990;249:505–510. doi: 10.1126/science.2200121. [DOI] [PubMed] [Google Scholar]
- 147.Bunka D.H., Platonova O., Stockley P.G. Development of aptamer therapeutics. Curr. Opin. Pharmacol. 2010;10:557–562. doi: 10.1016/j.coph.2010.06.009. [DOI] [PubMed] [Google Scholar]
- 148.Famulok M., Mayer G. Aptamers as tools in molecular biology and immunology. Curr. Top. Microbiol. Immunol. 1999;243:123–136. doi: 10.1007/978-3-642-60142-2_7. [DOI] [PubMed] [Google Scholar]
- 149.Mescalchin A., Wunsche W., Sczakiel G. Specific recognition of proteins by array-bound hexanucleotides. Angew. Chem. Int. Ed. Engl. 2011;50:1052–1054. doi: 10.1002/anie.201003462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Blaum B.S., Wunsche W., Benie A.J., Kusov Y., Peters H., Gauss-Muller V., Peters T., Sczakiel G. Functional binding of hexanucleotides to 3C protease of hepatitis A virus. Nucleic Acids Res. 2012;40:3042–3055. doi: 10.1093/nar/gkr1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Lynch J.P., 3rd, Walsh E.E. Influenza: evolving strategies in treatment and prevention. Semin. Respir. Crit. Care Med. 2007;28:144–158. doi: 10.1055/s-2007-976487. [DOI] [PubMed] [Google Scholar]
- 152.Suarez D.L., Perdue M.L. Multiple alignment comparison of the non-structural genes of influenza A viruses. Virus Res. 1998;54:59–69. doi: 10.1016/s0168-1702(98)00011-2. [DOI] [PubMed] [Google Scholar]
- 153.Donelan N.R., Basler C.F., Garcia-Sastre A. A recombinant influenza A virus expressing an RNA-binding-defective NS1 protein induces high levels of beta interferon and is attenuated in mice. J. Virol. 2003;77:13257–13266. doi: 10.1128/JVI.77.24.13257-13266.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.Woo H.M., Kim K.S., Lee J.M., Shim H.S., Cho S.J., Lee W.K., Ko H.W., Keum Y.S., Kim S.Y., Pathinayake P., et al. Single-stranded DNA aptamer that specifically binds to the influenza virus NS1 protein suppresses interferon antagonism. Antiviral Res. 2013;100:337–345. doi: 10.1016/j.antiviral.2013.09.004. [DOI] [PubMed] [Google Scholar]
- 155.Tan J., Vonrhein C., Smart O.S., Bricogne G., Bollati M., Kusov Y., Hansen G., Mesters J.R., Schmidt C.L., Hilgenfeld R. The SARS-unique domain (SUD) of SARS coronavirus contains two macrodomains that bind G-quadruplexes. PLoS Pathog. 2009;5:e1000428. doi: 10.1371/journal.ppat.1000428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Shum K.T., Tanner J.A. Differential inhibitory activities and stabilisation of DNA aptamers against the SARS coronavirus helicase. Chembiochem. 2008;9:3037–3045. doi: 10.1002/cbic.200800491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 157.Tuerk C., MacDougal S., Gold L. RNA pseudoknots that inhibit human immunodeficiency virus type 1 reverse transcriptase. Proc. Natl. Acad. Sci. U.S.A. 1992;89:6988–6992. doi: 10.1073/pnas.89.15.6988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Ecker D.J., Vickers T.A., Hanecak R., Driver V., Anderson K. Rational screening of oligonucleotide combinatorial libraries for drug discovery. Nucleic Acids Res. 1993;21:1853–1856. doi: 10.1093/nar/21.8.1853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 159.Wyatt J.R., Vickers T.A., Roberson J.L., Buckheit R.W., Jr, Klimkait T., DeBaets E., Davis P.W., Rayner B., Imbach J.L., Ecker D.J. Combinatorially selected guanosine-quartet structure is a potent inhibitor of human immunodeficiency virus envelope-mediated cell fusion. Proc. Natl. Acad. Sci. U.S.A. 1994;91:1356–1360. doi: 10.1073/pnas.91.4.1356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 160.Buckheit R.W., Jr, Roberson J.L., Lackman-Smith C., Wyatt J.R., Vickers T.A., Ecker D.J. Potent and specific inhibition of HIV envelope-mediated cell fusion and virus binding by G quartet-forming oligonucleotide (ISIS 5320) AIDS Res. Hum. Retroviruses. 1994;10:1497–1506. doi: 10.1089/aid.1994.10.1497. [DOI] [PubMed] [Google Scholar]
- 161.Stoddart C.A., Rabin L., Hincenbergs M., Moreno M., Linquist-Stepps V., Leeds J.M., Truong L.A., Wyatt J.R., Ecker D.J., McCune J.M. Inhibition of human immunodeficiency virus type 1 infection in SCID-hu Thy/Liv mice by the G-quartet-forming oligonucleotide, ISIS 5320. Antimicrob. Agents Chemother. 1998;42:2113–2115. doi: 10.1128/aac.42.8.2113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 162.Rando R.F., Ojwang J., Elbaggari A., Reyes G.R., Tinder R., McGrath M.S., Hogan M.E. Suppression of human immunodeficiency virus type 1 activity in vitro by oligonucleotides which form intramolecular tetrads. J. Biol. Chem. 1995;270:1754–1760. doi: 10.1074/jbc.270.4.1754. [DOI] [PubMed] [Google Scholar]
- 163.Mazumder A., Neamati N., Ojwang J.O., Sunder S., Rando R.F., Pommier Y. Inhibition of the human immunodeficiency virus type 1 integrase by guanosine quartet structures. Biochemistry. 1996;35:13762–13771. doi: 10.1021/bi960541u. [DOI] [PubMed] [Google Scholar]
- 164.Mukundan V.T., Do N.Q., Phan A.T. HIV-1 integrase inhibitor T30177 forms a stacked dimeric G-quadruplex structure containing bulges. Nucleic Acids Res. 2011;39:8984–8991. doi: 10.1093/nar/gkr540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 165.Marchand C., Maddali K., Metifiot M., Pommier Y. HIV-1 IN inhibitors: 2010 update and perspectives. Curr. Top. Med. Chem. 2009;9:1016–1037. doi: 10.2174/156802609789630910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 166.Este J.A., Cabrera C., Schols D., Cherepanov P., Gutierrez A., Witvrouw M., Pannecouque C., Debyser Z., Rando R.F., Clotet B., et al. Human immunodeficiency virus glycoprotein gp120 as the primary target for the antiviral action of AR177 (Zintevir) Mol. Pharmacol. 1998;53:340–345. doi: 10.1124/mol.53.2.340. [DOI] [PubMed] [Google Scholar]
- 167.Hotoda H., Koizumi M., Koga R., Kaneko M., Momota K., Ohmine T., Furukawa H., Agatsuma T., Nishigaki T., Sone J., et al. Biologically active oligodeoxyribonucleotides. 5. 5′-End-substituted d(TGGGAG) possesses anti-human immunodeficiency virus type 1 activity by forming a G-quadruplex structure. J. Med. Chem. 1998;41:3655–3663. doi: 10.1021/jm970658w. [DOI] [PubMed] [Google Scholar]
- 168.Andreola M.L., Pileur F., Calmels C., Ventura M., Tarrago-Litvak L., Toulme J.J., Litvak S. DNA aptamers selected against the HIV-1 RNase H display in vitro antiviral activity. Biochemistry. 2001;40:10087–10094. doi: 10.1021/bi0108599. [DOI] [PubMed] [Google Scholar]
- 169.Metifiot M., Leon O., Tarrago-Litvak L., Litvak S., Andreola M.L. Targeting HIV-1 integrase with aptamers selected against the purified RNase H domain of HIV-1 RT. Biochimie. 2005;87:911–919. doi: 10.1016/j.biochi.2005.03.013. [DOI] [PubMed] [Google Scholar]
- 170.de Soultrait V.R., Lozach P.Y., Altmeyer R., Tarrago-Litvak L., Litvak S., Andreola M.L. DNA aptamers derived from HIV-1 RNase H inhibitors are strong anti-integrase agents. J. Mol. Biol. 2002;324:195–203. doi: 10.1016/s0022-2836(02)01064-1. [DOI] [PubMed] [Google Scholar]
- 171.Andreola M.L. Closely related antiretroviral agents as inhibitors of two HIV-1 enzymes, ribonuclease H and integrase: ‘killing two birds with one stone’. Curr. Pharma. Des. 2004;10:3713–3723. doi: 10.2174/1381612043382648. [DOI] [PubMed] [Google Scholar]
- 172.Phan A.T., Kuryavyi V., Ma J.B., Faure A., Andreola M.L., Patel D.J. An interlocked dimeric parallel-stranded DNA quadruplex: a potent inhibitor of HIV-1 integrase. Proc. Natl. Acad. Sci. U.S.A. 2005;102:634–639. doi: 10.1073/pnas.0406278102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 173.Faure-Perraud A., Metifiot M., Reigadas S., Recordon-Pinson P., Parissi V., Ventura M., Andreola M.L. The guanine-quadruplex aptamer 93del inhibits HIV-1 replication ex vivo by interfering with viral entry, reverse transcription and integration. Antiviral Ther. 2011;16:383–394. doi: 10.3851/IMP1756. [DOI] [PubMed] [Google Scholar]
- 174.Metifiot M., Faure A., Guyonnet-Duperat V., Bellecave P., Litvak S., Ventura M., Andreola M.L. Cellular uptake of ODNs in HIV-1 human-infected cells: a role for viral particles in DNA delivery. Oligonucleotides. 2007;17:151–165. doi: 10.1089/oli.2006.0061. [DOI] [PubMed] [Google Scholar]
- 175.Liang Y., Zhang Z., Wei H., Hu Q., Deng J., Guo D., Cui Z., Zhang X.E. Aptamer beacons for visualization of endogenous protein HIV-1 reverse transcriptase in living cells. Biosens. Bioelectron. 2011;28:270–276. doi: 10.1016/j.bios.2011.07.031. [DOI] [PubMed] [Google Scholar]
- 176.Alvarado-Mora M.V., Locarnini S., Rizzetto M., Pinho J.R. An update on HDV: virology, pathogenesis and treatment. Antiviral Ther. 2013;18:541–548. doi: 10.3851/IMP2598. [DOI] [PubMed] [Google Scholar]
- 177.Beaudoin J.D., Perreault J.P. Potassium ions modulate a G-quadruplex-ribozyme's activity. RNA. 2008;14:1018–1025. doi: 10.1261/rna.963908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 178.Reymond C., Beaudoin J.D., Perreault J.P. Modulating RNA structure and catalysis: lessons from small cleaving ribozymes. Cell. Mol. Life Sci. 2009;66:3937–3950. doi: 10.1007/s00018-009-0124-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 179.Hull R., Covey S.N., Maule A.J. Structure and replication of caulimovirus genomes. J. Cell Sci. Suppl. 1987;7:213–229. doi: 10.1242/jcs.1987.supplement_7.16. [DOI] [PubMed] [Google Scholar]
- 180.Qiu B., Zhang Y.S., Lin Y.B., Lu Y.J., Lin Z.Y., Wong K.Y., Chen G.N. A novel fluorescent biosensor for detection of target DNA fragment from the transgene cauliflower mosaic virus 35S promoter. Biosens. Bioelectron. 2013;41:168–171. doi: 10.1016/j.bios.2012.08.017. [DOI] [PubMed] [Google Scholar]
- 181.Neidle S., Balasubramanian S. Quadruplex Nucleic Acids. Cambridge: RSC; 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 182.Neidle S. Therapeutic Applications of Quadruplex Nucleic Acids. 1st edn. Waltham, MA: Academic Press; 2011. [Google Scholar]
- 183.Crooks G.E., Hon G., Chandonia J.M., Brenner S.E. WebLogo: a sequence logo generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]