Skip to main content
Cellular and Molecular Neurobiology logoLink to Cellular and Molecular Neurobiology
. 2015 Oct 11;36(6):955–964. doi: 10.1007/s10571-015-0281-8

Maximal Expression of the Evolutionarily Conserved Slit2 Gene Promoter Requires Sp1

Jacquelyn Saunders 1, D Roonalika Wisidagama 1, Travis Morford 1, Cindy S Malone 1,
PMCID: PMC4828330  NIHMSID: NIHMS729956  PMID: 26456684

Abstract

Slit2 is a neural axon guidance and chemorepellent protein that stimulates motility in a variety of cell types. The role of Slit2 in neural development and neoplastic growth and migration has been well established, while the genetic mechanisms underlying regulation of the Slit2 gene have not. We identified the core and proximal promoter of Slit2 by mapping multiple transcriptional start sites, analyzing transcriptional activity, and confirming sequence homology for the Slit2 proximal promoter among a number of species. Deletion series and transient transfection identified the Slit2 proximal promoter as within 399 base pairs upstream of the start of transcription. A crucial region for full expression of the Slit2 proximal promoter lies between 399 base pairs and 296 base pairs upstream of the start of transcription. Computer modeling identified three transcription factor-binding consensus sites within this region, of which only site-directed mutagenesis of one of the two identified Sp1 consensus sites inhibited transcriptional activity of the Slit2 proximal promoter (−399 to +253). Bioinformatics analysis of the Slit2 proximal promoter −399 base pair to −296 base pair region shows high sequence conservation over twenty-two species, and that this region follows an expected pattern of sequence divergence through evolution.

Electronic supplementary material

The online version of this article (doi:10.1007/s10571-015-0281-8) contains supplementary material, which is available to authorized users.

Keywords: Promoter, Slit2, Transcription, Phylogenetics, Evolution

Introduction

The SLIT family of proteins are secreted by midline glial cells, while their receptor, Roundabout (Robo), is expressed on the axon growth cone. In mammals, Slit-Robo signaling is required for the proper development of the central nervous system (CNS), and organs including the lung, kidney, and mammary gland. Slit-Robo signaling also plays a role in leukocyte chemotaxis, and in the migration of endothelial cells (Ypsilanti et al. 2010). Recent studies suggest that Slit-Robo signaling affects tumorigenic properties such as angiogenesis, adhesion, apoptosis, and proliferation in complex ways that often contradict each other (Ballard and Hinck 2012).

The membrane-bound Slit2 protein was originally thought of as a mid-line guidance molecule only: chemorepulsive-to-trunk neural crest cell migration (Kidd et al. 1999; De Bellard et al. 2003; Blockus and Chédotal 2014). Specifically, an in vitro extracellular Slit2 protein gradient results in a reversal in the direction of migration of neuronal precursor cells without affecting their migration speed (Xu et al. 2004). In fact, Slit2 gain-of-function mutations significantly impaired neural crest cell migration, while Slit2 loss-of-function mutations favored migration (Giovannone et al. 2012). But, Slit2′s role in nervous system development and function has extended far beyond migration and patterning to include the outgrowth of dendrites and the formation of the spine itself (Blockus and Chédotal 2014).

In original studies, Slit2 gene expression was found in many human stationary tumors (Wang et al. 2003), but not in aggressive metastatic tumors (Dallol et al. 2002, 2003). A recent compilation of tumor types suggests that this categorization is much more complicated. In fact, Slit2 expression is down-regulated in many tumor types including lung, brain, cervical, acute lymphocytic leukemia, and breast cancers (Blockus and Chédotal 2014). In fact, in ~50 % of sampled human breast tumors, the Slit2 gene was silenced. In addition, breast cancer cells that actually over-expressed Slit2 exhibited decreased proliferation and migration capabilities compared with control cells under in vitro, and in vivo conditions (Marlow et al. 2008). Increased levels of Slit2 expression are also associated with colon and prostate cancers (Blockus and Chédotal 2014).

Appropriately controlling the expression of Slit2 is absolutely critical for proper CNS formation and development, and for controlling its tumorigenic and metastatic potential. Here, we identify and characterize the Slit2 gene promoter as an evolutionarily conserved, TATA-less, dispersive transcription start site promoter that relies on an Sp1 site for maximal transcription.

Materials and Methods

Rapid Amplification of cDNA Ends

The Invitrogen 5′ RACE system (Invitrogen) was used to perform 5′ RACE as described. Gene-Specific Primers—GSP1 (Reverse), GSP3 (Forward), and GSP2 (nested, Reverse) (Integrated DNA Technologies)—were designed using the Primer 3 software as per the Invitrogen 5′ RACE system protocol.

GSP1: 5′-CCTTCCTTGGAATTGCTTGA -3′.

GSP2 (nested): 5′-TCTCGATGGTGCTGATTCTG -3′.

GSP3: 5′-CTCCTGCCCCATATCACTGT -3′.

Total RNA was extracted from NIH3T3 cells using the RNeasy mini prep kit (Qiagen) as per the protocol. 5′ RACE parameters were as per protocol using total RNA and primers with Tm <60 °C.

Plasmid Construction and Mutagenesis

The mouse Slit2 putative promoter sequence was obtained using a mouse cDNA for Slit2 (Accession number NM_178804.4) and the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Protocol (BLAST) algorithm (Altschul et al. 1990, 1997; Zhang and Madden 1997). Putative transcription factor-binding sites in the mouse Slit2 promoter were identified using the following transcription factor analysis databases: TF SEARCH, MATCH public version 1.0, Alibaba 2.1, ConSite, PROMO, and Motif Viz. The Slit2 promoter region was PCR amplified from mouse genomic DNA using primers at −1023 bp upstream and +239 bp downstream (just before the first ATG start of translation) of the Genbank recorded transcription start site. −1023 F: 5′-GAAACTTCCTGCACTTTGAGGCTGAGGAC-3′,

+200 R: 5′-CTTCCCTAATAGACTCCTTGCTTTGAAGG-3′. The 1223-bp fragment was directionally cloned into the pGL3 Basic Vector (Promega). Slit2 promoter deletion constructs were generated using existing or by creating restriction sites using the Quik-change site-directed mutagenesis kit (Stratagene), excising the internal fragment, and re-ligating the vector. Complementary oligonucleotides for site-directed mutagenesis are as follows:

NheI-153: 5′-ACTGGAATCGGATTTGCTAGcGGGTTCGTGGGATCA-3′.

XhoI-674: 5′-GCTCTTCTGGGCTCGagTATCTGCAAATCTTGC-3′.

MluI-490: 5′-GCGTCGCCAAGGACGCgtGGAGGGAGGCGGGATTG-3′.

MluI-399: 5′-GGAAGTGGAGAAGACGcgTGAGGCTGCTGCTGT-3′.

MluI-296: 5′- GTGCGCCCCCACGCGtGCCATGACCAGTCTC-3′.

PstI-100: 5′-GCAACCAGCCTCTGCAgCTTCGCGGACACTGC-3′.

The Slit2 constructs with mutated Sp1 or USF sites were created using the Quik-change site-directed mutagenesis kit (Stratagene, La Jolla, CA, USA) and the following complimentary oligonucleotides:

Sp1 (−359 bp): 5′-GAGCCCTGAGaaCCGCATCACCGCCCGGCACCCCC-3′.

Sp1 (−345 bp): 5′-GCATCACCGCtCGGCACttCCAAGGCTGTCTGAATACC-3′.

USF (−305 bp): 5′-GAATACCCGGTGCGCCCCCAatCGCGCCATGACCAGTCTC-3′.

Cell Culture, Transfections, and Dual Luciferase Assays

Cell culture media was supplemented with 3.7 g NaHCO3, 1X Sodium Pyruvate, 1X Non-essential Amino Acids, 1X Penicillin–Streptomycin, and 10 % Fetal Bovine Serum.

HEK 293T cells were grown in RPMI, and NIH 3T3 cells were grown in DMEM. Cell lines were cotransfected using 2–3.5 ul Effectene transfection reagent (Qiagen), 0.6 micrograms pGL3 constructs, 0.4 micrograms pRL SV40 for normalization, and assayed as described using the Dual Luciferase assay kit (Promega). Results are the averages of at least three independent transfections using at least two preparations of Mini-prep DNA (Qiagen). Statistical analysis was performed using the Student’s two-sided t test in Microsoft Excel.

Multiple Sequence Alignment

The Slit2 promoter sequence was used in conjunction with a BLAST to obtain orthologous Slit2 promoter sequences from other species in the NCBI nucleotide database. A total of 21 species with a highly conserved DNA sequence (>50 %) in the −399 to −296 bp Slit2 promoter region were found in NCBI (not including the mouse Slit2 promoter query sequence). The species and accession numbers the BLAST protocols are listed in Supplemental Table 1. A multiple sequence alignment (MSA) was constructed using CLUSTAL W 3.2 (http://workbench.sdsc.edu), and BioEdit 7.0.9.0 software (http://www.mbio.ncsu.edu/BioEdit/bioedit.html).

Model Testing

To determine the best-fit substitution model for the MSA, model testing was performed in the Model Test Program within MEGA 5.1. Model testing included the Bayesian information criterion (BIC) and the Akaike information criterion (AIC) to assess a substitution model’s goodness of fit to the dataset (Guindon and Gascuel 2003; Posada and Buckley 2004; Erb and van Nimwegen 2011). The Model test data indicate that the Tamura 3-parameter with Gamma distributed rates (T92+G) model, having the lowest AIC score, will provide the best DNA model for the pairwise distances of the DNA sequences. (Supplemental Table 2). Each parameter of the T92+G model including the individual base frequencies, nucleotide substitution rates, transition/transversion bias (R), and the gamma distribution shape parameter (+G), which was determined to be essentially one (1.08). Invariable sites (+I) were not detected. Base substitution rates were also calculated.

Tree Construction

Bayesian Analysis was performed using Mr. Bayes 3.1.2 (Huelsenbeck and Ronquist 2001). The Bayesian analysis parameters include three million generations, generation of a tree every 100 generations, and a “burn-in” value of 7000 generations. MEGA 5.1 was used to produce a Maximum Likelihood tree using parametric bootstrapping at 5000 replicates. Maximum Parsimony and Neighbor-Joining trees were both produced with bootstrapping for 5000 and 10,000 replicates using MEGA software.

Results

The Slit2 Gene has Multiple, Dispersed Transcription Start Sites

Analyses of core promoters have led to the discovery of consensus sequence motifs such as the TATA box, the Initiator element (Inr), the motif ten element (MTE), the downstream core element (DCE), and the downstream promoter element (DPE) that drive basal levels of transcription from focused single transcription start sites (TSS) (Juven-Gershon and Kadonaga 2010). These elements are generally not found or not functional in promoters with multiple dispersed TSS (Juven-Gershon et al. 2008). Using the Slit2 gene mRNA sequence, NCBI BLAST was performed and the genomic DNA region upstream of the 5′UTR was identified. Computer analyses of the putative promoter region using multiple transcription factor scanning software failed to identify a consensus TATA box sequence, an Inr, an MTE, a DCE, or a DPE in the Slit21 putative promoter. The lack of these consensus sequences suggests that transcription initiation of the Slit2 promoter occurs at multiple dispersed sites (Juven-Gershon and Kadonaga 2010). 5′ RACE confirmed multiple dispersed TSS in the putative Slit2 promoter, confirming the computer analyses (Fig. 1). The TSS suggested by mRNA sequences submitted to GenBank™ is located 253 bp upstream of the ATG translation start site. 5′ RACE analysis identified six independent TSS in the Slit2 promoter that were located 38, 57, 68, 72, 202, and 314 bp upstream of the first ATG (Fig. 2) and are consistent with a dispersed mode of transcription initiation. Interestingly, none of the experimentally identified TSS aligned with the suggested start site from mRNA sequences submitted to GenBank™ (253 bp).

Fig. 1.

Fig. 1

The Slit2 gene promoter sequence identifying 5′ RACE transcription start sites and construct lengths. Primer sequences used for generating the Slit2 −1075 to +253 construct are shown as black arrowhead underlines. The GenBank-assigned start of transcription is shown in red text labeled (+1) with a large red arrow above the sequence. The start of translation is shown in bold black text “ATG”. 5′ RACE was used to experimentally identify the transcription start site(s) and are shown as labeled red sequences and small red arrows. PstI-607 and EcoRV-490 are endogenous restriction sites and site-directed mutagenesis was used to create the -674 (Xho1), -469 (Mlu1), -399 (Mlu1), -296 (Mlu1), -153 (Nhe1), and -100 (Pst1) restriction sites for deletion of intervening sequence and re-ligation of deletion constructs. CpG dinucleotides are denoted in blue text. Putative transcription factor-binding sites identified by database analyses are outlined in orange or green and labeled by putative transcription factor name (Color figure online)

Fig. 2.

Fig. 2

The Slit2 −399 to +253 promoter construct shows maximal activity and defines the Slit2 proximal promoter. Transient transfection of the Slit2 promoter deletion constructs −1075, −674, −607, −490, −469, −399, −296, −153, −100 bp, and pGL3 basic control vector were performed in the HEK293T cell line. The Slit2 promoter deletion construct lengths are represented by continually shorter labeled lines that are not drawn to scale. Slit2 promoter sequences for each construct are shown in Fig. 1. The activity of each construct is expressed as the fold activation over promoterless pGL3 basic vector. Luciferase activity is pRL SV40-luciferase promoter and enhancer vector normalized and are the average ± SD of at least four independent transfections using at least three preparations of DNA. All construct activities were significantly higher than pGL3 basic by the Student two-sided t test (p ≤ 0.05). Slit2 −399 bp activity was significantly higher than all other constructs (p ≤ 0.05) except for the Slit2 −1075 bp construct

Identification of the Slit2 Gene Proximal Promoter

To define the boundaries of the most active Slit2 gene promoter sequences and to identify cis elements that may govern the transcriptional activity of Slit2, a series of deletion constructs was prepared (Fig. 1) and tested for promoter activity using transient transfection of a mouse fibroblast cell line (NIH 3T3) and the dual luciferase assay system (Fig. 2). The ten constructs tested, −1075, −674, −607, −490, −469, −399, −296, −153, and −100 bp all showed significant activity over promoterless control (Fig. 2). The Slit2 promoter sequence for each deletion construct is delineated in Fig. 1.

The Slit2 −296, −153, and −100 bp constructs all showed a statistically significantly lower activity than the Slit2 −399 bp construct (Fig. 2). The significant drop in activity between the Slit2 −399 bp construct and the Slit2 −296 bp suggests the putative transcription factor-binding sites within this region may be essential for maximal proximal promoter transcription of Slit2. Therefore, the Slit2 −399 and −296 bp region was analyzed for putative transcription factor-binding sites using TFSEARCH, AliBaba2.1, MATCH, ConSite, PROMO, and Motif Viz databases.

Evolutionarily Conserved Transcription Factor Consensus Sequences

Three putative transcription factor-binding consensus site sequences were identified within the Slit2 −399 and −296 bp region, two Sp1 sites (−359 and −345 bp) and a USF site (−305 bp), by at least two of the above six search algorithms (See Fig. 1). Both Sp1 and USF are well known transcription factors that commonly interact with sequences in TATA-less promoters like Slit2 (Workman et al. 1990; Emami et al. 1998; Erb and van Nimwegen 2011). Other putative transcription factor-binding site consensus sequences identified within the entire Slit2 promoter −1075 bp are shown in Fig. 1 as well. Although using the CLUSTALW Algorithm and BioEdit7.0.9.0 to create a MSA of the human, rat and mouse Slit2 −1023 promoter regions revealed high conservation (>70 %, data not shown), adding additional species to the alignment resulted in a loss of conservation. These additional orthologous species were identified by NCBI BLAST analysis using the mouse promoter sequences as a reference. An MSA of the Slit2 −399 to −296 bp region of the proximal promoter revealed 21 species with highly conserved DNA sequences in this region (data not shown) (Brown et al. 1998). The MSA of the mouse Slit2 proximal promoter showed a 51.5 % conservation of sequence in the least related species, the Giant Panda, with the highest conservation of sequence seen in the Norway Rat sequence (90.4 %). A high level of conservation among all of the Primate species, with the exception of the Galago, and a high level of sequence conservation among the Ungulates was also found (see phylogenetic tree in Fig. 4).

Fig. 4.

Fig. 4

Species relatedness by Bayesian inference analysis of the Slit2 proximal promoter −399 to −296 bp from 21 species. The tree was produced using Mr. Bayes 3.1.2 Bayesian inference analysis demonstrating posterior probabilities, noted next to each clade between 0 and 100, with 100 equaling 100 % agreement. Branch lengths indicate the amount of character change in proportion to estimated evolutionary distances. The scale bar represents units of genetic distance (0.05 mutations per site). Three million generations were performed with a tree produced every 100 generations and the first 7000 trees were discarded as “burn-in”

Several transcription factor-binding site search algorithms were used to identify conserved transcription factor-binding consensus sequences in the areas of highest conservation among the 21 species. Figure 1 shows the identified transcription factor-binding consensus sites for Sp1(−359), Sp1(−345) and USF1(−305). Among the 21 species, the Sp1(−359) site had only 2/8 identical base pairs (25 %) and 3/8 base pairs that were conserved (37.5 %), suggesting this consensus sequence was conserved, but not as highly conserved through evolution as the other two sites. Both the Sp1(−345) site and the USF1 site showed higher conservation of identical and conserved sequences among the 21 species analyzed. The Sp1(−345) site was 5/13 identical base pairs (38.5 %) and 9/13 conserved base pairs (69 %). The USF1 site was 6/12 identical base pairs (50 %) and 7/12 conserved base pairs (58 %). These data lend strong evidence for evolutionary conservation of these transcription factor-binding consensus sites, and for the conclusion that this region is in fact the proximal promoter of the Slit2 gene. Figure 4 illustrates this Slit2 −399 to −296 bp promoter region MSA graphically in phylogenetic tree form showing the evolutionary relationships among the 21 species.

Phylogenetic Analysis of the Slit2 −399 to −296 bp Proximal Promoter

Model testing was performed on the MSA to determine the evolutionary best-fit model for phylogenetic tree construction (Supplemental Table 2). Bayesian inference was performed using the Tamura (1992) model with Gamma distributed rates (T92+G) substitution model for tree construction (Fig. 4). All of the primate clades show high Bayesian support values save for the clade containing the Marmoset and the Bolivian Squirrel Monkey, due to a higher degree of divergence between the two species’ sequences. As expected, the Mouse and Norway rat are grouped in a single clade with a high Bayesian support value, as is the clade containing the Orca, Sheep, and Cow. The branch that gives rise to the Horse, Walrus, White Rhinoceros, and African Elephant has a much lower Bayesian support value than most of the clades in the tree, as does the branch that separates the Galago, Mouse, and Norway rat from the Giant Panda, European Shrew, Star-nosed mole, Sheep, Cow, Orca, Horse, Walrus, White Rhinoceros, and African Elephant. This is likely due to a higher degree of divergence in the Slit2 −399 to −296 bp proximal promoter sequence between these species. As the degree of divergence in the nucleotide sequence increases, the Bayesian support of species relatedness decreases. These data support the sequence conservation identified by MSA (data not shown and Fig. 4).

A Single Sp1 Site is Essential for Maximal Slit2 Proximal Promoter Activity

The three most highly conserved transcription factor-binding consensus sites were mutagenized to determine their importance in the Slit2 −399 bp proximal promoter. Site-directed mutagenesis of the Sp1(−359 bp), Sp1(−345 bp) and USF1(−305 bp) sites, using defined sequences known to abolish binding of the respective transcription factors, were introduced into the Slit2 −399 bp promoter construct and tested for activity in transient transfection. Figure 5 shows that mutations in the Sp1(−359 bp) site resulted only in a slight inhibition of transcriptional activity that was not statistically significant (p > 0.05). Surprisingly, mutations in the USF1 site did not affect promoter activity either, even though it is highly conserved through evolution (Fig. 3). When the Sp1 (−345 bp) with high sequence identity and conservation through evolution was mutated, transient transfection of the Slit2 −399 bp promoter construct resulted in a statistically significant inhibition of transcriptional activity (p < 0.001), indicating that this site is crucial for maximal transcription of the Slit2 −399 bp proximal promoter (Fig. 5).

Fig. 5.

Fig. 5

The Slit2 −399 to +253 mutant Sp1-345 bp promoter construct shows loss of activity and confirms the evolutionary importance of the Sp1-345 bp site. Transient transfection of the Slit2 promoter deletion construct −399, −399 bp mutant Sp1-359, -399 bp mutant Sp1-345, -399 bp mutant USF1-305, -296 bp, and pGL3 basic control vector were performed in the HEK293T cell line. Slit2 promoter sequences for each deletion construct are shown in Fig. 1 and mutations generated within the transcription factor consensus sequences are shown in Fig. 3. The activity of each construct is expressed as the fold activation over promoterless pGL3 basic vector. Luciferase activity is pRL SV40-luciferase promoter and enhancer vector normalized and are the average ± SD of at least four independent transfections using at least three preparations of DNA. All construct activities were significantly higher than pGL3 basic by the Student two-sided t test (p ≤ 0.05), except Slit2 promoter deletion construct −399 bp mutant Sp1-345. The only construct with significantly lower activity than the Slit2 −399 bp deletion construct was the Slit2 promoter deletion construct -399 bp mutant Sp1-345 (p ≤ 0.05)

Fig. 3.

Fig. 3

Multiple sequence alignment of the −362 to −303 bp Slit2 promoter reveals high evolutionary conservation of the Sp1 (−345) and USF1 (−305) sites. MSA using the Slit2 proximal promoter −399 to −296 bp from 21 species was performed. The −362 to −303 bp region of the Slit2 proximal promoter with the two Sp1 sites (−359 and −345, labeled, boxed, and shown in green text) and the USF1 site (−305, labeled, boxed, and shown in green text) is shown here. Conserved and identical sequences among species, omitting species with gapped sequences in the analyses, are labeled “Conservation” and “Identity” and shown in green text. Mutations incorporated into the Slit2 proximal promoter −399 to +253 bp by site-directed mutagenesis are shown in bold, green, underlined text and labeled “Mutagenesis” (Color figure online)

Discussion

Gene expression is generally governed by the presence of certain cis-acting motifs within a promoter, including the TATA box, the downstream core promoter element (DPE), and the Initiator (Inr), that interact with transcription initiation proteins (Juven-Gershon and Kadonaga 2010). When transcription initiation occurs from promoters that lack these motifs, multiple transcription start sites result (Anish et al. 2009). Dispersed transcription initiation from multiple TSS occurs in about 70–72 % of promoters in vertebrates, while focused initiation is the predominant mechanism in simple organisms (Juven-Gershon et al. & Anish et al.). These data are consistent with what we have found for the Slit2 promoter lacking the common transcription initiation motifs, TATA, DPE, and Inr and having multiple transcription start sites. Maximal activity of the mouse Slit2 (muSlit2) gene promoter lies in the −399 bp region, suggesting important transcription factors interact there. Although computer analyses found many transcription factor-binding consensus sequences, only ten were found commonly among the four algorithms used to analyze this region. The three consensus sequences found in the region between the −399 and −296 bp of the Slit2 promoter were subject to scrutiny because this region harbored the highest levels of promoter activity. Of the identified two Sp1 sites and USF1 site, mutations at only one of the Sp1 sites, Sp1(−345 bp), showed a statistically significant reduction in transcriptional activity. The mutations created in both of the Sp1 sites and the USF site were previously shown to reduce binding of these transcription factors and to reduce transcriptional activity (Sims et al. 1993; Tamaki et al. 1995; Jiang et al. 1997; Huang et al. 2013).

The Sp1 transcription factor is a zinc-finger protein that can recruit the basal transcription factor TBP/TFIID to the initiation machinery, explaining previously characterized ability of Sp1 to induce transcription of TATA-less genes (Brandeis et al. 1994; Emami et al. 1998; Paonessa et al. 2013). The Sp1 core consensus sequence, CCGCCC or GGGCGG, has been further defined as (G/T)GGGCGG(G/A)(G/A)(C/T) (Song et al. 2001), although the Sp1 transcription factor is known to bind a variety of GC-rich sequences known as GC boxes (Solomon et al. 2008). In fact, just the core sequence, CGCC or GGcG was sufficient for DNA binding in vitro (Thiesen and Bach 1990). Sp1 sites tend to occur and contribute to the unmethylated state of CpG islands and therefore gene expression levels (Futscher et al. 2002) in TATA-less promoters (Brandeis et al. 1994; French et al. 2003; Cheng et al. 2004). CpG islands are generally at least 200 bp in length where the overall GC content is greater than 55 %, and includes more CpG sequences than by random chance compared to the rest of the genome (Deaton and Bird 2011). While the majority of CG dinucleotides outside CpG islands in the human genome are methylated, those within the islands are not (Bird 1980; Gardiner-Garden and Frommer 1987). CpG island promoters typically initiate from multiple start sites, unlike the single start sites characteristic of TATA box, DPE, and Inr promoter types (Suzuki et al. 2001), and are the most common type of promoter in the mammalian genome (Gagniuc and Ionescu-Tirgoviste 2012). The Slit2 promoter classifies as a CpG island due to its high GC content (62 %) and 26 CpG sites within the −399 bp promoter alone. In fact, the Slit2 gene was silenced by promoter hypermethylation in many aggressive metastatic tumors (Dallol et al. 2002, 2003), including 29 % of neuroblastomas, 38 % of Wilms’ tumors, and 25 % of renal cell carcinomas studied (Astuti et al. 2004), while it was expressed in many stationary tumors (Wang et al. 2003). Taken together, these data are consistent with our mutagenesis results showing that Sp1(−345) is critical for Slit2 gene expression from the −399 bp promoter. The USF transcription factor is a basic helix-loop-helix leucine zipper family protein that activates transcription at initiator (Inr) elements (C/T C/T A N A/T C/T C/T) and E-box motifs (5′-CACGTG-3′) (Roy et al. 1991; Jones 2004). The Slit2 promoter contains an evolutionarily conserved non-canonical E-box motif (5′-CACGCG-3′) that when mutated (5′-CAatCG-3′) did not affect transcription of the Slit2 −399 proximal promoter, suggesting that it was not a functional E-box in our experiments (Carr and Sharp 1990; Gregor et al. 1990; Sims et al. 1993; Chen et al. 2012; Huang et al. 2013). Because USF is known to interact with other transcription factors, such as Sp1 (Ge et al. 2003), it is possible that we did not see any difference in activity without combinatorially mutating the Sp1 and USF sites. The region just 5′ adjacent to the non-canonical E-box was highly evolutionarily conserved as well, but the transcription factor consensus sequence analyses did not consistently identify any transcription factor-binding consensus sites there (See Figs. 1, 3). Further mutational analyses of this region would be necessary to uncover the significance of the evolutionary conservation.

Interestingly, the region just upstream of the Slit2 −399 region shows a much lower level of activity, suggesting the presence of silencer elements or negative regulation within the −469 to −400 Slit2 promoter region (Malone et al. 2000; Malone et al. 2001; Malone et al. 2006). Another evolutionarily conserved E box is present within this region, potentially interacting with USF transcription factors (See Fig. 1). Although most DNA interactions with USF promote gene activation, it can interact with negative cofactors (Anantharaman et al. 2011). In this case, however, USF and its negative cofactor repressed TATA-driven promoters, and the Slit2 promoter does not contain a TATA box (Meisterernst et al. 1991). Again, a thorough mutational analysis of this region would be needed to implicate the perpetrators of the negative regulation in this region.

Model testing of the entire conserved Slit2 −399 to −296 bp promoter MSA indicated that the T92+G model provided the best DNA model for pairwise distances. This method provided the lowest AIC score, likely because the T92 model accounts for the differences between rates of transition mutations (a change from one purine to another) and transversion mutations (change from a purine to a pyrimidine), allowing for variable substitution rates, unlike the Jukes-Cantor model, which has a higher AIC score. The T92 model also accounts for G/C content bias, which is a difference in the number of G and C nucleotides in a DNA sequence from that of A and T nucleotides, unlike the Kimura 2-parameter model which assumes that the frequencies of all four nucleotides A, T, C and G remains constant (0.025) over evolutionary time (Tamura 1992).

Three methods of phylogenetic tree generation were performed that resulted in trees with similar topology. Maximum likelihood, Neighbor Joining, and Bayesian inference trees were generated, but only the Bayesian inference tree is shown in Fig. 4 because it was produced using the Markov Chain Monte Carlo simulation. This process results in a posterior probability distribution and gives an actual statistical probability that the tree was accurately constructed (Huelsenbeck and Ronquist 2001). Each of the three methods grouped the primates in a single clade or, as is the case with the Bayesian inference tree (Fig. 4) with the primates originating from the same branch, and then grouped the others into different clades according to the degree of species relatedness. Each of the trees grouped the ungulates in a clade, with the other orders grouped therein (Felsenstein 1981; Efron 1982; Felsenstein 1985). The Bayesian inference tree is presented here because overall, the bootstrapping values are highest for the Bayesian inference analysis compared to Maximum Likelihood and Neighbor Joining analyses (data not shown). The higher the percent value for Bootstrapping, the more confident the analysis was in the relatedness of species and how the tree is branched into clades. All phylogenetic trees generated from the MSA show that the Slit2 promoter sequences of Primates, Ungulates, Carnivora, Rodentia, Soricomorpha, Lagomorpha and Proboscidea have differences that define them as separate orders, but the critical region of the Slit2 promoter that drives maximal transcription is highly conserved through evolution. These data support the hypothesis that that while coding and non-coding DNA sequences are equally susceptible to mutation, selective pressures exist on protein coding and regulatory DNA sequences that reduce the mutation frequencies in populations, as compared to other non-coding sequences which have a smaller effect on fitness and therefore lack these selective pressures (Strachan and Read 2004; Kryukov et al. 2005). While sequence variability for transcription factor-binding sites in promoter regions can vary greatly among different species, there tends to be some conservation of promoter sequences over broad phylogenetic groups (Florquin et al. 2005; Blanco et al. 2006; Doniger and Fay 2007; Eisermann et al. 2008). For example, high degrees of promoter conservation between species tend to be associated with genes for cell signaling, transcription factors, and genes that regulate development, like Slit2, and that lower degrees of conservation occur among the promoter regions of genes involved in metabolic processes (Chiba et al. 2008). It is clear from our genetic and phylogenetic data that the evolutionarily conserved Slit2 −399 to −296 bp promoter region is necessary for maximal transcription and that the Sp1 (−345 bp) site it critical for activity in all the species analyzed, as is shown here for the mouse Slit2 promoter (Fig. 5).

Electronic supplementary material

Below is the link to the electronic supplementary material.

Acknowledgments

This work was supported by the California State University Northridge Research, Scholarship, and Creative Activity Awards, the California State University Northridge Probationary Faculty Support Awards, the California State University Program for Education and Research in Biotechnology: CSU Faculty-Student Collaborative Research Seed Grant Program, an NIH R15 AREA award GM080683-01, and the California Institute for Regenerative Medicine CSUN-UCLA Bridges to Stem Cell Research Program Educational Enhancement: TB1-01183.

References

  1. Altschul SF, Gish W, Miller W et al (1990) Basic local alignment search tool. J Mol Biol. doi:10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
  2. Altschul SF, Madden TL, Schäffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Anantharaman A, Lin I-J, Barrow J et al (2011) Role of helix-loop-helix proteins during differentiation of erythroid cells. Mol Cell Biol 31:1332–1343. doi:10.1128/MCB.01186-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anish R, Hossain MB, Jacobson RH, Takada S (2009) Characterization of transcription from TATA-less promoters: identification of a new core promoter element XCPE2 and analysis of factor requirements. PLoS One. doi:10.1371/journal.pone.0005103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Astuti D, Da Silva NF, Dallol A et al (2004) SLIT2 promoter methylation analysis in neuroblastoma, Wilms’ tumour and renal cell carcinoma. Br J Cancer 90:515–521. doi:10.1038/sj.bjc.6601447 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ballard MS, Hinck L (2012) A roundabout way to cancer. Adv Cancer Res 114:187–235. doi:10.1016/B978-0-12-386503-8.00005-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504. doi:10.1093/nar/8.7.1499 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Blanco E, Messeguer X, Smith TF, Guigó R (2006) Transcription factor map alignment of promoter regions. PLoS Comput Biol 2:403–416. doi:10.1371/journal.pcbi.0020049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Blockus H, Chédotal A (2014) The multifaceted roles of Slits and Robos in cortical circuits: from proliferation to axon guidance and neurological diseases. Curr Opin Neurobiol 27:82–88. doi:10.1016/j.conb.2014.03.003 [DOI] [PubMed] [Google Scholar]
  10. Brandeis M, Frank D, Keshet I et al (1994) Sp1 elements protect a CpG island from de novo methylation. Nature 371:435–438. doi:10.1038/371435a0 [DOI] [PubMed] [Google Scholar]
  11. Brown NP, Leroy C, Sander C (1998) MView: a web-compatible database search or multiple alignment viewer. Bioinformatics 14:380–381. doi:10.1093/bioinformatics/14.4.380 [DOI] [PubMed] [Google Scholar]
  12. Carr CS, Sharp PA (1990) A helix-loop-helix protein related to the immunoglobulin E box-binding proteins. Mol Cell Biol 10:4384–4388. doi:10.1128/MCB.10.8.4384.Updated [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chen B, Hsu R, Li Z et al (2012) Upstream stimulatory factor 1 activates GATA5 expression through an E-box motif. Biochem J 446:89–98 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cheng L, Jin Z, Liu L et al (2004) Characterization and promoter analysis of the mouse nestin gene. FEBS Lett 565:195–202. doi:10.1016/j.febslet.2004.03.097 [DOI] [PubMed] [Google Scholar]
  15. Chiba H, Yamashita R, Kinoshita K, Nakai K (2008) Weak correlation between sequence conservation in promoter regions and in protein-coding regions of human-mouse orthologous gene pairs. BMC Genom 9:152. doi:10.1186/1471-2164-9-152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dallol A, Da Silva NF, Viacava P et al (2002) SLIT2, a human homologue of the Drosophila Slit2 gene, has tumor suppressor activity and is frequently inactivated in lung and breast cancers. Cancer Res 62:5874–5880 [PubMed] [Google Scholar]
  17. Dallol A, Krex D, Hesson L et al (2003) Frequent epigenetic inactivation of the SLIT2 gene in gliomas. Oncogene 22:4611–4616. doi:10.1038/sj.onc.1206687 [DOI] [PubMed] [Google Scholar]
  18. De Bellard ME, Rao Y, Bronner-Fraser M (2003) Dual function of Slit2 in repulsion and enhanced migration of trunk, but not vagal, neural crest cells. J Cell Biol 162:269–279. doi:10.1083/jcb.200301041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Deaton AM, Bird A (2011) CpG islands and the regulation of transcription. Genes Dev 25:1010–1022. doi:10.1101/gad.2037511 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Doniger SW, Fay JC (2007) Frequent gain and loss of functional transcription factor binding sites. PLoS Comput Biol 3:0932–0942. doi:10.1371/journal.pcbi.0030099 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Efron B (1982) The jackknife, the bootstrap and other resampling plans. SIAM, Philadelphia [Google Scholar]
  22. Eisermann K, Tandon S, Bazarov A et al (2008) Evolutionary conservation of zinc finger transcription factor binding sites in promoters of genes co-expressed with WT1 in prostate cancer. BMC Genom 9:337. doi:10.1186/1471-2164-9-337 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Emami KH, Burke TW, Smale ST (1998) Sp1 activation of a TATA-less promoter requires a species-specific interaction involving transcription factor IID. Nucleic Acids Res 26:839–846. doi:10.1093/nar/26.3.839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Erb I, van Nimwegen E (2011) Transcription factor binding site positioning in yeast: proximal promoter motifs characterize Tata-Less promoters. PLoS One. doi:10.1371/journal.pone.0024279 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17:368–376. doi:10.1007/BF01734359 [DOI] [PubMed] [Google Scholar]
  26. Felsenstein J (1985) Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39:783–791. doi:10.2307/2408678 [DOI] [PubMed] [Google Scholar]
  27. Florquin K, Saeys Y, Degroeve S et al (2005) Large-scale structural analysis of the core promoter in mammalian and plant genomes. Nucleic Acids Res 33:4255–4264. doi:10.1093/nar/gki737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. French SW, Malone CS, Shen RR et al (2003) Sp1 transactivation of the TCL1 oncogene. J Biol Chem 278:948–955. doi:10.1074/jbc.M207166200 [DOI] [PubMed] [Google Scholar]
  29. Futscher BW, Oshiro MM, Wozniak RJ et al (2002) Role for DNA methylation in the control of cell type specific maspin expression. Nat Genet 31:175–179. doi:10.1038/ng886 [DOI] [PubMed] [Google Scholar]
  30. Gagniuc PA, Ionescu-Tirgoviste C (2012) Eukaryotic genomes may exhibit up to 10 generic classes of gene promoters. BMC Genom 13:512 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gardiner-Garden M, Frommer M (1987) CpG islands in vertebrate genomes. J Mol Biol 196:261–282. doi:10.1016/0022-2836(87)90689-9 [DOI] [PubMed] [Google Scholar]
  32. Ge Y, Jensen TL, Matherly LH, Taub JW (2003) Physical and functional interactions between USF and Sp1 proteins regulate human deoxycytidine kinase promoter activity. J Biol Chem 278:49901–49910. doi:10.1074/jbc.M305085200 [DOI] [PubMed] [Google Scholar]
  33. Giovannone D, Reyes M, Reyes R et al (2012) Slits affect the timely migration of neural crest cells via robo receptor. Dev Dyn 241:1274–1288. doi:10.1002/dvdy.23817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Gregor PD, Sawadogo M, Roeder RG (1990) The adenovirus major late transcription factor USF is a member of the helix-loop-helix group of regulatory proteins and binds to DNA as a dimer. Genes Dev 4:1730–1740. doi:10.1101/gad.4.10.1730 [DOI] [PubMed] [Google Scholar]
  35. Guindon S, Gascuel O (2003) A simple, fast, and accurate method to estimate large phylogenies by maximum likelihood. Syst Biol 52:696–704. doi:10.1080/10635150390235520 [DOI] [PubMed] [Google Scholar]
  36. Huang G, Wen Q, Zhao Y et al (2013) NF-κB plays a key role in inducing CD274 expression in human monocytes after lipopolysaccharide treatment. PLoS One. doi:10.1371/journal.pone.0061602 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Huelsenbeck JP, Ronquist F (2001) MRBAYES: bayesian inference of phylogeny. Bioinformatics 17:754–755. doi:10.1093/bioinformatics/17.8.754 [DOI] [PubMed] [Google Scholar]
  38. Jiang JG, Bell A, Liu Y, Zarnegar R (1997) Transcriptional regulation of the hepatocyte growth factor gene by the nuclear receptors chicken ovalbumin upstream promoter transcription factor and estrogen receptor. J Biol Chem 272:3928–3934. doi:10.1074/jbc.272.7.3928 [DOI] [PubMed] [Google Scholar]
  39. Jones S (2004) An overview of the basic helix-loop-helix proteins. Genome Biol 5:226. doi:10.1186/gb-2004-5-6-226 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Juven-Gershon T, Kadonaga JT (2010) Regulation of gene expression via the core promoter and the basal transcriptional machinery. Dev Biol 339:225–229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Juven-Gershon T, Juven-Gershon T, Hsu J-Y et al (2008) The RNA polymerase II core promoter—the gateway to transcription. Curr Opin Cell Biol 20:253–259. doi:10.1016/j.ceb.2008.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kidd T, Bland KS, Goodman CS (1999) Slit is the midline repellent for the robo receptor in Drosophila. Cell 96:785–794. doi:10.1016/S0092-8674(00)80589-9 [DOI] [PubMed] [Google Scholar]
  43. Kryukov GV, Schmidt S, Sunyaev S (2005) Small fitness effect of mutations in highly conserved non-coding regions. Hum Mol Genet 14:2221–2229. doi:10.1093/hmg/ddi226 [DOI] [PubMed] [Google Scholar]
  44. Malone CS, Patrone L, Buchanan KL et al (2000) An upstream oct-1- and oct-2-binding silencer governs B29 (Ig) gene expression. J Immunol 164:2550–2556. doi:10.4049/jimmunol.164.5.2550 [DOI] [PubMed] [Google Scholar]
  45. Malone CS, Omori SA, Gangadharan D, Wall R (2001) Leukocyte-specific expression of the pp52 (LSP1) promoter is controlled by the cis-acting pp52 silencer and anti-silencer elements. Gene 268:9–16. doi:10.1016/S0378-1119(01)00430-9 [DOI] [PubMed] [Google Scholar]
  46. Malone CS, Kuraishy AI, Fike FM et al (2006) B29 gene silencing in pituitary cells is regulated by its 3′ enhancer. J Mol Biol 362:173–183. doi:10.1016/j.jmb.2006.07.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Marlow R, Strickland P, Ji SL et al (2008) SLITs suppress tumor growth in vivo by silencing Sdf1/Cxcr4 within breast epithelium. Cancer Res 68:7819–7827. doi:10.1158/0008-5472.CAN-08-1357 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Meisterernst M, Roy AL, Lieu HM, Roeder RG (1991) Activation of class II gene transcription by regulatory factors is potentiated by a novel activity. Cell 66:981–993 [DOI] [PubMed] [Google Scholar]
  49. Paonessa F, Latifi S, Scarongella H et al (2013) Specificity protein 1 (Sp1)-dependent activation of the synapsin I gene (SYN1) is modulated by RE1-silencing transcription factor (REST) and 5′-cytosine-phosphoguanine (CPG) methylation. J Biol Chem 288:3227–3239. doi:10.1074/jbc.M112.399782 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Posada D, Buckley TR (2004) Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. Syst Biol 53:793–808. doi:10.1080/10635150490522304 [DOI] [PubMed] [Google Scholar]
  51. Roy AL, Meisterernst M, Pognonec P, Roeder RG (1991) Cooperative interaction of an initiator-binding transcription initiation factor and the helix-loop-helix activator USF. Nature 354:245–248. doi:10.1038/354245a0 [DOI] [PubMed] [Google Scholar]
  52. Sims SH, Cha Y, Romine MF et al (1993) A novel interferon-inducible domain: structural and functional analysis of the human interferon regulatory factor 1 gene promoter. Mol Cell Biol 13:690–702 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Solomon SS, Majumdar G, Martinez-Hernandez A, Raghow R (2008) A critical role of Sp1 transcription factor in regulating gene expression in response to insulin and other hormones. Life Sci 83:305–312 [DOI] [PubMed] [Google Scholar]
  54. Song J, Ugai H, Ogawa K et al (2001) Two consecutive zinc fingers in Sp1 and in MAZ are essential for interactions with cis-elements. J Biol Chem 276:30429–30434. doi:10.1074/jbc.M103968200 [DOI] [PubMed] [Google Scholar]
  55. Strachan T, Read AP (2004) Chapter 9: organization of the human genome. In: Human molecular genetics, Garland, New York, 3–10
  56. Suzuki Y, Tsunoda T, Sese J et al (2001) Identification and characterization of the potential promoter regions of 1031 kinds of human genes. Genome Res 11:677–684. doi:10.1101/gr.GR-1640R [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Tamaki T, Ohnishi K, Hartl C et al (1995) Characterization of a GC-rich region containing Sp1 binding site(s) as a constitutive responsive element of the α2(I) collagen gene in human fibroblasts. J Biol Chem 270:4299–4304. doi:10.1074/jbc.270.9.4299 [DOI] [PubMed] [Google Scholar]
  58. Tamura K (1992) Estimation of the number of nucleotide substitutions when there are strong transition-transversion and G+C-content biases. Mol Biol Evol 9:678–687 [DOI] [PubMed] [Google Scholar]
  59. Thiesen HJ, Bach C (1990) Target Detection Assay (TDA): a versatile procedure to determine DNA binding sites as demonstrated on SP1 protein. Nucleic Acids Res 18:3203–3209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wang B, Xiao Y, Ding BB et al (2003) Induction of tumor angiogenesis by Slit-Robo signaling and inhibition of cancer growth by blocking Robo activity. Cancer Cell 4:19–29. doi:10.1016/S1535-6108(03)00164-8 [DOI] [PubMed] [Google Scholar]
  61. Workman JL, Roeder RG, Kingston RE (1990) An upstream transcription factor, USF (MLTF), facilitates the formation of preinitiation complexes during in vitro chromatin assembly. EMBO J 9:1299–1308 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Xu H-T, Yuan X, Guan C et al (2004) Calcium signaling in chemorepellant Slit2-dependent regulation of neuronal migration. Proc Natl Acad Sci USA 101:4296–4301. doi:10.1073/pnas.0303893101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Ypsilanti AR, Zagar Y, Chedotal A (2010) Moving away from the midline: new developments for Slit and Robo. Development 137:1939–1952. doi:10.1242/dev.044511 [DOI] [PubMed] [Google Scholar]
  64. Zhang J, Madden TL (1997) PowerBLAST: a new network BLAST application for interactive or automated sequence analysis and annotation. Genome Res 7:649–656. http://www.ncbi.nih.gov/entrez/query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=9199938 [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Cellular and Molecular Neurobiology are provided here courtesy of Springer

RESOURCES