Table 2. Major elements in 5′UTR and 300bp promoter region.
Elements | Sequence a | Location b , c | Putative Function |
---|---|---|---|
ACGT Sequence | ACGT | -120(+,-) | ACGT sequence (from -155 to -152) required for etiolation-induced expression of erd1 (early responsive to dehydration) in Arabidopsis [37]. |
ARR1AT | NGATT | -257(+), 36(+) | "ARR1-binding element" found in Arabidopsis; Required for transcriptional activation in response to cytokinin [38]. |
CACTFT PPCA1 | YACT | -36(+), -21(+), -137(+), -10(+), -154(-), -139(-), -113(-), -88(-), 9(+), 28(+), 66(+) | Tetranucleotide (CACT) is a key component of Mem1 (mesophyll expression module 1, which direct mesophyll-specific expression of gene) [19]. |
CARE element | CAACTC | -203(-) | CAREs, CAACTC regulatory elements, are required for GA-inducible expression of hydrolase genes in the germinating seeds [39]. |
CARGCW8GAT | CWWWWWWWWG | -220(+,-) | A variant of CArG motif with a longer A/T-rich core is a preferential binding site for the transcriptional regulator AGL15 that accumulates during embryo development [17]. |
CCAAT BOX1 | CCAAT | -72(-), 38(-) | Common sequence found in the 5'-non-coding regions of eukaryotic genes, which involved in increasing the promoter activity [40]. |
CPB Sequence | TATTAG | -216(+) | The sequence is critical for Cytokinin-enhanced Protein Binding in vitro [27]. |
CURE CORE | GTAC | -138(+,-), -118(+,-) | Copper-response element, also involved in oxygen-response of some genes [41]. |
DOF CORE | AAAG | -43(-), 49(-), 62(-) | Core site is required for binding of Dof proteins, which may be associated with the plant-specific pathway for carbon metabolism in maize [42]. |
DPBF CORE | ACACNNG | -115(-) | The binding core sequence of bZIP transcription factor DPBF-1 and 2 (Dc3 promoter-binding factor-1 and 2); Involved in embryo-specific expression, and responding to ABA [31]. |
E2F CONSENSUS | WTTSSCSS | -72(+) | E2F consensus sequence of all different E2F-DP-binding motifs that were involved in cell cycle regulation, DNA replication, and chromatin dynamics [43]. |
E BOX | CANNTG | -170(+,-), -115(+,-), -21(+,-) | The cis-elements in the promoter regions of most genes encoding the storage protein [18]. |
ERE Motif | AWTTCAAA | -253(-) | The ethylene responsive element mediate ethylene-induced activity of transcription [28]. |
GATA BOX | GATA | -297(+), -235(+), -165(+), -29(-) | Required for high level, light regulated, and tissue specific expression [23]. |
GT1 CONSENSUS | GRWAAW | -235(+), -130(+), 50(-), 73(-) | Consensus GT-1 binding site in the promoter regions of many light-regulated genes [24]. |
GTGA Motif | GTGA | -193(+), -132(+) | "GTGA motif" found in the promoter of the tobacco late pollen gene g10 and the tomato gene lat56, required for the gene expression in pollen [44] |
I BOX CORE | GATAA | -235(+) | Conserved sequence upstream of light-regulated genes of both monocots and dicot. |
POLLEN1 LELAT52 | AGAAA | -285(-), 75(-) | One of two co-dependent regulatory elements (AGAAA and TCCACCATA) responsible for pollen specific activation of gene [21]. |
RAV1A AT | CAACA | -243(-) | Binding consensus sequence of Arabidopsis transcription factor RAV1, which expresses in relatively higher level in rosette leaves and roots [32]. |
ROOT MOTIF | ATATT | -294(+), -217(+), -190(-) | Motif found both in promoters of rolD, which expresses strongly in roots [20]. |
SEF4 MOTIF | RTTTTTR | -248(+) | Binding with SEF4, one of soybean embryo factor (SEF) [15]. |
SORLIP1 AT | GCCAC | -23(+) | One of "Sequences Over-Represented in Light-Induced Promoters (SORLIPs) in Arabidopsis; Involved in phyA-regulated gene expression [45]. |
TAAAG Motif | TAAAG | -233(+), 49(-) | TAAAG motif controls guard cell-specific gene expression [46]. |
WRKY71 OS | TGAC | -81(+), -96(-) | A core of TGAC-containing W-box; Binding site of rice WRKY71, a transcriptional repressor of the gibberellin signaling pathway or the regulation of the pathogenesis-related genes [25]. |
aN = G/A/C/T; R = A/G; S = C/G; W = A/T; Y = T/C
bThe symbol ‘+’ or ‘-’ in the bracket represents the DNA strand in which the element is situated.
cThe positive number indicates the location of element in 5′UTR, while the negative represents that in promoter.