Table 8.
String search (dna-pattern) |
Matrix analysis (patser) |
Highest scoring ARE |
||||||||
---|---|---|---|---|---|---|---|---|---|---|
SwissProt acc. no. | Protein name | Fold-change | Number of consensus sequences (RTGABNNNGCA) | Number of matching sequences | Highest score | Mean score | SD | Location from to | Sequence | |
P02762 | Major urinary protein 6 | 0.35 | 0 | 14 | 4.89 | 2.03 | 1.07 | −1935 | −1923 | ttccCTGTCACTAAGCAtgtt |
P10649 | Glutathione S-transferase Mu 1 | 0.41 | 4 | 15 | 4.40 | 2.42 | 1.09 | −56 | −44 | gtggGCAGGACAAAACAgcgg |
P19157 | Glutathione S-transferase P 1 | 0.54 | 0 | 13 | 4.02 | 2.11 | 0.98 | −68 | −56 | aacgTGTTGAGTCAGCAtccg |
Q91WG8 | Bifunctional UDP-N-acetylglucosamine 2-epimerase/N-acetylmannosamine kinase | 0.55 | 0 | 12 | 5.95 | 2.50 | 1.70 | −387 | −375 | gcagGGGTGGCAAAGCTtaaa |
P17717 | UDP-glucuronosyltransferase 2B5 | 0.55 | 1 | 13 | 5.59 | 2.40 | 1.23 | −398 | −386 | cagtCCATGACTGAGTTtgaa |
Q99P30 | Peroxisomal coenzyme A diphosphatase NUDT7 | 1.41 | 1 | 8 | 4.68 | 2.49 | 1.14 | −848 | −836 | caagGCATTACACAGCCcagg |
Q8JZR0 | Long-chain-fatty-acid-CoA ligase 5 | 1.57 | 1 | 10 | 7.66 | 2.56 | 1.90 | −1234 | −1222 | cttaGAATGACCCAGCCcttg |
Q91V92 | ATP-citrate synthase | 1.75 | 1 | 9 | 10.02 | 3.26 | 2.58 | −1899 | −1887 | agaaAAATGACTAAGCAggta |
Q8VCH0 | 3-ketoacyl-CoA thiolase B, peroxisomal | 2.21 | 2 | 15 | 5.84 | 2.55 | 1.44 | −137 | −125 | tgggGGAAGACTCAGGAagag |
Q05816 | Fatty acid-binding protein, epidermal | 2.81 | 0 | 15 | 4.37 | 2.59 | 0.86 | −1728 | −1716 | agtgGGATGTCGCAGCTcagg |
Mean values for all Nrf2-regulated proteins | 1.26 | 1.25 | 13.69 | 5.62 | 2.50 | 1.33 | ||||
Mean values for all down-regulated Nrf2-dependent proteins | 0.57 | 1.00 | 15.40 | 5.20 | 2.54 | 1.23 | ||||
Mean values for all up-regulated Nrf2-dependent proteins | 1.57 | 1.36 | 12.91 | 5.81 | 2.49 | 1.37 | ||||
Mean values for all proteins identified | 1.21 | 13.20 | 6.48 | 2.03 | 1.62 |
Sequences of the genes of Nrf2-regulated proteins were obtained from the ENSMBL mouse genome database and interrogated for ARE and ARE/like consensus sequences using the RSAT analysis software (http://rsat.ulb.ac.be/rsat/). Both string-based (dna-pattern) and matrix-based (patser) pattern searching strategies were adopted (see text for details). For the dna-pattern analysis, returned sequences were rated against the ‘perfect’ consensus sequence RTGABNNNGCA. For the patser analysis, the number of sequences matching the position specific scoring matrix with a score > 1 are given, along with the highest score attained. For comparison, equivalent data from the entire set of identified proteins is included at the foot of the table.