Abstract
The dataset presented in this article is related to the research article entitled “One-pot, two-step transaminase and transketolase synthesis of l-gluco-heptulose from l-arabinose” (Bawn et al., 2018 in press) [1]. This article presents data on initial experiments that were carried out to investigate new thermostable transketolase (TK) activities with l-arabinose. Transaminase (TAm) sequences from an in-house library of thermophilic strains were analyzed to compare homologies to characterized TAms with desired activity. DNA and amino acid sequences are presented for all the enzymes investigated. Calibration curves for products of the TK and TAm reactions are also presented along with chromatographic analysis of the various one-pot reactions.
Specifications Table
Subject area | Biology |
More specific subject area | Biocatalysis |
Type of data | Tables, text file, figures |
How data was acquired | Experiments/ in-vitro assays and high performance anion exchange chromatography with pulsed amperometric detection (HPAEC-PAD) |
Data format | Analyzed and tabulated |
Experimental factors | All enzymes and substrates were freshly prepared before use |
Experimental features | Experiments were carried out in triplicate |
Data source location | United Kingdom, London, University College London (UCL) |
Data accessibility | The data are accessible only within this article |
Value of the data
-
•
The data presented in this article gives new insight into the activities of thermostable enzymes not published before.
-
•
The data represents a rationale behind why TKs and TAms were selected for the one-pot reaction.
-
•
Product of one-pot reaction, l-gluco-heptulose, is a pharmaceutically-relevant compound.
1. Data
l-Arabinose is a major monosaccharide of sugar beet pulp (SBP), a by-product of sucrose extraction which is currently produced and sold as a low value animal feed [1]. The main focus of this work was to create a value-added product from the monosaccharides that make up SBP via enzymatic routes. Building on previous work [2], [3], this present study produces l-gluco-heptulose, a high value, pharmaceutically relevant compound from l-arabinose using a two-step thermostable enzyme cascade. A thermostable TK catalyzed the synthesis of l-gluco-heptulose from l-arabinose and β-hydroxypyruvate (HPA) in which the latter was produced in situ from l-serine and α-ketoglutaric acid using a thermostable TAm.
Table 1 identifies thermostable TKs utilized and whether they were active towards l-arabinose via the Seliwanoff assay [4]. Table 2 describes the TAms investigated and compares sequence homologies to TAms previously showing activities required for this reaction. Examples of HPAEC-PAD traces (Fig. 2, Fig. 3) demonstrate how the TK and TAm one-pot reactions were monitored for the presence of l-gluco-heptulose.
Table 1.
TK Strain | UniProtKB accession code | Plasmid name and abbreviated name | Active towards l-arabinose from Seliwanoff assay |
---|---|---|---|
Deinococcus geothermalis DSM 11300 | Q1IW07 | pQR1758 (TKDgeo) | ✓ |
Deinococcus radiodurans DSM 20539 | Q9RS71 | pQR1759 (TKDrad) | ✓ |
Geobacillus stearothermophilus DSM 22 | KFL15812.1 | pQR1743 | ✓ |
Thermobifida fusca strain YX | Q47ND4 | pQR1744 | ✗ |
Thermotoga maritima DSM 3109 | Q9X283 | pQR1745 | ✗ |
Table 2.
TAm Strain | UniProtKB accession code | Plasmid name and abbreviated name | Homology to DGEO_0713 | Homology to SPAT | Homology to CV2025 |
---|---|---|---|---|---|
Deinococcus geothermalis(%) | Sulfolobus solfataricus(%) | Chromobacterium violaceum(%) | |||
Deinococcus radiodurans DSM 20539 | Q9RWP3 | pQR1746 | 78 | 38 | 31 |
Geobacillus stearothermophilus DSM 22 | Q59228 | pQR1756 (TAmGste) | 25 | 38 | 40 |
Thermobifida fusca strain YX | Q47LH8 | pQR1748 | 26 | 37 | 41 |
Thermotoga maritima DSM 3109 | G4FE93 | pQR1749 | 30 | 58 | 31 |
Deinococcus geothermalis DSM 11300 | Q1IZC2 | pQR1757 (TAmDgeo) | 29 | 29 | 44 |
Xanthomonas campestris pv. Campestris DSM 3586 | Q8PDQ2 | pQR1751 | 28 | 25 | 30 |
Thermotoga maritima DSM 3109 | Q9X1C0 | pQR1752 | 33 | 34 | 36 |
Pectobacterium carotovorum subsp. Carotovorum DSM 30168 | A0A0B3YSH6 | pQR1755 | 24 | 31 | 33 |
2. Experimental design, materials and methods
2.1. TK activity
Thermostable TKs were cloned and subsequently expressed in E.coli BL21 DE3. Cell lysates were used to determine activity towards l-arabinose using the colorimetric assay, Seliwanoff assay. The Seliwanoff assay distinguishes between ketoses and aldoses using 6 M HCl and resorcinol (Seliwanoff׳s reagent) [4]. After 24 h incubation of enzyme and l-arabinose, Seliwanoff reagent was added to the reaction and heated at 100 °C. Colour formation due to the presence of the ketose, l-gluco-heptulose, was observed within 15 min (Table 1).
2.2. TAm sequence analysis
TAm sequences were obtained from the NCBI database [5] and the UniProt Knowledgebase (UniProtKB) [6] followed by a sequence alignment using Clustal W [7].
2.3. Product analysis using HPAEC-PAD
Quantitative analysis of l-arabinose, l-gluco-heptulose and HPA was performed using HPAEC-PAD (ICS 5000+, Dionex) equipped with a Dionex AminopacTM PA1 anion exchange column 4×250 mm2 fitted with a Dionex AminopacTM PA1 guard column 4×50 mm2, an electrochemical detector system, and an eluent generator with a KOH 500 cartridge. The elution times of each compound can be observed in Fig. 1. Fig. 3, Fig. 4 are examples of a one-pot reaction analysis with various TKs and TAmDgeo. Standard calibration curves of l-gluco-heptulose and HPA were used for quantification purposes (Fig. 4, Fig. 5).
2.4. TK DNA/ amino acid sequences
DNA sequences were retrieved from the NCBI database [5] and amino acid sequences were obtained through the UniProtKB [6].
>TKDgeo
ATGAGTCCCGAACAGCAGGCCGTGCGTCAGGATGTCGATCAGCTGAGCATCAACACCATCAGGACGCTTGCCATCGATGCGGTGCAGCGGGCCAACAGTGGCCACCCCGGCGCGCCGCTCGGCATGGCCCCGATGGGCTACGTGCTGTGGCAGCGCTTCCTGCGCCACAATCCGAAACATCCCGAGTGGCCGGGCCGCGACCGCTTCGTGCTGTCGGCAGGGCACGCCAGCATGCTGATCTACTCGCTGCTGCACCTCACCGGCTACGACCTGCCGCTGGAGGACATCAAGAACTTCCGCCAGTGGGGCAGCAAGACGCCTGGGCATCCCGAGTTCTTCCACACCCCAGGCCTAGACGCCACCACCGGCCCGCTCGGTCAGGGTGCGGCGATGACGGTGGGCATGGCGATGGCCGAAGCGCACCTCGCCGCACGCTACAACCGCCCCGGCTTCAAGGTCTTTGACAACTACACCTACGCGATCTTGGGGGACGGCGACCTGCAAGAAGGCGTCAACCACGAGGCCGCGTCGCTGGCAGGGCACCTCAAGCTGGGCAAGCTGATCTGGCTGCACGACGACAACCAGGTGCAGCTGGACACCGCCACGTTCAAGGCGGCCAACGAGGATACTGCGGAGCGTTACCGCGCCTACGGCTGGGAAGTTCTGCGTGTGCAAGACGGCAACAATCTCACGGAGATCGAGAACGCGATCCGCCAGGCACGGATGAACACCGAGCAGCCCACCCTGATCCAGGTTCGCACGGTGATCGGCTTCGGCAGTCCCCGTGCGGGCACCAGCAAGGCGCACGGCGAGCCGCTGGGCGAGGAAGGCGTGCAGGAGACCAAGGCGGCCCTGGGCTGGGACTACCCGCCCTTCACGGTGCCCGACGAGGTCAAGGCGCATATGGACGCGACTGAGCGTGGCGCGGAGTGGGAGGCCGACTGGAACGCGCTGATGGAGCGCTACCGTGCCGAGTACCCCGATCTCGCGGCGGAGGTTGACGCGCTGCTGGCGCGCGAACTGCCCGCCAATCTCGCCGAAGTGCTCCCCTCCTACGAAGTGGGCAGCAAGGCCATCGCCACCCGCAACGCGAGCGGTGAAGTCATCAATGCGCTGGCGCAGGTGGTGCCGGGCCTGATGGGGGGCAGTGCGGATCTCTCCGGCAGCACCAAAACCACCATCAAGGACGGCGGCGAGTTTCTGCCAGGAAACTACGGGGGCCGCAACGTCTACTTTGGCGTCCGCGAGTTTGGGATGGCCGCAGCGGGCAATGGCCTTTCGCTCTACGGAGGTGTTCGGCCCCTGGTGGGGACCTTCCTGGTGTTTGCGGACTACCTCAAGCCCGCCTTCCGCCTCTCCGCCCTTCAGTTCCAGCCGGTTACCTATGTCCTGACCCATGACTCCATTGGCCTGGGCGAAGACGGCCCAACCCACCAGCCTATTGAGCAGCTCGCCATGCTGCGCGCCGTGCCGGGTGCCCACGTGATTCGCCCCGCCGACGCCAACGAGACGGCGGCGGCCTGGCAGATGGCGCTGGAGTACGACAAGGGACCAACCGCTCTGGCCCTCTCCCGCCAGGATCTCCCAGTGCTGCCCCGCAACCACGCGGGCGTGAAGAAGGGCGCCTACGTGGTTCGCGACGCCGAAGGGGGGCCGGCACAGATCATCTTGATCGCCACCGGCTCGGAGGTGAGCCTGGCGCTGGATGCTGCCCAAGCGCTGGCGGAGGAAGGCATCCAGGCTCGGGTCGTCTCAATGCCCTGCATGGAAGTCTTCCGCCAGCAGGACGCCAGTTATCGGGACAGCGTGCTCACCCCCGGCGTGAAACGCGTGGCCATCGAGGCTGCCAGCCCGCTCCCCTGGTATGAGTGGGTGGGCTTTGACGGCGCGGTGATCGGAATGACCACCTTTGGCGCCTCGGCCCCAGCCAAAGTCCTCTTTGAGAAATTCGGCTTCAACGTGCCGAACGTCGTGCAGGTCGTCAAGGGCGTTTTGCAGAGGTGA
>TKDgeo
MSPEQQAVRQDVDQLSINTIRTLAIDAVQRANSGHPGAPLGMAPMGYVLWQRFLRHNPKHPEWPGRDRFVLSAGHASMLIYSLLHLTGYDLPLEDIKNFRQWGSKTPGHPEFFHTPGLDATTGPLGQGAAMTVGMAMAEAHLAARYNRPGFKVFDNYTYAILGDGDLQEGVNHEAASLAGHLKLGKLIWLHDDNQVQLDTATFKAANEDTAERYRAYGWEVLRVQDGNNLTEIENAIRQARMNTEQPTLIQVRTVIGFGSPRAGTSKAHGEPLGEEGVQETKAALGWDYPPFTVPDEVKAHMDATERGAEWEADWNALMERYRAEYPDLAAEVDALLARELPANLAEVLPSYEVGSKAIATRNASGEVINALAQVVPGLMGGSADLSGSTKTTIKDGGEFLPGNYGGRNVYFGVREFGMAAAGNGLSLYGGVRPLVGTFLVFADYLKPAFRLSALQFQPVTYVLTHDSIGLGEDGPTHQPIEQLAMLRAVPGAHVIRPADANETAAAWQMALEYDKGPTALALSRQDLPVLPRNHAGVKKGAYVVRDAEGGPAQIILIATGSEVSLALDAAQALAEEGIQARVVSMPCMEVFRQQDASYRDSVLTPGVKRVAIEAASPLPWYEWVGFDGAVIGMTTFGASAPAKVLFEKFGFNVPNVVQVVKGVLQR
>TKDrad
ATGACAGACCAGAGCGTTTCCCAAAACGTGGCGCGGCTGAGTGTGAACACCATTCGCACGCTCGCCATTGACGCGGTGCAGGCCGCCAACTCGGGCCACCCCGGTGCGCCGCTGGGCATGGCCCCGATGGGCTACGTGCTGTGGCACAAGTTCCTGCGCCACAACCCCGCGCACCCTGAGTGGCCGGGCCGCGACCGCTTCGTGCTGTCGGCGGGGCACGCCTCCATGCTGATCTACAGCCTGCTGCACCTGACCGGCTACCAGGAAATGACGCTCGACGACCTGCGCCACTTCCGGCAGTGGGGCTACCACACCCCCGGCCACCCCGAGTTTTTCCACACCAAGGGTCTGGACGCGACCACCGGCCCGCTTGGGCAGGGCGCGGCGATGACGGTGGGCATGGCGATGGCCGAAGCACACCTCGCCGCCCGCTACAACCGCGAAGGCTTTCCGATTTTCGACAACCGCACCTACGCCATCATGGGCGACGGCGATCTGCAAGAAGGCATCAACCACGAAGCCGCCGCGCTCGCCGGGCACCTGAAACTCGGCAAGCTGATCTGGCTGCACGACGACAACCACATCCAGCTCGACACGCCCACGAACAAGGCCGAGTCCGAGGACACCGCCGCCCGCTTCCGCGCCTATGGCTGGAACGTGCTGAAGGTGGAAGACGGCGACAATCTGGACGAAATTGAAAAGGCGATTGCCGAGGCCCGCAGCCAAAGCGAGCGGCCCACGCTGATTCAGGTGCGCACCATCATCGGCTTCGGCAGCCCGCGCGCCGGCACGAGCAAGGCGCACGGCGAGCCGCTCGGCGAAGAGGGCGTGGCCGAGACGAAGAAAGCGTTGGGCTGGGAGTACCCCGCTTTTACCGTGCCCGACGAAGTGGCTGCGCACATGAACGCTCGCGCTAAGGGTGCTCAACTCGAAGCCGACTGGGAAAAACTGATGGCCGACTACCGCACCGCGCACCCCGACCTCGGCAAGGAAGTGGACGCGCTGCTCGCCCGTGAACTGCCCGCCAACCTCGCCGACCTGCTGCCCAAGTACGAAGTCGGCGGCAAGGCGGCGGCCACCCGCAACGCGAGCGGCGAAGTCATCAACGCGCTGGCGAAGGTGCTTCCCGGTTTGATGGGCGGCAGCGCGGACCTCTCGGGCTCGACCAAGACCACCATCAAGGACGGCGGCGAGATGGAAGCGGGCACCATGGGCGGGCGCAACGTGCTGTTCGGCGTGCGCGAGTTCGGCATGAGCGCCGCGGGCAATGGCCTGAGCCTCTACGGCGGCCTGCACCCGATGGTAGGCACCTTCCTGGTATTCGCCGACTACCTCAAGCCGGCTTTCCGCCTCTCGGCGCTGCAAATGCAGCCGGTGACTTACGTGCTGACCCACGACTCCATCGGTCTGGGCGAAGACGGGCCGACCCACCAGCCGGTGGACCAGCTCGCCATGCTGCGAGCGGTGCCGGGCGCCCACGTCATTCGCCCCGCCGACGCCAACGAAACCGCCGCCGCGTGGCTGATGGCGCTGGAATACGACAAGGGCCCCACCGCGCTCGCCCTCTCGCGCCAGGATCTGCCGATTCTGCCCGCCAACATCGAAGGCGTGAAGAAGGGCGCGTATGTCCTCCGAGATGTGGACGGTGCCGATGGTCAGGGGGCTCAAGTCATCCTGATC
GCCAGCGGCTCGGAAGTCGCCCTGGCCCTGAGCAGCGCCGAGCGGCTGGCCGAAGAGGGCGTGCAGGCCCGCGTGGTGTCCATGCCGTGCATGGAGGTCTTTCGCCAGCAGGAGCAGAGCTACCGCGACAGCGTGCTGACCCCCGGCGTGAAGCGCGTCGCCATCGAGGCCGCCAGCCCGCAGCCCTGGTACGAGTGGACGCTCGGCGGCCCAGTCATCGGCATGACGACCTTCGGTGCGTCGGCCCCGGCCAAGGTGCTGTTTGAGAAGTTCGGCTTCAGCGTGGAAAACGTGGTGAAGGTGGTCCACTCCGTGCTGTAA
>TKDrad
MTDQSVSQNVARLSVNTIRTLAIDAVQAANSGHPGAPLGMAPMGYVLWHKFLRHNPAHPEWPGRDRFVLSAGHASMLIYSLLHLTGYQEMTLDDLRHFRQWGYHTPGHPEFFHTKGLDATTGPLGQGAAMTVGMAMAEAHLAARYNREGFPIFDNRTYAIMGDGDLQEGINHEAAALAGHLKLGKLIWLHDDNHIQLDTPTNKAESEDTAARFRAYGWNVLKVEDGDNLDEIEKAIAEARSQSERPTLIQVRTIIGFGSPRAGTSKAHGEPLGEEGVAETKKALGWEYPAFTVPDEVAAHMNARAKGAQLEADWEKLMADYRTAHPDLGKEVDALLARELPANLADLLPKYEVGGKAAATRNASGEVINALAKVLPGLMGGSADLSGSTKTTIKDGGEMEAGTMGGRNVLFGVREFGMSAAGNGLSLYGGLHPMVGTFLVFADYLKPAFRLSALQMQPVTYVLTHDSIGLGEDGPTHQPVDQLAMLRAVPGAHVIRPADANETAAAWLMALEYDKGPTALALSRQDLPILPANIEGVKKGAYVLRDVDGADGQGAQVILIASGSEVALALSSAERLAEEGVQARVVSMPCMEVFRQQEQSYRDSVLTPGVKRVAIEAASPQPWYEWTLGGPVIGMTTFGASAPAKVLFEKFGFSVENVVKVVHSVL
>pQR1743
ATGGCGCATTCGATCGAAGAATTAGCGATTACCACCATTCGAACGCTGTCGATTGACGCGATCGAAAAAGCGAAATCCGGGCACCCGGGCATGCCGATGGGCGCGGCCCCGATGGCGTATACGCTCTGGACGAAATTTATGAACCATAATCCAGCGAATCCCAACTGGTTTAACCGCGACCGGTTTGTTTTGTCCGCTGGGCACGGGTCGATGCTGCTTTACAGCCTGCTTCATCTAAGCGGCTACGATGTCACGATGGACGACTTGAAACAGTTCCGCCAATGGGGAAGCAAAACGCCGGGCCATCCGGAATACGGCCATACGCCAGGGGTGGAGGCAACGACCGGCCCGCTCGGCCAAGGGATTGCGATGGCGGTCGGCATGGCGATGGCGGAACGGCATTTGGCGGCTGCATACAATCGCGATGGATTTGACATTATCAACCACTACACGTATGCGATTTGCGGCGACGGCGATTTGATGGAAGGAGTGGCGAGCGAAGCGGCGTCACTCGCCGGCCACTTGAAGCTCGGCCGTCTGATCGTCCTGTATGACTCGAACGACATTTCGCTGGACGGCGAGCTCAACTTGTCGTTTTCGGAAAACGTCGCCCAACGTTTCCAAGCGTACGGCTGGCAATATTTGCGCGTTGAGGACGGCAACAATATTGAAGAAATCGCCAAAGCGCTCGAGGAGGCGCGGACGGACCTCAGCCGGCCGACGCTCATTGAAGTAAAAACGACGATTGGCTACGGCGCGCCAAATAAAGCGGGCACGTCCGGCGTCCACGGCGCTCCGCTCGGCGCCCAAGAGGCGAAGTTGACGAAAGAAGCGTACCGCTGGACGTTTTCCGAAGATTTCTACGTGCCGGATGAAGTGTACGCTCATTTCCGGGAAACGGTGCAAGAAGCCGGAGCGAGAAAAGAAGCGGAGTGGAATGAGCGCTTCGTTGCTTACGAGCGGGCGCATCCGGAATTGGCCGCCGAGCTGAAGCAGGCGATTGAAGGGAAGCTTCCGGATGGCTGGGAAACATCGCTGCCGGTGTATGAAGCGGGCAAAAGCTTGGCGACCCGCTCATCGTCCGGGGAAGTGATCAACGCCATCGCCAAAGCGGTGCCGCATTGTTTGGCGGTTCGGCGGACTTGGCAAGCTCGAATAAAACGCTTATCAAAGGCGGCGGCAACTTCTTGCCGGACAGCTACGAAGGGCGCAACATTTGGTTTGGCGTGCGCGAGTTTGCCATGGGCGCGGCGTTAAACGGCATGGCGCTTCACGGCGGGTTGAAAGTGTTCGGCGGCACGTTCTTCGTGTTCTCCGACTACTTGCGCCCGGCGATTCGGCTGGCGGCGCTCATGGGCTTGCCGGTGACGTACGTGCTGACGCACGACAGCATCGCCGTCGGGGAAGACGGCCCGACGCATGAGCCGGTCGAGCATCTCGCTTCACTTCGGGCGATGCCGAACTTGTCAGTCATCCGGCCGGCTGACGCAAACGAAACGGCGGCCGCCTGGCGGCTGGCGCTCGAGTCGACGAACAAGCCGACTGCGCTCGTCTTGACGCGTCAAGATGTGCCGACATTGCCGACAACCGCTCAGTTGGCGTATGAAGGCGTGAAAAAAGGCGCGTACGTCGTTTCACCGGCGAAAAACGGCGCTCCGGAGGCGCTGTTGTTGGCGACTGGCTCGGAAGTCGGTCTGGCCGTCAAAGCGCAAGAAGCGCTCGCCGCTGAGGGCATCCATGTCTCCGTCATCAGCATGCCATCGTGGGACCGCTTCGAAGCGCAGCCAAAATCGTACCGCGATGAAGTGCTGCCGCCGGCCGTGACGAAGCGGCTCGCCATTGAAATGGGCGCCTCGCTCGGTTGGGAGCGCTACGTCGGCGCCGAGGGCGACATTTTGGCCATCGACCGATTCGGTGCTTCCGCTCCGGGAGAGAAAATCATGGCCGAGTATGGCTTTACGGTTGACAACGTCGTCCGCCGCACAAAAGCGCTGCTCGGCAAGTAA
> pQR1743
MAHSIEELAITTIRTLSIDAIEKAKSGHPGMPMGAAPMAYTLWTKFMNHNPANPNWFNRDRFVLSAGHGSMLLYSLLHLSGYDVTMDDLKQFRQWGSKTPGHPEYGHTPGVEATTGPLGQGIAMAVGMAMAERHLAAAYNRDGFDIINHYTYAICGDGDLMEGVASEAASLAGHLKLGRLIVLYDSNDISLDGELNLSFSENVAQRFQAYGWQYLRVEDGNNIEEIAKALEEARTDLSRPTLIEVKTTIGYGAPNKAGTSGVHGAPLGAQEAKLTKEAYRWTFSEDFYVPDEVYAHFRETVQEAGARKEAEWNERFVAYERAHPELAAELKQAIEGKLPDGWETSLPVYEAGKSLATRSSSGEVINAIAKAVPQLFGGSADLASSNKTLIKGGGNFLPDSYEGRNIWFGVREFAMGAALNGMALHGGLKVFGGTFFVFSDYLRPAIRLAALMGLPVTYVLTHDSIAVGEDGPTHEPVEHLASLRAMPNLSVIRPADANETAAAWRLALESTNKPTALVLTRQDVPTLPTTAQLAYEGVKKGAYVVSPAKNGAPEALLLATGSEVGLAVKAQEALAAEGIHVSVISMPSWDRFEAQPKSYRDEVLPPAVTKRLAIEMGASLGWERYVGAEGDILAIDRFGASAPGEKIMAEYGFTVDNVVRRTKALLGK
>pQR1744
ATGAACACCGGCACCCCAAAGACCCTGGACTGGTCTGATCTCGATAGACGTACCGTAGACGTGGTTCGTGCCCTGGCGATGGACGCGGTCGAAGAAGCGGGATCCGGGCACCCTGGAACCGCGATGAGTCTGGCGCCTGTGGCCTACCTGCTCTTCCAGAAGGTGATGCGGCACGATCCGACAGATCCGAAGTGGATCGGCCGCGACCGCTTCGTCCTGTCCTGCGGGCACTCCAGCCTCACGCTCTACATCCAGCTCTACCTGGCTGGCTACGGGCTGAGCCTGAACGACATCAAGCGGCTGCGCCAGTGGGGCAGCCTCACCCCGGGCCACCCCGAATACGGGCACACCGCCGGGGTGGAAACCACCACCGGCCCCTTGGGGCAGGGCATCGGCAACGCGGTCGGCATGGCCATGGCCGCCCGCCGGGAGCGGGGCCTGTTCGACCCGGACACCCCGATCGGGGAAAGCCCGTTCGACCACTACATCTACGTCCTGTGCTCTGACGGCGACGTCCAGGAGGGCATCAGCCACGAAGTAAGTGCCCTCGCCGGCACGCAGAAGCTCGGCAACCTCATCGTCATCTGGGACGACAACCGCATCTCCATCGAAGACGACACCCAGATCGCATTCACCGAAGACGTCGTCGCCCGCTACGCCGCCTACGGCTGGCACGTCCAAGAGGTCGAGTGGGTCGGCGAGGACGGCTCCTACCACGAAGACGTGGCGGCGCTGTACGACGCGATCCGGGCCGCCCAGGCGGAGACGGAACGTCCCTCTTTCATCCGGCTGCGCACCATCATCGGCTGGCCGTCCCCGAACAAGCAGAACACGGGGGCGATCCACGGCGCCGCGCTGGGGGCTGAAGAGGTCGCCGCCACCAAGCGGGTGCTGGGCCTCAACCCTGAGGCGCAGTTCGACGTGCCCAACGAGCTGCTGGAGCACGCCCGGGGCGTGGTGGCGCGGGGCCGCGCCGCCCGCCAGGAATGGGAGGCCTTGTTCGCCAAGTGGCGGGCCAACGCGGGCGAGCGTGCCGAACTGTTCGACCGGCTGATGGCAGGCTCGCTCCCGGACGGTTGGGAGAAGGCGATCCCGACCTTCGAGCCCAGCGCTAAGGGCATGGCCACCCGGAAAGCGTCCGGTGAGGTGCTGAGCGCGATCGCCCCGGTGCTGCCGGAGCTGTGGGGCGGCTCGGCGGACTTGGCCGGATCCAACAACACCACGCCTAAGGGCGAGCCGTCGTTCATCCCCGAGGAGCGGTCCACGAAGGCGTTCTCCGGCCACCGCTACGGCCGGGTGCTGCACTTCGGGATCCGTGAACACGGCATGGGGGCGATCCTCAACGGGATCGCGCTGCACGGCCCCACCCGCCCCTACGGTGGCACCTTCCTCGTGTTCAGCGACTACATGCGGCCGTCGGTGCGGCTGGCTGCCCTGATGAAGCTGCCGGTCACGTACGTGTGGACCCACGACTCGATCGGTCTGGGCGAAGACGGACCCACCCACCAGCCGGTGGAGCACCTGTGGTCGCTGCGCGCCATCCCCGGCCTGGCGGTGGTGCGTCCCGCCGACGCCAACGAGACGGCAGTGGCCTGGCGCACCATCCTGGAACGCAATGACGGCCCGGTGGCGCTCGCGCTGACCCGGCAGTCGGTTCCGGTTCTGGACCGCTCCGAGCTCGCCTCTGCGGAGCTGGTCTCCCGCGGCGGGTACATCCTGGCCGAAGCCAGCAACGGCCGTCCGGAGGCGATCATCATCGCCACCGGAAGTGAGGTGCAGATCGCGTTGGAGGCGCGTTCCCGCCTGGAGGAGTCGGGTACTCCTACCCGTGTGGTGTCGATGCCGTGCCTGGAGTGGTTCAACGAGCAGGACGACGCCTACCGCCAGCAGGTGCTTCCACCGTCGGTCCGGGTCCGGGTCTCCGTGGAAGCCGGGGTCGCCTTGGGCTGGCGCGAGCTGGTGGGCGAGTATGGCGAGTCGGTGAGTCTGGAACACTTCGGCGCTTCGGCTCCGTACGCGACTCTCTACGAGCAGTTCGGGCTCACCGCCGACCGGGTAGTGGCAGCCGTACACTCCAGCGCTGCCAAGCTCGGCGGTGACCGTGGATCAACGACCGGCAACTGA
>pQR1744
MNTGTPKTLDWSDLDRRTVDVVRALAMDAVEEAGSGHPGTAMSLAPVAYLLFQKVMRHDPTDPKWIGRDRFVLSCGHSSLTLYIQLYLAGYGLSLNDIKRLRQWGSLTPGHPEYGHTAGVETTTGPLGQGIGNAVGMAMAARRERGLFDPDTPIGESPFDHYIYVLCSDGDVQEGISHEVSALAGTQKLGNLIVIWDDNRISIEDDTQIAFTEDVVARYAAYGWHVQEVEWVGEDGSYHEDVAALYDAIRAAQAETERPSFIRLRTIIGWPSPNKQNTGAIHGAALGAEEVAATKRVLGLNPEAQFDVPNELLEHARGVVARGRAARQEWEALFAKWRANAGERAELFDRLMAGSLPDGWEKAIPTFEPSAKGMATRKASGEVLSAIAPVLPELWGGSADLAGSNNTTPKGEPSFIPEERSTKAFSGHRYGRVLHFGIREHGMGAILNGIALHGPTRPYGGTFLVFSDYMRPSVRLAALMKLPVTYVWTHDSIGLGEDGPTHQPVEHLWSLRAIPGLAVVRPADANETAVAWRTILERNDGPVALALTRQSVPVLDRSELASAELVSRGGYILAEASNGRPEAIIIATGSEVQIALEARSRLEESGTPTRVVSMPCLEWFNEQDDAYRQQVLPPSVRVRVSVEAGVALGWRELVGEYGESVSLEHFGASAPYATLYEQFGLTADRVVAAVHSSAAKLGGDRGSTTGN
>pQR1745
ATGGAAAGGTTTCCCTATGAAAAACTTCCAGAAAGCGAACTCAAAGAGTTGAAAGAACTCGGAAGGCTCTGCCGTGGCGACATACTGAAAATGACCTACATAGCTAACTCAGGCCATCCTGGAGGATCCATGTCTTCGATCGATCTTTATCTTACCGTCTTCAAGTACGCAAAACTCAGACCCGTCGATGATCCTGCAAGAGACAGAATCGTGATCAGCCATGGACACACTTCTCCGGGTGTCTACGCAGCTATGGCTCGTTTGGGGTTTGTCGATCTCGATGAAGTCCTCGCAGGATTCAGACACCCCGCTTCCGTTTTTGAAGGACACGTGACCCGAGGTGTTGGGATCATCGACTGGACAACCGGAAACCTCGGTCAGGGTCTTTCAGCCGGACTCGGTTTTGCCCTCGCATCCAGGTTCACAGGAAAAGATTACCACGTCTTTGTTCTCATGAGTGACGCAGAACAGGCAAAAGGACAGGTGGCGGAGGCAAGAAGAGTGGCGAAAAAGTACGGTGTCACGAATCTCACAGTGATCATCGACTACAACGACGCCCAGATCAGTGGCCGTGCCAGAGACGTCATGCCCGTGAACATAAAGGAAAACTACTTAGCGGACGGCTGGAGGGTCATCGAGATCGATGGGCACGACTACGAACAGATCTATCTCGCACTGAAAGAAGCGGTAGAAGACGAAC
TGAATCCCGTTGCCATAATCGCCAAAACGGTCATGGGAAAAGGCGTATCTTTCATGGAAAACGAGGTGAAATACCACGGAAAGCCTTTGAACAGAGAAGAACTCGAAAAAGCCCTCGCGGAACTCGGAATTGAAAACGATGTTGATGTGTACATCGAAAAAAGAAAACAACTTCCAGTGGAAAAACACAAGAAAGTCTACAAAACTTACCCGATCAAGATCGACACGGGAGAGCCCATCACCTACACCTCACCCACTGACAACAGAAGCGCATTCGGAAAAGCTATTCTGGATCTGGTGAAGAAGAACGTAAACAATCCAGAAACCACACCCATCGTCGCTGTGGACTGCGACCTGAAGGGATCGGTCAAACTCGACCTGCTCGACAAAGAGTTCCCTGAGAGACTCCTGGAAGTGGGCGTTCAGGAACACAACGCTGCCGCTATGGCGGGGGCACTCTCCGCAGAGGGTGTGATCACGTTCTTCGCTGATTTTGGTGTTTTTGGAATTTCTGAAACCTACAACCAGCACAGGCTGAACGCCATCAATGGAACGAACCTCAAAGTCGTTGTCACACACTGCGGACTCAACGTGGGAGAGGACGGAAAAACTCATCACGGACTCGACTACGTTTCCGGGCCGATGAACTGGTACGGTTTCAAAGTGATCGTCCCTGGTGATCCCAACCAGACGGATAGAGTTGTCAGATACGCCGCGAAGGAATACGGGAACTTCGTAATCGCCATGGGAAGATCTAAGCTTCCCATCATCCTCGATGAAAACGGGAAACCTTTCTTCGGAGAGGGTTACACCTTCGAATATGGGAAGATCGATGTCGTTAGAAAAGGTGACGACGCGGTGATCATAACTTACGGTTCTACACTCTGTGAAGCCGTAAATGCCGCAGACGAACTCAAGAAAGAAGGAGTAAACGTAGCCGTTCTGAATGTCTCCTGTCCGGTGGATCTCGATATAGAGACCTTGAAGATGGTCGATGGAAAACCCGTTCTCGTTGTGGAGGATCACAACGTTTTCACAGGACTTGGAAGCTTCCTTGGAACCACCCTTCTTGAAAACGGCATCATCCCGAAGAAATACGTGAGAGTAGGTGTTCCAGAATTCGCCGTGTCCGGCAGTTACACGATGCTCTACAAACTCTACGGCCTGGATAAAGATGGAATAATTTCCAGACTCAGAGAGATGCTCTAA
> pQR1745
MERFPYEKLPESELKELKELGRLCRGDILKMTYIANSGHPGGSMSSIDLYLTVFKYAKLRPVDDPARDRIVISHGHTSPGVYAAMARLGFVDLDEVLAGFRHPASVFEGHVTRGVGIIDWTTGNLGQGLSAGLGFALASRFTGKDYHVFVLMSDAEQAKGQVAEARRVAKKYGVTNLTVIIDYNDAQISGRARDVMPVNIKENYLADGWRVIEIDGHDYEQIYLALKEAVEDELNPVAIIAKTVMGKGVSFMENEVKYHGKPLNREELEKALAELGIENDVDVYIEKRKQLPVEKHKKVYKTYPIKIDTGEPITYTSPTDNRSAFGKAILDLVKKNVNNPETTPIVAVDCDLKGSVKLDLDKEFPERLLEVGVQEHNAAAMAGALSAEGVITFFADFGVFGISETYNQHRLNAINGTNLKVVVTHCGLNVGEDGKTHHGLDYVSGPMNWYGFKVIVPGDPNQTDRVVRYAAKEYGNFVIAMGRSKLPIILDENGKPFFGEGYTFEYGKIDVVRKGDDAVIITYGSTLCEAVNAADELKKEGVNVAVLNVSCPVDLDIETLKMVDGKPVLVVEDHNVFTGLGSFLGTTLLENGIIPKKYVRVGVPEFAVSGSYTMLYKLYGLDKDGIISRLREML
2.5. TAm DNA/ amino acid sequences
DNA sequences were retrieved from the NCBI database [5] and amino acid sequences were obtained through UniProtKB [6].
>TAmGste
ATGAAATTGGCAAAACGGGTGGCGTCGCTGACGCCATCGGCGACTTTGGCCATTACGGAGAAAGCAAAAGAACTAAAAGCGGCCGGGCATGACGTGATTGGTCTCGGAGCTGGCGAACCGGATTTCAACACGCCACAGCACATTCTTGATGCCGCCATCAAGGCAATGAACGAAGGACATACGAAATATACACCATCGGGCGGTTTGCCGGCGTTAAAGGAGGAAATTATAAAAAAATTCGCCCGCGACCAAGGCTTGGATTATGAGCCGGCTGAAGTGATTGTATGCGTCGGAGCGAAGCACGCCCTTTACACGCTGTTCCAAGTATTGCTCGATGAAGGCGACGAAGTGATCATTCCGACGCCATACTGGGTGAGCTATCCGGAACAAGTGAAACTGGCGGGCGGTGTTCCGGTTTACGTCGAAGGGCTTGAACAAAATCATTTTAAAATTACGCCGGAGCAGCTGAAACAGGCAATCACGCCGCGGACGAAAGCGGTTATCATCAACTCGCCGAGCAACCCGACTGGCATGATTTATACAGCCGAAGAGTTGAAGGCGCTTGGTGAGGTGTGCCTAGCGCATGGTGTATTGATCGTGTCAGATGAAATTTACGAAAAATTGACTTACGGCGGGGCGAAGCATGTGTCCATCGCTGAGTTGTCGCCGGAGCTGAAGGCGCAGACAGTCATCATTAACGGCGTGTCAAAGTCGCATTCGATGACGGGCTGGCGCATTGGTTATGCGGCGGGGCCGAAAGATATTATTAAGGCAATGACAGATTTGGCGAGCCACAGCACGTCCAACCCGACGTCAATCGCCCAATACGCGGCCATCGCTGCTTACAGCGGGCCGCAGGAGCCGGTCGAACAAATGCGCCAAGCGTTTGAACAACGGCTCAATATCATTTACGACAAGCTCGTGCAAATTCCAGGATTCACGTGCGTTAAGCCACAAGGGGCGTTTTATTTGTTCCCGAACGCCCGCGAAGCGGCTGCAATGGCCGGCTGCCGCACGGTCGACGAGTTCGTCGCTGCCTTGTTGGAGGAAGCGAAAGTCGCGCTTGTGCCCGGCTCTGGGTTTGGAGCGCCGGATAACGTTCGCTTGTCATACGCGACATCGCTCGATGCACTGGAAACCGCCGTGGAACGCATCCACCGGTTTATGGAAGCGCGCGCTTAA
>TAmGste
MKLAKRVASLTPSATLAITEKAKELKAAGHDVIGLGAGEPDFNTPQHILDAAIKAMNEGHTKYTPSGGLPALKEEIIKKFARDQGLDYEPAEVIVCVGAKHALYTLFQVLLDEGDEVIIPTPYWVSYPEQVKLAGGVPVYVEGLEQNHFKITPEQLKQAITPRTKAVIINSPSNPTGMITAEELKALGEVCLAHGVLIVSDEIYEKLTYGGAKHVSIAELSPELKAQTVIINGVSKSHSMTGWRIGYAAGPKDIIKAMTDLASHSTSNPTSIAQYAAIAAYSGPQEPVEQMRQAFEQRLNIIYDKLVQIPGFTCVKPQGAFYLFPNAREAAAMAGCRTVDEFVAALLEEAKVALVPGSGFGAPDNVRLSYATSLDALETAVERIHRFMEARA
>TAmDGeo
ATGTTCGAGGACACGCCCGCACCCTTTCCACCGCACATTCTGCTGACGCCCGGTCCGACACCGATTCACCCCCGGGCCCAGCGGGCGCTGCTGCGCGAGATGCTGGGGCACATGGACCCTGAGGTGTTCGCCCTGAACCGCGAGATCCAGGCGGACTTGCGGGTGATGTACGGGACGGGGCCCCAGACCTTTACGGCGCTGCTGGCGGGCACCGGGAGCCTGGGCATGGAGGCGGGCTTCGCCAACTTGGTGGAGAGGGGAGACGACGTGCTGATCTGCGTCAATGGTGCCTTTGGTCAGCGCATGGCCGAGATGGCGGCGCGCTACGGTGCGAATGTACGGCGGGTGACCGCGCCGCTGGGCGAGCCGATCGACCCGGCCGACGTGGCTGCGCGGTTGAGCGGCGCGCGGCTGGTGGCGGTGGTGCATGGGGAGACGAGCACGGGTGTGCTCAATCCGCTTCCGGAGATTGCCGAGGCCGTGCGCGGGAGCGGGGCATTGCTGGCCGTGGACGCCGTGACGACCGCCGGGATGGAACCCTTCCATATGGCGGACTGGGGCGTGGACTACGCCTATACCGGCGCGCAGAAGTGCCTCTCGGCACCGCCCGGCCTGGCCCCGGTGGCGATCAGCGACCGTGCTCTCGCTCGCCACGCGGCCCGCCGCACGCCCACGCCGCTGTGGTACTGCGATTTTGAGGGCCTGCGCGACTACTGGGACCGGCACAGCTACCACCACACGGTCCCGGTGAATCTGCACTACGCCTTCCACGCCGCCCTGCGCGCCGCACTCGAAGAAGGCCTCCAAGCCCGGCAGGCCCGCGTGCGCGACCTTGGCCAGGCGGTGCTGGCGGCCCTGACGCCGCTGGGCTTCACGCCGTATGTGGCCGATCCCGCCGCGCGGCTGCCCACCGTCTTGGCCCTGCGTCTTCCTCCCGGCTTCGACGACGCGGGCGTTCGCCAGGCCCTACGGGAACGCGGGATCAGCGTGACCGGCGGCCTGGGACCGACGGCAGGGCTGATCTGGCGTCTGGGCCTGATGGGGGAAGCGGCTCGCCCCGCGCCCTACCGCGCGCTGATGCTCGCCCTGGAAGACCTGCTGGGCGAGCGGGGCTTGGTGGCGCGCTTCGAGGAGGCGCTGGGCGTCGCGGCCTGA
>TAmDGeo
MFEDTPAPFPPHILLTPGPTPIHPRAQRALLREMLGHMDPEVFALNREIQADLRVMYGTGPQTFTALLAGTGSLGMEAGFANLVERGDDVLICVNGAFGQRMAEMAARYGANVRRVTAPLGEPIDPADVAARLSGARLVAVVHGETSTGVLNPLPEIAEAVRGSGALLAVDAVTTAGMEPFHMADWGVDYAYTGAQKCLSAPPGLAPVAISDRALARHAARRTPTPLWYCDFEGLRDYWDRHSYHHTVPVNLHYAFHAALRAALEEGLQARQARVRDLGQAVLAALTPLGFTPYVADPAARLPTVLALRLPPGFDDAGVRQALRERGISVTGGLGPTAGLIWRLGLMGEAARPAPYRALMLALEDLLGERGLVARFEEALGVAA
>pQR1746
ATGACCTCTCCTTTCCGCCTCTCCGCCCGCGCCCAGAGCCTCAAGCCGTCTGCGACAGTGGCGGTCACGTCCCGCGCCCTGGAACTCCAGCGTCAGGGCCTGGACGTGATTTCCATGAGCGTGGGCGAGCCGGATTTCGACACGCCGCCACATGTCAAGGCCGCCGGCATCGCCGCCATCGAGGAAGGCAAGACCAAATACACCCCGGTCAGCGGCATTCCCGAACTGCGCGAGGCCATCAGCGCCAAGTTTCGGCGCGAAAACGGCCTGGACTACGCGCCGAACGCCGTGACGGTAACGAGCGGCGGTAAACAGGCGCTGTTCAACGCCTTTTTCGCGTTGCTGAACCCCGGCGACGAGGTGCTGATTCCCGCGCCCCACTGGGTCAGCTACCCCGAAATGGTCGCGCTGACCGGCGCGGTGCCGGTAACCGTACCCACTACGCCGCAGCAGGGCTTTCAACTCGACCCGGACGCCCTCGCCGCCGCCATCACGCCGCGCACCCGCATGGTGATTCTCAACAGCCCCGGCAACCCGACGGGCGCGGTGTTTCCGCCGGAAACCTTGCGGGCGGTGGCCGACCTCGCCACGCAGCACGGCTTGATGATCGTCACCGACGAAATCTACGAGCACCTCGTCTACGACGCCGAGCAGGTCAGCATCGGCACCTACGCGCCGGAGCACACCCTGACCATCAATGGCGCGAGCAAAGCGTATGCCATGACCGGCTGGCGCATCGGCTACGCGGGCGGGCCGCGCGAGGTGATTGCCGCCATGAACGCGCTGCAATCGCAAAGCACCAGCAACGCCAGCAGCGTCAGCCAGTACGCCGCCCTCGCCGCGCTGGAACAGCACGAGGAAACCATGCGCTTCATCGACAGGGCCCGCACCGCCTACCGCGAACGGCGCGACCGCATCGTGGCGGGCCTCAACGCGCTGGGGTTGCCCACGCCCACGCCGCAAGGGGCCTTTTACGTGATGGCCGACACCCGCGCCATTCACACCGACGAACTCGAAGCCGCCCGCATCATTCTGGATGAGGCGCAGGTCGCCGTCGTGCCCGGCACCGATTTCGCCGCGCCGGGACAGGTGCGCCTGAGCTACGCGACCAGCATGGACAACATCGAGGAAGTGCTGCGGCGGCTGGAAGGGGTCGTGCGGCGCTAA
>pQR1746
MTSPFRLSARAQSLKPSATVAVTSRALELQRQGLDVISMSVGEPDFDTPPHVKAAGIAAIEEGKTKYTPVSGIPELREAISAKFRRENGLDYAPNAVTVTSGGKQALFNAFFALLNPGDEVLIPAPHWVSYPEMVALTGAVPVTVPTTPQQGFQLDPDALAAAITPRTRMVILNSPGNPTGAVFPPETLRAVADLATQHGLMIVTDEIYEHLVYDAEQVSIGTYAPEHTLTINGASKAYAMTGWRIGYAGGPREVIAAMNALQSQSTSNASSVSQYAALAALEQHEETMRFIDRARTAYRERRDRIVAGLNALGLPTPTPQGAFYVMADTRAIHTDELEAARIILDEAQVAVVPGTDFAAPGQVRLSYATSMDNIEEVLRRLEGVVRR
>pQR1748
ATGACTGACCGACCTCGTATCTCCGCACGCATCGGCGGTATCTCCGAGTCAGCGACCCTGGCGGTGGACGCCAAGGCCAAGGCCCTGAAGGCCGCTGGGCATCCCGTGATCGGCTTCGGCGCCGGGGAGCCTGACTTCCCCACGCCCGACTACATCGTGGAGGCAGCGGTCGCCGCCTGCCGCGACTCGCGCTTCCACCGCTACACCCCGGCGGGAGGCCTCCCCGAACTCAAGGAAGCCATCGCGGCTAAGACGCTGCGCGACTCCGGCTACCGGGTGGAGCCGAACCAAGTCCTGGTCACCAACGGCGGCAAGCAAGCGATCTACGAGGCGTTCGCCACGCTGCTCGATCCGGGCGACGAGGTCATCGTGATCGCGCCCTACTGGACCACCTACCCTGAATCGATCCGGCTGGCCGGAGGAACCCCGGTCTACGTGGTCACCGACGAGTCCACTGGCTACCTGGCCACGGTCGAGCAGCTGGAGGCGGCCCGCACCGACCGCACCAAGGTGCTGCTGTTCGTCTCCCCCTCGAACCCGACCGGCGCGGTGTACTCGCCCGAGCAGGTCCGCGAGATCGGCCGGTGGGCCCTCGAACACAACCTGTGGGTGCTCACCGACGAGATCTACGAGCACCTCGTCTACGGGGACGCCCGGTTCTCCTCGATGCCGGTGGAAGTTCCGGAACTGGCCGACCGCACCGTGGTGGTCAACGGGGTGGCCAAGACCTACGCCATGACCGGGTGGCGGGTCGGCTGGCTCATCGGCCCCGTGGACGTGGTCAAGGCTGCGACCAACCTGCAGTCGCACGCCACCTCCAATGTGGCCAACGTCTCGCAGGCCGCGGCTCTGGCAGCGGTCTCCGGCGACCTGTCGGCCGTGGAGGAGATGAAGCAGGCCTTCGACCGGCGGCGGCAGACCATTGTGCGGATGCTCAACGAGATCCCCGGTGTGGTGTGCCCCGAGCCCCAGGGCGCGTTCTACGCCTACCCGTCGGTCAAGGAGATCCTCGGCAAGGAGATCCGCGGTCAGCGTCCGCAGACCTCCAGCGAGCTGGCGTCGCTGATCCTGGAGCACGCCAAGGTCGCGGTGGTCCCGGGCGAGGCGTTCGGCACTCCGGGCTACCTGCGGTTGTCCTACGCGTTGAGCGACGCCGATCTGGTCGAAGGGGTCAGCCGGATCGCCAAGCTGCTGAGCGAAGCCCACTGA
>pQR1748
MTDRPRISARIGGISESATLAVDAKAKALKAAGHPVIGFGAGEPDFPTPDYIVEAAVAACRDSRFHRYTPAGGLPELKEAIAAKTLRDSGYRVEPNQVLVTNGGKQAIYEAFATLLDPGDEVIVIAPYWTTYPESIRLAGGTPVYVVTDESTGYLATVEQLEAARTDRTKVLLFVSPSNPTGAVYSPEQVREIGRWALEHNLWVLTDEIYEHLVYGDARFSSMPVEVPELADRTVVVNGVAKTYAMTGWRVGWLIGPVDVVKAATNLQSHATSNVANVSQAAALAAVSGDLSAVEEMKQAFDRRRQTIVRMLNEIPGVVCPEPQGAFYAYPSVKEILGKEIRGQRPQTSSELASLILEHAKVAVVPGEAFGTPGYLRLSYALSDADLVEGVSRIAKLLSEAH
>pQR1749
ATGGTATCCAGGAGAATATCAGAGATTCCCATATCGAAAACCATGGAACTCGACGCGAAGGCCAAAGCCCTCATAAAAAAGGGAGAAGACGTGATCAATCTAACGGCTGGTGAGCCGGATTTTCCCACACCGGAACCCGTCGTGGAAGAAGCGGTGAGATTTCTCCAGAAAGGAGAAGTGAAATACACAGATCCTCGTGGTATCTACGAACTCAGAGAGGGTATAGCGAAAAGGATAGGCGAGAGATACAAAAAAGATATCTCACCGGATCAGGTCGTGGTGACGAATGGAGCGAAACAGGCTCTGTTCAATGCTTTCATGGCCCTTCTCGATCCCGGTGACGAAGTGATCGTGTTTTCTCCCGTCTGGGTCAGCTACATTCCTCAGATCATCCTTGCTGGTGGCACGGTGAACGTGGTTGAGACGTTCATGAGTAAAAATTTCCAGCCCAGTCTGGAAGAGGTGGAAGGGCTTCTTGTTGGGAAAACGAAAGCCGTTCTTATCAACTCGCCGAACAATCCCACTGGTGTGGTGTACAGAAGAGAGTTCCTTGAAGGACTTGTGAGACTTGCCAAGAAGAGGAATTTCTACATAATCAGCGACGAAGTCTACGATTCCCTTGTTTACACGGATGAATTCACATCGATACTCGATGTTTCTGAAGGATTCGACCGGATAGTTTACATAAACGGCTTCTCGAAGTCTCACTCCATGACCGGCTGGAGGGTGGGTTACCTGATATCGAGCGAAAAAGTAGCGACCGCTGTGTCGAAGATCCAGTCTCACACCACCTCCTGTATCAACACGGTAGCACAGTACGCCGCCTTGAAGGCTCTGGAAGTGGACAACTCTTACATGGTTCAGACCTTTAAAGAAAGAAAAAATTTCGTGGTGGAAAGATTGAAAAAGATGGGTGTTAAGTTCGTGGAACCAGAAGGTGCGTTCTACCTCTTTTTCAAAGTCCGGGGTGACGATGTGAAATTCTGTGAAAGGCTCCTCGAAGAAAAGAAGGTTGCACTCGTTCCAGGATCCGCTTTTCTGAAGCCTGGATTTGTGAGGCTTTCTTTTGCCACATCTATAGAAAGACTTACGGAGGCGCTGGATAGAATTGAAGACTTCCTCAATTCTCGTTGA
>pQR1749
MVSRRISEIPISKTMELDAKAKALIKKGEDVINLTAGEPDFPTPEPVVEEAVRFLQKGEVKYTDPRGIYELREGIAKRIGERYKKDISPDQVVVTNGAKQALFNAFMALLDPGDEVIVFSPVWVSYIPQIILAGGTVNVVETFMSKNFQPSLEEVEGLLVGKTKAVLINSPNNPTGVVYRREFLEGLVRLAKKRNFYIISDEVYDSLVYTDEFTSILDVSEGFDRIVYINGFSKSHSMTGWRVGYLISSEKVATAVSKIQSHTTSCINTVAQYAALKALEVDNSYMVQTFKERKNFVVERLKKMGVKFVEPEGAFYLFFKVRGDDVKFCERLLEEKKVALVPGSAFLKPGFVRLSFATSIERLTEALDRIEDFLNSR
>pQR1751
ATGGCACCTGACCTGCGCCACCTGCACACCTTCGGCGAACTGGATCCGCCGCAACGCCTGTTGATGGGCCCCGGCCCGGTCAATGCGCATCCACGCGTGCTGCGTGCGATGGCGGCCGACCTGCTTGGCCAGTTCGACCCGGAAATGACCACCTACATGAACGAGGTGATGGCGCTGTACCGCCCCTTGTTCGGCACCCAGAACCGCTGGACCTTTCTGGTCGATGGCACGGCGCGCGCCGGCATCGAAGCCGCGCTGGTGTCGCTGGTGCAGCCGGGCGACCGTGTGCTGGTGATCAACTTCGGCCGCTTCGGTTTGTTGCTGACCGAAATCCTTGGCCGGCTCGGCGCCGACGTCCACACCGTGGATGCGCCGTGGGGCGAGGTGGTGCCGCTGGCGGCGATTGCCGAGGCGATCGCAAGCGTGGCACCCAAGCTGGTGGCCACCGTGCACGGCGACACCTCCACCACCATGGCGCAGCCGCTCGATGGCCTAGGCGCGCTATGCCGGGCGGCCGGCGCGCTGAGTTACGTAGACGCCACAGCCACCATCGGCGGCATGGACATCGCCAGCGACCGCTGGGAGGTGGACGTGGTCACCGCGGGGCTGCAGAAATGCCTGGGCGGGCCGTCCGGCTCGGCGCCGATCACTGTCTCTGCCGCGGCAGCGGAGGCGATCTTTGCGCGGCGGCATGTCGAACGCGGCATCGTGCGCGAGGACATCGCCAACGGCAGCGGCCCACGCATCGCCTCGAATTATTTCGACCTGGCGATGATCATGGATTACTGGTCCGACAAGCGCCTCAATCACCACACCGAAGCCACCACCATGCTGTACGGCGCGCGCGAATGCGCACGCGTGGCCTTGCAGGAAGGCCTGGAGGCGCGCTACGCCCGGCATGCGGCTGCCGGCCGCGCGGTCAGCGCCGGCGTGCGCGCACTGGGGCTGGAGGTGTTCGGCGACGATGCGCACCGCATGAGCAATGTCACCGGCGTGGTGATCCCGCACGGCGTCGACAGTGAAGCAGTGCGGCGGCGCATGCGCGAGGATTTCGAAATCGAGATCGGCACCGCGTTCGGCCCGCTGCAAGGCAGGATCTGGCGCATCGGTGCGATGGGCTACAACGCGATGAAGCACAAGGTGCTGCTCACCCTGGCCGCACTGGAAGCGGTGCTGCGCGCCGAGGGCTACGCGTGCACCCAAGGCCTGGCGGTCGAAGCCGCACGCGCCGCCTGGCATGCGGAGCCGGCTGCATGA
>pQR1751
MAPDLRHLHTFGELDPPQRLLMGPGPVNAHPRVLRAMAADLLGQFDPEMTTYMNEVMALYRPLFGTQNRWTFLVDGTARAGIEAALVSLVQPGDRVLVINFGRFGLLLTEILGRLGADVHTVDAPWGEVVPLAAIAEAIASVAPKLVATVHGDTSTTMAQPLDGLGALCRAAGALSYVDATATIGGMDIASDRWEVDVVTAGLQKCLGGPSGSAPITVSAAAAEAIFARRHVERGIVREDIANGSGPRIASNYFDLAMIMDYWSDKRLNHHTEATTMLYGARECARVALQEGLEARYARHAAAGRAVSAGVRALGLEVFGDDAHRMSNVTGVVIPHGVDSEAVRRRMREDFEIEIGTAFGPLQGRIWRIGAMGYNAMKHKVLLTLAALEAVLRAEGYACTQGLAVEAARAAWHAEPAA
>pQR1752
ATGGGAAAGTTTCTTAAGAAACACTACATAATGGCACCTGGACCAACACCAGTCCCAAACGATATTTTAACAGAAGGAGCGAAGGAAACAATACACCACAGAACACCTCAGTTTGTTTCCATAATGGAAGAGACCCTCGAAAGTGCAAAGTACATCTTTCAGACAAAACACAACGTGTACGCCTTTGCTTCCACAGGAACTGGCGCTATGGAAGCGGCGGTGGCGAATCTTGTGAGCCCTGGAGACAAAGTGATCGTGGTTGTGGCTGGAAAGTTCGGTGAAAGATGGAGAGAGCTCTGTCAGGCTTACGGTGCTGATATCGTAGAAATCGCCCTCGAATGGGGAGACGCGGTCACACCTGAACAGATCGAAGAGGCTCTCAACAAAAACCCCGATGCGAAGGTCGTCTTCACCACCTACAGTGAAACATCGACGGGTACAGTCATAGACCTCGAAGGAATTGCCAGAGTCACGAAGGAAAAAGACGTTGTTCTTGTGACAGACGCTGTCAGCGCTCTTGGAGCAGAACCACTGAAGATGGATGAATGGGGTGTGGATCTCGTTGTCACAGGTTCACAGAAGGGTTTGATGTTACCTCCAGGACTGGCGCTCATCTCTCTCAACGACAAAGCGTGGGGGCTCGTGGAAAAATCCAGATCTCCAAGGTACTACTTCGATCTGAGGGCCTACAGGAAATCTTACCCCGACAATCCTTACACCCCCGCAGTAAACATGATATACATGTTGAGAAAGGCTCTTCAGATGATAAAAGAGGAAGGCATAGAAAACGTATGGGAAAGGCACAGAATACTGGGAGACGCAACAAGAGCAGCGGTGAAAGCACTTGGACTGGAACTCCTCTCGAAAAGACCGGGAAACGTTGTAACAGCCGTGAAAGTGCCTGAGGGCATCGATGGAAAACAGATTCCCAAGATCATGAGAGACAAGTACGGTGTGACCATCGCCGGTGGACAGGCTAAACTCAAGGGAAAAATATTCAGGATAGCACACCTCGGATACATGTCACCTTTCGACACCATAACTGCCATTTCCGCTCTTGAATTAACCTTGAAGGAACTCGGTTATGAGTTCGAACTCGGAGTCGGTGTTAAGGCAGCCGAAGCTGTCTTCGCTAAAGAATTCATTGGGGAGTGA
>pQR1752
MGKFLKKHYIMAPGPTPVPNDILTEGAKETIHHRTPQFVSIMEETLESAKYIFQTKHNVYAFASTGTGAMEAAVANLVSPGDKVIVVVAGKFGERWRELCQAYGADIVEIALEWGDAVTPEQIEEALNKNPDAKVVFTTYSETSTGTVIDLEGIARVTKEKDVVLVTDAVSALGAEPLKMDEWGVDLVVTGSQKGLMLPPGLALISLNDKAWGLVEKSRSPRYYFDLRAYRKSYPDNPYTPAVNMIYMLRKALQMIKEEGIENVWERHRILGDATRAAVKALGLELLSKRPGNVVTAVKVPEGIDGKQIPKIMRDKYGVTIAGGQAKLKGKIFRIAHLGYMSPFDTITAISALELTLKELGYEFELGVGVKAAEAVFAKEFIGE
>pQR1755
ATGACTCAGATTTTTAATTTTAGCGCCGGTCCAGCAATGCTGCCGGTTGAAGTACTGCGTCGTGCTGAACAGGAATTGTGTAATTGGAATGGCCTGGGCACATCGGTTATGGAAATCAGCCACCGCAGTAAAGAGTTTATGCAGGTTGCCGCTGAATCCGAACAGGATCTGCGTGATTTGCTGAAAATCCCCTCCAACTACAAAGTGCTCTTTTGCCACGGCGGTGCTCGTGCGCAATTCGCCGCAGTGCCGTTAAATCTTCTGGGCGAACGCTCAACGGCCGACTACATCGACGGCGGGTATTGGGCGCACAGCGCAATCAATGAAGCAGAAAAATACTGCACGCCTAACGTGATTGACGTGAAAATGCGCGTGGGCGAACTGCGTGGCATTAAGCCGATGCGTGAATGGAAATTGTCTGATGACGCGGCGTTTGTGCATTACTGCCCGAATGAAACCATCGACGGTATTGCGATCGAAGAAGAGCCGGACTTTGGCGATAAAATTGTGGTCGCCGACTATTCTTCCAGCATCCTGTCTCGTCGTATTGATGTCAGCCGTTACGGCGTGATCTATGCCGGTGCGCAGAAAAATATCGGCCCTGCCGGCCTGACGCTGGTTATCGTACGTGAAGATTTGCTGGGCAAGGCGCGCCGTGAGCTGCCATCGATTCTGGATTACCAGGTTCTGGCGGACAATGACTCCATGTTTAACACGCCACCGACCTTTGCCTGGTACCTGTCCGGTATGGTCTTCAAATGGCTGAAAGAGTACGGCGGTCTGGCTGAAATGGAAAAACGTAACCAGGAGAAGGCTGACCTGCTGTATAGCGCGATTGACGGTAACGATTTCTATCGTAATGACGTTGCGGTAGCGAACCGTTCTCGCATGAATGTGCCATTCCTGTTGGCGGATTCTGCGCTGGATAAAGTCTTCCTGGAAGAATCAGTCGCTGCAGGTCTGCACGCGCTGAAAGGCCATCGCGTAGTAGGCGGCATGCGTGCCTCTATCTACAATGCGATGCCGTTGGAAGGCGTGAAAGTGCTGACGGAATTTATGGCTGACTTCGCTCGTCGCCACGGTTGA
>pQR1755
MTQIFNFSAGPAMLPVEVLRRAEQELCNWNGLGTSVMEISHRSKEFMQVAAESEQDLRDLLKIPSNYKVLFCHGGARAQFAAVPLNLLGERSTADYIDGGYWAHSAINEAEKYCTPNVIDVKMRVGELRGIKPMREWKLSDDAAFVHYCPNETIDGIAIEEEPDFGDKIVVADYSSSILSRRIDVSRYGVIYAGAQKNIGPAGLTLVIVREDLLGKARRELPSILDYQVLADNDSMFNTPPTFAWYLSGMVFKWLKEYGGLAEMEKRNQEKADLLYSAIDGNDFYRNDVAVANRSRMNVPFLLADSALDKVFLEESVAAGLHALKGHRVVGGMRASIYNAMPLEGVKVLTEFMADFARRHG
Acknowledgements
The authors are grateful to the UK Engineering and Physical Sciences Research Council (EPSRC) for financial support of this work (EP/K014897/1). Furthermore, we acknowledge the advice and support from the project Industrial Advisory Board.
Footnotes
Transparency document associated with this article can be found in the online version at 10.1016/j.dib.2018.05.140.
Transparency document. Supplementary material
.
References
- 1.Bawn M., Subrizi F., Lye G.J., Sheppard T.D., Hailes H.C., Ward J.M. One-pot, Two-step Transaminase and Transketolase Synthesis of L-gluco-heptulose from L-arabinose. Enzyme Microb. Technol. 2018;116:16–22. doi: 10.1016/j.enzmictec.2018.05.006. [DOI] [PubMed] [Google Scholar]
- 2.Cárdenas-Fernández M., Bawn M., Hamley-Bennett C., Bharat P.K.V., Subrizi F., Suhaili N., Ward D.P., Bourdin S., Dalby P.A., Hailes H.C., Hewitson P., Ignatova S., Kontoravdi C., Leak D.J., Shah N., Sheppard T.D., Ward J.M., Lye G.J. An integrated biorefinery concept for conversion of sugar beet pulp into value-added chemicals and pharmaceutical intermediates. Faraday Discuss. 2017;346:183–190. doi: 10.1039/c7fd00094d. [DOI] [PubMed] [Google Scholar]
- 3.Subrizi F., Cárdenas-Fernández M., Lye G.J., Ward J.M., Dalby P.A., Sheppard T.D., Hailes H.C. Transketolase catalysed upgrading of L-arabinose: the one-step stereoselective synthesis of L-gluco-heptulose. Green Chem. 2016;18:3158–3165. [Google Scholar]
- 4.Seliwanoff T. Notiz iiber eke Fruchtzucker- reaction. Chem. Ber. 1887;20:181–182. [Google Scholar]
- 5.N.C. for B.I. National Library of Medicine (US), National Center for Biotechnology Information (NCBI), 1988. 〈https://www.ncbi.nlm.nih.gov/〉.
- 6.Bateman A., Martin M.J., O’Donovan C., Magrane M., Alpi E., Antunes R., Bely B., Bingley M., Bonilla C., Britto R., Bursteinas B., Bye-AJee H., Cowley A., Da Silva A., De Giorgi M., Dogan T., Fazzini F., Castro L.G., Figueira L., Garmiri P., Georghiou G., Gonzalez D., Hatton-Ellis E., Li W., Liu W., Lopez R., Luo J., Lussi Y., MacDougall A., Nightingale A., Palka B., Pichler K., Poggioli D., Pundir S., Pureza L., Qi G., Rosanoff S., Saidi R., Sawford T., Shypitsyna A., Speretta E., Turner E., Tyagi N., Volynkin V., Wardell T., Warner K., Watkins X., Zaru R., Zellner H., Xenarios I., Bougueleret L., Bridge A., Poux S., Redaschi N., Aimo L., ArgoudPuy G., Auchincloss A., Axelsen K., Bansal P., Baratin D., Blatter M.C., Boeckmann B., Bolleman J., Boutet E., Breuza L., Casal-Casas C., De Castro E., Coudert E., Cuche B., Doche M., Dornevil D., Duvaud S., Estreicher A., Famiglietti L., Feuermann M., Gasteiger E., Gehant S., Gerritsen V., Gos A., Gruaz-Gumowski N., Hinz U., Hulo C., Jungo F., Keller G., Lara V., Lemercier P., Lieberherr D., Lombardot T., Martin X., Masson P., Morgat A., Neto T., Nouspikel N., Paesano S., Pedruzzi I., Pilbout S., Pozzato M., Pruess M., Rivoire C., Roechert B., Schneider M., Sigrist C., Sonesson K., Staehli S., Stutz A., Sundaram S., Tognolli M., Verbregue L., Veuthey A.L., Wu C.H., Arighi C.N., Arminski L., Chen C., Chen Y., Garavelli J.S., Huang H., Laiho K., McGarvey P., Natale D.A., Ross K., Vinayaka C.R., Wang Q., Wang Y., Yeh L.S., Zhang J. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–D169. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Larkin M.A., Blackshields G., Brown N.P., Chenna R., Mcgettigan P.A., McWilliam H., Valentin F., Wallace I.M., Wilm A., Lopez R., Thompson J.D., Gibson T.J., Higgins D.G. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.