Skip to main content
Data in Brief logoLink to Data in Brief
. 2018 May 31;19:1341–1354. doi: 10.1016/j.dib.2018.05.140

Data on a thermostable enzymatic one-pot reaction for the production of a high-value compound from l-arabinose

Maria Bawn a, Fabiana Subrizi b, Gary J Lye a, Tom D Sheppard b, Helen C Hailes b, John M Ward a,
PMCID: PMC6140825  PMID: 30229010

Abstract

The dataset presented in this article is related to the research article entitled “One-pot, two-step transaminase and transketolase synthesis of l-gluco-heptulose from l-arabinose” (Bawn et al., 2018 in press) [1]. This article presents data on initial experiments that were carried out to investigate new thermostable transketolase (TK) activities with l-arabinose. Transaminase (TAm) sequences from an in-house library of thermophilic strains were analyzed to compare homologies to characterized TAms with desired activity. DNA and amino acid sequences are presented for all the enzymes investigated. Calibration curves for products of the TK and TAm reactions are also presented along with chromatographic analysis of the various one-pot reactions.


Specifications Table

Subject area Biology
More specific subject area Biocatalysis
Type of data Tables, text file, figures
How data was acquired Experiments/ in-vitro assays and high performance anion exchange chromatography with pulsed amperometric detection (HPAEC-PAD)
Data format Analyzed and tabulated
Experimental factors All enzymes and substrates were freshly prepared before use
Experimental features Experiments were carried out in triplicate
Data source location United Kingdom, London, University College London (UCL)
Data accessibility The data are accessible only within this article

Value of the data

  • The data presented in this article gives new insight into the activities of thermostable enzymes not published before.

  • The data represents a rationale behind why TKs and TAms were selected for the one-pot reaction.

  • Product of one-pot reaction, l-gluco-heptulose, is a pharmaceutically-relevant compound.

1. Data

l-Arabinose is a major monosaccharide of sugar beet pulp (SBP), a by-product of sucrose extraction which is currently produced and sold as a low value animal feed [1]. The main focus of this work was to create a value-added product from the monosaccharides that make up SBP via enzymatic routes. Building on previous work [2], [3], this present study produces l-gluco-heptulose, a high value, pharmaceutically relevant compound from l-arabinose using a two-step thermostable enzyme cascade. A thermostable TK catalyzed the synthesis of l-gluco-heptulose from l-arabinose and β-hydroxypyruvate (HPA) in which the latter was produced in situ from l-serine and α-ketoglutaric acid using a thermostable TAm.

Table 1 identifies thermostable TKs utilized and whether they were active towards l-arabinose via the Seliwanoff assay [4]. Table 2 describes the TAms investigated and compares sequence homologies to TAms previously showing activities required for this reaction. Examples of HPAEC-PAD traces (Fig. 2, Fig. 3) demonstrate how the TK and TAm one-pot reactions were monitored for the presence of l-gluco-heptulose.

Table 1.

TKs showing activity with l-arabinose via Seliwanoff assay.

TK Strain UniProtKB accession code Plasmid name and abbreviated name Active towards l-arabinose from Seliwanoff assay
Deinococcus geothermalis DSM 11300 Q1IW07 pQR1758 (TKDgeo)
Deinococcus radiodurans DSM 20539 Q9RS71 pQR1759 (TKDrad)
Geobacillus stearothermophilus DSM 22 KFL15812.1 pQR1743
Thermobifida fusca strain YX Q47ND4 pQR1744
Thermotoga maritima DSM 3109 Q9X283 pQR1745

Table 2.

Sequence similarity values between new cloned TAms and DGEO_0713, SPAT and CV2025.

TAm Strain UniProtKB accession code Plasmid name and abbreviated name Homology to DGEO_0713 Homology to SPAT Homology to CV2025
Deinococcus geothermalis(%) Sulfolobus solfataricus(%) Chromobacterium violaceum(%)
Deinococcus radiodurans DSM 20539 Q9RWP3 pQR1746 78 38 31
Geobacillus stearothermophilus DSM 22 Q59228 pQR1756 (TAmGste) 25 38 40
Thermobifida fusca strain YX Q47LH8 pQR1748 26 37 41
Thermotoga maritima DSM 3109 G4FE93 pQR1749 30 58 31
Deinococcus geothermalis DSM 11300 Q1IZC2 pQR1757 (TAmDgeo) 29 29 44
Xanthomonas campestris pv. Campestris DSM 3586 Q8PDQ2 pQR1751 28 25 30
Thermotoga maritima DSM 3109 Q9X1C0 pQR1752 33 34 36
Pectobacterium carotovorum subsp. Carotovorum DSM 30168 A0A0B3YSH6 pQR1755 24 31 33

Fig. 2.

Fig. 2

HPAEC-PAD trace showing l-gluco-heptulose production from one-pot reaction with TAmDgeo and TKDgeo after 24 h.

Fig. 3.

Fig. 3

HPAEC-PAD trace showing l-gluco-heptulose production from one-pot reaction with TAmDgeo and TKDrad after 24 h.

2. Experimental design, materials and methods

2.1. TK activity

Thermostable TKs were cloned and subsequently expressed in E.coli BL21 DE3. Cell lysates were used to determine activity towards l-arabinose using the colorimetric assay, Seliwanoff assay. The Seliwanoff assay distinguishes between ketoses and aldoses using 6 M HCl and resorcinol (Seliwanoff׳s reagent) [4]. After 24 h incubation of enzyme and l-arabinose, Seliwanoff reagent was added to the reaction and heated at 100 °C. Colour formation due to the presence of the ketose, l-gluco-heptulose, was observed within 15 min (Table 1).

2.2. TAm sequence analysis

TAm sequences were obtained from the NCBI database [5] and the UniProt Knowledgebase (UniProtKB) [6] followed by a sequence alignment using Clustal W [7].

2.3. Product analysis using HPAEC-PAD

Quantitative analysis of l-arabinose, l-gluco-heptulose and HPA was performed using HPAEC-PAD (ICS 5000+, Dionex) equipped with a Dionex AminopacTM PA1 anion exchange column 4×250 mm2 fitted with a Dionex AminopacTM PA1 guard column 4×50 mm2, an electrochemical detector system, and an eluent generator with a KOH 500 cartridge. The elution times of each compound can be observed in Fig. 1. Fig. 3, Fig. 4 are examples of a one-pot reaction analysis with various TKs and TAmDgeo. Standard calibration curves of l-gluco-heptulose and HPA were used for quantification purposes (Fig. 4, Fig. 5).

Fig. 1.

Fig. 1

HPAEC-PAD trace showing the elution of l-arabinose, l-gluco-heptulose and HPA.

Fig. 4.

Fig. 4

Calibration curve for determination of l-gluco-heptulose yield.

Fig. 5.

Fig. 5

Calibration curve for determination of HPA yield.

2.4. TK DNA/ amino acid sequences

DNA sequences were retrieved from the NCBI database [5] and amino acid sequences were obtained through the UniProtKB [6].

  • >TKDgeo

ATGAGTCCCGAACAGCAGGCCGTGCGTCAGGATGTCGATCAGCTGAGCATCAACACCATCAGGACGCTTGCCATCGATGCGGTGCAGCGGGCCAACAGTGGCCACCCCGGCGCGCCGCTCGGCATGGCCCCGATGGGCTACGTGCTGTGGCAGCGCTTCCTGCGCCACAATCCGAAACATCCCGAGTGGCCGGGCCGCGACCGCTTCGTGCTGTCGGCAGGGCACGCCAGCATGCTGATCTACTCGCTGCTGCACCTCACCGGCTACGACCTGCCGCTGGAGGACATCAAGAACTTCCGCCAGTGGGGCAGCAAGACGCCTGGGCATCCCGAGTTCTTCCACACCCCAGGCCTAGACGCCACCACCGGCCCGCTCGGTCAGGGTGCGGCGATGACGGTGGGCATGGCGATGGCCGAAGCGCACCTCGCCGCACGCTACAACCGCCCCGGCTTCAAGGTCTTTGACAACTACACCTACGCGATCTTGGGGGACGGCGACCTGCAAGAAGGCGTCAACCACGAGGCCGCGTCGCTGGCAGGGCACCTCAAGCTGGGCAAGCTGATCTGGCTGCACGACGACAACCAGGTGCAGCTGGACACCGCCACGTTCAAGGCGGCCAACGAGGATACTGCGGAGCGTTACCGCGCCTACGGCTGGGAAGTTCTGCGTGTGCAAGACGGCAACAATCTCACGGAGATCGAGAACGCGATCCGCCAGGCACGGATGAACACCGAGCAGCCCACCCTGATCCAGGTTCGCACGGTGATCGGCTTCGGCAGTCCCCGTGCGGGCACCAGCAAGGCGCACGGCGAGCCGCTGGGCGAGGAAGGCGTGCAGGAGACCAAGGCGGCCCTGGGCTGGGACTACCCGCCCTTCACGGTGCCCGACGAGGTCAAGGCGCATATGGACGCGACTGAGCGTGGCGCGGAGTGGGAGGCCGACTGGAACGCGCTGATGGAGCGCTACCGTGCCGAGTACCCCGATCTCGCGGCGGAGGTTGACGCGCTGCTGGCGCGCGAACTGCCCGCCAATCTCGCCGAAGTGCTCCCCTCCTACGAAGTGGGCAGCAAGGCCATCGCCACCCGCAACGCGAGCGGTGAAGTCATCAATGCGCTGGCGCAGGTGGTGCCGGGCCTGATGGGGGGCAGTGCGGATCTCTCCGGCAGCACCAAAACCACCATCAAGGACGGCGGCGAGTTTCTGCCAGGAAACTACGGGGGCCGCAACGTCTACTTTGGCGTCCGCGAGTTTGGGATGGCCGCAGCGGGCAATGGCCTTTCGCTCTACGGAGGTGTTCGGCCCCTGGTGGGGACCTTCCTGGTGTTTGCGGACTACCTCAAGCCCGCCTTCCGCCTCTCCGCCCTTCAGTTCCAGCCGGTTACCTATGTCCTGACCCATGACTCCATTGGCCTGGGCGAAGACGGCCCAACCCACCAGCCTATTGAGCAGCTCGCCATGCTGCGCGCCGTGCCGGGTGCCCACGTGATTCGCCCCGCCGACGCCAACGAGACGGCGGCGGCCTGGCAGATGGCGCTGGAGTACGACAAGGGACCAACCGCTCTGGCCCTCTCCCGCCAGGATCTCCCAGTGCTGCCCCGCAACCACGCGGGCGTGAAGAAGGGCGCCTACGTGGTTCGCGACGCCGAAGGGGGGCCGGCACAGATCATCTTGATCGCCACCGGCTCGGAGGTGAGCCTGGCGCTGGATGCTGCCCAAGCGCTGGCGGAGGAAGGCATCCAGGCTCGGGTCGTCTCAATGCCCTGCATGGAAGTCTTCCGCCAGCAGGACGCCAGTTATCGGGACAGCGTGCTCACCCCCGGCGTGAAACGCGTGGCCATCGAGGCTGCCAGCCCGCTCCCCTGGTATGAGTGGGTGGGCTTTGACGGCGCGGTGATCGGAATGACCACCTTTGGCGCCTCGGCCCCAGCCAAAGTCCTCTTTGAGAAATTCGGCTTCAACGTGCCGAACGTCGTGCAGGTCGTCAAGGGCGTTTTGCAGAGGTGA

  • >TKDgeo

MSPEQQAVRQDVDQLSINTIRTLAIDAVQRANSGHPGAPLGMAPMGYVLWQRFLRHNPKHPEWPGRDRFVLSAGHASMLIYSLLHLTGYDLPLEDIKNFRQWGSKTPGHPEFFHTPGLDATTGPLGQGAAMTVGMAMAEAHLAARYNRPGFKVFDNYTYAILGDGDLQEGVNHEAASLAGHLKLGKLIWLHDDNQVQLDTATFKAANEDTAERYRAYGWEVLRVQDGNNLTEIENAIRQARMNTEQPTLIQVRTVIGFGSPRAGTSKAHGEPLGEEGVQETKAALGWDYPPFTVPDEVKAHMDATERGAEWEADWNALMERYRAEYPDLAAEVDALLARELPANLAEVLPSYEVGSKAIATRNASGEVINALAQVVPGLMGGSADLSGSTKTTIKDGGEFLPGNYGGRNVYFGVREFGMAAAGNGLSLYGGVRPLVGTFLVFADYLKPAFRLSALQFQPVTYVLTHDSIGLGEDGPTHQPIEQLAMLRAVPGAHVIRPADANETAAAWQMALEYDKGPTALALSRQDLPVLPRNHAGVKKGAYVVRDAEGGPAQIILIATGSEVSLALDAAQALAEEGIQARVVSMPCMEVFRQQDASYRDSVLTPGVKRVAIEAASPLPWYEWVGFDGAVIGMTTFGASAPAKVLFEKFGFNVPNVVQVVKGVLQR

  • >TKDrad

ATGACAGACCAGAGCGTTTCCCAAAACGTGGCGCGGCTGAGTGTGAACACCATTCGCACGCTCGCCATTGACGCGGTGCAGGCCGCCAACTCGGGCCACCCCGGTGCGCCGCTGGGCATGGCCCCGATGGGCTACGTGCTGTGGCACAAGTTCCTGCGCCACAACCCCGCGCACCCTGAGTGGCCGGGCCGCGACCGCTTCGTGCTGTCGGCGGGGCACGCCTCCATGCTGATCTACAGCCTGCTGCACCTGACCGGCTACCAGGAAATGACGCTCGACGACCTGCGCCACTTCCGGCAGTGGGGCTACCACACCCCCGGCCACCCCGAGTTTTTCCACACCAAGGGTCTGGACGCGACCACCGGCCCGCTTGGGCAGGGCGCGGCGATGACGGTGGGCATGGCGATGGCCGAAGCACACCTCGCCGCCCGCTACAACCGCGAAGGCTTTCCGATTTTCGACAACCGCACCTACGCCATCATGGGCGACGGCGATCTGCAAGAAGGCATCAACCACGAAGCCGCCGCGCTCGCCGGGCACCTGAAACTCGGCAAGCTGATCTGGCTGCACGACGACAACCACATCCAGCTCGACACGCCCACGAACAAGGCCGAGTCCGAGGACACCGCCGCCCGCTTCCGCGCCTATGGCTGGAACGTGCTGAAGGTGGAAGACGGCGACAATCTGGACGAAATTGAAAAGGCGATTGCCGAGGCCCGCAGCCAAAGCGAGCGGCCCACGCTGATTCAGGTGCGCACCATCATCGGCTTCGGCAGCCCGCGCGCCGGCACGAGCAAGGCGCACGGCGAGCCGCTCGGCGAAGAGGGCGTGGCCGAGACGAAGAAAGCGTTGGGCTGGGAGTACCCCGCTTTTACCGTGCCCGACGAAGTGGCTGCGCACATGAACGCTCGCGCTAAGGGTGCTCAACTCGAAGCCGACTGGGAAAAACTGATGGCCGACTACCGCACCGCGCACCCCGACCTCGGCAAGGAAGTGGACGCGCTGCTCGCCCGTGAACTGCCCGCCAACCTCGCCGACCTGCTGCCCAAGTACGAAGTCGGCGGCAAGGCGGCGGCCACCCGCAACGCGAGCGGCGAAGTCATCAACGCGCTGGCGAAGGTGCTTCCCGGTTTGATGGGCGGCAGCGCGGACCTCTCGGGCTCGACCAAGACCACCATCAAGGACGGCGGCGAGATGGAAGCGGGCACCATGGGCGGGCGCAACGTGCTGTTCGGCGTGCGCGAGTTCGGCATGAGCGCCGCGGGCAATGGCCTGAGCCTCTACGGCGGCCTGCACCCGATGGTAGGCACCTTCCTGGTATTCGCCGACTACCTCAAGCCGGCTTTCCGCCTCTCGGCGCTGCAAATGCAGCCGGTGACTTACGTGCTGACCCACGACTCCATCGGTCTGGGCGAAGACGGGCCGACCCACCAGCCGGTGGACCAGCTCGCCATGCTGCGAGCGGTGCCGGGCGCCCACGTCATTCGCCCCGCCGACGCCAACGAAACCGCCGCCGCGTGGCTGATGGCGCTGGAATACGACAAGGGCCCCACCGCGCTCGCCCTCTCGCGCCAGGATCTGCCGATTCTGCCCGCCAACATCGAAGGCGTGAAGAAGGGCGCGTATGTCCTCCGAGATGTGGACGGTGCCGATGGTCAGGGGGCTCAAGTCATCCTGATC

GCCAGCGGCTCGGAAGTCGCCCTGGCCCTGAGCAGCGCCGAGCGGCTGGCCGAAGAGGGCGTGCAGGCCCGCGTGGTGTCCATGCCGTGCATGGAGGTCTTTCGCCAGCAGGAGCAGAGCTACCGCGACAGCGTGCTGACCCCCGGCGTGAAGCGCGTCGCCATCGAGGCCGCCAGCCCGCAGCCCTGGTACGAGTGGACGCTCGGCGGCCCAGTCATCGGCATGACGACCTTCGGTGCGTCGGCCCCGGCCAAGGTGCTGTTTGAGAAGTTCGGCTTCAGCGTGGAAAACGTGGTGAAGGTGGTCCACTCCGTGCTGTAA

  • >TKDrad

MTDQSVSQNVARLSVNTIRTLAIDAVQAANSGHPGAPLGMAPMGYVLWHKFLRHNPAHPEWPGRDRFVLSAGHASMLIYSLLHLTGYQEMTLDDLRHFRQWGYHTPGHPEFFHTKGLDATTGPLGQGAAMTVGMAMAEAHLAARYNREGFPIFDNRTYAIMGDGDLQEGINHEAAALAGHLKLGKLIWLHDDNHIQLDTPTNKAESEDTAARFRAYGWNVLKVEDGDNLDEIEKAIAEARSQSERPTLIQVRTIIGFGSPRAGTSKAHGEPLGEEGVAETKKALGWEYPAFTVPDEVAAHMNARAKGAQLEADWEKLMADYRTAHPDLGKEVDALLARELPANLADLLPKYEVGGKAAATRNASGEVINALAKVLPGLMGGSADLSGSTKTTIKDGGEMEAGTMGGRNVLFGVREFGMSAAGNGLSLYGGLHPMVGTFLVFADYLKPAFRLSALQMQPVTYVLTHDSIGLGEDGPTHQPVDQLAMLRAVPGAHVIRPADANETAAAWLMALEYDKGPTALALSRQDLPILPANIEGVKKGAYVLRDVDGADGQGAQVILIASGSEVALALSSAERLAEEGVQARVVSMPCMEVFRQQEQSYRDSVLTPGVKRVAIEAASPQPWYEWTLGGPVIGMTTFGASAPAKVLFEKFGFSVENVVKVVHSVL

  • >pQR1743

ATGGCGCATTCGATCGAAGAATTAGCGATTACCACCATTCGAACGCTGTCGATTGACGCGATCGAAAAAGCGAAATCCGGGCACCCGGGCATGCCGATGGGCGCGGCCCCGATGGCGTATACGCTCTGGACGAAATTTATGAACCATAATCCAGCGAATCCCAACTGGTTTAACCGCGACCGGTTTGTTTTGTCCGCTGGGCACGGGTCGATGCTGCTTTACAGCCTGCTTCATCTAAGCGGCTACGATGTCACGATGGACGACTTGAAACAGTTCCGCCAATGGGGAAGCAAAACGCCGGGCCATCCGGAATACGGCCATACGCCAGGGGTGGAGGCAACGACCGGCCCGCTCGGCCAAGGGATTGCGATGGCGGTCGGCATGGCGATGGCGGAACGGCATTTGGCGGCTGCATACAATCGCGATGGATTTGACATTATCAACCACTACACGTATGCGATTTGCGGCGACGGCGATTTGATGGAAGGAGTGGCGAGCGAAGCGGCGTCACTCGCCGGCCACTTGAAGCTCGGCCGTCTGATCGTCCTGTATGACTCGAACGACATTTCGCTGGACGGCGAGCTCAACTTGTCGTTTTCGGAAAACGTCGCCCAACGTTTCCAAGCGTACGGCTGGCAATATTTGCGCGTTGAGGACGGCAACAATATTGAAGAAATCGCCAAAGCGCTCGAGGAGGCGCGGACGGACCTCAGCCGGCCGACGCTCATTGAAGTAAAAACGACGATTGGCTACGGCGCGCCAAATAAAGCGGGCACGTCCGGCGTCCACGGCGCTCCGCTCGGCGCCCAAGAGGCGAAGTTGACGAAAGAAGCGTACCGCTGGACGTTTTCCGAAGATTTCTACGTGCCGGATGAAGTGTACGCTCATTTCCGGGAAACGGTGCAAGAAGCCGGAGCGAGAAAAGAAGCGGAGTGGAATGAGCGCTTCGTTGCTTACGAGCGGGCGCATCCGGAATTGGCCGCCGAGCTGAAGCAGGCGATTGAAGGGAAGCTTCCGGATGGCTGGGAAACATCGCTGCCGGTGTATGAAGCGGGCAAAAGCTTGGCGACCCGCTCATCGTCCGGGGAAGTGATCAACGCCATCGCCAAAGCGGTGCCGCATTGTTTGGCGGTTCGGCGGACTTGGCAAGCTCGAATAAAACGCTTATCAAAGGCGGCGGCAACTTCTTGCCGGACAGCTACGAAGGGCGCAACATTTGGTTTGGCGTGCGCGAGTTTGCCATGGGCGCGGCGTTAAACGGCATGGCGCTTCACGGCGGGTTGAAAGTGTTCGGCGGCACGTTCTTCGTGTTCTCCGACTACTTGCGCCCGGCGATTCGGCTGGCGGCGCTCATGGGCTTGCCGGTGACGTACGTGCTGACGCACGACAGCATCGCCGTCGGGGAAGACGGCCCGACGCATGAGCCGGTCGAGCATCTCGCTTCACTTCGGGCGATGCCGAACTTGTCAGTCATCCGGCCGGCTGACGCAAACGAAACGGCGGCCGCCTGGCGGCTGGCGCTCGAGTCGACGAACAAGCCGACTGCGCTCGTCTTGACGCGTCAAGATGTGCCGACATTGCCGACAACCGCTCAGTTGGCGTATGAAGGCGTGAAAAAAGGCGCGTACGTCGTTTCACCGGCGAAAAACGGCGCTCCGGAGGCGCTGTTGTTGGCGACTGGCTCGGAAGTCGGTCTGGCCGTCAAAGCGCAAGAAGCGCTCGCCGCTGAGGGCATCCATGTCTCCGTCATCAGCATGCCATCGTGGGACCGCTTCGAAGCGCAGCCAAAATCGTACCGCGATGAAGTGCTGCCGCCGGCCGTGACGAAGCGGCTCGCCATTGAAATGGGCGCCTCGCTCGGTTGGGAGCGCTACGTCGGCGCCGAGGGCGACATTTTGGCCATCGACCGATTCGGTGCTTCCGCTCCGGGAGAGAAAATCATGGCCGAGTATGGCTTTACGGTTGACAACGTCGTCCGCCGCACAAAAGCGCTGCTCGGCAAGTAA

  • > pQR1743

MAHSIEELAITTIRTLSIDAIEKAKSGHPGMPMGAAPMAYTLWTKFMNHNPANPNWFNRDRFVLSAGHGSMLLYSLLHLSGYDVTMDDLKQFRQWGSKTPGHPEYGHTPGVEATTGPLGQGIAMAVGMAMAERHLAAAYNRDGFDIINHYTYAICGDGDLMEGVASEAASLAGHLKLGRLIVLYDSNDISLDGELNLSFSENVAQRFQAYGWQYLRVEDGNNIEEIAKALEEARTDLSRPTLIEVKTTIGYGAPNKAGTSGVHGAPLGAQEAKLTKEAYRWTFSEDFYVPDEVYAHFRETVQEAGARKEAEWNERFVAYERAHPELAAELKQAIEGKLPDGWETSLPVYEAGKSLATRSSSGEVINAIAKAVPQLFGGSADLASSNKTLIKGGGNFLPDSYEGRNIWFGVREFAMGAALNGMALHGGLKVFGGTFFVFSDYLRPAIRLAALMGLPVTYVLTHDSIAVGEDGPTHEPVEHLASLRAMPNLSVIRPADANETAAAWRLALESTNKPTALVLTRQDVPTLPTTAQLAYEGVKKGAYVVSPAKNGAPEALLLATGSEVGLAVKAQEALAAEGIHVSVISMPSWDRFEAQPKSYRDEVLPPAVTKRLAIEMGASLGWERYVGAEGDILAIDRFGASAPGEKIMAEYGFTVDNVVRRTKALLGK

  • >pQR1744

ATGAACACCGGCACCCCAAAGACCCTGGACTGGTCTGATCTCGATAGACGTACCGTAGACGTGGTTCGTGCCCTGGCGATGGACGCGGTCGAAGAAGCGGGATCCGGGCACCCTGGAACCGCGATGAGTCTGGCGCCTGTGGCCTACCTGCTCTTCCAGAAGGTGATGCGGCACGATCCGACAGATCCGAAGTGGATCGGCCGCGACCGCTTCGTCCTGTCCTGCGGGCACTCCAGCCTCACGCTCTACATCCAGCTCTACCTGGCTGGCTACGGGCTGAGCCTGAACGACATCAAGCGGCTGCGCCAGTGGGGCAGCCTCACCCCGGGCCACCCCGAATACGGGCACACCGCCGGGGTGGAAACCACCACCGGCCCCTTGGGGCAGGGCATCGGCAACGCGGTCGGCATGGCCATGGCCGCCCGCCGGGAGCGGGGCCTGTTCGACCCGGACACCCCGATCGGGGAAAGCCCGTTCGACCACTACATCTACGTCCTGTGCTCTGACGGCGACGTCCAGGAGGGCATCAGCCACGAAGTAAGTGCCCTCGCCGGCACGCAGAAGCTCGGCAACCTCATCGTCATCTGGGACGACAACCGCATCTCCATCGAAGACGACACCCAGATCGCATTCACCGAAGACGTCGTCGCCCGCTACGCCGCCTACGGCTGGCACGTCCAAGAGGTCGAGTGGGTCGGCGAGGACGGCTCCTACCACGAAGACGTGGCGGCGCTGTACGACGCGATCCGGGCCGCCCAGGCGGAGACGGAACGTCCCTCTTTCATCCGGCTGCGCACCATCATCGGCTGGCCGTCCCCGAACAAGCAGAACACGGGGGCGATCCACGGCGCCGCGCTGGGGGCTGAAGAGGTCGCCGCCACCAAGCGGGTGCTGGGCCTCAACCCTGAGGCGCAGTTCGACGTGCCCAACGAGCTGCTGGAGCACGCCCGGGGCGTGGTGGCGCGGGGCCGCGCCGCCCGCCAGGAATGGGAGGCCTTGTTCGCCAAGTGGCGGGCCAACGCGGGCGAGCGTGCCGAACTGTTCGACCGGCTGATGGCAGGCTCGCTCCCGGACGGTTGGGAGAAGGCGATCCCGACCTTCGAGCCCAGCGCTAAGGGCATGGCCACCCGGAAAGCGTCCGGTGAGGTGCTGAGCGCGATCGCCCCGGTGCTGCCGGAGCTGTGGGGCGGCTCGGCGGACTTGGCCGGATCCAACAACACCACGCCTAAGGGCGAGCCGTCGTTCATCCCCGAGGAGCGGTCCACGAAGGCGTTCTCCGGCCACCGCTACGGCCGGGTGCTGCACTTCGGGATCCGTGAACACGGCATGGGGGCGATCCTCAACGGGATCGCGCTGCACGGCCCCACCCGCCCCTACGGTGGCACCTTCCTCGTGTTCAGCGACTACATGCGGCCGTCGGTGCGGCTGGCTGCCCTGATGAAGCTGCCGGTCACGTACGTGTGGACCCACGACTCGATCGGTCTGGGCGAAGACGGACCCACCCACCAGCCGGTGGAGCACCTGTGGTCGCTGCGCGCCATCCCCGGCCTGGCGGTGGTGCGTCCCGCCGACGCCAACGAGACGGCAGTGGCCTGGCGCACCATCCTGGAACGCAATGACGGCCCGGTGGCGCTCGCGCTGACCCGGCAGTCGGTTCCGGTTCTGGACCGCTCCGAGCTCGCCTCTGCGGAGCTGGTCTCCCGCGGCGGGTACATCCTGGCCGAAGCCAGCAACGGCCGTCCGGAGGCGATCATCATCGCCACCGGAAGTGAGGTGCAGATCGCGTTGGAGGCGCGTTCCCGCCTGGAGGAGTCGGGTACTCCTACCCGTGTGGTGTCGATGCCGTGCCTGGAGTGGTTCAACGAGCAGGACGACGCCTACCGCCAGCAGGTGCTTCCACCGTCGGTCCGGGTCCGGGTCTCCGTGGAAGCCGGGGTCGCCTTGGGCTGGCGCGAGCTGGTGGGCGAGTATGGCGAGTCGGTGAGTCTGGAACACTTCGGCGCTTCGGCTCCGTACGCGACTCTCTACGAGCAGTTCGGGCTCACCGCCGACCGGGTAGTGGCAGCCGTACACTCCAGCGCTGCCAAGCTCGGCGGTGACCGTGGATCAACGACCGGCAACTGA

  • >pQR1744

MNTGTPKTLDWSDLDRRTVDVVRALAMDAVEEAGSGHPGTAMSLAPVAYLLFQKVMRHDPTDPKWIGRDRFVLSCGHSSLTLYIQLYLAGYGLSLNDIKRLRQWGSLTPGHPEYGHTAGVETTTGPLGQGIGNAVGMAMAARRERGLFDPDTPIGESPFDHYIYVLCSDGDVQEGISHEVSALAGTQKLGNLIVIWDDNRISIEDDTQIAFTEDVVARYAAYGWHVQEVEWVGEDGSYHEDVAALYDAIRAAQAETERPSFIRLRTIIGWPSPNKQNTGAIHGAALGAEEVAATKRVLGLNPEAQFDVPNELLEHARGVVARGRAARQEWEALFAKWRANAGERAELFDRLMAGSLPDGWEKAIPTFEPSAKGMATRKASGEVLSAIAPVLPELWGGSADLAGSNNTTPKGEPSFIPEERSTKAFSGHRYGRVLHFGIREHGMGAILNGIALHGPTRPYGGTFLVFSDYMRPSVRLAALMKLPVTYVWTHDSIGLGEDGPTHQPVEHLWSLRAIPGLAVVRPADANETAVAWRTILERNDGPVALALTRQSVPVLDRSELASAELVSRGGYILAEASNGRPEAIIIATGSEVQIALEARSRLEESGTPTRVVSMPCLEWFNEQDDAYRQQVLPPSVRVRVSVEAGVALGWRELVGEYGESVSLEHFGASAPYATLYEQFGLTADRVVAAVHSSAAKLGGDRGSTTGN

  • >pQR1745

ATGGAAAGGTTTCCCTATGAAAAACTTCCAGAAAGCGAACTCAAAGAGTTGAAAGAACTCGGAAGGCTCTGCCGTGGCGACATACTGAAAATGACCTACATAGCTAACTCAGGCCATCCTGGAGGATCCATGTCTTCGATCGATCTTTATCTTACCGTCTTCAAGTACGCAAAACTCAGACCCGTCGATGATCCTGCAAGAGACAGAATCGTGATCAGCCATGGACACACTTCTCCGGGTGTCTACGCAGCTATGGCTCGTTTGGGGTTTGTCGATCTCGATGAAGTCCTCGCAGGATTCAGACACCCCGCTTCCGTTTTTGAAGGACACGTGACCCGAGGTGTTGGGATCATCGACTGGACAACCGGAAACCTCGGTCAGGGTCTTTCAGCCGGACTCGGTTTTGCCCTCGCATCCAGGTTCACAGGAAAAGATTACCACGTCTTTGTTCTCATGAGTGACGCAGAACAGGCAAAAGGACAGGTGGCGGAGGCAAGAAGAGTGGCGAAAAAGTACGGTGTCACGAATCTCACAGTGATCATCGACTACAACGACGCCCAGATCAGTGGCCGTGCCAGAGACGTCATGCCCGTGAACATAAAGGAAAACTACTTAGCGGACGGCTGGAGGGTCATCGAGATCGATGGGCACGACTACGAACAGATCTATCTCGCACTGAAAGAAGCGGTAGAAGACGAAC

TGAATCCCGTTGCCATAATCGCCAAAACGGTCATGGGAAAAGGCGTATCTTTCATGGAAAACGAGGTGAAATACCACGGAAAGCCTTTGAACAGAGAAGAACTCGAAAAAGCCCTCGCGGAACTCGGAATTGAAAACGATGTTGATGTGTACATCGAAAAAAGAAAACAACTTCCAGTGGAAAAACACAAGAAAGTCTACAAAACTTACCCGATCAAGATCGACACGGGAGAGCCCATCACCTACACCTCACCCACTGACAACAGAAGCGCATTCGGAAAAGCTATTCTGGATCTGGTGAAGAAGAACGTAAACAATCCAGAAACCACACCCATCGTCGCTGTGGACTGCGACCTGAAGGGATCGGTCAAACTCGACCTGCTCGACAAAGAGTTCCCTGAGAGACTCCTGGAAGTGGGCGTTCAGGAACACAACGCTGCCGCTATGGCGGGGGCACTCTCCGCAGAGGGTGTGATCACGTTCTTCGCTGATTTTGGTGTTTTTGGAATTTCTGAAACCTACAACCAGCACAGGCTGAACGCCATCAATGGAACGAACCTCAAAGTCGTTGTCACACACTGCGGACTCAACGTGGGAGAGGACGGAAAAACTCATCACGGACTCGACTACGTTTCCGGGCCGATGAACTGGTACGGTTTCAAAGTGATCGTCCCTGGTGATCCCAACCAGACGGATAGAGTTGTCAGATACGCCGCGAAGGAATACGGGAACTTCGTAATCGCCATGGGAAGATCTAAGCTTCCCATCATCCTCGATGAAAACGGGAAACCTTTCTTCGGAGAGGGTTACACCTTCGAATATGGGAAGATCGATGTCGTTAGAAAAGGTGACGACGCGGTGATCATAACTTACGGTTCTACACTCTGTGAAGCCGTAAATGCCGCAGACGAACTCAAGAAAGAAGGAGTAAACGTAGCCGTTCTGAATGTCTCCTGTCCGGTGGATCTCGATATAGAGACCTTGAAGATGGTCGATGGAAAACCCGTTCTCGTTGTGGAGGATCACAACGTTTTCACAGGACTTGGAAGCTTCCTTGGAACCACCCTTCTTGAAAACGGCATCATCCCGAAGAAATACGTGAGAGTAGGTGTTCCAGAATTCGCCGTGTCCGGCAGTTACACGATGCTCTACAAACTCTACGGCCTGGATAAAGATGGAATAATTTCCAGACTCAGAGAGATGCTCTAA

  • > pQR1745

MERFPYEKLPESELKELKELGRLCRGDILKMTYIANSGHPGGSMSSIDLYLTVFKYAKLRPVDDPARDRIVISHGHTSPGVYAAMARLGFVDLDEVLAGFRHPASVFEGHVTRGVGIIDWTTGNLGQGLSAGLGFALASRFTGKDYHVFVLMSDAEQAKGQVAEARRVAKKYGVTNLTVIIDYNDAQISGRARDVMPVNIKENYLADGWRVIEIDGHDYEQIYLALKEAVEDELNPVAIIAKTVMGKGVSFMENEVKYHGKPLNREELEKALAELGIENDVDVYIEKRKQLPVEKHKKVYKTYPIKIDTGEPITYTSPTDNRSAFGKAILDLVKKNVNNPETTPIVAVDCDLKGSVKLDLDKEFPERLLEVGVQEHNAAAMAGALSAEGVITFFADFGVFGISETYNQHRLNAINGTNLKVVVTHCGLNVGEDGKTHHGLDYVSGPMNWYGFKVIVPGDPNQTDRVVRYAAKEYGNFVIAMGRSKLPIILDENGKPFFGEGYTFEYGKIDVVRKGDDAVIITYGSTLCEAVNAADELKKEGVNVAVLNVSCPVDLDIETLKMVDGKPVLVVEDHNVFTGLGSFLGTTLLENGIIPKKYVRVGVPEFAVSGSYTMLYKLYGLDKDGIISRLREML

2.5. TAm DNA/ amino acid sequences

DNA sequences were retrieved from the NCBI database [5] and amino acid sequences were obtained through UniProtKB [6].

  • >TAmGste

ATGAAATTGGCAAAACGGGTGGCGTCGCTGACGCCATCGGCGACTTTGGCCATTACGGAGAAAGCAAAAGAACTAAAAGCGGCCGGGCATGACGTGATTGGTCTCGGAGCTGGCGAACCGGATTTCAACACGCCACAGCACATTCTTGATGCCGCCATCAAGGCAATGAACGAAGGACATACGAAATATACACCATCGGGCGGTTTGCCGGCGTTAAAGGAGGAAATTATAAAAAAATTCGCCCGCGACCAAGGCTTGGATTATGAGCCGGCTGAAGTGATTGTATGCGTCGGAGCGAAGCACGCCCTTTACACGCTGTTCCAAGTATTGCTCGATGAAGGCGACGAAGTGATCATTCCGACGCCATACTGGGTGAGCTATCCGGAACAAGTGAAACTGGCGGGCGGTGTTCCGGTTTACGTCGAAGGGCTTGAACAAAATCATTTTAAAATTACGCCGGAGCAGCTGAAACAGGCAATCACGCCGCGGACGAAAGCGGTTATCATCAACTCGCCGAGCAACCCGACTGGCATGATTTATACAGCCGAAGAGTTGAAGGCGCTTGGTGAGGTGTGCCTAGCGCATGGTGTATTGATCGTGTCAGATGAAATTTACGAAAAATTGACTTACGGCGGGGCGAAGCATGTGTCCATCGCTGAGTTGTCGCCGGAGCTGAAGGCGCAGACAGTCATCATTAACGGCGTGTCAAAGTCGCATTCGATGACGGGCTGGCGCATTGGTTATGCGGCGGGGCCGAAAGATATTATTAAGGCAATGACAGATTTGGCGAGCCACAGCACGTCCAACCCGACGTCAATCGCCCAATACGCGGCCATCGCTGCTTACAGCGGGCCGCAGGAGCCGGTCGAACAAATGCGCCAAGCGTTTGAACAACGGCTCAATATCATTTACGACAAGCTCGTGCAAATTCCAGGATTCACGTGCGTTAAGCCACAAGGGGCGTTTTATTTGTTCCCGAACGCCCGCGAAGCGGCTGCAATGGCCGGCTGCCGCACGGTCGACGAGTTCGTCGCTGCCTTGTTGGAGGAAGCGAAAGTCGCGCTTGTGCCCGGCTCTGGGTTTGGAGCGCCGGATAACGTTCGCTTGTCATACGCGACATCGCTCGATGCACTGGAAACCGCCGTGGAACGCATCCACCGGTTTATGGAAGCGCGCGCTTAA

  • >TAmGste

MKLAKRVASLTPSATLAITEKAKELKAAGHDVIGLGAGEPDFNTPQHILDAAIKAMNEGHTKYTPSGGLPALKEEIIKKFARDQGLDYEPAEVIVCVGAKHALYTLFQVLLDEGDEVIIPTPYWVSYPEQVKLAGGVPVYVEGLEQNHFKITPEQLKQAITPRTKAVIINSPSNPTGMITAEELKALGEVCLAHGVLIVSDEIYEKLTYGGAKHVSIAELSPELKAQTVIINGVSKSHSMTGWRIGYAAGPKDIIKAMTDLASHSTSNPTSIAQYAAIAAYSGPQEPVEQMRQAFEQRLNIIYDKLVQIPGFTCVKPQGAFYLFPNAREAAAMAGCRTVDEFVAALLEEAKVALVPGSGFGAPDNVRLSYATSLDALETAVERIHRFMEARA

  • >TAmDGeo

ATGTTCGAGGACACGCCCGCACCCTTTCCACCGCACATTCTGCTGACGCCCGGTCCGACACCGATTCACCCCCGGGCCCAGCGGGCGCTGCTGCGCGAGATGCTGGGGCACATGGACCCTGAGGTGTTCGCCCTGAACCGCGAGATCCAGGCGGACTTGCGGGTGATGTACGGGACGGGGCCCCAGACCTTTACGGCGCTGCTGGCGGGCACCGGGAGCCTGGGCATGGAGGCGGGCTTCGCCAACTTGGTGGAGAGGGGAGACGACGTGCTGATCTGCGTCAATGGTGCCTTTGGTCAGCGCATGGCCGAGATGGCGGCGCGCTACGGTGCGAATGTACGGCGGGTGACCGCGCCGCTGGGCGAGCCGATCGACCCGGCCGACGTGGCTGCGCGGTTGAGCGGCGCGCGGCTGGTGGCGGTGGTGCATGGGGAGACGAGCACGGGTGTGCTCAATCCGCTTCCGGAGATTGCCGAGGCCGTGCGCGGGAGCGGGGCATTGCTGGCCGTGGACGCCGTGACGACCGCCGGGATGGAACCCTTCCATATGGCGGACTGGGGCGTGGACTACGCCTATACCGGCGCGCAGAAGTGCCTCTCGGCACCGCCCGGCCTGGCCCCGGTGGCGATCAGCGACCGTGCTCTCGCTCGCCACGCGGCCCGCCGCACGCCCACGCCGCTGTGGTACTGCGATTTTGAGGGCCTGCGCGACTACTGGGACCGGCACAGCTACCACCACACGGTCCCGGTGAATCTGCACTACGCCTTCCACGCCGCCCTGCGCGCCGCACTCGAAGAAGGCCTCCAAGCCCGGCAGGCCCGCGTGCGCGACCTTGGCCAGGCGGTGCTGGCGGCCCTGACGCCGCTGGGCTTCACGCCGTATGTGGCCGATCCCGCCGCGCGGCTGCCCACCGTCTTGGCCCTGCGTCTTCCTCCCGGCTTCGACGACGCGGGCGTTCGCCAGGCCCTACGGGAACGCGGGATCAGCGTGACCGGCGGCCTGGGACCGACGGCAGGGCTGATCTGGCGTCTGGGCCTGATGGGGGAAGCGGCTCGCCCCGCGCCCTACCGCGCGCTGATGCTCGCCCTGGAAGACCTGCTGGGCGAGCGGGGCTTGGTGGCGCGCTTCGAGGAGGCGCTGGGCGTCGCGGCCTGA

  • >TAmDGeo

MFEDTPAPFPPHILLTPGPTPIHPRAQRALLREMLGHMDPEVFALNREIQADLRVMYGTGPQTFTALLAGTGSLGMEAGFANLVERGDDVLICVNGAFGQRMAEMAARYGANVRRVTAPLGEPIDPADVAARLSGARLVAVVHGETSTGVLNPLPEIAEAVRGSGALLAVDAVTTAGMEPFHMADWGVDYAYTGAQKCLSAPPGLAPVAISDRALARHAARRTPTPLWYCDFEGLRDYWDRHSYHHTVPVNLHYAFHAALRAALEEGLQARQARVRDLGQAVLAALTPLGFTPYVADPAARLPTVLALRLPPGFDDAGVRQALRERGISVTGGLGPTAGLIWRLGLMGEAARPAPYRALMLALEDLLGERGLVARFEEALGVAA

  • >pQR1746

ATGACCTCTCCTTTCCGCCTCTCCGCCCGCGCCCAGAGCCTCAAGCCGTCTGCGACAGTGGCGGTCACGTCCCGCGCCCTGGAACTCCAGCGTCAGGGCCTGGACGTGATTTCCATGAGCGTGGGCGAGCCGGATTTCGACACGCCGCCACATGTCAAGGCCGCCGGCATCGCCGCCATCGAGGAAGGCAAGACCAAATACACCCCGGTCAGCGGCATTCCCGAACTGCGCGAGGCCATCAGCGCCAAGTTTCGGCGCGAAAACGGCCTGGACTACGCGCCGAACGCCGTGACGGTAACGAGCGGCGGTAAACAGGCGCTGTTCAACGCCTTTTTCGCGTTGCTGAACCCCGGCGACGAGGTGCTGATTCCCGCGCCCCACTGGGTCAGCTACCCCGAAATGGTCGCGCTGACCGGCGCGGTGCCGGTAACCGTACCCACTACGCCGCAGCAGGGCTTTCAACTCGACCCGGACGCCCTCGCCGCCGCCATCACGCCGCGCACCCGCATGGTGATTCTCAACAGCCCCGGCAACCCGACGGGCGCGGTGTTTCCGCCGGAAACCTTGCGGGCGGTGGCCGACCTCGCCACGCAGCACGGCTTGATGATCGTCACCGACGAAATCTACGAGCACCTCGTCTACGACGCCGAGCAGGTCAGCATCGGCACCTACGCGCCGGAGCACACCCTGACCATCAATGGCGCGAGCAAAGCGTATGCCATGACCGGCTGGCGCATCGGCTACGCGGGCGGGCCGCGCGAGGTGATTGCCGCCATGAACGCGCTGCAATCGCAAAGCACCAGCAACGCCAGCAGCGTCAGCCAGTACGCCGCCCTCGCCGCGCTGGAACAGCACGAGGAAACCATGCGCTTCATCGACAGGGCCCGCACCGCCTACCGCGAACGGCGCGACCGCATCGTGGCGGGCCTCAACGCGCTGGGGTTGCCCACGCCCACGCCGCAAGGGGCCTTTTACGTGATGGCCGACACCCGCGCCATTCACACCGACGAACTCGAAGCCGCCCGCATCATTCTGGATGAGGCGCAGGTCGCCGTCGTGCCCGGCACCGATTTCGCCGCGCCGGGACAGGTGCGCCTGAGCTACGCGACCAGCATGGACAACATCGAGGAAGTGCTGCGGCGGCTGGAAGGGGTCGTGCGGCGCTAA

  • >pQR1746

MTSPFRLSARAQSLKPSATVAVTSRALELQRQGLDVISMSVGEPDFDTPPHVKAAGIAAIEEGKTKYTPVSGIPELREAISAKFRRENGLDYAPNAVTVTSGGKQALFNAFFALLNPGDEVLIPAPHWVSYPEMVALTGAVPVTVPTTPQQGFQLDPDALAAAITPRTRMVILNSPGNPTGAVFPPETLRAVADLATQHGLMIVTDEIYEHLVYDAEQVSIGTYAPEHTLTINGASKAYAMTGWRIGYAGGPREVIAAMNALQSQSTSNASSVSQYAALAALEQHEETMRFIDRARTAYRERRDRIVAGLNALGLPTPTPQGAFYVMADTRAIHTDELEAARIILDEAQVAVVPGTDFAAPGQVRLSYATSMDNIEEVLRRLEGVVRR

  • >pQR1748

ATGACTGACCGACCTCGTATCTCCGCACGCATCGGCGGTATCTCCGAGTCAGCGACCCTGGCGGTGGACGCCAAGGCCAAGGCCCTGAAGGCCGCTGGGCATCCCGTGATCGGCTTCGGCGCCGGGGAGCCTGACTTCCCCACGCCCGACTACATCGTGGAGGCAGCGGTCGCCGCCTGCCGCGACTCGCGCTTCCACCGCTACACCCCGGCGGGAGGCCTCCCCGAACTCAAGGAAGCCATCGCGGCTAAGACGCTGCGCGACTCCGGCTACCGGGTGGAGCCGAACCAAGTCCTGGTCACCAACGGCGGCAAGCAAGCGATCTACGAGGCGTTCGCCACGCTGCTCGATCCGGGCGACGAGGTCATCGTGATCGCGCCCTACTGGACCACCTACCCTGAATCGATCCGGCTGGCCGGAGGAACCCCGGTCTACGTGGTCACCGACGAGTCCACTGGCTACCTGGCCACGGTCGAGCAGCTGGAGGCGGCCCGCACCGACCGCACCAAGGTGCTGCTGTTCGTCTCCCCCTCGAACCCGACCGGCGCGGTGTACTCGCCCGAGCAGGTCCGCGAGATCGGCCGGTGGGCCCTCGAACACAACCTGTGGGTGCTCACCGACGAGATCTACGAGCACCTCGTCTACGGGGACGCCCGGTTCTCCTCGATGCCGGTGGAAGTTCCGGAACTGGCCGACCGCACCGTGGTGGTCAACGGGGTGGCCAAGACCTACGCCATGACCGGGTGGCGGGTCGGCTGGCTCATCGGCCCCGTGGACGTGGTCAAGGCTGCGACCAACCTGCAGTCGCACGCCACCTCCAATGTGGCCAACGTCTCGCAGGCCGCGGCTCTGGCAGCGGTCTCCGGCGACCTGTCGGCCGTGGAGGAGATGAAGCAGGCCTTCGACCGGCGGCGGCAGACCATTGTGCGGATGCTCAACGAGATCCCCGGTGTGGTGTGCCCCGAGCCCCAGGGCGCGTTCTACGCCTACCCGTCGGTCAAGGAGATCCTCGGCAAGGAGATCCGCGGTCAGCGTCCGCAGACCTCCAGCGAGCTGGCGTCGCTGATCCTGGAGCACGCCAAGGTCGCGGTGGTCCCGGGCGAGGCGTTCGGCACTCCGGGCTACCTGCGGTTGTCCTACGCGTTGAGCGACGCCGATCTGGTCGAAGGGGTCAGCCGGATCGCCAAGCTGCTGAGCGAAGCCCACTGA

  • >pQR1748

MTDRPRISARIGGISESATLAVDAKAKALKAAGHPVIGFGAGEPDFPTPDYIVEAAVAACRDSRFHRYTPAGGLPELKEAIAAKTLRDSGYRVEPNQVLVTNGGKQAIYEAFATLLDPGDEVIVIAPYWTTYPESIRLAGGTPVYVVTDESTGYLATVEQLEAARTDRTKVLLFVSPSNPTGAVYSPEQVREIGRWALEHNLWVLTDEIYEHLVYGDARFSSMPVEVPELADRTVVVNGVAKTYAMTGWRVGWLIGPVDVVKAATNLQSHATSNVANVSQAAALAAVSGDLSAVEEMKQAFDRRRQTIVRMLNEIPGVVCPEPQGAFYAYPSVKEILGKEIRGQRPQTSSELASLILEHAKVAVVPGEAFGTPGYLRLSYALSDADLVEGVSRIAKLLSEAH

  • >pQR1749

ATGGTATCCAGGAGAATATCAGAGATTCCCATATCGAAAACCATGGAACTCGACGCGAAGGCCAAAGCCCTCATAAAAAAGGGAGAAGACGTGATCAATCTAACGGCTGGTGAGCCGGATTTTCCCACACCGGAACCCGTCGTGGAAGAAGCGGTGAGATTTCTCCAGAAAGGAGAAGTGAAATACACAGATCCTCGTGGTATCTACGAACTCAGAGAGGGTATAGCGAAAAGGATAGGCGAGAGATACAAAAAAGATATCTCACCGGATCAGGTCGTGGTGACGAATGGAGCGAAACAGGCTCTGTTCAATGCTTTCATGGCCCTTCTCGATCCCGGTGACGAAGTGATCGTGTTTTCTCCCGTCTGGGTCAGCTACATTCCTCAGATCATCCTTGCTGGTGGCACGGTGAACGTGGTTGAGACGTTCATGAGTAAAAATTTCCAGCCCAGTCTGGAAGAGGTGGAAGGGCTTCTTGTTGGGAAAACGAAAGCCGTTCTTATCAACTCGCCGAACAATCCCACTGGTGTGGTGTACAGAAGAGAGTTCCTTGAAGGACTTGTGAGACTTGCCAAGAAGAGGAATTTCTACATAATCAGCGACGAAGTCTACGATTCCCTTGTTTACACGGATGAATTCACATCGATACTCGATGTTTCTGAAGGATTCGACCGGATAGTTTACATAAACGGCTTCTCGAAGTCTCACTCCATGACCGGCTGGAGGGTGGGTTACCTGATATCGAGCGAAAAAGTAGCGACCGCTGTGTCGAAGATCCAGTCTCACACCACCTCCTGTATCAACACGGTAGCACAGTACGCCGCCTTGAAGGCTCTGGAAGTGGACAACTCTTACATGGTTCAGACCTTTAAAGAAAGAAAAAATTTCGTGGTGGAAAGATTGAAAAAGATGGGTGTTAAGTTCGTGGAACCAGAAGGTGCGTTCTACCTCTTTTTCAAAGTCCGGGGTGACGATGTGAAATTCTGTGAAAGGCTCCTCGAAGAAAAGAAGGTTGCACTCGTTCCAGGATCCGCTTTTCTGAAGCCTGGATTTGTGAGGCTTTCTTTTGCCACATCTATAGAAAGACTTACGGAGGCGCTGGATAGAATTGAAGACTTCCTCAATTCTCGTTGA

  • >pQR1749

MVSRRISEIPISKTMELDAKAKALIKKGEDVINLTAGEPDFPTPEPVVEEAVRFLQKGEVKYTDPRGIYELREGIAKRIGERYKKDISPDQVVVTNGAKQALFNAFMALLDPGDEVIVFSPVWVSYIPQIILAGGTVNVVETFMSKNFQPSLEEVEGLLVGKTKAVLINSPNNPTGVVYRREFLEGLVRLAKKRNFYIISDEVYDSLVYTDEFTSILDVSEGFDRIVYINGFSKSHSMTGWRVGYLISSEKVATAVSKIQSHTTSCINTVAQYAALKALEVDNSYMVQTFKERKNFVVERLKKMGVKFVEPEGAFYLFFKVRGDDVKFCERLLEEKKVALVPGSAFLKPGFVRLSFATSIERLTEALDRIEDFLNSR

  • >pQR1751

ATGGCACCTGACCTGCGCCACCTGCACACCTTCGGCGAACTGGATCCGCCGCAACGCCTGTTGATGGGCCCCGGCCCGGTCAATGCGCATCCACGCGTGCTGCGTGCGATGGCGGCCGACCTGCTTGGCCAGTTCGACCCGGAAATGACCACCTACATGAACGAGGTGATGGCGCTGTACCGCCCCTTGTTCGGCACCCAGAACCGCTGGACCTTTCTGGTCGATGGCACGGCGCGCGCCGGCATCGAAGCCGCGCTGGTGTCGCTGGTGCAGCCGGGCGACCGTGTGCTGGTGATCAACTTCGGCCGCTTCGGTTTGTTGCTGACCGAAATCCTTGGCCGGCTCGGCGCCGACGTCCACACCGTGGATGCGCCGTGGGGCGAGGTGGTGCCGCTGGCGGCGATTGCCGAGGCGATCGCAAGCGTGGCACCCAAGCTGGTGGCCACCGTGCACGGCGACACCTCCACCACCATGGCGCAGCCGCTCGATGGCCTAGGCGCGCTATGCCGGGCGGCCGGCGCGCTGAGTTACGTAGACGCCACAGCCACCATCGGCGGCATGGACATCGCCAGCGACCGCTGGGAGGTGGACGTGGTCACCGCGGGGCTGCAGAAATGCCTGGGCGGGCCGTCCGGCTCGGCGCCGATCACTGTCTCTGCCGCGGCAGCGGAGGCGATCTTTGCGCGGCGGCATGTCGAACGCGGCATCGTGCGCGAGGACATCGCCAACGGCAGCGGCCCACGCATCGCCTCGAATTATTTCGACCTGGCGATGATCATGGATTACTGGTCCGACAAGCGCCTCAATCACCACACCGAAGCCACCACCATGCTGTACGGCGCGCGCGAATGCGCACGCGTGGCCTTGCAGGAAGGCCTGGAGGCGCGCTACGCCCGGCATGCGGCTGCCGGCCGCGCGGTCAGCGCCGGCGTGCGCGCACTGGGGCTGGAGGTGTTCGGCGACGATGCGCACCGCATGAGCAATGTCACCGGCGTGGTGATCCCGCACGGCGTCGACAGTGAAGCAGTGCGGCGGCGCATGCGCGAGGATTTCGAAATCGAGATCGGCACCGCGTTCGGCCCGCTGCAAGGCAGGATCTGGCGCATCGGTGCGATGGGCTACAACGCGATGAAGCACAAGGTGCTGCTCACCCTGGCCGCACTGGAAGCGGTGCTGCGCGCCGAGGGCTACGCGTGCACCCAAGGCCTGGCGGTCGAAGCCGCACGCGCCGCCTGGCATGCGGAGCCGGCTGCATGA

  • >pQR1751

MAPDLRHLHTFGELDPPQRLLMGPGPVNAHPRVLRAMAADLLGQFDPEMTTYMNEVMALYRPLFGTQNRWTFLVDGTARAGIEAALVSLVQPGDRVLVINFGRFGLLLTEILGRLGADVHTVDAPWGEVVPLAAIAEAIASVAPKLVATVHGDTSTTMAQPLDGLGALCRAAGALSYVDATATIGGMDIASDRWEVDVVTAGLQKCLGGPSGSAPITVSAAAAEAIFARRHVERGIVREDIANGSGPRIASNYFDLAMIMDYWSDKRLNHHTEATTMLYGARECARVALQEGLEARYARHAAAGRAVSAGVRALGLEVFGDDAHRMSNVTGVVIPHGVDSEAVRRRMREDFEIEIGTAFGPLQGRIWRIGAMGYNAMKHKVLLTLAALEAVLRAEGYACTQGLAVEAARAAWHAEPAA

  • >pQR1752

ATGGGAAAGTTTCTTAAGAAACACTACATAATGGCACCTGGACCAACACCAGTCCCAAACGATATTTTAACAGAAGGAGCGAAGGAAACAATACACCACAGAACACCTCAGTTTGTTTCCATAATGGAAGAGACCCTCGAAAGTGCAAAGTACATCTTTCAGACAAAACACAACGTGTACGCCTTTGCTTCCACAGGAACTGGCGCTATGGAAGCGGCGGTGGCGAATCTTGTGAGCCCTGGAGACAAAGTGATCGTGGTTGTGGCTGGAAAGTTCGGTGAAAGATGGAGAGAGCTCTGTCAGGCTTACGGTGCTGATATCGTAGAAATCGCCCTCGAATGGGGAGACGCGGTCACACCTGAACAGATCGAAGAGGCTCTCAACAAAAACCCCGATGCGAAGGTCGTCTTCACCACCTACAGTGAAACATCGACGGGTACAGTCATAGACCTCGAAGGAATTGCCAGAGTCACGAAGGAAAAAGACGTTGTTCTTGTGACAGACGCTGTCAGCGCTCTTGGAGCAGAACCACTGAAGATGGATGAATGGGGTGTGGATCTCGTTGTCACAGGTTCACAGAAGGGTTTGATGTTACCTCCAGGACTGGCGCTCATCTCTCTCAACGACAAAGCGTGGGGGCTCGTGGAAAAATCCAGATCTCCAAGGTACTACTTCGATCTGAGGGCCTACAGGAAATCTTACCCCGACAATCCTTACACCCCCGCAGTAAACATGATATACATGTTGAGAAAGGCTCTTCAGATGATAAAAGAGGAAGGCATAGAAAACGTATGGGAAAGGCACAGAATACTGGGAGACGCAACAAGAGCAGCGGTGAAAGCACTTGGACTGGAACTCCTCTCGAAAAGACCGGGAAACGTTGTAACAGCCGTGAAAGTGCCTGAGGGCATCGATGGAAAACAGATTCCCAAGATCATGAGAGACAAGTACGGTGTGACCATCGCCGGTGGACAGGCTAAACTCAAGGGAAAAATATTCAGGATAGCACACCTCGGATACATGTCACCTTTCGACACCATAACTGCCATTTCCGCTCTTGAATTAACCTTGAAGGAACTCGGTTATGAGTTCGAACTCGGAGTCGGTGTTAAGGCAGCCGAAGCTGTCTTCGCTAAAGAATTCATTGGGGAGTGA

  • >pQR1752

MGKFLKKHYIMAPGPTPVPNDILTEGAKETIHHRTPQFVSIMEETLESAKYIFQTKHNVYAFASTGTGAMEAAVANLVSPGDKVIVVVAGKFGERWRELCQAYGADIVEIALEWGDAVTPEQIEEALNKNPDAKVVFTTYSETSTGTVIDLEGIARVTKEKDVVLVTDAVSALGAEPLKMDEWGVDLVVTGSQKGLMLPPGLALISLNDKAWGLVEKSRSPRYYFDLRAYRKSYPDNPYTPAVNMIYMLRKALQMIKEEGIENVWERHRILGDATRAAVKALGLELLSKRPGNVVTAVKVPEGIDGKQIPKIMRDKYGVTIAGGQAKLKGKIFRIAHLGYMSPFDTITAISALELTLKELGYEFELGVGVKAAEAVFAKEFIGE

  • >pQR1755

ATGACTCAGATTTTTAATTTTAGCGCCGGTCCAGCAATGCTGCCGGTTGAAGTACTGCGTCGTGCTGAACAGGAATTGTGTAATTGGAATGGCCTGGGCACATCGGTTATGGAAATCAGCCACCGCAGTAAAGAGTTTATGCAGGTTGCCGCTGAATCCGAACAGGATCTGCGTGATTTGCTGAAAATCCCCTCCAACTACAAAGTGCTCTTTTGCCACGGCGGTGCTCGTGCGCAATTCGCCGCAGTGCCGTTAAATCTTCTGGGCGAACGCTCAACGGCCGACTACATCGACGGCGGGTATTGGGCGCACAGCGCAATCAATGAAGCAGAAAAATACTGCACGCCTAACGTGATTGACGTGAAAATGCGCGTGGGCGAACTGCGTGGCATTAAGCCGATGCGTGAATGGAAATTGTCTGATGACGCGGCGTTTGTGCATTACTGCCCGAATGAAACCATCGACGGTATTGCGATCGAAGAAGAGCCGGACTTTGGCGATAAAATTGTGGTCGCCGACTATTCTTCCAGCATCCTGTCTCGTCGTATTGATGTCAGCCGTTACGGCGTGATCTATGCCGGTGCGCAGAAAAATATCGGCCCTGCCGGCCTGACGCTGGTTATCGTACGTGAAGATTTGCTGGGCAAGGCGCGCCGTGAGCTGCCATCGATTCTGGATTACCAGGTTCTGGCGGACAATGACTCCATGTTTAACACGCCACCGACCTTTGCCTGGTACCTGTCCGGTATGGTCTTCAAATGGCTGAAAGAGTACGGCGGTCTGGCTGAAATGGAAAAACGTAACCAGGAGAAGGCTGACCTGCTGTATAGCGCGATTGACGGTAACGATTTCTATCGTAATGACGTTGCGGTAGCGAACCGTTCTCGCATGAATGTGCCATTCCTGTTGGCGGATTCTGCGCTGGATAAAGTCTTCCTGGAAGAATCAGTCGCTGCAGGTCTGCACGCGCTGAAAGGCCATCGCGTAGTAGGCGGCATGCGTGCCTCTATCTACAATGCGATGCCGTTGGAAGGCGTGAAAGTGCTGACGGAATTTATGGCTGACTTCGCTCGTCGCCACGGTTGA

  • >pQR1755

MTQIFNFSAGPAMLPVEVLRRAEQELCNWNGLGTSVMEISHRSKEFMQVAAESEQDLRDLLKIPSNYKVLFCHGGARAQFAAVPLNLLGERSTADYIDGGYWAHSAINEAEKYCTPNVIDVKMRVGELRGIKPMREWKLSDDAAFVHYCPNETIDGIAIEEEPDFGDKIVVADYSSSILSRRIDVSRYGVIYAGAQKNIGPAGLTLVIVREDLLGKARRELPSILDYQVLADNDSMFNTPPTFAWYLSGMVFKWLKEYGGLAEMEKRNQEKADLLYSAIDGNDFYRNDVAVANRSRMNVPFLLADSALDKVFLEESVAAGLHALKGHRVVGGMRASIYNAMPLEGVKVLTEFMADFARRHG

Acknowledgements

The authors are grateful to the UK Engineering and Physical Sciences Research Council (EPSRC) for financial support of this work (EP/K014897/1). Furthermore, we acknowledge the advice and support from the project Industrial Advisory Board.

Footnotes

Transparency document

Transparency document associated with this article can be found in the online version at 10.1016/j.dib.2018.05.140.

Transparency document. Supplementary material

Supplementary material

mmc1.docx (12.4KB, docx)

.

References

  • 1.Bawn M., Subrizi F., Lye G.J., Sheppard T.D., Hailes H.C., Ward J.M. One-pot, Two-step Transaminase and Transketolase Synthesis of L-gluco-heptulose from L-arabinose. Enzyme Microb. Technol. 2018;116:16–22. doi: 10.1016/j.enzmictec.2018.05.006. [DOI] [PubMed] [Google Scholar]
  • 2.Cárdenas-Fernández M., Bawn M., Hamley-Bennett C., Bharat P.K.V., Subrizi F., Suhaili N., Ward D.P., Bourdin S., Dalby P.A., Hailes H.C., Hewitson P., Ignatova S., Kontoravdi C., Leak D.J., Shah N., Sheppard T.D., Ward J.M., Lye G.J. An integrated biorefinery concept for conversion of sugar beet pulp into value-added chemicals and pharmaceutical intermediates. Faraday Discuss. 2017;346:183–190. doi: 10.1039/c7fd00094d. [DOI] [PubMed] [Google Scholar]
  • 3.Subrizi F., Cárdenas-Fernández M., Lye G.J., Ward J.M., Dalby P.A., Sheppard T.D., Hailes H.C. Transketolase catalysed upgrading of L-arabinose: the one-step stereoselective synthesis of L-gluco-heptulose. Green Chem. 2016;18:3158–3165. [Google Scholar]
  • 4.Seliwanoff T. Notiz iiber eke Fruchtzucker- reaction. Chem. Ber. 1887;20:181–182. [Google Scholar]
  • 5.N.C. for B.I. National Library of Medicine (US), National Center for Biotechnology Information (NCBI), 1988. 〈https://www.ncbi.nlm.nih.gov/〉.
  • 6.Bateman A., Martin M.J., O’Donovan C., Magrane M., Alpi E., Antunes R., Bely B., Bingley M., Bonilla C., Britto R., Bursteinas B., Bye-AJee H., Cowley A., Da Silva A., De Giorgi M., Dogan T., Fazzini F., Castro L.G., Figueira L., Garmiri P., Georghiou G., Gonzalez D., Hatton-Ellis E., Li W., Liu W., Lopez R., Luo J., Lussi Y., MacDougall A., Nightingale A., Palka B., Pichler K., Poggioli D., Pundir S., Pureza L., Qi G., Rosanoff S., Saidi R., Sawford T., Shypitsyna A., Speretta E., Turner E., Tyagi N., Volynkin V., Wardell T., Warner K., Watkins X., Zaru R., Zellner H., Xenarios I., Bougueleret L., Bridge A., Poux S., Redaschi N., Aimo L., ArgoudPuy G., Auchincloss A., Axelsen K., Bansal P., Baratin D., Blatter M.C., Boeckmann B., Bolleman J., Boutet E., Breuza L., Casal-Casas C., De Castro E., Coudert E., Cuche B., Doche M., Dornevil D., Duvaud S., Estreicher A., Famiglietti L., Feuermann M., Gasteiger E., Gehant S., Gerritsen V., Gos A., Gruaz-Gumowski N., Hinz U., Hulo C., Jungo F., Keller G., Lara V., Lemercier P., Lieberherr D., Lombardot T., Martin X., Masson P., Morgat A., Neto T., Nouspikel N., Paesano S., Pedruzzi I., Pilbout S., Pozzato M., Pruess M., Rivoire C., Roechert B., Schneider M., Sigrist C., Sonesson K., Staehli S., Stutz A., Sundaram S., Tognolli M., Verbregue L., Veuthey A.L., Wu C.H., Arighi C.N., Arminski L., Chen C., Chen Y., Garavelli J.S., Huang H., Laiho K., McGarvey P., Natale D.A., Ross K., Vinayaka C.R., Wang Q., Wang Y., Yeh L.S., Zhang J. UniProt: the universal protein knowledgebase. Nucleic Acids Res. 2017;45:D158–D169. doi: 10.1093/nar/gkw1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Larkin M.A., Blackshields G., Brown N.P., Chenna R., Mcgettigan P.A., McWilliam H., Valentin F., Wallace I.M., Wilm A., Lopez R., Thompson J.D., Gibson T.J., Higgins D.G. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (12.4KB, docx)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES