Different Evolutionary Modifications as a Guide to Rewire Two-Component Systems

Beate Krueger; Torben Friedrich; Frank Förster; Jörg Bernhardt; Roy Gross; Thomas Dandekar

doi:10.4137/BBI.S9356

. 2012 May 3;6:97–128. doi: 10.4137/BBI.S9356

Different Evolutionary Modifications as a Guide to Rewire Two-Component Systems

Beate Krueger ¹, Torben Friedrich ^1,², Frank Förster ¹, Jörg Bernhardt ³, Roy Gross ⁴, Thomas Dandekar ^1,^5,^✉

PMCID: PMC3348925 PMID: 22586357

Abstract

Two-component systems (TCS) are short signalling pathways generally occurring in prokaryotes. They frequently regulate prokaryotic stimulus responses and thus are also of interest for engineering in biotechnology and synthetic biology. The aim of this study is to better understand and describe rewiring of TCS while investigating different evolutionary scenarios. Based on large-scale screens of TCS in different organisms, this study gives detailed data, concrete alignments, and structure analysis on three general modification scenarios, where TCS were rewired for new responses and functions: (i) exchanges in the sequence within single TCS domains, (ii) exchange of whole TCS domains; (iii) addition of new components modulating TCS function. As a result, the replacement of stimulus and promotor cassettes to rewire TCS is well defined exploiting the alignments given here. The diverged TCS examples are non-trivial and the design is challenging. Designed connector proteins may also be useful to modify TCS in selected cases.

Keywords: histidine kinase, engineering, promoter, sensor, response regulator, synthetic biology, sequence alignment, connector, Mycoplasma

Introduction

A key mechanism used by bacteria for sensing their environment is based on two-component systems (TCS). These systems typically consist of a sensor protein with a membrane-bound histidine kinase domain (HisKA) and a corresponding regulator protein with a response regulator domain (RR). The sensor protein detects specific changes in the environment and subsequently binds adenosine triphosphate (ATP). This causes a structural change of the sensor protein and, after autophorphorylation at a histidine residue, evokes phosphor-transfer to the corresponding response regulator. The response regulator then changes its structure and mediates a cellular response.¹ TCS standard structure is well conserved.²^,³ Several databases describe different aspects of TCS.⁴^–⁷ Mutational analyses of individual components in TCS are described in previous reports.⁸^,⁹ Design, rewiring, and modifications of TCS have been studied for a long time, including efforts in biotechnology.¹⁰^–¹⁶ Still, it is a major challenge to successfully engineer TCS systems, as direct design attempts only work well for controlled cases and evolutionarily short distances.¹⁷ In taking a closer look, it turned out that information for specific cases on individual functional sites and sequences is often lacking. Therefore, we looked closely at evolutionary changes in TCS, in order to create a more solid basis for future design attempts. In synthetic biology, rewiring TCS allows us to construct synthetic networks.¹⁸ For this, exchange of TCS promotors, partial or full replacement of sensor and regulator, as well as adding additional components is key.¹⁹ The specific motifs involved and the overall topology of the system determine the observed switching behavior.²⁰

Consequently, the aim of this study is to describe and review evolutionary scenarios as a guide to rewire two-component systems.

Taking a large-scale screen on available TCS from various databases as our basis (see Supplementary material), we considered three general scenarios spanning from local to more global changes of TCS: (i) Individual amino acid changes. These lead to direct sequence changes of sensors and regulators, eg, changing specificity of stimulus or allowing the regulation of new genes. (ii) An alternative scenario considers more radical changes such as domain swapping. We performed large-scale screens and identified events in which such exchanges lead to a change in the overall function of a TCS. This can be exploited for more drastic engineering strategies, which are otherwise very difficult to predict in their outcome. (iii) Another modification strategy does not interfere with the sensor or regulator of the TCS. Additional proteins or domains, so called connectors, interact with either one or both of them. This again modulates output and performance of the TCS. Starting from a known event (SafA in Escherichia coli) we consider further proteins, which could have such connector functions and examine their potential to change TCS function.

Results and Discussion

We screened various databases for TCS and their modifications. Supplementary material illustrates this in Table S1 for a screen listing the most frequently occurring contexts in which histidine kinase or response regulator domains were found. Databases we screened include amongst others the database of protein families PFAM,²¹ the protein database Uniprot,²² as well as further repositories, such as MIST2,⁴ SENTRA,⁶ and P2CS.⁷ Furthermore, there are numerous sensors with periplasmic, membrane-embedded, and cytoplasmic sensor domains and a great diversity of regulator protein contexts.

TCS rewiring by changing residues in sequences

Sequence mutations change sensors and regulators, for instance the specificity of the stimulus recognized or the genes regulated. To gain concrete information useful for engineering, we looked closely at sequences from several bacterial model organisms, focusing especially on the recognition site and the DNA and promotor binding sites. Annotated information on these signatures is often not available and hence relies on detailed manual annotation as well as sequence comparisons. We revalidated predictions by extensive sequence-structure comparisons (more information see Supplementary material).

TCS stimulus signatures

We annotated here several stimulus recognition sites in different model organisms (E. coli 536, E. coli CFT073, E. coli K12 W3110, E. coli O157:H7 EDL933, E. coli K12 MG1655, E coli O157:H7 Sakai pO157, E. coli UTI89, Salmonella, Bacillus subtilis, Staphylococcus aureus, Legionella pneumophila, Listeria monocytogenes, Pseudomonas aeruginosa, and Mycoplasma pneumoniae) and for different stimuli (Table 1A; phosphor, iron, copper, osmotic, stress, citrate, fumarate and nitrate/nitrite;²³^–²⁵ sequence, genome and domain analysis, see Materials and methods). Table 1A shows the best consensus derived. However, for concrete engineering experiments and detection in new genomes, the signatures themselves are important and are given in detail summarizing all investigated sequences. They can be used directly for engineering. Detailed alignments are given in Supplementary material, section 1.2.

Table 1A.

Stimulus recognition consensus sequences for various TCS stimuli.

Stimulus	No. of sequences	Position	Recognition sequence¹
Phosphor	1	29–32	GYLP
Osmotic	4	36–158	NFAILPSLQQFNKVLAYEVRMLMTDKLQLEDGTQLVVPPAFRREIyrelgISLYTNEA AEEAGLRWAQHYEFLSHQMAQQLGGPTEVRVEVNKSSPVVWLKTWLSPNIWVRVPLTE IHQGDFS
Stress	6	25–135	LVYKFTAERAGRQSLDDLMNSSLYLMRSELREIPPHDWGKTLKEmdlnlsfdlrvepls kyhlddismhrlrggeivALDDQYTFIQRIPRSHYVLAVGPVPYLYYLHQMr
Iron	6	35–64	HESTEQIQLFEQALRDNRNNDRHIMREIRE
Copper	3	37–86	HSVKVHFAEQDINDLKEISATLERVLNHPDETQARRLMTLEDIVSGYSNVLISLADSH GKTVYHSPGAPDIREFARDAIPDKDARGGEVFLLSGPTMMMPGHGHGHMEHSNWRMISL PVGPLVDGKPIYTLYIALSIDFHLHYINDLMNK
Citrate	4	43–182	asfedyltlhvrdmamnqakiiasndsvisavktrdykrlatianklQRDTDFDYVVIG DRHSIRLYHPNPEKIGYPMQFTKPGALEKGESYFITGKGSMGMAMRAKTPIFDDDGKV IGVVSIGYLVSKIDSWRAEFLLP
Fumarate	4	42–181	SQISDMTRDGLANKALAVARTLADSPEIRQGLQKKPQESGIQAIAEAVRKRNDLLFIVV TDMHSLRYSHPEAQRIGQPFKGDDILKALNGEENVAINRGFLAQALRVFTPIYDENHIS KAQIGVVAIGLELSRVtqqindsrw
Nitrate/Nitrite	8	38–151	sslrDAHAINKAGSLRMQSYRLGYDLPSGEPDKNAHRQMFQQAlhspvltnlnvwyv peavkTRYAHRNANWDGMNNRLQGGDDPWYNENIPNYMNQQDRFTLALDHY Qerkqffec

Regulated gene	Sequence
OmpC	TTTACATTTTGAAACATCT
OmpF	T[GT][GT][TG]TA[CG][AC][TA][AC]TTT[TC]
OmpF/OmpC	TTT[TA]C-TTTT[TG]
NarG1	1 TACCCATTAA 10
NarG2	1 TAACCAT--- 7
NarG3	1 TAATTAT--- 7
NarG4	1 TACTTTA--- 7
NarG5	1 -AGGGGTA-- 7
NarG6	1 TAGGAAT--- 7
NarG7	TTTAACCCGAtcggggtatg
NarK	TAC[TC][CG][CA]T
CitB	agtAATTTAATTaatt
LytT	[TA][AC][CA]GTTN[AG][TG]
LytT	taaggAAATAAAACTGATTTTcacgtca
AlgR	aaatGAATATTTATTCAAat
GlnG/GlnK	tgcaCCACCATGGTGCA
Spo1	1 ------------TTTGTCGAATGTAA----------- 14
Spo2	1 --AATTTCATTTTTAGTCGAAAAACAGAGAAAAACAT 35
Spo3	1 AAAAGAAGATTTTTCGACAAATTCA------------ 25

Family	Regulated gene	Function	Example organism	Sequence
NtrC	GlnH	Transcription factor	Salmonella	GacatTTGCACTTAAATAGTGCACaaccc
NtrC	GlnA	Transcription factor	Salmonella	ttctaTTGCACCAATGTGGTGCTTaatgt cattgAAGCACTATTTTGGTGCAAcatag
NtrC	GlnK	Transcription factor	Salmonella	CcattATGCACCGTCGTGGTGCGTttttc
NtrC	GlnA	Transcription factor	Salmonella	CtataATGCACTAAAATGGTGCAAccttt
NarL	NarK	Transcription factor	Salmonella	AatagCCTACTCATTAAGGGTAATaacta
NtrC	GlnG	Transcription factor	Shigella flexneri	CtataATGCACTAAAATGGTGCAAcctgt
ArgR	ArgA	Transcription factor	Salmonella	actaaTTTCGAATAATAATTCACTAgtggg
ArgR	ArgC	Transcription factor	Salmonella	cgttaATGAATAAAAATACATaatta

Response regulator protein	Regulated gene	Repetition	Distance [NS]
Citrate utilization protein B (CitB)	Citrate lyase (CitC)	6	40
Nitrogen regulation protein (NtrC)	Sequences glutamine synthetase (GlnA)	2	63
Nitrogen regulation protein (NtrC)	Nitrogen regulator protein (GlnK)	7–12	Variable
Nitrate/Nitrite response regulator protein (NarL)	Respiratory nitrate reductase (NarG)	Variable	Ca. 6
Nitrate/Nitrite response regulator protein (NarL)	Nitrite extrusion protein (NarK)	Variable	Variable
Osmolarity response regulator (OmpR)	Outer membrane protein C and F (OmpC/OmpF)	3	7

Family	Identification	Stimulus	Sensor²	Regulator²	Strain	Function
*(A)L. pneumophila str. Philadelphia***¹
OmpR	Iterative sequence searches with cut off e-30 using OmpR sequences from Enterobacter cloacae	Mg starvation	QseC GI:52841522 Known/annotated by PMID 15448271	GI:52841523 which is potential similar to QseB	Philadelphia 1	Regulated protein FliC; GI: 52841570; Flagella regulation;
NarL	Iterative sequence searches with cut off e-30 using NP_288375 E. coli O157:H7 str. EDL933	Carbon	BarA GI: 52842130 Known/annotated by PMID 15448271	GI:52842852 which is potential similar to UvrY	Philadelphia 1	Regulated protein CsrA; GI:52841018 Carbon storage regulator
NarL	Iterative sequence searches with cut off e-30 in E. coli ETEC H10407	Pheromone		GI:52840952 which is potential similar to EvgA	Philadelphia 1	Regulated protein EmrY; GI:52841684; antibiotic resistance
Family	Identification	Stimulus	Sensor*	Regulator*	Strain	Function

*(B)Listeria monocytogenes***³
NarL	Iterative sequence searches with cut off e-30 in E. coli ETEC H10407		Q4EKW8_LISMO which is potential similar to EvgS	GI: 16804553 which is potential similar to EvgA	EGD-e	Antibiotic resistance
OmpR	Iterative sequence searches with cut off e-30 in B. subtilis; the sequences of these proteins where used to search in the Listeria genome	Stress	GI: 16804620 GI: 16803101 which is potential similar to CSSS_BACSU	GI: 16804621 which is potential similar to CSSR_BACSU	EGD-e	Regulated protein HtrA; serine protease
OmpR	PSI-Blast search in B. subtilis with cut off e-60; the sequences of these sensors where used to search in the Listeria genome	Mg starvation	GI: 16803061 which is potential similar to ZP_03239257	PhoP GI: 16804539 Known/annotated by PMID 11679669	EGD-e	Virulence, antimicrobial peptide resistance

Domain	Protein	Context	Function
HisKin	Pyruvate dehydrogenase kinase	Glucose metabolism In S. cerevisiae	Inhibits the mitochondrial pyruvate dehydrogenase complex by phosphorylation of the E1 alpha subunit, thus contributing to the regulation of glucose metabolism
HisKin	Adenylate cyclase	Sporulation in some organisms	Stringent response, protein kinases are activated (PKAs)
HisKin	BCKD-kinase	Valine, leucine and isoleucine catabolic pathways in Mouse	Catalyzes the phosphorylation and inactivation of the branched-chain alpha-ketoacid dehydrogenase complex, the key regulatory enzyme of the valine, leucine and isoleucine catabolic pathways. Key enzyme that regulate the activity state of the BCKD complex
HisKin	Phytochrome A	Regulatory photoreceptor In Deinococcus	Regulatory photoreceptor which exists in two forms that are reversibly interconvertible by light: the Pr form that absorbs maximally in the red region of the spectrum and the Pfr form that absorbs maximally in the far-red region. Photoconversion of Pr to Pfr induces an array of morphogenic responses, whereas reconversion of Pfr to Pr cancels the induction of those responses. Pfr controls the expression of a number of nuclear genes including those encoding the small subunit of ribulose-bisphosphate carboxylase, chlorophyll A/B binding protein, protochlorophyllide reductase, rRNA, etc. It also controls the expression of its own gene(s) in a negative feedback fashion
Response Reg	Adventurous-gliding motility protein Z	Chemosensory system in Myxococcus	Required for adventurous-gliding motility, in response to environmental signals sensed by the frz chemosensory system. Forms ordered clusters that span the cell length and that remain stationary relative to the surface across which the cells move, serving as anchor points that allow the bacterium to move forward. Clusters disassemble at the lagging cell pol
Response Reg	Adenylate cyclase	Sporulation in some organisms	Stringent response, response regulators are activated
Response Reg	Serine/threonine-protein kinase ppk18	Schizosaccharomyces pombe	Serine/threonine-protein kinase ppk18 plays pivotal roles in cell proliferation and cell growth in response to nutrient status

Protein	Description	Organism	STRING score
NP_310132	Hypothetical protein ECs2105	E. coli 0157	0,9 to EvgS
ZP_02799272	Conserved hypothetical protein	E. coli 0157	0,9 to EvgS
YP_540723	Hypothetical protein C1714	E. coli UTI89	0,9 to EvgS
NP_837211	Hypothetical protein S1655	S. flexneri	0,76 to EvgS
NP_458304	Putative phosphodiesterase	S. typhi	0,65 to ygiM (put. signal transduction protein)
NP_462516	Putative phosphodiesterase	S. typhimurium	0,6l to lon

Protein with EAL-Domain	Interaction partner¹
>Q21G90_SACD2 Diguanylate cyclase/phosphodiesterase Saccharophagus degradans (full protein with two domains)	Sde_3649 GGDEF family protein Sde_2537 hypothetical protein Sde_3232 hypothetical protein Sde_3313 putative diguanylate phosphodiesterase Sde_1079 putative diguanylate phosphodiesterase Sde_3648 Formamidopyrimidine-DNA glycolase Sde_0078 GGDEF domain protein Sde_3427 Putative diguanylate cyclase (GGDEF) Sde_3693 res_reg receiver domain protein (CheY-like) Sde_1063 GGDEF family protein
>A6Q1G4_NITSB Signal transduction response regulator nitratiruptor sp.	dgkA Diacylglycerol kinase NIS_0211 Putative uncharacterized protein dnaG DNA primase DnaG NIS_0567 Putative uncharacterized protein NIS_0004 Putative uncharacterized protein NIS_1647 Putative uncharacterized protein NIS_1732 Putative uncharacterized protein NIS_0150 Putative uncharacterized protein NIS_0136 Putative uncharacterized protein
>A1AD34_ECOK1 Putative uncharacterized protein rtn E. coli O1	yedQ hypothetical protein yaiC Putative uncharacterized protein ydeH Putative uncharacterized protein ydeH yeaP Putative uncharacterized protein yeaP ycdT predicted diguanylate cyclase yfiN Putative diguanylate cyclase yneF Putative uncharacterized protein yneF yeaI Putative uncharacterized protein yeaI yejA Putative uncharacterized protein yejA yejB Predicted oligopeptide transporter subunit

Phosphor
>PHOR_ECOLI 29-32 (4) GYLP
Osmotic
>ENVZ_ECOLI 36-158 (123) NFAILPSLQQFNKVLAYEVR MLMTDKLQLEDGTQLVVPP AFRREIYRELGISLYSNEAAE EAGLRWAQHYEFLSHQMAQQ LGGPTEVRVEVNKSSPVVWLK TWLSPNIWVRVPLTEIHQGDFS	>ENVZ_SALTY 36-158 (123) NFAILPSLQQFNKVLAYEVR MLMTDKLQLEDGTQLVVPP AFRREIYRELGISLYTNEAAE EAGLRWAQHYEFLSHQMAQQ LGGPTEVRVEVNKSSPVVWLK TWLSPNIWVRVPLTEIHQGDFS	>Q02EG5_PSEAB 15-117 TLWLVLIVVLFSKALTLVYLLMN EDVIVDRQYSHGAALTIRAFWAA DEESRAAIAKASGLRWVPSSAD QPGEQHWPYTEIFQRQMQMELG PDTETRLRIHQPS
>ENVZ_SALTI 36-158 (123) NFAILPSLQQFNKVLAYEVR MLMTDKLQLEDGTQLVVPP AFRREIYRELGISLYTNEAAE EAGLRWAQHYEFLSHQMAQ QLGGPTEVRVEVNKSSPVVW LKTWLSPNIWVRVPLTEIHQ GDFS	>ENVZ_SHIFL 36-158 (123) NFAILPSLQQFNKVLAYEVR MLMTDKLQLEDGTQLVVPP AFRREIYRELGISLYSNEAAE EAGLRWAQHYEFLSHQMA QQLGGPTEVRVEVNKSSPVV WLKTWLSPNIWVRVPLTEIH QGDFS
Stress
>RSTB_ECOLI 25-135 (111) LVYKFTAERAGKQSLDDLM NSSLYLMRSELREIPPHDWG KTLKEMDLNLSFDLRVEPLS KYHLDDISMHRLRGGEIVAL DDQYTFLQRIPRSHYVLAVG PVPYLYYLHQMR	>B3AUE7 _ECO57 25-135 (111) LVYKFTAERAGKQSLDDLM NSSLYLMRSELREIPPHDWG KTLKEMDLNLSFDLRVEPLS KYHLDDISMHRLRGGEIVAL DDQYTFLQRIPRSHYVLAVG PVPYLYYLHQMR	>Q8ZPL6_SALTY 25-135 (111) LVYKFTAERAGRQSLDDLMKSS LYLMRSELREIPPREWGKTLKEM DLNLSFDLRVEPLNHYKLDAATT QRLREGDIVALDDQYTFIQRIPRS HYVLAVGPVPYLYFLHQMR
>Q8XED5_ECO57 25-135 (111) LVYKFTAERAGKQSLDDLM NSSLYLMRSELREIPPHDWG KTLKEMDLNLSFDLRVEPLS KYHLDDISMHRLRGGEIVAL DDQYTFLQRIPRSHYVLAVG PVPYLYYLHQMR	>Q8Z6R8_SALTI 25-135 (111) LVYKFTAERAGRQSLDDLM KSSLYLMRSELREIPPREWG KTLKEMDLNLSFDLRVEPL NHYKLDAATTQRLREGDIVA LDDQYTFIQRIPRSHYVLAV GPVPYLYFLHQMR	>Q83KZ3_SHIFL 25-135 (111) LVYKFTAERAGRQSLDDLMKSS LYLMRSELREIPPREWGKTLKEM DLNLSFDLRVEPLNHYKLDAATT QRLREGDIVALDDQYTFIQRIPRS HYVLAVGPVPYLYFLHQMR
Iron
>BASS_ECOLI 35-64 (30) HESTEQIQLFEQALRDNRNN DRHIMREIRE	>BASS_SALTY 35-64 (30) HESTEQIQLFEQALRDNRNN DRHIMREIRE	>Q8FAU6_ECOL6 38-67 (30) HESTEQIQLFEQALRDNRNNDR HIMREIRE
>B2NQU4_ECO57 38-67 (30) HESTEQIQLFEQALRDNRNN DRHIMREIRE	>Q83PA1_SHIFL 38-67 (30) HESTEQIQLFEQALRDNRNN DRHIMREIRE	>Q8Z1P5_SALTI 38-67 (30) HESTEQIQLFEQALRDNRNNDR HIMREIRE
Copper
>CUSS_ECOLI 37-86 (150) HSVKVHFAEQDINDLKEISA TLERVLNHPDETQARRLMT LEDIVSGYSNVLISLADSQGK TVYHSPGAPDIREFTRDAIPD KDAQGGEVYLLSGPT MMMPGHGHGHMEHSN WRMINLPVGPLVDGKPI YTLYIALSIDFHLHYIND LMNK	>CUSS_ECO57 37-86 (150) HSVKVHFAEQDINDLKEISAT LERVLNHPDETQARRLMTL EDIVSGYSNVLISLADSHGK TVYHSPGAPDIREFARDAIPD KDARGGEVFLLSGPTMMMP GHGHGHMEHSNWRMISLP VGPLVDGKPIYTLYIALSIDF HLHYINDLMNK	>CUSS_ECOL6 37-86 (150) HSVKVHFAEQDINDLKEISATLE RVLNHPDETQARRLMTLEDIVS GYSNVLISLADSHG KTVYHSPGAPDIREFARDAIP DKDARGGEVFLLSGPTMMM PGHGHGHMEHSNWRMISLP VGPLVDGKPIYTLYIALSIDF HLHYINDLMNK
Citrate
>DPIB_ECOLI 43-182 (140) ASFEDYLTLHVRDMAMNQA KIIASNDSVISAVKTRDYKRL ATIANKLQRDTDFDYVVIGD RHSIRLYHPNPEKIGYPMQFT KQGALEKGESYFITGKGSM GMAMRAKTPIFDDDGKVIG VVSIGYLVSKIDSWRAEFLLP	>Q8XBS0_ECO57 43-182 (140) ASFEDYLTLHVRDMAMNQA KIIASNDSVISEVKTRDYKRL ATIANKLQRDTDFDYVVIGD RHSIRLYHPNPEKIGYPMQFT KQGALEKGESYFITGKGSMG MAMRAKTPIFDDDGKVIGV VSIGYLVSKIDSWRAEFLLP	>Q8Z8I7_SALTI 43-182 (140) ASFEDYLASHVRDMAMNQA KIIASNDSIIAAVKNRDYKRL AIIANKLQRGTDFDYVVIGD RHSIRLYHPNPEKIGYPMQFT KPGALERGESYFITGKGSIGM AMRAKTPIFDNEGNVIGVVS IGYLVSKIDSWRLDFLLP
>Q8FJZ9_ECOL6 63-202 (140) ASFEDYLTLHVRDMAMNQA KIIASNDSIISAVKTRDYKRL ATIADKLQRDTDFDYVVIGD RHSIRLYHPNPEKIGYPMQFT KPGALEKGESYFITGKGSIGM AMRAKTPIFDDDGKVIGVVS IGYLVSKIDSWRAEFLLP
Fumarate
**>Ecoli_dcsu 42-181 (140)** SQISDMTRDGLANKALAVAR TLADSPEIRQGLQKKPQESGI QAIAEAVRKRNDLLFIVVTD MQSLRYSHPEAQRIGQPFKG DDILKALNGEENVAINRGFL AQALRVFTPIYDENHKQIGV VAIGLELSRVTQQINDSRW	>DCUS_ECOL6 42-181 (140) SQISDMTRDGLANKALAVA RTLADSPEIRQGLQKKPQES GIQAIAEAVRKRNDLLFIVVT DMHSLRYSHPEAQRIGQPFK GDDILKALNGEENVAINRGF LAQALRVFTPIYDENHKQIG VVAIGLELSRVTQQINDSRW	>DCUS_SHIFL 42-181 (140) SQISDMTRDGLANKALAVAR TLADSPEIRQGLQKKPQESGI QAIAEAVRKRNDLLFIVVTD MHSLRYSHPEAQRIGQPFKG DDILKALNGEENVAINRGFL AQALRVFTPIYDENHKQIGV VAIGLELSRVTQQINDSRW
>DCUS_ECO57 42-181 (140) SQISDMTRDGLANKALAVAR TLADSPEIRQGLQKKPQESGI QAIAEAVRKRNDLLFIVVTD MQSLRYSHPEAQRIGQPFKG DDILKALNGEENVAINRGFL AQALRVFTPIYDENHKQIGV VAIGLELSRVTQQINDSRW
Nitrate/Nitrite
>NARX_ECOLI 38-151 (114) QGVQGSAHAINKAGSLRMQ SYRLLAAVPLSEKDKPLIKE MEQTAFSAELTRAAERDGQ LAQLQGLQDYWRNELIPAL MRAQNRETVSADVSQFVAG LDQLVSGFDRTTEMRIET	>NARQ_ECOLI 35-146 (112) SSLRDAEAINIAGSLRMQSY RLGYDLQSGSPQLNAHRQL FQQALHSPVLTNLNVWYVP EAVKTRYAHLNANWLEMN NRLSKGDLPWYQANINNYV NQIDLFVLALQHYAERK	>Q8Z4S5_SALTI 35-146 (112) SSLRDAEAINIAGSLRMQSYRLG YDLQSGSPQLNAHRQLFQQALH SPVLTNLNVWYVPEAVKTRYAH LNANWLEMNNRLSKGDLPWYQ ANINNYVNQIDLFVLALQHYAE RK
>NARX_ECO57 38-151 (114) QGVQGSAHAINKAGSLRMQ SYRLLAAVPLSEKDKPLIKE MEQTAFSAELTRAAERDGQL AQLQGLQDYWRNELIPALM RAQNRETVSADVSQFVAGL DQLVSGFDRTTEMRIET	Q8FF85_ECOL6 40-151 (112) SSLRDAEAINIAGSLRMQSY RLGYDLQSGSPQLNAHRQL FQQALHSPVLTNLNVWYVP EAVKTRYAHLNANWLEMN NRLSKGDLPWYQANINNYV NQIDLFVLALQHYAERK	>Q8ZN78_SALTY 35-146 (112) SSLRDAEAINIAGSLRMQSYRLG YDLQSGSPQLNAHRQLFQQALH SPVLTNLNVWYVPEAVKTRYAH LNANWLEMNNRLSKGDLPWYQ ANINNYVNQIDLFVLALQHYAER
>NARX_SHIFL 38-151 (114) QGVQGSAHAINKAGSLRMQ SYRLLAAVPLSEKDKPLIKE MEQTAFSAELTRAAERDGQL AQLQGLQDYWRNELIPALM RAQNRETVSADVSQFVAGL DQLVSGFDRTTEMRIET	>Q8XBE5_ECO57 35-146 (112) SSLRDAEAINIAGSLRMQSY RLGYDLQSGSPQLNAHRQL FQQALHSPVLTNLNVWYVP EAVKTRYAHLNANWLEMNN RLSKGDLPWYQANINNYVN QIDLFVLALQHYAERK

Combination of sensor domains	Response regulator domains
HisKA +	HATPase_c +
(n * HAMP + m *
PAS + p * Hpt)¹
HATPase_c	Response_reg * s²
HAMP	Response_reg + GerE
His_kinase +	Response_reg + HTH
HATPase_c
HisKA +	Response_reg + LytTR
HATPase_c
HWE_HK	Response_reg + HisKA domain
HisKA_2 +	Response_reg + CheB or CheW
HATPase_c
HisKA_2	Response_reg + Sigma
HisKA_3	Response_reg + Spo
HisKA	Response_reg + GGDEF
	Response_reg + EAL
	Response_reg + HDOD

Pfam-A	Description	Entry type	Seq start	Seq end	HMM from	To	Bits score	E-value
Response_reg dicdi	Response	Domain	2	86	1	80	24.6	2.6e-06
Response_reg AGLZ	Response	Regulator	Receiver	Domain	Domain	2	83	1

Organismus	Mist-annotation/ScanProsite or SMART count¹

	HisKa	Response reg
E. coli K-12	29/77	31/39
Staphylococcus aureus (STAAN)	18/30	17/285
Listerien monocytogenes (LISMO) EGD	16/56	16/54
Arabidospis thaliana (ARATH)	16/61	22/285
Zea mays (MAIZE)	20/25	22/44

Organismus	Mist-annotation/ScanProsite or SMART count¹

	HisKa	Response reg
E. coli K-12	29/77	31/39
Staphylococcus aureus (STAAN)	18/30	17/285
Listerien monocytogenes (LISMO) EGD	16/56	16/54
Arabidospis thaliana (ARATH)	16/61	22/285
Zea mays (MAIZE)	20/25	22/44

Organism	Protein Id	Protein name	Score	E-value
E. coli 0157	NP_310132.1	Hypothetical protein ECs2105	100	5e-23
E. coli 0157	ZP_02799272.2	Conserved hypothetical protein	88.2	2e-19
E. coli UTI89	YP_540723.1	Hypothetical protein UTI89_C1714	97.4	2e-22
Shigella flexneri 2a str. 24577T	NP_837211.1	Hypothetical protein S1655	91.5	2e-17

PERMALINK

Different Evolutionary Modifications as a Guide to Rewire Two-Component Systems

Beate Krueger

Torben Friedrich

Frank Förster

Jörg Bernhardt

Roy Gross

Thomas Dandekar

Abstract

Introduction

Results and Discussion

TCS rewiring by changing residues in sequences

TCS stimulus signatures

Table 1A.

Table 1B.

Binding sites on the DNA

Table 2A.

Table 2B.

Table 3.

TCS rewiring by domain shuffling and diverged domains

Diverged TCS domains

Table 4.

Extensive TCS domain shuffling

Table 5.

A putative new family of TCS in Mycoplasma pneumoniae

Figure 1.

Figure 2.

TCS rewiring by additional components

Table 6A.

Table 6B.

Conclusions

Materials and Methods

Methods for sequence analysis

Methods for structural analysis

Supplementary Data

Modification by domain swapping

General flexibility of TCS

TCS stimuli

DNA-binding sites

Modification by Diverged Systems

Domain shuffling in HisKA

>PDK_YEAST 126-386 Pyruvate dehydrogenase

>CYAD_DICDI 654-928 Adenylate cyclase

>BCKD_MOUSE 159-404 BCKD-kinase (PMID: 11562470)

>PHYA_POPTM 901-1117 (217)

Phytochrome A

HisKa substitution

Domain shuffling in regulator

AGLZ_MYXXD 4-422 (15342587) Adventurous-gliding motility protein Z

CYAD_DICDI 954-1076 (18832717)

Adenylate cyclase

PPK18_SCHPO 1198-1279 (18855897) Serine/ threonine-protein kinase ppk18

A putative new family of TCS in mycoplasma pneumoniae

HPK1

G-box: GGxGLGLxhhxxhhxxxxGxhxhxxxxxxGx xFxhxh

HPK3

Eight receiver domain families

Modification by Connector Proteins

Table S1.

Table S2.

Table S3.

Table S4.

Table S5.

Table S6.

References

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

Table S1.

Table S2.

Table S3.

Table S4.

Table S5.

Table S6.

ACTIONS

PERMALINK

RESOURCES

Similar articles