Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2011 Nov 1;6(11):e25570. doi: 10.1371/journal.pone.0025570

Protein Domain of Unknown Function 3233 is a Translocation Domain of Autotransporter Secretory Mechanism in Gamma proteobacteria

Ananth Prakash 1, S Yogeeshwari 1, Sanchari Sircar 1, Shipra Agrawal 1,2,*
Editor: Eugene A Permyakov3
PMCID: PMC3206015  PMID: 22073138

Abstract

Vibrio cholerae, the enteropathogenic gram negative bacteria is one of the main causative agents of waterborne diseases like cholera. About 1/3rd of the organism's genome is uncharacterised with many protein coding genes lacking structure and functional information. These proteins form significant fraction of the genome and are crucial in understanding the organism's complete functional makeup. In this study we report the general structure and function of a family of hypothetical proteins, Domain of Unknown Function 3233 (DUF3233), which are conserved across gram negative gammaproteobacteria (especially in Vibrio sp. and similar bacteria). Profile and HMM based sequence search methods were used to screen homologues of DUF3233. The I-TASSER fold recognition method was used to build a three dimensional structural model of the domain. The structure resembles the transmembrane beta-barrel with an axial N-terminal helix and twelve antiparallel beta-strands. Using a combination of amphipathy and discrimination analysis we analysed the potential transmembrane beta-barrel forming properties of DUF3233. Sequence, structure and phylogenetic analysis of DUF3233 indicates that this gram negative bacterial hypothetical protein resembles the beta-barrel translocation unit of autotransporter Va secretory mechanism with a gene organisation that differs from the conventional Va system.

Introduction

Domain of Unknown Function (DUF) 3233 (PFAM: PF11557) is a family of uncharacterised hypothetical proteins conserved among gram negative gammaproteobacteria. Representative members of this domain include marine bacteria from genus Vibrio, Shewanella, Colwellia and Alcanivorax of which Vibrio cholerae, Vibrio parahaemolyticus, Vibrio splendidus and Vibrio vulnificus are pathogenic to human and aquatic life. Vibrio cholerae causes seasonal outbreaks of cholera of epidemic proportions in developing countries with high mortality rates [1]. The enterotoxins produced by the bacteria after colonising the host small intestine disrupts the ion transport by the intestinal epithelial cells causing outflow of large volumes of fluids into the intestine leading to watery diarrhoea, dehydration and in severe cases, death [1] [2].

Significant fraction of genomes of Vibrio species lack structure function annotation and most of these gene products are classified as hypothetical proteins or domains of unknown function [3]. The PFAM [4] database in its 24th release lists about 3000 DUF families. Many of these DUF families are kingdom specific (DUF2883, DUF3328, DUF3329), limited/shared between kingdoms (DUF1497, DUF3609) or restricted/specific to certain organisms (DUF1196, DUF2667). The specific and ubiquitous nature of these domains suggests their functional importance in organism specific niches or a common biological role.

Identifying homologous protein families through sequence based search marks the first step in the annotation of DUFs, providing an initial broad picture of the protein's probable family and function. Sequence homology search becomes increasingly powerful when we advance from normal sequence-sequence based searches to methods that uses profile or HMM information like HHsenser [5], which increases the efficiency of finding remote homologues. In silico structure prediction methods together with sequence similarity detection methods assist the annotation of fold-function space. Fold recognition methods like I-TASSER [6] help predict the 3 dimensional (3D) structure and functions of proteins that share low sequence identity with other known structures.

In this study we analyse the sequence and structural characteristics of DUF3233 using computational approaches and try to infer various properties of this domain. Sequence search by HHsenser identifies similarity with the beta-barrel translocation unit of autotransporter Va secretory proteins. Sequence homology combined with secondary structure prediction indicates a beta-barrel domain of 12 beta-strands. The predicted 3D model from I-TASSER shows the structure with an overall beta-barrel topology with an N terminal helix running along the central barrel axis perpendicular to the 12 antiparallel strands that form the barrel. Amphipathicity and membrane barrel discrimination analysis suggest the domain is a potential outer membrane gram negative beta-barrel protein.

Autotransporter translocation units belong to the transmembrane beta-barrel fold in SCOP database [7], defined by a beta-barrel of 12 to 14 antiparallel strands with an N terminal helix perpendicular to the barrel. Finally with the analysis of genomic context of DUF3233 we could infer that this outer membrane beta-barrel translocation domain has a gene organisation that is not typical of the autotransporter Va secretory mechanism.

Results

Sequence based characterization of DUF3233 as autotransporter β-domain protein

Sequence search for homologues with PSI-BLAST using a representative query, Vibrio cholerae DUF3233 (RefSeq: NP_232949) against the NCBI nr database with a threshold 0.005, reached convergence at the 4th iteration. The resulting sequences identified were hypothetical proteins conserved among gram negative proteobacteria. For improved search and better coverage of homologous sequence space, information from aligned regions of DUF3233 sequences in the form of a multiple sequence alignment profile was queried with HHsenser. From the resulting sequences in the permissive alignment list we were able to infer homology between DUF3233 and the outer membrane beta-barrel translocation domain of autotransporter proteins. DUF3233 shares sequence similarity with outer membrane beta-barrel domain of Ochrobactrum intermedium autotransporter (e-value 1E-34, 95% coverage, 22% identity), Rhizobium leguminosarum adhesin autotransporter (e-value 2E-29, 94% coverage, 18% identity) and Yersinia aldovae AidA adhesin autotransporter (e-value 2E-26, 89% coverage, 18% identity). Interestingly, a number of gram negative hypothetical proteins were picked up as potential homologues, which showed fair amount of similarity to the autotransporter beta-domain (Table S1).

Position specific scoring matrix (PSSM) profile based discrimination analysis using TMBETADISC-RBF [8], predicts the outer membrane beta-barrel nature of DUF3233. To confirm homology with autotransporter protein family we queried DUF3233 sequences against putative outer membrane proteins (OMPs). A pairwise hidden Markov model (HMM) search by HHomp [9] identifies DUF3233 as an OMP. V. cholerae DUF3233 shares homology with HHomp cluster 12.1.6 (96% probability). This cluster comprises profile HMMs of autotransporter sequences whose 12 stranded beta-barrel transmembrane domains conform to the translocation unit of autotransporter NalP [9]. DUF3233 sequences from Colwellia (94% probability), Shewanella (96% probability) and Ferrimonas (95% probability) were all found to share homology with autotransporter protein family.

DUF3233 is a solitary outer membrane autotransporter β-barrel domain

Proteins targeted for transport across membranes posses leader sequence or signal peptide at their N-terminus, which directs translocation. We analysed DUF3233 sequences using a combination of artificial neural networks and HMMs implemented in SignalP [10] to predict the presence and location of signal peptide cleavage sites. SignalP identified the presence of N-terminal signal peptide of an average length of 23 amino acid residues having a positively charged amino terminal followed by a hydrophobic region and hydrophilic carboxy terminal. Signal peptides are cleaved from the exported protein by specific proteases called signal peptidases (SPases) [11]. Prediction of cleavage mechanism of these signal sequences by LipoP [12] identifies SPase 1 target site, indicating DUF3233 might be a non-lipoprotein.

We browsed DUF3233 genomic region of all representative organisms with STRING [13] to look for possible gene fusion events with other domains and found no such occurrence. DUF3233 is a single domain protein found on the small chromosome 2 in Vibrio species with an upstream gamma-glutamyltranspeptidase (GGT) or response regulatory protein transcribed in one potential operon (Table 1). These upstream proteins lack the N-terminal signal sequence for inner membrane transport and analysis through SecretomeP [14] indicates that these proteins are not exported through non-classical secretory system. Gene organisation of DUF3233 therefore suggests a solitary translocation unit with an absent upstream secretory protein.

Table 1. Genomic context of DUF3233: Proteins upstream of DUF3233 in various representative organisms are organised in operons and have a probable pathogenic role.

DUF3233 Gene Organism Upstream protein DOOR Operon ID*
Fbal_1102 F. balearica DSM 9799 OmpA domain protein transmembrane region-containing protein -
VCA0559 V. cholerae O1 biovar El Tor str. N16961 Gamma-glutamyltranspeptidase, putative -
VC0395_0494 V. cholerae O395 Putative gamma-glutamyltranspeptidase 329019
VPA0926 V. parahaemolyticus RIMD 2210633 Putative regulatory components of sensory transduction system 88118
VEA_000122 Vibrio sp. Ex25 Response regulator -
VV2_0321 V. vulnificus CMCP6 Response regulator 80840
VVA0816 V. vulnificus YJ016 Putative regulatory components of sensory transduction system 102681

*For few genomic strains which are not represented in DOOR v2.0, operon ids are left blank.

Structure based validation of DUF3233 as transmembrane β-barrel domain of autotransporter proteins

DUF3233 sequence based PSI-BLAST search for proteins with known structures (72,386 structures in PDB as of April 2011) fetched results with a maximum alignment length covering 51 residues. Secondary structure assignment by PSIPRED [15] predicts an N-terminal α-helix (αN) followed by 12 consecutive β-strands (β1–β12) interspersed by two short turns of α-helices α1 and α2 predicted to occur between β1–β2 and between β5–β6 respectively. With no suitable template with significant sequence homology available, we used the fold recognition algorithm implemented in I-TASSER to predict a 3D model of DUF3233. The translocation unit of NalP from N. meningitidis (PDB: 1UYN_X, 15% identity, 85% coverage, normalised Z-score above 1) was identified by I-TASSER in the top four threading templates to model V. cholerae DUF3233. The predicted structure of V. cholerae DUF3233 (Figure 1) resembles the beta-barrel translocation unit of autotransporter proteins, aligning over 75% structurally equivalent positions with the template and an RMSD of 2.3. The domain has an N-terminal α-helix running along the central axis surrounded by beta-barrel formed by twelve anti-parallel beta-strands. Predicted strand assembly within the outer membrane shows the carboxy and amino terminal of the beta-barrel point towards the periplasmic space, the central helix is oriented such that its N-terminal is pointed towards the external environment. Secondary structure based sequence alignment of DUF3233 with the translocation unit of autotransporters shows a similar domain organisation (Figure 2). Using alignment of DUF3233 sequences the average hydropathy, amphipathicity and similarity plots was generated with AveHAS [16]. Figure 3, shows 12 hydrophobicity and amphipathicity peaks with an average stretch of 10 to 15 residues per peak that may form transmembrane beta-strands.

Figure 1. Predicted 3D model of DUF3233.

Figure 1

A) The predicted structure resembles the beta-barrel translocation unit of autotransporter proteins. The 12 beta-strands (blue) orient in anti-parallel fashion enclosing the N-terminal helix (red), the two additional helices α1 and α2 are shown in green. B) Superimposed structures of DUF3233 (magenta) and NalP translocation unit of N. meningitidis 1uyn_x (blue). Figures were generated with Rasmol.

Figure 2. Sequence alignment of DUF3233 and translocation units of select autotransporters.

Figure 2

Secondary structure diagram for NalP is shown at the top. The two extra helices (α1 and α2) present among DUF3233 proteins are shown in green. DUF3233 and autotransporter sequences representing each cluster are used in the alignment. Nme_Neisseria meningitidis, Pae_Pseudomonas aeruginosa, Eco_Escherichia coli, Sen_Salmonella enterica, Bpe_Bordetella pertussis, Ctr_Chlamydia trachomatis, Hin_Haemophilus influenzae, Rri_Rickettsia rickettsii, Mca_Moraxella catarrhalis, Vch_Vibrio cholerae, Cps_Colwellia psychrerythraea, Slo_Shewanella loihica.

Figure 3. Average hydropathy, amphipathicity and similarity plot of 18 DUF3233 representative sequences.

Figure 3

The plot shows 12 hydrophobic (green) and amphipathic (red) peaks corresponding to possible transmembrane beta-strand forming segments The plot was generated using AveHAS program.

DUF3233 is evolutionarily linked to the autotransporters

To infer evolutionary relation with type V secretory proteins, we analysed DUF3233 representatives with members of both Va and Vb family (Table S2). The third type of proteins found in the type V secretory system, type Vc or AT-2 proteins, which are characterised by trimeric C-terminal beta-barrel [17] were not considered for phylogenetic analysis. The inferred phylogenetic tree (Figure 4) classifies members of the two families into two separate clans. Proteins are grouped into clusters with similar function, architecture and organism type as analysed in [18] and [19]. DUF3233 family sequences though related to autotransporters form a distinct group from the main autotransporter clan indicating that these domains represent new cluster of autotransporters.

Figure 4. Bootstrap phylogenetic tree of DUF3233 with Va and Vb family proteins.

Figure 4

51 Va family (red), 25 Vb family (blue) translocation unit and 3 DUF3233 (green) representative sequences were used to generate a 1000 replicate bootstrap phylogenetic tree. Eukaryotic Vb sequences representing clusters 1 and 2 were not considered in the analysis. Sequences of each family are grouped into previously reported clusters (C 1 to 11 and C 1 to 4); DUF3233 sequences form a new cluster of the autotransporter family. Numbers at the node indicate neighbour-joining bootstrap percentages. Phylogenetic analyses were conducted in MEGA4.

Discussion

A vital part in the survival and adaptation mechanism of bacteria lies in the constant interaction with their extra-cellular environment. Bacteria secrete a wide range of molecules into the extra-cellular milieu that includes enzymes, which break down carbohydrates, proteins and lipids, and virulence factors such as adhesins and toxins by those involved in pathogenesis. Transport of these molecules is mediated by protein complexes through conserved secretory pathways. Of the 6 types of secretory mechanisms known in gram-negative bacteria (type I to type VI), type V represents the simplest transport system. Proteins of the type V secretory system fall under the autotransporter (Va), two partner secretion (Vb) and the AT-2 (Vc) families, and share a similar domain organisation: an N-terminal signal peptide for inner membrane translocation followed by a passenger protein which is normally a virulence determinant and a C-terminal translocation unit for transporting the upstream passenger protein [18].

Proteins of the autotransporter (Va) family were the first to be described [20] and form the largest representation of this system [19]. Autotransporters export a wide range of toxins and enzymes [21] to the cell surface or secrete them into the external environment. The passenger domain and translocation unit of autotransporters are both expressed as a single polypeptide [20] making the translocation unit highly specific and committed for transporting only the upstream passenger. Solved experimental structures of the autotransporter translocation unit [22][24] show that they all possess a similar structure, a beta-barrel of 12 antiparallel strands with a central N-terminal helix running along the barrel axis. Proteins of the two partner secretion (Vb) are widely distributed and follow a similar mode of function, transporting cytolysins, adhesions and metalloproteases [19]. The secreted exoprotein and the transporter unlike the Va proteins are not linked but, are expressed as two separate proteins transcribed in a single operon [25]. Vb transporters are predicted to have a multidomain architecture [26] and a relatively wider barrel made of 16 [27] to 20 [28] beta-strands. The newly discovered AT-2 family (Vc) [29] represents proteins secreted via a homotrimeric mechanism [30]. Proteins secreted through this system are mainly implicated in virulence [31]. With a domain organisation similar to that of autotransporters, the system functions with the coming together of three individual proteins each complete with an N-terminal signal peptide, a passenger unit and four beta-strand domain at the C-terminal which makes a complete closed 12 stranded beta-barrel translocation unit upon trimerisation [31] [32].

The present work describes sequence and structure based characterization of proteobacteria DUF3233 as a beta-barrel transmembrane domain of autotransporter proteins. DUF3233 packs an average 312 amino acid residues (including N-terminal signal peptide) and is currently classified as a domain of unknown function.

DUF3233 is encoded as a single domain protein, homologous to the translocation unit of autotransporters. One aspect of DUF3233 that distinguishes it from other main class autotransporters is that it lacks a covalently linked N-terminal passenger domain, to which C-terminal translocation units of all autotransporters are committed to transport. Few autotransporter representatives of two-polypeptide architecture [19] might suggest the secretion of co-transcribed upstream proteins similar to the TPS system, but considering the cytosolic nature of upstream proteins, extracellular translocation seems unlikely.

Few representative members from the Vibrio genus express DUF3233 and upstream putative GGT or response regulatory proteins in a single operon (Table 1). Over expression of GGT [33] and GGDEF domain proteins [34] [35] are implicated in pathogenesis. The prokaryotic GGT is shown to be a major factor in the colonisation of gut and gastric mucosa [36] [37]. The upstream response regulators are two-domain proteins with an N-terminal CheY-like regulatory and a conserved C-terminal GGDEF effector domain, which is responsible for eliciting pathogenic response through cyclic di-GMP mediated exopolysaccharide synthesis and biofilm formation [38]. Genes encoding virulence products in V. cholerae are organised in clusters or operons [39], and since gene encoding DUF3233 is located among virulent genes, the possibility of the involvement of DUF3233 in pathogenesis cannot be overlooked.

The translocation units of autotransporters exhibit conserved amino acid consensus motif at their carboxy terminus, the barrel closing beta-strand displays alternate arrangement of polar and hydrophobic residues terminating with a conserved aromatic amino acid at the barrel terminus which is usually a phenylalanine or a tryptophan [40]. Hendrixson et al., [41] demonstrated the importance of C-terminal consensus motif on the viability of H. influenzae Hap translocation unit. Deletion of terminal 12 residues proved detrimental to the outer membrane localisation; while the stability and/or outer membrane localisation of the translocation unit was affected with the deletion of all three terminal residues, point mutations of these residues showed no effect on the outer membrane localisation or secretion of the mature protein [41]. DUF3233 displays consensus pattern at its C-terminal that resembles the conserved motif found among autotransporters discussed above (Figure S1). A stretch of alternating polar and hydrophobic residues precedes the terminal beta-strand having a hydrophobic segment and a conserved ‘terminal’ phenylalanine or a tyrosine residue. Interestingly the extreme carboxy terminus harbours three conserved polar residues [N/D][Q/E][D/E] after the ‘terminal’ F/Y. Secondary structure and predicted models of DUF3233 show the hydrophilic residues of the “polar tail” are not part of the terminal beta-sheet, but instead form a short overhang pointed towards the periplasm. As yet, we do not know the significance and possible role of these tail polar residues on the outer membrane localisation and stability.

DUF3233 exhibits certain features that are in common with the translocation units of type Va secretory proteins and yet possesses characteristics that are not typical to the proteins of the above system. DUF3233 represents a translocation unit that is devoid of a secretable passenger unit. Considering its location in the representative genomes alongside other virulence genes, we hypothesize that this domain is involved in pathogenesis. However, the mechanism apparently looks new and different than a typical type Va secretion system.

Our study on the proteobacterial protein DUF3233 with a combination of methods like sequence similarity searches, outer membrane beta-barrel discrimination, phylogenetic analysis, and fold recognition has led us to a consensus at annotating fold and function to this domain.

Sequence similarity search suggests that DUF3233 has remote homology with the translocation unit of autotransporter proteins of the type V secretory system. The domain's outer membrane beta-barrel nature was further emphasised by signal peptide, outer membrane beta-barrel discrimination and amphipathicity analysis. Secondary structure prediction and alignment with translocation units, and inputs from the predicted model suggests that DUF3233 and the translocation unit of autotransporter proteins share a similar domain organisation.

Drawing a consensus from various in silico prediction methods it appears that DUF3233 is a cluster of remote homologues of autotransporters. This is the first report of an autotransporter like protein family in Vibrio species, and though within the realm of bioinformatics we were able to infer its probable family and fold, the pathogenic mechanism still remains to be explored and seeks further experimental studies.

Materials and Methods

Sequence similarity search

20 DUF3233 genes from NCBI comprising one sequence each from Aliivibrio, Colwellia and Ferrimonas, five sequences from Shewanella and twelve from Vibrio species were fetched using their corresponding RefSeq ids. YP_001366070 and YP_001555810 were excluded because of their short domain size and a total of 18 DUF3233 sequences were included in this study. The signal peptides of DUF3233 sequences have not been included in various analyses of this study unless mentioned. PSI-BLAST [42] profile-sequence search was used to probe homologous protein families against NCBI nonredundant (nr) database with default parameters. ClustalW multiple sequence alignment profile of DUF3233 protein sequences was taken in as input for HHsenser to search the nr database with a threshold e-value of 0.001 and default parameters for improved homolog coverage. TMBETADISC-RBF [8], PSSM profile based discrimination of beta-barrel OMPs from other folding types like globular and membrane proteins was used to assess the outer membrane nature of DUF3233. DUF3233 sequences were queried against HHomp database [9] to detect homology with other known OMPs.

Signal peptide and Genomic context analysis

The presence of N-terminal signal peptide and the putative cleavage sites were predicted with SignalP 3.0 [10]. Using LipoP 1.0 [12] the signal peptide sequences were checked for lipoprotein signal peptide signatures that differentiate them from other signal peptides and subsequently cleavage by signal peptidase II from signal peptidase I. SignalP 3.0 and SecretomeP 2.0 [14] were used to determine inner membrane transport of proteins upstream of DUF3233 via Sec dependent or other non classical secretory pathways. STRING (version 8.3) [13] and DOOR (version 2.0) [43] were used for gene neighbourhood and operon analysis.

Structure prediction and transmembrane β-barrel analysis

Secondary structure assignments were made using PSIPRED [15]. 3D structure of V. cholerae DUF3233 (RefSeq: NP_232949) was predicted using I-TASSER fold recognition method [44]. The predicted structure was superimposed with the template using TopMatch [45] and visualised with Rasmol [46]. The WHAT [47] program was used to predict hydropathy and amphipathicity using sliding windows of 13, 15, 17 residues and an angle of 100° for α-helix and 180° for β-strand. Multiple sequence alignment profile was used to plot the average hydropathy, amphipathicity and similarity plot using the AveHAS [16] program. Orientation of the domain within the outer membrane was predicted using the Viterbi method implemented in PRED-TMBB [48]. Secondary structure based sequence alignment of DUF3233 family and with the representative autotransporter translocation units was done with inputs from ClustalX [49], ProbCons [50], Ali2D [51] and the alignment was adjusted manually.

Phylogenetic analysis

Amino acid sequences of translocation unit of autotransporters, and pore forming beta-domain of two-partner secretion (TPS) family members, reported in [18] and [19] were used for phylogeny analysis. Three representative DUF3233 sequences from V. cholerae (RefSeq: NP_232949), C. psychrerythraea (RefSeq: YP_269983) and S. loihica (RefSeq: YP_001095898) along with beta-barrel domain sequences of the Va, Vb secretory systems were analysed with MEGA 4 [52]. Pairwise distances were calculated and the phylogenetic tree of the aligned sequences was generated using minimum evolution method. A bootstrap test of phylogeny was performed with p-distance model on the inferred evolutionary tree and a consensus bootstrap tree was generated from 1000 replicates.

Supporting Information

Figure S1

Sequence alignment of DUF3233 representative sequences. DUF3233 has an average 23 amino acid N-terminal signal sequence (gray) which guides inner membrane translocation. The signal peptidase I cleavage site is marked by a red arrow. Cps_Colwellia psychrerythraea (YP_269983.1), Vfi_Vibrio fischeri (YP_206466.1), Vpa_Vibrio parahaemolyticus (NP_800436.1), Vvu_Vibrio vulnificus (NP_762291.1), Vch_Vibrio cholerae (NP_232949.1), Vsp_Vibrio splendidus (YP_002395311.1), Sba_Shewanella baltica (YP_001052604.1), Slo_Shewanella loihica (YP_001095898.1), Spi_Shewanella piezotolerans (YP_002312093.1), Fba_Ferrimonas balearica (YP_003912385.1).

(TIF)

Table S1

List of DUF3233 homologous sequences. Sequence search by HHsenser (permissive) picked gram-negative and cyanobacterial (Cyn) hypothetical proteins that share significant homology with DUF3233. These hypothetical proteins posses sequence characteristics that apparently resemble the autotransporter beta-domain. # Insignificant Pfam A hits, $ Significant Pfam A hits.

(DOC)

Table S2

List of representative sequences used for phylogeny analysis. C-terminal translocation unit sequences of Autotransporter (Va) [18], Two partner secretion (Vb) [19] proteins and DUF3233 representatives used for phylogenetic analysis.

(DOC)

Acknowledgments

We thank Mr. G. Goldsmith, IBAB for his engaging and fruitful discussions.

Footnotes

Competing Interests: The authors have read the journal's policy and have the following conflicts: Dr. Agrawal is currently affiliated with BioCOS Life Sciences Pvt. Limited. There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials, as detailed online in the guide for authors.

Funding: We thank the Department of Information Technology (DIT), Government of India, New Delhi, India for supporting stipends to AP, YS and SS. We also thank BioCOS Life Sciences Private Limited, Bangalore, India (www.biocosls.com) for financial support and technical assistance to Dr. Agrawal. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Faruque SM, Albert MJ, Mekalanos JJ. Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae. Microbiol Mol Biol Rev. 1998;62:1301–14. doi: 10.1128/mmbr.62.4.1301-1314.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Sack DA, Sack RB, Nair GB, Siddique AK. Cholera. Lancet. 2004;363:223–33. doi: 10.1016/s0140-6736(03)15328-7. [DOI] [PubMed] [Google Scholar]
  • 3.Markowitz VM, Korzeniewski F, Palaniappan K, Szeto E, Werner G, et al. The integrated microbial genomes (IMG) system. Nucleic Acids Res. 2006;34:D344–8. doi: 10.1093/nar/gkj024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, et al. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–80. doi: 10.1093/nar/30.1.276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Söding J, Remmert M, Biegert A, Lupas AN. HHsenser: exhaustive transitive profile search using HMM-HMM comparison. Nucleic Acids Res. 2006;34:W374–8. doi: 10.1093/nar/gkl195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang Y. I-TASSER server for protein 3D structure prediction. BMC Bioinformatics. 2008;9:40. doi: 10.1186/1471-2105-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995;247:536–40. doi: 10.1006/jmbi.1995.0159. [DOI] [PubMed] [Google Scholar]
  • 8.Ou YY, Gromiha MM, Chen SA, Suwa M. TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles. Comput Biol Chem. 2008;32:227–31. doi: 10.1016/j.compbiolchem.2008.03.002. [DOI] [PubMed] [Google Scholar]
  • 9.Remmert M, Linke D, Lupas AN, Söding J. HHomp–prediction and classification of outer membrane proteins. Nucleic Acids Res. 2009;37:W446–51. doi: 10.1093/nar/gkp325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bendtsen JD, Nielsen H, von Heijne G, Brunak S. Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004;340:783–95. doi: 10.1016/j.jmb.2004.05.028. [DOI] [PubMed] [Google Scholar]
  • 11.Paetzel M, Karla A, Strynadka NC, Dalbey RE. Signal peptidases. Chem Rev. 2002;102:4549–80. doi: 10.1021/cr010166y. [DOI] [PubMed] [Google Scholar]
  • 12.Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, et al. Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci. 2003;12:1652–62. doi: 10.1110/ps.0303703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, et al. The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res. 2011;39:D561–8. doi: 10.1093/nar/gkq973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bendtsen JD, Kiemer L, Fausbøll A, Brunak S. Non-classical protein secretion in bacteria. BMC Microbiol. 2005;5:58. doi: 10.1186/1471-2180-5-58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.McGuffin LJ, Bryson K, Jones DT. The PSIPRED protein structure prediction server. Bioinformatics. 2000;16:404–5. doi: 10.1093/bioinformatics/16.4.404. [DOI] [PubMed] [Google Scholar]
  • 16.Zhai Y, Saier MH., Jr A web-based program for the prediction of average hydropathy, average amphipathicity and average similarity of multiply aligned homologous proteins. J Mol Microbiol Biotechnol. 2001;3:285–6. [PubMed] [Google Scholar]
  • 17.Roggenkamp A, Ackermann N, Jacobi CA, Truelzsch K, Hoffmann H, et al. Molecular analysis of transport and oligomerization of the Yersinia enterocolitica adhesin YadA. J Bacteriol. 2003;185:3735–44. doi: 10.1128/JB.185.13.3735-3744.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Henderson IR, Navarro-Garcia F, Desvaux M, Fernandez RC, Ala'Aldeen D. Type V protein secretion pathway: the autotransporter story. Microbiol Mol Biol Rev. 2004;68:692–744. doi: 10.1128/MMBR.68.4.692-744.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yen MR, Peabody CR, Partovi SM, Zhai Y, Tseng YH, et al. Protein-translocating outer membrane porins of Gram-negative bacteria. Biochim Biophys Acta. 2002;1562:6–31. doi: 10.1016/s0005-2736(02)00359-0. [DOI] [PubMed] [Google Scholar]
  • 20.Pohlner J, Halter R, Beyreuther K, Meyer TF. Gene structure and extracellular secretion of Neisseria gonorrhoeae IgA protease. Nature. 1987;325:458–62. doi: 10.1038/325458a0. [DOI] [PubMed] [Google Scholar]
  • 21.Henderson IR, Nataro JP. Virulence functions of autotransporter proteins. Infect Immun. 2001;69:1231–43. doi: 10.1128/IAI.69.3.1231-1243.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Oomen CJ, van Ulsen P, van Gelder P, Feijen M, Tommassen J, et al. Structure of the translocator domain of a bacterial autotransporter. EMBO J. 2004;23:1257–66. doi: 10.1038/sj.emboj.7600148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Van den Berg B. Crystal structure of a full-length autotransporter. J Mol Biol. 2010;396:627–33. doi: 10.1016/j.jmb.2009.12.061. [DOI] [PubMed] [Google Scholar]
  • 24.Barnard TJ, Dautin N, Lukacik P, Bernstein HD, Buchanan SK. Autotransporter structure reveals intra-barrel cleavage followed by conformational changes. Nat Struct Mol Biol. 2007;14:1214–20. doi: 10.1038/nsmb1322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jacob-Dubuisson F, Locht C, Antoine R. Two-partner secretion in Gram-negative bacteria: a thrifty, specific pathway for large virulence proteins. Mol Microbiol. 2001;40:306–13. doi: 10.1046/j.1365-2958.2001.02278.x. [DOI] [PubMed] [Google Scholar]
  • 26.Surana NK, Buscher AZ, Hardy GG, Grass S, Kehl-Fie T, et al. Translocator proteins in the two-partner secretion family have multiple domains. J Biol Chem. 2006;281:18051–8. doi: 10.1074/jbc.M600036200. [DOI] [PubMed] [Google Scholar]
  • 27.Clantin B, Delattre AS, Rucktooa P, Saint N, Méli AC, et al. Structure of the membrane protein FhaC: a member of the Omp85-TpsB transporter superfamily. Science. 2007;317:957–61. doi: 10.1126/science.1143860. [DOI] [PubMed] [Google Scholar]
  • 28.Könninger UW, Hobbie S, Benz R, Braun V. The haemolysin-secreting ShlB protein of the outer membrane of Serratia marcescens: determination of surface-exposed residues and formation of ion-permeable pores by ShlB mutants in artificial lipid bilayer membranes. Mol Microbiol. 1999;32:1212–25. doi: 10.1046/j.1365-2958.1999.01433.x. [DOI] [PubMed] [Google Scholar]
  • 29.Hoiczyk E, Roggenkamp A, Reichenbecher M, Lupas A, Heesemann J. Structure and sequence analysis of Yersinia YadA and Moraxella UspAs reveal a novel class of adhesins. EMBO J. 2000;19:5989–99. doi: 10.1093/emboj/19.22.5989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kim DS, Chao Y, Saier MH., Jr Protein-translocating trimeric autotransporters of gram-negative bacteria. J Bacteriol. 2006;188:5655–67. doi: 10.1128/JB.01596-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Cotter SE, Surana NK, St Geme JW., 3rd Trimeric autotransporters: a distinct subfamily of autotransporter proteins. Trends Microbiol. 2005;13:199–205. doi: 10.1016/j.tim.2005.03.004. [DOI] [PubMed] [Google Scholar]
  • 32.Meng G, Surana NK, St Geme JW, 3rd, Waksman G. Structure of the outer membrane translocator domain of the Haemophilus influenzae Hia trimeric autotransporter. EMBO J. 2006;25:2297–304. doi: 10.1038/sj.emboj.7601132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Larocque RC, Harris JB, Dziejman M, Li X, Khan AI, et al. Transcriptional profiling of Vibrio cholerae recovered directly from patient specimens during early and late stages of human infection. Infect Immun. 2005;73:4488–93. doi: 10.1128/IAI.73.8.4488-4493.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Nakhamchik A, Wilde C, Rowe-Magnus DA. Cyclic-di-GMP regulates extracellular polysaccharide production, biofilm formation, and rugose colony development by Vibrio vulnificus. Appl Environ Microbiol. 2008;74:4199–209. doi: 10.1128/AEM.00176-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Beyhan S, Tischler AD, Camilli A, Yildiz FH. Transcriptome and phenotypic responses of Vibrio cholerae to increased cyclic di-GMP level. J Bacteriol. 2006;188:3600–13. doi: 10.1128/JB.188.10.3600-3613.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Barnes IH, Bagnall MC, Browning DD, Thompson SA, Manning G, et al. Gamma-glutamyl transpeptidase has a role in the persistent colonization of the avian gut by Campylobacter jejuni. Microb Pathog. 2007;43:198–207. doi: 10.1016/j.micpath.2007.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chevalier C, Thiberge JM, Ferrero RL, Labigne A. Essential role of Helicobacter pylori gamma-glutamyltranspeptidase for the colonization of the gastric mucosa of mice. Mol Microbiol. 1999;31:1359–72. doi: 10.1046/j.1365-2958.1999.01271.x. [DOI] [PubMed] [Google Scholar]
  • 38.Ryjenkov DA, Tarutina M, Moskvin OV, Gomelsky M. Cyclic diguanylate is a ubiquitous signalling molecule in bacteria: insights into biochemistry of the GGDEF protein domain. J Bacteriol. 2005;187:1792–8. doi: 10.1128/JB.187.5.1792-1798.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Heidelberg JF, Eisen JA, Nelson WC, Clayton RA, Gwinn ML, et al. DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature. 2000;406:477–83. doi: 10.1038/35020000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Henderson IR, Navarro-Garcia F, Nataro JP. The great escape: structure and function of the autotransporter proteins. Trends Microbiol. 1998;6:370–8. doi: 10.1016/s0966-842x(98)01318-3. [DOI] [PubMed] [Google Scholar]
  • 41.Hendrixson DR, de la Morena ML, Stathopoulos C, St Geme JW., 3rd Structural determinants of processing and secretion of the Haemophilus influenzae hap protein. Mol Microbiol. 1997;26:505–18. doi: 10.1046/j.1365-2958.1997.5921965.x. [DOI] [PubMed] [Google Scholar]
  • 42.Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Mao F, Dam P, Chou J, Olman V, Xu Y. DOOR: a database for prokaryotic operons. Nucleic Acids Res. 2009;37:D459–63. doi: 10.1093/nar/gkn757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zhang Y. Template-based modeling and free modeling by I-TASSER in CASP7. Proteins. 2007;69:108–17. doi: 10.1002/prot.21702. [DOI] [PubMed] [Google Scholar]
  • 45.Sippl MJ, Wiederstein M. A note on difficult structure alignment problems. Bioinformatics. 2008;24:426–7. doi: 10.1093/bioinformatics/btm622. [DOI] [PubMed] [Google Scholar]
  • 46.Sayle RA, Milner-White EJ. RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995;20:374. doi: 10.1016/s0968-0004(00)89080-5. [DOI] [PubMed] [Google Scholar]
  • 47.Zhai Y, Saier MH., Jr A web-based program (WHAT) for the simultaneous prediction of hydropathy, amphipathicity, secondary structure and transmembrane topology for a single protein sequence. J Mol Microbiol Biotechnol. 2001;3:501–2. [PubMed] [Google Scholar]
  • 48.Bagos PG, Liakopoulos TD, Spyropoulos IC, Hamodrakas SJ. PRED-TMBB: a web server for predicting the topology of beta-barrel outer membrane proteins. Nucleic Acids Res. 2004;32:W400–4. doi: 10.1093/nar/gkh417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Jeanmougin F, Thompson JD, Gouy M, Higgins DG, Gibson TJ. Multiple sequence alignment with Clustal X. Trends Biochem Sci. 1998;23:403–5. doi: 10.1016/s0968-0004(98)01285-7. [DOI] [PubMed] [Google Scholar]
  • 50.Do CB, Mahabhashyam MS, Brudno M, Batzoglou S. ProbCons: Probabilistic consistency-based multiple sequence alignment. Genome Res. 2005;15:330–40. doi: 10.1101/gr.2821705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Biegert A, Mayer C, Remmert M, Söding J, Lupas AN. The MPI Bioinformatics Toolkit for protein sequence analysis. Nucleic Acids Res. 2006;34:W335–9. doi: 10.1093/nar/gkl217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Tamura K, Dudley J, Nei M, Kumar S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007;24:1596–9. doi: 10.1093/molbev/msm092. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1

Sequence alignment of DUF3233 representative sequences. DUF3233 has an average 23 amino acid N-terminal signal sequence (gray) which guides inner membrane translocation. The signal peptidase I cleavage site is marked by a red arrow. Cps_Colwellia psychrerythraea (YP_269983.1), Vfi_Vibrio fischeri (YP_206466.1), Vpa_Vibrio parahaemolyticus (NP_800436.1), Vvu_Vibrio vulnificus (NP_762291.1), Vch_Vibrio cholerae (NP_232949.1), Vsp_Vibrio splendidus (YP_002395311.1), Sba_Shewanella baltica (YP_001052604.1), Slo_Shewanella loihica (YP_001095898.1), Spi_Shewanella piezotolerans (YP_002312093.1), Fba_Ferrimonas balearica (YP_003912385.1).

(TIF)

Table S1

List of DUF3233 homologous sequences. Sequence search by HHsenser (permissive) picked gram-negative and cyanobacterial (Cyn) hypothetical proteins that share significant homology with DUF3233. These hypothetical proteins posses sequence characteristics that apparently resemble the autotransporter beta-domain. # Insignificant Pfam A hits, $ Significant Pfam A hits.

(DOC)

Table S2

List of representative sequences used for phylogeny analysis. C-terminal translocation unit sequences of Autotransporter (Va) [18], Two partner secretion (Vb) [19] proteins and DUF3233 representatives used for phylogenetic analysis.

(DOC)


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES