Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Sep 4.
Published in final edited form as: J Biomol NMR. 2004 May;29(1):101–102. doi: 10.1023/B:JNMR.0000019469.77977.72

Letter to the Editor: NMR assignment of the hypothetical protein HI0004 from Haemophilus influenzae – a putative essential gene product

Deok Cheon Yeh 1, James F Parsons 1, Lisa M Parsons 1, Fang Liu 1, Edward Eisenstein 1, John Orban 1,*
PMCID: PMC3762882  NIHMSID: NIHMS505811  PMID: 15017148

Biological context

Haemophilus influenzae is a gram negative bacterium which is known to cause meningitis, sepsis, upper respiratory infections, and inner ear infections, particularly in small children. Its 1.83 Mb genome is relatively small, consisting of approximately 1700 open reading frames, and was the first free-living organism to have its genome sequenced entirely (Fleischmann et al., 1995). At the time the genome sequence was released, approximately 32% of the gene products had unknown or poorly understood functions. A significant fraction of these have sequence homologues in prokaryotes, archaea, and eukaryotes, and many are still classified as hypothetical proteins. In an effort to better understand the biochemical functions of these hypothetical proteins, we have been involved in a concerted program to solve their three dimensional structures using X-ray crystallography or NMR spectroscopy (Eisenstein et al., 2000). A summary of the current project status can be found at http://s2f.umbi.umd.edu

One such protein of interest is the 154 residue HI0004 which has over 150 sequence homologues in other organisms but no known function or three-dimensional structure. Mutagenesis experiments indicate a putative essential role for 259 gene products of unknown function in Haemophilus influenzae and HI0004 is in this set, making it an important target for structural and functional annotation (Akerley et al., 2002). Further, the HI0004 family of sequence relatives contains a human homologue, C21orf57, which is a putative transcript located on chromosome 21 near the Down syndrome locus (Reymond et al., 2001). Structure determination of HI0004 will therefore allow homology modeling of a large number of sequences in other organisms and provide the basis for an understanding of their biochemical functions at a molecular level. We report here the 1H, 13C, and 15N chemical shift assignments for HI0004, providing the first step toward detailed structure calculations in solution.

Methods and experiments

The native HI0004 gene was cloned into a pET-28a vector using standard procedures and transformed into Escherichia coli BL21 (DE3) cells (Novagen). Uniformly 13C/15N-labeled samples of HI0004 were prepared by growing these cells in minimal media with 15NH4Cl and 13C6-glucose as the sole nitrogen and carbon sources, respectively. The media were further supplemented with ZnCl2 (2 mg/L), MnSO4 (1 mg/L), CoCl2 (0.1 mg/L), CuSO4 (0.1 mg/L) and thiamine (1 mg/L). E. coli cells were grown to an A600 of 0.8 and protein expression was induced with 1 mM IPTG. After an additional 3 h, the cells were harvested, suspended in lysis buffer (0.1 M Tris, 1 mM EDTA, 1 mM DTT, pH 8.0), and lysed by sonication. The lysed cells were centrifuged and ammonium sulfate was added to the supernatant to a 40% saturation level. The resulting precipitate was removed by centrifugation and further ammonium sulfate was added to the supernatant to give a 70% saturation level. The precipitate was collected by centrifugation, dissolved in lysis buffer, and dialysed against a solution of 25 mM HEPES, 1 mM EDTA, and 1 mM DTT at pH 7.2. The desalted solution was loaded on a Poros HQ-50 anion exchange column (Applied Biosystems) and eluted with a 0–800 mM NaCl gradient. The homogeneity of individual fractions was determined by SDS-PAGE 102 and those containing pure HI0004 were combined, dialysed against NMR buffer (50 mMpotassium phosphate, 100 mM NaCl, 0.5 mM EDTA, 1 mM DTT, pH 7.0) and concentrated.

Both 15N- and 15N/13C-labeled NMR samples of HI0004 were prepared at concentrations of approximately 1.5 mM in 450 µL of NMR buffer containing 90% H2O/10% D2O. The following experiments were acquired: two-dimensional 15N-HSQC,13C-HSQC, and (HB)CB(CGCD)HD/(HB)CB(CGCD)HE, and three-dimensionalHNCACB, CBCA(CO)NH, HNCO, HNCA, HBHA(CO)NH, H(CCO)NH-TOCSY, (H)C (CO)NH-TOCSY, HCCH-TOCSY, HCCH-COSY, 15NNOESY HSQC, and 13C-NOESY HSQC. In addition, a constant time 13C-HSQC spectrum was recorded on a 15% uniformly 13C-labeled sample (1.0 mM) to determine stereospecific assignments of methyl groups in Leu and Val residues (Neri et al., 1989). The methyl groups of 12 out of 14 Val residues and all 17 Leu residues were stereospecifically assigned in this way. With the exception of the 15N-NOESY HSQC, the NMR spectra were collected in a 7 day period at the NMR Facility at Madison (NMRFAM, Wisconsin) on a Bruker DRX-500 equipped with a z-gradient triple resonance cryoprobe. All experiments were recorded at 298K. Spectra were processed using nmrPipe (Delaglio et al., 1995) and analyzed with Sparky (Goddard and Kneller, UCSF).

Extent of assignments and data deposition

Sequence specific backbone assignments were made for 143 of the 145 possible main chain crosspeaks in the 15N-HSQC spectrum of HI0004 (154 residues minus 8 prolines and the N-terminal Met) as illustrated in Figure 1. Residues for which backbone HN/N assignments were not made are G2 and M133. The chemical shift index (CSI) results suggest that HI0004 contains four beta strands and five helices. Interestingly, the number of these secondary structure elements and their locations in the polypeptide chain from CSI are in very good agreement with sequence-based predictions from Psipred (Jones, 1999). Assignments were also made for about 90% of the possible 1H, 13C, and 15N side chain resonances. The chemical shifts of HI0004 have been deposited in the BioMagResBank database (http://www.bmrb.wisc.edu) under BMRB accession number 5942.

Figure 1.

Figure 1

Two-dimensional 15N-HSQC spectrum of HI0004 at 298K and pH 7.0. The downfield peaks due to the indole protons of W27 and W108 are not shown.

Acknowledgements

We are grateful to NMRFAM at Madison, Wisconsin, for access to their cryoprobe facility and to Dr Frits Abildgaard in particular for assistance with pulse programs. This research was supported by NIH grants GM57890 and 1S10RR15744, and the W. M. Keck Foundation.

References

  1. Akerley BJ, Rubin EJ, Novick VL, Amaya K, Judson N, Mekalanos JJ. Proc. Natl. Acad. Sci. USA. 2002;99:966–971. doi: 10.1073/pnas.012602299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. J. Biomol. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
  3. Eisenstein E, Gilliland GL, Herzberg O, Moult J, Orban J, Poljak RJ, Banerjei L, Richardson D, Howard AJ. Curr. Opin. Biotechnol. 2000;11:25–30. doi: 10.1016/s0958-1669(99)00063-4. [DOI] [PubMed] [Google Scholar]
  4. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb J-F, Dougherty BA, Merrick JM, McKenney K, Sutton G, FitzHugh W, Fields C, Gocayne JD, Scott J, Shirley R, Liu L-I, Glodek A, Kelley JM, Weidman JF, Phillips CA, Spriggs T, Hedblom E, Cotton MD, Utterback TR, Hanna MC, Nguyen DT, Saudek DM, Brandon RC, Fine LD, Fritchman JL, Fuhrmann JL, Geoghagen NSM, Gnehm CL, McDonald LA, Small KV, Fraser CM, Smith HO, Venter JC. Science. 1995;269:496–512. doi: 10.1126/science.7542800. [DOI] [PubMed] [Google Scholar]
  5. Jones DT. J. Mol. Biol. 1999;292:195–202. doi: 10.1006/jmbi.1999.3091. [DOI] [PubMed] [Google Scholar]
  6. Neri D, Szyperski T, Otting G, Senn H, Wüthrich K. Biochemistry. 1989;28:7510–7516. doi: 10.1021/bi00445a003. [DOI] [PubMed] [Google Scholar]
  7. Reymond A, Friedli M, Henrichsen CN, Chapot F, Deutsch S, Ucla C, Rossier C, Lyle R, Guipponi M, Antonarakis SE. Genomics. 2001;78:46–54. doi: 10.1006/geno.2001.6640. [DOI] [PubMed] [Google Scholar]

RESOURCES