Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2006 Nov 7;35(Database issue):D906–D910. doi: 10.1093/nar/gkl849

SuperHapten: a comprehensive database for small immunogenic compounds

Stefan Günther 1,*, Dorothea Hempel 1, Mathias Dunkel 1, Kristian Rother 1, Robert Preissner 1
PMCID: PMC1669746  PMID: 17090587

Abstract

The immune system protects organisms from foreign proteins, peptide epitopes and a multitude of chemical compounds. Among these, haptens are small molecules, eliciting an immune response when conjugated with carrier molecules. Known haptens are xenobiotics or natural compounds, which can induce a number of autoimmune diseases like contact dermatitis or asthma. Furthermore, haptens are utilized in the development of biosensors, immunomodulators and new vaccines. Although hapten-induced allergies account for 6–10% of all adverse drug effects, the understanding of the correlation between structural and haptenic properties is rather fragmentary. We have developed a manually curated hapten database, SuperHapten, integrating information from literature and web resources. The current version of the database compiles 2D/3D structures, physicochemical properties and references for about 7500 haptens and 25,000 synonyms. The commercial availability is documented for about 6300 haptens and 450 related antibodies, enabling experimental approaches on cross-reactivity. The haptens are classified regarding their origin: pesticides, herbicides, insecticides, drugs, natural compounds, etc. Queries allow identification of haptens and associated antibodies according to functional class, carrier protein, chemical scaffold, composition or structural similarity. SuperHapten is available online at http://bioinformatics.charite.de/superhapten.

INTRODUCTION

Today, many aspects of the immune system like receptor-mediated signalling or induction of autoimmune diseases are only partly understood (13). Nevertheless, the substantial progress in the development of new vaccines (4,5) or therapeutic antibodies (6) demonstrates the great impact of immunological research on drug design and immune therapy. Although not antigenic by themselves, haptens interact with T-cell receptors or specific antibodies when conjugated to a larger antigenic molecule, usually a carrier protein. T-cells recognize haptens which are bound to the major histocompatibility complex (MHC) presented on the surface of various cell types. In contrast, hapten recognition by B-cells is mediated by receptors located on their membranes. Once they are activated, B-cells differentiate into Ig-secreting plasma cells. Such cells are able to produce highly specific antibodies that have the capability to bind haptens without carrier conjugation (7).

Specific antibody recognition is utilized in the development of biosensors (8), catalytic antibodies (9), immunomodulators (10) or new vaccines (11). However, an immune response is not always favourable. Natural and synthetic compounds in food or cosmetic products can cause skin inflammation (12), asthma and other allergic symptoms (13). Even though several mechanisms are discussed to be operative in the pathogenesis, immunogenic compounds are capable of eliciting autoimmune diseases like hepatocellular hepatitis (14) or systemic lupus erythematosus (15). It is estimated that drug allergies account for 6–10% of all adverse drug effects (16).

Many databases exist that provide diverse information for immunology. The International Immunogenetics Information System (17) consists of databases, online-tools and Web resources regarding various immunological aspects. Extensive information is also provided by the HIV Molecular Immunology Database (http://www.hiv.lanl.gov/content/immunology). Data relating to antigenic epitopes are collected in the databases JenPep (18), AntiJen (19) and Epitome (20). Furthermore the databases Kabat (21), MHCBN (22), Bcipep (23) and FIMM (24) provides comprehensive information about MHC binding peptides, B-cell epitopes or antibody sequences.

Haptens have rarely been subject to an extensive description. Recently, a large manually curated immune epitope database (IEDB) was published (25). Although the database focuses on antigenic peptides, it contains 91 haptens. Much more haptens are described within the HaptenDB (26), a collection of approximately 1000 haptens with its main focus on the underlying immunochemical assays. It contains information about the assay method, conjugation ratio of haptens and carrier molecules, sensitivity and specificity of the system or cross reactivity of the antibodies used. Although the database is very helpful to get an overview of the performed immunologic experiments, we noticed that many important immunologic compounds are missing. For example, drugs such as aspirin or beta adrenergic antagonists are capable of eliciting severe autoimmune diseases like bronchial asthma (27) or Psoriasis (28). Chlorated benzenes present in pesticides, solvents and lubricants are frequently described as haptens but apparently not incorporated within the HaptenDB. While the authors processed in detail the data of approximately 250 hapten-related articles, a large proportion of described haptens and associated immunologic articles is ignored. Furthermore, information about the availability of the described haptens and antibodies will be helpful in case a described immunoassay is to be reproduced.

To overcome these problems, we have performed a more extensive semi-automatic literature screening, resulting in a comprehensive dataset.

DATABASE DESCRIPTION

SuperHapten currently contains 7257 different haptens and references to about 10,000 immunologic articles. The basic application of the database is the identification of compounds that elicit immune response. The user interface provides diverse query types, like searching by hapten name, physicochemical properties, industrial use or associated carrier protein. Figure 1 depicts various query types. A built-in molecule editor allows the user to draw a molecule and to screen the database for compounds with similar molecular structure. This feature is of particular interest even if the specific requested molecule is not part of the database. Since compounds with high structural similarity frequently exhibit similar activities (29), this method provides an informative basis for the identification of new immunogenic compounds. Identified haptens are displayed by 3D structure and may act as starting points for further similarity searches. Each compound is provided with a full info page containing synonyms, molecular properties and references to the underlying literature. Another important aspect to assist immunologists is the availability of the haptens for further experiments. We have checked whether the haptens are commercially obtainable and provided 6279 compounds with ordering codes and external links to suppliers. Similarly, the ordering information of 453 hapten related antibodies was determined and included within the database. This procedure may avoid exhausting in vitro antibody generations if specific antibodies are required. Carrier proteins are annotated with UNIPROT-ID, PDB-structure and underlying references to literature. Furthermore, we have checked whether the annotated haptens are structurally resolved in complex with antibodies, carriers or other proteins. The corresponding PDB cross references are included on the hapten related info pages. Some haptens were identified, which are assured to be MHC-mediated. These 46 haptens are specified separately and directly retrievable. Table 1 specifies some key numbers of the database content.

Figure 1.

Figure 1

Queries and results of the SuperHapten web-interface. Query types: (a) Part of the cluster-tree, showing the structure of the scaffold classification. Clicking on the nodes expands the branch: monocyclic aromatic compounds → dinitrobenzene derivatives etc. Clicking on leafs triggers a database search in a separate window. (b) Text query options including a classification regarding the origin of the haptens. (c) Screenshot of the java applet Marvin which allows upload or drawing of own structures for similarity searches in the SuperHapten database showing Dinitrophenol. Results (d) Query results with search options for 2D similarity: Dinitrophenol (C6H4N2O5) and 2,4-dinitro-6-methylphenol (C7H6N2O5). The compounds can be rotated (left mouse button), different display styles are available, structures can be saved (right mouse button) and more detailed information, such as synonymous names and formula or supplier information can be obtained by use of the FULL INFO button. Further information (e) Antibody and suitable information like supplier, ID and source organism are displayed. (f) Information on hapten specific carrier protein(s), references, UNIPROT-, PDB-ID etc. (h) Recent scientific references confirming the haptenic effect including author information and abstract.

Table 1.

Key numbers of the database content

Entities
    Haptens 7257
    Antibodies 453
    Carrier proteins 24
    References to literature 9670
Hapten subgroups
    Confirmed 2009
    Putative 5248
    Commercially available 6285
    Drugs 392
    Natural compounds 1397
    Pesticides 227

Another intention was to allow immunologists an overview of known immunogenic compounds. For that purpose, the haptens were compared all against all and clustered by structural similarity. The resulting 154 clusters containing at least eight similar compounds were manually ordered to obtain a hierarchical compound tree. Each node was named according to the chemistry of the included compounds. For example, halogenated chemicals are frequently responsible for environmental skin diseases (30). About 800 references to articles are retrievable, which describe an immunologic response to halogenated compounds. By the same procedure, chemical classes like sulfurones or triazines are easily accessible. The presented cluster tree could potentially allow researchers to analyse the relationship between molecular structure and immunogenic effect. Exemplarily, Table 2 shows seven chemicals according to different compound groups.

Table 2.

Examples of haptens classified as drugs, contact allergens or pesticides

graphic file with name gkl849t2.jpg

The cluster size specifies the number of highly similar haptens grouped together.

METHODS

Haptens were collected from literature and various web resources. The abstracts of the literature database PubMed were filtered for relevant immunologic articles, using specific keywords. The 15,000 abstracts thus obtained were screened against names and synonyms of 3 million chemical compounds as well as a distinct set of substrings of IUPAC names.

The text passages containing matches were manually curated by a scientific team that confirmed the matching and analysed whether the identified compound was recognized by antibodies or T-cell receptors. About 200 full text review articles were subjected to the same procedure. The resulting 1600 haptens formed the confirmed basic dataset. Two web resources were checked, the DIMDI contact allergen database (http://www.dimdi.de/static/en/db/dbinfo/dbkurz/ka00.htm) and the HaptenDB (26). Additional 400 compounds were detected and included within the SuperHapten. Each compound of the basic dataset was translated into a structural fingerprint, using the chemistry development kit (http://almost.cubic.uni-koeln.de/cdk/). The fingerprint algorithm follows the approach taken by Daylight (http://www.daylight.com/dayhtml/doc/theory/). Up to six connected atoms all connectivity paths within the molecule are determined. For each path, a hash function calculates the location of a representing bit in a Boolean array. The number of possible paths is huge and it is not possible to assign a particular bit to each path, thus a particular bit represents several paths. The combination of all representing bits of all paths yields in a specific pattern or structural fingerprint.

The similarity index used is the Tanimoto coefficient, which is the number of bit positions set to 1 in both fingerprints divided by the number of bit positions set to 1 in at least one of the fingerprints. If a set bit is considered as a feature present in the molecule, the Tanimoto coefficient is a measure of the number of common features in both molecules (31). A Tanimoto coefficient of >0.85 indicates that two molecules have similar activities (32). Calculated fingerprints were used for a further structural screening against 4 million compounds. In this way, 5248 compounds were detected that resulted in Tanimoto coefficients of >0.9. These putative haptens are highly probable candidates for eliciting the same kind of immune response, binding to the same carrier and being recognised by the same antibodies or T-cells as the haptens found in literature (29).

3D representations of all haptens were generated with Discovery Studio, (Accelrys Inc., http://www.accelrys.com/dstudio) and visualized with the free Chime-Plugin from MDL (available for Windows, SGI, Mac). Linux compatibility is warranted by a second visualizer, MarvinView (ChemAxon). The same tool was also used for the built-in molecule editor that allows structural screening with self-edited molecules.

For compatibility, our data model are directed to the schema implemented in the IEDB (25). A diagram of the database schema is shown at the website. The data are implemented as a relational database on a MySQL server and publicly available at http://bioinformatics.charite.de/superhapten.

CONCLUSIONS AND FUTURE DIRECTIONS

By now, the presented database has become a useful tool to retrieve information about specific immunogenic compounds, or to get an overview of the known substances that elicit immune responses. The KEGG-pathway analysis of single haptenic compounds revealed that their synthesis is exclusively performed by bacterial enzymes (33). SuperHapten enables systematic approaches on the position of haptens in metabolic or signalling networks. The included data on purchasability of haptens and related antibodies will enable systematic experimental approaches on the relation between structural similarity and cross-reactivity.

Furthermore, structure comparisons of self-edited molecules to the annotated haptens may allow a first rough estimation of the potential immunogenicity of new chemicals.

Acknowledgments

This work is supported by the Deutsche Forschungsge-meinschaft (DFG). Funding to pay the Open Access publication charges for this article was provided by DFG-Sonderforschungsbereich 449.

Conflict of interest statement. None declared.

REFERENCES

  • 1.Rioux J.D., Abbas A.K. Paths to understanding the genetic basis of autoimmune disease. Nature. 2005;435:584–589. doi: 10.1038/nature03723. [DOI] [PubMed] [Google Scholar]
  • 2.O'Neill L.A. How Toll-like receptors signal: what we know and what we don't know. Curr. Opin. Immunol. 2006;18:3–9. doi: 10.1016/j.coi.2005.11.012. [DOI] [PubMed] [Google Scholar]
  • 3.Mahoney J.A., Rosen A. Apoptosis and autoimmunity. Curr. Opin. Immunol. 2005;17:583–588. doi: 10.1016/j.coi.2005.09.018. [DOI] [PubMed] [Google Scholar]
  • 4.Linhart B., Valenta R. Molecular design of allergy vaccines. Curr. Opin. Immunol. 2005;17:646–655. doi: 10.1016/j.coi.2005.09.010. [DOI] [PubMed] [Google Scholar]
  • 5.Houghton M., Abrignani S. Prospects for a vaccine against the hepatitis C virus. Nature. 2005;436:961–966. doi: 10.1038/nature04081. [DOI] [PubMed] [Google Scholar]
  • 6.Kim S.J., Park Y., Hong H.J. Antibody engineering for the development of therapeutic antibodies. Mol. Cells. 2005;20:17–29. [PubMed] [Google Scholar]
  • 7.Singh K.V., Kaur J., Varshney G.C., Raje M., Suri C.R. Synthesis and characterization of hapten-protein conjugates for antibody production against small molecules. Bioconjug. Chem. 2004;15:168–173. doi: 10.1021/bc034158v. [DOI] [PubMed] [Google Scholar]
  • 8.Cho A.Y., Yi K.S., Rhim J.H., Kim K.I., Park J.Y., Keum E.H., Chung J., Oh S. Detection of abnormally high amygdalin content in food by an enzyme immunoassay. Mol. Cells. 2006;21:308–313. [PubMed] [Google Scholar]
  • 9.Hanson C.V., Nishiyama Y., Paul S. Catalytic antibodies and their applications. Curr. Opin. Biotechnol. 2005;16:631–636. doi: 10.1016/j.copbio.2005.10.003. [DOI] [PubMed] [Google Scholar]
  • 10.Holzer A.M., Kaplan L.L., Levis W.R. Haptens as drugs: contact allergens are powerful topical immunomodulators. J. Drugs Dermatol. 2006;5:410–416. [PubMed] [Google Scholar]
  • 11.Berd D., Sato T., Maguire H.C., Jr, Kairys J., Mastrangelo M.J. Immunopharmacologic analysis of an autologous, hapten-modified human melanoma vaccine. J. Clin. Oncol. 2004;22:403–415. doi: 10.1200/JCO.2004.06.043. [DOI] [PubMed] [Google Scholar]
  • 12.Divkovic M., Pease C.K., Gerberick G.F., Basketter D.A. Hapten-protein binding: from theory to practical application in the in vitro prediction of skin sensitization. Contact Dermatitis. 2005;53:189–200. doi: 10.1111/j.0105-1873.2005.00683.x. [DOI] [PubMed] [Google Scholar]
  • 13.Vanoirbeek J.A., Tarkowski M., Vanhooren H.M., De Vooght V., Nemery B., Hoet P.H. Validation of a mouse model of chemical-induced asthma using trimellitic anhydride, a respiratory sensitizer, and dinitrochlorobenzene, a dermal sensitizer. J. Allergy. Clin. Immunol. 2006;117:1090–1097. doi: 10.1016/j.jaci.2006.01.027. [DOI] [PubMed] [Google Scholar]
  • 14.Ju C. Immunological mechanisms of drug-induced liver injury. Curr. Opin. Drug. Discov. Devel. 2005;8:38–43. [PubMed] [Google Scholar]
  • 15.Pelizza L., De Luca P., La Pesa M., Minervino A. Drug-induced systemic lupus erythematosus after 7 years of treatment with carbamazepine. Acta. Biomed Ateneo Parmense. 2006;77:17–19. [PubMed] [Google Scholar]
  • 16.Ju C., Pohl L.R. Tolerogenic role of Kupffer cells in immune-mediated adverse drug reactions. Toxicology. 2005;209:109–112. doi: 10.1016/j.tox.2004.12.017. [DOI] [PubMed] [Google Scholar]
  • 17.Lefranc M.P. IMGT, the international ImMunoGeneTics information system(R): a standardized approach for immunogenetics and immunoinformatics. Immunome Res. 2005;1:3. doi: 10.1186/1745-7580-1-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.McSparron H., Blythe M.J., Zygouri C., Doytchinova I.A., Flower D.R. JenPep: a novel computational information resource for immunobiology and vaccinology. J. Chem. Inf. Comput. Sci. 2003;43:1276–1287. doi: 10.1021/ci030461e. [DOI] [PubMed] [Google Scholar]
  • 19.Toseland C.P., Clayton D.J., McSparron H., Hemsley S.L., Blythe M.J., Paine K., Doytchinova I.A., Guan P., Hattotuwagama C.K., Flower D.R. AntiJen: a quantitative immunology database integrating functional, thermodynamic, kinetic, biophysical, and cellular data. Immunome Res. 2005;1:4. doi: 10.1186/1745-7580-1-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Schlessinger A., Ofran Y., Yachdav G., Rost B. Epitome: database of structure-inferred antigenic epitopes. Nucleic Acids Res. 2006;34:D777–D780. doi: 10.1093/nar/gkj053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Johnson G., Wu T.T. Kabat database and its applications: 30 years after the first variability plot. Nucleic Acids Res. 2000;28:214–218. doi: 10.1093/nar/28.1.214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bhasin M., Singh H., Raghava G.P. MHCBN: a comprehensive database of MHC binding and non-binding peptides. Bioinformatics. 2003;19:665–666. doi: 10.1093/bioinformatics/btg055. [DOI] [PubMed] [Google Scholar]
  • 23.Saha S., Bhasin M., Raghava G.P. Bcipep: a database of B-cell epitopes. BMC Genomics. 2005;6:79. doi: 10.1186/1471-2164-6-79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Schonbach C., Koh J.L., Flower D.R., Wong L., Brusic V. FIMM, a database of functional molecular immunology: update 2002. Nucleic Acids Res. 2002;30:226–229. doi: 10.1093/nar/30.1.226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sathiamurthy M., Peters B., Bui H.H., Sidney J., Mokili J., Wilson S.S., Fleri W., McGuinness D.L., Bourne P.E., Sette A. An ontology for immune epitopes: application to the design of a broad scope database of immune reactivities. Immunome Res. 2005;1:2. doi: 10.1186/1745-7580-1-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Singh M.K., Srivastava S., Raghava G.P., Varshney G.C. HaptenDB: a comprehensive database of haptens, carrier proteins and anti-hapten antibodies. Bioinformatics. 2006;22:253–255. doi: 10.1093/bioinformatics/bti692. [DOI] [PubMed] [Google Scholar]
  • 27.Pfaar O., Klimek L. Aspirin desensitization in aspirin intolerance: update on current standards and recent improvements. Curr. Opin. Allergy Clin. Immunol. 2006;6:161–166. doi: 10.1097/01.all.0000225153.45027.6a. [DOI] [PubMed] [Google Scholar]
  • 28.O'Brien M., Koo J. The mechanism of lithium and beta-blocking agents in inducing and exacerbating psoriasis. J. Drugs Dermatol. 2006;5:426–432. [PubMed] [Google Scholar]
  • 29.Martin Y.C., Kofron J.L., Traphagen L.M. Do structurally similar molecules have similar biological activity? J. Med. Chem. 2002;45:4350–4358. doi: 10.1021/jm020155c. [DOI] [PubMed] [Google Scholar]
  • 30.Yamamoto O., Tokura Y. Photocontact dermatitis and chloracne: two major occupational and environmental skin diseases induced by different actions of halogenated chemicals. J. Dermatol. Sci. 2003;32:85–94. doi: 10.1016/s0923-1811(03)00097-5. [DOI] [PubMed] [Google Scholar]
  • 31.Delaney J.S. Assessing the ability of chemical similarity measures to discriminate between active and inactive compounds. Mol. Divers. 1996;1:217–222. doi: 10.1007/BF01715525. [DOI] [PubMed] [Google Scholar]
  • 32.Patterson D.E., Cramer R.D., Ferguson A.M., Clark R.D., Weinberger L.E. Neighborhood behavior: a useful concept for validation of ‘molecular diversity’ descriptors. J. Med. Chem. 1996;39:3049–3059. doi: 10.1021/jm960290n. [DOI] [PubMed] [Google Scholar]
  • 33.Honda W., Kawashima S., Kanehisa M. Metabolite antigens and pathway incompatibility. Genome Informatics. 2006;17:184–194. [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES