Skip to main content
. 2000 Dec 4;1(6):research0013.1–13.13. doi: 10.1186/gb-2000-1-6-research0013

Table 1.

Interkingdom domain fusions and their probable origins

IKF gene Best 'native' hit Best 'alien' hit Protein function Stand-alone Comment
(GI number and gene (E-value, amino acid (E-value, amino paralog of the
name) and origin residue range, acid residue range, alien domain
of domains species)/domain species)/domain
function function
Archaea
Aeropyrum pernix
  5106104_ 2621953_Mth 2633525_Bs Hydroxymethyl- None Pyrococci encode proteins with
  APE2400 5e-27; 4e-54; pyrimidine the same domain organization
  Archaeal-bacterial 282-445; 16-272; phosphate kinase andclosest similarity to A. pernix;
uncharacterized domain hydroxymethyl- involved in thiamine M. jannaschii encodes a protein
conserved among pyrimidine phosphate biosynthesis with the same domain
archaea (homolog kinase (additional function?) organization but low similarity;
of the amino-terminal Mt encodes a HMP-kinase with
domain of sialic acid moderate similarity
synthase)
Methanococcus jannaschii
  1591138_ 2128140_Mj; 7270033_At; Unknown; None The amino-terminal domain is
  MJ0434 1e-19; 0.003; possible role present in several stand-alone
  Archaeal- 2-94; 120-222; in stress response copies in M. jannaschii, but
  bacterial-eukaryotic uncharacterized AIG2-like otherwise, is seen mostly in
domain stress-related bacteria; the possibility of
protein acquisition of a bacterial gene
by the Methanococcus lineage
is conceivable
Methanobacterium thermoautotrophicum
  2621249_ 5103547_Ap; 1651798_Ssp; Membrane-associated None In Ssp, the amino-terminal
  MTH204 1e-34; 0.002; 5-formyl- domain is fused to another
  Archaeal- 137-326; 8-139; tetrahydrofolate uncharacterized domain. An
  eukaryotic/ 5-formyl- uncharacterized cyclo-ligase(?); ortholog with conserved
  bacterial tetrahydrofolate membrane-associated exact function domain organization is seen
cyclo-ligase domain unknown in Mycobacterium, but many
other bacteria encode stand-
alone versions of this domain,
which could be the actual sources
of horizontal gene transfer
  2621673_ 3256572_Ph; 2984130_Aa; GTPase, possible 2621855
  MTH594 3e-10; 6e-19; role in signal
  Archaeal-bacterial 5-137; 233-390; transduction
inactivated RecA GTPase
domain
  2622642_ 5105992_Ap; 2569943_Axy; Glucose-1-phosphate None
  MTH1523 3e-36; 2e-05; thymidylyl transferase/
  Archaeal-bacterial 5-226; 226-334; glucose-6-phosphate
glucose-1-phosphate mannose-6- isomerase
thymidylyl transferase phosphate isomerase
 Bacteria
  Aquifex aeolicus
  2983622_ 2633696_Bs; 2650176_Af; Signal None
  aq_1151 5e-65; 0.005; transduction
  Bacterial-archaeal 325-795; 116-279; c-di-GMP
c-di-GMP phospho- PAS/PAC phospho-diesterase
diesterase domain
  2984285_ 586875_Bs; 3915955_Mj; Molybdenum None
  aq_2060 4e-63 3e-09; cofactor
  Bacterial-archaeal 1-252; 270-441; bisynthesis enzyme(?)
PHP superfamily pyruvate
hydrolase formate-lyase
activating enzyme
(Fe-S cluster
oxidoreductase)
Bacillus subtilis
  2632283_yaaH, 4980914_Tm 399377_Rn Chitinase 2635915 B. subtilis encodes two
  1945087_ydhD 1e-06 2e-11 paralogous proteins with the
  Bacterial-eukaryotic 2-92; 221-402; same domain architecture
LysM repeat domain chitinase
  2633242_yhcR 645819_Dr; 2622704_Mth; Nuclease-nucleotidase None
  Bacterial-archaeal 1e-64; 0.008 (probable repair
584-1068; 151-257; enzyme)
5'-nucleotidase; nucleic acid-binding
1175987_ domain (OB-fold)
ECR100;
2e-09;
377-521;
thermonuclease
  2632325_yabN 4981449_Tm; 3873806_Ce; Methyl-transferase/ None Other than in chlamydiae,
  Bacterial-eukaryotic 2e-62; 0.003; pyro-phosphatase the SWI domain is seen
223-483; 7-125; (metabolic enzyme in eukaryotic chromatin-
MazG (predicted pyro- SAM-dependent of an unknown associated proteins, leading
phosphatase) methyl-transferase pathway?) to the suggestion that
chlamydial topoisomerase
is involved in chromosome
condensation
Chlamydophyla pneumoniae
  4377077_ 730965_Bs; 3581917_Sp; DNA topoisomerase I, 7189103 SWI is a typical eukaryotic
  CPn0769 e-148; 3e-10; possibly involved in domain not found in
  Bacterial-eukaryotic 1-727; 792-866; chromatin prokaryotes other than
DNA topoisomerase I SWI domain condensation chlamydia (the ortholog
in Chlamydia trachomatis has the
same domain architecture)
Deinococcus radiodurans
  6459294_ 7248325_Sco; 6754878_Mm; DNase None The G9a domain is not
  DR1533 0.001; 9e-28; detectable in other prokaryotes.
  Bacterial-eukaryotic 171-265; 4-148; In eukaryotes, this domain so
McrA family G9a domain (DNA- far has been found only as part
endonuclease binding?) of multidomain nuclear proteins,
including transcription factors
Escherichia coli
  1787179_ 94933_Ppu; 3747107_Rn; Oxidoreductase None The eukaryotic domain is present
  b0947 3e-10; 3e-32; (as a partial sequence) also in the
  Bacterial-eukaryotic 287-367; 4-261; beta-proteobacterium Vogesella.
ferredoxin uncharacterized This domain contains a conserved
domain (thiol pair of cysteines, which together
oxidoreductase?) with the ferredoxin fusion, may
suggest a thiol oxidoreductase
activity. Most of the eukaryotic
proteins containing this domain
appear to be mitochondrial,
suggesting the possibility of an
alternative evolutionary scenario
  1787678_ 487713_Sli; 5459012_Pab; Methyl-transferase/ None
  b1410 3e-05; 1e-17; Lipase (exact function
  Bacterial-archaeal 408-522; 33-274; unclear)
SAM-dependent lyso-phospholipase
methyl-transferase
  1787679_ynbD 1591375_Mj; 7160233_Sp; Membrane-associated None An unusual case of fusion
  Archaeal-eukaryotic 4e-04; 1e-06; bifunctional between an apparently archaeal
50-218; 346-415; phosphatase and a typical eukaryotic domain
membrane-associated tyrosine phosphatase in a bacterium
acid phosphatase
  1788589_ 5763950_Sco; 3860247_At; Bifunctional enzyme; None
  b2255 4e-35; 1e-55; exact function unclear
  Bacterial-eukaryotic 1-259; 318-652;
methionyl-tRNA dTDP-glucose 4-6-
formyl-transferase dehydratase
  1788938_yfiQ 929735_Nsp; 2649370_Af; acetyl-CoA synthetase/ None
  bacterial-Archaeal/ 8e-32; 4e-85; acetyl-transferase; exact
  eukaryotic 637-874; 6-689; function unclear
acetyl-transferase acetyl-CoA synthetase
Mycobacterium tuberculosis
  2909507_ 6469244_Sco; 4151109_Tbr; Adenylate cyclase/ 7476546, M. tuberculosis encodes three
  Rv2488c, 5e-64; 6e-04; ATPase; probable 7476738 paralogous proteins that consist
  2791528_Rv1358, 19-603; 6-167; transcription regulator of three domains, the eukaryotic-
  1419061_ 4726088_Rer; adenylate cyclase type adenylate cyclase, AP
  Rv1358 2e-12; (apoptotic) ATPase and DNA-
  Bacterial-eukaryotic 818-1073 binding response regulator, and
two stand-alone versions of
adenylate cyclase, which show the
closest similarity to the cyclase
domain of the multidomain
proteins
  1314025_ 120037_Tt; 178213_Hs; Ferredoxin/ 2076681 D. radiodurans also encodes the
  Rv0886 1e-11; 4e-65; ferredoxin reductase eukaryotic-type ferredoxin
  Bacterial-eukaryotic 2-79; 93-543; reductase, but the ferredoxin
ferredoxin ferredoxin reductase fusion is unique to mycobacteria
  3261732_ 2661695_Sco; 279520_Dd; cAMP-dependent 4455714
  Rv0998 3e-13; 7e-07; acetyl-transferase(?) (M. leprae)
  Bacterial-eukaryotic 148-328; 30-105;
acetyl-transferase cAMP-binding domain
  2326726_ 421331_Cvi; 2645721_Mm; Bifunctional enzyme of 1929080
  Rv1683 1e-24; 6e-26; poly (3-hydroxy-butyrate)
  Bacterial-eukaryotic 23-359; 456-972; synthesis
poly (3-hydroxy- very-long-chain
butyrate) synthase acyl-CoA synthetase
  1403447_ 6752338_Sco; 3892714_At; Polyfunctional enzyme 2661651 In this protein, the domain of
  Rv2006 2e-27; 8e-27; of trehalose metabolism apparent eukaryotic origin
  Bacterial-eukaryotic 23-240; 264-521; is flanked by bacterial domains
phosphatase; trehalose-6-phosphate from both sides
6448751_Sco; phosphatase
0.0;
534-1320;
trehalose hydrolase
  2896788_ 117648_Ec; 3073773_Mm; Polyfunctional enzyme 2337823 The presence of the stand-alone
  Rv2051c 1e-16; 4e-31; of lipid metabolism (M. leprae); version of the eukaryotic
  Bacterial-eukaryotic 94-514; 588-829; 6468712 domain in Streptomyces suggests
apolipoprotein dolichol-phosphate- (Streptomyces an ancient horizontal transfer
N-acyltransferase mannose synthase coelicolor)
  2791523_ 6225563_Scy; 1098605_Cnu; Multifunctional enzyme None
  Rv2483c 7e-16; 5e-22; of phospholipid
  Bacterial-eukaryotic 36-253; 289-492; metabolism
phosphoserine 1-acyl-sn-
phosphatase glycerol-3-phosphate
acyltransferase
  2894233_ 2633801_Bs; 4538974_At; Molybdopterin synthase 2076687 The same domain organization
  Rv3323c 3e-19; 7e-06; is seen in D. radiodurans, but in
  Bacterial-eukaryotic 89-208; 2-82; this case, both components
molybdopterin molybdopterin appear to be of bacterial origin
synthase large subunit synthase small subunit
(MoaE) (MoaD)
  2960152_ 4753872_Sco; 466119_Ce; cAMP-regulated 2501688 M. tuberculosis encodes two
  Rv3728, 1e-35; 7e-20; efflux pump(?) strongly similar paralogs with
  7477551_ 56-428; 549-964; the same domain architecture
  Rv3239c transmembrane cAMP-binding domain-
  Bacterial-eukaryotic efflux protein phosphoesterase
  2960153_ 4731342_Sl; 1591330_Mj; Bifunctional enzyme 1806159 The amino-terminal domain
  Rv3729 3e-14; 3e-58; of molybdenum stand-alone paralog is more
  Bacterial-archaeal 510-776; molybdenum cofactor biosynthesis similar to archaeal homologs
C5-O-methyl- cofactor biosynthesis than to the stand-alone paralog,
Transferase protein MoaA but nevertheless, the latter
(mitomycin (Fe-S oxidoreductase) appears to be of archaeal origin
biosynthesis)
  3261806_ 40487_Cg; 7304009_Dm; Secreted protein 7649504 The stand-alone version of the
  Rv3811 3e-12; 2e-12; (S. coelicolor) eukaryotic domain is present
  Bacterial-eukaryotic 404-494; 198-384; only in Streptomyces
major secreted peptidoglycan
protein recognition protein
Treponema pallidum
  3322964_ 7225946_Nm; 320868_Sc; Uridine kinase None A co-linear ortholog is present
  TP0667 9e-04; 2e-13; in Thermotoga
  Bacterial-eukaryotic 10-154; 290-488;
threonyl-tRNA uridine kinase
synthetase (TGS and
H3H domains)
Thermotoga maritima
  4981276_ 68516_Bs; 3218401_Sp; Uridine kinase None A co-linear ortholog is present
  TM0751 3e-07; 2e-11; in Treponema
  Bacterial-eukaryotic 11-200; 288-475;
threonyl-tRNA uridine kinase
synthetase (TGS and
H3H domains)
Eukaryotes
Saccharomyces cerevisiae
  536367_ 586134_Bt; 7450047_Aa; Bifunctional signal- 5249 SurE homologs are not
  Ybr094w 9e-10; 8e-09; transduction protein (Yarrowia detectable in eukaryotes other
  Eukaryotic/ tubulin-tyrosine ligase acid phosphatase lipolytica) than yeasts
  Bacterial-archaeal (SurE)
  1431219_ 577625_Hs; 3328426_Ct
  YDL141w 1e-39 5e-27;
  Eukaryotic- Biotin-[propionyl- biotin protein ligase Bifunctional biotin- None An ortholog with an identical
  bacterial CoA-carboxylase(ATP- protein ligase domain architecture is present
hydrolysing)] ligase in S. pombe
  458922_ 477096_Gg; 1653075_Ssp; heat shock NONE An ortholog with an identical
  YHR206W 8e-18; 7e-17; transcription domain architecture is present
  Eukaryotic-bacterial 78-216 375-503; factor in S. pombe (3327019)
heat shock CheY domain
transcription factor
domain 2983676_Aa; Siroheme synthase 2330809 S. pombe also encodes a co-linear
  486539_ 1146165_At; 1e-04; (S. pombe) ortholog (3581882); apparent
  YKR069w 3e-34; 22-188; displacement of the bacterial
  Eukaryotic-bacterial 249-556; precorrin-2 oxidase precorrin-2 oxidase by a distinct
urophorphyrin III Rossmann fold domain
methylase
  1302305_ 4938476_At; 3212189_Hi; Multifunctional enzyme None Co-linear orthologs in S. pombe
  YNL256w 5e-65; 5e-05; of folate biosynthesis (7490442) and Pneumocystis
  Eukaryotic-bacterial 324-861 62-148; carinii (283062)
7,8-dihydro-6- 187-297;
hydroxymethylpterin- dihydro-neopterin
pyro-phosphokinase+ aldolase
Dihydro-pteroate
synthase
  1419887_ 7297709_Dm; 5918510_Sco; Bifunctional RNA 2213559 The known bacterial homologs
  YOL066c 2e-72; 2e-10; modification enzyme (S. pombe) have a two-domain organization;
  Eukaryotic-bacterial 42-408; 436-574; the evolutionary scenario could
large ribosomal pyrimidine deaminase have included domain
subunit pseudoU rearrangements
synthase
  1419865_ 2462827_At; 1075360_Hi; Transcriptional regulator None Yeast encodes three strongly
  YOL055c, 1e-39; 6e-24; of thiamine biosynthesis similar paralogs with identical
  2132251_ 22-390; 342-549; genes(?) domain organization; co-linear
  YPL258c, phosphomethyl transcriptional orthologs are present in other
  2132289_ pyrimidinekinase activator ascomycetes
  YPR121w (thiamine biosynthesis)
  Eukaryotic-bacterial
  1370444_ YPL214c 2746079_Bn; 2648451_Af; Bifunctional thiamine None Except for the one from
  Eukaryotic-archaeal/ 1e-27; 9e-27; biosynthesis enzyme A. fulgidus, all highly conserved
  Bacterial 9-233; 251-531; homologs of the kinase domain
thiamin-phosphate hydroxyethyl-thiazole of this protein are bacterial; it
pyro-phosphorylase kinase appears likely that the A. fulgidus
gene is the result of horizontal
transfer

The following complete genomes were analyzed. Archaea: Aeropyrum pernix (Ap); Archaeoglobus fulgidus (Af); Methanococcus jannaschii (Mj); Methanobacterium thermoautotrophicum (Mth); Pyrococcus horikoshii (Ph); Bacteria: Aquifex aeolicus (Aa); Borrelia burgdorferi (Bb); Bacillus subtilis (Bs); Chlamydophila pneumoniae (Cp); Deinococcus radiodurans (Dr); Escherichia coli (Ec); Haemophilus influenzae (Hi); Helicobacter pylori (Hp); Mycobacterium tuberculosis (Mt); Mycoplasma pneumoniae (Mp); Rickettsia prowazekii (Rp); Synechocystis sp (Ssp); Thermotoga maritima (Tm); Treponema pallidum (Tp). No IKFs were detected in the genomes that are not shown in the table. Additional species name abbreviations: At, Arabidopsis thaliana; Axy, Acetobacter xylinus; Bn, Brassica napus; Ce, Caenorhabditis elegans; Cvi, Chromatium vinosum; Gg, Gallus gallus; Hs, Homo sapiens; Mm, Mus musculus; Rn, Rattus norvegicus; Sco, Streptomyces coelicolor; Sl, Streptomyces lavendulae.