Table 1.
Organism/ Common Name |
Parent Protein or Gene/ GenBank or Uniprot Link |
* Penetratin or Penetratin Analog Sequence | #Arg /#Lys/CPP Probability $ |
---|---|---|---|
Drosophila melanogaster/fruit fly | pAntp/P02833 | RQIKIWFQNRRMKWKK | 3/4/1.00 |
Homo sapiens/human | Hox-A5/P20719 PDX-1/P52945 HXD8/P13378 Hox-C12/ P31275 |
RQIKIWFQNRRMKWKK RHIKIWFQNRRMKWKK RQVKIWFQNRRMKWKK QQVKIWFQNRRMKKKR |
3/4/1.00 3/4/0.997 3/4/0.97 3/4/0.97 |
Homo sapiens/human | Pax-6/P26367 Pax-7/P23759 and Pax-3/P23760 |
ARIQVWFSNRRAKWRR ARVQVWFSNRRARWRK |
5/1/0.94 5/1/0.94 |
Homo sapiens/human | PITX2/D6RFI4 | ARVRVWFKNRRAKWRKR | 5/3/0.97 |
Ciona intestinalis/sea squirt tunicate | Pax3/7-like/NP_001071798.1 | ARVQVWFSNRRAKWRR | 5/1/0.94 |
Acropora millepora/ stony coral | Pax-6/XP_029212196.2 | ARIQVWFSNRRAKWRK | 4/2/0.94 |
Capitella teleta/ annelid worm | Ct-Pax3/7 (Pax6)/A1XC54, ABC68267.1 | ARVQVWFSNRRARWRK | 5/1/0.94 |
Nematostella vectensis/sea anemone | PaxC homeodomain transcription factor/Q5IGV4 | ARVQVWFSNRRAKWRR | 5/1/0.94 |
Mnemiopsis leidyi/ comb jelly | PRD10a homeobox trancription factor/ ADO22618.1 | ARIQVWFQNRRAKWRK | 4/2/0.93 |
Amphimedon queenslandica/ sponge | Pax-6/XP_003387530.1 | SRVQVWFQNRRAKWRK | 4/2/0.93 |
Trichoplax adhaerens | PaxB/ACH57172.1 | ARVQVWFSNRRAKWRK | 4/2/0.92 |
Ceratocystis platani/fungi | Pax-6/KKF93291.1 | AKINNWFQNRRAKAKL | 2/3/0.86 |
Galerina marginata/Dykaria higher fungi | Homeobox containing protein fragment/A0A067SZU8 | ARIQVWFSNRRAKWRR | 5/1/0.94 |
Planoprotostelium fungivorum/amoeba | Arf-GAP with homeobox domain/ A0A2P6NXG8 | ARIQVWFSNRRAKWRR | 5/1/0.94 |
Monosiga brevicollis (Choanoflagellate) | Mb_hbx2 homeobox-domain protein/ A9UP33 |
QQINNWFINARRRLLNR | 4/0/0.76 |
Capsaspora owczarzaki amoebae (Filasterea clade) | CAOG_004648 Homeobox domain-containing protein/ A0A0D2VSA1 | RVIRIWFQNRRAKQRR RRQKARRNQFWIRIVRR§ | 6/1/0.96 7/1/0.96 |
Candida glabrata/budding yeast | Homeobox containing protein PHO2/ Q6FKZ3 |
KNVRIWFQNRRAKVRK
KNVRIWFQNRRAKVRKKGKL |
4/3/0.95 4/5/0.95 |
Hanseniaspora osmophila/ wine-making yeast |
Regulatory protein PHO2 with homobox domain 1/A0A1E5RMZ3 | TQVKIWFQNRRMKWKR | 3/3/0.94 |
Acinetobacter baumannii/ Gram- bacteria | Homeobox domain-containing protein (partial)/ WP_139162288.1 | RQVAVWFQNRRARWKT | 4/1/0.87 |
Klebsiella pneumoniae/ Gram- bacteria | Homeobox domain-containing protein WP_185963280.1 | TQIKIWFQNRRAKDHR | 3/2/0.76 |
Euryarchaeota archaeon | RYE98021.1 | RQVSVWFTNARKRIWL | 3/1/0.77 |
Acanthamoeba polyphaga mimivirus/ giant virus |
Putative homeobox protein/ AKI80488.1 |
RQIQIWFQNRRCKDRK | 4/2/0.87 |
Moumouvirus maliensis/giant virus | Homeodomain containing protein/ QGR53678.1 |
KQISIWFANRRAYDARK
RKNGVKMTKVKKIRRSR& |
3/2/0.63 4/5/0.94 |
Megavirus chiliensis/giant virus | Putative homeobox protein/ YP_004894234.1 |
RQIQIWFQNRRARDSKKNR | 5/2/0.85 |
Bandra megavirus/ | Homeobox/ AUV58136.1 | RQIQIWFQNRRARDSKKIR | 5/2/0.85 |
Unclassified Mimivirus/ giant virus |
Homeobox protein/ QZX43434.1 |
RQIQIWFQNRRARDSRKNR | 6/1/0.86 |
* Bold font is for amino acid residues that are identical in type and sequence location to Drosophila pAntp penetratin; underlined residues are for extended penetratin analog at its C-terminal; All examined viruses and some Gram-negative bacteria have aspartate (D) highlighted with italic font instead of tryptophan (W) at the 14th sequence location. $ #Arg/#Lys are the numbers of arginines and lysines in the sequence. The third number after the slash symbol is the cell-penetrating probability (CPP), according to www.thegleelab.org/MLCPP/ (accessed on 7 August 2022) server. § Reversed amoebae penetratin (Filasterea clade) with added arginine. & Homeodomain motif upstream from penetratin analog is also predicted as the CPP.