Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2008 Jun 27;380(1):206–222. doi: 10.1016/j.jmb.2008.04.070

Ribonuclease A Homologues of the Zebrafish: Polymorphism, Crystal Structures of Two Representatives and their Evolutionary Implications

Konstantina Kazakou 1,, Daniel E Holloway 1,, Stephen H Prior 1,2, Vasanta Subramanian 1,, K Ravi Acharya 1,
PMCID: PMC2582337  PMID: 18508078

Abstract

The widespread and functionally varied members of the ribonuclease A (RNase A) superfamily provide an excellent opportunity to study evolutionary forces at work on a conserved protein scaffold. Representatives from the zebrafish are of particular interest as the evolutionary distance from non-ichthyic homologues is large. We conducted an exhaustive survey of available zebrafish DNA sequences and found significant polymorphism among its four known homologues. In an extension of previous nomenclature, the variants have been named RNases ZF-1a–c,-2a–d,-3a–e and-4. We present the first X-ray crystal structures of zebrafish ribonucleases, RNases ZF-1a and-3e at 1.35-and 1.85 Å resolution, respectively. Structure-based clustering with ten other ribonuclease structures indicates greatest similarity to mammalian angiogenins and amphibian ribonucleases, and supports the view that all present-day ribonucleases evolved from a progenitor with three disulphide bonds. In their details, the two structures are intriguing melting-pots of features present in ribonucleases from other vertebrate classes. Whereas in RNase ZF-1a the active site is obstructed by the C-terminal segment (as observed in angiogenin), in RNase ZF-3e the same region is open (as observed in more catalytically efficient homologues). The progenitor of present-day ribonucleases is more likely to have had an obstructive C terminus, and the relatively high similarity (late divergence) of RNases ZF-1 and-3 infers that the active site unblocking event has happened independently in different vertebrate lineages.

Abbreviations: 2′,5′-ADP, adenosine-2′,5′-diphosphate; Ang, angiogenin; Amph-2, Amphinase-2 from Rana pipiens; d(CpA), 2′-deoxy(cytidylyl-3′,5′-adenosine); 6-FAM-dArUdAdA-TAMRA, 6-carboxyfluorescein-dArUdAdA-6-carboxytetramethylrhodamine; hAng, human angiogenin; hECP, human eosinophil cationic protein; hEDN, human eosinophil-derived neurotoxin; LHM, loop-based Hausdorff measure; mAng-4, murine angiogenin-4; ONC, Onconase from Rana pipiens; Pca, pyrrolidone carboxylic acid (pyroglutamate); PDB, Protein Data Bank; PEG, poly(ethylene glycol); RC-RNase, Rana catesbeiana ribonuclease; RNase A, bovine pancreatic ribonuclease A

Keywords: ribonuclease, angiogenin, X-ray crystallography, protein evolution, zebrafish

Introduction

Pyrimidine-specific ribonucleases with homology to bovine pancreatic ribonuclease A (RNase A; EC 3.1.27.5) are remarkably widespread among vertebrates. Representatives have been isolated from mammals, birds, amphibians, reptiles and, most recently, fish.1–4 Many of these enzymes are endowed with special biological activity(ies); for example, some stimulate the development of vascular endothelial cells,2,5,6 dendritic cells,7 and neurons,8,9 are cytotoxic/anti-tumoral10 and/or anti-pathogenic.11 In some cases, this is dependent on catalytic activity, but in others, it is not. In view of these features, the RNase A superfamily provides an excellent opportunity to study evolutionary forces at work on a conserved protein scaffold.

Current structural data indicate that catalytically active members of the superfamily typically comprise a polypeptide chain of 119 ± 15 residues, have an α + β topology with three conserved disulphide bond positions (SCOP12 code 54076), and employ a catalytic triad of residues (two His, one Lys) to cleave RNA on the 3′-side of pyrimidine nucleotides.13 Structural diversity comes in several forms. Some of these are more biochemical in nature, such as the oligomeric state14 or surface charge density,11 while others are more subtle, such as where the modification of loop regions alters the fine structure of the active site. Here, evolutionary forces have refined the ribonucleolytic performance of different members. Importantly, all the aforementioned forms of structural variation are known to influence biological activity.

Recent surveys of the data from zebrafish cDNA and genome sequencing projects have led to the identification and characterisation of RNase A homologues in this species.2,3 The enzymes have varying ribonucleolytic strength and are endowed variously with angiogenic and anti-bacterial activities. This, allied with sequence homology to mammalian angiogenin (Ang), has led to their classification as Ang-like and has been used to support the view that all present-day RNases evolved from Ang-like progenitors.2,3,15 In the present study, we reinvestigate the available sequence data and find that, in line with the high degree of genetic variation in this species,16 zebrafish ribonucleases exhibit significant polymorphism. At an early point in this investigation (and before the publication of the first report on zebrafish ribonuclease sequences),2 we selected a range of variants for further study. We present here the X-ray crystal structures of two representatives that yielded high-resolution data. Through systematic comparison of the most divergent regions of structure, we generate further evidence for the evolutionary origins of present-day ribonucleases. Most importantly, the structures reveal the three-dimensional details of relationships with other enzymes that are not prominent in amino acid sequence alignments and illustrate the assignment of kinship.

Results

Polymorphism

The combined efforts of two research groups have identified four nucleotide sequences encoding homologues of RNase A in the zebrafish (RNases ZF-1–4).2,3 The tissues and breeding lines from which these sequences were derived are varied, which prompted us to conduct an exhaustive survey of available zebrafish sequences. For RNases ZF-1,-2 and-3, we identified three, four and five variants, respectively, which we designate 1a–c, 2a–d and 3a–e (an extension of the nomenclature used by Pizzo and D’Alessio17), while for RNase ZF-4, we could not add to the single known form (Table 1; Fig. 1). The set of sequences includes representatives from the Tübingen (TU), AB, SJD and Singapore wild-type lines,18 and gives an indication of the variation within and between the different lines. Throughout, signal peptide cleavage sites predicted with SignalP 3.0 were consistent with those reported by Cho and Zhang,3 but were three or four residues upstream of those suggested by Pizzo et al.2

Table 1.

Zebrafish RNase variants

RNase Breeding line cDNA sourcea No. copies Representative example
Clone ID GenBank ID
ZF-1a Mixed RZPD609 1 IMAGE:3716043b,c AI476973
ZF-1b SJD SJD 5d embryo 1 IMAGE:5628280c BM533038
ZF-1c SJD SJD 5d embryo 1 IMAGE:5627130c BM573599
ZF-2a TU d 1 AL928685:89541–89993
TU FDR107 > 100 FDR107-P00072-BR_P04 EH536003
TU NIH_ZGC_7 1 IMAGE:7270177 BC133890, CN510408
AB NIH_ZGC_8 19 IMAGE:7411519 BC110104, CO812928
AB ad hoc 1 e EF382670
Mixed RZPD609 1 IMAGE:3733625b BF717998
ZF-2b AB NIH_ZGC_8 1 IMAGE:7410112c BC092908, CO798809
AB NIH_ZGC_16 1 IMAGE:7213476 BC129341, CK866751
ZF-2c AB NIH_ZGC_8 1 IMAGE:7264666c BC105745, CN326043
ZF-2d AB NIH_ZGC_8 1 IMAGE:7250488c BC142860, CN170062
ZF-3a TU d 1 BX465197:31652–32098
TU NIH_ZGC_10 1 IMAGE:8008790 DT070465
ZF-3b Unspecified ad hoc 1 b
ZF-3c AB ad hoc 1 e EF382669
ZF-3d TU FDR107 17 FDR107-P00029-BR_C05 EH493831
TU NIH_ZGC_10 3 IMAGE:7047538c CK142774
TU NIH_ZGC_7 1 IMAGE:7054926 CK026631
AB NIH_ZGC_16 13 IMAGE:7213270 CK869816
AB NIH_ZGC_8 2 IMAGE:7250821 CN170899
AB NIH_ZGC_20 1 IMAGE:7292890 CO250046
Singapore GISZF001 1 IMAGE:7151363c CK688316
ZF-3e Singapore GISZF001 1 IMAGE:6907218c CB363875
ZF-4 TU d 1 AL928685:53649–54053
AB ad hoc 1 e EF382671
a

Libraries RZPD609 and NIH_ZGC_7 are normalized, all others are not. Specifics of the various libraries are as follows: FDR107, gut and internal organs (adult); GISZF001, whole body (embryo); NIH_ZGC_7, whole body (adult); NIH_ZGC_8, liver (adult); NIH_ZGC_10, whole body (adult); NIH_ZGC_16, gut (adult); NIH_ZGC_20, mixed tissue (adult); RZPD609, whole body (embryo, late somatogenesis) + liver (adult); SJD 5d embryo, whole body (embryo, 5 days). FDR107 and GISZF001 were constructed at the Genome Institute of Singapore (unpublished), NIH_ZGC libraries by the Mammalian Gene Collection Program team,46,87 RZPD609 by Clark et al.19 and SJD 5d embryo by the WashU Zebrafish EST Project team (unpublished). Further details are listed at http://www.ncbi.nlm.nih.gov/UniGene/lbrowse2.cgi?TAXID=7955&CUTOFF=D0.

b

Data from Pizzo et al.2

c

Resequenced during the course of this work.

d

Data from the genomic assembly.

e

Data from Cho and Zhang.3

Fig. 1.

Fig. 1

Alignment of the sequences of zebrafish RNase variants. Sequences were deduced from the sources given in Table 1 (nucleic acids encoding RNases ZF-1a–c,-2b–2d,-3d and-3e were resequenced during the course of this work). Residues predicted to form the signal peptide and mature chain of each protein are written in grey and black text, respectively. Residues conserved throughout are shaded blue, while those that vary within each subclass are shaded gold. Likely members of the catalytic triad are denoted by asterisks.

A gene encoding RNase ZF-1 is yet to be found in the genome assembly (version 7 at the time of writing), and corresponding sequences are currently restricted to cDNA libraries. RNase ZF-1a is the clone identified by Pizzo et al.2 in RZPD 609 (also known as MPMGp609), a library constructed from multiple tissues extracted from multiple, non-inbred individuals.19 RNases ZF-1b and-1c are encoded by clones present in a library prepared from an SJD embryo. We obtained I.M.A.G.E. Consortium clones of RNases ZF-1a–c and found that the deposited sequences were incomplete, lacking five or six residues at the C terminus. RNases ZF-1b and-1c are more similar to each other (two mismatches) than to RNase ZF-1a (four mismatches).

The sequence of RNase ZF-2a corresponds to that reported previously,2,3 is present in the genome assembly and is well represented in the non-normalised cDNA libraries FDR107 and NIH_ZGC_8. RNase ZF-2b is found in a single instance in two independent libraries, while RNases ZF-2c and-2d are single-copy members of NIH_ZGC_8. RNase ZF-2a differs considerably from RNases 2b–d (30 mismatches) which are closely related to one another (two variable positions).

RNase ZF-3a is encoded by a gene in the genome assembly, while RNases ZF-3b and-3c correspond to the cDNA sequence variants obtained by Pizzo et al.2 and Cho and Zhang (“Dr-RNase 1”)3 using an unspecified breeding line and the AB line, respectively. RNase ZF-3d is well represented in cDNA libraries prepared from the TU and AB lines, and is present in the Singapore line, while RNase ZF-3e was detected only in the Singapore line. Among this group, the mature protein sequence shows six positions of variance.

The sequence of RNase ZF-4 was deduced by Cho and Zhang from DNA sequences obtained from the TU and AB lines (“Dr-RNase 3”).3 We found no corresponding cDNA or additional sequence information in the databases.

Ribonucleolytic activity

RNase ZF-1a cleaves tRNA with a specific activity that is almost identical with that of human angiogenin (hAng) (Fig. 2), i.e. some four to five orders of magnitude lower than that of RNase A. This is the first report of its activity toward a polymeric substrate. The relative activities of RNase ZF-1a and hAng are slightly different from those measured for the cleavage of 6-carboxyfluorescein-dArUdAdA-6-carboxytetramethylrhodamine (6-FAM-dArUdAdA-TAMRA; a short fluorogenic substrate), where the specific activity of RNase ZF-1a is 3-fold greater.2 RNase ZF-3e is 17–20-fold more active than RNase ZF-1a or hAng toward tRNA. This is approximately double the figure reported previously for the cleavage of yeast tRNA by RNase ZF-3c,3 or the cleavage of 6-FAM-dArUdAdA-TAMRA by RNase ZF-3b.2

Fig. 2.

Fig. 2

Comparative tRNA cleavage activities. Assays measured the release of perchloric acid-soluble fragments catalysed by RNase ZF-1a (●), RNase ZF-3e (Inline graphic) and hAng (▪) as described in Materials and Methods. Each datum point represents the mean of four or five measurements. In all cases, the standard deviation is less than 5% of the mean.

In the natural host, RNases ZF-1a and-3e most likely possess an N-terminal pyroglutamate residue (Pca1). The proteins assayed here differ in that Pca1 is replaced with Met(–1)–Gln1. When such a replacement is made in hAng, there is no change in ribonucleolytic activity20 owing to the presence of a relatively long N-terminal segment that distances Pca1 from the active site; the same is expected for RNases ZF-1a and-3e. Experiments described below employed forms of RNases ZF-1a and-3e possesing an N-terminal His-tag. Their specific activities differed from those of their Met(–1) counterparts by no more than 30% (data not shown). Since the formation of the catalytic site in an RNase A homologue requires the precise alignment of several non-contiguous structural elements, this indicates that the tags do not have a major impact on the overall protein conformation.

Quality of the crystal structures

His-tagged forms of RNases ZF-1a and-3e were crystallised in the presence of high-molecular mass poly(ethylene glycol) (PEG) at pH 8–8.5, yielding monomeric structures at resolutions of 1.35 Å and 1.85 Å, respectively. The final models comprise residues (–1)–126 and 5–127, respectively (negative residue numbers denote those belonging to the His tag). Both His tags are significantly disordered, as are the C-terminal residue of RNase ZF-1a and the N-terminal four residues of RNase ZF-3e, but elsewhere the main chain is well ordered. Ramachandran plots indicate that the proportions of non-Pro/Gly residues falling in the most favoured regions are 91.6 % and 90.7 %, respectively, with the remainder in the other allowed regions.

Topological overview and structure-based clustering

The main chain of both structures adopts the characteristic bi-lobed α+β-fold of the pancreatic ribonuclease superfamily (Fig. 3a and b). The two Cα atom traces are more similar to each other (rms deviation = 1.1 Å for 115 equivalent atoms) than to any other structure in the Protein Data Bank (PDB). Within the PDB, the best global matches are made with orthologues of RNase 5 (e.g. hAng) and RNase 1 (e.g. RNase A), which are of mammalian origin (Fig. 3c–e). When aligned pairwise with hAng (PDB entry 1B1I),21 RNases ZF-1a and-3e deviate by 1.6 Å (rms) over 112 and 108 equivalent atoms, respectively; with RNase A (PDB entry 1RPG),22 the corresponding figures are 1.5 Å and 1.8 Å over 107 and 112 equivalent atoms.

Fig. 3.

Fig. 3

Topological comparison. Ribbon diagrams of (a) RNase ZF-1a, (b) RNase ZF-3e, (c) hAng (PDB entry 1B1I)21 and (d) RNase A·d(CpA) (PDB entry 1RPG)22, the latter including ball-and-stick and space-filling representations of the dinucleotide inhibitor. Elements of secondary structure are labelled, as are the N-and C-terminal extremities of each structure. Disulphide bonds are shown in black in ball-and-stick form. Several regions (strands B2 and B7, the segments immediately downstream, and the H2–B1 loop) are of particular use in comparing the four proteins and are highlighted in colour. (e) Stereo superposition of the Cα traces of the four proteins obtained with CE-MC.82 Colours correspond to the highlighted regions in (a–d).

Although the RNase A superfamily presents a highly conserved core topology, its constituent clades are defined to a large degree by the structures of extraneous regions such as loops and the fringes of the secondary structure elements. A cursory inspection of the available structures reveals that the zebrafish RNases have a characteristically long H2–B1 loop (Figs. 3 and 4); human eosinophil-derived neurotoxin (hEDN), human eosinophil cationic protein (hECP) and hRNase 7 have an unusually long B6–B7 loop;23–25 amphibian RNases have a general economy of loops;26–28 and the C-terminal segment of angiogenins is obstructive to substrate binding.21,29,30 With this in mind, we sought to identify the three-dimensional relationships among RNase ZF-1a, RNase ZF-3e and a representative set of ribonucleases through alignment of their Cα coordinates and application of a loop-based Hausdorff measure (LHM) of structural dissimilarity.31 The measure was applied to eight variable segments, most being loops but some having the potential for well-defined secondary structure, e.g. those corresponding to strand B2 and the C-terminal portion of strand B7 in RNase A. The LHM scores ranged from 1.45–8.01 Å and the resulting pairwise distance matrix enabled the construction of a clustering tree (Fig. 5). The tree indicates that RNases ZF-1 and-3 are more similar to amphibian RNases and angiogenins than to other mammalian ribonucleases such as RNase A, which is consistent with previous sequence-based phylogenetic analyses.2–4 The tree also supports a pathway for the emergence of the known disulphide-bonding patterns in which all present-day RNases derive from a progenitor with three disulphide bonds.15

Fig. 4.

Fig. 4

Structure-based sequence alignment. The Cα coordinates of RNase ZF-1a, RNase ZF-3e, hAng (PDB entry 1B1I)21 and RNase A·d(CpA) (PDB entry 1RPG)22 were aligned with CE-MC.82 Elements of secondary structure are shaded (α-and 310-helices, blue; β-strands, pink) and labelled below. In the RNase A sequence, residues shown crystallographically to form the B1 and B2 subsites are coloured red and green, respectively, while those that form the P1 subsite are ringed in black. In the RNase ZF-1a and-3e sequences, positions that show polymorphism are ringed in gold. Numbering schemes for hAng and RNase A are given above and below the sequences, respectively. Residues that did not align or are not present in the respective crystal structures are written in grey text. < Q denotes a pyroglutamate residue.

Fig. 5.

Fig. 5

Structural relationships among ribonucleases. Cα coordinates of RNase ZF-1a, RNase ZF-3e, mAng-4 (PDB entry 2J4T, chain A),30 hAng (PDB entry 1B1I),21 RNase A·d(CpA) (PDB entry 1RPG),22 hRNase 4 (PDB entry 1RNF, chain A),37 hECP·2′,5′-ADP (PDB entry 1H1H),24 hEDN (PDB entry 1GQV),23 hRNase 7 (PDB entry 2HKY, model no. 15),25 RC-RNase·d(ACGA) (PDB entry 1M07, chain A),27 Amph-2 (PDB entry 2P7S)28 and ONC (PDB entry 1ONC)26 were aligned with CE-MC,82 and a loop-based Hausdorff measure (LHM) of structural dissimilarity31 was used to compute a pairwise distance matrix. A clustering tree was then constructed using the UNJ method,83 and rooted by reference to the fossil record.85 Branch lengths are scaled according to LHM distance; measured and reconstructed distances differ by 0.23 Å (mean) and peak at 0.66 Å (RNase ZF-3 versus RNase 7). Proteins with similar disulphide-bonding patterns are bracketed (the subscripted letters denote alternative 4 × (S–S) arrangements), while those that have obstructed B1 subsites and are angiogenic are marked with a red circle. Ichthyic, mammalian and amphibian clades are coloured blue, black and green, respectively.

The high resolution of the present structures permits a more detailed analysis of their structural and evolutionary links to other ribonucleases. The region that offers the greatest resource of structure-function data is the active site.13 Its central portion is conventionally divided into three subsites designated P1, B1 and B2, which bind the scissile phosphodiester linkage of the RNA substrate and the nucleotide bases located immediately upstream (a pyrimidine) and downstream (preferentially a purine), respectively.32 For all known RNase A homologues, these subsites are formed by residues located on opposing sides of the cleft between the two lobes (Fig. 3d). Across the superfamily, one of the lobes (composed principally of the B2–B3–B6–B7 β-sheet and its associated loops) exhibits structural variations that impact significantly on enzymatic performance and contribute to novel biological functions. Of particular interest is the β-hairpin formed by strands B2 and B3 at one edge, and strand B7 and the short C-terminal segment at the other. These regions are described in detail below.

P1 subsite and catalytic centre

The structure of the P1 subsite is strongly conserved between RNase A and the zebrafish enzymes. It follows that the catalytic triad of RNase ZF-1a comprises His16, Lys47 and His119, while other phosphate-binding interactions are likely to be made by the side-chain amide group of Gln15 and the main chain N atom of Tyr120 (Fig. 4). In RNase ZF-3e, the corresponding residues are His16, Lys45 and His120, plus Gln15 and Tyr121. Typically, the C-terminal catalytic histidine adopts two main conformations, designated A (productive) and B (non-productive).33 Only A is observed for His119 of RNase ZF-1a, while a 50/50 mixture of A and B is observed for His120 of RNase ZF-3e. The absence of conformation B in RNase ZF-1a appears to be due to the crystal packing arrangement, which forces the side chain of Arg8 into the space occupied by conformation B. In neither of the zebrafish RNases is there any intramolecular interaction that might stabilise conformation A. This contrasts with the most efficient catalysts of the pancreatic ribonuclease superfamily such as RNase A, in which a hydrogen bond between His119 and Asp121 makes a 102-fold contribution to kcat/Km.34

B1 subsite and C-terminal segment

Structural studies on mammalian and amphibian RNases in complex with cytidylic and uridylic inhibitors indicate that the B1 subsite is a well-conserved, close-fitting pocket that makes two or three specific interactions with the base.22,27,35–38 Each of the present structures displays an obvious homology with previously reported structures in which the bulk of the pocket lining is formed by a universally conserved set of residues: in RNase ZF-1a, His16, Val49, Asn50, Thr51 and Tyr120; in RNase ZF-3e, His16, Val47, Asn48, Thr49 and Tyr121 (the corresponding residues of RNase A are His12, Val43, Asn44, Thr45 and Phe120) (Fig. 4).

Thr51 of ZF-RNase-1a and Thr49 of ZF-RNase-3e are positioned in the same way as Thr45 of RNase A and are therefore expected to be the primary functional components of their respective B1 subsites, forming a pair of hydrogen bonds with either cytosine or uracil (Fig. 6a and b). In RNase ZF-1a, the anticipated binding position of the pyrimidine is taken by the side chain of Glu122, which makes two hydrogen bonds with Thr51 (Fig. 6a; Table 2). This is reminiscent of hAng, in which a similar obstruction is posed by the corresponding residue, Gln117.21,29 One residue upstream of the obstructive residue, the main chain begins a deviation from the neighbouring β-strand that marks the beginning of a novel C-terminal conformation (Figs. 3a and 6a). There is no interaction between the two residues that flank the obstructive residue (His121 and Asp123), contrasting with the two hydrogen bonds that link the corresponding residues of hAng (Asp116 and Ser118). Furthermore, the Glu122–Asp123 peptide bond is flipped relative to its hAng counterpart, and the C-terminal segment bulges out from the main body of the protein rather than inwards as is the case with hAng. Overall, the C-terminal conformation is less obstructive than that found in hAng; notably, Val125 points away from the B1 subsite, while its hAng counterpart, Phe120, encroaches on the pyrimidine-binding zone. The segment is stabilised chiefly by a network of electrostatic and polar interactions involving the side chains of two residues from the C-terminal segment (Asp123 and Asn126) and two residues from the main body of the protein (Lys85 and Arg106) (Fig. 6a; Table 3). In this respect, it bears some resemblance to murine angiogenin-4 (mAng-4; PDB entry 2J4T),30 which features an electrostatic interaction between the side chain of Arg99 (the counterpart of Arg106) and the free carboxylate group of the C terminus.

Fig. 6.

Fig. 6

B1 subsite and C-terminal segment. RNase ZF-1a, RNase ZF-3e and hAng (PDB entry 1B1I)21 were aligned with the RNase A·uridine vanadate complex (PDB entry 1RUV)86 on the basis of the Cα positions of His12, Lys41, Thr45 and His119 in the latter. Shown in stereo are: (a) RNase ZF-1a (carbon, gold; nitrogen, blue; oxygen, red) superposed with hAng (grey); and (b) RNase ZF-3e (carbon, green; nitrogen, blue; oxygen, red) superposed with RNase A (grey). In both panels, the uridine vanadate moiety is shown in purple (pyrimidine ring labelled). A chloride ion in the RNase ZF-3e structure and two water molecules in the RNase A·uridine vanadate structure are shown as gold and grey spheres, respectively. Residue labels are coloured in accordance with the colouring of carbon atoms. Broken lines denote hydrogen bonds. The side chain of hAng Arg121 is omitted for clarity.

Table 2.

Potential hydrogen bonds in the B1 subsite and C-terminal regions

RNase A·Uvana
RNase ZF-3e
hAngb
RNase ZF-1a
Bond Length (Å) Bond Length (Å) Bond Length (Å) Bond Length (Å)
A. Pyrimidine binding and mimicry
Thr45 N–O2 Ura 2.88 Thr49 N–Cl CL1 3.18 Thr44 N–Oε1 Gln117 2.87 Thr51 N–Oε2 Glu122 2.91
Ura N3–Oγ1 Thr45 2.79 Wat79 O–Oγ1 Thr49 3.07 Gln117 Nε2–Oγ1 Thr44 3.09 Thr51 Oγ1–Oε1 Glu122 2.64
Thr45 Oγ1–Oδ1 Asp83 2.71 Thr49 Oγ1–Oγ1 Thr86 3.42 Thr44 Oγ1–Oγ1 Thr80 2.84
His12 Nδ1–O Thr45 2.69 His16 Nδ1–O Thr49 2.91 His13 Nδ1–O Thr44 2.93 His16 Nδ1–O Thr51 2.80



B. Stabilisation of C-terminal conformation
(a) Interactions with external regions
 Ile107 N–O Ala122 2.73 Val110 N–O Arg123 2.80
 Ala122 N–O Ile107 3.26 Arg123 N–O Val110 3.37
 His105 N–O Val124 2.82 Lys108 N–O Val125 2.99
 Val124 N–O His105 2.78 Val125 N–O Lys108 2.90
 Lys66 N–Oδ2 Asp121 2.81
 Lys66 Nζ–O Asp121 2.67
 – Arg107 Nη1–O Gly124 2.88 Arg106 Nη1–Oδ1 Asp123 2.92
 – Arg106 Nη2–Oδ2 Asp123 2.96
 – Lys85 Nζ–Oδ2 Asp123 3.18
 – Lys85 Nζ–Oδ1 Asn126 2.77
(b) Internal interactions
 His119 Nε2–Oδ1 Asp121 2.66
 – Phe120 N–O Gln117 2.97
 – Arg121 N–O Ser118 3.14 Asn126 Nδ2–O Asp123 2.73
 – Ser118 N–Oδ1 Asp116 3.07
 – Ser118 Oγ–Oδ1 Asp116 2.46

Potential hydrogen bonds were identified with HBPLUS88 using default criteria (D–H···A angle > 90°, H···A distance < 2.5 Å).

a

PDB entry 1RUV. 86

b

PDB entry 1B1I.21

Table 3.

Crystallographic statistics

RNase ZF-1a RNase ZF-3e
A. Diffraction data
Space group P21 I222
Unit cell parameters
 a (Å) 33.2 43.2
 b (Å) 39.4 61.0
 c (Å) 46.1 115.2
 β (deg) 98.8
Resolution range (Å) 50–1.35 50–1.85
No. reflections measured 188,114 189,062
No. unique reflections 27,012 13,420
Rsymmab 0.043 (0.313) 0.115 (0.248)
I/σ(I) 22.8 (2.0) 14.5 (5.1)
Completeness (%) 93.7 (61.8) 98.8 (90.6)



B. Refined model
Rcrystb 0.185 0.208
Rfreec 0.226 0.260
Deviation from ideality (r.m.s.)
 Bond lengths (Å) 0.007 0.013
 Bond angles (deg) 1.37 1.83
No. atoms
 Protein 1065 982
 Water 159 85
 Chloride 1 2
Mean B-factor (Å2)
 Protein 10.0 28.2
 Water 19.9 34.7
 Chloride 45.8 27.4

Values in parentheses refer to the outermost shell (1.40–1.35Å and 1.92–1.85 Å for RNase ZF-1a and RNase ZF-3e, respectively).

a

Rsymm = ∑hi[|Ii(h)–〈I(h)〉|/∑hiIi(h)], where Ii is the ith measurement and 〈I(h)〉 is the weighted mean of all measurements of I(h).

b

Rcryst = ∑h|FoFc|/∑hFo, where Fo and Fc are the observed and calculated structure factor amplitudes of reflection h, respectively.

c

Rfree is equal to Rcryst for a randomly selected 5 % subset of reflections not used in the refinement.74

In RNase ZF-3e, the B1 subsite is unobstructed and the position expected for the 2-keto group of a bound pyrimidine is occupied by a chloride ion from the crystallisation medium (Fig. 6b). This has also been observed for RNase A when crystallised in the presence of a high concentration of chloride ions, e.g. PDB entry 1RNX.39 The C-terminal segment follows a path similar to that of RNase A, i.e. strand B7 maintains an extended antiparallel link to strand B6 (Figs. 3b and d). This is further stabilised by a hydrogen bond between Gly124 and the side chain of Arg107 (Fig. 6b; Table 3). Arg123, the counterpart of hAng Gln117, is too large a residue to occupy the pyrimidine-binding pocket and its side chain is instead oriented toward the solvent. As such, it is very similar to the Arg/Lys residues found at the corresponding positions in hEDN, hRNase 7 and Amphinase-2 (Amph-2) (PDB entries 1GQV, 2HKY and 2P7S, respectively23,24,28).

The cytidine/uridine specificity of the B1 subsite varies greatly among pancreatic ribonuclease superfamily members and is known to be modulated by two means. First, the hydrogen-bonding selectivity of the main Thr residue can be influenced by a neighbouring residue and second, additional interactions with the substrate may be made by other residues. In RNase ZF-1a, Lys85 lies adjacent to Thr51 and is the counterpart of the modulatory residues Thr80 and Asp83 that are found in hAng and RNase A, respectively (Fig. 6a and b). No other known RNase structure features a Lys residue at this position. The Nζ group of Lys85 makes no interaction with the side-chain hydroxyl group of Thr51 and is instead engaged in interactions with Asp123 and Asn126 of the C-terminal segment (Fig. 6a; Table 3). However, it is likely that substrate binding triggers a significant restructuring of the C-terminal segment. During this process, the side chain of Lys85 could become involved in substrate binding. It may form a hydrogen bond with Thr51 or one directly with the 4-keto group of uracil. The guanidino group of Arg106 may also interact with the 4-keto group of uracil, thus emulating Arg101 of RNase 4, a residue that contributes to the strong uridine preference of that enzyme.37,40

In RNase ZF-3e, Thr86 corresponds well with Thr80 of hAng (Figs. 6a and b), a residue known to promote uridine selectivity by weakening the interaction between the main Thr residue (Thr44) and the N3 atom of cytosine.41,42 The distance between the Oγ1 atoms of Thr49 and Thr86 measures 3.42 Å in the present structure, suggesting that a weak hydrogen bond is present and that Thr86 plays a role similar to that of Thr80 in hAng. Arg107 (the counterpart of Arg101 in RNase 4/Arg106 in RNase ZF-1a ) makes two hydrogen-bonding interactions that orient its guanidino group away from the B1 subsite, reducing the likelihood of a role in substrate binding.

In neither of the zebrafish RNases is there a direct counterpart of Ser123 of RNase A, a residue that forms two energetically significant water-mediated hydrogen bonds with the 4-keto group of uridine.43

B2 subsite and B2–B3 β-hairpin

The ways in which adenylic and guanylic inhibitors are bound by various RNases22,27,38,44,45 infer that the B2 subsites of RNases ZF-1a and-3e have several familiar features in common. First, as is the case throughout the superfamily, the imidazole ring of the C-terminal catalytic histidine is positioned to make stacking interactions with the purine ring system (Fig. 7a and b). Second, Glu113 of RNase ZF-1a and Glu114 of RNase ZF-3e are poised to mimic Glu111 of RNase A/Glu97 of Rana catesbeiana ribonuclease (RC-RNase)/Glu91 of Onconase (ONC) and may, therefore, form hydrogen bonds with the N1 and/or N2 atoms of guanine. Third, any contribution from the loop of the B2–B3 β-hairpin is likely to be modest. In each case, the loop is fairly short (two residues in RNase ZF-1a, five in RNase ZF-3e) but includes at least one residue (Asn73 in RNase ZF-1a, Asn72 and Asp74 in RNase ZF-3e) that could conceivably interact with the heteroatom at position 6 of a bound purine, although this is most likely to be via a water molecule. Thus, this region is more elaborate than in amphibian RNases (where it is abridged) and is most akin to hAng where the loop comprises three residues (Fig. 3c). It contrasts with several mammalian enzymes such as RNases 1–4 and 7 where the loop is somewhat longer (eight residues in RNase A), is stabilised by an internal disulphide bond, and hydrogen-bonds directly to adenine (Figs. 3d and 7b).

Fig. 7.

Fig. 7

B2 subsite. RNase ZF-1a, RNase ZF-3e and the RC-RNase·d(ACGA) complex (PDB entry 1M07)27 were aligned with the RNase A·d(CpA) complex (PDB entry 1RPG)22 on the basis of the Cα positions of Ala109, Glu111 and His119 in the latter. Shown in stereo are: (a) RNase ZF-1a (carbon, gold; nitrogen, blue; oxygen, red) superposed with RC-RNase (grey); and (b) RNase ZF-3e (carbon, green; nitrogen, blue; oxygen, red) superposed with RNase A (grey). In each panel, a portion of dinucleotide (purple; purine ring labelled) and a surface representation of the subsite are contributed by the liganded structure. Residue labels are coloured in accordance with the colouring of carbon atoms. < Q1 denotes a pyroglutamate residue and broken lines denote hydrogen bonds. Conformation B of RNase ZF-1a Glu113, RNase ZF-3e His120 and RNase A Gln69 are omitted for clarity. RNase ZF-3e residues Arg73 and Asp74 are disordered beyond Cβ.

There is one notable difference between the B2 subsites of the two zebrafish RNases. While RNase ZF-3e possesses an Ala residue at position 112 (a likely functional counterpart of Ala109, a simple pocket-lining residue of RNase A), RNase ZF-1a has a Lys residue at the corresponding position (residue number 111). The only precedent for this is Lys95 of RC-RNase. In the crystal structure of RC-RNase in complex with d(ACGA) (PDB entry 1M07), the side chain of Lys95 makes van der Waals interactions with guanine and stabilises the position of Glu97 via an Nζ–Oε2 hydrogen bond (Fig. 7a), thereby making a significant contribution to the guanine preference of the B2 subsite of that enzyme.27 Although in the present structure of RNase ZF-1a the side chain of Lys111 is obstructive to purine binding and appears to stabilise a wayward conformation of Glu113, substrate binding may trigger a rearrangement in which Lys111 and Glu113 come to mimic Lys95 and Glu97 of RC-RNase. Indeed, the two RC-RNase residues are also somewhat mobile, as indicated by their conformations in an unpublished nucleotide-free structure (PDB entry 1KM9).

Discussion

Our survey of zebrafish sequences revealed significant polymorphism of RNases ZF-1–3. Although the inherent error rates of thermostable polymerases, reverse transcriptase and other procedures give rise to artifacts in all cDNA collections, there are several pointers as to the validity of the current analysis. First, the degree to which experimental artifacts contribute to the sequence variations observed in carefully constructed full-length cDNA libraries such as those of the Mammalian Gene Collection is low, at around 10%.46 Second, in several cases (e.g., RNases ZF-2a,-2b and-3b), representatives were found in multiple libraries, which would be highly unlikely to occur by chance. Third, we resequenced representative clones to ensure that the quality of the DNA sequencing was high. With regard to RNase ZF-1 (a copy of which is yet to be found in genomic DNA), each of the three variants that we identified must be taken as equally valid at the present time. In addition, the finding that RNases ZF-2b–d differ from the previously identified RNase ZF-2a at nearly one-quarter of all positions may necessitate the creation of a new RNase ZF-5 class.

Our structures of RNases ZF-1a and-3e are the first ichthyic ribonuclease structures to be reported. Classification using a loop structure-based dissimilarity measure produced a biologically meaningful tree (Fig. 5), contrasting with trees derived from measures of core similarity (not shown) and confirming the potential of this approach for obtaining systematic relationships from structural data.47 Regions that are not part of the conserved core tend not to be critical to the structural integrity of proteins and evolve mostly through insertion-deletion events.48 The tree suggests that this type of event has had a major role in the formation of the present-day ribonuclease clades and in the development of their special biological activities. Attributing statistical significance to particular branches of the tree is currently not possible. Character-based confidence tests, such as bootstrapping,49 are not appropriate, but analytical tests such as those based on weighted least-squares50 could be used if the error of the dissimilarity measure was known. However, this cannot be defined satisfactorily at present, as it depends on multiple factors including the resolution of the structures, the quality of the structural alignment and any disorder in the loop regions.

While sequence data suggest that all present-day ribonucleases descend from a progenitor with three disulphide bonds,2–4,15 the structure of RNase ZF-3e advances the debate. Its combination of three disulphide bonds and an unobstructive C terminus—which is also likely to be a feature of some reptilian and avian ribonucleases6,51—raises the question of whether the C terminus of the progenitor was obstructive or not. The clustering tree supports both possibilities (Fig. 5). In addition, the relatively high level of similarity (late divergence) of RNases ZF-1 and-3 infers that the unblocking/blocking event has happened independently in different vertebrate lineages. Since the low level of catalytic efficiency that results from an obstructive C-terminus is highly correlated with the presence of angiogenic activity, it seems most likely that the ancestral ribonuclease had an obstructive C terminus; the convergent evolution of angiogenic activity in vertebrate classes as distantly related as mammals and fish seems highly improbable in comparison.

Several regions of the new structures provide functional insight. In the region of the C-terminal segment, RNase ZF-1a bears an obvious (though incomplete) resemblance to hAng. The binding of RNA substrates to hAng requires a major structural rearrangement of this region,52 and the same appears to be true for RNase ZF-1a, which most likely underpins its low level of ribonucleolytic activity. The unobstructed B1 subsite of RNase ZF-3e identifies it not with hAng but with most of the other mammalian and all of the known amphibian ribonucleases. As expected, its ribonucleolytic activity is significantly greater than that of RNase ZF-1a or hAng. Since mutations that elevate ribonucleolytic activity are known to be detrimental to the angiogenic activity of mammalian angiogenins,30,53,54 this most likely accounts for the lack of angiogenic activity in its close relative, RNase ZF-3b.2 Its enzymatic prowess is still, however, several orders of magnitude lower than that of some relatives, e.g. RNase A. Contributions to this are likely to come from the lack of interactions stabilising the productive tautomer of His120 and the more rudimentary nature of its substrate-binding subsites.

At the B1 subsite of RNase ZF-1a, the position of Lys85 and the correspondence of Arg106 to Arg101 of RNase 4 suggest that this enzyme will bind uridine more avidly than cytosine,37,40 whereas in RNase ZF-3e, the positions of Thr86 and Arg107 suggest an ambivalence as for hAng.41,42 At the B2 subsite, predictions of base selectivity are hampered somewhat by the variable positioning and orientation of the purine when bound by different enzymes.22,27,38,44,45 The resemblance of RNase ZF-1a to RC-RNase suggests a preference for guanine over adenine here but the likely preference of RNase ZF-3e is unclear.

The 11 polymorphisms identified in RNases ZF-1 and-3 are spread over several structural elements (Fig. 4). The residues found at these positions are hydrophilic or small/non-polar and are all at exposed positions, eight being within loops. Most of the substitutions are unlikely to affect substrate binding or catalysis, although the three substitutions in the region of the second catalytic histidine and C-terminal segment may have small effects. It is possible that the Ala/Thr119 and Arg/Lys123 substitutions contribute to the apparent difference between the activities of RNase ZF-3e and-3b/c.

The wealth of ribonuclease crystal and solution structures permits some predictions about the structures of the remaining zebrafish enzymes. RNase ZF-2 is similar in length to RNases ZF-1 and-3 (Fig. 1). Its H2–B1 and B2–B3 loops are likely to be roughly the same length as those of RNase ZF-1a but its C-terminal segment is unusually long. If we define the position of the second catalytic histidine in the sequence as the origin, there is a Glu residue at +3 and large hydrophobic residues (both Ile) at +5 and +6. The same motif is found in hAng, suggesting that the C terminus of RNase ZF-2 obstructs the B1 subsite. This is consistent with the magnitude of its ribonucleolytic activity, which is small and similar to that of hAng.2,3 The specificity-determining elements of the B1 subsite appear similar to those of RNase ZF-3e, while those of the B2 subsite may be novel. In contrast, the sequence of RNase ZF-4 differs widely from those of RNases ZF-1–3. It is 14–16 residues shorter, with the majority of its deletions predicted to fall in loops such as those in the N-terminal, H2–B1 and B2–B3 segments. As in amphibian RNases such as ONC, the short N-terminal segment may permit the participation of Pca1 in catalysis,55 while the B2–B3 segment appears to be abridged. The C terminus is short and hydrophilic with an Arg residue at position +3. Therefore, the B1 subsite is likely to be unobstructed, as observed for RNase ZF-3e and other enzymes with an Arg/Lys in this position.23,24,28 This is consistent with its elevated ribonucleolytic activity.3 The specificity-determining elements of the B1 subsite appear to be novel but the presence of direct counterparts of ONC residues Thr89 and Glu91 suggest that the B2 subsite may be similar to that of the amphibian enzyme.38

Materials and Methods

Identification of zebrafish ribonuclease variants and selection of clones

Translated zebrafish nucleotide sequences held on the Zebrafish Information Network56 were first compared to known zebrafish ribonuclease amino acid sequences2,3 using TBLASTN.57 Follow-up searches of I.M.A.G.E. Consortium58 clustered expressed sequence tags were conducted using the IMAGEne server, v. 4.9.59 Sequence alignments were performed with ClustalW.60 Signal peptide cleavage positions were determined with SignalP 3.0.61

I.M.A.G.E. Consortium cDNA clones 3716043, 5628280 and 6907218, which encode RNases ZF-1a,-1b and-3e, respectively, were obtained from RZPD Deutsches Ressourcenzentrum für Genomforschung (Berlin, Germany). The plasmid inserts were sequenced from each end at MWG Biotech (London, UK) using appropriate vector-based primers. Clones 5627130, 7410112, 7264666, 7250488, 7047538 and 7151363, which encode RNases ZF-1c,-2b,-2c,-2d,-3d and-3d (again), respectively, were sequenced in the same manner at Geneservice Ltd (Cambridge, UK).

Construction of expression strains

The coding sequences of mature RNases ZF-1a and-3e were amplified from the aforementioned cDNA clones by PCR with KOD Hot Start DNA polymerase (Novagen) in conjunction with the following primers: for RNase ZF-1a

  • 5′-CATATGCAACCTGAACATGTAAAGGAGC-3′ (forward)

  • 5′-CTCGAGTCAGCCTACGTTAACTTCGTCT-3′ (reverse) for RNase ZF-3e

  • 5′-CATATGCAACCAGCAGAAATAAGGCG-3′ (forward)

  • 5′-CTCGAGCTAAACAATAACACCTCTTTCATAGTG-3′ (reverse)

Amplification products were A-tailed with Taq polymerase/dATP and ligated with EcoRV-cleaved, T-tailed pGEM®-T Easy Vector (all Promega) to form pGEM-z1a and pGEM-z3e, respectively. The mixture was then used to transform Escherichia coli JM109 cells and authentic clones were identified by restriction digestion and DNA sequencing.

For crystallographic experiments, each protein was expressed with an N-terminal His6-containing tag. An RNase ZF-1a expression plasmid was created by use of a ligation-independent cloning PCR approach. The gene was first PCR-amplified from pGEM-z1a with the primers

  • 5′-CACCACCACCACATGCAACCTGAACATGTAAAGGAGCG-3′ (forward)

  • 5′-GAGGAGAAGGCGCGTTAGCCTACGTTAACTTCGTCTTCATGATAATGCAC-3′ (reverse)

The products were treated with phage T4 DNA polymerase (Novagen)/dATP to generate single-stranded overhangs and annealed with suitably prepared pET-YSBLIC62 (a pET-28a(+) derivative kindly donated by Dr M.J. Fogg, University of York, UK) to form pYSBLIC-z1a. For expression of RNase ZF-3e, the gene was excised from pGEM-z3e with NdeI and XhoI and ligated with similarly treated pET-15b (Novagen) to form pET15-z3e. The tagged proteins possessed N-terminal extensions of 11 and 21 residues, respectively.

For assay of ribonucleolytic activity, each protein was expressed in Met(–1) form, without a His tag. The sequence encoding RNase ZF-1a was PCR-amplified from pYSBLIC-z1a with the primers

  • 5′-CATCACCACCACCATATGCAACCTGAACATGTAAAG-3′ (forward)

  • 5′-GGAGAACTCGAGTTAGCCTACGTTAACTTCGTC-3′ (reverse)

and treated with NdeI and XhoI, while that encoding RNase ZF-3e was excised from pET15-z3e using the same enzymes. The DNA fragments were ligated with similarly digested pET-22b(+) (Novagen) to form pET22-z1a and pET22-z3e, and the absence of unintended mutation was established by DNA sequencing. These constructs do not make use of the pelB leader or His-tag encoded by pET-22b(+). All expression constructs were used to transform E. coli BL21-CodonPlus(DE3)-RIL cells (Stratagene).

Purification of recombinant proteins

Recombinant proteins were prepared from expression strains as described.63 Briefly, strains were cultured at 37 °C in Terrific Broth64 supplemented with 0.5 % (w/v) glucose and appropriate antibiotic(s) (for pYSBLIC-z1a, 50 μg ml− 1 kanamycin and 25 μg ml− 1 chloramphenicol; for pET15-z3e, pET22-z1a and pET22-z3e, 100 μg ml− 1 ampicillin). Expression was induced by addition of isopropyl-β-D-thiogalactopyranoside to 1 mM, which directed the target proteins into inclusion bodies. These were solubilised and refolded, then purified by SP-Sepharose chromatography followed by C4 reversed-phase HPLC. Purified proteins were lyophilised and reconstituted in AnalaR grade water, at which point purity was > 98 % as judged by SDS-PAGE. Protein concentration was determined by measuring UV absorbance and using an estimated ε28065 of 13,325 M− 1 cm− 1 for all proteins.

Proteins were authenticated by electrospray ionisation time-of-flight mass spectrometry. Analyses were conducted by Dr A.T. Lubben (Department of Chemistry, University of Bath, UK) using a micrOTOF instrument (Bruker Daltonik). The recorded molecular masses of [Met(–1)]-RNase ZF-1a and-3e were 14679.3 Da and 14860.3 Da, respectively, which were within 1 Da of the predicted mass. Those recorded for His-tagged RNase ZF-1a (15,733.4 Da) and-3e (16,892.3 Da) were consistent with the presence of vector-encoded N-terminal extensions of sequence

  • GSSHHHHHHM and

  • MGSSHHHHHHSSGLVPRGSHM,

respectively, and were equally accurate.

Ribonucleolytic activity assay

tRNA cleavage assays were conducted essentially as described,66 except that the buffer was 33 mM Na-Hepes (pH 7.0) and incubations were for 2 h. Assays contained 2 mg ml− 1 yeast tRNA (Sigma), 0.1 mg ml− 1 bovine serum albumin (Worthington Biochemical Corp.) and six concentrations of test protein. The reaction was terminated by the addition of 2 vol. ice-cold 3.4 % (w/v) perchloric acid and incubated on ice for 10 min. The mixture was centrifuged at 13,200 g for 10 min at 4 °C, after which the A260 of the supernatant was used as a measure of RNase activity. The magnitude of the activity in the most linear portion of each plot (three or four data points) was used in the calculation of relative activities. [Met(–1)]-hAng (available from a previous study30) and RNase A (Sigma) were used as reference samples and were found to have activities in accordance with previous measurements.67

X-ray crystallography

Crystals of His-tagged RNase ZF-1a and-3e were grown at 16 °C by the hanging-drop, vapour-diffusion method. Protein solution (10 mg ml− 1 in water) was mixed with an equal volume of reservoir solution (for RNase ZF-1a, 25 % PEG 4000, 0.2 M (NH4)2SO4, 0.1 M Tris–HCl (pH 8.0); for RNase ZF-3e, 25 % PEG 4000, 0.2 M LiCl, 0.1 M Tris–HCl (pH 8.5)). RNase ZF-1a formed amorphous crystals within 3 days while RNase ZF-3e formed plate-like crystals within one week. Diffraction data from single crystals were collected on stations 14.2 (RNase ZF-1a, room temperature) and 10.1 (RNase ZF-3e, 100 K) of the Synchrotron Radiation Source (Daresbury, UK), each of which was equipped with a Quantum-4 CCD detector (Area Detector Systems Corp.). For RNase ZF-3e, the PEG 4000 concentration in the mother liquor was increased to 30 % to provide cryoprotection. All data were processed using the HKL suite68 and intensities were truncated to amplitudes using TRUNCATE.69,70 Matthews coefficients71 indicated that both types of crystal contained one molecule per asymmetric unit (for RNase ZF-1a, VM = 1.9 Å3 Da− 1 and solvent content = 35 %, v/v); for RNase ZF-3e, VM = 2.2 Å3 Da− 1 and solvent content = 45 %, v/v). Detailed data processing statistics are given in Table 3.

Similar approaches were used for determination of each structure. Initial phases were obtained using the molecular replacement routines of PHASER.72 For RNase ZF-1a, the search model was derived from the coordinates of [Met(–1)]-hAng (PDB entry 1ANG29) while for RNase ZF-3e, it was derived from those of RNase ZF-1a. The resultant models were refined using REFMAC573 with 3 % (RNase ZF-1a) or 5 % (RNase ZF-3e) of reflections set aside for cross-validation.74 After an initial round of rigid-body refinement, rounds of restrained refinement were interspersed with electron density map calculations and manual adjustments using COOT.75 On the basis of mFoDFc electron density, side-chain atoms were omitted at some positions (RNase ZF-1a: Lys6; RNase ZF-3e: Ile5, Arg7, Arg8, Arg28, Arg73, Asp74) and dual conformations were modelled at others (RNase ZF-1a: Met(–1), Arg8, Gln15, Met21, Val23, Ser28, Thr38, Ser40, Asn56, Lys57, Ser69, Asn89, Glu98, Arg100, Thr102, Glu113; RNase ZF-3e: Val29, Val111, His120). Chloride ions were added manually, while water molecules were added using ARP-wARP76,77 at positions where mFoDFc electron density peaks exceeded 3σ and potential hydrogen bonds could be made. Model validation was conducted with the RCSB PDB Validation Suite78 and the WHAT_CHECK server.79 Detailed statistics for each model are given in Table 3. Figures were drawn with PyMOL§ (DeLano Scientific, San Carlos, CA, USA).

Structural alignments and clustering

Global similarities between the present crystal structures (minus His tag residues) and those in the Protein Data Bank were identified using the EBI secondary-structure matching tool.80 Subsequent structural alignments were performed with CE (pairwise)81 and CE-MC (multiple).82 A pairwise distance matrix was derived using a loop-based Hausdorff measure (LHM) of structural dissimilarity,31 and proteins were clustered using the unweighted neighbour-joining method, UNJ,83 as implemented in T-REX.84

Protein Data Bank accession codes

The coordinates and structure factors of RNase ZF-1a and RNase ZF-3e have been deposited in the RCSB Protein Data Bank under accession codes 2VQ8 and 2VQ9, respectively.

Acknowledgements

This research is supported by the Wellcome Trust (U.K) through a Programme Grant (083191) to K.R.A. and by the University of Bath through a Postgraduate Studentship to K.K. We thank Dr. Benedict Crabtree (University of Bath) for his contribution during the early stages of this work and Dr. Thomas Madej (National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD) for providing computer scripts and the program for calculation of LHM scores. We acknowledge the support provided by the Station Scientists at the Synchrotron Radiation Source (Daresbury, UK), Dr. Nethaji Thiyagarajan and Dr. Shalini Iyer during X-ray data collection.

Edited by M. Guss

Footnotes

Contributor Information

Vasanta Subramanian, Email: bssvss@bath.ac.uk.

K. Ravi Acharya, Email: bsskra@bath.ac.uk.

References

  • 1.Beintema J.J., Breukelman H.J., Carsana A., Furia A. Evolution of vertebrate ribonucleases: ribonuclease A superfamily. In: D'Alessio G., Riordan J.F., editors. Ribonucleases: Structures and Functions. Academic Press; New York: 1997. pp. 245–269. [Google Scholar]
  • 2.Pizzo E., Buonanno P., Di Maro A., Ponticelli S., De Falco S., Quarto N. Ribonucleases and angiogenins from fish. J. Biol. Chem. 2006;281:27454–27460. doi: 10.1074/jbc.M605505200. [DOI] [PubMed] [Google Scholar]
  • 3.Cho S., Zhang J. Zebrafish ribonucleases are bactericidal: implications for the origin of the vertebrate RNase A superfamily. Mol. Biol. Evol. 2007;24:1259–1268. doi: 10.1093/molbev/msm047. [DOI] [PubMed] [Google Scholar]
  • 4.Pizzo E., Varcamonti M., Maro A.D., Zanfardino A., Giancola C., D'Alessio G. Ribonucleases with angiogenic and bactericidal activities from the Atlantic salmon. FEBS J. 2008;275:1283–1295. doi: 10.1111/j.1742-4658.2008.06289.x. [DOI] [PubMed] [Google Scholar]
  • 5.Fett J.W., Strydom D.J., Lobb R.R., Alderman E.M., Bethune J.L., Riordan J.F., Vallee B.L. Isolation and characterization of angiogenin, an angiogenic protein from human carcinoma cells. Biochemistry. 1985;24:5480–5486. doi: 10.1021/bi00341a030. [DOI] [PubMed] [Google Scholar]
  • 6.Nitto T., Dyer K.D., Czapiga M., Rosenberg H.F. Evolution and function of leukocyte RNase A ribonucleases of the avian species, Gallus gallus. J. Biol. Chem. 2006;281:25622–25634. doi: 10.1074/jbc.M604313200. [DOI] [PubMed] [Google Scholar]
  • 7.Yang D., Chen Q., Rosenberg H.F., Rybak S.M., Newton D.L., Wang Z.Y. Human ribonuclease A superfamily members, eosinophil-derived neurotoxin and pancreatic ribonuclease, induce dendritic cell maturation and activation. J. Immunol. 2004;173:6134–6142. doi: 10.4049/jimmunol.173.10.6134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Subramanian V., Feng Y. A new role for angiogenin in neurite growth and pathfinding: implications for amyotrophic lateral sclerosis. Hum. Mol. Genet. 2007;16:1445–1453. doi: 10.1093/hmg/ddm095. [DOI] [PubMed] [Google Scholar]
  • 9.Subramanian V., Crabtree B., Acharya K.R. Human angiogenin is a neuroprotective factor and amyotrophic lateral sclerosis associated angiogenin variants affect neurite extension/pathfinding and survival of motor neurons. Hum. Mol. Genet. 2008;17:130–149. doi: 10.1093/hmg/ddm290. [DOI] [PubMed] [Google Scholar]
  • 10.Benito A., Ribó M., Vilanova M. On the track of antitumour ribonucleases. Mol. Biosyst. 2005;1:294–302. doi: 10.1039/b502847g. [DOI] [PubMed] [Google Scholar]
  • 11.Rosenberg H.F. RNase A ribonucleases and host defense: an evolving story. J. Leukoc. Biol. 2008;83:1079–1087. doi: 10.1189/jlb.1107725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Murzin A.G., Brenner S.E., Hubbard T., Chothia C. SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 1995;247:536–540. doi: 10.1006/jmbi.1995.0159. [DOI] [PubMed] [Google Scholar]
  • 13.Raines R.T. Ribonuclease A. Chem. Rev. 1998;98:1045–1065. doi: 10.1021/cr960427h. [DOI] [PubMed] [Google Scholar]
  • 14.D'Alessio G. Evolution of oligomeric proteins. The unusual case of a dimeric ribonuclease. Eur. J. Biochem. 1999;266:699–708. doi: 10.1046/j.1432-1327.1999.00912.x. [DOI] [PubMed] [Google Scholar]
  • 15.Cho S., Beintema J.J., Zhang J. The ribonuclease A superfamily of mammals and birds: identifying new members and tracing evolutionary histories. Genomics. 2005;85:208–220. doi: 10.1016/j.ygeno.2004.10.008. [DOI] [PubMed] [Google Scholar]
  • 16.Guryev V., Koudijs M.J., Berezikov E., Johnson S.L., Plasterk R.H.A., van Eeden F.J.M., Cuppen E. Genetic variation in the zebrafish. Genome Res. 2006;16:491–497. doi: 10.1101/gr.4791006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pizzo E., D'Alessio G. The success of the RNase scaffold in the advance of biosciences and in evolution. Gene. 2007;406:8–12. doi: 10.1016/j.gene.2007.05.006. [DOI] [PubMed] [Google Scholar]
  • 18.Trevarrow B., Robison B. Genetic backgrounds, standard lines, and husbandry of zebrafish. Methods Cell Biol. 2004;77:599–616. doi: 10.1016/s0091-679x(04)77032-6. [DOI] [PubMed] [Google Scholar]
  • 19.Clark M.D., Hennig S., Herwig R., Clifton S.W., Marra M.A., Lehrach H. An oligonucleotide fingerprint normalized and expressed sequence tag characterized zebrafish cDNA library. Genome Res. 2001;11:1594–1602. doi: 10.1101/gr.186901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Shapiro R., Harper J.W., Fox E.A., Jansen H.W., Hein F., Uhlmann E. Expression of Met-(–1) angiogenin in Escherichia coli: conversion to the authentic < Glu-1 protein. Anal. Biochem. 1988;175:450–461. doi: 10.1016/0003-2697(88)90569-6. [DOI] [PubMed] [Google Scholar]
  • 21.Leonidas D.D., Shapiro R., Allen S.C., Subbarao G.V., Veluraja K., Acharya K.R. Refined crystal structures of native human angiogenin and two active site variants: implications for the unique functional properties of an enzyme involved in neovascularisation during tumour growth. J. Mol. Biol. 1999;285:1209–1233. doi: 10.1006/jmbi.1998.2378. [DOI] [PubMed] [Google Scholar]
  • 22.Zegers I., Maes D., Dao-Thi M.-H., Poortmans F., Palmer R., Wyns L. The structures of RNase A complexed with 3′-CMP and d(CpA): active site conformation and conserved water molecules. Protein Sci. 1994;3:2322–2339. doi: 10.1002/pro.5560031217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Swaminathan G.J., Holloway D.E., Veluraja K., Acharya K.R. Atomic resolution (0.98 Å) structure of eosinophil-derived neurotoxin. Biochemistry. 2002;41:3341–3352. doi: 10.1021/bi015911f. [DOI] [PubMed] [Google Scholar]
  • 24.Mohan C.G., Boix E., Evans H.R., Nikolovski Z., Nogués M.V., Cuchillo C.M., Acharya K.R. The crystal structure of eosinophil cationic protein in complex with 2′,5′-ADP at 2.0 Å resolution reveals the details of the ribonucleolytic active site. Biochemistry. 2002;41:12100–12106. doi: 10.1021/bi0264521. [DOI] [PubMed] [Google Scholar]
  • 25.Huang Y.-C., Lin Y.-M., Chang T.-W., Wu S.-J., Lee Y.-S., Chang M.D.-T. The flexible and clustered lysine residues of human ribonuclease 7 are critical for membrane permeability and antimicrobial activity. J. Biol. Chem. 2007;282:4626–4633. doi: 10.1074/jbc.M607321200. [DOI] [PubMed] [Google Scholar]
  • 26.Mosimann S.C., Ardelt W., James M.N.G. Refined 1.7 Å X-ray crystallographic structure of P-30 protein, an amphibian ribonuclease with anti-tumor activity. J. Mol. Biol. 1994;236:1141–1153. doi: 10.1016/0022-2836(94)90017-5. [DOI] [PubMed] [Google Scholar]
  • 27.Leu Y.-J., Chern S.-S., Wang S.-C., Hsiao Y.-Y., Amiraslanov I., Liaw Y.-C., Liao Y.-D. Residues involved in the catalysis, base specificity, and cytotoxicity of ribonuclease from Rana catesbeiana based upon mutagenesis and X-ray crystallography. J. Biol. Chem. 2003;278:7300–7309. doi: 10.1074/jbc.M206701200. [DOI] [PubMed] [Google Scholar]
  • 28.Singh U.P., Ardelt W., Saxena S.K., Holloway D.E., Vidunas E., Lee H.-S. Enzymatic and structural characterisation of Amphinase, a novel cytotoxic ribonuclease from Rana pipiens oocytes. J. Mol. Biol. 2007;371:93–111. doi: 10.1016/j.jmb.2007.04.071. [DOI] [PubMed] [Google Scholar]
  • 29.Acharya K.R., Shapiro R., Allen S.C., Riordan J.F., Vallee B.L. Crystal structure of human angiogenin reveals the structural basis for its functional divergence from ribonuclease. Proc. Natl Acad. Sci. USA. 1994;91:2915–2919. doi: 10.1073/pnas.91.8.2915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Crabtree B., Holloway D.E., Baker M.D., Acharya K.R., Subramanian V. Biological and structural features of murine angiogenin-4, an angiogenic protein. Biochemistry. 2007;46:2431–2443. doi: 10.1021/bi062158n. [DOI] [PubMed] [Google Scholar]
  • 31.Panchenko A.R., Madej T. Analysis of protein homology by assessing the (dis)similarity in protein loop regions. Proteins: Struct. Funct. Bioinformatics. 2004;57:539–547. doi: 10.1002/prot.20237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Parés X., Nogués M.V., de Llorens R., Cuchillo C.M. Structure and function of ribonuclease A binding subsites. Essays Biochem. 1991;26:89–103. [PubMed] [Google Scholar]
  • 33.Borkakoti N., Moss D.M., Palmer R.A. Ribonuclease-A: least-squares refinement of the structure at 1.45 Å resolution. Acta Crystallogr. B. 1982;38:2210–2217. [Google Scholar]
  • 34.Schultz L.W., Quirk D.J., Raines R.T. His···Asp catalytic dyad of ribonuclease A: structure and function of the wild-type, D121N, and D121A enzymes. Biochemistry. 1998;38:8886–8898. doi: 10.1021/bi972766q. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Richards F.M., Wyckoff H.W., Allewell N. The origin of specificity in binding: a detailed example in a protein–nucleic acid interaction. In: Schmitt F.O., editor. The Neurosciences: Second Study Program. Rockefeller University Press; New York: 1970. pp. 901–912. [Google Scholar]
  • 36.Vitagliano L., Adinolfi S., Riccio A., Sica F., Zagari A., Mazzarella L. Binding of a substrate analog to a domain swapping protein: X-ray structure of the complex of bovine seminal ribonuclease with uridylyl(2′,5′)adenosine. Protein Sci. 1998;7:1691–1699. doi: 10.1002/pro.5560070804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Terzyan S.S., Peracaula R., de Llorens R., Tsushima Y., Yamada H., Seno M. The three-dimensional structure of human RNase 4, unliganded and complexed with d(Up), reveals the basis for its uridine specificity. J. Mol. Biol. 1999;285:205–214. doi: 10.1006/jmbi.1998.2288. [DOI] [PubMed] [Google Scholar]
  • 38.Lee J.E., Bae E., Bingham C.A., Phillips G.N., Jr, Raines R.T. Structural basis for catalysis by Onconase. J. Mol. Biol. 2008;375:165–177. doi: 10.1016/j.jmb.2007.09.089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Federov A.A., Joseph-McCarthy D.E., Fedorov D., Sirakova E., Graf I., Almo S.C. Ionic interactions in crystalline bovine pancreatic ribonuclease A. Biochemistry. 1996;35:15962–15979. doi: 10.1021/bi961533g. [DOI] [PubMed] [Google Scholar]
  • 40.Hofsteenge J., Moldov C., Vicentini A.M., Zelenko O., Jarai-Kote Z., Neumann U. A single amino acid substitution changes ribonuclease 4 from a uridine-specific to a cytidine-specific enzyme. Biochemistry. 1998;37:9250–9257. doi: 10.1021/bi9803832. [DOI] [PubMed] [Google Scholar]
  • 41.Shapiro R. Structural features that determine the enzymatic potency and specificity of human angiogenin: threonine-80 and residues 58–70 and 116–123. Biochemistry. 1998;37:6847–6856. doi: 10.1021/bi9800146. [DOI] [PubMed] [Google Scholar]
  • 42.Holloway D.E., Chavali G.B., Hares M.C., Baker M.D., Subbarao G.V., Shapiro R., Acharya K.R. Crystallographic studies on structural features that determine the enzymatic specificity and potency of human angiogenin: Thr44, Thr80, and residues 38–41. Biochemistry. 2004;43:1230–1241. doi: 10.1021/bi035654+. [DOI] [PubMed] [Google Scholar]
  • 43.Hodges R.S., Merrifield R.B. The role of serine-123 in the activity and specificity of ribonuclease. Reactivation of ribonuclease 1–118 by the synthetic COOH-terminal tetradecapeptide, ribonuclease 111–124, and its O-methylserine and alanine analogs. J. Biol. Chem. 1975;250:1231–1241. [PubMed] [Google Scholar]
  • 44.Tarragona-Fiol A., Eggelte H.J., Harbron S., Sanchez E., Taylorson C.J., Ward J.M., Rabin B.R. Identification by site-directed mutagenesis of amino acids in the B2 subsite of bovine pancreatic ribonuclease A. Protein Eng. 1993;6:901–906. doi: 10.1093/protein/6.8.901. [DOI] [PubMed] [Google Scholar]
  • 45.Larson S.B., Day J.S., Cudney R., McPherson A. A new crystal form of bovine pancreatic RNase A in complex with 2′-deoxyguanosine-5′-monophosphate. Acta Crystallogr. F. 2007;63:728–733. doi: 10.1107/S1744309107039565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.The MGC Project Team The status, quality, and expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC) Genome Res. 2004;14:2121–2127. doi: 10.1101/gr.2596504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Panchenko A.R., Madej T. Structural similarity of loops in protein families: toward the understanding of protein evolution. BMC Evol. Biol. 2005;5:10. doi: 10.1186/1471-2148-5-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Pascarella W., Argos P. Analysis of insertions/deletions in protein structures. J. Mol. Biol. 1992;224:461–471. doi: 10.1016/0022-2836(92)91008-d. [DOI] [PubMed] [Google Scholar]
  • 49.Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791. doi: 10.1111/j.1558-5646.1985.tb00420.x. [DOI] [PubMed] [Google Scholar]
  • 50.Sanjuán R., Wróbel B. Weighted least-squares likelihood ratio test for branch testing in phylogenies reconstructed from distance measures. Syst. Biol. 2005;54:218–229. doi: 10.1080/10635150590923308. [DOI] [PubMed] [Google Scholar]
  • 51.Katekaew S., Torikata T., Araki T. The complete amino acid sequence of green turtle (Chelonia mydas) egg white ribonuclease. Protein J. 2006;25:316–327. doi: 10.1007/s10930-006-9017-4. [DOI] [PubMed] [Google Scholar]
  • 52.Leonidas D.D., Shapiro R., Subbarao G.V., Russo A., Acharya K.R. Crystallographic studies on the role of the C-terminal segment of human angiogenin in defining enzymatic potency. Biochemistry. 2002;41:2552–2562. doi: 10.1021/bi015768q. [DOI] [PubMed] [Google Scholar]
  • 53.Harper J.W., Vallee B.L. A covalent angiogenin/ribonuclease hybrid with a fourth disulfide bond generated by regional mutagenesis. Biochemistry. 1989;28:1875–1884. doi: 10.1021/bi00430a067. [DOI] [PubMed] [Google Scholar]
  • 54.Holloway D.E., Shapiro R., Hares M.C., Leonidas D.D., Acharya K.R. Guest–host crosstalk in an angiogenin–RNase A chimeric protein. Biochemistry. 2002;41:10482–10489. doi: 10.1021/bi026151r. [DOI] [PubMed] [Google Scholar]
  • 55.Lee J.E., Raines R.T. Contribution of active-site residues to the function of Onconase, a ribonuclease with antitumoral activity. Biochemistry. 2003;42:11443–11450. doi: 10.1021/bi035147s. [DOI] [PubMed] [Google Scholar]
  • 56.Sprague J., Bayraktaroglu L., Clements D., Conlin T., Fashena D., Frazer K. The Zebrafish Information Network: the zebrafish model organism database. Nucleic Acids Res. 2006;34:D581–D585. doi: 10.1093/nar/gkj086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gertz E.M., Yu Y.-K., Agarwala R., Schäffer A.A., Altschul S.F. Composition-based statistics and translated nucleotide searches: improving the TBLASTN module of BLAST. BMC Biol. 2006;4:41. doi: 10.1186/1741-7007-4-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lennon G., Auffray C., Polymeropoulos M., Soares M.B. The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression. Genomics. 1996;33:151–152. doi: 10.1006/geno.1996.0177. [DOI] [PubMed] [Google Scholar]
  • 59.Cariaso M., Folta P., Wagner M., Kuczmarski T., Lennon G. IMAGEne I: clustering and ranking of I.M.A.G.E. cDNA clones corresponding to known genes. Bioinformatics. 1999;15:965–973. doi: 10.1093/bioinformatics/15.12.965. [DOI] [PubMed] [Google Scholar]
  • 60.Chenna R., Sugawara H., Koike T., Lopez R., Gibson T.J., Higgins D.G., Thompson J.D. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 2003;31:3497–3500. doi: 10.1093/nar/gkg500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Bendtsen J.D., Nielsen H., von Heijne G., Brunak S. Improved prediction of signal peptides: SignalP 3.0. J. Mol. Biol. 2004;340:783–795. doi: 10.1016/j.jmb.2004.05.028. [DOI] [PubMed] [Google Scholar]
  • 62.Alzari P.M., Berglund H., Berrow N.S., Blagova E., Busso D., Cambillau C. Implementation of semi-automated cloning and prokaryotic expression screening: the impact of SPINE. Acta Crystallogr. D. 2006;62:1103–1113. doi: 10.1107/S0907444906029775. [DOI] [PubMed] [Google Scholar]
  • 63.Holloway D.E., Hares M.C., Shapiro R., Subramanian V., Acharya K.R. High-level expression of three members of the murine angiogenin family in Escherichia coli and purification of the recombinant proteins. Protein Express. Purif. 2001;22:307–317. doi: 10.1006/prep.2001.1434. [DOI] [PubMed] [Google Scholar]
  • 64.Sambrook J., Fritsch E.F., Maniatis T. 2nd edit. Cold Spring Laborartory Press; Cold Spring Harbor, NY: 1989. Molecular Cloning: A Laboratory Manual. [Google Scholar]
  • 65.Pace N.C., Vajdos F., Fee L., Grimsley G., Gray T. How to measure and predict the molar absorption coefficient of a protein. Protein Sci. 1995;4:2411–2423. doi: 10.1002/pro.5560041120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Shapiro R., Weremowicz S., Riordan J.F., Vallee B.L. Ribonucleolytic activity of angiogenin: essential histidine, lysine, and arginine residues. Proc. Natl Acad. Sci. USA. 1987;84:8783–8787. doi: 10.1073/pnas.84.24.8783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Russo N., Shapiro R., Acharya K.R., Riordan J.F., Vallee B.L. Role of glutamine-117 in the ribonucleolytic activity of human angiogenin. Proc. Natl Acad. Sci. USA. 1994;91:2920–2924. doi: 10.1073/pnas.91.8.2920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Otwinowski Z., Minor W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
  • 69.Bailey S. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
  • 70.French S., Wilson K. On the treatment of negative intensity observations. Acta Crystallogr. A. 1978;34:517–525. [Google Scholar]
  • 71.Matthews B.W. Solvent content of protein crystals. J. Mol. Biol. 1968;33:491–497. doi: 10.1016/0022-2836(68)90205-2. [DOI] [PubMed] [Google Scholar]
  • 72.Read R.J. Pushing the boundaries of molecular replacement with maximum likelihood. Acta Crystallogr. D. 2001;57:1373–1382. doi: 10.1107/s0907444901012471. [DOI] [PubMed] [Google Scholar]
  • 73.Murshudov G.N., Vagin A.A., Dodson E.J. Refinement of macromolecular structures by the maximum likelihood method. Acta Crystallogr. D. 1997;53:240–255. doi: 10.1107/S0907444996012255. [DOI] [PubMed] [Google Scholar]
  • 74.Brünger A.T. Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature. 1992;355:472–475. doi: 10.1038/355472a0. [DOI] [PubMed] [Google Scholar]
  • 75.Emsley P., Cowtan K. COOT: model-building tools for molecular graphics. Acta Crystallogr. D. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
  • 76.Lamzin V.S., Wilson K.S. Automated refinement of protein models. Acta Crystallogr. D. 1993;49:129–147. doi: 10.1107/S0907444992008886. [DOI] [PubMed] [Google Scholar]
  • 77.Perrakis A., Sixma T.K., Wilson K.S., Lamzin V.S. wARP: improvement and extension of crystallographic phases by weighted averaging of multiple-refined dummy atomic models. Acta Crystallogr. D. 1997;53:448–455. doi: 10.1107/S0907444997005696. [DOI] [PubMed] [Google Scholar]
  • 78.Westbrook J., Feng Z., Burkhardt K., Berman H.M. Validation of protein structures for Protein Data Bank. Methods Enzymol. 2003;374:370–385. doi: 10.1016/S0076-6879(03)74017-8. [DOI] [PubMed] [Google Scholar]
  • 79.Rodriguez R., Chinea G., Lopez N., Pons T., Vriend G. Homology modeling, model and software evaluation: three related resources. Bioinformatics. 1998;14:523–528. doi: 10.1093/bioinformatics/14.6.523. [DOI] [PubMed] [Google Scholar]
  • 80.Krissinal E., Henrick K. Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D. 2004;60:2256–2268. doi: 10.1107/S0907444904026460. [DOI] [PubMed] [Google Scholar]
  • 81.Shindyalov I.N., Bourne P.E. Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 1998;11:739–747. doi: 10.1093/protein/11.9.739. [DOI] [PubMed] [Google Scholar]
  • 82.Guda C., Lu S., Scheeff E.D., Bourne P.E., Shindyalov I.N. CE-MC: a multiple protein structure alignment server. Nucleic Acids Res. 2004;32:W100–W103. doi: 10.1093/nar/gkh464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Gascuel O. Concerning the NJ algorithm and its unweighted version, UNJ. In: Mirkin B., McMorris F., Roberts F., Rzetsky A., editors. Mathematical Hierarchies and Biology. American Mathematical Society; Providence RI: 1997. pp. 149–170. [Google Scholar]
  • 84.Makarenkov M. T-REX: reconstructing and visualizing phylogenetic trees and reticulation networks. Bioinformatics. 2001;17:664–668. doi: 10.1093/bioinformatics/17.7.664. [DOI] [PubMed] [Google Scholar]
  • 85.Donoghue P.C.J., Benton M.J. Rocks and clocks: calibrating the Tree of Life using fossils and molecules. Trends Ecol. Evol. 2007;22:424–431. doi: 10.1016/j.tree.2007.05.005. [DOI] [PubMed] [Google Scholar]
  • 86.Ladner J.E., Wladkowski B.D., Svensson L.A., Sjölin L., Gilliland G.L. X-ray structure of a ribonuclease A–uridine vanadate compex at 1.3 Å resolution. Acta Crystallogr. D. 1997;53:290–301. doi: 10.1107/S090744499601582X. [DOI] [PubMed] [Google Scholar]
  • 87.Mammalian Gene Collection (MGC) Program Team Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc. Natl Acad. Sci. USA. 2002;99:16899–16903. doi: 10.1073/pnas.242603899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.McDonald I.K., Thornton J.M. Satisfying hydrogen bonding potential in proteins. J. Mol. Biol. 1994;238:777–793. doi: 10.1006/jmbi.1994.1334. [DOI] [PubMed] [Google Scholar]

RESOURCES