Statistical analysis of sizes and shapes of virus capsids and their resulting elastic properties

Anže Lošdorfer Božič; Antonio Šiber; Rudolf Podgornik

doi:10.1007/s10867-013-9302-3

. 2013 Mar 1;39(2):215–228. doi: 10.1007/s10867-013-9302-3

Statistical analysis of sizes and shapes of virus capsids and their resulting elastic properties

Anže Lošdorfer Božič ^1,^✉, Antonio Šiber ², Rudolf Podgornik ^1,³

PMCID: PMC3662414 PMID: 23860870

Abstract

From the analysis of sizes of approximately 130 small icosahedral viruses we find that there is a typical structural capsid protein, having a mean diameter of 5 nm and a mean thickness of 3 nm, with more than two thirds of the analyzed capsid proteins having thicknesses between 2 nm and 4 nm. To investigate whether, in addition to the fairly conserved geometry, capsid proteins show similarities in the way they interact with one another, we examined the shapes of the capsids in detail. We classified them numerically according to their similarity to sphere and icosahedron and an interpolating set of shapes in between, all of them obtained from the theory of elasticity of shells. In order to make a unique and straightforward connection between an idealized, numerically calculated shape of an elastic shell and a capsid, we devised a special shape fitting procedure, the outcome of which is the idealized elastic shape fitting the capsid best. Using such a procedure we performed statistical analysis of a series of virus shapes and we found similarities between the capsid elastic properties of even very different viruses. As we explain in the paper, there are both structural and functional reasons for the convergence of protein sizes and capsid elastic properties. Our work presents a specific quantitative scheme to estimate relatedness between different proteins based on the details of the (quaternary) shape they form (capsid). As such, it may provide an information complementary to the one obtained from the studies of other types of protein similarity, such as the overall composition of structural elements, topology of the folded protein backbone, and sequence similarity.

Keywords: Capsid, Virus, Geometry, Icosahedron, Faceting, Elasticity

Introduction

Viruses are the most abundant source of DNA and proteins in Earth’s oceans, which contain on the order of 10³⁰ virions [1]. Yet their status as living entities is often called into question as they do not conform to the “self-reproduction with variations” standard [2]. It appears that much of their features can be understood in terms of thermodynamic equilibrium physics [3–5], especially when they are “dormant”, i.e. outside the cells which they infect, where they in fact turn into little more than very complicated macromolecules with a “life cycle” [6]. The fact that many features of their life cycle can be understood within the equilibrium framework sets them apart from the rest of the living biological systems.

The lack of self-reproduction with variations that we associate with (present-day) life could also mean that viruses predate precellular life [7], which is partially corroborated by the fact that there seems to be no living system that is immune to viruses, including viruses themselves [8]. The long lasting debate about the position of viruses in the great divide between the living and nonliving has been only intensified in recent years with the discovery of viruses with huge genomes that encode proteins which allow for mechanisms that we do associate with life [9–11]. These gigantic viruses are more complex than some bacteria which even further obscures the question of their status and their origin.

The usual approach to trace the origins of a virus is to analyze the virus genome and compare it to other strains and viruses with the hope of uncovering evolutionary relatedness [12]. This can, however, be a futile endeavor as viruses mutate quickly, especially when their genome is RNA-based [13]. The phylogenetic approach thus often reduces to comparison of highly divergent sequences, which in effect prohibits the accurate extraction of the evolutionary information and the determination of the relatedness of different viruses. Viruses are thus typically classified only in terms of the type of genome that they contain (double-stranded (ds) or single-stranded (ss) DNA, dsRNA, plus- and minus-sense ssRNA, and reverse transcribing viruses) or whether they wrap themselves in a piece of cellular membrane or not (enveloped and non-enveloped viruses).

The virus phenotypes, on the other hand, are often strikingly similar. A most obvious similarity is between the protein shells (capsids) they form to protect and pack their genomes. A large number of viruses have an icosahedral capsid that consists of many copies of a single or a few very similar proteins arranged in a highly symmetrical manner that can be defined in precise mathematical terms.1 Thus, even for viruses which are highly diverged and apparently unrelated (e.g., some may infect plant and others may infect animal cells), there are conserved features in their phenotype, i.e., in the proteins. A viable mode of investigation of virus evolution and origin may thus be through the analysis of capsid protein structure and function, which appear to be fairly conserved, even when the sequences coding for the proteins are very divergent.

The icosahedral nature of the capsid requires proteins to assemble in precise relations to their neighbors, and disruption of some key aspects of protein structure, such as spatial distribution of hydrophobic/hydrophilic patches and/or charge, may completely block the assembly of a virus from its constituents. This is an evolutionary dead end for a virus, so there are obviously some physical constraints obeyed and encoded in the viral genome, which must be preserved in the evolution of a virus. The aim of this paper is to accentuate this information through the analysis of shape and size distribution of different icosahedral viruses.

As obvious as it may seem, this is by no means a straightforward and clearly defined task. After all, what does it actually mean to analyze the similarities between the shapes of different viruses and what sort of an information does one gain in the process? We shall seek the answer to this question in the nonlinear theory of elastic shells [15] by carefully comparing a large amount of different viral shape information to numerical predictions, and then extracting a sequence of (Föppl-von Kármán, FvK) numbers pertaining to the sequence of shapes. When dealing with real viruses, one is confronted with an experimentally determined structure that contains spatial coordinates of (ideally) all the atoms that compose the virus. This is a huge amount of data containing a detailed description of the virus surface. Some of these details are of course important for virus attachment (receptor geometry), but there are also some generalities that are expected to be a consequence of the elasticity of the protein shell. Our work presents the attempt to identify the features of the shape related to elastic properties of protein-protein interactions in the capsid, and to numerically quantify these properties.

Analysis

Structural dataset used

In our analysis we have used approximately 130 capsid entries deposited in VIPERdb [16], obtained from X-ray scattering experiments or cryo-electron microscopy. The capsid triangulation numbers range from T = 1 to T = p25.2 The number of viruses with triangulation number T > 1, used in determining the elastic parameters, is approximately 100, with 29 of them having a T-number greater than T = 3.

To the polyomaviruses and papillomaviruses we assign a triangulation number of T = 6 instead of the usually used T = 7 as they are composed of 360 copies of a protein, and should likely be considered as dodecahedral, not icosahedral structures [17]. We do this in order for the number of proteins in a capsid to have a clear correspondence with its triangulation number, enabling us a more consistent analysis.

Calculation of best-fitting prototype shape and effective FvK number of a capsid

We have approached the task of quantifying a capsid shape in the following way. The capsid is compared against a set of “prototype” shapes whose elastic properties are known (determined). These prototype shapes are introduced and described in Section 3.3. Of all the prototype shapes we need to pick one which represents the capsid structure, and thus its elasticity, the best. This task requires a definition of a “quality measure”, i.e., a quantitative parameter describing the quality of the match between the two shapes. The comparison is done by determining the root-mean-square deviation (RMSD) of the two shapes.

To calculate the RMSD, we first rotate the prototype shape so that the five-fold symmetry axes of both shapes coincide. For a rotated prototype shape given by the vertices v_i and the amino acid positions of the capsid r_j we calculate the RMSD as

where Inline graphic is the approximation for the closest point on the triangulated prototype surface to the amino acid position r_j.3 The closest point on the prototype surface to a given amino acid will lie on the radial vector connecting the origin and the amino acid, piercing the prototype surface. Since we do not have the exact surface shape but only its triangulated mesh [19], we take for the point Inline graphic the point where the radial vector pierces the tangential plane of the prototype vertex v_i nearest to the amino acid r_j (an illustration is shown in Fig. 1). This gives a good approximation to the true distance from the surface, provided the meshing is fine enough. We have also tried several different distance measures, and found they influence the values of RMSD (a worse approximation yielding bigger values), but not the position of the minimum FvK number for a given capsid. In order to compare the two shapes, the prototype shape and the experimental shell also have to be properly scaled. We thus let the mean radius of one shell to vary slightly during the minimization to allow for a better fit. To minimize the computational time we do the calculation for 1/60th of a virus, since the rest of the amino acid positions in the capsid can be generated by applying the rotational matrices of the icosahedral symmetry group, and thus do not influence the final result.

Fig. 1 — A sketch illustrating the pairing of an amino acid r_j to its closest point on the prototype surface . The closest vertex of the prototype v_i to the amino acid position is determined by the shortest distance from the radial line given by r_j, and the perpendicular distance to the amino acid position then gives the approximation for the deviation used in calculating the RMSD (1)

Results and discussion

Before proceeding to the shape analysis of a virus, it is important for our goal to perform a simple analysis of the average virus size. This information is pertinent to the analysis of elastic properties of the capsid, especially the thickness of proteins compared to the average capsid size, as will be shown further on in the paper.

Effective size of virus proteins

The sizes of viruses vary across a large interval of mean radii (~ 10–200 nm) and if there is something like a “typical” capsid (structural) protein, having some “typical” size, the larger viruses should contain more copies of it. This means that larger viruses should have a larger (triangulation) T-number,4 instead of being built of a smaller number of copies of a bigger protein and having a T-number comparable to smaller viruses. This proposition has been confirmed by Rossmann and Erickson [20] on a dataset of viruses much smaller than the one we use, so it is of interest to reexamine this issue and update the analysis. To this end, we plot the mean radii of the capsids versus their triangulation numbers in Fig. 2.

Fig. 2 — Calculated mean radii of the capsids versus their triangulation numbers. The dashed line shows the least squares fit of the data to the function , with a = 9.6±0.4 nm, and b = 0.41±0.02. The data consist of approximately 130 viruses, with circles denoting viruses with single-stranded genome, squares viruses with double-stranded genome, diamonds bacteriophages, and triangles T = p3 ssRNA viruses

If the idea of a viral protein of some prototypical size makes sense, then the average capsid radius should be distributed according to

which was obtained simply from equating the area of an approximately spherical virus, Inline graphic , with the total area of assembled capsid proteins, 60 TA_p, where A_p is the average area per protein. The fit of the form

that we performed on the data shows that the idea indeed makes sense, as b = 0.41±0.02 is quite close to the expected 0.5. The average area of a prototypical capsid protein is obtained from the coefficient a and is found to be Inline graphic . A similar result was obtained by Rossmann and Erickson [20]. They find that the ratio of the virus radius and a square root of its T-number is approximately conserved, nm, which is in very good agreement with our results.

Here we mention that the choice of the viruses included in the fit can have a certain effect on the end results, as already noted elsewhere [21]. In Table 1 we compile the results of the fit for various subsets of the viruses included in our analysis. If we perform the fit on viruses with ss genome or with ds genome only, we obtain similar results as when the entire database is included. Bacteriophages, on the other hand, do not conform to the relationship in (2) that well, and excluding them from the fit gives a slope of almost T^1/2.

Table 1.

Fitting coefficients for the dependence of capsid mean radius on its triangulation number, Inline graphic

Dataset	a [nm]	b [ ]
entire	9.6±0.4	0.41±0.02
ss only	9.1±0.6	0.43±0.06
ds only	10.1±0.7	0.45±0.03
phage only	11±1	0.33±0.05
ss and ds only	8.75±0.3	0.49±0.02

Open in a new tab

The results of the fit are shown for the entire dataset, and for subsets selected on the basis of the type of the viral genome contained in the capsid

We next investigate the average thickness of capsid proteins. In order to define the average capsid protein thickness we have analyzed the radial mass distribution of each capsid, which peaks around the capsid mean radius Inline graphic . We have defined the average capsid thickness δ as the full-width-at-half-maximum (FWHM) of this distribution (see [18] for details). Note that our procedure implicitly assumes that the capsids are fairly spherical (as they in fact are for the present purpose). The width may be artificially enlarged due to the nonsphericity of the capsid, but this effect yields corrections of the magnitude which cannot falsify the similarities of thicknesses among different capsid proteins that we obtain.

Our results for average capsid protein thicknesses are shown in Fig. 3. The fit of the form δ(T) = a×T^b gives a small positive exponent for T dependence with a = 2.9±0.2 nm and b = 0.12±0.05. We thus find that the thicknesses of capsid proteins are almost constant and typically in the range of 2–4 nm, with thicker proteins being relatively rare, and intriguingly almost equally distributed in viruses with ss and ds genomes.

Some of the thicker viruses fall into the T = 1 ds genome category, and a closer inspection suggests more pronounced protrusions around five-fold axes than in the T = 1 viruses with the ss genome, which might explain the increased thickness. Some of the other outliers are two caliciviruses, Sindbis virus, Semliki Forest virus, and Nudaurelia capensis ω virus; no obvious characteristic is shared among these viruses.

Taken together, our analysis shows that the concept of a typical capsid protein is indeed viable – the typical capsid protein is prism-like, about 3 nm thick and has an average diameter of 5 nm. The molecular weight pertaining to these protein dimensions can be estimated well by Inline graphic , and the average number of amino acid residues peaks at ≃ 200 residues, with an additional smaller peak at ≃ 390 residues (Fig. 4). Comparing this with the distribution of the most common sizes in a non-redundant representative set of protein chains (as given by, for example, PDBselect [22]) which has a peak at about ≃ 140 residues, one can observe that there are some viral coat proteins which are somewhat larger than the average. What interests us next are the elastic properties of the two-dimensional sheet made of such prototype proteins.

Fig. 4 — Probability distribution of the average number of residues in capsid proteins. The average number of amino acids is obtained by averaging over T-number of constituent proteins in a capsid. There is a prominent peak at the average number of residues around ≃ 200 residues, and a smaller one around the average number of ≃ 390 residues

Elasticity of infinitely thin Icosa(delta)hedral shells: Föppl-von Kármán number

The virus capsid is a shell – as we have shown above, it is a structure formed by a layer of protein material “wrapped” so as to make a closed structure of icosahedral symmetry. The fact that the symmetry of the shell is icosahedral restricts the possible shapes of the capsid. Depending on the elastic properties of the protein capsid, i.e., the energetics of inter-protein contacts, such a shell may be more sphere-like, or more icosahedron-like. The sphere and the icosahedron are in fact two limiting shapes of the infinitely thin, continuum elastic icosahedrally symmetric shells with all the other allowed cases lying somewhere in between the two extremes. This opens the possibility to study the design and conserved elastic features of the virus proteins by analyzing the details of the shape of the icosahedral shell (capsid) they form.

The details of the elastic theory of icosa(delta)hedral shells without the spontaneous curvature have been elaborated in [15] and [19]. We shall present here only the most important issues relevant to our aim. It has been shown that the shape of a continuum icosahedral shell depends on a single parameter. This parameter is termed the Föppl-von Kármán number (γ; FvK) in [15], and it depends on the elastic properties of (formally infinitely thin) two-dimensional sheets made of virus proteins. There are two elastic parameters characterizing the elastic response of such sheets: the two-dimensional Young’s modulus Y which specifies the response of the sheet to stretching, and the bending rigidity κ which quantifies the energetics of the sheet bending. The FvK number of a thin shell with mean radius Inline graphic is a particular dimensionless combination of the elastic parameters of the shell and its radius, defined as

Fitting the shapes: Determining effective FvK number of real viruses

We have approached the problem of extracting the elastic parameters of the proteins from the shape of the capsid by assigning an effective FvK number to a real capsid whose shape (i.e. the positions of atoms) has been determined experimentally. We have first generated sixty “prototype” shapes of different FvK numbers, i.e., the ideal shapes of the icosadeltahedral shells obtained for continuum shell material (formally infinite T-number; we have used T = 625 [19]). These shapes, 30 of which are shown in Fig. 5, were then compared to the real capsid and the one that fitted capsid the best was found. The FvK number of the best-fitting prototype shape was proclaimed to be the effective FvK number of the virus.

Fig. 5 — Prototype shapes. The FvK number increases from left to right and from top to bottom. The range of FvK numbers goes from γ = 10^− 1 (almost a sphere) to γ = 10⁷ (almost an icosahedron), each shape in the sequence having FvK number larger from the previous one by a factor of 1.77. The colors show the relative elastic energy contained in the parts of the shape surface, with yellow-colored regions having largest energy, magenta-colored ones having smallest energy, and blue-colored regions being in between (see [15] and [19] for details)

In order to apply this scheme, one first needs to define the procedure for comparing shapes. We did this by calculating the root-mean-square deviation (RMSD) between a capsid and all 60 of the prototype shapes (details of the procedure are given in Section 2.2). By doing this we obtained the RMSD(γ) dependence for our set of analyzed capsids. The minimum of this curve for each capsid is the effective FvK number associated with a given virus. An example curve is shown in Fig. 6 for the bacteriophage P22 procapsid (yielding a FvK number of γ = 406), and Fig. 7 shows a real virus shape of the bacteriophage PM2 and the corresponding best-fitting prototype with γ = 720. Details of the procedure are given in the Analysis section.

Fig. 6 — Example determination of the effective FvK number of a virus. The virus in question is bacteriophage P22 procapsid (PDB ID 2xyy), and the effective FvK number assigned to it is the FvK number of the prototype shape with the minimum RMSD compared to the capsid. The procedure, detailed in the text, yields γ = 406 in this case

Fig. 7 — Real capsid and the best-fitting prototype shape. Left: The experimentally determined shape of brome mosaic virus (PDB ID 1js9), an icosahedral virus with T = 3. The shape was constructed as a union of spheres of radius 8.4 Å, each one representing an amino acid of a virus. Right: The best-fit prototype shape with γ = 720

A variant of the procedure described was most likely used in the analysis of L-A and HK97 viruses in [15] (the authors do not specify the details). Here we have further developed the procedure into a consistent fitting scheme and applied it to a large set of virus shapes as a tool for determination of elastic properties of capsid proteins. It is of interest to note here that for L-A and HK97 viruses we obtain values similar to those calculated in [15] (718 and 1270 vs. 547 and 1480 obtained in [15]).

The above procedure yields only an estimate of the elastic parameters of the capsids for several reasons: (i) The overall shape of the capsid may sometimes be importantly defined by the protein features not related to protein-protein interactions and the shell elasticity. Some capsids for example, have spikes, troughs, and other peculiarities related to the special properties of the protein subunit. For such capsids, the procedure we propose may be less reliable. (ii) The theory was constructed for infinitely thin shells, so it should be applied with some reservation to viruses whose thickness is non-negligible with respect to their mean radius. (iii) The theory applies to sheets without spontaneous curvature. (iv) The theory applies to continuum shells, which means formally infinite T-numbers. (v) The capsid mean radii are functions of external thermodynamic parameters, especially of the pH value, and vary in quite different manners for different virus classes. (vi) The virus capsid is not really a closed shell but a shell with holes, where the holes are sometimes rather large; this can be the case either in the procapsid state or in the mature capsid.

The point (iv) is particularly important for small viruses (see [23] for a discussion of the buckling of small-T shells). To avoid false signals which may be obtained in the described way, we have decided to analyze only the capsids with T > 1. For these capsids, the results of the fitting procedure are shown in Fig. 8. For viruses with T = 2 and T = 3 a large spread of the ratio of the elastic parameters Y/κ is observed, ranging from 0.01 nm^− 2 to almost 100 nm^− 2.

Fig. 8 — Ratio of capsid elastic parameters Y/κ compared to the capsid mean radius. The values of Y/κ were obtained from best-fit FvK number, . The area between dashed lines shows the interval [0.1,2] nm^− 2, the values of the FvK numbers of the majority of T > 3 capsids. Symbols have the same meaning as in Fig. 2, with the exception of the T = p3 ssRNA viruses being shown in the same way as the rest of ssRNA viruses (circles). The size of the error bars depends on the sampling of FvK shapes – it is the distance of the minimally deviating FvK number to the nearest FvK shape. The points with large error bars correspond to cases where the minimum is very shallow, so that only the left side of the uncertainty interval can be determined on the basis of the fit

When we move to higher T-numbers – viruses with T > 3 – the data show a reasonably converged distribution of Y/κ values, with most of the values falling into the interval [0.1, 2] nm^− 2, indicated by the gray rectangle in Fig. 8. The notable exception though is the point located at mean radius of Inline graphic , having the value of the ratio of Y / κ = 9.1 nm^− 2; this point belongs to PM2 bacteriophage. PM2 bacteriophage is a special virus in the sense that below the outer protein shell it contains a mixed lipid-protein membrane (i.e., it has a proteinaceous lipid core) [24]. However, so does the PRD1 bacteriophage [25], whose Y / κ ratio is within the typical range of values (0.8–1.5 nm^− 2). The reason for the observed large Y / κ ratio in PM2 is thus most likely due to its very pronounced spike proteins around the five-fold axes which may influence our shape analysis and push its effective FvK number towards larger values; the PRD1 bacteriophage, on the other hand, has no such prominent features. These two cases again accentuate the care required in interpretation of our results.

The reason for the wide-spread range of the Y/κ values for T = 3 viruses and a much narrower range of this ratio for viruses with higher T-numbers might be explained, at least qualitatively and conceptually, by a transition between a mostly geometric domain and a continuum domain as proposed in [26]. In the former domain the geometry of the protein subunits influences the capsid morphology. After a certain capsid size or T-number the shapes are then largely described by continuum elasticity where FvK number is the relevant parameter. Due to our limited dataset it is difficult to say where exactly this transition occurs, but the Fig. 8 suggests that this happens somewhere around Inline graphic . Intriguingly, a very similar finding was reported recently in [27], where the authors report a strong correlation between γ and a “degree of buckling” in T = 7 capsids, and lack of such a correlation in T = 3 capsids.

Bending rigidity is related to the energy required to change the angle between two capsid proteins in flat contact, while the Young’s modulus measures how difficult it is to stretch the two capsid proteins in contact, keeping them flat. Having in mind all the intricacies of protein-protein interaction, it is not easy to see that the two quantities ought to be in any particular relation, so that they should be, in general, treated as completely independent parameters. The fact that the ratio of two independent elastic parameters falls in an interval only one order of magnitude wide points again to a possible conservation of this quantity.

The ratio Y/κ can be obtained as a function of thickness of the shell in the case of an isotropic elastic material, which the protein shell is obviously not, as the proteins preferably bind only in 2D. Nevertheless, if we insist on approximating the proteins as isotropically interacting particles, we have that Y / κ = 12 (1 − ν²) / δ², where δ is again the shell thickness and ν the Poisson ratio of the material. Interestingly, for δ~3 nm, and ν~0.3, which is typical for many materials, this gives Y / κ~1 nm^− 2, which fits nicely in the range we obtained (but also with the values found in [15] and [27]), suggesting that the elastic response is essentially fixed with conservation of δ (i.e. conservation of protein size we have already established). This is further corroborated by the examination of extremal values of δ and Y / κ in our set. Sindbis virus (PDB ID 1ld4) has the largest δ in our dataset (11.27 nm), but also one of the smallest ratios of Y / κ, (0.08 nm^− 2), consistent with a simple prediction Y / κ ∝ δ^− 2. However, when we constructed the quantity Yδ² / κ we found that its spread is not significantly smaller than the spread of Y / κ values, suggesting that this ratio contains more information than simply δ.

Summary and conclusions

Using statistical analysis we have shown that capsid proteins of all viruses are similarly sized, prism-like, about 3 nm thick, having an average diameter of 5 nm and an average molecular weight of Inline graphic . This is by no means a trivial finding. In the early days of virus structure research [28] it has been argued that viruses cannot code for large structural proteins as it would require long genomes, and the capsid thus must be assembled from many copies of a smaller protein. Although this sounds reasonable, one should not forget that the quantity of information that can be stored in the capsid scales as Inline graphic , presuming the genome is uniformly distributed within the capsid, which is the case at least for bacteriophages [29, 30]. Were all capsids made by assembling 60 protein units (T = 1) of similar thickness, the quantity of information required for coding the capsid proteins would then scale as Inline graphic , so the percentage of information required for structural proteins would be vanishingly small for sufficiently large viruses, i.e. sufficiently large genomes. This, seemingly obvious statement is perhaps even more strengthened by the fact that there are huge viruses where the spatial constraints do not seem to be the critical issue.5 Yet there are apparently no huge icosahedral viruses with small T-numbers, i.e., made of huge capsid proteins. The capsid protein size appears thus to be an evolutionarily conserved feature.

An important question is whether the elasticity of the capsid is a property which is under evolutionary pressure and thus makes a difference for the functioning of a virus. There is at least one type of virus for which we can be fairly certain that the answer is affirmative – the dsDNA bacteriophages [30]. The bacteriophages pack their dsDNA molecule in the preformed capsid tightly, building up an effective outward mechanical pressure on the capsid up to almost a hundred atmospheres. Such huge pressures would induce large displacements in elastically soft capsid material, resulting in eventual rupture of protein contacts formed by either through hydrophobic-van der Waals association or electrostatic complementarity. Thus, at least for these types of viruses the elastic properties of capsids should be under evolutionary pressure and should converge to some functional range.

This does not exclude the possibility of the conservation of elastic properties in other types of viruses, where possibly different types of elastic constraints might be at work. The situation of RNA virus capsids is in fact exactly reversed, as the (small) force of the genome on the capsid creates an effective inward mechanical pressure, being of electrostatic bridging origin [4]. The elastic constraints in this case would play a prominent role not so much in the context of structural rigidity of the capsid as in the whole self-assembly and maturation process.

Last but not least, in order to penetrate the cell viruses undergo often complicated and multi-stepped paths (i.e., receptor attachment, membrane wrapping, etc.) which include many interactions, eventually resulting in effective mechanical forces on the capsid. The architecture of the capsid and mechanical properties of its building blocks must be well suited to successfully complete this, maybe the most fragile, part of the viral life-cycle.

By combining the statistical analysis with the theory of elasticity we have analyzed the elastic properties of the virus capsids. Our results suggest a reasonable convergence of the elastic properties of the viruses we inspected. To fully evaluate the power of the methods we proposed to discriminate among different virus architectures and possibly even lineages, a significantly larger virus dataset would need to be analyzed which is at present not possible.

Acknowledgements

We thank Alberto Vianelli for informing us about [20]. We also thank the two anonymous reviewers for their suggestions on improving the manuscript. A.L.B. acknowledges the support from the Slovene Agency for Research and Development under the young researcher grant. A.Š. acknowledges support from the Ministry of Science, Education, and Sports of Republic of Croatia (Grant No. 035-0352828-2837). R.P. acknowledges the support from the Slovene Agency for Research and Development (Grant No. P1-0055 and J1-4297).

Footnotes

This is the basis of the Caspar-Klug classification scheme [14] that classifies different capsid icosahedra in terms of the triangulation number T, which is, roughly, a way to divide the icosahedron in similar, nearly equivalent parts that represent individual proteins.

The triangulation numbers denoted with p (pseudo) do not completely conform to the Caspar-Klug principle of quasi-equivalence since the basic unit is composed of different (but morphologically similar) proteins.

The positions of the amino acids are represented either by the positions of their centers-of-mass (see also [18]), or, where this data was unavailable, by the positions of their C_α atoms.

⁴

There are 60T proteins in a virus of a given T-number.

⁵

The analysis of available space and restricted length of the genome is somewhat different in ssRNA viruses, where the ssRNA molecule is held within the capsid mostly by electrostatic interactions with the capsid proteins. In this case, the quantity of information that can be stored in the capsid scales as Inline graphic [4], so that the production of large proteins always requires the same percentage of the ssRNA information, irrespectively of the radius of the virus.

References

1.Suttle C. Marine viruses – major players in the global ecosystem. Nat. Rev. Microbiol. 2007;5:801–812. doi: 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]
2.Trifonov EN. Definition of life: navigation through uncertainties. Biomol. J. Struct. Dyn. 2012;29:647–650. doi: 10.1080/073911012010525017. [DOI] [PubMed] [Google Scholar]
3.Roos WH, Bruinsma R, Wuite GJL. Physical virology. Nat. Phys. 2010;6:733–743. doi: 10.1038/nphys1797. [DOI] [Google Scholar]
4.Šiber A, Lošdorfer Božič A, Podgornik R. Energies and pressures in viruses: contribution of nonspecific electrostatic interactions. Phys. Chem. Chem. Phys. 2012;14:3746–3765. doi: 10.1039/c1cp22756d. [DOI] [PubMed] [Google Scholar]
5.Zandi R, Reguera D, Bruinsma RF, Gelbart WM, Rudnick J. Origin of icosahedral symmetry in viruses. Proc. Natl. Acad. Sci. USA. 2004;101:15556–15560. doi: 10.1073/pnas.0405844101. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Cello J, Paul A, Wimmer E. Chemical synthesis of poliovirus cDNA: Generation of infectious virus in the absence of natural template. Science. 2002;97:1016–1018. doi: 10.1126/science.1072266. [DOI] [PubMed] [Google Scholar]
7.Koonin EV, Martin W. On the origin of genomes and cells within inorganic compartments. Trends Genet. 2005;647–654:021910. doi: 10.1016/j.tig.2005.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Pearson H. Virophage suggests viruses are alive. Nature. 2008;454:677. doi: 10.1038/454677a. [DOI] [PubMed] [Google Scholar]
9.Suzan-Monti M, La Scola B, Raoult D. Genomic and evolutionary aspects of mimivirus. Virus Res. 2006;117:135–155. doi: 10.1016/j.virusres.2005.07.011. [DOI] [PubMed] [Google Scholar]
10.Claverie JM, Abergel C. Mimivirus: the emerging paradox of quasi-autonomous viruses. Trends Genet. 2010;26:431–437. doi: 10.1016/j.tig.2010.07.003. [DOI] [PubMed] [Google Scholar]
11.Arslan D, Legendre M, Seltzer V, Abergel C, Claverie JM. Distant mimivirus relative with a larger genome highlights the fundamental features of megaviridae. Proc. Natl. Acad. Sci. USA. 2011;108:17486. doi: 10.1073/pnas.1110889108. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Holmes, E.C.: The Evolution and Emergence of RNA Viruses. Oxford University Press (2009)
13.Koonin, E.V., Mushegian, A.R., Dolja, V.V.: Computer analysis of amino acid sequences. In: Foster, G.D., Taylor, S.C., Walker, J.M. (eds.) Plant Virology Protocols, vol. 81 of Methods in Molecular Biology, pp. 319–337, Humana Press (1998) [DOI] [PubMed]
14.Caspar DLD, Klug A. Physical principles in the construction of regular viruses. Cold Spring Harbor Symp. 1962;27:1–24. doi: 10.1101/SQB.1962.027.001.005. [DOI] [PubMed] [Google Scholar]
15.Lidmar J, Mirny L, Nelson DR. Virus shapes and buckling transitions in spherical shells. Phys. Rev. 2003;E68:051910. doi: 10.1103/PhysRevE.68.051910. [DOI] [PubMed] [Google Scholar]
16.Carrillo-Tripp M, Shepherd CM, Borelli IA, Venkataraman S, Lander G, Natarajan P, Johnson JE, Brooks CL, III, Reddy VS. Viperdb2: an enhanced and web api enabled relational database for structural virology. Nucleic Acids Res. 2009;37:436–442. doi: 10.1093/nar/gkn840. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Konevtsova OV, Rochal SB, Lorman VL. Chiral quasicrystalline order and dodecahedral geometry in exceptional families of viruses. Phys. Rev. Lett. 2012;108:038102. doi: 10.1103/PhysRevLett.108.038102. [DOI] [PubMed] [Google Scholar]
18.Lošdorfer Božič A, Šiber A, Podgornik R. How simple can a model of an empty viral capsid be? charge distributions in viral capsids. J. Biol. Phys. 2012;38:657–671. doi: 10.1007/s10867-012-9278-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Šiber A. Buckling transition in icosahedral shells subjected to volume conservation constraint and pressure: Relations to virus maturation. Phys. Rev. E. 2006;E73:061915. doi: 10.1103/PhysRevE.73.061915. [DOI] [PubMed] [Google Scholar]
20.Rossmann, M.G., Erickson, J.W.: Structure and assembly of icosahedral shells. In: Casjens, S. (ed.) Virus Structure and Assembly. Jones and Bartless Publishers (1985)
21.Ting CL, Wu J, Wang ZG. Thermodynamic basis for the genome to capsid charge relationship in electrostatically-driven viral encapsidation. Proc. Natl. Acad. Sci. USA. 2011;108:16986–16991. doi: 10.1073/pnas.1109307108. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Griep S, Hobohm U. PDBselect 1992–2009 and PDBfilter-select. Nucleic Acids Res. 2010;38:318–319. doi: 10.1093/nar/gkp786. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Aznar M, Luque A, Reguera D. Relevance of capsid structure in the buckling and maturation of spherical viruses. Phys. Biol. 2012;9:036003. doi: 10.1088/1478-3975/9/3/036003. [DOI] [PubMed] [Google Scholar]
24.Kivela HM, Kalkkinen N, Bamford DH. Bacteriophage PM2 has a protein capsid surrounding a spherical proteinaceous lipid core. Virology J. 2002;76:8169–8178. doi: 10.1128/JVI.76.16.8169-8178.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Benson SD, Bamford JKH, Bamford DH, Burnett RM. Viral evolution revealed by bacteriophage PRD1 and human adenovirus coat protein structures. Cell. 1999;98:825–833. doi: 10.1016/S0092-8674(00)81516-0. [DOI] [PubMed] [Google Scholar]
26.Mannige RV, Brooks CL., III Periodic table of virus capsids: implications for natural selection and design. PLoS ONE. 2010;5:9423. doi: 10.1371/journal.pone.0009423. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.May ER, Brooks C., III On the morphology of viral capsids: Elastic properties and buckling transitions. Phys. J. Chem. 2012;B116:8604–8609. doi: 10.1021/jp300005g. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Morgan GJ. Historical review: viruses, crystals and geodesic domes. Trends Biochem. Sci. 2003;28:86–90. doi: 10.1016/S0968-0004(02)00007-5. [DOI] [PubMed] [Google Scholar]
29.Catalano CE. In: Viral Genome Packaging Machines: Genetics, Structure, and Mechanism. Catalano CE, editor. Georgetown, TX: Landes Bioscience; 2005. pp. 1–4. [Google Scholar]
30.Gelbart WM, Knobler CM. The physics of phages. Phys. Today. 2008;61:42–47. doi: 10.1063/1.2835152. [DOI] [Google Scholar]

[CR1] 1.Suttle C. Marine viruses – major players in the global ecosystem. Nat. Rev. Microbiol. 2007;5:801–812. doi: 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Trifonov EN. Definition of life: navigation through uncertainties. Biomol. J. Struct. Dyn. 2012;29:647–650. doi: 10.1080/073911012010525017. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Roos WH, Bruinsma R, Wuite GJL. Physical virology. Nat. Phys. 2010;6:733–743. doi: 10.1038/nphys1797. [DOI] [Google Scholar]

[CR4] 4.Šiber A, Lošdorfer Božič A, Podgornik R. Energies and pressures in viruses: contribution of nonspecific electrostatic interactions. Phys. Chem. Chem. Phys. 2012;14:3746–3765. doi: 10.1039/c1cp22756d. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Zandi R, Reguera D, Bruinsma RF, Gelbart WM, Rudnick J. Origin of icosahedral symmetry in viruses. Proc. Natl. Acad. Sci. USA. 2004;101:15556–15560. doi: 10.1073/pnas.0405844101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Cello J, Paul A, Wimmer E. Chemical synthesis of poliovirus cDNA: Generation of infectious virus in the absence of natural template. Science. 2002;97:1016–1018. doi: 10.1126/science.1072266. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Koonin EV, Martin W. On the origin of genomes and cells within inorganic compartments. Trends Genet. 2005;647–654:021910. doi: 10.1016/j.tig.2005.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Pearson H. Virophage suggests viruses are alive. Nature. 2008;454:677. doi: 10.1038/454677a. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Suzan-Monti M, La Scola B, Raoult D. Genomic and evolutionary aspects of mimivirus. Virus Res. 2006;117:135–155. doi: 10.1016/j.virusres.2005.07.011. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Claverie JM, Abergel C. Mimivirus: the emerging paradox of quasi-autonomous viruses. Trends Genet. 2010;26:431–437. doi: 10.1016/j.tig.2010.07.003. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Arslan D, Legendre M, Seltzer V, Abergel C, Claverie JM. Distant mimivirus relative with a larger genome highlights the fundamental features of megaviridae. Proc. Natl. Acad. Sci. USA. 2011;108:17486. doi: 10.1073/pnas.1110889108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Holmes, E.C.: The Evolution and Emergence of RNA Viruses. Oxford University Press (2009)

[CR13] 13.Koonin, E.V., Mushegian, A.R., Dolja, V.V.: Computer analysis of amino acid sequences. In: Foster, G.D., Taylor, S.C., Walker, J.M. (eds.) Plant Virology Protocols, vol. 81 of Methods in Molecular Biology, pp. 319–337, Humana Press (1998) [DOI] [PubMed]

[CR14] 14.Caspar DLD, Klug A. Physical principles in the construction of regular viruses. Cold Spring Harbor Symp. 1962;27:1–24. doi: 10.1101/SQB.1962.027.001.005. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Lidmar J, Mirny L, Nelson DR. Virus shapes and buckling transitions in spherical shells. Phys. Rev. 2003;E68:051910. doi: 10.1103/PhysRevE.68.051910. [DOI] [PubMed] [Google Scholar]

[CR16] 16.Carrillo-Tripp M, Shepherd CM, Borelli IA, Venkataraman S, Lander G, Natarajan P, Johnson JE, Brooks CL, III, Reddy VS. Viperdb2: an enhanced and web api enabled relational database for structural virology. Nucleic Acids Res. 2009;37:436–442. doi: 10.1093/nar/gkn840. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Konevtsova OV, Rochal SB, Lorman VL. Chiral quasicrystalline order and dodecahedral geometry in exceptional families of viruses. Phys. Rev. Lett. 2012;108:038102. doi: 10.1103/PhysRevLett.108.038102. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Lošdorfer Božič A, Šiber A, Podgornik R. How simple can a model of an empty viral capsid be? charge distributions in viral capsids. J. Biol. Phys. 2012;38:657–671. doi: 10.1007/s10867-012-9278-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Šiber A. Buckling transition in icosahedral shells subjected to volume conservation constraint and pressure: Relations to virus maturation. Phys. Rev. E. 2006;E73:061915. doi: 10.1103/PhysRevE.73.061915. [DOI] [PubMed] [Google Scholar]

[CR20] 20.Rossmann, M.G., Erickson, J.W.: Structure and assembly of icosahedral shells. In: Casjens, S. (ed.) Virus Structure and Assembly. Jones and Bartless Publishers (1985)

[CR21] 21.Ting CL, Wu J, Wang ZG. Thermodynamic basis for the genome to capsid charge relationship in electrostatically-driven viral encapsidation. Proc. Natl. Acad. Sci. USA. 2011;108:16986–16991. doi: 10.1073/pnas.1109307108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Griep S, Hobohm U. PDBselect 1992–2009 and PDBfilter-select. Nucleic Acids Res. 2010;38:318–319. doi: 10.1093/nar/gkp786. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Aznar M, Luque A, Reguera D. Relevance of capsid structure in the buckling and maturation of spherical viruses. Phys. Biol. 2012;9:036003. doi: 10.1088/1478-3975/9/3/036003. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Kivela HM, Kalkkinen N, Bamford DH. Bacteriophage PM2 has a protein capsid surrounding a spherical proteinaceous lipid core. Virology J. 2002;76:8169–8178. doi: 10.1128/JVI.76.16.8169-8178.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Benson SD, Bamford JKH, Bamford DH, Burnett RM. Viral evolution revealed by bacteriophage PRD1 and human adenovirus coat protein structures. Cell. 1999;98:825–833. doi: 10.1016/S0092-8674(00)81516-0. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Mannige RV, Brooks CL., III Periodic table of virus capsids: implications for natural selection and design. PLoS ONE. 2010;5:9423. doi: 10.1371/journal.pone.0009423. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.May ER, Brooks C., III On the morphology of viral capsids: Elastic properties and buckling transitions. Phys. J. Chem. 2012;B116:8604–8609. doi: 10.1021/jp300005g. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Morgan GJ. Historical review: viruses, crystals and geodesic domes. Trends Biochem. Sci. 2003;28:86–90. doi: 10.1016/S0968-0004(02)00007-5. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Catalano CE. In: Viral Genome Packaging Machines: Genetics, Structure, and Mechanism. Catalano CE, editor. Georgetown, TX: Landes Bioscience; 2005. pp. 1–4. [Google Scholar]

[CR30] 30.Gelbart WM, Knobler CM. The physics of phages. Phys. Today. 2008;61:42–47. doi: 10.1063/1.2835152. [DOI] [Google Scholar]

PERMALINK

Statistical analysis of sizes and shapes of virus capsids and their resulting elastic properties

Anže Lošdorfer Božič

Antonio Šiber

Rudolf Podgornik

Abstract

Introduction