Abstract
High-speed atomic force microscopy (HS-AFM) experiments allow direct observation of biomolecular dynamics at the single-molecule level, acquiring a large amount of topographic imaging data that visualizes changes in the molecular surface during functional activity over an extended period of time. Since images have no atomistic resolution, a major challenge has been to develop post-experimental computational methods to infer atomistic information from measurements. The recently developed NMFF-AFM flexible fitting method provides a computationally efficient approach promising to infer atomistic-precision models of conformational dynamics from resolution-limited AFM imaging data. We report the software integration of this method into the well-established BioAFMviewer platform and demonstrate its first applications to experimental HS-AFM imaging data. To facilitate applications, we developed a direct workflow from raw experimental AFM data to the visualization and analysis of fitting results. The presented applications to experimental data of a single protein domain, a protein complex, and a megadalton-sized protein filament demonstrate the versatility of NMFF-AFM modeling to reproduce large-amplitude conformational motions of biomolecular dynamics from HS-AFM imaging. As a first step toward automated large-scale analysis of AFM imaging data, we furthermore demonstrate reconstruction of an atomistic molecular movie of protein dynamics, involving large-amplitude conformational transitions, from a measured HS-AFM movie sequence. Implementation of flexible fitting within the stand-alone user-friendly interactive BioAFMviewer software provides the opportunity for a broad range of applications to facilitate the understanding of resolution-limited HS-AFM measurements.
Keywords: high-speed atomic force microscopy, protein dynamics, AFM flexible fitting, 3D atomistic reconstruction, AFM software


Despite the recent revolution in structural biology techniques , and the use of artificial intelligence for solving the protein structure prediction problems, , revealing the dynamical mechanisms by which proteins implement their biological function remains an outstanding challenge. Well-established single-molecule experimental methods such as fluorescence spectroscopy can make predictions about the dynamical behavior based on optical measurements, but individual molecules themselves are invisible. High-speed atomic force microscopy (HS-AFM) is the only experimental technique to directly observe single proteins during their functional activity under near-physiological conditions. By rapid scanning of a protein placed on a solid substrate with a probing tip, conformational dynamics are monitored as consecutive topographic images visualizing changes in the molecular surface. Directly filming the molecular dynamics of individual proteins, their interactions with other biomolecules, and their functional assemblies has made important contributions to the understanding of biological processes with interdisciplinary applications in the field of Life Science.
HS-AFM can image proteins during their operation over a long time, acquiring a large amount of imaging data, but intrinsic experimental limitations prevent a detailed understanding of the observed dynamics. Visualized are only motions of the biomolecular surface by using a scanning tip that is typically larger by 1 order of magnitude compared to atomic size, preventing to resolve details at the sub-nanometer range. Although recent technological improvements to speed up scanning, , approaching the time resolution of 10 ms, are promising to expand the range of observable biomolecular dynamics, AFM measurements alone will inevitably provide insufficient information for detailed understanding of biomolecular function.
To fully exploit the explanatory power of HS-AFM measurements, computational methods to reconstruct three-dimensional structures and conformational dynamics with atomistic-resolution from post-experimental analysis of imaging data have been developed in the past years (see ref ). Such approaches are based on integrating the vast amount of available high-resolution structural data of proteins and molecular modeling methods with HS-AFM data. Previously developed rigid-body fitting, , correlating known 3D atomistic static structural data with measured topographies, considerably restricts the interpretation of dynamic topography imaging data. On the other side, flexible fitting methods considering conformational changes of structural data have been developed to provide feasible atomistic models of dynamic conformations from imaging data. − A major drawback in current methods is that dynamically steering an input structure into fitting a target AFM image relies on an exhaustive sampling of conformations. Hence, the massive computational expense required to conduct flexible fitting has so far prevented applications for the analysis of AFM measurement data.
Recently, two separate studies have proposed highly accelerated flexible fitting methods, NMFF-AFM (normal-mode flexible fitting AFM) and AFMfit. For the computational sampling of biomolecular dynamics to identify an atomistic structure that matches an experimental image, these methods focus on the reduced conformational subspace derived from normal-mode calculations of an elastic-network presentation of a known biomolecular atomistic structure. Normal-mode analysis (NMA) has been demonstrated to be powerful to reproduce functionally important molecular motions including large-amplitude conformational transitions in proteins, − avoiding the heavy computations inherent to molecular dynamics simulations. The NMA approach was previously employed in flexible fitting applications to electron microscopy maps (NMFF tool , and hybrid electron microscopy normal-mode analysis HEMNMA) to extract biomolecular dynamics from measurement data.
NMA-based modeling is particularly suitable for the analysis of AFM imaging since visualized global topography changes shall correspond to conformational changes characterized by a small number of low-frequency normal modes. While applications of AFMfit are missing to resolve motions that are significantly different from the initial structure, the NMFF-AFM method has demonstrated the potential of flexible fitting to reproduce atomistic-precision models of large-amplitude conformational transitions observed under HS-AFM microscopy.
Here, we report the software implementation of the NMFF-AFM method into the well-established stand-alone BioAFMviewer platform, allowing for convenient applications to experimental imaging data within a user-friendly interactive graphical interface with no dependencies on other software. While in the original publication the model performance was verified in control applications of synthetic AFM data, we now present the developed software-integrated workflow of flexible fitting and demonstrate several applications to HS-AFM imaging data.
As we show, flexible fitting achieves prediction for atomistic models of large-amplitude conformational motions from HS-AFM images with remarkable computational efficiency, requiring as few as tens of seconds of calculation time for the presented applications to a single protein domain and a protein complex. This significant advancement of computational modeling opens the opportunity for automatized large-scale analysis of AFM imaging data, also including the interpretation of measurements for large-size protein aggregates such as filaments.
To outline such prospects, we demonstrate the application of the NMFF-AFM method to reconstruct a molecular movie of atomistic-precision models of protein dynamics involving functional conformational transitions from an HS-AFM movie. The computational efficiency is furthermore highlighted in the application of flexible fitting for a megadalton-sized actin filament to provide an atomistic model from topographic imaging, a task which would require surreal computational expense if attempted by fitting using molecular dynamics simulations.
Integrating flexible fitting into a user-friendly software workflow, from raw experimental AFM imaging data to visualization and analysis, is expected to enable a broad range of applications by the Bio-AFM community. This will facilitate a detailed understanding of molecular processes on the nanoscale from topographic AFM measurements.
Results
Software Integration of Flexible Fitting
We first summarize the concept of the NMFF-AFM computational method and refer to the original publication for a detailed description. This method relies on an iterative modeling scheme to gradually deform an atomistic biomolecular structure toward a dynamic conformation that fits into a target experimental AFM topographic image. The deformations are generated by computing conformational motions arising from the normal-mode analysis of an elastic network representation of the biomolecular structure.
Starting with an initial structure, known either from previous experimental data (Protein Data Bank structures) or AlphaFold predictions, the iteration scheme performs, in each step, first normal-mode analysis to generate a set of slightly deformed structures representing motions of individual modes and amplitudes. Next, AFM imaging simulation to translate this set of atomistic conformations into pseudo-AFM topographic images allows comparison with the target AFM image to select a single normal-mode motion and its direction (sign of the amplitude) that best improves image similarity. The deformed structure obtained by using the selected motion then enters the next round of iterations (with its elastic network possibly changed). The iteration scheme terminates when fitting deformation cannot much further improve the image similarity of the simulated AFM topography for the deformed structure and the experimental AFM image.
The method has been validated in applications of different proteins with various characteristics of conformational changes under idealized conditions, where the target AFM image of a protein was generated as a simulated topography from a known atomistic structure (ground truth). Flexible fitting starting from another known structure of the same protein in a completely different conformation was then able to sufficiently reproduce the ground truth structure.
We implemented the NMFF-AFM method within the BioAFMviewer software for convenient applications to real-world experimental HS-AFM imaging data. The BioAFMviewer platform was designed toward integrating all available high-resolution structural and modeling data of biomolecules with resolution-limited scanning probe measurement data. The software package contains a high-quality molecular viewer for the 3D visualization of biomolecular structures combined with synchronized computation and visualization of corresponding simulated AFM images (live simulation AFM), allowing to correlate atomistic structures with measured topographies as previously demonstrated by applications of the integrated rigid-body fitting method.
With the prospects of flexible fitting allowing for efficient applications to large sets of HS-AFM imaging data, we have also implemented a new software tool for automated integration of experimental AFM raw data in the BioAFMviewer, the BioAFM-ASD viewer. This app processes data encoded in the ASD file format (a standard format used by AFM experimentalists) to visualize the dynamics of measured biomolecular surface topographies as 2D color-coded multiframe images, thus establishing a communication pipeline to correlate experimental imaging data with simulated AFM topographies of dynamic atomistic conformations generated during fitting. The BioAFM-ASD viewer includes options for post-experimental image processing prior to the application of flexible fitting, such as AFM-stage tilt correction, Gaussian filtering of pixel information along the scanning grid, and a method for automated molecule recognition to define the target AFM image relevant for fitting (see the Methods section for details). As an alternative to ASD files, measured AFM topographies can also be uploaded as image files in standard formats including tiff and multi-tiff.
For the convenient integration of the NMFF-AFM workflow, we have implemented a flexible toolbox app. Like existing BioAFMviewer apps, it provides a user-friendly interface to choose parameters of the model and the fitting process. The app directly communicates with the BioAFMviewer molecular viewer, simulation AFM, and ASD viewer, to conduct the fitting procedure. Thereby, it integrates the cycle of iterative normal-mode analysis of the biomolecular structure and comparison of simulated topographies for dynamic conformations with the target experimental AFM image toward eventually identifying the best matching fitting result. After completion of the fitting process, the app allows for direct visualization and analysis of the obtained fitting results. Furthermore, fitting data can be exported as movie files and PDB files of molecular dynamics trajectories. We refer to the Methods section for more detailed information.
Figure shows a schematic workflow for the application of the NMFF-AFM method within BioAFMviewer, involving the three stages of loading data, application of the flexible fitting toolbox, and graphical output combined with data analysis.
1.

Workflow for BioAFMviewer software integration of NMFF-AFM flexible fitting to AFM imaging data. The BioAFMviewer molecular viewer allows 3D visualization of available biomolecular structures combined with interactive live simulation AFM to compute and visualize corresponding pseudo-AFM topographic images to be correlated with experimental images during the fitting processes. The developed ASD file viewer incorporates the import of AFM raw data to visualize measured topographic images and provides image processing methods, including molecule recognition to autonomously identify targets for fitting. The flexible fitting toolbox app allows for the convenient selection of model and fitting parameters and integrates all computations of the NMFF-AFM iteration scheme. The interactive BioAFMviewer software interface allows visualization of fitting results, data analysis, and convenient export of all obtained data.
Applications to HS-AFM Imaging Data
HECT Domain Large-Amplitude Conformational Transition
The first application is for HS-AFM imaging of the structural dynamics of HECT (homologous to E6AP C-terminus) domain (∼350 amino acids) that plays an important role in post-translation protein ubiquitination by HECT E3 ligase enzymes to regulate numerous cellular processes. HS-AFM experiments visualized highly dynamic topographic changes showing frequent transitions between spherical and oval molecular shapes (ref , and Figure c for a few examples), which suggests large-amplitude conformational motions within the HECT domain under physiological conditions. Previous application of rigid-body fitting allowed to assign known static atomistic structures in very different conformations to either of the measured topographic shapes.
2.
HECT domain large-amplitude conformational transition. (a) Ribbon representation of the HECT domain structure in the catalytic conformation. The C-lobe domain (blue) and finger loop region are indicated. (b) Snapshots of structure changes in normal modes 7 and 8, respectively. The HECT structure is colored, and deformed conformations are shown in transparent gray. (c) Examples of typical HS-AFM images for HECT topographies in spherical and oval shapes. (d) The ASD image selected for fitting and the probability density of topography heights. (e) The same ASD image after applying filtering according to the height threshold determined by the Otsu method (denoised image). The corresponding height probability density shows variations of the HECT domain shape by green color, rescaled after erasing the background (shown as a red box for illustration purposes only). In (d, e), the region of interest (ROI) for fitting is shown in red. (f) Side-by-side image presentation of the initial HECT conformation for fitting (atomistic VdW representation), the corresponding simulated AFM image, the ROI of the target AFM image, and the error map. (g) The corresponding images after application of flexible fitting. (h) Plots of image similarity scores along the fitting process. The graph of the image cc score used for fitting is shown in blue; graphs of other scores are displayed in gray (for the p-rmsd an alternative scale is used). The iteration point determined by the fitting termination criterion is marked with a red dot. (i) Plot of the mode number selected along the fitting process; deformation modes are shown in blue. (j) Fitting results employing the denoised target AFM image (representation similar to that in panel (g)). (k) Corresponding plots of image similarity scores.
Here, we demonstrate flexible fitting to model large-amplitude transitions in applications for HECT domain HS-AFM imaging data. Figure a shows the HECT domain structure in its so-called catalytic conformation (PDB ID 3jvz), which has been identified to represent measured spherical shape AFM topographies. The conformation is characterized by a globular C-lobe domain connected in a central position with respect to the rest of the protein via an unstructured linker. Moreover, an extended finger loop is present in the other protein domain. Previous analysis of crystallographic B-factors and molecular dynamics simulations revealed structural flexibility of those protein parts. Normal-mode analysis (NMA) of this HECT structure preceding flexible fitting shows that the first four deformation modes indeed describe relative motions of the C-lobe domain and the finger loop; see Figure b for snapshots of structural changes in modes 7 and 8, respectively.
In our setup of flexible fitting, the HECT catalytic conformation is the starting structure. Figure c shows typical HS-AFM images depicting spherical and oval molecular shapes. As a target experimental image for fitting, we selected an image visualizing the HECT domain in the oval-shaped topography, which was previously correlated with a static structure in a completely different HECT conformation by rigid-body fitting. For the selected ASD frame (Figure d), we first computed the probability density of topography heights, which shows a narrow distribution (peak at ∼0.5 nm) corresponding to imaging noise of the AFM mica substrate (Figure d). Application of the Otsu algorithm determined a height threshold of ∼1.4 nm to separate the HECT shape from the noisy background (denoised image shown in Figure e) and to determine the region of interest (ROI) for fitting (shown in Figure d,e). We performed flexible fitting for both noisy and denoised experimental target images.
To choose the initial molecular orientation for flexible fitting, we use information from previous rigid-body fitting of the HECT structure to HS-AFM images (see the Methods section). Figure f shows the initial molecular orientation, the corresponding spherical shape simulated AFM topography, the target AFM image in the oval shape topography, and the error map visualizing their deviations. The discrepancies between the simulated and experimental AFM images before fitting are characterized by differences in the overall topography shape (spherical versus oval) and in the pattern of height protrusions.
The results obtained from flexible fitting (see the Methods section for details on parameters) are shown in Figure g. During the fitting process, significant structural changes of the HECT domain are observed, involving internal large-amplitude conformational motions of the flexible C-lobe domain and the finger loop combined with rigid-body rearrangements (see SI Movie 1). Changes between the initial structure and that obtained after fitting are quantified by an RMSD value of 14.5 Å, and 8.4 Å after structure alignment. The simulated AFM image of the fitted structure nicely resembles the oval shape and protrusions of the experimental AFM image, as evidenced by the corresponding error map. The fitting process is visualized in a side-by-side view of the molecular movie, the simulation AFM movie, and the target AFM image and as a superposition of the molecular movie with the target AFM image (SI Movie 1).
The quantitative improvement in image similarity scores along the fitting process is shown in Figure h. The image cc score used to drive the fitting process changes from 0.88 for the initial conformation and saturates at a value of 0.97, where the termination criterion detected the iteration step to determine the structure considered as the fitting result (see the Methods section). This choice is consolidated by monitoring other image similarity measures during fitting (p-rmsd, ssim; Figure h), showing that further conformational changes are unfavorable to improve image similarity. The distribution of selected normal modes along the fitting process (Figure i) confirms that motions of the HECT domain result from a combination of rigid-body rearrangements and internal conformational changes, which are dominated by the softest deformation mode.
We repeated the exact same flexible fitting using the denoised AFM image (Figure e) as a target. The results are shown in Figure j,k, and the fitting process is visualized in SI Movie 2. Comparing the molecular structures obtained from the two fittings (Figure g,j) shows some differences, most visible in the position of the flexible finger loop region. The overall differences are quantified by a RMSD value of 4.4 Å (without alignment). However, the two obtained structures align quite well with a much smaller RMSD of 1.9 Å. The implication of this observation is that differences between the fitting results are mainly attributed to a change in the orientation of the HECT structure relative to the AFM substrate, while similar internal large-amplitude conformational motions toward the target AFM image are produced during flexible fitting (regardless of the presence of noise). We argue that for the denoised case, the fitting procedure generated additional rotations of the HECT structure (see SI Movie 2) to improve image similarity toward the target AFM image in which denoising has removed the boundary of the protein topography at heights comparable to the background noise.
The quantitative improvement in image similarity scores along the fitting process (see Figure k) shows a change in the image cc from 0.86 to 0.96 (at the termination iteration) comparable to that for fitting to the noisy AFM image. Note that for the denoised fitting, the cosine similarity score, which does not consider the average of topography information, and the image c.c. are similar.
To compute simulated AFM images during fitting, we had to decide on parameters characterizing the tip-shape geometry by its probe sphere radius and the cone half-angle. In particular, the size of the probe sphere affects simulated topographies, , thereby possibly changing the results of flexible fitting to the target experimental AFM image. We addressed this issue by performing flexible fitting for different tip shapes varying in the size of the probe sphere radius (R = 1 nm, R = 2 nm, R = 3 nm). All three fitting procedures generated similar conformational changes of the flexible C-lobe domain and the finger loop (see SI Figure 1 and SI Movie 3) and showed comparable improvements in the image similarity score. For the fitting presented above, we favored fitting using a tip shape with a probe sphere radius of 2 nm and a cone half-angle of 10°, recording the highest image c.c. between the simulated AFM image of the fitted model conformation with the AFM target image.
Finally, we aimed to compare the dynamic HECT domain conformation obtained from flexible fitting with the static structure in the L-shape conformation (PDB ID 1D5F) that was previously correlated with measured oval shape HS-AFM topographies based on rigid-body fitting and is considered functional to transfer ubiquitin to target proteins. Since the structures represent different types of HECT domains (E6AP-type HECT used for flexible fitting and the protein used in experiments; NEDD4L-type HECT for L-shape) with poor sequence similarity, we characterized the conformations in terms of their domain geometry rather than using atomistic RMSD analysis. The position and orientation of the C-lobe domain relative to the N-lobe domain was characterized by the distance between their geometric centers and a hinge angle (see the Methods section), showing very different values for the HECT domain catalytic conformation used as the initial state for flexible fitting (angle = 108°; D = 3 nm) and the static structure L-shape conformation (angle = 67°; D = 3.1 nm) (see SI Figure 2). As we find, the HECT protein conformation obtained from flexible fitting is similar to the L-shape geometry (angle = 79°; D = 3 nm), thus supporting experimental observations of E6AP-type HECT to adopt functionally important conformations under equilibrium physiological conditions.
HECT Domain Complex with Bound E2 Enzyme
To demonstrate the versatility of the integrated fitting toolbox, we considered as a second example the application to a protein complex, combining rigid-body fitting to determine an initial molecular orientation and optimal tip shape parameters with subsequent flexible fitting to provide an atomistic model of experimental HS-AFM imaging.
Before the HECT-E3 ligase can ubiquitinate target proteins, it must accept a ubiquitin (Ub) molecule delivered by the small E2 enzyme in the first step. HS-AFM experiments, including both HECT and E2-Ub enzymes, could visualize topographies that were different from spherical or oval shapes of a single HECT domain, but it was not previously possible to clearly verify the formation of a protein complex using rigid-body fitting of available static atomistic structures. We therefore applied a combination of rigid-body and flexible fitting to provide an atomistic model for the measured HS-AFM image.
Figure a shows the results of rigid-body fitting of the HECT-E2 protein complex (494 amino acids, PDB ID 1c4z) obtained by the fitting toolbox in the BioAFMviewer software. The simulated AFM image of the molecular orientation compares to the HS-AFM image by an image cc score of 0.94. Indeed, the agreement is good for the region corresponding to the topography of the HECT domain. Notably, the predicted orientation of HECT is very similar to those obtained from previous rigid-body fitting of the single HECT domain to several measured HS-AFM images. However, the agreement between the simulated and HS-AFM image is less good for the part that represents the bound E2 enzyme, as indicated by the error map.
3.
HECT protein complex with bound E2 enzyme. (a) Side-by-side view of the HECT-E2 structure in the orientation obtained from rigid-body fitting, the corresponding simulated AFM topography, the target experimental AFM image, the superposition of the molecular structure with the AFM image, and the error map between the simulated and target AFM images. (b) Landscape of image c.c. values scoring similarity of simulated AFM images with variable tip shape parameters and the target AFM image obtained from the quick fit algorithm of refined rigid-body fitting. The optimal choice is indicated by a red dotted line with the parameter and score values shown. (c) Results from flexible fitting in the same representation as shown in panel (a). (d) Plots of image similarity scores along the fitting process. The graph of the image c.c. score used for fitting is shown in blue; graphs of other scores are displayed in gray (for the p-rmsd an alternative scale is used). The iteration point determined by the fitting termination criterion is marked as a red dot.
This shows that the observed dynamic HECT-E2 complex is clearly different from that of the crystal structure. Before proceeding with flexible fitting, we used the Quick-Fit function of the BioAFMviewer which refines the result of rigid-body fitting by iteratively sampling orientations that further improve image similarity toward the target AFM image considering different tip shape geometries to compute simulated AFM images. Collecting such results, we could therefore construct a landscape of image c.c. values for varying tip shape parameters shown in Figure b. While the cone angle value does not affect the comparison of the simulated images with the experimental AFM image, a strong dependence on the probe sphere radius can be clearly seen. Deviations from R = 2.0 (used in the previous example of flexible fitting) to either sharper or more blunt tips result in decreasing image c.c. scores. The optimal tip shape parameters are determined as (R = 2.3 nm, ϑ = 14°).
For the application of flexible fitting, the HECT-E2 complex is described by a single elastic network, modeling intra- and intermolecular interactions in the same way. We used the optimal tip shape for the computation of simulated AFM images and started fitting from the refined molecular orientation of the HECT-E2 crystal structure obtained by rigid-body Quick-Fit. The results from flexible fitting are shown in Figure c (see the Methods section for details on parameters). The fitting process mainly involves large-amplitude motions of the E2 protein relative to the HECT domain quantified by an RMSD value of 10.1 and 8.3 Å after structure alignment. The visualization is provided in a side-by-side view of the molecular movie, the simulation AFM movie, and the target AFM image, and as a superposition of the molecular movie with the target AFM image (SI Movie 4). The simulated AFM image of the fitted structure compares well with the target AFM image (image cc value of 0.96) both in the overall shape topography and the position of protrusions arising from the HECT domain as demonstrated visually by the error map. The quantitative improvement in image similarity scores along the fitting process (Figure d) shows that beyond the iteration step determined by the fitting determination criterion, the image c.c. score used for fitting saturates, indicating no further improvement.
Reconstructing Atomistic-Precision Molecular Movies from HS-AFM Imaging
The presented applications demonstrate that NMA flexible fitting can capture large-amplitude structural changes by efficiently steering a known static protein structure toward a dynamic conformation representing a plausible atomistic model for a single HS-AFM topographic image. Within the BioAFMviewer software, the performed fitting computations proceeded within a few tens of seconds on normal laptop machines (see the Methods section for benchmarking details). This remarkable computational efficiency combined with the implementation of automated molecule detection from HS-AFM raw data allows us to extend the application of flexible fitting to reconstruct atomistic conformations from a time series of measured protein topographies.
To provide a demonstration, we analyze again HS-AFM data for the structural dynamics of the HECT domain, visualizing conformations in the spherical and oval topography shapes, and transitions between them. We consider experimental data for a single molecule whose dynamic topography is traced in our ASD file viewer to determine for each frame the region of interest which defines the corresponding target AFM image (see the Methods section for the data set). For each target AFM image, flexible fitting was performed individually, always starting from the same molecular orientation of the HECT domain catalytic conformation used for the previous fitting (see Figure f). The model and fitting parameters were also the same, and similar fitting termination criteria were employed to determine the atomistic model behind each target AFM image (see the Methods section).
Collecting the results of fitting processes, we first quantified the structural change of each obtained conformation compared with the initial structure. We find that conformations group into those with minor changes (RMSD 2–4 Å), moderate changes (RMSD 4–7 Å), and large internal motions (RMSD > 8 Å) (Figure a). HECT conformations obtained from fitting with minor changes are representative of dynamic structures in the near-catalytic state (i.e., close to the initial conformation), corresponding to oval-shaped topographies of the analyzed HS-AFM video frames. Figure b shows an example of such fittings. On the other side, dynamic structures obtained by larger internal motions are representative of conformations corresponding to AFM topographies in spherical-to-oval intermediate shape (see Figure c for an example case) or those which correspond to clearly oval shape AFM topographies (see Figure d for an example case). It should be noted that the results shown in Figure d are consistent with our previous flexible fitting to an AFM image in a different oval shape topography (see Figure g).
4.
Reconstructing atomistic-precision molecular movies from HS-AFM imaging. (a) Structure change of dynamic HECT domain conformations with respect to the initial structure as obtained from flexible fitting toward respective video frame target images. The RMSD value representing only internal structural changes is plotted and visualized by a color gradient. Exemplary snapshots representative of dynamic HECT structures in near-catalytic conformation (b), intermediate conformation (c), and near L-shape conformation (d) are shown in a side-by-side view of molecular structure, simulated AFM, and target AFM image. Corresponding RMSD values and image-centered center values are given. (e) Geometry analysis of HECT dynamic conformations from fitting visualized in a polar plot of the C–N-lobe domain distance and the hinge angle (highlighted by a color-coded value). Corresponding values for the static catalytic and L-shape PDB structures are indicated. (f) Plot of the image cc value for each individual fitting process, visualizing improvement in the image similarity score from values correlating the initial structure simulation AFM image with respective video frame target images (gray colored dots) to correlations obtained after flexible fitting (green colored dots).
In SI Movie 5, the obtained entire HECT dynamics molecular movie is shown in a side-by-side view with the corresponding sequence of simulation AFM images, the sequence of target HS-AFM images, and a superposition of the molecular movie with the HS-AFM movie. The inferred atomistic models for HECT dynamic conformations are characterized by large-amplitude rotations of the flexible C-lobe domain and relative changes of the small finger loop region compared with the HECT catalytic state structure.
The geometry analysis of all collected dynamic HECT conformations indeed shows large variations in the C–N-lobe hinge angle, with values ranging between those for the static catalytic and L-shaped structures (Figure e). Here, we recall our previously published findings: First we demonstrated by rigid-body fitting, that the static catalytic HECT and L-shape conformations can be correlated with spherical shape and oval shape AFM topographies, respectively. Second, from modeling the conformational dynamics along the transition between both states combined with simulation AFM, we previously concluded that spherical-to-oval shape changes in AFM topographies are likely related to the rotation of the C-lobe domain relative to the N-lobe, although it was not possible to directly correlate modeling results with HS-AFM imaging data. The results from flexible fitting, now providing atomistic models for HECT dynamic conformations directly from measured AFM dynamic topographies, allow the detailed level verification for the interpretation of HECT protein functional dynamics from imaging data.
Looking further into the obtained results, we find that in the majority of cases, fitting generated dynamic HECT conformations which can be viewed as high-confidence atomistic models of corresponding HS-AFM movie frames, as evidenced by image cc values between simulated AFM images and experimental topographies as high as 0.95 to 0.97 (Figure f). For the few cases with lower image cc values (i.e., 0.92 and ∼0.94), we inspected the individual fitting processes and found two different scenarios. In one scenario, the image cc score quickly increases during a first phase of fitting, then saturates for a few iteration steps, before further increasing almost linearly until the preset last iteration step is reached (see Figure S3a–c). This rare type of behavior is not well represented when applying our automated fitting termination criteria which as a simplification assumes an exponential increase in the image cc improvement as fitting progresses (see the Methods section), hence resulting in premature termination close to the saturation stage. In such rare cases, we repeated the individual fitting procedures, this time extending the number of iteration steps to 200, which allowed saturation of image cc scores to obtain higher confidence atomistic models (see Figure S3a–c).
In the other scenario, the improvement of image cc score during fitting clearly saturates within the preset range of iteration steps, as detected by the automated termination criteria. However, further improvement in image similarity by fitting is hampered due to the presence of experimental artifacts in the corresponding HS-AFM images (see Figure S3d–f). Such topographies clearly show distortions along the scanning directions indicative of scanning with a contaminated tip or tip parachuting effects, thus limiting the flexible fitting prediction quality.
Scalability of Flexible FittingApplication to 4 Megadalton Actin Filament
The computational efficiency of the software-integrated NMFF-AFM method shall allow expansion of applications to the analysis of AFM imaging data for protein assemblies, such as filaments with a size of several hundred nanometers.
We provide a demonstration of the scalability of flexible fitting in an application for HS-AFM imaging of the actin filament, which plays an important role in the dynamics of the cell cytoskeleton and contraction of muscle cells. The HS-AFM technique was previously employed to observe dynamical changes in the actin filament in response to interactions with binding proteins, , but data analysis could be performed only at the level of measured limited-resolution topographic data. It is therefore generally desirable to infer atomistic models of dynamic actin conformations from AFM imaging for detailed analysis and an understanding of measurements.
Here, we provide an example application of flexible fitting considering a single HS-AFM experimental image of the actin filament, which was observed in a bent shape (Figure a), much different from the textbook perception being straight. To perform fitting, we first constructed an atomistic model of a straight actin filament consisting of 96 protomer units shown in Figure a (278,304 atoms in total, see the Methods section). Considering that AFM observations proceed with the actin filament placed on the 2D substrate allowed us to determine a suitable initial orientation of the atomistic model (see the Methods section for details). Furthermore, the comparison of simulated AFM topographies with the measured HS-AFM image allowed us to estimate tip-shape parameters for fitting as (R = 6.5 nm, ϑ = 10°) (see the Methods section and Figure S4).
5.
Flexible fitting of 4 megadalton actin filament. (a) Initial 96-protomer atomistic model of the actin filament, its corresponding simulation AFM image obtained with optimal tip-shape scanning, the target HS-AFM image of bent actin shape, and the topography error map before fitting (from top to bottom). (b) Left: Plots of image similarity scores along the fitting process. The graph of the image c.c. score used for fitting is shown in blue; graphs of other scores are displayed in gray (for the p-rmsd an alternative scale is used). The iteration point to terminate the fitting is marked as a red dot. Right: Plot of the mode number selected along the fitting process. Deformation modes are indicated as blue dots. Modes corresponding to rigid-body rotation (numbers 1 to 3) and translation (numbers 4 to 6) are shown as red dots. (c) Mid-stage result of fitting resulting solely from global bending motions of the actin model favored in the first stage of fitting. The instantaneous actin model at iteration step 11 is shown in superposition with its corresponding simulation AFM image (top), and the target HS-AFM image (bottom). (d) Results after finalized flexible fitting. From top to bottom: atomistic model of the fitted actin filament, its corresponding simulation AFM image, its superposition with the HS-AFM target image, and the topography error map.
The results from flexible fitting are shown in Figure (b–d) (see the Methods section for parameter details). We found that fitting proceeds in two stages. During the first stage, the fitting algorithm is solely driven to level the differences between the simulated AFM straight topography shape with respect to the measured bent shape topography, since only global bending motions of the actin filament characterized by the first nontrivial normal mode (mode number 7) are performed (see Figure b). During this part of fitting, the image cc value already significantly improved from 0.39 to 0.79 (see Figure b). The dynamic atomistic actin filament model resulting from the first fitting stage is shown in Figure c together with the corresponding simulated AFM image and the AFM experimental target AFM image. An interesting observation at this fitting stage is that while globally bent shapes of simulated and measured AFM topographies are already comparable by an image cc value of 0.79, they mismatch in terms of protrusion produced by the periodic arrangement of protomers along the filament. This is because the first stage fitting process also favored global bending motions that deviate the actin filament orientation from the planar AFM x–y surface (see topography height scale of the simulated AFM image in Figure c), apparently to better match the protrusion pattern in the central HS-AFM image region where the actin filament is rather straight.
The second fitting stage is then dominated by rigid-body motions of the actin filament combined with further internal structural changes (see Figure b for the selected normal modes). As we find, the rigid-body rearrangements consist of rotations and translations to place the bent filament back toward the planar orientation in which the simulated AFM image represents well the periodic protrusion pattern of the measured AFM image. Regarding internal structural changes, it is interesting that this fitting stage includes several steps which employ motions of higher-frequency normal modes (numbers 32 to 50). In contrast to low-frequency modes describing homogeneous bending of the actin filament, such modes rather correspond to deformations changing locally the arrangement between neighboring protomers, thus generating slight deviations from the global symmetrically bent filament structure. This can be seen in the atomistic model of the actin filament structure after finalized fitting shown in Figure d together with the simulated AFM image, the superposition of atomistic model with the target HS-AFM image, and the topography error map. The confidence of the obtained atomistic model to represent the experimental HS-AFM image is quantified by an image similarity cc score of 0.96 (Figure b).
The entire fitting process is visualized in a top-to-bottom view of the molecular movie, the simulation AFM movie, the target AFM image, and as a superposition of the molecular movie with the target AFM image (SI Movie 6).
Discussion
We provide the software implementation of the flexible fitting NMFF-AFM computational method within the popular BioAFMviewer platform and demonstrate the first applications to experimental HS-AFM imaging data. The developed BioAFM-ASD viewer app provides the interface to visualize and analyze measured AFM topographic data, while the NMFF-AFM app provides a graphical interface for flexible fitting and integrates all required computations. The implementation capitalizes on the BioAFMviewer stand-alone high-quality molecular viewer to visualize atomistic biomolecular structures, its synchronously operating AFM simulator, previously implemented rigid-body fitting methods, AFM topography analysis tools, and the graphical interface to directly visualize obtained results and export data. Hence, the established user-friendly environment, with no dependencies on other software products, offers the opportunity for broad applications of flexible fitting within the Bio-AFM community to infer 3D atomistic-precision models of resolution-limited topographic imaging.
The presented applications to experimental observations of a single protein domain, a protein complex, and a megadalton-sized filamentous protein clearly demonstrate the versatility of flexible fitting modeling large-amplitude conformational transitions to provide atomistic-precision models of biomolecular dynamics from HS-AFM imaging, thus overcoming the limitations of previous rigid-body fitting methods.
The chosen application of flexible fitting to HS-AFM data of the rather small HECT domain was challenging because the conformational dynamics observed under the AFM microscope are represented by topographies with a spatial resolution-limited to a few tenths of pixels in each x and y scanning direction. While this limited information on tip-convoluted topographies should generally restrict the confidence of predictions from any fitting procedure, our results obtained from flexible fitting are very much consistent with previous structural data analysis and molecular modeling emphasizing the importance of flexible C-lobe domain rotations for HECT function. HECT dynamics observed in HS-AFM experiments corresponds to transitions between the spherical and L-shape protein shapes under equilibrium conditions, which may regulate binding of the ubiquitin delivering E2 enzyme, known to preferably bind to the HECT domain in its L-shape conformation. The role of flexible motions of the finger loop in the HECT N-lobe region is so far much less explored (to the best of our knowledge), but its proximity to the E2 enzyme binding site may suggest involvement in the formation of the HECT-E2 complex. Further HS-AFM experiments considering HECT mutants are planned to investigate this aspect at the single-molecule level. A very recent experimental work has indeed characterized HECT domain C-lobe and N-terminal region as key structural elements for ubiquitination in the HECT-type E3 ligase Ufd4.
We expect the flexible fitting method to generally perform better in applications to HS-AFM imaging of larger proteins, where domain motions are visualized by kilopixel resolution (and beyond) topographies. Apparently, high-quality imaging data obtained with a sharp scanning tip are favorable to improving the quality of atomistic model predictions.
The application of flexible fitting to the actin filament demonstrates the scalability of the method, allowing for the atomistic-level interpretation of HS-AFM imaging far beyond the range of single protein topographies at reasonably low computational cost compared to the surreal expense required by molecular dynamics. Within the NMFF-AFM method, the generation of feasible dynamic conformations of a biomolecular structure is generally fast, but the computational efficiency of the entire fitting process is mainly gained from parallel GPU calculations of AFM simulation images within the BioAFMviewer. All applications of flexible fitting were performed on standard laptop computers, with computations taking between tens of seconds and less than a minute for the analysis of a single AFM frame (except for the actin filament case).
The new software implementation of the NMFF-AFM method therefore opens the opportunity for the large-scale analysis of scanning probe microscopy imaging data. The presented application for reconstructing an atomistic molecular movie of HECT protein dynamics from a sequence of 50 HS-AFM movie frames demonstrates a first advancement in that direction.
In principle, the computationally efficient application of flexible fitting can be extended to the automated analysis of all measured topographies and their dynamic changes accumulated from the time course of experimental observations. While this ambitious aim was not attempted in this work, we nonetheless discuss several challenging tasks currently preventing fully automated large-scale application of flexible fitting. The implemented method of automated molecule recognition from HS-AFM imaging data can reliably provide individual target topographies for the presented flexible fitting applications, but it would generally not be possible to detect a single-molecule topography in AFM images of clustered molecules or distinguish between different molecule types.
Further challenges for automation are related to the flexible fitting method. As demonstrated before in applications to synthetic AFM data, the performance of flexible fitting, and therefore the reliability of the obtained dynamic model conformation, may be affected by the chosen molecular orientation of the initial structure. Within the BioAFMviewer workflow, the rigid-body fitting app provides an automated procedure for the initial alignment of the starting structure with respect to the target AFM image prior to the application of flexible fitting. However, for the application to the HECT domain, the predicted alignments were not reliable because simulated AFM images of the starting structure are very different from the target HS-AFM image. In that case, our choice for the initial alignment was based on information about the molecule placement relative to the AFM substrate acquired from previous applications of rigid-body fitting of the starting structure to a selected set of HS-AFM images that represented HECT protein topographies close to the same equilibrium conformation. We argue that this application of flexible fitting is an exemplary case where stark differences in the topography shapes of the initial and target image may present a general challenge for confident initial molecule alignment by automation. Nonetheless, manual intervention to determine the initial alignment allowed us to further reconstruct an atomistic-precision molecular movie of dynamic models, including motions of large-amplitude conformational transitions, inferred by the completely automatized application of flexible fitting to 50 measured HS-AFM imaging frames.
The presented application to the HS-AFM image of the HECT-E2 complex provides a demonstration of the ideal BioAFMviewer fitting workflow, consisting of reliable automated rigid-body alignment and the estimation of optimal tip-shape parameters prior to the actual flexible fitting process.
Conclusions
In summary, with the prospects of flexible fitting for large-scale automated analysis of AFM imaging data, the provided NMFF-AFM software implementation can be viewed as a first advancement toward that direction. Along this path, with several future challenges ahead, the presented contribution allows for the immediate practical application of flexible fitting to AFM measurement data within the versatile, user-friendly BioAFMviewer software interface. The BioAFMviewer software is available as a free download from the project website www.bioafmviewer.com. All new presented developments are implemented in BioAFMviewer version 5.0. A promotional film highlighting flexible fitting in the BioAFMviewer is available on the project YouTube channel https://www.youtube.com/@bioafmviewer.
Methods
Implementation of the BioAFM-ASD Viewer
We developed our own computer code to process AFM experimental raw data encoded in the ASD file format and visualize measured topographic images. The ASD viewer app includes postexperimental image processing for AFM-stage tilt correction and for Gaussian filtering of AFM pixel information along the scanning grid. For the implementation, we employed previously described algorithms. For the automated detection of measured topographies to be considered as target AFM images, we calculated for each ASD frame the probability density of topography heights and then applied the Otsu method to determine the height threshold separating relevant information from noisy background (the opencv library was used). The boundary for each isolated topography in the actual ASD frame was then characterized by a box with Nx pixels width and Ny pixels height. The corresponding region of interest (ROI) was constructed by extending the box size by a specified number of pixel units. Alternatively, the ROI can be manually drawn within the ASD frame pixel canvas. The size of the target AFM image is fixed by the ROI, but there is an option to proceed fitting with either all topography height information retained or the denoised case from application of the Otsu method threshold.
Communication between the ASD Viewer and BioAFMviewer Simulation AFM
Once the fitting toolbox is activated, the scan step to generate simulated images is fixed to the value extracted from the loaded experimental ASD file. The ROI for a molecule in the ASD canvas would result in the same-size ROI centered at the origin of the simulation AFM canvas. In both image canvases, the ROIs can be moved but not resized, as they must have the same size to be compared by the image similarity scoring functions.
Implementation of the Flexible Fitting Toolbox
The flexible fitting toolbox app integrates all computations of the NMFF-AFM iteration scheme and allows users to modify parameters of the model and the fitting process.
The only model parameter is the elastic network cutoff distance.
Elastic Network Cutoff Distance
The cutoff distance determines the connectivity of atoms within the elastic network constructed from the input PDB structure, i.e., the pattern of effective harmonic interactions of each atom with neighboring atoms. The default value is chosen as 8 Å. Generally, a large cutoff distance would result in stiff elastic networks, preventing normal modes to capture intramolecular flexibility. If the value is too small, the elastic network would contain sparsely connected regions which give rise to unnatural motions (indicating the presence of additional trivial normal modes). In the latter case, the app will announce a warning alert.
Integrating the NMFF-AFM iteration scheme relies on the computation of normal modes of the instantaneous atomistic structure and the corresponding simulated AFM images of deformed conformations.
Computation of Elastic Network Normal Modes
To integrate the NMFF-AFM procedure into the BioAFMviewer, the original source code (see repository https://github.com/TamaLab/nmff-afm/tree/main) was rewritten in C++. For the normal-mode analysis, the original Fortran source code (see repository https://github.com/TamaLab/nma) was compiled as a C++ library that was called within the BioAFMviewer iterative fitting procedure. Normal-mode analysis is performed using the RTB method for computational efficiency. The assignment of amino acid residues into blocks (residues per block, rpb) depended on the total number of atoms N of the biomolecular structure in the following way: N between 0 and 1000, 1 rpb; N between 1000 and 4000, 3 rpb; N between 4000 and 8000, 5 rpb; N between 8000 and 11,000, 7 rpb; N > 11000, 11 rpb.
Simulation of AFM Images
AFM simulation to translate atomistic structures into synthetic AFM topographies is at the core of BioAFMviewer software. Simulated scanning is based on the nonelastic collisions of a rigid cone-shaped tip model (probe sphere radius R, and cone half-angle ϑ) with the rigid van der Waals atomic model of the biomolecular structure (for details, see ref ). The calculation speed is massively enhanced by our implemented GPU-based atom filtering algorithm and computations. Furthermore, the computation of simulated AFM images for all of the deformed biomolecular conformations generated in each single iteration step is performed in parallel threads on the graphics card.
The following parameters are relevant to the fitting process.
Number of Normal Modes
Starting with the elastic network constructed from the initial input PDB structure, the conformation is deformed in each iterative step according to a single favorable normal mode, which is automatically selected from the number of specified normal modes. The total number of normal modes for an (anisotropic) elastic network of N atoms is 3N. The first 6 modes describe trivial translations and rotations of the structure and are required to include rigid-body rearrangements during iterative fitting. The remaining nontrivial modes 7···3N capture internal structural changes. Generally, the first few non-trivial modes describe collective domain motions, whereas higher number modes correspond to more localized structural changes. By default, the number of modes is 15. Choosing a larger number of modes will increase the space of sampled conformational motions at the cost of additional computational time needed to complete fitting.
RMSD of Structural Deformation
The RMSD value quantifies the conformational change in each iterative step. It refers to the root-mean-square deviation (RMSD) of the atomic structure before and after the iteration step, i.e., , where N is the number of atoms. The 3N-dimensional vectors R⃗ j and R⃗ j+1 contain atomic positions of the structure before and after iteration step j, respectively. Since R⃗ j+1 = R⃗ j + A j ·E⃗ j , and normal-mode vectors are normalized, the amplitude A j chosen to apply a structure change according to the selected normal mode E⃗ j is given by the relation . The default RMSD value is 0.5 Å. Using much larger values can result in unnatural structural changes, and it is recommended to use a larger number of iteration steps instead.
Image Similarity Scoring Functions
To quantify the similarity between simulated AFM images of all deformed structures generated in each iteration step with the target AFM image, we use scoring functions previously applied in computational AFM analysis. They are based on a pixel-wise comparison between the simulated and measured height topographies across the scanning grid. The image correlation coefficient (image c.c.) is computed as . The cosine similarity (cos. sim.) considers absolute height values neglecting averages, i.e., . The pixel RMSD measures the root-mean-square deviation of height differences, i.e., . In the equations above, h ij represents the height value for the pixel at the position (i,j) of the scanning grid, and h̅ denotes the average of all pixel heights. The structural similarity metric (SSIM) is defined by SSIM = ((2h̅ sim h̅ exp + C 1)·(2σsim,exp + C 2))/(((h̅ sim)2 + (h̅ exp)2 + C 1)·((σsim)2 + (σexp)2 + C 2)), where σsim and σexp are the standard deviations of heights for the simulated and the experimental image, respectively, and σsim,exp is the covariance. The two constants C 1 = (k 1 L)2 and C 2 = (k 2 L)2 prevent divergence in the score. The value L represents the dynamic range of input data and is set as the maximum height found in the simulated and experimental image, and standard values k 1 = 0.01 and k 2 = 0.03 are used.
Number of Iterations and Criterion for Automated Termination of Fitting
Flexible fitting terminates after a preset number of iterations. The default value is 100 steps, but the choice is dependent on the particular application. Additionally, we implemented a method for automated selection of a sufficiently fitted deformed structure based on a posteriori analysis of all iterations. Here, we follow the strategy formulated in the original paper aimed to prevent structural deformations that only marginally increase image similarity of simulated and target experimental AFM image. This situation typically occurs when the image similarity is already sufficiently high, but fitting may continue to generate internal motions that cannot be justified due to the limited information provided by AFM surface topography imaging. To automatically select the sufficiently fitted structure, we traced the change of the image similarity score in each iteration, applied exponential fitting to the resulting data, and detected the iteration at which fitting should terminate by the criteria that it improved the score by a value less than 3–5% compared to the change at the initial iteration.
Fitting Mode
We provide a choice between single-frame and multiframe flexible fitting. In the first case, a single HS-AFM movie frame can be selected in the ASD viewer, and fitting proceeds for a chosen topography (automatically detected or manually selected). For multiframe fitting, aimed at reproducing a molecular movie from a time series of HS-AFM images, the user can specify the range of movie frames in the ASD viewer, and individual fitting processes are performed frame by frame employing automated molecule detection to identify the corresponding target topography.
Data Analysis, Graphical Output, and Data Export
Structural Change during Fitting
The structural change during fitting is quantified by the root-mean-square displacement (RMSD) of the atomic positions of the instantaneous conformation with respect to the initial conformation. The RMSD is also evaluated by alignment of the instantaneous conformation with the initial conformation, quantifying only internal conformational changes. Corresponding plots can be visualized after fitting and exported as image and text files.
Image Similarity Scores during Fitting
The selected score, quantifying image similarity between the simulated and target AFM images along the fitting process, can be visualized as a line plot. Plots of the other available scoring functions are also provided for comparison, but it should be noted that the fitting results employing them would be different. The data can be exported as images and text files.
Selected Normal Modes during Fitting
During the fitting process, the number of normal modes selected in each iteration step is recorded and can be displayed as a point plot. The data can be exported as image and text files.
Error Map
The error map provides a graphical representation of differences between the simulated and target AFM topography. For the region of interest (ROI), the pairwise comparison of height values for pixels across the scanning grid is performed and deviations are visualized by a color gradient. For the presented applications, deviations relative to mean heights were considered, i.e., for each cell of the scanning grid, the height difference Δh ij = (h ij – h̅ sim) – (h ij – h̅ exp) was evaluated. The visualization of the error map can be modified by color bar adjustments. The obtained data can be exported as images and text files.
Movie and PDB Data Export
The molecular viewer and synchronized simulation AFM in the BioAFMviewer allow visualization of the fitting process as a molecular movie of atomistic motions from the initial orientation of the structure toward the conformation reached by fitting, together with the corresponding simulation AFM movie. In the case of the multiframe flexible fitting, the atomistic molecular movie is displayed as a collection of predicted models for each frame together with the corresponding simulation AFM movie. Two types of visualizations are provided for export in video file format. A side-by-side view of the molecular movie, the simulation AFM movie, and the target AFM image (movie). And second, a superposition of the molecular movie with the target AFM image (movie). Additionally, fitting results can be exported as PDB files of atomic coordinates and their dynamic changes. For example, in the case of flexible fitting applied to a time series of AFM images, atomistic conformations of final models representing the molecular movie inferred from all AFM image frames can be exported as a multiframe PDB file.
Details for Application to HS-AFM Imaging Data
HECT Domain
Flexible fitting was performed with the following set of parameters. Elastic network cutoff distance: 8 Å. Number of normal modes: 10. RMSD of structural deformation: 0.5 Å. Number of iterations: 100. Image similarity scoring: image c.c. The tip shape parameters for simulation AFM were: probe sphere radius R = 2 nm and cone half-angle ϑ = 10° (Figure and SI Movie 1), and R = 1.0, 3.0 nm (SI Figure 1 and SI Movie 3).
HECT-E2 Protein Complex
Rigid-body fitting of the HECT-E2 protein complex was performed by sampling 3D molecular orientations with an angular grid spacing of 10° and using tip shape parameters R = 2 nm and cone half-angle ϑ = 10° for simulation AFM. Application of the Quick-Fit procedure proceeded with the best matching molecular orientation based on image c.c. scoring, considering variations in the tip shape parameters (ΔR = ±1.5 nm, step size 0.1 nm; Δϑ = ±5°, step size 1°). The molecular orientation obtained with the optimal tip shape (R = 2.3 nm, ϑ = 14°) was used as the initial conformation for flexible fitting, which was performed with the following set of parameters: Elastic network cutoff distance: 8 Å. Number of normal modes: 15. RMSD of structural deformation: 0.5 Å. Number of iterations: 100. Image similarity scoring: image c.c.
HECT Protein Molecular Movie
The experimental raw data was processed in the ASD viewer to track the measured HECT topographies and determine the ROI for fitting within each frame. Flexible fitting was performed with the following set of parameters. Elastic network cutoff distance: 8 Å. Number of normal modes: 10. RMSD of structural deformation: 0.5 Å. Number of iterations: 100. Image similarity scoring: image c.c.
HECT Domain PDB Structures for Fitting and Analysis
For the HECT domain application of flexible fitting, the crystal structure PDB ID 3jvz (E6AP-type HECT catalytic conformation; chain C: Glu574-Gly950) was used. Other HECT domain structures PDB ID 1d5f (NEDD4L-type HECT L-shape conformation; chain A: Asn497-Ala846) and PDB ID 1nd7 (reverse T-shape; chain A: His544-Glu917) were considered for a comparison of flexible fitting results with static structural data. The domain geometry of static HECT structures and dynamic conformations obtained from flexible fitting was characterized by the distance between the C-lobe domain and N-lobe domain geometric centers and a hinge angle. For 3jvz, the α-carbon atoms of residues Val838-Gly950 (C-lobe), Glu574-Asn746 and Asn783-Gly834 (N-lobe), and Asp747-Glu782 (finger loop) were considered. For 1d5f, the α-carbon atoms of residues Leu742-Ala846 (C-lobe), Asn497-Tyr645 and Glu685-Cys737 (N-lobe), and Glu646-Asn684 (finger loop) were considered. The orientation of the N-lobe domain in the respective HECT conformation was determined by computing the three principal axes of the inertia. The hinge angle characterizing the orientation of the C-lobe relative to the N-lobe domain was then defined by the vector connecting the geometric centers and the major principal axis.
For the HECT-E2 application of flexible fitting, the crystal structure PDB ID 1c4z (HECT domain, chain A: Asn497-Ala846; E2 enzyme; chain D: Ser4-Tyr147) was used.
Actin Filament
The 96-protomer atomistic model for fitting was constructed using the open-source code provided in ref . The resulting structure was already aligned along the X-axis in the coordinate system of our molecular viewer. Considering that actin bending toward the shape observed in the target HS-AFM image proceeds with the filament placed on the 2D substrate allowed us to determine the degree of filament rotation around the x-axis required to prepare a suitable initial orientation for fitting. In other words, since global bending of the actin filament is described by motions corresponding to the first nontrivial normal mode (mode number 7), we have roughly aligned the filament such that bending motions would proceed within the x–y plane, which represents the AFM substrate plane in our coordinate system. After that, we additionally shifted the filament position in the x-direction such that the periodic height protrusion pattern in the simulated AFM image (coming from the actin helical pitch) roughly matches that of the AFM image in the straight central region of the filament. It should be noted that a precise alignment is not necessary because further adjustments are taken care of during the flexible fitting process. To estimate the optimal tip shape for fitting, we compared the cross-section height profiles of simulated AFM topographies for the initial orientation of the actin model with the measured HS-AFM topography. Using the BioAFMviewer topography analysis toolbox, we chose a cross section in the region where the filament is straight and varied the tip probe sphere radius value for computing simulated AFM images to roughly match the height profile in the measured image (see Figure S4).
Flexible fitting was performed with the following set of parameters: Elastic network cutoff distance: 8 Å. Number of normal modes: 50. RMSD of structural deformation: 5 Å. Number of iterations: 100. Image similarity scoring: image c.c.
HS-AFM Experimental Imaging
For the application of flexible fitting to HECT/E2 protein dynamics, we used a data set of previously obtained HS-AFM imaging and refer to the original publications for further experimental details.
For the application of flexible fitting to the actin filament, original HS-AFM data was used. Experiments for actin were performed using a laboratory-built machine in tapping mode with small cantilevers (BLAC10DS-A2, Olympus) as previously described. A DPPC/DPTAP/Biotin-Cap-DPPE (89:10:1) lipid bilayer was deposited (10 min) on the mica surface before the incubation of actin (10 min) (see ref for the detailed method of mica-supported lipid bilayer). A conventional actin buffer (25 mM KCl, 0.5 mM EGTA, 2 mM MgCl2, 20 mM Imidazol-HCl pH 7.0) was used for imaging.
Computation Benchmark Details
All flexible fitting computations were performed on standard laptop computers. The computational efficiency evaluated using the two architectures is summarized in Table .
1. Computation Benchmark Details .
| Single HECT image | HECT-E2 | HECT movie (50 images) | Actin | |
|---|---|---|---|---|
| Lenovo ThinkPad X1 Carbon (7th Gen, 2019) | 29 s | 44 s | 26.5 min | 4.1 h |
| CPU: Intel Core i5–7200U 2.50 GHz (up to 2.71 GHz), 2 cores, 4 threads | ||||
| RAM: 8GB | ||||
| GPU: Intel HD Graphics 620 | ||||
| Lenovo ThinkPad P14s (4th Gen, 2023) | 15 s | 17 s | 14 min | 1.5 h |
| CPU: Intel Core i7–1370P 1.90 GHz (up to 5.20 GHz), 14 cores, 20 threads | ||||
| RAM: 64GB | ||||
| GPU: NVIDIA RTX A500 Laptop, 4GB |
Details of the two laptop computer architectures used for flexible fitting are provided together with the corresponding computation times for the four applications of HS-AFM imaging data.
Supplementary Material
Acknowledgments
This work was supported by the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan, through the World Premier International Research Center (WPI) Initiative (R.A., N.K., H.K., and H.F.), Grant-in-Aid for Scientific Research KAKENHI Nos 25K01631 (H.K.) and 24H02260 (O.M., F.T.) from the Japan Society for the Promotion of Science (JSPS), the Transdisciplinary Research Promotions grant (R.A., N.K., H.K., and H.F.) from WPI-NanoLSI, and in part by MEXT as “Program for Promoting Researches on the Supercomputer Fugaku” (Development and application of large-scale simulation-based inferences for biomolecules JPMXP1020230119) (O.M., F.T.).
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsnano.5c10073.
HECT domain large-amplitude conformational transition - fitting with different tip shape (SI Figure 1). Geometry analysis of HECT crystal structures (SI Figure 2). Reconstructing atomistic-precision molecular movies from HS-AFM imaging (SI Figure 3). Tip shape estimate for actin filament flexible fitting (SI Figure 4) (PDF)
Flexible fitting for HECT domain (Movie 1) (MP4)
Flexible fitting for HECT domain with denoised HS-AFM target image (Movie 2) (MP4)
Flexible fitting for HECT domain using different tip shapes (Movie 3) (MP4)
Flexible fitting for HECT-E2 protein complex (Movie 4) (MP4)
Flexible fitting for HECT dynamics HS-AFM movie (Movie 5) (MP4)
Flexible fitting for 4 megadalton actin filament (Movie 6) (MP4)
Conceptualization: R.A., O.M., F.T., H.F. Methodology: R.A., O.M., X.W., F.T., H.F. Software: R.A., O.M., F.T. Validation: R.A., K.T., N.K., H.K., O.M., F.T., H.F. Formal analysis: R.A., H.F. Investigation: R.A., K.T., H.K., N.K., H.F. Resources: R.A., H.K., N.K., O.M., F.T., H.F. Data curation: R.A., N.K., H.K. Writingoriginal draft: H.F. Writingreview and editing: R.A., N.K., H.K., O.M., F.T., H.F. Visualization: R.A., H.F. Supervision: O.M., F.T., H.F. Project administration: O.M., F.T., H.F. Funding acquisition: R.A., O.M., F.T., H.F.
A previous version of this manuscript was posted on a preprint server.
The authors declare no competing financial interest.
References
- Nogales E., Mahamid J.. Bridging structural and cell biology with cryo-electron microscopy. Nature. 2024;628:47–56. doi: 10.1038/s41586-024-07198-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Banari A., Samanta A. K., Munke A., Laugks T., Bajt S., Grünewald K., Marlovits T. C., Küpper J., Maia F. R. N. C., Chapman H. N., Oberthür D., Seuring C.. Advancing time-resolved structural biology: latest strategies in cryo-EM and X-ray crystallography. Nat. Methods. 2025;22:1420–1435. doi: 10.1038/s41592-025-02659-6. [DOI] [PubMed] [Google Scholar]
- Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Zidek A., Potapenko A., Bridgland A., Meyer C., Kohl S. A. A., Ballard A. J., Cowie A., Romera-Paredes B., Nikolov S., Jain R., Adler J., Back T.. et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., Ronneberger O., Willmore L., Ballard A. J., Bambrick J., Bodenstein S. W., Evans D. A., Hung C.-C., O’Neill M., Reiman D., Tunyasuvunakool K., Wu Z., Zemgulyte A., Arvaniti E., Beattie C.. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. doi: 10.1038/s41586-024-07487-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ando T., Fukuda S., Ngo K. X., Flechsig H.. High-speed atomic force microscopy for filming protein molecules in dynamic action. Annu. Rev. Biophys. 2024;53:19–39. doi: 10.1146/annurev-biophys-030722-113353. [DOI] [PubMed] [Google Scholar]
- Ando, T. High-Speed Atomic Force Microscopy in Biology; Springer: Berlin, Heidelberg, 2022. [Google Scholar]
- Fukuda S., Ando T.. Faster high-speed atomic force microscopy for imaging of biomolecular processes. Rev. Sci. Instrum. 2021;92:033705. doi: 10.1063/5.0032948. [DOI] [PubMed] [Google Scholar]
- Shimizu M., Okamoto C., Umeda K., Watanabe S., Ando T., Kodera N.. An ultrafast piezoelectric Z-scanner with a resonance frequency above 1.1 MHz for high-speed atomic force microscopy. Rev. Sci. Instrum. 2022;93:013701. doi: 10.1063/5.0072722. [DOI] [PubMed] [Google Scholar]
- Flechsig H., Ando T.. Protein dynamics by the combination of high-speed AFM and computational modeling. Curr. Opin. Struct. Biol. 2023;80:102591. doi: 10.1016/j.sbi.2023.102591. [DOI] [PubMed] [Google Scholar]
- Niina T., Matsunaga Y., Takada S.. Rigid-body fitting to atomic force microscopy images for inferring probe shape and biomolecular structure. PLoS Comput. Biol. 2021;17:e1009215. doi: 10.1371/journal.pcbi.1009215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amyot R., Marchesi A., Franz C. M., Casuso I., Flechsig H.. Simulation atomic force microscopy for atomic reconstruction of biomolecular structures from resolution-limited experimental images. PLoS Comput. Biol. 2022;18:e1009970. doi: 10.1371/journal.pcbi.1009970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niina T., Fuchigami S., Takada S.. Flexible fitting of biomolecular structures to atomic force microscopy images via biased molecular simulations. J. Chem. Theory Comput. 2020;16:1349–1358. doi: 10.1021/acs.jctc.9b00991. [DOI] [PubMed] [Google Scholar]
- Dasgupta B., Miyashita O., Tama F.. Reconstruction of low-resolution molecular structures from simulated AFM force microscopy images. Biochim. Biophys. Acta, Gen. Subj. 2020;1864:129420. doi: 10.1016/j.bbagen.2019.129420. [DOI] [PubMed] [Google Scholar]
- Dasgupta B., Miyashita O., Uchihashi T., Tama F.. Reconstruction of three-dimensional conformations of bacterial ClpB from high-speed atomic-force-microscopy images. Front. Mol. Biosci. 2021;8:704274. doi: 10.3389/fmolb.2021.704274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X., Miyashita O., Tama F.. Modeling conformational transitions of biomolecules from atomic force microscopy images using normal mode analysis. J. Phys. Chem. B. 2024;128:9363–9372. doi: 10.1021/acs.jpcb.4c04189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vuillemot, R. ; Pellequer, J.-L. ; Grudinin, S. . AFMfit: deciphering conformational dynamics in AFM data using fast nonlinear NMA and FFT-based search bioRxiv 2024. [DOI] [PMC free article] [PubMed]
- Tama F., Sanejouand Y. H.. Conformational change of proteins arising from normal mode calculations. Protein Eng., Des. Sel. 2001;14:1–6. doi: 10.1093/protein/14.1.1. [DOI] [PubMed] [Google Scholar]
- Gur M., Zomot E., Bahar I.. Global motions exhibited by proteins in micro- to milliseconds simulations concur with anisotropic network model predictions. J. Chem. Phys. 2013;139:121912. doi: 10.1063/1.4816375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mahajan S., Sanejouand Y. H.. On the relationship between low-frequency normal modes and the large-scale conformational changes of proteins. Arch. Biochem. Biophys. 2015;567:59–65. doi: 10.1016/j.abb.2014.12.020. [DOI] [PubMed] [Google Scholar]
- Tama F., Miyashita O., Brooks C. L.. Flexible multi-scale fitting of atomic structures into low-resolution electron density maps with elastic network normal mode analysis. J. Mol. Biol. 2004;337:985–999. doi: 10.1016/j.jmb.2004.01.048. [DOI] [PubMed] [Google Scholar]
- Tama F., Miyashita O., Brooks C. L.. Normal mode based flexible fitting of high-resolution structure into low-resolution experimental data from cryo-EM. J. Struct. Biol. 2004;147:315–326. doi: 10.1016/j.jsb.2004.03.002. [DOI] [PubMed] [Google Scholar]
- Jin Q., Sorzano C. O. S., de al Rosa-Trevin J. M., Bilbao-Castro J. R., Nunez-Ramirez R., Llorca O., Tama F., Jonic S.. Iterative elastic 3D-to-2D alignment method using normal modes for studying structural dynamics of large macromolecular complexes. Structure. 2014;22:496–506. doi: 10.1016/j.str.2014.01.004. [DOI] [PubMed] [Google Scholar]
- Amyot R., Flechsig H.. BioAFMviewer: an interactive interface for simulated AFM scanning of biomolecular structures and dynamics. PLoS Comput. Biol. 2020;16:e1008444. doi: 10.1371/journal.pcbi.1008444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weber J., Polo S., Maspero E.. HECT E3 ligases: a tale with multiple facets. Front. Physiol. 2019;10:370. doi: 10.3389/fphys.2019.00370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takeda K., Flechsig H., Muro I., Amyot R., Kobayashi F., Kodera N., Ando T., Konno H.. Structural dynamics of E6AP E3 ligase HECT domain and involvement of a flexible hinge loop in the ubiquitin chain synthesis mechanism. Nano Lett. 2023;23:11940–11948. doi: 10.1021/acs.nanolett.3c04150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otsu N.. A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. 1979;9:62–66. doi: 10.1109/TSMC.1979.4310076. [DOI] [Google Scholar]
- Huang L., Kinnucan E., Wang G., Beaudenon S., Howley P. M., Huibregtse J. M., Pavletich N. P.. Structure of an E6AP-UbcH7 complex: insights into ubiquitination by the E2-E3 enzyme cascade. Science. 1999;286:1321–1326. doi: 10.1126/science.286.5443.1321. [DOI] [PubMed] [Google Scholar]
- Ngo K. X., Kodera N., Katayama E., Ando T., Uyeda T. Q. P.. Cofilin-induced unidirectional cooperative conformational changes in actin filaments revealed by high-speed atomic force microscopy. eLife. 2015;4:e04806. doi: 10.7554/eLife.04806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ngo K. X., Vu H. T., Umeda K., Trinh M. N., Kodera N., Uyeda T.. Deciphering the actin structure-dependent preferential cooperative binding of cofilin. eLife. 2024;13:RP95257. doi: 10.7554/eLife.95257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X., Ai H., Mao J., Cai H., Liang L.-J., Tong Z., Deng Z., Zheng Q., Liu L., Pan M.. Structural visualization of HECT-type E3 ligase Ufd4 accepting and transferring ubiquitin to form K29/K48-branched polyubiquitination. Nat. Commun. 2025;16:4313. doi: 10.1038/s41467-025-59569-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tama F., Gadea F. X., Marques O., Sanejouand Y. H.. Building-block approach for determining low-frequency normal modes of macromolecules. Proteins. 2000;41:1–7. doi: 10.1002/1097-0134(20001001)41:1<1::AID-PROT10>3.0.CO;2-P. [DOI] [PubMed] [Google Scholar]
- Amyot R., Kodera N., Flechsig H.. Atom filtering algorithm and GPU-accelerated calculation of simulation atomic force microscopy images. Algorithms. 2024;17:38. doi: 10.3390/a17010038. [DOI] [Google Scholar]
- Reynolds M. J., Hachicho C., Carl A. G., Gong R., Alushin G. M.. Bending forces and nucleotide state jointly regulate F-actin structure. Nature. 2022;611:380–386. doi: 10.1038/s41586-022-05366-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uchihashi T., Kodera N., Ando T.. Guide to video recording of structure dynamics and dynamic processes of proteins by high-speed atomic force microscopy. Nat. Protoc. 2012;7:1193–1206. doi: 10.1038/nprot.2012.047. [DOI] [PubMed] [Google Scholar]
- Yoshimi K., Takeshita K., Kodera N., Shibumura S., Yamauchi Y., Omatsu M., Umeda K., Kunihiro Y., Yamamoto M., Mashimo T.. Dynamic mechanisms of CRISPR interference by Escherichia coli CRISPR-Cas3. Nat. Commun. 2022;13:4917. doi: 10.1038/s41467-022-32618-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amyot, R. ; Miyashita, O. ; Wu, X. ; Takeda, K. ; Kodera, N. ; Konno, H. ; Tama, F. ; Flechsig, H. . Flexible fitting to infer atomistic-precision models of large-amplitude conformational dynamics in biomolecules from high-speed atomic force microscopy imaging bioRxiv 2025. https://www.biorxiv.org/content/10.1101/2025.05.14.653975v1.abstract 10.1101/2025.05.14.653975. (accessed September 10, 2025). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




