Skip to main content
Royal Society Open Science logoLink to Royal Society Open Science
. 2021 Aug 18;8(8):210160. doi: 10.1098/rsos.210160

Insights into the aggregation mechanism of RNA recognition motif domains in TDP-43: a theoretical exploration

Wei Liu 1, Chaoqun Li 2,, Jiankai Shan 1, Yan Wang 1, Guangju Chen 1,
PMCID: PMC8371369  PMID: 34457335

Abstract

The transactive response DNA-binding protein 43 (TDP-43) is associated with several diseases such as amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) due to pathogenic aggregations. In this work, we examined the dimer, tetramer and hexamer models built from the RRM domains of TDP-43 using molecular dynamics simulations in combination with the protein–protein docking. Our results showed that the formations of the dimer models are mainly achieved by the interactions of the RRM1 domains. The parallel β-sheet layers between the RRM1 domains provide most of the binding sites in these oligomer models, and thus play an important role in the aggregation process. The approaching of the parallel β-sheet layers from small oligomer models gradually expand to large ones through the allosteric communication between the α1/α2 helices of the RRM1 domains, which maintains the binding affinities and interactions in the larger oligomer models. Using the repeatable-superimposing method based on the tetramer models, we proposed a new aggregation mechanism of RRM domains in TDP-43, which could well characterize the formation of the large aggregation models with the repeated, helical and rope-like structures. These new insights help to understand the amyloid-like aggregation phenomena of TDP-43 protein in ALS and FTLD diseases.

Keywords: TDP-43 protein, RRM1 and RRM2 domains, molecular dynamics simulation, aggregation

1. Introduction

Many human neurodegenerative diseases are correlated with the abnormal accumulation of proteins in the neurons tissues, such as amyotrophic lateral sclerosis (ALS), Parkinson's disease (PD), Alzheimer's disease (AD) and frontotemporal lobar degeneration (FTLD) etc. [13]. Recent evidence has further connected the progressive disorder diseases of ALS, FTLD and AD to the aggregation or polymer formation of proteins [47]. The transactive response RNA-binding protein 43 (TDP-43) is not only a primarily nuclear RNA-binding protein but also one of the potential aggregation proteins [8]. It has been evidenced that aggregation in nervous tissues of wild-type TDP-43 is a potential risk in ALS and FTLD [911]. Therefore, investigation of TDP-43 aggregation mechanism has gained momentum in recent years [12].

TDP-43 was first discovered in 1995 and is implicated in mRNA metabolism, such as mRNA transport, mRNA stability, pre-mRNA splicing and miRNA processing [8,13]. TDP-43 contains 414 amino acids and consists of one N-terminal domain (NTD), two repeated RNA recognition motif domains (RRM domains), one C-terminal domain (CTD) and linking loops [14]. The domain schematic is shown in figure 1a. Specifically, NTD (residues 1–101) includes one α-helix, six β-strands and several loops, while CTD (residues 274–414) contains a helix-turn-helix structure and unstructured loops [15,16]. RRM domains feature two independent domains of RRM1 (residues 102–176) and RRM2 (residues 192–267) with each including two α-helixes (α1, α2), five β-strands (β1–β5) and a α-linker loop between them (figure 1b) [17,18]. The aggregation phenomena mostly mediated by independent NTD and CTD have been extensively investigated in vivo and in vitro [1921]. Some investigated evidence showed that the CTD, included in the fragments of ALS-associated TDP-43 inclusions, is classified as prion-like, and is aggregation prone both in vitro and in cell [22,23]. Other studies predicted that the NTD was predicted to adopt a stable fold and was found to drive the formation of large oligomers [24,25]. For instance, Miguel Mompeán and Douglas V. Laurents' group provided specific structural evidence of CTD in TDP-43 with β-hairpins assembling into a parallel β-turn for amyloid-like structure, via electron microscopy (EM), X-ray diffraction and solid-state NMR techniques [26]. Nicolas L. Fawzi's group elucidated the structural basis of dynamic granule formation of TDP-43 via its CTD through ALS-associated mutations [27]. Then, the NMR structure of head-to-tail NTD dimer was solved by them, which induces the native polymerization [28]. Using the NMR and other analytical methods, the studies of Emanuele Buratti and Douglas V. Laurents' group strongly supported that a stably folded NTD is essential for correct TDP-43 function and also enhances dimerization, which is in relation to the pathological aggregation and protein's activities [15].

Figure 1.

Figure 1.

Compositions of TDP-43 protein. (a) The RRM12 domains are connected to NTD (in red) and CTD (in magenta) by the N-loop and C-loop, respectively. RRM12 domains contain the RRM1 (in purple) domain and RRM2 (in orange) domain that are connected by a α-linker (in cyan). (b) The domain organization of the RRM12 domains are represented by the five β-strands (β1, β2, β3, β4, β5) and two mutually vertical α-helixes (α1, α2) of each RRM1 (in purple) domain and RRM2 (in orange) domain, respectively.

Recently, some studies have shown that the RRM domains will not only help the interaction between TDP-43 and nucleic acids, but also tend to self-assembling using the nucleic acid binding sites as nucleic acids are absent, which also plays an important role in pathological aggregation of TDP-43 [2932]. Especially, Elsa Zacco and Annalisa Pastore's group investigated the supramolecular conformations constructed by RRM1 and RRM1-2 through the CD analysis, the ThT-binding assay and the AFM techniques. By analysing the resulting formations after the protein constructing to fully aggregate after 72 h, they found that the aggregates formed by RRM1 or RRM1-2 appeared uniformly thick or softer, almost cotton-like with an average thickness of 365 or 173 nm, respectively [33]. In 2019, they validated that the stabilization of the structure of the two RRM domains and the aggregation propensity are simultaneously increased through the D169G mutation adjacent to RRM1, which also reduces the RNA binding tendency. The mutation causes an insistent exposure of the RNA-binding interface, which not only enhances the aggregation potential but also promotes the production of toxic aggregates [34]. Moreover, the studies of Han-Jou Chen and Christopher E. Shaw's group in 2019 showed that the TDP-43 mutations, such as the RRM adjacent K181E mutation, will largely improve aggregation and are possibly served as pathogenic since they promote wild-type TDP-43 to aggregate dominant-negatively [35]. Jianxing Song's group revealed that the TDP-43 RRM12 domains without ATP became opaque during the incubation, which indicated formation of fibrils via the analysis of heteronuclear single quantum coherence (HSQC) peaks and the fluorescence characteristics [36]. It has been found by David Eisenberg's group that amyloid-like fibril formation presents a runaway domain swap through exchanging the C-terminal β-strands between two monomers from the sample of RNase A with Q10 expansion [37,38]. In addition, RNA was shown to be an important factor in preventing aggregation of TDP-43 resulting from the fact that RRM domains with RNA binding will maintain TDP-43 solubility [3436].

Molecular dynamics (MD) simulations of two mutated structures (D169G and K263E) of RRM domains have been reported, which predicted the mutation leading aggregation tendency [39]. Protein–protein docking studies indicated that the cysteine residues in NTD contribute to aggregation propensity [40]. Atomic MD simulation studies elucidated the structural unfolding and stability of NTD and RRMs, which indicated that these unfolded intermediate structures induce the aggregation of TDP-43 [41,42]. MD simulation studies have also shown that the 341–366 segment of CTD was able to drive aggregation of the entire TDP-43 protein [43]. Feng Ding and co-workers theoretically studied the oligomerization of full length hIAPP through multiple molecular systems of increasing number of peptides and suggested that oligomers larger than a trimer allowed the formation of more stable β-sheets of the human islet amyloid polypeptide [44]. However, theoretical studies on the aggregation structures of RRM domains of wild TDP-43 are scarce so far. Especially, the detailed aggregation processes by which small oligomers evolve into large oligomers leading to an amyloid-like structure of TDP-43 are elusive but necessary to understand the aggregation mechanisms of RRM1 and RRM2 domains. To gain an in-depth understanding of the structural and dynamical basis for the aggregation mechanism and the oligomerization process of RRM domains in TDP-43, we used protein–protein docking technique to build the initial structures of dimer, tetramer and oligomer models, and carried out MD simulations with free energy calculations at the atomic levels for these models. Our main objectives were: (i) to establish as many small oligomer models as possible of RRM1 and RRM2 domains, calculate and compare their stabilities; (ii) to elucidate the possible aggregation mechanism by which small oligomers promote large oligomers. This study will help better understanding the RRM domains amyloid-like aggregation mechanism.

2. Methods

2.1. Model construction for the RRM domains of TDP-43

Based on the previous experimental studies, two tandem RRM1 and RRM2 domains of TDP-43 protein play an important role in the TDP-43 protein aggregation [3335]. So, the two RRM1 and RRM2 domains were chosen to investigate the aggregation mechanism of this protein. The initial structure of the tandem RRM1 and RRM2 domains was constructed by using the nuclear magnetic resonance (NMR) structure of the RRM domains of TDP-43 protein with RNA binding (Protein Data Bank (PDB) code: 4BS2) (assigned as RRM12 model). In order to eliminate the disturbance on the intra-domain structure from RNA binding, the coordinates of the RRM1 and RRM2 domains were corrected by the NMR structures of the two isolate RRM1 and RRM2 domains (PDB codes: 2CQG and 1WF0). Namely, first, the basis structure of RRM12 model was taken from the NMR structure of the RNA binding RRM domains (PDB code: 4BS2) with the direct superposing of isolate RRM1 and RRM2 on the corresponding positions; second, the corrected coordinates of RRM1 and RRM2 domains captured from that of the NMR structure of isolate RRM1 and RRM2 (PDB code: 2CQG and 1WF0) were merged into the corresponding positions of NMR structure of the RNA binding RRM domains (PDB code: 4BS2); finally, besides removing the bound RNA structure, the full residue coordinates from Lys102 to Asn267 of the RRM1 and RRM2 domains and the linking loop of Asn179 to Val193 between the two domains in the RRM12 model were constructed by the NMR experimental data (PDB code: 4BS2) with the deletion of the extra residues in the RRM1 domain and with the mutated Gly200 residue recovered by the original Glu200 residue in the RRM2 domain for recovering the primary sequence of this domain [45]. Five initial dimer models were produced from the average structure of the monomer RRM12 model by using the protein–protein docking software [4648]. The tetramer models were constructed based on the average structure of the most stable dimer model using the same method. The large aggregation models, such as hexamer, octamer, dodecamer and so on, were constructed using the superimposing-creating method. The details of construction processes for these models can be found in §§3.2, 3.3.1 and 3.3.2, respectively. For MD simulation, each model system was explicitly solvated by using the transferable intermolecular potential 3P (TIP3P) water inside a rectangular box large enough to ensure the solvent shell with the depth of 14 Å in all the six directions of the model.

2.2. Molecular dynamics simulation and ZDOCK

MD simulations for these models, including minimization, heating and equilibration, were performed by using the sander module of AMBER16 package with a classical AMBER parm99 [49,50]. The details of the MD procedure are given in the electronic supplementary material. The Cpptraj and Ptraj utilities of the AMBER16 program were used to obtain root-mean-square deviation (RMSD), average structures, hydrogen bond and hydrophobic interactions etc. [51]. ZDOCK algorithm based on fast Fourier transform searches all possible binding modes by changing the translational and rotational positions of proteins; and each mode was evaluated by a comprehensive energy scoring function that includes some terms of shape complementarity, electrostatics and statistical potential [5254]. ZDOCK software was used to search the possible binding conformations of these models (v. 3.0.2, http://zdock.umassmed.edu/) [46,5254]. The details of the docking procedure are available in the electronic supplementary material.

2.3. Binding free energy analysis

The molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) method was used to perform the binding free energy analysis in AMBER16 package [55,56]. The binding free energy (ΔGbinding) was calculated based on the followed equation:

ΔGbinding=Gcomplex(Gligand+Greceptors), 2.1

where Gcomplex, Gligand and Greceptors are the free energies of complex, ligand and receptor calculated from the snapshots of the MD trajectories, respectively. The binding free energy was calculated as follows:

ΔGbinding=ΔHbindingTΔS=ΔEMM+ΔGsolvTΔS. 2.2

The details of the MM-PBSA method can be found in the electronic supplementary material.

3. Results

3.1. Structural characteristics of the monomer RRM12 model

For determining the structure of the RRM1 and RRM2 domains, the 1000 ns MD simulation for the RRM12 model was carried out. The root-mean-square deviation (RMSD) values of all backbone atoms of the RRM1 and RRM2 domains referenced to the corresponding starting structure over the trajectory of RRM12 model were examined to determine if the system had attained equilibrium, and are shown in figure 2a. It can be seen from figure 2a that the RRM12 model reached equilibrium after short simulation times, and the energies were found to be stable during the remainder of the simulation. Therefore, the equilibrated conformation for this model was extracted from the trajectory analysis. The corresponding average structure of RRM12 model is shown in figure 2b. In this average structure, the RRM1 and RRM2 domains are linked by a small α-helix as the flexible linking loop. Both domains display the similar conformation, i.e. the five β-strands of β1, β2, β3, β4, β5 form a curve sheet layer nearby the plane constructed by two almost vertical α1-α2 helixes; the angles of the α1-α2 helixes and the mass centre distances between the α1–α2 helixes plane and the β-strands sheet layer are 104.3° and 98.2°, 8.2 and 7.6 Å in RRM1 and RRM2 domains, respectively. Each β-strand and each α-helix in the two domains are linked by the corresponding linking loop, named by the two linked units as Lβ1-α1, Lα1-β2, Lβ2-β3, Lβ3-α2, Lα2-β4 and Lβ2-β5. The linking order is following by β1-α1-β2-β3-α2-β4-β5 in each domain with the β4 strand representing a loop structure in the RRM1 domain (figure 2b). The structural characteristics of each domain in the RRM12 model show a ‘hotdog' fold sketch, such that the curve β-sheet wraps up two α-helices in one side. The relative position of two domains can be described in detail, by the mass centre distance, and the relative rotation angles of the two α1 helices and the two α2 helices in RRM1 and RRM2 domains, to be 24.9 Å, 21.1° and 49.6°, respectively. It can be seen from the results that the two domains linked by the small α-helix linking loop lay nearby each other with the certain spiral and tilt angle (figure 2b). Especially, comparing the above calculated distance and angles to the experimental data of 23.9 Å, 16.4° and 24.2° in the RNA binding RRM domain structure, respectively, it is found that the RNA binding causes the decrease of spiral angle amount between the two α2 in two domains and results in the tendency of two domains paralleling with each other. Also, it can be seen that the current RRM12 model constructed by the superimposing technique reproduces the experimental structures [18]. This calculated monomer RRM12 model will be prepared to construct the oligomer aggregation models for the investigation of the aggregation mechanism of RRM domains of TDP-43 protein.

Figure 2.

Figure 2.

(a) RMSD values of heavy atoms with respect to the initial structure of the RRM12 system. (b) The average structure of the RRM12 system. The RRM1 domain and the RRM2 domain are coloured in purple and orange, respectively.

3.2. The structure and interaction characteristics of RRM domain dimer models

To explain the aggregation process of RRM domains of TDP-43, the initial dimer models were produced from the average conformation of the calculated RRM12 model by protein–protein docking method using ZDOCK software (http://zdock.umassmed.edu/) [46]. Based on this method, five dimer models were screened from the top 250 docking results with the high ZDOCK scores and the groups of some similar contacted modes. The details of the docking procedure are available in the electronic supplementary material. Finally, MD runs of 1000 ns for five models were carried out by following the same protocol above.

3.2.1. Structure analysis of the dimer models

Based on the similar trajectory analysis, MD simulations of 1000 ns for all five dimer models reached equilibrium in the time range of 400–600 ns, and the corresponding energies were found to be stable during the remainder of the simulation. Therefore, the equilibrated conformations have been extracted from 800 to 1000 ns of simulation time, and are shown in figure 3 with the corresponding non-transparent surface structures. The RMSD values of all backbone atoms referenced to the corresponding starting structures over the trajectories of the five dimer models are shown in electronic supplementary material, figure S1a–e. Based on the asymmetrical ellipsoid shape of the monomer RRM12 model (figure 2b), four outer edges of each model were defined as ‘shoulder12-α’, ‘shoulder12-β', and ‘head-1', ‘head-2', which will be applied to easily identify the types and names of the five dimer models. The shoulder12-α in each model represents the outer edge consisted of the α2-helix in RRM2 domain, and the linking loop (or the α-helix-linker) between RRM1 and RRM2 domains; the shoulder12-β represents the outer edge consisted of two β-sheet layers in RRM1 and RRM2 domains; then head-1 represents the outer edge constructed by the α1, α2 helixes of the RRM1 domain, and head-2 represents the outer edge constructed by the β-sheet layer tail of the RRM2 domain (figure 2b). From the above, the five dimer models were defined as SαSβ/D, SβSβ/D, SβH1/D, SβH2/D and H1H1/D models with three classified groups of shoulder-to-shoulder, i.e. SαSβ/D and SβSβ/D models, shoulder-to-head, i.e. SβH1/D and SβH2/D models, and head-to-head, i.e. H1H1/D model, respectively (figure 3).

Figure 3.

Figure 3.

Average structures and the corresponding non-transparent surfaces of (a) dimer SαSβ/D, (b) SβSβ/D, (c) SβH1α1/D, (d) SβH2/D and (e) H1H1/D systems. Two monomers in the five dimer models are coloured in cyan and green.

It can be seen from the average structure in figure 3a that the SαSβ/D model presents that the α-helix linking loop and β5 strand of RRM1 domain in the shoulder12-α side of one monomer RRM12 model (cyan in figure 3a) combines face-to-face with the curve β-sheet layer of RRM1 domain in the shoulder12-β side of the other monomer RRM12 model (green in figure 3a). Meanwhile, the α-helix linking loop of one monomer occupies the interstice of two domains of the other monomer, and vice versa. In figure 3b, the SβSβ/D model presents that the curve β-sheet layer of the RRM1 domain in the shoulder12-β side of one monomer RRM12 model (cyan in figure 3b) combines interlaced with the β-sheet layer of RRM1 in the shoulder12-β side of the other monomer RRM12 model (green in figure 3b). The above two dimer models were formed by the interaction between the shoulder sides, and thus were called the group of shoulder-to-shoulder. The SβH1α1/D model in figure 3c presents that the α2-helix of RRM1 domain in the head-1 side in one monomer RRM12 model (cyan in figure 3c) inserts tilted into the interstice of two β-sheet layers, nearby the β-sheet layer of RRM1 domain, in the shoulder12-β side of the other monomer RRM12 model (green in figure 3c). Moreover, the linking loop in the same monomer RRM12 model interacts with the β-sheet layer of RRM1 in the shoulder12-β side of the other monomer RRM12 model. Then, the SβH2/D model in figure 3d presents that the α2-helix of RRM1 domain in the head-1 side in one monomer RRM12 model (cyan in figure 3d) inserts vertically into the interstice of two β-sheet layers in the shoulder12-β side of the other monomer RRM12 model (green in figure 3d). So, the SβH1α1/D and SβH2/D dimer models were called the group of shoulder-to-head. The H1H1/D model in figure 3e presents that the α1-helix of RRM1 domain in the head-1 side in one monomer combines with the loose β4 and β5 in the other monomer, and vice versa, as the group of head-to-head. In conclusion, the combinations of two RRM1 domains dominate the formations of these dimer models, which is consistent with the experimental results of RRM1 function in the TDP-43 protein aggregation [33].

3.2.2. Analyses of binding free energies and interactions of the dimer models

To address the combination abilities for these five dimer models of SαSβ/D, SβSβ/D, SβH1/D, SβH2/D and H1H1/D, their binding free energies have been carried out by using the MM-PBSA method based on their corresponding MD simulations. All energy terms and the total binding free energies for these models are given in table 1. It can be seen that the binding free energies for SαSβ/D, SβSβ/D, SβH1/D, SβH2/D and H1H1/D dimer models are −57.66, −41.97, −23.11, −17.80 and −4.22 kcal mol−1, respectively, which reasonably show the stability and possible existence of the five dimer models from the favourable binding energies. This result supports the experimental suggestion that the RRM domains can possibly cause the aggregation of TDP-43 proteins themselves under the absence of RNA [33,34,57]. Moreover, based on the differences of the energies of the five dimer models, their combination abilities between two monomer RRM12 models along the shoulder sides are obviously greater than that along the head sides, which proposes that the further aggregation along the shoulder sides would be favourable.

Table 1.

Components of the MM-PBSA free energies (kcal mol−1) for the SαSβ/D, SβSβ/D, SβH1α1/D, SβH2/D and H1H1/D dimer models. ΔGnp = ΔEvdw + ΔGnp/solv, ΔGpb = ΔEele + ΔGpb/solv, ΔHbinding = ΔGnp + ΔGpb + ΔEint, ΔGbinding = ΔHTΔS.

model SαSβ/D SβSβ/D SβH1/D SβH2/D H1H1/D
ΔEele −574.68 −160.57 −307.64 −252.14 −188.48
ΔEvdw −147.57 −146.63 −155.24 −74.73 −58.48
ΔEint 0.00 0.00 0.00 0.00 0.00
ΔGnp/solv −19.14 −18.15 −19.26 −10.29 −7.05
ΔGpb/solv 633.36 231.30 397.46 279.29 209.91
ΔGnp −166.71 −164.78 378.20 269.00 202.86
ΔGpb 58.69 70.73 −174.50 −85.02 −65.53
ΔHbinding −108.02 −94.05 −79.86 −57.87 −44.10
TΔS −50.36 −52.08 −61.57 −33.23 −39.88
ΔGbinding −57.66 −41.97 −23.11 −17.80 −4.22

To further explore the interaction mechanisms between the two RRM12 monomer models in these dimers with the different binding ways, the percentages of occurrences of all possible hydrogen bond and hydrophobic interactions located at the interfaces for the five dimer models were extracted from the MD simulations, and are shown in table 2. The criteria include a donor–acceptor distance of <3.5 Å and a donor–proton–acceptor angle of greater than 120° for a hydrogen bond, while a C-C distance of less than 4.5 Å for a hydrophobic interaction. It can be seen from table 2 that the total occupancies of hydrogen bond and hydrophobic interaction are 628.64% and 554.72% for SαSβ/D model, 302.29% and 638.81% for SβSβ/D, 520.81% and 431.48% for SβH1/D, 190.74% and 593.42% for SβH2/D, and 215.96% and 112.83% for H1H1/D, respectively. Obviously, the total occupancies for the hydrogen bonds or hydrophobic interactions for SαSβ/D, SβSβ/D, SβH1/D and SβH2/D models along the shoulder sides in RRM1 domains between two monomer RRM12 models are larger than those for H1H1/D one along the head sides, which are consistent with the binding free energy calculations. Some binding sites have been found at the corresponding interfaces with the binding type of shoulder or head for these models. For example, the interaction residues of hydrogen bonds and hydrophobic interactions for SαSβ/D model locate at Ser163, Asp174, Cys175, Lys176, Gln182, Gln184, Asp185, Arg227 and Met162, Leu177, Pro262 in the shoulder12-α side of RRM1 domain of one RRM12 monomer (cyan in figure 3a), and Lys145, Arg165, Lys176, Asn179, Gln182, Arg227, Glu261 and Leu109, Phe147, Phe149, Pro225 in the shoulder12-β side of RRM1 domain of the other RRM12 monomer (green in figure 3a), respectively. Moreover, the ionic interaction analysis for SαSβ/D, SβSβ/D, SβH1/D, SβH2/D and H1H1/D models were calculated, and are shown in figure 4. Similarly, the criteria for an ionic interaction include a distance of less than 5.0 Å between the HN group of positive Arg or Lys and the O atom of negative Asp or Glu. It can be seen that the ionic interactions mainly occurred at the residue-pairs of Asp174-Lys176 and Glu156-Arg171 with the total occupancy of 161.1% for SαSβ/D model, the residue-pairs of Glu186-Arg171 and Lys140-Glu200 with the total occupancy of 104.1% for SβSβ/D model, the residue-pairs of Asp185-Lys136 and Asp174-Arg165 with the total occupancy of 175.9% for SβH1/D model, the Glu156-Arg171 residue-pair with the occupancy of 51.6% for SβH2/D model. Conversely, there is no ionic interaction for H1H1/D model. These results are consistent with the analyses of binding energies and interactions discussed above. The non-transparent surfaces in figure 3 show the binding interfaces in the studied dimer models, and as expected the rank of the binding areas in this interface is in consistent with the order of the binding free energies and interactions in this interface. In conclusion, the greater the combination ability, the larger the binding area.

Table 2.

Occupancies (%) of hydrogen bonds and hydrophobic interactions between different RRM12 monomers for the SαSβ/D, SβSβ/D, SβH1α1/D, SβH2/D and H1H1/D models.

SαSβ/D occupancy%
SβH2/D occupancy%
hydrogen bonds hydrogen bonds
Lys 176 NZ-HZ* … OE1 182 Gln 84.25 Ser 163 O … H*-N* 197 Arg 73.84
Leu 177 O … HD21-ND2 179 Asn 67.49 Ser 125 O … H-N 266 Ser 62.39
Asp 174 OD2 … HH*-NH* 165 Arg 66.19 Asp 185 OD1 … H*-N* 165 Arg 55.27
Ser 163 O … HE-NE 165 Arg 61.19 Asp 174 OD2 … HZ*-NZ 136 Lys 51.10
Asp 174 OD1 … HH*-NH* 165 Arg 61.10 Asn 267 O … HH*-NH* 171 Arg 46.94
Gln 184 OE1 … HH12-NH1 227 Arg 59.77 Gln 182 O … HH*-NH2 171 Arg 46.52
Asp 185 OD1 … HZ*-NZ 145 Lys 55.37 Asp 185 OD2 … H*-N* 165 Arg 43.98
Asp 174 OD1 … HZ*-NZ 176 Lys 39.71 Gln 164 NE2-HE21 … O 228 Ala 42.95
Gln 182 OE1 … H*-N* 179 Asn 37.73 Met 162 O … HZ*-NZ 136 Lys 34.79
Cys 175 O … HZ*-NZ 176 Lys 33.73 Ser 266 O … HH*-NH1 171 Arg 32.29
Asp 174 OD2 … HZ*-NZ 176 Lys 33.60 Gln 164 OE1 … HE2-NE2 256 His 30.74
Arg 227 NH2-HH* … OE* 261 Glu 28.51
hydrophobic interactions hydrophobic interactions
Leu 177 CD2 … CD2 147 Phe 88.36 Phe 127 CZ … CD2 229 Phe 92.20
Met 162 CB … CD2 109 Leu 84.51 Trp 172 CZ3 … CB 228 Ala 90.93
Leu 177 CD1 … CD1 147 Phe 73.50 Leu 177 CD1 … CD2 147 Phe 77.41
Leu 177 CD1 … CD2 147 Phe 68.80 Leu 177 CD1 … CD2 109 Leu 55.16
Leu 177 CD2 … CZ 149 Phe 56.31 Leu 177 CD1 … CD1 147 Phe 38.18
Pro 262 CD … CD 225 Pro 54.98 Val 159 CG1 … CE1 149 Phe 27.95
Leu 177 CD1 … CZ 147 Phe 42.82 Leu 177 CD1 … CD1 109 Leu 25.86
Met 162 CE … CD1 109 Leu 33.71 Val 159 CG1 … CE1 149 Phe 23.79
Leu 177 CD1 … CD1 109 Leu 30.98
Leu 177 CD2 … CD1 147 Phe 20.75
SβSβ/D occupancy%
SβH2/D occupancy%
hydrogen bonds hydrogen bonds
Arg 165TRP172ARG165TYR155 Asp 185 N-H … O 170 Gly 56.93 Arg 165 NH2-HH22…O 259 Asn 62.26
Arg 171 NH*-HH* … O 257 Ile 39.28 Trp 172 NE1-HE1…O 261 Glu 52.92
Glu 186 OE* … .H*-N* 171 Arg 32.04 Arg 165 NH2-HH21…OG 258 Ser 51.93
Ser 183 O … H-N 172 Trp 30.28 Tyr 155 OH-HH…O 145 Lys 23.63
Ala 228 N-H … O 141 Thr 26.26 hydrophobic interactions
Asn 179 ND2-HD21 … O 172 Trp 25.77 Val 159 CG1 … CD2 147 Phe 79.74
Arg 171 NH1-HH1 … O 246 Glu 24.71 Val 159 CG1 … CD1 147 Phe 73.65
Gln 182 NE2-HE22 … O 172 Trp 23.45 Val 159 CG1 … CD2 109 Leu 71.89
Asn 259 O … HH*-NH* 171 Arg 23.00 Val 159 CG2 … CD1 147 Phe 61.31
Arg 171 NH1-HH1 … O 178 Pro 20.57 Met 167 CE … CD1 188 Leu 50.17
hydrophobic interactions Met 167 CE … CD2 188 Leu 47.86
Phe 147 CD1 … CZ 147 Phe 94.88 Trp 172 CD1 … CB 262 Pro 48.54
Trp 113 CD1 … CZ 194 Phe 90.63 Trp 172 CD1 … CD 262 Pro 36.94
Trp 113 CD1 … CD1 194 Phe 71.93 Val 159 CG1 … CZ 147 Phe 36.02
Trp 113 CD1 … CD2 194 Phe 70.50 Val 159 CG2 … CD2 147 Phe 34.38
Phe 147 CD2 … CZ 147 Phe 64.14 Val 159 CG1 … CD2 109 Leu 26.98
Leu 109 CD2 … CZ 147 Phe 49.35 Val 159 CG2 … CZ 147 Phe 25.94
Phe 147 CZ … CZ 147 Phe 46.07 H1H1/D
Trp 113 CE3 … CZ 229 Phe 45.86 hydrogen bonds
Trp 113 CZ2 … CD1 194 Phe 33.55 167 Met O … HE21-NE2 164 Gln 70.14
Trp 113 CE3 … CZ 194 Phe 27.96 168 Ile O … H-N 167 Met 65.99
Leu 109 CD2 … CD1 147 Phe 23.49 114 Lys NZ … HZ*-O 167 Met 59.12
Leu 109 CD1 … CD2 109 Leu 20.45 122 Glu OE* … HH22-NH2 165 Arg 20.71
hydrophobic interactions
168 Ile CG2 … CH2 172 Trp 80.31
123 Tyr CD2 … CH2 172 Trp 32.52
Figure 4.

Figure 4.

The ionic interaction occupancies between two monomer RRM12 from the simulations of the dimer models of (a) SαSβ/D, (b) SβSβ/D, (c) SβH1/D and (d) SβH2/D, respectively.

To compare the differences of the energies and interactions of the dimerization, and to further provide the quantitative information of energy contributions of the residues, the decompositions of the corresponding binding free energies into the key residues were performed for the five dimer models, and are shown in figure 5a and b only for SαSβ/D and H1H1/D dimer models, respectively. It can be seen from the energy decompositions of two models with the energy contributions more than 2 kcal mol−1 (i.e. the decomposition energy values less than −2 kcal mol−1) that the distinct differences of energy decompositions occur mainly at the crucial residues of Glu156, Met162, Ser163, Asp174, Leu177, Asp185, Arg227, Pro262 in one monomer (i.e. the cyan one in figure 3a) and Leu109, Phe147, Phe149, Arg165, Arg171, Asn179, Gln182, Phe221, Phe231 in the other one (i.e. the green one in figure 3a) for SαSβ/D dimer model, and at the crucial residues of Lys114, Met167, Ile168 and Lys114, Arg165, Met167, Trp172 for H1H1/D dimer model (figure 3e). Obviously, the crucial interaction sites in SαSβ/D dimer model are more than those in H1H1/D one. The binding residue numbers and locations support the binding free energy calculations, and the results of hydrogen bonds and hydrophobic interactions. Other details of the decompositions for other dimer models can be found in electronic supplementary material, figure S2.

Figure 5.

Figure 5.

MM-PBSA energy decompositions between two monomer RRM12 representing (kcal mol−1) into the residues of RRM12 domains for the dimer (a) SαSβ/D and (b) H1H1/D models.

3.3. The oligomer aggregation mechanism

3.3.1. The structure and interaction characteristics of tetramer models

To investigate the aggregation mechanism of RRM1 and RRM2 domains of TDP-43 protein, two tetramer models of SαSβ1/T and SαSβ2/T based on the most stable SαSβ/D dimer model were built by using the ZDOCK software with the top energy scoring, and simulated for 50 ns by using the same method described in §2.2. These two tetramer models represent the typical aggregation modes among the first eight docking structures with the top scores from the 250 docking models screened by the ZDOCK software. The equilibrated conformations of SαSβ1/T and SαSβ2/T are shown in figure 6a and b, and RMSD values are shown in electronic supplementary material, figure S1f. It can be seen from figure 6a that the SαSβ1/T tetramer model presents that both head-1 sides of RRM1 domains in the cyan and green SαSβ/D dimer model combine to both head-1 sides of RRM1 domains in the yellow and magenta SαSβ/D dimer model, i.e. two dimers are bound together in the way of head-to-head along the shoulder horizontal direction. However, the SαSβ2/T tetramer model in figure 6b presents that both head-2 sides of RRM2 domains in the cyan and green SαSβ/D dimer model combine with both head-2 sides of RRM2 domains in the yellow and magenta SαSβ/D dimer model in the direction of head-to-head opposite to the first dimer, i.e. head-2 to head-2 along the tilt head direction. The binding free energies (shown in electronic supplementary material, tables S1 and S2) in the head-1-to-head-1 or head-2-to-head-2 interfaces between two SαSβ/D dimer models for SαSβ1/T and SαSβ2/T tetramer models are, respectively, −17.61 and −32.05 kcal mol−1; meanwhile, the average binding free energies of −54.47 and −53.66 kcal mol−1 in the shoulder-to-shoulder interfaces of SαSβ/D dimer (in figure 6a) in the tetramer SαSβ1/T and SαSβ2/T models, respectively, are similar to the binding free energy of −57.66 kcal mol−1 in the same interface of the SαSβ/D model, which suggests that the affinity of shoulder-to-shoulder interface in the tetramer models presents the consistent stability compared with the corresponding dimer model, which would favour the formation of tetramer aggregation. The details of others tetramer models built from the SβSβ/D and SβH1/D dimer models are also analysed, and are shown in section S3, figure S1f and S3 of the electronic supplementary material.

Figure 6.

Figure 6.

Average structures of the tetramer models of (a) SαSβ1/T and (b) SαSβ2/T. The shoulder-to-shoulder, head-1-to-head-1 and head-2-to-head-2 interfaces are marked by dash lines.

To investigate the relationship between structural dynamics and interactions, the dynamical fluctuations of every residue of one same RRM1 domain in the RRM12, SαSβ/D and SαSβ1/T models were determined by RMSF values (figure 7a). It can be observed that the RMSF values in the dimer SαSβ/D model and tetramer SαSβ1/T model are smaller than those in the monomer RRM12 model, which is usually considered directly linking to the formations of new hydrogen bonds, hydrophobic and ionic interactions, and further supports the interaction analysis (table 2 and figure 4; electronic supplementary material, table S2). Moreover, due to the interactions for the dimer SαSβ/D model and the tetramer SαSβ1/T models mainly achieved by the parallel β-sheet layers of RRM1 domains and the connected flexible loops, as expected, the residues in β1, β4 and β5 of parallel β-sheet layer of one RRM1 domain have the lower RMSF values. To address further the stability of affinity at the shoulder-to-shoulder interface in the tetramer SαSβ1/T model, the motion correlations of the allosteric communication for the backbone atoms during the first 10 ns simulation time in the aggregation process from the dimer SαSβ/D model to tetramer SαSβ1/T model have been calculated, and the corresponding cross correlation map is shown in figure 7b. In this figure, the range of the residue motion correlations is labelled from low anticorrelation (blue) to high correlations (red). It can be found that the motions of α1 and α2 helices of the RRM1 domains in the head-to-head interface significantly correlate to the motions of the parallel β-sheet layer (β1, β4 and β5) of the same RRM1 domains in the shoulder-to-shoulder interface, which implies the allosteric communication in the RRM1 domains from the α1 and α2 helices in the head-to-head interface to the parallel β-sheet layer in the shoulder-to-shoulder interface.

Figure 7.

Figure 7.

(a) The residues fluctuations of RRM1 domain for the RRM12 (black line), SαSβ/D (red line) and SαSβ1/T (blue line) models. (b) Dynamical cross-correlation maps calculated from the first 10 ns of the SαSβ1/T model for the residues of the SαSβ/D dimer (yellow and magenta in figure 6a). The key subregions are marked by squared in black.

Moreover, the binding free energy contributed from the RRM1 domains for the SαSβ1/T tetramer model is more favourable than that from the RRM2 domains for the SαSβ1/T tetramer model, which is consistent with the calculations of binding free energies for the dimer models, also suggests that the aggregation coming from the RRM1 domains is dominant, and that from the RRM2 domains is the second choice. For this reason, it is found that the residues of Leu109 of β1, Asp165 of β4, Arg171, Asp174, Lys176, Leu177 of β5 on the β-sheet layers in the RRM1 domain might be the main contribution to the oligomer aggregation; accordingly, these residues are absent in the RRM2 domain for the SαSβ/D model. This result is consistent with the experimental fact that the β-sheet layers of the RRM1 domain is an important motif with the highly conserved sequence for RNA binding ability and the pathological of TDP-43 aggregation [57].

3.3.2. The mechanism of the large oligomer aggregations

To explore the aggregation mechanism and to explain the pathogenic aggregation phenomenal in vivo [33], we try to construct the large aggregation models with the periodic structures from the small aggregation models. It is obvious that the SαSβ1/T and SαSβ2/T tetramer models present the similar structures besides of the different combining directions of head-1-to-head-1 or head-2-to-head-2 from two SαSβ/D dimer models. Especially, each SαSβ/D dimer in the two tetramers still shows the similar conformations to the corresponding isolate SαSβ/D dimer. To construct further the large aggregation models, the superimposing-creating pattern of the hexamer model (as called SαSβ/H model) from the SαSβ1/T and SαSβ2/T tetramer models is shown in figure 8 with the scheme of model superimposing by firstly using an SαSβ/D dimer of the SαSβ2/T tetramer model overlapping on the SαSβ1/T model, and then removing the repetitive SαSβ/D dimer with the superimposing technique of the PyMol program (http://www.pymol.org). Based on this superimposing method, the two possible paths for establishing the periodic octamer model from the SαSβ1/T and SαSβ2/T tetramer models are shown in figure 9a, in which an octamer model can be obtained along two different overlapping directions on the SαSβ1/T tetramer model, i.e. by using the left SαSβ/D dimer of the SαSβ2/T tetramer model overlapping on the right SαSβ/D dimer of the SαSβ1/T tetramer model (path 1 in figure 9a) or by using the right SαSβ/D dimer overlapping on the left SαSβ/D dimer (path 2 in figure 9a). The two octamer models constructed from these two paths are shown in figure 9b, and present full equal conformations with the building unit as the same as the SαSβ1/T tetramer model. Consequently, the large aggregation models could be established by using the similar constructing pathways. For example, the dodecamer model constructed by using the same repeatable-superimposing method is shown in figure 9c. Interestingly, the structure of the dodecamer model exhibits the appearance of repeatable, helical and rope-like aggregation pattern which may imply the structural characteristics of the protein aggregation as observed in experiment [30].

Figure 8.

Figure 8.

The hexamer model of SαSβ/H was constructed using the superimposing technique from the SαSβ1/T and SαSβ2/T tetramer models. The scheme of SαSβ/H model superimposing by using firstly an SαSβ/D dimer of the SαSβ2/T tetramer model overlapping on the SαSβ1/T model, then, by removing the SαSβ/D dimer with the superimposing technique of the PyMol program (http://www.pymol.org).

Figure 9.

Figure 9.

(a) The two possible paths for establishing the periodic octamer model from the SαSβ1/T and SαSβ2/T tetramer models were depicted, i.e. by using the left SαSβ/D dimer of the SαSβ2/T tetramer model overlapping on the right SαSβ/D dimer of the SαSβ1/T tetramer model (path 1) or by using the right SαSβ/D dimer of that overlapping on the left SαSβ/D dimer (path 2). (b) Representation of two octamer models constructed from two possible paths. (c) The dodecamer model was constructed using the same repeatable-superimposing method.

Furthermore, the SαSβ/H hexamer model constructed by path 1 was simulated for 50 ns by using the same method described in §2.2, and the equilibrated conformation is shown in electronic supplementary material, figure S4. The small RMSD values analysed from the trajectory of this model support the stable existence possibility of this hexamer model, and are shown in electronic supplementary material, figure S1 g. The average binding free energy for two head-to-head interface in the hexamer SαSβ/H model are −18.2 kcal mol−1. Moreover, the average binding free energy of −54.20 kcal mol−1 for the shoulder-to-shoulder interface of hexamer SαSβ/H model is similar to that of −54.47 kcal mol−1 for that of tetramer SαSβ1/T model, which suggests that the affinity of shoulder-to-shoulder interface in hexamer SαSβ/H model also maintains the consistent stability by the formation of new head-1-to-head-1 and head-2-to-head-2 interfaces. This result indicates that there is the concerted interaction affinity between the shoulder-to-shoulder and head-to-head interfaces. However, since it is the head-to-head interaction that contributes to the growth of the polymerization, it is expected this concerted interaction affinity will reach a constant as the number of the head-to-head interfaces produced by the aggregation process becomes large enough. The details of binding free energies were shown in electronic supplementary material, table S3. These results revealed the stability existence possibility of the hexamer model, that is, the possible aggregation process from tetramer to large oligomer aggregation using the repeatable-superimposing method is feasible, and may produce a repeatable, helical and rope-like structure as a mimicked aggregation mode.

4. Discussion

4.1. Analyses of dimer structures

To investigate the possible aggregation mechanism of small oligomer models about RRM1 and RRM2 domains, the structural characteristics and interactions of the dimer SαSβ/D, SβSβ/D, SβH1/D, SβH2/D and H1H1/D models were further elucidated. As the energy decomposition and interaction analyses discussed above, the dimerization for the dimer SαSβ/D, SβSβ/D, SβH1α1/D models is mainly achieved by the parallel β-sheet layers of two RRM1 domains and the connected flexible loops, but RRM2 domains in these dimer models present the weaker interactions and affinities for the dimerization. For example, the interactions and affinities in the SαSβ/D model occur mainly at Leu109 of β1; Phe147, Phe149 of β3; Glu156, Met162, Ser163 of α2; Asp165 of β4; Arg171, Asp174, Lys176, Leu177 of β5 in RRM1, which are associated with the interaction area of TDP-43 for RNA binding in experimental results [17,18]. Especially, β1, β4 and β5 of two RRM1 domains form the typical interacting surface on the parallel β-sheet layers for RNA binding. The similar results can be obtained in the dimer SβSβ/D and SβH1α1/D models that are shown in electronic supplementary material, table S4. It can be seen that the binding sites located at the parallel β-sheet layers for the dimerization of RRM1 in TDP-43 are almost in the same positions for RNA binding domains [18], which suggest that TDP-43 firstly binding to RNA will prevent or slow down TDP-43 aggregations due to the same occupied binding sites. These results imply that the parallel β-sheet layer of two RRM1 domains plays an important role in the protein dimerization, and RNA binding can protect the pathogenic aggregation of TDP-43, which are consistent with experimental results [33,58,59]. In addition, the parallel β-sheet layers as the feature of amyloid-like fibril structure is universally accepted [60], so the TDP-43 protein might form the large aggregation structure on basis of the interaction formations of the parallel β-sheet layer of two RRM1 domains.

Moreover, the distances between two β-sheet layers in two RRM1 domains for the SαSβ/D, SβSβ/D and SβH1α1/D models are 17.9, 17.7 and 21.4 Å (figure 10a), and the two β-sheet layers are parallel to each other, which might favour to form the parallel β-sheet structure as the character of amyloid-like fibril structure. Otherwise, the distances for the SβH2/D and H1H1/D models are 25.5 and 23.2 Å (figure 10a), respectively, and the two β-sheet layers of two RRM1 domains are not parallel in the two models (figure 3d and e), which indicates that the parallel β-sheet structure cannot be formed due to the corresponding long distances of the two β-sheet layers and their structural arrangement. That is, the SαSβ/D, SβSβ/D and SβH1α1/D models might possibly form the large oligomer aggregation. Moreover, the secondary structures were changed during the dimerization from the monomer RRM12 model to some dimer models. For example, it is found through comparing the average structures of the RRM1 and RRM2 domains in the monomer RRM12 with the dimer SβSβ/D model (see figures 2b and 3b) that the loose β-strand link between α2 and β5 in monomer forms the stable β4-strand (from His166 to Ile168) and the β5 length of RRM1 domain is increased from three residues (from Asp174 to Lys176) in monomer to five residues (from Arg171 to Cys176) in the SβSβ/D model (green in figure 3b). However, the length of α2 helix with seven residues (from Asp237 to Leu243) in RRM2 domain of monomer RRM12 decreases to that with four residues (from Ala240 to Leu243); the α-helix linking loop between the RRM1 and RRM2 domains in monomer RRM12 model changes to the loop linking conformation in the same dimer SβSβ/D model.

Figure 10.

Figure 10.

(a) Distances of time-dependences between two β-sheet layers in two RRM1 domains for the dimer SαSβ/D (black), SβSβ/D (red), SβH1/D (blue), SβH2/D (magenta), H1H1/D (green) models and (b) those for the SαSβ/D dimer (yellow and magenta in figure 6a) in the tetramer SαSβ1/T (black) and SαSβ2/T (red) models; (c) the distances of two α1 in both RRM1 domains, and those of two α2 in both RRM1 domains for the dimer SαSβ/D model (orange) and the tetramer SαSβ1/T model (light blue).

4.2. Analyses of the oligomer aggregation

To elucidate further the possible mechanism of large aggregation through addressing the formation progress from small oligomer to large oligomer, the oligomer models about RRM1 and RRM2 domains were built on the basis of the periodic aggregation mode from the dimer models. The amyloid aggregation possesses a cross-β structure, in which β-strands orient perpendicularly to the aggregation axis (figure 11), and are assembled into β-sheet layers that run the length of the aggregation, as detected by experiment results [37,38,60]. In the tetramer models of SαSβ1/T and SαSβ2/T, the interactions in the shoulder-to-shoulder interface mainly occurred at Leu109 of β1; Phe147, Phe149 of β3; Glu156, Met162, Ser163 of α2; Asp165 of β4; Arg171, Asp174, Lys176, Leu177 of β5 in RRM1, which are almost similar to those in the SαSβ/D dimer model. The details of the interactions in shoulder-to-shoulder interface for SαSβ1/T, SαSβ2/T models are shown in electronic supplementary material, table S5. As expected, the residues in β1, β4 and β5 of the β-sheet layer of RRM1 domain in the tetramer SαSβ1/T model have the lower RMSF values compared with those in the SαSβ/D model. Further, the distances between the two β-sheet layers of two RRM1 domains for the SαSβ/D dimer (yellow and magenta in figure 6a) in the SαSβ1/T and SαSβ2/T models are, respectively, 16.7 and 17.4 Å (figure 10b), which are smaller than that of 17.9 Å in the SαSβ/D model discussed above. The approaching of the parallel β-sheet layers induces energetically to the consistent interaction affinity in the shoulder-to-shoulder interface for the tetramer model as discussed in §3.3.1. As discussed in the correlation analyses in figure 7, α1 and α2 helices of the RRM1 domains in the head-to-head interface correlates to the parallel β-sheet layers (including β1, β4 and β5) of the same RRM1 domains in the shoulder-to-shoulder interface with large cross-correlate coefficients, which represents the allosteric communication from α1 and α2 to the parallel β-sheet layers (i.e. β1, β4 and β5). Such communications relate to the structural changes around α1, α2 and the parallel β-sheet layers. Namely, the tetramer aggregation caused the movement of α1 and α2 in the head-to-head interface of the SαSβ/D dimer, i.e. the distances of two α1 in both RRM1 domains and two α2 in both RRM1 domains of the SαSβ/D dimer (yellow and magenta in figure 6a) decrease from 36.3 to 35.9 Å and from 24.3 to 23.3 Å, respectively, as shown in figure 10c. Because α1 is connected to β1, and α2 is connected to β4 and β5, the movements of α1 and α2 relate definitely to the movements of the parallel β-sheet layers, as expected in the correlations of α1 and α2 to the parallel β-sheet layers. The approaching of the parallel β-sheet layers indicates that the parallel β-sheet layers of RRM1 of TDP-43 in the dimer model are also retained in the corresponding tetramer model, and were perpendicular to the aggregation axis, which may favour the formation of large aggregation.

Figure 11.

Figure 11.

The aggregation axis (black line) existed in the large oligomer model. The inset represents that the parallel β-sheet layer orients perpendicularly to the aggregation axis.

Furthermore, the large aggregation models were constructed by using the same repeatable-superimposing method described in §3.3.2, in which the structural character with the repeatable, helical and rope-like as a mimicked aggregation mode has been shown by the oligomer models. The results in the tetramer models might also meet the large aggregation models. That is, the parallel β-sheet layers still exist in the large aggregation, such as hexamer, octamer, dodecamer and so on, and orient perpendicularly to the aggregation axis; and the interactions between the parallel β-sheet layers favour the formation of the large aggregation (figure 11).

Summarily, the studied oligomer aggregation process presents two types of combination of outer edges of the two domains of TDP-43 protein, i.e. one is the shoulder-to-shoulder type which occurs at the interface between the two β-sheet layers or between one β-sheet layer and the α-helix linking loop in the different shoulder sides; the other is the head-to-head type which occurs at the interface between the two α1-α2 helix planes or between the two β-sheet layer tails in the different head sides. Consequently, the conformational changes mainly involve the tertiary structures, but the secondary structures were changed slightly in the dimer, tetramer and hexamer models. For example, the mass centre distances of 25.8, 25.8 and 25.4 Å between the RRM1 and RRM2 domains in the dimer SαSβ/D, tetramer SαSβ1/T and hexamer SαSβ/H models, respectively, are longer than that of 23.9 Å in the monomer RRM12 model, which expanded the expected oligomer aggregation edges and induced the looser conformation of oligomer in the shoulder-to-shoulder interface, favouring the aggregation process.

5. Conclusion

We performed MD simulations, binding free energy calculations and interaction analyses of the monomers, dimers, tetramers and larger oligomers of the RRM1 and RRM2 domains in TDP-43 to reveal their aggregation mechanism and aggregation process from small oligomer to large aggregation. Among the five dimer models of TDP-43, the SαSβ/D model was found the most stable. The dimerization for these models is mainly achieved by the interactions between the parallel β-sheet layers of two RRM1 domains. Based on the analysis of the structures, energies and the motion correlations for two tetramer models built by the most stable SαSβ/D dimer model, it is found that the changes of positions of the α1 and α2 helixes in the head-to-head interface of the RRM1 domains in each tetramer model compared with that in the dimer model induce the approaching of the parallel β-sheet layers in the shoulder-to-shoulder interface of the same RRM1 domains, which induces energetically the consistent interaction affinity in the shoulder-to-shoulder interface. This character is still valid for the large oligomer aggregations. Using the repeatable-superimposing method based on the tetramer models, we proposed a new aggregation mechanism of RRM domains in TDP-43, which characterizes the formation of the large aggregation models with the repeated, helical and rope-like structures. This RRM domains-mediated structural characteristic may drive the fibril-like aggregation. Our results imply that the RRM domains could play an important role in TDP-43 aggregation process. It is also found that in all the models the RRM domains aggregate employing the same binding sites by which TDP-43 binds to RNA, which could explain the experimental result that the RNA binding to RNA will prevent the aggregation of TDP-43. The mimicked aggregation mode is helpful for understanding the amyloid-like aggregation mechanism of TDP-43, and providing theoretical support for ALS and FTLD disease prevention.

Supplementary Material

Acknowledgements

This work was supported by grants from the National Natural Science Foundation of China (nos. 22073010 and 21573020); Natural Science Foundation of Hebei province, P.R. China (grant no. B2021109004); Colleges and Universities in Hebei Province Science and Technology Research Youth Talent Support Program (grant no. BJ2019204).

Contributor Information

Chaoqun Li, Email: lichaoqun@hdc.edu.cn.

Guangju Chen, Email: gjchen@bnu.edu.cn.

Data accessibility

Our data are deposited at Dryad Digital Repository: https://doi.org/10.5061/dryad.w3r2280q0. The data are provided in electronic supplementary material [61].

Authors' contributions

W.L. designed the experiments, performed the experiments and wrote the paper; W.L., J.S. and Y.W. analysed the data; C.L. and G.C. designed the experiments and reviewed the paper.

Competing interests

We declare we have no competing interests.

Funding

We received no funding for this study.

References

  • 1.Moreno F, et al. 2015A novel mutation P112H in the TARDBP gene associated with frontotemporal lobar degeneration without motor neuron disease and abundant neuritic amyloid plaques. Acta Neuropathol. Commun. 3, 19. ( 10.1186/s40478-015-0190-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Josephs KA, et al. 2014TDP-43 is a key player in the clinical features associated with Alzheimer's disease. Acta Neuropathol. 127, 811-824. ( 10.1007/s00401-014-1269-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ross CA, Poirier MA. 2004Protein aggregation and neurodegenerative disease. Nat. Med. 10, S10-S17. ( 10.1038/nm1066) [DOI] [PubMed] [Google Scholar]
  • 4.Shiina Y, Arima K, Tabunoki H, Satoh J. 2010TDP-43 dimerizes in human cells in culture. Cell. Mol. Neurobiol. 30, 641-652. ( 10.1007/s10571-009-9489-9) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Davis SA, Gan KA, Dowell JA, Cairns NJ, Gitcho MA. 2017TDP-43 expression influences amyloidβ plaque deposition and tau aggregation. Neurobiol. Dis. 103, 154-162. ( 10.1016/j.nbd.2017.04.012) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Roychaudhuri R, Yang M, Hoshi MM, Teplow DB. 2009Amyloid beta-protein assembly and Alzheimer disease. J. Biol. Chem. 284, 4749-4753. ( 10.1074/jbc.R800036200) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sun Y, Arslan PE, Won A, Yip CM, Chakrabartty A. 2014Binding of TDP-43 to the 3'UTR of its cognate mRNA enhances its solubility. Biochemistry 53, 5885-5894. ( 10.1021/bi500617x) [DOI] [PubMed] [Google Scholar]
  • 8.Buratti E, Baralle FE. 2012TDP-43: gumming up neurons through protein-protein and protein-RNA interactions. Trends Biochem. Sci. 37, 237-247. ( 10.1016/j.tibs.2012.03.003) [DOI] [PubMed] [Google Scholar]
  • 9.Arnold ES, et al. 2013ALS-linked TDP-43 mutations produce aberrant RNA splicing and adult-onset motor neuron disease without aggregation or loss of nuclear TDP-43. Proc. Natl Acad. Sci. USA 110, E736-E745. ( 10.1073/pnas.1222809110) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Afroz T, et al. 2017Functional and dynamic polymerization of the ALS-linked protein TDP-43 antagonizes its pathologic aggregation. Nat. Commun. 8, 45. ( 10.1038/s41467-017-00062-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Seyfried NT, Gozal YM, Dammer EB, Xia Q, Duong DM, Cheng D, Lah JJ, Levey AI, Peng J. 2010Multiplex SILAC analysis of a cellular TDP-43 proteinopathy model reveals protein inclusions associated with SUMOylation and diverse polyubiquitin chains. Mol. Cell. Proteomics 9, 705-718. ( 10.1074/mcp.M800390-MCP200) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.He R-Y, Huang Y-C, Chiang C-W, Tsai Y-J, Ye T-J, Gao H-D, Wu C-Y, Lee H-M, Huang JJ-T. 2015Characterization and real-time imaging of the FTLD-related protein aggregation induced by amyloidogenic peptides. Chem Commun. (Camb) 51, 8652-8655. ( 10.1039/c5cc00513b) [DOI] [PubMed] [Google Scholar]
  • 13.Ou SH, Wu F, Harrich D, García-Martínez LF, Gaynor RB. 1995Cloning and characterization of a novel cellular protein, TDP-43, that binds to human immunodeficiency virus type 1 TAR DNA sequence motifs. J. Virol. 69, 3584-3596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ayala YM, Zago P, D'Ambrogio A, Xu Y-F, Petrucelli L, Buratti E, Baralle FE. 2008Structural determinants of the cellular localization and shuttling of TDP-43. J. Cell Sci. 121, 3778-3785. ( 10.1242/jcs.038950) [DOI] [PubMed] [Google Scholar]
  • 15.Mompeán M, Romano V, Pantoja-Uceda D, Stuani C, Baralle FE, Buratti E, Laurents DV. 2017Point mutations in the N-terminal domain of transactive response DNA-binding protein 43 kDa (TDP-43) compromise its stability, dimerization, and functions. J. Biol. Chem. 292, 11 992-12 006. ( 10.1074/jbc.M117.775965) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wang I-F, Chang H-Y, Hou S-C, Liou G-G, Way T-D, James Shen C-K. 2012The self-interaction of native TDP-43 C terminus inhibits its degradation and contributes to early proteinopathies. Nat. Commun. 3, 766. ( 10.1038/ncomms1766) [DOI] [PubMed] [Google Scholar]
  • 17.Kuo P-H, Doudeva LG, Wang Y-T, Shen C-KJ, Yuan HS. 2009Structural insights into TDP-43 in nucleic-acid binding and domain interactions. Nucleic Acids Res. 37, 1799-1808. ( 10.1093/nar/gkp013) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lukavsky PJ, Daujotyte D, Tollervey JR, Ule J, Stuani C, Buratti E, Baralle FE, Damberger FF, Allain FH-T. 2013Molecular basis of UG-rich RNA recognition by the human splicing factor TDP-43. Nat. Struct. Mol. Biol. 20, 1443-1449. ( 10.1038/nsmb.2698) [DOI] [PubMed] [Google Scholar]
  • 19.Johnson BS, McCaffery JM, Lindquist S, Gitler AD. 2008A yeast TDP-43 proteinopathy model: exploring the molecular determinants of TDP-43 aggregation and cellular toxicity. Proc. Natl Acad. Sci. USA 105, 6439-6444. ( 10.1073/pnas.0802082105) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang Y-J, et al. 2013The dual functions of the extreme N-terminus of TDP-43 in regulating its biological activity and inclusion formation. Hum. Mol. Genet. 22, 3112-3122. ( 10.1093/hmg/ddt166) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lim L, Wei Y, Lu Y, Song J. 2016ALS-causing mutations significantly perturb the self-assembly and interaction with nucleic acid of the intrinsically disordered prion-like domain of TDP-43. PLoS Biol. 14, e1002338. ( 10.1371/journal.pbio.1002338) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zhang Y-J, et al. 2009Aberrant cleavage of TDP-43 enhances aggregation and cellular toxicity. Proc. Natl Acad. Sci. USA 106, 7607-7612. ( 10.1073/pnas.0900688106) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Jiang L-L, Zhao J, Yin X-F, He W-T, Yang H, Che M-X, Hu H-Y. 2016Two mutations G335D and Q343R within the amyloidogenic core region of TDP-43 influence its aggregation and inclusion formation. Sci. Rep. 6, 23928. ( 10.1038/srep23928) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Qin H, Lim L-Z, Wei Y, Song J. 2014TDP-43 N terminus encodes a novel ubiquitin-like fold and its unfolded form in equilibrium that can be shifted by binding to ssDNA. Proc. Natl Acad. Sci. USA 111, 18 619-18 624. ( 10.1073/pnas.1413994112) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Tsoi PS, Choi K-J, Leonard PG, Sizovs A, Moosa MM, MacKenzie KR, Ferreon JC, Ferreon ACM. 2017The N-terminal domain of ALS-linked TDP-43 assembles without misfolding. Angew. Chem. 129, 12 764-12 767. ( 10.1002/ange.201706769) [DOI] [PubMed] [Google Scholar]
  • 26.Mompeán M, et al. 2015Structural evidence of amyloid fibril formation in the putative aggregation domain of TDP-43. J. Phys. Chem. Lett. 6, 2608-2615. ( 10.1021/acs.jpclett.5b00918) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Conicella AE, Zerze GH, Mittal J, Fawzi NL. 2016ALS mutations disrupt phase separation mediated by α-helical structure in the TDP-43 low-complexity C-terminal domain. Structure 24, 1537-1549. ( 10.1016/j.str.2016.07.007) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wang A, et al. 2018A single N-terminal phosphomimic disrupts TDP-43 polymerization, phase separation, and RNA splicing. EMBO J. 37, e97452. ( 10.15252/embj.201797452) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ihara R, et al. 2013RNA binding mediates neurotoxicity in the transgenic Drosophila model of TDP-43 proteinopathy. Hum. Mol. Genet. 22, 4474-4484. ( 10.1093/hmg/ddt296) [DOI] [PubMed] [Google Scholar]
  • 30.Garnier C, et al. 2017Zinc binding to RNA recognition motif of TDP-43 induces the formation of amyloid-like aggregates. Sci. Rep. 7, 6812. ( 10.1038/s41598-017-07215-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Chou C-C, et al. 2018TDP-43 pathology disrupts nuclear pore complexes and nucleocytoplasmic transport in ALS/FTD. Nat. Neurosci. 21, 228-239. ( 10.1038/s41593-017-0047-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Cohen TJ, Hwang AW, Restrepo CR, Yuan C-X, Trojanowski JQ, Lee VM. 2015An acetylation switch controls TDP-43 function and aggregation propensity. Nat. Commun. 6, 5845. ( 10.1038/ncomms6845) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zacco E, Martin SR, Thorogate R, Pastore A. 2018The RNA-recognition motifs of TAR DNA-binding protein 43 may play a role in the aberrant self-assembly of the protein. Front. Mol. Neurosci. 11, 372. ( 10.3389/fnmol.2018.00372) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zacco E, Graña-Montes R, Martin SR, Groot NS de, Alfano C, Tartaglia GG, Pastore A. 2019RNA as a key factor in driving or preventing self-assembly of the TAR DNA-binding protein 43. J. Mol. Biol. 431, 1671-1688. ( 10.1016/j.jmb.2019.01.028) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen H-J, et al. 2019RRM adjacent TARDBP mutations disrupt RNA binding and enhance TDP-43 proteinopathy. Brain 142, 3753-3770. ( 10.1093/brain/awz313) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Dang M, Kang J, Lim L, Li Y, Wang L, Song J. 2019ATP is a cryptic binder of TDP-43 RRM domains to enhance stability and inhibit ALS/AD-associated fibrillation. Biochem. Biophys. Res. Commun. 522, 247-253. ( 10.1016/j.bbrc.2019.11.088) [DOI] [PubMed] [Google Scholar]
  • 37.Sambashivan S, Liu Y, Sawaya MR, Gingery M, Eisenberg D. 2005Amyloid-like fibrils of ribonuclease A with three-dimensional domain-swapped and native-like structure. Nature 437, 266-269. ( 10.1038/nature03916) [DOI] [PubMed] [Google Scholar]
  • 38.Liu Y, Gotte G, Libonati M, Eisenberg D. 2001A domain-swapped RNase A dimer with implications for amyloid formation. Nat. Struct. Biol. 8, 211-214. ( 10.1038/84941) [DOI] [PubMed] [Google Scholar]
  • 39.Bhandare VV, Ramaswamy A. 2018The proteinopathy of D169G and K263E mutants at the RNA recognition motif (RRM) domain of tar DNA-binding protein (tdp43) causing neurological disorders: a computational study. J. Biomol. Struct. Dyn. 36, 1075-1093. ( 10.1080/07391102.2017.1310670) [DOI] [PubMed] [Google Scholar]
  • 40.Bozzo F, Salvatori I, Iacovelli F, Mirra A, Rossi S, Cozzolino M, Falconi M, Valle C, Carrì MT. 2016Structural insights into the multi-determinant aggregation of TDP-43 in motor neuron-like cells. Neurobiol. Dis. 94, 63-72. ( 10.1016/j.nbd.2016.06.006) [DOI] [PubMed] [Google Scholar]
  • 41.Prakash A, Kumar V, Meena NK, Hassan MI, Lynn AM. 2019Comparative analysis of thermal unfolding simulations of RNA recognition motifs (RRMs) of TAR DNA-binding protein 43 (TDP-43). J. Biomol. Struct. Dyn. 37, 178-194. ( 10.1080/07391102.2017.1422026) [DOI] [PubMed] [Google Scholar]
  • 42.Prakash A, Kumar V, Meena NK, Lynn AM. 2018Elucidation of the structural stability and dynamics of heterogeneous intermediate ensembles in unfolding pathway of the N-terminal domain of TDP-43. RSC Adv. 8, 19 835-19 845. ( 10.1039/C8RA03368D) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Mompeán M, Buratti E, Guarnaccia C, Brito RMM, Chakrabartty A, Baralle FE, Laurents DV. 2014Structural characterization of the minimal segment of TDP-43 competent for aggregation. Arch. Biochem. Biophys. 545, 53-62. ( 10.1016/j.abb.2014.01.007) [DOI] [PubMed] [Google Scholar]
  • 44.Sun Y, Kakinen A, Xing Y, Pilkington EH, Davis TP, Ke PC, Ding F. 2019Nucleation of β-rich oligomers and β-barrels in the early aggregation of human islet amyloid polypeptide. Biochim. Biophys. Acta Mol. Basis Dis. 1865, 434-444. ( 10.1016/j.bbadis.2018.11.021) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sun Y, Chakrabartty A. 2017Phase to phase with TDP-43. Biochemistry 56, 809-823. ( 10.1021/acs.biochem.6b01088) [DOI] [PubMed] [Google Scholar]
  • 46.Pierce BG, Wiehe K, Hwang H, Kim B-H, Vreven T, Weng Z. 2014ZDOCK server: interactive docking prediction of protein-protein complexes and symmetric multimers. Bioinformatics 30, 1771-1773. ( 10.1093/bioinformatics/btu097) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Liu J, He X, Zhang JZH. 2013Improving the scoring of protein-ligand binding affinity by including the effects of structural water and electronic polarization. J. Chem. Inf. Model. 53, 1306-1314. ( 10.1021/ci400067c) [DOI] [PubMed] [Google Scholar]
  • 48.Villa A, Wöhnert J, Stock G. 2009Molecular dynamics simulation study of the binding of purine bases to the aptamer domain of the guanine sensing riboswitch. Nucleic Acids Res. 37, 4774-4786. ( 10.1093/nar/gkp486) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Li C, Zhu Y, Wang Y, Chen G. 2014Molecular dynamics simulation on the conformational transition of the Mad2 protein from the open to the closed state. Int. J. Mol. Sci. 15, 5553-5569. ( 10.3390/ijms15045553) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Li C, Zhao X, Liu W, Yin F, Hu J, Zhang G, Chen G. 2020DNA structural distortions induced by a monofunctional trinuclear platinum complex with various cross-links using molecular dynamics simulation. J. Chem. Inf. Model. 60, 1700-1708. ( 10.1021/acs.jcim.0c00002) [DOI] [PubMed] [Google Scholar]
  • 51.Meng H, Li C, Wang Y, Chen G. 2014Molecular dynamics simulation of the allosteric regulation of eIF4A protein from the open to closed state, induced by ATP and RNA substrates. PLoS ONE 9, e86104. ( 10.1371/journal.pone.0086104) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Chronopoulou EG, Vlachakis D, Papageorgiou AC, Ataya FS, Labrou NE. 2019Structure-based design and application of an engineered glutathione transferase for the development of an optical biosensor for pesticides determination. Biochim. Biophys. Acta (BBA) – General Subjects 1863, 565-576. ( 10.1016/j.bbagen.2018.12.004) [DOI] [PubMed] [Google Scholar]
  • 53.Healy EF, Little C, King PJ. 2014A model for small heat shock protein inhibition of polyglutamine aggregation. Cell Biochem. Biophys. 69, 275-281. ( 10.1007/s12013-013-9795-1) [DOI] [PubMed] [Google Scholar]
  • 54.Tarus B, Straub JE, Thirumalai D. 2005Probing the initial stage of aggregation of the Aβ10-35-protein: assessing the propensity for peptide dimerization. J. Mol. Biol. 345, 1141-1156. ( 10.1016/j.jmb.2004.11.022) [DOI] [PubMed] [Google Scholar]
  • 55.Xue T, Song C, Wang Q, Wang Y, Chen G. 2016Investigations of the CLOCK and BMAL1 proteins binding to DNA: a molecular dynamics simulation study. PLoS ONE 11, e0155105. ( 10.1371/journal.pone.0155105) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Song C, Wang Q, Xue T, Wang Y, Chen G. 2016Molecular dynamics simulations on the conformational transitions from the GA98 (GA88) to GB98 (GB88) proteins. J. Mol. Recognit. 29, 580-595. ( 10.1002/jmr.2558) [DOI] [PubMed] [Google Scholar]
  • 57.Huang Y-C, Lin K-F, He R-Y, Tu P-H, Koubek J, Hsu Y-C, Huang JJ-T. 2013Inhibition of TDP-43 aggregation by nucleic acid binding. PLoS ONE 8, e64002. ( 10.1371/journal.pone.0064002) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Shodai A, et al. 2013Aberrant assembly of RNA recognition motif 1 links to pathogenic conversion of TAR DNA-binding protein of 43 kDa (TDP-43). J. Biol. Chem. 288, 14 886-14 905. ( 10.1074/jbc.M113.451849) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Takagi S, et al. 2013RNP2 of RNA recognition motif 1 plays a central role in the aberrant modification of TDP-43. PLoS ONE 8, e66966. ( 10.1371/journal.pone.0066966). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chiti F, Dobson CM. 2017Protein misfolding, amyloid formation, and human disease: a summary of progress over the last decade. Annu. Rev. Biochem. 86, 27-68. ( 10.1146/annurev-biochem-061516-045115) [DOI] [PubMed] [Google Scholar]
  • 61.Liu W, Li C, Shan J, Wang Y, Chen G. 2021Insights into the aggregation mechanism of RNA recognition motif domains in TDP-43: a theoretical exploration. Figshare. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Liu W, Li C, Shan J, Wang Y, Chen G. 2021Insights into the aggregation mechanism of RNA recognition motif domains in TDP-43: a theoretical exploration. Figshare. [DOI] [PMC free article] [PubMed]

Supplementary Materials

Data Availability Statement

Our data are deposited at Dryad Digital Repository: https://doi.org/10.5061/dryad.w3r2280q0. The data are provided in electronic supplementary material [61].


Articles from Royal Society Open Science are provided here courtesy of The Royal Society

RESOURCES