Figure - PMC

Skip to main content

View full-text article in PMC

. Author manuscript; available in PMC: 2022 Jul 14.

Published in final edited form as: Nat Comput Sci. 2022 Apr 28;2(4):265–275. doi: 10.1038/s43588-022-00232-1

Fig. 1 | — The flowchart is illustrated with a three-domain protein from the iron-dependent regulator of Mycobacterium tuberculosis (pDB ID 1fx7A). Starting from the query sequence, domain boundaries are first predicted by FUpred¹⁴ and ThreaDom¹⁵, and models of each domain are generated by D-I-TASSEr¹⁶. Meanwhile, inter-domain distances are predicted with a deep convolutional neural-network predictor DomainDist. Second, each of the domain models is independently fit to the density map by quasi-Newton searching. Third, the initial full-length models are optimized by a two-step rigid-body rEMC simulation to minimize the DCS) between the density map and full-length model (equation (1)). Fourth, the lowest DCS model selected from the rigid-body assembly simulations is refined by flexible assembly with atom-, segment- and domain-level refinements using rEMC simulation guided by the DCS, inter-domain distance profiles and a knowledge-based force field, with the resulting decoy conformations clustered by SpICKEr⁵³ to obtain a centroid model. Finally, the flexible assembly simulation is performed again for the full-atomic model with constraints from centroid models adding to the energy, and the final model is created from the lowest-energy model after side-chain repacking with FASpr⁵⁴ and FG-MD¹⁸.