The clinical datasets used in this study consisted of a total of 154 stacks used for testing brain localization and extraction. Motion level has been qualitatively graded “Minor” (little,1, or none,0), “Moderate” (2,3) or “Severe” (4,5,excluded when necessary). A subset of 87 stacks that led to successful baseline reconstruction were further used to evaluate and compare the complete reconstruction pipeline. Selection criteria for a stack to be used in the reconstruction included: homogeneous in-plane and slice resolution in all stacks; stacks with thinner slices in cases where different slice thicknesses where available; fetuses with at least one stack per slice-select acquisition direction; and having no spin history or significant motion-induced intensity distortion artifact that could corrupt the reconstruction. The goal in this paper was not to challenge motion estimation or super-resolution reconstruction but to evaluate the performance of the proposed brain localization and extraction methods and their impact on final reconstruction.