. 2020 Jul 14;267(Suppl 1):185–196. doi: 10.1007/s00415-020-10062-8

Table 2.

Overview of results

A
Data set	Noise	Scale	Otsu’s	Cutoff 6	Cutoff 8	Cutoff 10
D2	BI	1	99.4%	95.0%	98.0%	98.7%
		2	97.6%	93.7%	95.5%	96.7%
		3	93.8%	92.0%	93.4%	94.6%
		4	90.3%	90.3%	91.2%	92.7%
		6	85.8%	90.0%	90.8%	91.5%
	Sc	10	98.5%	88.4%	93.6%	97.9%
		20	88.7%	87.9%	93.0%	96.9%
		30	81.2%	87.7%	92.2%	93.5%
		40	76.7%	87.4%	90.5%	87.6%
		50	74.3%	86.5%	88.0%	82.3%
		60	73.2%	85.6%	86.3%	80.5%

B
Data set	Validation	M	Cutoff 6	Cutoff 8	Cutoff 10
D3	DS	Gold standard	97.0% ± 0.7 range 95.6–97.9	96.6% ± 0.8 range 95.0–97.7	95.9% ± 0.9 range 93.8–97.2
	V_E	16.7 mm³ ± 5.5 range 8.8–30.7	11.5 mm³ ± 5.7 range 5.0–25.5	17.1 mm³ ± 7.4 range 8.4–33.6	23.3 mm³ ± 8.7 range: 13.0–41.0
	V_E/M	1	0.7 ± 0.2 range 0.4–0.9	1 ± 0.2 range 0.7–1.5	1.4 ± 0.3 range 1.0–2.1
	V_T	276.2 mm³ ± 37.6 (range 223.6–347.6)

As an artificial data set, D2 provided a known ground truth to test and compare VOLT cutoff versions to Otsu’s method (O). A shows an overview of the Dice scores (DS) of each segmentation method (Otsu’s, cutoff 6, cutoff 8, cutoff 10) concerning the real-world MRI imitating noise that was added stepwise in the form of increasing blurriness noise (Bl, Gaussian blur kernel, SD range 1–6 voxel in x/y/z-direction; SD = standard deviations) or increasing scatter noise (Sc, SD range of intensity variation: 0–50 SD). For visualization of the added noise and results, see Fig. 1a. D3 included real-world data sets from consecutive patients from the interdisciplinary German Center for Vertigo and Balance Disorders, Munich, Germany. Part B shows an overview of the results’ mean of each segmentation method (manual segmentation that was considered as the gold standard and VOLT with three different cutoffs 6, 8, 10). Segmentation accuracy was evaluated using the Sørensen-Dice overlap coefficient, and segmentation precision were estimated by comparing the volume of the ELS (V_E). The ratio V_E/M was supplied to show the deviation of each cutoff from the gold standard, which was the manual segmentation. The V_E ranges include all different grades of endolymphatic hydrops

± standard deviation, Bl blurriness, DS Dice score, Sc scatter, V_E volume of the endolymphatic space, V_T volume of the total fluid space