MATLAB toolbox for ROC analysis of multi-reader multi-case diagnostic imaging studies

Brian J Smith; Stephen L Hillis

doi:10.1117/12.2610663

. Author manuscript; available in PMC: 2023 Apr 4.

Published in final edited form as: Proc SPIE Int Soc Opt Eng. 2022 Apr 4;12035:120350G. doi: 10.1117/12.2610663

MATLAB toolbox for ROC analysis of multi-reader multi-case diagnostic imaging studies

Brian J Smith ^a, Stephen L Hillis ^b

PMCID: PMC9504162 NIHMSID: NIHMS1836143 PMID: 36159880

Abstract

A common study design for comparing the performances of diagnostic imaging tests is to obtain ratings from multiple readers of multiple cases whose true statuses are known. Typically, there is overlap between the tests, readers, and/or cases for which special analytical methods are needed to perform statistical comparisons. We present our new MATLAB MRMCaov toolbox, which is designed for multi-reader multi-case comparisons of two or more diagnostic tests. The toolbox allows for statistical comparison of reader performance metrics, such as area under the receiver operating characteristic curve (ROC AUC), with analysis of variance methods originally proposed by Obuchowski and Rockette (1995) and later unified and improved by Hillis and colleagues (2005, 2007, 2008, 2018). MRMCaov is open-source software with an integrated command-line interface for performing multi-reader multi-case statistical analysis, plotting, and presenting results. Its features (1) ROC AUC, likelihood ratios of positive or negative ratings, sensitivity, specificity, and expected utility reader performance metrics; (2) reader-specific ROC curves; (3) user-definable performance metrics; (4) test-specific estimates of mean performance along with confidence intervals and p-values for statistical comparisons; (5) support for factorial, nested, or partially paired study designs; (6) inference for random or fixed readers and cases; (7) DeLong, jackknife, or unbiased covariance estimation; and (8) compatibility with Microsoft Windows, Mac OS, and Linux.

Keywords: multi-reader multi-case, ANOVA, ROC analysis, diagnostic radiology, software

1. INTRODUCTION

A common study design for comparing the diagnostic performance of imaging modalities, or diagnostic tests, is to obtain modality-specific ratings from multiple readers of multiple cases (MRMC) whose true statuses are known. In such a design, receiver operating characteristic (ROC) metrics, such as area under the ROC curve (ROC AUC), can be used to quantify correspondence between reader ratings and case status. Metrics can then be compared statistically to determine if there are differences between modalities. Special statistical methods are needed when readers or cases represent a random sample from a larger population of interest and there is overlap in readers and/or cases across modalities. An ANOVA model designed for the characteristics of MRMC studies was initially proposed by Dorfman et al.¹ and Obuchowski and Rockette² and later unified and improved by Hillis and colleagues.^3–6 Their models are implemented in the MRMCaov MATLAB toolbox.⁷

MRMCaov performs multi-reader multi-case analysis of variance for the comparison of reader performance across imaging modalities. This software is the first MATLAB implementation of the Hillis unified methodology and builds upon his OR-DBM MRMC SAS software.⁸ It is designed to be user friendly, integrate with the MATLAB programming and graphics environment, and offer new features and methodologies. Current features of the toolbox are summarized below. Usage of the software is illustrated with a medical imaging example in the subsequent sections.

MRMCaov provides both graphical and tabular analysis results, including reader-specific ROC curves and AUC estimates, modality-specific estimates, confidence intervals, and p-values for statistical comparisons. The toolbox includes a new method for unbiased covariance estimation as well as other features not collectively available in any other existing MATLAB toolbox.

MRMCaov MATLAB toolbox features.

Empirical ROC curves.
Reader-specific ROC curves and performance metrics.
Performance metric functions to compute area under the ROC curve, expected utility, sensitivity for a specified specificity, and specificity for a specified sensitivity.
User-definable performance metrics.
Modality-specific estimates of mean performance along with confidence intervals and p-values for statistical comparisons.
Comparison of two or more modalities.
Support for factorial, nested, and partially paired study designs.
Inference for random readers and cases, random readers and fixed cases, or fixed readers and random cases.
DeLong, jackknife, or unbiased covariance estimation.
Compatibility with Microsoft Windows, Mac OS, and Linux.

1.1. Installation

The MATLAB toolbox is currently available for download from https://github.com/brian-j-smith/MRMCaov.m. Installation instructions are provided at the download site. Once installed, the toolbox may be used in the MATLAB desktop environment.⁹

1.2. Data

Input data for MRMCaov analysis should be given as MATLAB vectors for reader, test, and case identifiers as well as true event statuses and reader ratings. A table of example data, named VanDyke, is provided with the toolbox and displayed in example 1.1. The data come from a study in which the relative performance of cinematic presentation of MRI (1 = CINE MRI) was compared to single spin-echo magnetic resonance imaging (2 = SE MRI) for the detection of thoracic aortic dissection.¹⁰ Forty five patients with aortic dissection and 69 without dissection were imaged with both modalities. Based on the images, five radiologists rated patient disease status as 1 = definitely no aortic dissection, 2 = probably no aortic dissection, 3 = unsure about aortic dissection, 4 = probably aortic dissection, or 5 = definitely aortic dissection. Interest lies in estimating ROC curves for each combination of reader and modality and in comparing modalities with respect to summary statistics from the curves.

Descriptions of the table variables are as follows.
reader: unique identifiers for the five radiologists.
treatment: identifiers for the imaging modality (1 = CINE MRI, 2 = SE MRI).
case: identifiers for the 114 cases.
truth: indicator for thoracic aortic dissection (1 = done, 0 = not done).
rating: five-point ratings given to case images by the readers.
case2: example identifiers representing nesting of cases within readers.
case3: example identifiers representing nesting of cases within treatments.

MATLAB Example 1.1: First 10 observations in VanDyke table.

>> load VanDyke.mat
>> head(VanDyke, 10)

reader	treatment	case	rating	case2	case3
1	1	1	1	1.1	1.1
1	2	1	3	1.1	2.1
2	1	1	2	2.1	1.1
2	2	1	3	2.1	2.1
3	1	1	2	3.1	1.1
3	2	1	2	3.1	2.1
4	1	1	1	4.1	1.1
4	2	1	2	4.1	2.1
5	1	1	3	5.1	1.1
5	2	1	2	5.1	2.1

reader	treatment	case	rating	case2	case3
1	1	1	1	1.1	1.1
1	2	1	3	1.1	2.1
2	1	1	2	2.1	1.1
2	2	1	3	2.1	2.1
3	1	1	2	3.1	1.1
3	2	1	2	3.1	2.1
4	1	1	1	4.1	1.1
4	2	1	2	4.1	2.1
5	1	1	3	5.1	1.1
5	2	1	2	5.1	2.1

reader	test	y	N
1	1	0.91965	114
1	2	0.94783	114
2	1	0.85878	114
2	2	0.90531	114
3	1	0.90386	114
3	2	0.92174	114
4	1	0.97311	114
4	2	0.99936	114
5	1	0.82979	114
5	2	0.92995	114

Source	d.f.	Sum Sq.	Mean Sq.
{‘reader’ }	4	0.015345	{[0.0038]}
{‘test’ }	1	0.0047962	{[0.0048]}
{‘reader*test’}	4	0.0022041	{[5.5103e-04]}

	Estimate	Correlation
Error	0.00078839	NaN
Cov1	0.00034167	0.43338
Cov2	0.00033906	0.43007
Cov3	0.00023561	0.29885

	Estimate	Correlation
reader	0.0015365	NaN
reader*test	0.00020776	NaN
Error	0.00078839	NaN
Covl	0.00034167	0.43338
Cov2	0.00033906	0.43007
Cov3	0.00023561	0.29885

	Estimate	MS(R)	Cov2	StdErr	df	CI
1	0.89704	0.0030826	0.00047718	0.033071	12.588	0.82535	0.96872
2	0.94084	0.0013046	0.00020095	0.021491	12.534	0.89423	0.98744

	Estimate	Correlation
reader	0.0015743	NaN
reader*test	0.00046223	NaN
Error	0.00015708	NaN
Cov1	6.828e-05	0.43468
Cov2	0	0
Cov3	0	0

	Estimate	MS(R)	Cov2	StdErr	df	CI
1	0.89704	0.0030826	NaN	0.02483	4	0.8281	0.96598
2	0.94084	0.0013046	NaN	0.016153	4	0.89599	0.98569

	Estimate	Correlation
reader	0.0016426	NaN
reader*test	0.00032719	NaN
Error	0.00039326	NaN
Cov1	0	0
Cov2	0.00016942	0.4308
Cov3	0	0

	Estimate	MS(R)	Cov2	StdErr	df	CI
1	0.89704	0.0030826	0.0002385	0.029241	7.6934	0.82914	0.96494
2	0.94084	0.0013046	0.00010033	0.019007	7.6677	0.89668	0.985

reader	treatment	case	rating	case2	case3
1	1	1	1	1.1	1.1
1	2	1	3	1.1	2.1
2	1	1	2	2.1	1.1
2	2	1	3	2.1	2.1
3	1	1	2	3.1	1.1
3	2	1	2	3.1	2.1
4	1	1	1	4.1	1.1
4	2	1	2	4.1	2.1
5	1	1	3	5.1	1.1
5	2	1	2	5.1	2.1

PERMALINK

MATLAB toolbox for ROC analysis of multi-reader multi-case diagnostic imaging studies

Brian J Smith

Stephen L Hillis

Abstract

1. INTRODUCTION

MRMCaov MATLAB toolbox features.

1.1. Installation

1.2. Data

MATLAB Example 1.1: First 10 observations in VanDyke table.

2. READER PERFORMANCE METRICS

PerformanceVariate class.

Syntax

Description

Input Arguments

2.1. Area Under the ROC Curve

ROCAUCVariate class.

Syntax

Description

Input Arguments

Name-Value Arguments

2.2. Expected Utility of the ROC Curve

ROCEUVariate class.

Syntax

Description

Input Arguments

2.3. Sensitivity and Specificity

SensitivityVariate and SpecificityVariate classes.

Syntax

Description

Input Arguments

2.4. Likelihood Ratios of Positive and Negative Ratings

ROCLRposVariate and ROCLRnegVariate classes.

Syntax

Description

Input Arguments

3. MRMC ANALYSIS

3.1. Analysis Specification

Multi-reader multi-case analysis function.

Syntax

Description

Input Arguments

Name-Value Argument

MATLAB Example 3.1: mrmc function call.

MATLAB Example 3.2: MRMCFit fields.

Descriptions

MATLAB Example 3.3: Plot of mrmc ROC AUC fit.

MATLAB Example 3.4: Designation of fixed readers or cases.

3.2. Statistical Analysis Summary

MRMC statistical analysis summary function.

Syntax

Description

Input Arguments

Name-Value Arguments

MATLAB Example 3.5: MRMC statistical analysis summary (factorial design).

MATLAB Example 3.6: MRMCSummary fields.

Descriptions

MATLAB Example 3.7: MRMC statistical analysis summary (cases nested within readers).

MATLAB Example 3.8: MRMC statistical analysis summary (cases nested within tests).

4. CONCLUSIONS

ACKNOWLEDGMENTS

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

reader	treatment	case	rating	case2	case3
1	1	1	1	1.1	1.1
1	2	1	3	1.1	2.1
2	1	1	2	2.1	1.1
2	2	1	3	2.1	2.1
3	1	1	2	3.1	1.1
3	2	1	2	3.1	2.1
4	1	1	1	4.1	1.1
4	2	1	2	4.1	2.1
5	1	1	3	5.1	1.1
5	2	1	2	5.1	2.1