Reliable Gene Mutation Prediction in Clear Cell Renal Cell Carcinoma through Multi-classifier Multi-objective Radiogenomics Model

Xi Chen; Zhiguo Zhou; Raquibul Hannan; Kimberly Thomas; Ivan Pedrosa; Payal Kapur; James Brugarolas; Xuanqin Mou; Jing Wang

doi:10.1088/1361-6560/aae5cd

. Author manuscript; available in PMC: 2019 Oct 24.

Published in final edited form as: Phys Med Biol. 2018 Oct 24;63(21):215008. doi: 10.1088/1361-6560/aae5cd

Reliable Gene Mutation Prediction in Clear Cell Renal Cell Carcinoma through Multi-classifier Multi-objective Radiogenomics Model

Xi Chen ¹, Zhiguo Zhou ², Raquibul Hannan ², Kimberly Thomas ³, Ivan Pedrosa ⁴, Payal Kapur ^5,⁶, James Brugarolas ^6,⁷, Xuanqin Mou ¹, Jing Wang ²

PMCID: PMC6240911 NIHMSID: NIHMS1511079 PMID: 30277889

Abstract

Genetic studies have identified associations between gene mutations and clear cell renal cell carcinoma (ccRCC). Since the complete gene mutational landscape cannot be characterized through biopsy and sequencing assays for each patient, non-invasive tools are needed to determine the mutation status for tumors. Radiogenomics may be an attractive alternative tool to identify disease genomics by analyzing amounts of features extracted from medical images. Most current radiogenomics predictive models are built based on a single classifier and trained through a single objective. However, since many classifiers are available, selecting an optimal model is challenging. On the other hand, a single objective may not be a good measure to guide model training. We proposed a new multi-classifier multi-objective (MCMO) radiogenomics predictive model. To obtain more reliable prediction results, similarity-based sensitivity and specificity were defined and considered as the two objective functions simultaneously during training. To take advantage of different classifiers, the evidential reasoning (ER) approach was used for fusing the output of each classifier. Additionally, a new similarity-based multi-objective optimization algorithm (SMO) was developed for training the MCMO to predict ccRCC related gene mutations (VHL, PBRM1 and BAP1) using quantitative CT features. Using the proposed MCMO model, we achieved a predictive area under the receiver operating characteristic curve (AUC) over 0.85 for VHL, PBRM1 and BAP1 genes with balanced sensitivity and specificity. Furthermore, MCMO outperformed all the individual classifiers, and yielded more reliable results than other optimization algorithms and commonly used fusion strategies.

1. Introduction

Kidney cancer, most predominantly, renal cell carcinoma (RCC) remains one of the most common renal malignancies with 63,990 new cases expected to be diagnosed and with 14,400 deaths in the United States in 2017 (Siegel et al, 2017). Clear cell RCC (ccRCC) is the most abundant (~75%) subtype of RCC and the most likely to metastasize outside the kidney (Motzer et al, 2002). Most cases of ccRCC present with somatic (or germline) inactivating mutations in the von Hippel–Lindau tumor suppressor (VHL) gene, which are generally absent in other cancers (Gnarra et al, 1994; Varela et al, 2011; Guo et al, 2012; Cancer Genome Atlas Research Network, 2013). Several other mutations in genes involved in regulating chromatin states, including those in the BRCA1-associated protein 1 (BAP1), polybromo 1 (PBRM1), SET domain containing 2 (SETD2), and lysine (K)-specific demethylase 5C (KDM5C), were recently identified (Dalgliesh et al, 2010; Varela et al, 2011; Duns et al, 2010; Peña-Llopis et al, 2012). Mutations in BAP1 and SETD2 were found to be associated with advanced stage and poor outcome (Cancer Genome Atlas Research Network, 2013; Hakimi et al, 2013; Kapur et al, 2013). The genes mutated within a tumor can be used as biomarkers and may help with prognosis, treatment selection, and treatment response prediction. However, inter- and intra-tumoral heterogeneity in gene mutations has previously been described in ccRCC (Gerlinger et al, 2012, 2014; McGranahan and Swanton, 2015). As ccRCC metastasizes, additional gene mutations accumulate. Because the complete gene mutational landscape is hard to be characterized for each patient through biopsy and sequencing assays, a non-invasive tool would be useful to identify the mutations within the tumor.

Radiogenomics (Rutman and Kuo, 2009; Jaffe, 2012; Kuo and Jamshidi, 2014; Karlo et al, 2014; Shinagare et al, 2015; Sala et al, 2017), an integrated approach that combines radiology and genomics, is based on extracting and analyzing amounts of data from medical images and clinical information by high-throughput computing. Therefore, radiogenomics is a promising solution for predicting gene mutation in ccRCC. Contrast-enhanced computed tomography (CT) is commonly used to diagnose and characterize renal masses, monitor growth in pathologically-proven RCC undergoing active surveillance, assess RCC location and extent, and determine stage and treatment response (Stewartmerrill et al, 2015; Motzer et al, 2017). Furthermore, the diagnostic standard of reference has expanded to the genomic level and has led to the attempt to use imaging as a noninvasive determinant of mutational status (Reznek, 2004; Powles and Albers, 2012; Carles et al, 2012; Kuo and Yamamoto, 2011). Therefore, a CT based radiogeneomics predictive model would be helpful.

In recent years, researchers have investigated predictable and systematic associations between imaging features and underlying molecular and genomic alterations in different cancers. Yamamoto et al (2012) carried out a radiogenomic analysis of breast cancer with MRI, a novel approach that may help reveal the underlying molecular biology of breast cancers. Gevaert et al (2017) used CT image features to predict the mutation status of EGFR in non small cell lung cancer (NSCLC). Aerts et al (2014) revealed that a prognostic radiomic feature set, capturing intra-tumor heterogeneity, is associated with underlying gene-expression patterns. One study reported associations between CT features of 58 ccRCCs and the underlying karyotype (Sauk et al, 2011). Others studied the association between CT imaging features and mutational status of ccRCC (Karlo et al, 2014; Shinagare et al, 2015). For example, the BAP1 mutation was associated with ill-defined tumor margins and calcification (Shinagare et al, 2015).

By quantitatively analyzing large amounts of information from medical images, radiogenomics holds great potential to predict gene mutation. However, several challenges need to be addressed to build an optimal predictive model. First, a single classifier is typically used to build a radiogenomics predictive model. Aerts et al (2014) used the Cox proportional hazards regression model to predict survival in patients with lung and head-and-neck cancer. Other researchers tested different types of classifiers and chose one or two “preferred” ones for specific applications. Valdes et al (2016) evaluated three different classifiers, including decision trees, random forests, and RUSBoost, to predict radiation pneumonitis in patients with stage I NSCLC treated with stereotactic body radiation therapy (SBRT). Higher accuracy was achieved when the RUSBoost algorithm was used with regularization. These findings indicate how difficult it is to select a “preferred” classifier for a specific application. Instead of trying to find the most suitable classifier for a particular application, a model that combines multiple classifiers can fully use information from different classifiers to improve accuracy in radiogenomics. Second, most current radiogenomics models adopt a single objective function (e.g. accuracy, AUC), which may not be a good measure for building the predictive model, especially when positive and negative cases are imbalanced. To overcome the disadvantages of using a single classifier and a single objective function, we sought to develop a multi-classifier multi-objective (MCMO) radiogenomics model predict most mutations in most commonly mutated genes in ccRCC. In MCMO, multiple classifiers are used for building the model and a multi-objective optimization algorithm is used for training the model.

2. Materials and Methods

2.1. Data

2.1.1. Patients.

We conducted an institutional review board-approved, Health Insurance Portability and Accountability Act-compliant (HIPAA), retrospective study including 57 ccRCC patients from two independent cohorts. The first cohort consisted of 33 patients (median age 62 years, range 28–83) from the University of Texas Southwestern Medical Center (UTSW). The other cohort consisted of 24 patients (median age 59 years, range 26–74) from The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma (TCGA-KIRC) data collection. The TCGA-KIRC data collection is part of The Cancer Genome Atlas (TCGA), an ongoing project funded by the National Cancer Institute (NCI) and the National Human Genome Research Institute (NHGRI), which created an atlas of genetic changes related to more than 20 tumor types, including ccRCC. Clinical, genetic, and pathological data reside in the TCGA data portal, and radiological data is stored in The Cancer Imaging Archive (TCIA). Both TCGA and TCIA are accessible for public download (Smith et al, 2016; Clark et al, 2013).

All 57 patients fulfilled the following criteria: (a) histopathologic diagnosis of ccRCC and exome sequencing, including information on VHL, PBRM1, and BAP1 gene mutations, considered as frequent mutations in ccRCC; (b) availability of images from a pretreatment contrast-enhanced CT including a corticomedullary phase. For each gene, the numbers of patients with mutation and without mutation are listed (table 1).

Table 1.

Number of patients

	VHL		PBRM1		BAP1

	Mutation	Non-mutation	Mutation	Non-mutation	Mutation	Non-mutation
UTSW	26	7	19	14	5	28
TCGA	10	14	3	21	2	22
Total	36	21	22	35	7	50

Open in a new tab

2.1.2. CT Image features.

CT images from UTSW were acquired by GE LightSpeed VCT (GE Healthcare, Waukesha, WI) or TOSHIBA Aquilion ONE (Canon Medical Systems USA, Tustin, CA). CT image size was 512 × 512 with a pixel size of 0.7~0.9 mm, and slice thickness was 3 or 5mm. CT images from TCIA were acquired by the SIEMENS Sensation 64 / Definition AS+ (Siemens Medical Solution, Malvern, PA), Philips Brilliance 64 (Philips Healthcare, Andover, MA), or GE LightSpeed VCT. CT image size was 512 × 512 with a pixel size of 0.7~1 mm and slice thickness was 1.25 or 5 mm.

The primary tumor contour was delineated by a radiation oncologist with 4 years of experience and reviewed by a radiation oncologist with 9 years of experience. Contrast enhanced CT images acquired during the corticomedullary phase were used in all 57 cases for image analysis. A region of interest (ROI) was drawn along the outer contour of the mass using the Velocity 3.2.0 software excluding adjacent tissues (e.g. renal parenchyma, peri-renal fat) (figure 1).

Figure 1. — Tumor contour of ccRCC investigated in this study. (a) one case from UTSW. (b) one case from TCIA.

We resampled all images of the same slice thickness at 5mm. We only considered primary tumors and defined 43 quantitative image features describing tumor characteristics, including 13 geometry features, 9 intensity features, and 21 texture features (table 2). Geometry features describing tumor shape and size were calculated according to the actual pixel size. Features Size_X, Size_Y and Size_Z (table 2) describe the tumor size along the X, Y, and Z axes of the digital imaging and communications in medicine (DICOM) coordinate system (figure 2(a)). The shape and location of the tumors differed from patient to patient. To intuitively describe the size of the tumor, a principal component analysis (PCA) was applied to the tumor contour points to transform the data into a new coordinate system. Size_P1 is the maximum 3 dimensional diameter of the tumor, measured as the largest pairwise Euclidean distance between the voxels on the surface of the tumor volume; size_P2 and size_P3 are the tumor size along the directions orthogonal to the direction of maximum size (figure 2(b)).

Table 2.

Quantitative CT Image Features

Geometry features	Intensity features	Texture features
Volume	Minimum	Auto correlation
Size_X	Maximum	Contrast
Size_Y	Mean	Correlation
Size_Z	Median	Cluster prominence
Size_P1	Sum	Cluster shade
Size_P2	Variance	Dissimilarity
Size_P3	Standard deviation	Energy
Roundness	Skewness	Entropy
Surface area	Kurtosis	Homogeneity
Compactness_1		Maximum probability
Compactness_2		Variance
Spherical disproportion		Sum average
Surface to volume ratio		Sum variance
		Sum entropy
		Difference variance
		Difference entropy
		Information measure of correlation_1
		Information measure of correlation_2
		Inverse difference
		Inverse difference normalized
		Inverse difference moment normalized

Open in a new tab

Figure 2. — Coordinate systems. (a) The digital imaging and communications in medicine (DICOM) coordinate system. (b) Illustration of the transformation of the DICOM coordinate system into the PCA coordinate system.

Intensity features are first-order statistics that describe the distribution of the voxel intensities within the tumor on the CT image through commonly used basic metrics (Aerts et al, 2014). Intensity features provide information related to the gray-level distribution of the image; however, they do not provide any information on the relative position of the various gray levels over the image. Therefore, we included textural features that either describe the patterns or the spatial distribution of voxel intensities, which were calculated from the gray level co-occurrence matrix (GLCM) (Haralick et al, 1973). Texture matrix representation requires the voxel intensity values within the volume of interest to be discretized. In this work, voxel intensities were resampled into 64 equally spaced bins using a bin-width of 25 Hounsfield Units (HU). The detailed methodology of extracting intensity and texture features was previously described by Aerts et al (2014) (supplementary document). The calculated features of the 57 patients were normalized using the Z-scores method (Cheadle et al, 2003).

2.2. MCMO predictive model

2.2.1. Evidential reasoning based classifier fusion

The evidential reasoning (ER) approach was used for fusing the individual classifier probability output (Yang and Xu, 2002, 2013). Our study is a binary classification problem (mutation or non-mutation). Assuming there are classifiers, for a test sample, the output probability of each classifier is denoted by $P_{i} = {P_{i}^{1}, P_{i}^{1}}, i = 1, \cdot \cdot \cdot, M,$ which satisfies:

P_{i}^{1} + P_{i}^{2} = 1.

(1)

Where $P_{i}^{1}$ is output probability of gene mutation and $P_{i}^{2}$ is output probability of non mutation. Assume the relative weight of each classifier as w = ${w_{1}, w_{1}, \cdot \cdot \cdot, w_{m}},$ which satisfies the following constraint:

\sum_{i = 1}^{M} w_{i} = 1, 0 \leq w_{i} \leq 1.

(2)

The final output probabilities $P_{f i n}^{j}, j = 1, 2$ are obtained by classifier fusion through the ER approach (Yang and Xu, 2002, 2013):

P_{f i n}^{j} = E R (P_{i}^{J}, w_{i}), i = 1, \cdot \cdot \cdot M, j = 1, 2,

(3)

where ER represents the ER analytic algorithm (Wang et al, 2006), which is calculated as:

P_{f i n}^{j} = \frac{μ \times [\prod_{i = 1}^{M} (w_{i} P_{i}^{j} + 1 - ω_{i}) - \prod_{i = 1}^{M} (1 - w_{i})]}{1 - μ \times [\prod_{i = 1}^{M} (1 - w_{i})]}, j = 1, 2,

(4)

where µ is calculated as:

μ = {[\sum_{j = 1}^{2} \prod_{i = 1}^{M} (w_{i} P_{i}^{j} + 1 - w_{i}) - \prod_{i = 1}^{M} (1 - w_{i})]}^{- 1} .

(5)

For a binary classification problem, if $P_{f i n}^{1} > P_{f i n}^{2},$ the test sample belongs to class 1;if $P_{f i n}^{1} < P_{f i n}^{2},$ the test sample belongs to class 2; if the test sample belongs to either class.In this study, We sought to predict either the presence or absence of gene mutation. Therefore, if $P_{f i n}^{1} > P_{f i n}^{2},$ we considered the test sample had mutation; if $P_{f i n}^{1} \leq P_{f i n}^{2},$ we considered the test sample had non-mutation.

2.2.2. Reliable outcome prediction based on output probability similarity

To obtain more reliable predictive results, reliable outcome prediction (RCP) is proposed and defined as maximize the similarity between predicted output probability and true label vector. For example, we assume two models that predict the VHL gene mutation with the label vector [1, 0]. Model A has prediction probabilities (0.8, 0.2) (the probability of mutation is 0.8, the probability of non-mutation is 0.2, and the threshold is 0.5), and model B has prediction probabilities (0.55, 0.45). As 0.8 is closer to 1 than 0.55, the result of model A is more reliable than that of model B. In other words, the similarity between the probability output of model A and the label vector is higher than those observed for model B, which means model A is more reliable than model B in this prediction.

In RCP, the aim is to maximize the similarity between predicted output probability and true label vector T while training the single classifier model and weights. For a training sample, its label vector is denoted by T = [ T₁, T₂]. T is a binary vector, T = [1,0] (mutation) or T = [0,1] (non-mutation). Assuming that the predictive model has q parameters denoted by R = {R₁, R₂,⋯, R_q}, the objective function is expressed as:

f = \underset{w, R}{m a x} \sum_{k = 1}^{K} s i m (P^{k}, T^{k}),

(6)

where K represents the number of training samples and is the similarity measure. Since the above problem can be considered as the similarity of probability distribution and the dice coefficient (Sung-Hyuk, 2007) is effective for measuring similarity, it is used here:

s i m (P^{k}, T^{k}) = \frac{2 \sum_{j = 1}^{2} P_{j}^{k} T_{j}^{k}}{\sum_{j = 1}^{2} {(P_{j}^{k})}^{2} + \sum_{j = 1}^{2} {(T_{j}^{k})}^{2}} .

(7)

Since a single objective may not be a good measure when the training dataset is imbalanced, we consider sensitivity and specificity simultaneously as a better solution as follows:

f_{s e n} = \frac{T P}{T P + F N}, f_{s p e} = \frac{T N}{T N + F P},

(8)

where TP is the number of true positives, TN is the number of true negatives, FP is the number of false positives, and FN is the number of false negatives. In our previous study, f_sen and f_spe were both considered as objective functions (Zhou et al, 2017).

However, f_sen and f_spe are label based measures, while we sought to maximize the similarity of probability output and the true label vector. Therefore, we defined the new similarity-based sensitivity and specificity denoted by f_{sim_sen_} and f_{sim_spe}, respectively. Assume that ${P_{t p}^{1}, P_{t p}^{1}, \cdot \cdot \cdot, P_{t p}^{T P}}$ represents the probability output of true positives and the corresponding true label vector is ${T_{t p}^{1}, T_{t p}^{1}, \cdot \cdot \cdot, T_{t p}^{T P}} .$ The similarity of true positives TP_simis defined as:

T P_{s i m} = \sum_{k = 1}^{T P} s i m (P_{t p}^{k}, T_{t p}^{k}) = \sum_{k = 1}^{T P} \frac{2 \sum_{j = 1}^{2} P_{t p, j}^{k} T_{t p, j}^{k}}{\sum_{j = 1}^{2} {(P_{t p, j}^{k})}^{2} + \sum_{j = 1}^{2} {(T_{t p, j}^{k})}^{2}},

(9)

In gene mutation prediction, j=1 represents mutation, j=2 represents non-mutation, and is the mutation probability. Also, $P_{t p, 1}^{k} = P^{k}, P_{t p, 2}^{k} = 1 - P^{k}, T_{t p, 1}^{k} k = 1,$ and $T_{t p, 2}^{k} k = 0 .$ Therefore, equation (9) can be simplified as:

T P_{s i m} = \sum_{k = 1}^{T P} \frac{P^{k}}{{(P^{k})}^{2} - P^{k} + 1},

(10)

Similarly, we define the similarity of true negatives 𝑇𝑁_𝑠𝑖𝑚, false positives 𝐹𝑃_𝑠𝑖𝑚, and false negatives 𝐹𝑁_𝑠𝑖𝑚:

T N_{s i m} = \sum_{k = 1}^{T N} \frac{1 - P^{k}}{{(P^{k})}^{2} - P^{k} + 1},

(11)

F P_{s i m} = \sum_{k = 1}^{F P} \frac{1 - p^{k}}{{(P^{k})}^{2} - P^{k} + 1},

(12)

F N_{s i m} = \sum_{k = 1}^{F N} \frac{p^{k}}{{(p^{k})}^{2} - p^{k} + 1} .

(13)

Then, 𝑓_{𝑠𝑖𝑚_𝑠𝑒𝑛}, 𝑓_{𝑠𝑖𝑚_𝑠𝑝𝑒} are calculated as:

f_{s i m_{-} s e n} = \frac{T P_{s i m}}{T P_{s i m} + F N_{s i m}}, f_{s i m_{-} s p e} = \frac{T P_{s i m}}{T P_{s i m} + F P_{s i m},}

(14)

Our aim was to maximize the two similarity-based objective functions simultaneously as:

f_{s i m} = \max_{w, R} (f_{s i m_{-} s e n}, f_{s i m_{-} s p e}) .

(15)

Once training is finished, the Pareto-optimal solution set is generated, and the best model parameters and weights are selected based on the clinical needs. In the following subsection, we describe a new algorithm that was developed to solve the similarity-based multi-objective optimization problem.

2.3. Similarity-based multi-objective optimization (SMO) algorithm

Multi-objective evolutionary algorithms (MOEA) have demonstrated the superior performance for multi-objective optimization (Deb, 2001). Based on MOEA, we have proposed an iterative multi-objective immune algorithm (IMIA), which adopts the traditional sensitivity and specificity as the optimized objective functions (Zhou et al, 2017). Based on IMIA, we propose a new SMO algorithm. The major difference between IMIA and SMO is that the reliability is measured through similarity between output probability and label vector during the training process. Additionally, weighting coefficients needs to be optimized in SMO. For conciseness, we just gave a brief description of SMO and focused on the difference between SMO and IMIA. For a full and detailed algorithm, please refer to our previous paper (Zhou et al, 2017).

As IMIA, SMO consists of the 7 steps: initialization, cloning, mutation, deletion, solution updating, termination and best solution selection. In initialization, model parameters R and weights w were both initialized. We generated the initial solution set 𝐷(𝑡)={𝑑₁,⋯,𝑑_𝐻}( 𝑡=0) randomly, 𝑑_𝑖(𝑖=1,⋯,𝐻) is a particular solution, His the number of solutions, and 𝑡 is the number of generation. In cloning step, there is a big difference. In SMO, new similarity-based proportional cloning operation was proposed, where the solution with higher similarity was reproduced multiple times. Specifically, the clonal time CLT_i for each solution is calculated as:

C L T_{i} = ⌈ n_{c} \times \frac{s i m (d_{i})}{\sum_{i = 1}^{H} s i m (d_{i})} ⌉,

(15)

where 𝑛_𝑐 is the expected value of the clonal solution set and ⌈⌉ is the ceiling operator. The similarity measure for solution 𝑑_𝑖 denoted by sim(𝑑_𝑖) is calculated as:

s i m (d_{i}) = \sum_{k = 1}^{K} \frac{2 \sum_{j = 1}^{2} P_{j}^{k} T_{j}^{k}}{\sum_{j = 1}^{2} {(P_{j}^{k})}^{2} + \sum_{j = 1}^{2} {(T_{j}^{k})}^{2}},

(16)

where 𝐾 is the number of training samples and $T_{j}^{k}$ is the label vector.

The mutation and deletion in SMO are the same as those in IMIA. In this paragraph, “mutation” refers to the operation performed on the cloned solution set, not the gene mutation. The mutation probability threshold 𝑀𝑃 is determined empirically and an operation probability 𝑅𝑃_𝑖 is generated randomly. If 𝑅𝑃_𝑖 > 𝑀𝑃, a mutation operation is performed in which a new solution $d_{i}^{m}$ was generated randomly and replace the original solution 𝑑_𝑖. Then, a newly generated mutated solution set 𝑀(𝑡) and solution set 𝐷(𝑡) constitute the new solution set denoted by 𝐹(𝑡). Same solutions in (𝑡) were removed and anew solution set (𝑡) is generated.

In solution updating step of SMO, the aim is to select 𝐻 solutions from (𝑡) to maintain the population size. For each solution, we can obtain the similarities based on the probability outputs of all the training samples, according to equation (16). 𝑓_{𝑠𝑖𝑚_𝑠𝑒𝑛} and 𝑓_{𝑠𝑖𝑚_𝑠𝑝𝑒} can also be calculated according to equation (14). Using the MOEA (Deb, 2001; Deb et al, 2002), we selected Hsolutions. Unlike most traditional MOEAs, the solution in (𝑡) is sorted according to the similarity of each solution. Then, the new solution set (𝑡) is generated.

When t reaches the maximal number of generations 𝐺_𝑚𝑎𝑥, the algorithm terminates. The best solution is selected from the Pareto-optimal solution set (𝐺_𝑚𝑎𝑥) according to clinical needs. We selected the best solution according to the similarity-based sensitivity, specificity, and AUC. First, the thresholds T_{sim_sen} and T_{sim_spe} are determined for similarity-based sensitivity and specificity according to clinical needs. Second, for each solution 𝑑_𝑖 in (𝐺_𝑚𝑎𝑥), we calculate its similarity-based sensitivity $f_{s i m_{-} s e n}^{i}$ and specificity $f_{s i m_{-} s p e}^{i} .$ If $f_{s i m_{-} s e n}^{i} > T_{s i m_{-} s e n}$ , 𝑑_𝑖 is selected as a candidate solution. Third, we select the solution with the highest AUC from the candidate solutions as our final solution.

2.4. Training and testing procedure of the MCMO model

The training process mainly consists of three stages: feature calculation, feature selection, and predictive model construction. To achieve optimal performance for each classifier, we adopted our multi-objective feature selection method (Zhou et al, 2017). After selecting the features for each classifier, model parameters R_i and weights w_i (i = 1,2, …M ) were trained. The workflow is illustrated in figure 3.

Figure 3. — Training process of MCMO predictive model.

The testing process consists of three stages (figure 4). For a test sample, first, the features for each classifier are selected; second, each classifier are selected; second, each classifier outputs a probability $P_{i}^{j} (j = 1, 2; i = 1, 2, … M);$ ; third, the final mutation probability $P_{f i n}^{j}$ is obtained by combining all $P_{i}^{j}$ and 𝑤_𝑖 using the ER approach. Then the label can be determined.

We used six different classifiers in the MCMO model, including support vector machine (SVM) (Keerthi and Lin, 2003), logistic regression (LR) (Freedman, 2009), discriminant analysis (DA) (Hastie and Tibshirani, 1996), decision tree (DT) (Breiman, 2001), K-nearest-neighbor (KNN) (Keller et al, 2012), and naive Bayesian (NB) (Goldszmidt and Moises, 1997). Since SVM has two model parameters and other classifiers use default parameters, we train eight parameters ${R_{S V M_{-} 1,} R_{S V M_{-} 2,} w_{1}, … w_{6}}$ for the predictive model.

3. Results

3.1. Experimental setup

In MCMO, the population number H and the maximal generation number G_max were both set to 100. In the clone operation, n_c was set to 200. In the mutation operation, the mutation probability MP was set to 0.9. The proposed MCMO predictive model was compared to three types of predictive model: (1) models with single classifiers; (2) models with different multi-objective optimization; (3) models with different fusion strategies. Because multi-objective optimization is better than the single-objective model (Zhou et al, 2017), we did not compare our proposed model to models with single-objective optimization. Area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, similarity-based sensitivity (Sim-sensitivity), and similarity-based specificity (Sim-specificity) were used for evaluating the model performance.

In our study, eight parameters in the predictive model need to be trained and our dataset has 57 cases. We adopted two-fold cross-validation in this work. Cross-validation is a widely used model validation technique which can test the model’s ability to predict new data that were not used in training, in order to flag problems like overfitting (Kohavi, 1995). One round of cross-validation involves partitioning a sample of data into complementary subsets. In our work, two-fold cross-validation was used, where for each round, half cases (training set) were selected randomly for training and the other half cases (validation set) were used for validation, then reverse. To reduce variability caused by subset partition, ten rounds of two-fold cross-validation were performed for each model, and the validation results were averaged over the 10 rounds to give an estimate of the model’s predictive performance (table 3). Prediction results of AUC, accuracy, sensitivity and specificity of training set were also listed in table 3. The prediction accuracy of training set is higher than that obtained from the validation set, which is considered as normal for a machine learning algorithm. On the other hand, the accuracies of the training set are not close to 1 and the predictive results on the validation set are acceptable. In the remaining of the paper, all the prediction results were from the validation set.

Table 3.

Results of MCMO prediction models (training set vs. validation set)

Gene	Data set	AUC	Accuracy	Sensitivity	Specificity
VHL	Training set	0.96±0.02	0.91±0.01	0.93±0.02	0.88±0.02
VHL	Validation set	0.88±0.01	0.81±0.02	0.79±0.04	0.86±0.02

PBRM1	Training set	0.95±0.01	0.90±0.03	0.87±0.03	0.92±0.02
PBRM1	Validation set	0.86±0.02	0.78±0.02	0.75±0.02	0.80±0.02

BAP1	Training set	0.97±0.01	0.92±0.01	0.9±0.03	0.92±0.01
BAP1	Validation set	0.93±0.02	0.90±0.02	0.87±0.02	0.90±0.03

Open in a new tab

3.2. Selected features for the three genes

We obtained the classifier-specific feature subset and summarized the features selected by all (six) classifiers (table 4). Statistics using the unpaired T test were also listed. The feature selected by all classifiers indicates that the feature is important to predict mutation. P-value smaller than 0.05 indicated significant differences between presence and absence of mutations. However, four selected features (Minimum, Contrast, Variance and Sum variance) for the PBRM1 gene and three selected features (Variance, Sum average and Sum variance) for the BAP1 gene had P-values greater than 0.05. Because we selected the optimal features according to multi-objective model, using the AUC as the figure of metric, these selected features did not necessarily have P-values smaller than 0.05. Example boxplots were plotted to show the potential of a single feature for differentiating between the presence and absence of mutations (figure 5).

Table 4.

Selected features by all (six) classifiers for VHL, PBRM1 and BAP1 genes.

	Geometry (P-value)	Intensity (P-value)	Texture (P-value)
VHL	Mean (0.020)	Kurtosis (0.002)

PBRM1		Minimum (0.088)	Contrast (0.085)
		Mean (0.020)	Maximum probability (0.030)
		Median (0.041)	Variance (0.162)
		Skewness (0.042)	Sum variance (0.168)
		Kurtosis (0.005)

BAP1	Size_X (0.006)	Sum (0.048)	Homogeneity (0.023)
			Variance (0.171)
			Sum average (0.159)
			Sum variance (0.168)

Open in a new tab

Figure 5. — Boxplots for (a) Kurtosis (*VHL*), (b) Mean (*PBRM1*), and (c) Homogeneity (*BAP1*).

Two intensity features including Mean and Kurtosis are the most frequently selected features in VHL gene prediction. A histogram with a more elongated tail indicates smaller Kurtosis. A tumor with smaller Kurtosis is more likely to carry a VHL mutation (figure 5(a) and figure 6). For the PBRM1 gene prediction, nine features from intensity and texture features, were most frequently selected. A boxplot of the Mean is illustrated in figure 5(b). Five features were selected as the most prominent contributors in BAP1. Four texture features were selected as follows: Homogeneity, Variance, Sum average, and Sum variance. A lower similarity in intensity between a voxel and its neighbors led to higher Variance and Sum variance. A less uniform or more focal intensity distribution led to reduced Homogeneity. Therefore, larger Variance, and smaller Homogeneity were associated with the likelihood that a tumor carried a BAP1 mutation. Boxplot of Homogeneity is illustrated in figure 5(c).

Figure 6. — Kurtosis in tumor CT images. (a) A tumor without *VHL* mutation, (b) Histogram, with kurtosis= 26.57 (Z-score= 1.92), (c) A tumor with *VHL* mutation, (d) Histogram with kurtosis= 7.17 (Z-score= −1.33).

It is noted that one single feature may not achieve accurate predictive results. For each classifier, feature set is necessary. For KNN classifier, a feature set consisting of 12 features (Volume, Size_P3, Minimum, Maximum, Mean, Sum, Variance, Standard deviation, Kurtosis, Cluster shade, Energy, Inverse difference) was selected to predict VHL mutation; while for LR classifier, a feature set consisting of 19 features (Volume, Size_Z, Size_P2, Roundness, Surface area, Mean, Standard deviation, Skewness, Kurtosis, Contrast, Dissimilarity, Energy, Entropy, Homogeneity, Sum entropy, Information measure of correlation_1, Information measure of correlation_2, Inverse difference, Inverse difference normalized) was selected to predict VHL mutation.

3.3. Performance evaluation of MCMO vs. single classifiers

MCMO yielded better AUC, accuracy, sensitivity, and specificity results than other single classifiers (table 5). The prediction accuracy of MCMO is 0.81, 0.78, and 0.90 for VHL, PBRM1, and BAP1 genes, respectively, with AUC >= 0.86 sensitivity > =0.75 and specificity > =0.80. MCMO yielded better results than other single classifiers. KNN is the best single classifier for the VHL and PBRM1 genes, with specificities of 0.66 and 0.62, respectively. MCMO can achieve specificities of 0.86 and 0.80 for VHL and PBRM1 genes, respectively. SVM and DA achieved similar results for the BAP1 gene, which are better than other single classifiers, but sensitivities were only 0.57 and 0.63, respectively; the sensitivity obtained by MCMO was 0.87. Some single classifiers achieved higher sensitivities for the VHL gene, but the corresponding specificities were poor. MCMO achieved the highest AUC and accuracy with balanced sensitivities and specificities (difference < 0.1). Also, MCMO is a stable predictive model because its standard deviations are much smaller than those of the single classifiers.

Table 5.

Results of different prediction models (MOMC vs. single classifier)

Gene	Classifier	AUC	Accuracy	Sensitivity	Specificity
	SVM	0.71±0.06	0.69±0.05	0.88±0.07	0.37±0.11
	NB	0.33±0.14	0.67±0.05	0.84±0.10	0.36±0.16
	LR	0.73±0.04	0.71±0.05	0.73±0.05	0.67±0.09
VHL	KNN	0.80±0.04	0.78±0.03	0.84±0.04	0.66±0.08
	DT	0.67±0.06	0.68±0.07	0.71±0.08	0.61±0.11
	DA	0.71±0.05	0.66±0.05	0.68±0.11	0.64±0.10
	MCMO	0.88±0.01	0.81±0.02	0.79±0.04	0.86±0.02

	SVM	0.68±0.06	0.63±0.06	0.52±0.14	0.70±0.06
	NB	0.41±0.09	0.58±0.05	0.52±0.11	0.62±0.08
	LR	0.62±0.10	0.63±0.09	0.52±0.14	0.69±0.09
PBRM1	KNN	0.67±0.10	0.64±0.10	0.65±0.16	0.62±0.11
	DT	0.52±0.08	0.54±0.07	0.44±0.10	0.60±0.10
	DA	0.59±0.07	0.59±0.06	0.54±0.10	0.62±0.09
	MCMO	0.86±0.02	0.78±0.02	0.75±0.02	0.80±0.02

	SVM	0.80±0.07	0.81±0.03	0.57±0.11	0.85±0.04
	NB	0.74±0.11	0.02±0.02	0.24±0.0	0.88±0.02
	LR	0.69±0.08	0.81±0.03	0.54±0.21	0.84±0.04
BAP1	KNN	0.72±0.06	0.80±0.05	0.57±0.10	0.83±0.06
	DT	0.54±0.09	0.81±0.06	0.24±0.18	0.89±0.08
	DA	0.82±0.08	0.80±0.05	0.63±0.20	0.82±0.04
	MCMO	0.93±0.02	0.90±0.02	0.87±0.02	0.90±0.03

Open in a new tab

3.4. Comparative study of objective functions

One group of the Pareto-optimal solution set and the selected final solution in SMO is shown in figure 7. As described in section 2.3, the best solution was selected according to the similarity-based sensitivity, specificity (equation (14)), and AUC. First, thresholds T_{sim_sen} and T_{sim_spe} were determined for similarity-based sensitivity and specificity based on clinical needs. In this study, the thresholds T_{sim_sen} and T _{sim_spe} are both 0.9. The selected candidate solutions were included within the red rectangle and the selected final solution (highest AUC) was marked in red.

We evaluated the performance of our MCMO by comparing it to the iterative multi-objective immune algorithm (IMIA), which adopts the traditional sensitivity and specificity as the optimized objective functions (Zhou et al, 2017) (figure 8). The two methods were compared with the unpaired T test at a significance level 0.05 (table 6). Results were similar based on AUC, accuracy, sensitivity and specificity (P-value> 0.05). For VHL and PBRM1 genes, SMO achieved a little higher AUCs. For all three genes, SMO achieved significantly higher similarity scores (P-value <= 0.01), indicating that these results are more reliable. For the prediction result with higher AUC, the difference of 𝑓_{𝑠𝑖𝑚_𝑠𝑒𝑛} or 𝑓_{𝑠𝑖𝑚_𝑠𝑝𝑒} between SMO and IMIA is small. For example, for the BAP1gene, the difference of 𝑓_{𝑠𝑖𝑚_𝑠𝑒𝑛} and that of 𝑓_{𝑠𝑖𝑚_𝑠𝑝𝑒} are 0.03 and 0.02. However, for the prediction result with lower AUC, such as for the PBRM1 gene, the difference of 𝑓_{𝑠𝑖𝑚_𝑠𝑒𝑛} is 0.13 and that of 𝑓_{𝑠𝑖𝑚_𝑠𝑝𝑒} is 0.07.

Figure 8 . — Results of using different objective functions. (a) *VHL*; (b) *PBRM1*; (c) *BAP1*.

Table 6.

Results of P-values compared between SMO and IMIA

Gene	AUC	Accuracy	Sensitivity	Specificity	Sim-sensitivity	Sim-specificity
VHL	0.116	0.449	0.226	0.206	0.003	0.0002
PBRM1	0.412	0.690	0.355	0.355	<0.0001	<0.0001
BAP1	0.574	0.536	0.330	0.472	0.01	0.007

Open in a new tab

3.5. Comparative study of fusing method

We used the ER approach (equation (4)) for fusing the output of different classifiers. The classic weighted fusion (WF) method is used for comparison, as:

P = \sum_{i = 1}^{M} P_{i} w_{i}

(17)

where 𝑃_𝑖 is the individual classifier output probability and 𝑤_𝑖 is the relative weight. SMO is used in both fusion strategies, and the comparative results are shown in figure 9. The two methods were compared with the unpaired t test at a significance level 0.05 (table 7). For VHL and PBRM1 genes, the ER approach achieved higher AUCs (P-value < 0.05). Also, sim-sensitivity and sim-specificity in ER are higher than WF (P-value <= 0.02), which indicates that more reliable results can be obtained when using ER fusion.

Figure 9. — Results of using different fusion strategies (ER vs. WF). (a) *VHL*; (b) *PBRM1*; (c) *BAP1*.

Table 7.

Results of P-values compared between ER and WF

Gene	AUC	Accuracy	Sensitivity	Specificity	Sim-sensitivity	Sim-specificity
VHL	0.021	0.083	0.760	0.018	0.007	<0.0001
PBRM1	0.0001	0.244	0.306	0.492	0.04	0.02
BAP1	0.882	0.045	0.330	0.074	0.02	0.0009

Open in a new tab

4. Discussion

The study of the association between diagnostic imaging features and mutations is a first critical step in the radiogenomics of ccRCC (Kuo and Yamamoto, 2011). While the genomic landscape of ccRCC is first characterized by the loss of VHL function, recent advances in cancer genome sequencing have identified additional, prognostically significant mutations. Two hypothesis-generating studies indicated the potential association between individual CT features and mutations of the VHL gene, and also mutations in the PBRM1, BAP1, SETD2 and, KDM5C genes (Karlo et al, 2014; Shinagare et al, 2015). The CT features identified by the radiologists were primarily morphological (e.g. necrosis, ill or well defined margins, renal vein invasion). In contrast, all of our features were quantitative descriptors extracted from the contoured tumor image. The feature extraction was automated, eliminating subjectivity and improving reproducibility.

The most frequently selected features varied depending on the gene. The most frequently selected features of the VHL gene were intensity features (Mean and Kurtosis), which described the mean values and intensity distribution in tumor volume. Intensity and texture features were found to be important in the PBRM1 predictive model. The most selected features of PBRM1 consisted of five intensity features that measured the intensity distribution in tumor volume, and four texture features that measured the local differences within an image. Karlo et al (2014) reported that nodular, heterogeneous enhancement and visibility of intratumoral blood vessels in tumors were more common among ccRCCs with underlying VHL mutations. Also, investigators found an association between a well-defined tumor margin and the VHL mutation, and an association between solid ccRCC and mutations in VHL and PBRM1. However, Shinagare et al (2015) did not observe any imaging characteristics associated with PBRM1 and VHL mutations. Because both evaluations were subjective and their features were morphological, we were unable to directly compare our quantitative results with those morphological features.

The most frequently selected features of BAP1 consisted of one geometry feature, one intensity feature, and four texture features. Texture features were the most prominent contributors in the BAP1 predictive model. In a study by Shinagare et al (2015), the BAP1 mutation was found to be associated with ill-defined tumor margins and calcification. We did not study tumor margins because they are not a quantitative feature. However, the presence of calcification may be associated with selected features such as Homogeneity and Variance.

We proposed a MCMO radiogenomics model that predicts gene mutations in ccRCC. Multi-classifier models can fully use information extracted by different classifiers and potentially improve prediction accuracy. We used all six different classifiers without considering the performance of individual classifiers. In future work, we will test the performance of single classifiers, and remove those with lower performance for the multi-classifier model. SVM and NB results for the VHL gene were found to be poor (table 5), therefore eliminating them and using LR, KNN, DT and DA for fusion may improve prediction results.

Both similarity-based sensitivity and specificity were considered simultaneously as the objects that guided construction of the predictive model. For the first time, we propose reliable outcome prediction, which refers to maximizing the similarity of output probability and true label (probability is 1). Higher similarity means higher reliability. We designed similarity-based sensitivity and specificity as optimized objective functions, which differ from traditional ones. Moreover, an SMO algorithm was developed to train the model to increase accuracy and confidence of predictive results. Compared with our previous IMIA method, similarity was adopted as a non-dominated sorting criterion upon updating the solution set. Also, the solution with higher similarity was kept. These findings indicate that the prediction results are more reliable (higher f_{sim_sen} and f_{sim_spe} ) when using our model. As for the fusion strategy, the ER approach is better than the classic WF in terms of similarity-based optimization. SMO algorithm and ER approach both contribute to reliability increase.

Our study presents a number of limitations. First, the number of patients used in our study is relatively small. In our model, eight parameters (2 parameters of SVM and 6 weights) need to be estimated and two-fold cross-validation were used. While two-fold cross-validation results showed that our model achieved satisfactory results, a larger dataset and multi-classifier fusion could help to reduce the potential risk of overfitting (Dietterich, 2000). In a future work, we can apply this MCMO model in different classification where dataset of larger size is available. Second, recent advances in genetics have led to the identification of several mutations associated with ccRCC, including those involving the VHL, BAP1, PBRM1, SETD2, MUC4 and KDM5C genes. The genomic information of all six genes were available for the 24 patients in TCGA/TCIA data collection. However, genomic information of the three genes (SETD2, MUC4 and KDM5C) was not available for most of the 33 UTSW cases used in the present study. Thus, we only considered VHL, PBRM1 and BAP1 genes in this work. Third, the feature stability was not addressed in the current study. Our patient data was acquired by different CT scanners at different institutions with different protocols, which resulted in differences in pixel size and slice thickness, while the differences of scanner and protocol have influence on feature calculations (Mackin et al, 2015). Additionally, tumor delineation was conducted by one physician and reviewed by another physician in this study. This could also introduce inter-observer variability in tumor delineation. Standardization of image acquisition protocols, automatic segmentation or consensus contours from more physicians may further improve the performance of the model developed in this work.

5. Conclusion

We proposed a multi-classifier multi-objective (MCMO) radiogenomics model that predicts VHL, PBRM1, and BAP1 gene mutations in ccRCC using quantitative CT feature set. Using our feature selection strategy and model, we achieved a predictive AUC greater than 0.85 for all three genes. Compared to single classifiers, multi-classifiers fused through ER and trained by developed SMO algorithm can greatly improve prediction accuracy and reliability. In MCMO, the concept of reliable outcome prediction was first proposed and applied to the optimization procedure, generating more reliable results. The MCMO model should not only be applied to radiogenomics, but also to solving other outcome prediction problems in medicine.

Acknowledgement

This work was supported in part by the American Cancer Society ACS-IRG-02-196 (J. Wang), the US National Institutes of Health P50CA196516 (J. Brugarolas, P. Kapur, I. Pedrosa) and R01CA154475 (I. Pedrosa), and the National Natural Science Foundation of China 61401349 (X. Chen) and 61571359 (X. Mou). The authors thank Dr. Damiana Chiavolini for editing the manuscript.

References

Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, Bussink J, Monshouwer R, Haibe-Kains B and Rietveld D 2014. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach Nat. Commun 5 4006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Breiman L 2001. Random Forests Mach. Learn 45 5–32 [Google Scholar]
Cancer Genome Atlas Research Network 2013. Comprehensive molecular characterization of clear cell renal cell carcinoma Nature 499 43–9 [DOI] [PMC free article] [PubMed] [Google Scholar]
Carles J, Chirivella I, Climent MÁ, Gallardo E, Del Alba AG, Maroto JP, Mellado B and Del Muro FXG 2012. Evaluation of patients with metastatic renal cell carcinoma after failure of first-line treatment Cancer Metastasis Rev 31 S3–S9 [DOI] [PubMed] [Google Scholar]
Cheadle C, Vawter MP, Freed WJ and Becker KG 2003. Analysis of microarray data using Z score transformation J. Mol. Diagn 5 73–81 [DOI] [PMC free article] [PubMed] [Google Scholar]
Clark KVB, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F 2013. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository J. Digit. Imaging 26 1045–57 [DOI] [PMC free article] [PubMed] [Google Scholar]
Dalgliesh GL, Furge K, Greenman C, Chen L, Bignell G, Butler A, Davies H, Edkins S, Hardy C and Latimer C 2010. Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes Nature 463 360–3 [DOI] [PMC free article] [PubMed] [Google Scholar]
Deb K 2001. Multi-Objective Optimization Using Evolutionary Algorithms vol 16 (New York: Wiley; ) [Google Scholar]
Deb K, Pratap A, Agarwal S and Meyarivan T 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II IEEE T. Evolut. Comput 6 182–97 [Google Scholar]
Dietterich TG 2000. Ensemble methods in machine learning International Workshop on Multiple classifier systems Springer, Berlin, Heidelberg: 1–15 [Google Scholar]
Duns G, van den Berg E, van Duivenbode I, Osinga J, Hollema H, Hofstra RM and Kok K 2010. Histone methyltransferase gene SETD2 is a novel tumor suppressor gene in clear cell renal cell carcinoma Cancer Res 70 4287–91 [DOI] [PubMed] [Google Scholar]
Freedman DA 2009. Statistical Models: Theory and Practice (Cambridge, England: Cambridge University Press; ) [Google Scholar]
Gerlinger M, Horswell S, Larkin J, Rowan AJ, Salm MP, Varela I, Fisher R, McGranahan N, Matthews N and Santos CR 2014. Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing Nat. Genet 46 225–33 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, Martinez P, Matthews N, Stewart A and Tarpey P 2012. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing N. Engl. J. Med 366 883–92 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gevaert O, Echegaray S, Khuong A, Hoang CD, Shrager JB, Jensen KC, Berry GJ, Guo HH, Lau C and Plevritis SK 2017. Predictive radiogenomics modeling of EGFR mutation status in lung cancer Sci Rep 7 41674. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gnarra JR, Tory K, Weng Y, Schmidt L, Wei MH, Li H, Latif F, Liu S, Chen F and Duh FM 1994. Mutations of the VHL tumour suppressor gene in renal carcinoma Nature Genetics 7 85–90 [DOI] [PubMed] [Google Scholar]
Goldszmidt and Moises 1997. Bayesian Network Classifiers Mach. Learn 29 131–63 [Google Scholar]
Guo G, Gui Y, Gao S, Tang A, Hu X, Huang Y, Jia W, Li Z, He M and Sun L 2012. Frequent mutations of genes encoding ubiquitin-mediated proteolysis pathway components in clear cell renal cell carcinoma Nat. Genet 44 17–9 [DOI] [PubMed] [Google Scholar]
Hakimi AA, Chen Y-B, Wren J, Gonen M, Abdel-Wahab O, Heguy A, Liu H, Takeda S, Tickoo SK and Reuter VE 2013. Clinical and pathologic impact of select chromatin-modulating tumor suppressors in clear cell renal cell carcinoma Eur. Urol 63 848–54 [DOI] [PMC free article] [PubMed] [Google Scholar]
Haralick RM, Shanmugam K and Dinstein IH 1973. Textural features for image classification IEEE Trans. Syst. Man Cyber SMC-3 610–21 [Google Scholar]
Hastie T and Tibshirani R 1996. Discriminant Analysis by Gaussian Mixtures J. Roy. Stat. Soc 58 155–76 [Google Scholar]
Jaffe CC 2012. Imaging and genomics: is there a synergy? Radiology 264 329–31 [DOI] [PubMed] [Google Scholar]
Kapur P, Peña-Llopis S, Christie A, Zhrebker L, Pavía-Jiménez A, Rathmell WK, Xie X-J and Brugarolas J 2013. Effects on survival of BAP1 and PBRM1 mutations in sporadic clear-cell renal-cell carcinoma: a retrospective analysis with independent validation Lancet Oncol 14 159–67 [DOI] [PMC free article] [PubMed] [Google Scholar]
Karlo CA, Di Paolo PL, Chaim J, Hakimi AA, Ostrovnaya I, Russo P, Hricak H, Motzer R, Hsieh JJ and Akin O 2014. Radiogenomics of clear cell renal cell carcinoma: associations between CT imaging features and mutations Radiology 270 464–71 [DOI] [PMC free article] [PubMed] [Google Scholar]
Keerthi SS and Lin C-J 2003. Asymptotic behaviors of support vector machines with Gaussian kernel Neural Comput 15 1667–89 [DOI] [PubMed] [Google Scholar]
Keller JM, Gray MR and Givens JA 2012. A fuzzy K-nearest neighbor algorithm IEEE T. Syst. Man Cy SMC-15 580–5 [Google Scholar]
Kohavi R 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection International Joint Conference on Artificial Intelligence. Morgan Kaufmann Publishers Inc; 1137–43. [Google Scholar]
Kuo MD and Jamshidi N 2014. Behind the numbers: decoding molecular phenotypes with radiogenomics—guiding principles and technical considerations Radiology 270 320–5 [DOI] [PubMed] [Google Scholar]
Kuo MD and Yamamoto S 2011. Next generation radiologic-pathologic correlation in oncology: Rad-Path 2.0 Am. J. Roentgenol 197 990–7 [DOI] [PubMed] [Google Scholar]
Mackin D, Fave X, Zhang L, Fried D, Yang J, Taylor B, Rodriguezrivera E, Dodge C, Jones AK and Court L 2015. Measuring CT scanner variability of radiomics features Investigative Radiology 50 757. [DOI] [PMC free article] [PubMed] [Google Scholar]
McGranahan N and Swanton C 2015. Biological and therapeutic impact of intratumor heterogeneity in cancer evolution Cancer Cell 27 15–26 [DOI] [PubMed] [Google Scholar]
Motzer RJ, Bacik J, Mariani T, Russo P, Mazumdar M and Reuter V 2002. Treatment outcome and survival associated with metastatic renal cell carcinoma of non–clear-cell histology J. Clin. Oncol 20 2376–81 [DOI] [PubMed] [Google Scholar]
Motzer RJ, Jonasch E, Agarwal N, Bhayani S, Bro WP, Chang SS, Choueiri TK, Costello BA, Derweesh IH and Fishman M 2017. Kidney Cancer, Version 2. 2017, NCCN Clinical Practice Guidelines in Oncology J Natl Compr Canc Netw 15 804–34 [DOI] [PubMed] [Google Scholar]
Peña-Llopis S, Vega-Rubín-de-Celis S, Liao A, Leng N, Pavía-Jiménez A, Wang S, Yamasaki T, Zhrebker L, Sivanand S and Spence P 2012. BAP1 loss defines a new class of renal cell carcinoma Nat. Genet 751–9 [DOI] [PMC free article] [PubMed] [Google Scholar]
Powles T and Albers P 2012. Management of favorable-risk patients with metastatic renal cell carcinoma: when to start and when to stop targeted therapy Clin. Genitourin. Cancer 10 213–8 [DOI] [PubMed] [Google Scholar]
Reznek RH 2004. CT/MRI in staging renal cell carcinoma Cancer Imaging 4 S25–S32 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rutman AM and Kuo MD 2009. Radiogenomics: creating a link between molecular diagnostics and diagnostic imaging Eur. J. Radiol 70 232–41 [DOI] [PubMed] [Google Scholar]
Sala E, Mema E, Himoto Y, Veeraraghavan H, Brenton JD, Snyder A, Weigelt B and Vargas HA 2017. Unravelling tumour heterogeneity using next-generation imaging: radiomics, radiogenomics, and habitat imaging Clin. Radiol 72 3–10 [DOI] [PMC free article] [PubMed] [Google Scholar]
Sauk SC, Hsu MS, Margolis DJ, Lu DS, Rao NP, Belldegrun AS, Pantuck AJ and Raman SS 2011. Clear cell renal cell carcinoma: multiphasic multidetector CT imaging features help predict genetic karyotypes Radiology 261 854–62 [DOI] [PubMed] [Google Scholar]
Shinagare AB, Vikram R, Jaffe C, Akin O, Kirby J, Huang E, Freymann J, Sainani NI, Sadow CA and Bathala TK 2015. Radiogenomics of clear cell renal cell carcinoma: preliminary findings of The Cancer Genome Atlas–Renal Cell Carcinoma (TCGA–RCC) Imaging Research Group Abdom. Imaging 40 1684–92 [DOI] [PMC free article] [PubMed] [Google Scholar]
Siegel RL, Miller KD and Jemal A 2017. Cancer Statistics, 2017. Ca-Cancer J. Clin 67 7–30 [DOI] [PubMed] [Google Scholar]
Smith KCK, Bennett W, Nolan T, Kirby J, Wolfsberger M, Moulton J, Vendt B, Freymann J 2016. Radiology Data from The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma (TCGA-KIRC) collection [Google Scholar]
Stewartmerrill SB, Thompson RH, Boorjian SA, Psutka SP, Lohse CM, Cheville JC, Leibovich BC and Frank I 2015. Oncologic Surveillance After Surgical Resection for Renal Cell Carcinoma: A Novel Risk-Based Approach Journal of Urology 193 e650–e1 [DOI] [PubMed] [Google Scholar]
Sung-Hyuk C 2007. Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions International Journal of Mathematical Models & Methods in Applied Sciences 1 300–7 [Google Scholar]
Valdes G, Solberg TD, Heskel M, Ungar L and Simone CB 2016. Using machine learning to predict radiation pneumonitis in patients with stage I non-small cell lung cancer treated with stereotactic body radiation therapy Phys. Med. Biol 61 6105–20 [DOI] [PMC free article] [PubMed] [Google Scholar]
Varela I, Tarpey P, Raine K, Huang D, Ong CK, Stephens P, Davies H, Jones D, Lin M-L and Teague J 2011. Exome sequencing identifies frequent mutation of the SWI/SNF complex gene PBRM1 in renal carcinoma Nature 469 539–42 [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang Y, Yang J and Xu DL 2006. Environmental impact assessment using the evidential reasoning approach Eur. J. Oper. Res 174 1885–913 [Google Scholar]
Yamamoto S, Maki DD, Korn RL and Kuo MD 2012. Radiogenomic analysis of breast cancer using MRI: a preliminary study to define the landscape Am. J. Roentgenol 199 654–63 [DOI] [PubMed] [Google Scholar]
Yang JB and Xu DL 2002. On the evidential reasoning algorithm for multiple attribute decision analysis under uncertainty IEEE T. Syst. Man Cy. A 32 289–304 [Google Scholar]
Yang JB and Xu DL 2013. Evidential reasoning rule for evidence combination Artif. Intell 205 1–29 [Google Scholar]
Zhou Z, Folkert M, Iyengar P, Westover K, Zhang Y, Choy H, Timmerman R, Jiang S and Wang J 2017. Multi- objective radiomics model for predicting distant failure in lung SBRT Phys. Med. Biol 62 4460. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, Bussink J, Monshouwer R, Haibe-Kains B and Rietveld D 2014. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach Nat. Commun 5 4006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Breiman L 2001. Random Forests Mach. Learn 45 5–32 [Google Scholar]

[R2] Cancer Genome Atlas Research Network 2013. Comprehensive molecular characterization of clear cell renal cell carcinoma Nature 499 43–9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Carles J, Chirivella I, Climent MÁ, Gallardo E, Del Alba AG, Maroto JP, Mellado B and Del Muro FXG 2012. Evaluation of patients with metastatic renal cell carcinoma after failure of first-line treatment Cancer Metastasis Rev 31 S3–S9 [DOI] [PubMed] [Google Scholar]

[R4] Cheadle C, Vawter MP, Freed WJ and Becker KG 2003. Analysis of microarray data using Z score transformation J. Mol. Diagn 5 73–81 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Clark KVB, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F 2013. The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository J. Digit. Imaging 26 1045–57 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Dalgliesh GL, Furge K, Greenman C, Chen L, Bignell G, Butler A, Davies H, Edkins S, Hardy C and Latimer C 2010. Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes Nature 463 360–3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Deb K 2001. Multi-Objective Optimization Using Evolutionary Algorithms vol 16 (New York: Wiley; ) [Google Scholar]

[R8] Deb K, Pratap A, Agarwal S and Meyarivan T 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II IEEE T. Evolut. Comput 6 182–97 [Google Scholar]

[R9] Dietterich TG 2000. Ensemble methods in machine learning International Workshop on Multiple classifier systems Springer, Berlin, Heidelberg: 1–15 [Google Scholar]

[R10] Duns G, van den Berg E, van Duivenbode I, Osinga J, Hollema H, Hofstra RM and Kok K 2010. Histone methyltransferase gene SETD2 is a novel tumor suppressor gene in clear cell renal cell carcinoma Cancer Res 70 4287–91 [DOI] [PubMed] [Google Scholar]

[R11] Freedman DA 2009. Statistical Models: Theory and Practice (Cambridge, England: Cambridge University Press; ) [Google Scholar]

[R12] Gerlinger M, Horswell S, Larkin J, Rowan AJ, Salm MP, Varela I, Fisher R, McGranahan N, Matthews N and Santos CR 2014. Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing Nat. Genet 46 225–33 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, Martinez P, Matthews N, Stewart A and Tarpey P 2012. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing N. Engl. J. Med 366 883–92 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] Gevaert O, Echegaray S, Khuong A, Hoang CD, Shrager JB, Jensen KC, Berry GJ, Guo HH, Lau C and Plevritis SK 2017. Predictive radiogenomics modeling of EGFR mutation status in lung cancer Sci Rep 7 41674. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Gnarra JR, Tory K, Weng Y, Schmidt L, Wei MH, Li H, Latif F, Liu S, Chen F and Duh FM 1994. Mutations of the VHL tumour suppressor gene in renal carcinoma Nature Genetics 7 85–90 [DOI] [PubMed] [Google Scholar]

[R16] Goldszmidt and Moises 1997. Bayesian Network Classifiers Mach. Learn 29 131–63 [Google Scholar]

[R17] Guo G, Gui Y, Gao S, Tang A, Hu X, Huang Y, Jia W, Li Z, He M and Sun L 2012. Frequent mutations of genes encoding ubiquitin-mediated proteolysis pathway components in clear cell renal cell carcinoma Nat. Genet 44 17–9 [DOI] [PubMed] [Google Scholar]

[R18] Hakimi AA, Chen Y-B, Wren J, Gonen M, Abdel-Wahab O, Heguy A, Liu H, Takeda S, Tickoo SK and Reuter VE 2013. Clinical and pathologic impact of select chromatin-modulating tumor suppressors in clear cell renal cell carcinoma Eur. Urol 63 848–54 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Haralick RM, Shanmugam K and Dinstein IH 1973. Textural features for image classification IEEE Trans. Syst. Man Cyber SMC-3 610–21 [Google Scholar]

[R20] Hastie T and Tibshirani R 1996. Discriminant Analysis by Gaussian Mixtures J. Roy. Stat. Soc 58 155–76 [Google Scholar]

[R21] Jaffe CC 2012. Imaging and genomics: is there a synergy? Radiology 264 329–31 [DOI] [PubMed] [Google Scholar]

[R22] Kapur P, Peña-Llopis S, Christie A, Zhrebker L, Pavía-Jiménez A, Rathmell WK, Xie X-J and Brugarolas J 2013. Effects on survival of BAP1 and PBRM1 mutations in sporadic clear-cell renal-cell carcinoma: a retrospective analysis with independent validation Lancet Oncol 14 159–67 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Karlo CA, Di Paolo PL, Chaim J, Hakimi AA, Ostrovnaya I, Russo P, Hricak H, Motzer R, Hsieh JJ and Akin O 2014. Radiogenomics of clear cell renal cell carcinoma: associations between CT imaging features and mutations Radiology 270 464–71 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Keerthi SS and Lin C-J 2003. Asymptotic behaviors of support vector machines with Gaussian kernel Neural Comput 15 1667–89 [DOI] [PubMed] [Google Scholar]

[R25] Keller JM, Gray MR and Givens JA 2012. A fuzzy K-nearest neighbor algorithm IEEE T. Syst. Man Cy SMC-15 580–5 [Google Scholar]

[R26] Kohavi R 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection International Joint Conference on Artificial Intelligence. Morgan Kaufmann Publishers Inc; 1137–43. [Google Scholar]

[R27] Kuo MD and Jamshidi N 2014. Behind the numbers: decoding molecular phenotypes with radiogenomics—guiding principles and technical considerations Radiology 270 320–5 [DOI] [PubMed] [Google Scholar]

[R28] Kuo MD and Yamamoto S 2011. Next generation radiologic-pathologic correlation in oncology: Rad-Path 2.0 Am. J. Roentgenol 197 990–7 [DOI] [PubMed] [Google Scholar]

[R29] Mackin D, Fave X, Zhang L, Fried D, Yang J, Taylor B, Rodriguezrivera E, Dodge C, Jones AK and Court L 2015. Measuring CT scanner variability of radiomics features Investigative Radiology 50 757. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] McGranahan N and Swanton C 2015. Biological and therapeutic impact of intratumor heterogeneity in cancer evolution Cancer Cell 27 15–26 [DOI] [PubMed] [Google Scholar]

[R31] Motzer RJ, Bacik J, Mariani T, Russo P, Mazumdar M and Reuter V 2002. Treatment outcome and survival associated with metastatic renal cell carcinoma of non–clear-cell histology J. Clin. Oncol 20 2376–81 [DOI] [PubMed] [Google Scholar]

[R32] Motzer RJ, Jonasch E, Agarwal N, Bhayani S, Bro WP, Chang SS, Choueiri TK, Costello BA, Derweesh IH and Fishman M 2017. Kidney Cancer, Version 2. 2017, NCCN Clinical Practice Guidelines in Oncology J Natl Compr Canc Netw 15 804–34 [DOI] [PubMed] [Google Scholar]

[R33] Peña-Llopis S, Vega-Rubín-de-Celis S, Liao A, Leng N, Pavía-Jiménez A, Wang S, Yamasaki T, Zhrebker L, Sivanand S and Spence P 2012. BAP1 loss defines a new class of renal cell carcinoma Nat. Genet 751–9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] Powles T and Albers P 2012. Management of favorable-risk patients with metastatic renal cell carcinoma: when to start and when to stop targeted therapy Clin. Genitourin. Cancer 10 213–8 [DOI] [PubMed] [Google Scholar]

[R35] Reznek RH 2004. CT/MRI in staging renal cell carcinoma Cancer Imaging 4 S25–S32 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] Rutman AM and Kuo MD 2009. Radiogenomics: creating a link between molecular diagnostics and diagnostic imaging Eur. J. Radiol 70 232–41 [DOI] [PubMed] [Google Scholar]

[R37] Sala E, Mema E, Himoto Y, Veeraraghavan H, Brenton JD, Snyder A, Weigelt B and Vargas HA 2017. Unravelling tumour heterogeneity using next-generation imaging: radiomics, radiogenomics, and habitat imaging Clin. Radiol 72 3–10 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] Sauk SC, Hsu MS, Margolis DJ, Lu DS, Rao NP, Belldegrun AS, Pantuck AJ and Raman SS 2011. Clear cell renal cell carcinoma: multiphasic multidetector CT imaging features help predict genetic karyotypes Radiology 261 854–62 [DOI] [PubMed] [Google Scholar]

[R39] Shinagare AB, Vikram R, Jaffe C, Akin O, Kirby J, Huang E, Freymann J, Sainani NI, Sadow CA and Bathala TK 2015. Radiogenomics of clear cell renal cell carcinoma: preliminary findings of The Cancer Genome Atlas–Renal Cell Carcinoma (TCGA–RCC) Imaging Research Group Abdom. Imaging 40 1684–92 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] Siegel RL, Miller KD and Jemal A 2017. Cancer Statistics, 2017. Ca-Cancer J. Clin 67 7–30 [DOI] [PubMed] [Google Scholar]

[R41] Smith KCK, Bennett W, Nolan T, Kirby J, Wolfsberger M, Moulton J, Vendt B, Freymann J 2016. Radiology Data from The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma (TCGA-KIRC) collection [Google Scholar]

[R42] Stewartmerrill SB, Thompson RH, Boorjian SA, Psutka SP, Lohse CM, Cheville JC, Leibovich BC and Frank I 2015. Oncologic Surveillance After Surgical Resection for Renal Cell Carcinoma: A Novel Risk-Based Approach Journal of Urology 193 e650–e1 [DOI] [PubMed] [Google Scholar]

[R43] Sung-Hyuk C 2007. Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions International Journal of Mathematical Models & Methods in Applied Sciences 1 300–7 [Google Scholar]

[R44] Valdes G, Solberg TD, Heskel M, Ungar L and Simone CB 2016. Using machine learning to predict radiation pneumonitis in patients with stage I non-small cell lung cancer treated with stereotactic body radiation therapy Phys. Med. Biol 61 6105–20 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] Varela I, Tarpey P, Raine K, Huang D, Ong CK, Stephens P, Davies H, Jones D, Lin M-L and Teague J 2011. Exome sequencing identifies frequent mutation of the SWI/SNF complex gene PBRM1 in renal carcinoma Nature 469 539–42 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] Wang Y, Yang J and Xu DL 2006. Environmental impact assessment using the evidential reasoning approach Eur. J. Oper. Res 174 1885–913 [Google Scholar]

[R47] Yamamoto S, Maki DD, Korn RL and Kuo MD 2012. Radiogenomic analysis of breast cancer using MRI: a preliminary study to define the landscape Am. J. Roentgenol 199 654–63 [DOI] [PubMed] [Google Scholar]

[R48] Yang JB and Xu DL 2002. On the evidential reasoning algorithm for multiple attribute decision analysis under uncertainty IEEE T. Syst. Man Cy. A 32 289–304 [Google Scholar]

[R49] Yang JB and Xu DL 2013. Evidential reasoning rule for evidence combination Artif. Intell 205 1–29 [Google Scholar]

[R50] Zhou Z, Folkert M, Iyengar P, Westover K, Zhang Y, Choy H, Timmerman R, Jiang S and Wang J 2017. Multi- objective radiomics model for predicting distant failure in lung SBRT Phys. Med. Biol 62 4460. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Reliable Gene Mutation Prediction in Clear Cell Renal Cell Carcinoma through Multi-classifier Multi-objective Radiogenomics Model

Xi Chen

Zhiguo Zhou

Raquibul Hannan

Kimberly Thomas

Ivan Pedrosa

Payal Kapur

James Brugarolas

Xuanqin Mou

Jing Wang

Abstract

1. Introduction

2. Materials and Methods

2.1. Data

2.1.1. Patients.

Table 1.

2.1.2. CT Image features.

Figure 1.

Table 2.

Figure 2.

2.2. MCMO predictive model

2.2.1. Evidential reasoning based classifier fusion

2.2.2. Reliable outcome prediction based on output probability similarity

2.3. Similarity-based multi-objective optimization (SMO) algorithm

2.4. Training and testing procedure of the MCMO model

Figure 3.

Figure 4.

3. Results

3.1. Experimental setup

Table 3.

3.2. Selected features for the three genes

Table 4.

Figure 5.

Figure 6.

3.3. Performance evaluation of MCMO vs. single classifiers

Table 5.

3.4. Comparative study of objective functions

Figure 7.

Figure 8 .

Table 6.

3.5. Comparative study of fusing method

Figure 9.

Table 7.

4. Discussion

5. Conclusion

Acknowledgement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases