A neural pathomics framework for classifying colorectal cancer histopathology images based on wavelet multi-scale texture analysis

Eleftherios Trivizakis; Georgios S Ioannidis; Ioannis Souglakos; Apostolos H Karantanas; Maria Tzardi; Kostas Marias

doi:10.1038/s41598-021-94781-6

. 2021 Jul 30;11:15546. doi: 10.1038/s41598-021-94781-6

A neural pathomics framework for classifying colorectal cancer histopathology images based on wavelet multi-scale texture analysis

Eleftherios Trivizakis ^1,^2,^✉, Georgios S Ioannidis ², Ioannis Souglakos ^3,⁴, Apostolos H Karantanas ^2,⁵, Maria Tzardi ¹, Kostas Marias ^2,⁶

PMCID: PMC8324876 PMID: 34330946

Abstract

Colorectal cancer (CRC) constitutes the third most commonly diagnosed cancer in males and the second in females. Precise histopathological classification of CRC tissue pathology is the cornerstone not only for diagnosis but also for patients’ management decision making. An automated system able to accurately classify different CRC tissue regions may increase diagnostic precision and alleviate clinical workload. However, tissue classification is a challenging task due to the variability in morphological and textural characteristics present in histopathology images. In this study, an artificial neural network was trained to classify between eight classes of CRC tissue image patches derived from a public dataset with 5000 CRC histopathology image tiles. A total of 532 multi-level pathomics features examined at different scales were extracted by visual descriptors such as local binary patterns, wavelet transforms and Gabor filters. An exhaustive evaluation involving a variety of wavelet families and parameters was performed in order to shed light on the impact of scale on pathomics based CRC tissue differentiation. Our model achieved a performance accuracy of 95.3% with tenfold cross validation demonstrating superior performance compared to 87.4% reported in recent studies. Furthermore, we experimentally showed that the first and the second levels of the wavelet approximations can be used without compromising classification performance.

Subject terms: Diagnostic markers, Information technology, Computer science

Introduction

Pathomics are features stemming from the analysis of digitized histopathology images with the use of image analysis methods that extract sub-visual attributes for characterizing disease appearance and behavior. As in radiomics, the main image processing methodology used to extract a large number of characteristic image features is texture analysis. However, is it well known that texture may exist in different scales¹ and for this reason wavelet transforms have been used to characterize tissue properties as part of artificial intelligence and radiomics pipelines. The main rationale for using wavelets is their ability to decipher textural information from different scales, which has been shown to significantly improve image classification². However, the impact regarding the choice of wavelet families or level of decomposition on the performance of radiomics/pathomics methods, remains a poorly addressed topic in the literature. Wavelets are widely used in many fields of research such as time-series analysis, signal denoising³ and filtering for radiomics analysis in medical imaging⁴. In particular, Sharma et al.⁵ investigated a feature extraction method based on different multi-resolution schemes by incorporating wavelet packet transform (WPT) and Gabor wavelet transform (GWT) over the ROIs (Regions of Interest) of breast ultrasound images. The examined dataset comprised 167 cases of breast lesions including 60 fibro-adenoma, 50 carcinomas and 57 metastases. A support vector machine (SVM) classifier was used for differentiating among breast lesions yielding accuracy (ACC) up-to 89.5% and sensitivity (SN) up-to 95.5%. Wang et al.⁶ studied 3 datasets with a total of 381 T2-weighted brain images for pathological brain detection (PBD). Discrete Wavelet Packet Transform (DWPT) of the Haar wavelet family and Tsallis entropy (TE) was used for feature extraction. In particular, sixteen wavelet sub-bands were extracted from the original image and TE was calculated on sub-band coefficients. The classification of breast lesions was performed by a Feed-Forward Neural Network (FNN) trained on DWPT-TE features and achieving a performance accuracy of 99.5%. In another study⁷, the WPT of the Daubechies family was examined as a pre-processing method for the B-mode ultrasound liver images over the RF frames. Statistical and texture features such as first-order gray level parameters, gray-level co-occurrence matrix (GLCM), and local binary patterns (LBP) were calculated and machine-learning selection and classification was applied. The author claims a significant reduction in computational time up-to 75% with a cost of only 1% in terms of performance accuracy. Takruri et al.⁸ proposed a feature extraction methodology by combining the energy of multiple wavelet coefficients from melanoma histopathology images, achieving a performance accuracy of 87.1%.

Some noteworthy studies on the examined histopathology dataset⁹, introduced machine and deep learning analyses on this challenging multi-class problem. A variety of imaging features (GLCM, LBP, Gabor, histogram) and machine learning classifiers (SVM, decision trees, nearest neighbor) were examined, achieving an accuracy of 87.4%¹⁰. Cascianelli et al.¹¹ investigated a pre-trained “off-the-shelf” deep learning (ACC 84%) and machine learning approaches (ACC 72.8%-79.6%) with texture spectrum features and principal component analysis (PCA) over a wide range of datasets including the examined. Sarkar et al.¹² proposed a saliency guided dictionary learning (SDL) framework achieving ACC from 51.1 to 73.7% for a 7-class subset of the original set excluding patches from the background class. However, it should be noted that these studies did not follow a multi-scale approach in feature extraction which can be the reason for the limited performance reported.

In this paper, the WPT was used both as a composite pre-processing (filtering), image decomposition scheme and as a feature extraction methodology for improving the performance and accelerating the convergence of machine learning and artificial neural network (ANN) models. It is noteworthy that in oncology imaging studies with wavelets^13–18 the impact of different families or level of decomposition on the performance of their experimental methods is rarely investigated. The lack of such an analysis highlights a critical scientific question regarding the optimal use of wavelets in the context of radiomics or pathomics approaches in medical image analysis in order to evaluate the effectiveness of image features at different frequency bands. To this end, an exhaustive multi-scale pathomics analysis integrating WPT filtering with all available wavelet transforms and different levels of decomposition was conducted to identify the optimal set of level, sub-band and family for improved CRC tissue differentiation. In particular, improvements such as lower computational cost of the analysis and increased classification performance (up-to ACC 95.3%) were observed in the examined histopathology dataset compared to the current state-of-the-art literature¹⁰ (ACC 87.4%). The main contribution of the presented paper is the first in-depth application of wavelet analysis for pathomics-based multi-class pathology tissue classification, investigating the role of scale through wavelet image decomposition and achieving improved performance compared to the state-of-the-art methods.

Results

Sample selection bias is a common risk when applying machine learning techniques, particularly in ANN. This can lead to a compromised model with limited generalization capacity. A hold-out method was performed on the full dataset for randomly selecting the 10% of the images as a validation set for applying feature selection on the SVM experiments and hyperparameter optimization during the ANN convergence. In addition to the hold-out method, an iterative tenfold cross-validation process was applied on the rest 90% for splitting the images into a training set for model fitting and an unseen testing set per split for model evaluation. In particular, the validation set consisted of 500 images and at each cross-validation split, 4050 images were used as a training set and 450 for the testing set. The class balance across all sets was preserved to assess an unbiased performance analysis. There are 625 tissue samples (12.5%) of each of the eight categories in the examined colorectal dataset. Maintaining these ratios in the smaller validation and testing sets is important for both the feature selection and model evaluation tasks respectively, because it ensures that each class is represented equally and no bias is introduced from the data stratification process. The studied tissue tiles were subjected to wavelet decomposition up to six layers and using the six discrete wavelet families (bior, rbio, db, sym, haar, and coif), generating 378 sets of approximated images. A corresponding number of sets of imaging features were calculated from the WPT images. This resulted in the same number of models per analysis pipeline (baseline SVM, selected features with SVM and ANN). A multi-level feature extraction method (Fig. 4) was applied on the decomposed wavelet images. A total of 532 features comprised of first order statistics of pixel intensities, local binary patterns, and gabor filtered images combined with gray level co-occurrence matrices, and higher order features such as contrast, dissimilarity, homogeneity, correlation, angular second moment, and energy. A subset of these features were selected based on the two criteria: (a) the statistical significance of F with a p-value less than 0.05 and (b) the F should be greater than the F_{critical value}. The F_{critical value} was calculated by the percent point function¹⁹ using the aforementioned threshold in conjunction with the degrees of freedom of the dataset. It should be noted that the best features were comprised of first order statistics over the raw wavelet and gabor image (4 pixels per circle), local binary pattern features with finer quantization of the angular space, energy, dissimilarity, correlation, angular second moment from grey level features with a distance of one pixel. Three classification strategies were applied: support vector machines with radial basis function trained on full pathomics or selected pathomics and artificial neural networks trained on full pathomics. Only two parameters were variable across each analysis pipeline: the mother wavelet and the level of decomposition. Hyper-parameter optimization (number of layers, number of neurons, activation functions) was applied to assess the fitting status and mitigate the overfitting of the ANN. On average, more than 200 epochs were required for the ANN model to converge. The use of an early-stopping technique, in which the training is terminated when the validation error reaches a global minima, aided in preventing model overtraining. A few thousand models were evaluated during the experimental phase of this study. The computational infrastructure used during the experiments comprises an 8-core i7 processor with 24 gigabytes of RAM and a GTX 1070 graphics processor with 8 gigabytes of VRAM. Approximately 150 hours required for the proposed feature extraction process to complete on the entire dataset of 5000 colorectal image patches. This includes the calculation of all the examined wavelet families with up-to six levels of decomposition, selection and the extraction of the multi-level features. Additionally, 20 hours required for the SVM classification (with and without feature selection) and approximately 40 hours for the ANN convergence and evaluation. The SVM-based models (with and without feature selection) achieved accuracy up-to 84%, which is comparable to the published “off-the-shelf” deep learning approach¹¹. It is worth noting that models based on ANNs scored the highest performance (up-to ACC 95.3%), particularly in the second level of approximation of symlet wavelet family, as evident by Table 1. A comprehensive analysis of the performance grouped by wavelet family is presented in Fig. 1 and by the level of decomposition in Fig. 2. Additional resources such as the source code and the corresponding optimized parameters can be made available upon request.

The proposed multi-level feature extraction strategy including several combinations of WPT, Gabor filters, LBP, GLCM, first and high order statistics. *WPT* wavelet packet transform, *LBP* local binary patterns, *GLCM* grey level co-occurrence matrices, *FOS* first order statistics, *HOS* higher order statistics. (Figure created in PowerPoint 2013, https://www.microsoft.com/microsoft-365/powerpoint).

Table 1.

Performance analysis of the current literature in comparison to the tenfold average accuracy of the examined SVM-based and ANN models. Bold font indicates the best performance.

	State-of-the-art	Method	Input features	ACC (%)
Literature	Kather et al.¹⁰	SVM	GLCM, LBP, Gabor	87.4
	Cascianelli et al.¹¹	NNC/VGG-based “off-the-shelf”	LBP, Texture Spectrum, PCA / Deep Features	79.60/84.00
	Sarkar et al.¹²	SDL	Local Gabor filtering	73.7

	WPT family	Levels	# Features	ACC (avg ± std%)
Baseline SVM	rbio	1	155	84.04 ± 1.98
	bior	2	155	83.22 ± 2.34
	coif	2	140	83.21 ± 1.24
	sym	1	135	83.04 ± 1.63
	rbio	2	532	82.76 ± 1.67
	bior	2	532	82.18 ± 1.35
	coif	2	532	82.16 ± 1.57
	db	2	532	82.10 ± 1.68

	Artificial neural network		Input layers	Hidden layers	ACC (avg ± std%)
Neural pathomics	Number of neurons	64	532	1 × sigmoid	87.84 ± 1.16
		128			90.66 ± 1.23
		256			92.46 ± 1.24
		512			94.34 ± 0.77
		1024			94.91 ± 0.91
		2048			93.54 ± 2.31
		64	532	2 × sigmoid	86.4 ± 1.58
		128			89.68 ± 1.84
		256			92.77 ± 1.71
		512			94.56 ± 2.01
		1024			95.32 ± 2.16
		2048			94.13 ± 1.94

Open in a new tab

Performance analysis of wavelet families: Each line shows the percentage of each mother wavelet with respect to the total number of models achieving different levels of accuracy. db daubechies, *sym* symlets, *coif* coiflets, *bior* biorthogonal, *rbio* reverse biorthogonal. (Figure created in Excel 2013, https://www.microsoft.com/microsoft-365/excel).

Performance analysis among different levels of wavelet packet transform: the first and second levels exhibit high accuracies while in higher levels of decomposition the texture analysis error is significantly affected. (Figure created in Excel 2013, https://www.microsoft.com/microsoft-365/excel).

Discussion

To date, wavelet decomposition has been used for feature extraction or as a filtering method in medical image analysis and radiomics but the selection of its parameters has not been thoroughly investigated^13–18 which is indicative of the lack of research on the role of scale in most radiomics and pathomics studies. Since wavelet image transforms produce a series of band-pass outputs at each scale, i.e. the wavelet coefficients, there is a need for an exhaustive analysis for determining the impact of scale representation in the feature extraction process and the performance of the developed models. Depending on the application, different frequency bands might provide more invariant image information and therefore salient features for classification. In this study, we presented a pathomics methodology for addressing a challenging multiclass CRC tissue classification problem on an open-access dataset while investigating in parallel the impact of different mother wavelets on the discriminative power of the proposed analysis in order to incorporate optimal, scale-related information within the proposed pathomics pipeline. The WPT approximation pipeline enabled a low classification error despite projecting the original image to a much lower spatial resolution. Biorthogonal, reverse biorthogonal and symlets were among the best performing mother wavelets across all ANN models for this pathomics classification problem. Notably, the same three families were among the most accurate SVM models according to Table 1. Overall, ANN models achieved the highest accuracy because of their superior fully-automated selection and combination of pathomic features. The state-of-the-art machine learning analysis¹⁰ on full image tile resolution (150 by 150 pixels) performed significantly lower than the proposed methodology (ACC 87.4% versus 95.3%), as shown in Table 1. The reduced spatial dimensionality is a significant advantage of this approach (at least for the first and second level) with the examined samples spanning across 39 by 39 to 79 by 79 pixels in contrast to an analysis with full pixel array resolution. Despite this substantial reduction in spatial information prior to feature extraction, the proposed SVM analysis achieves similar performance to the state-of-the-art approaches (Table 1, baseline SVM 84% versus 87.4¹⁰/79.6-84¹¹/73.7¹²%). ANN models were introduced to address the limitations of a feature-based selection process that was followed by the proposed SVM and examined studies^10–12, enabling a fully automated analysis of the complete pathomics feature vector. As a result, a significant reduction in terms of classification error was achieved. The use of WPT in the proposed pathomics classification approach led to an efficient feature extraction pipeline exploiting a variety of mother wavelet transformations with lower computational complexity. Nevertheless, the prediction error increased when deeper levels of WPT decomposition were applied regardless of the mother wavelet, feature extraction or classification method. This effect can be observed in Fig. 2 and can be attributed to the reduced spatial resolution and information content of the approximated image which depends on the applied level of decomposition. The execution times of the WPT-based over the original image tiles were improved on average from 28.4% up to 49% per image depending on the level of decomposition, as reported in

Table 2 Additionally, it is worth noting that in this study the full dataset with all 8 classes was used for the proposed analysis while Sarkar et al.¹² excluded the background class. The first and the second decomposition levels were found (Fig. 2) to perform best compared to the higher levels of scale. This can be explained by the significant loss of spatial information after the second level of WPT, even below the previously mentioned 39 by 39 pixels. The insights from the parameter optimization process of WPT (Figs. 1, 2) can be useful for image analysis tasks that deal with complex tissue differentiation as for example in radiomic applications with wavelet filtering. Additionally, the encouraging results of the presented study could be the first step towards an automated annotation system providing a fast method for patch extraction over multiple high resolution histopathology images with no manual pixel- or patch-based annotations available. A few potential limitations of WPT for image analysis include the non-intuitive parameter optimization process involving mother wavelets, sub-bands and level of decomposition. Additionally, the feature selection process and the lack of a well-founded process for optimal combination of the best sub-bands or high-level features can render such analysis hard to perform. An interesting endeavor in future studies would be to analyze the impact of wavelet filtering and decomposition on deep learning convergence or as a data augmentation technique with the additional benefits of enhanced feature engineering. Sub-class discrimination for oncology and radio-, patho-genomics could also be attractive application domains for further research on the use of wavelets transformations as a mean to augment information extraction in a multi-scale fashion.

Table 2.

Execution times of the proposed feature extraction methodology for the original image and different levels of WPT.

Wavelet—level	Pixel array size (pixels)	Time per image (avg ± std second)
Original tile	150 by 150	1.41 ± 0.26
Bior1.1—1	75 by 75	1.01 ± 0.16
Bior1.1—2	38 by 38	0.81 ± 0.22
Bior1.1—3	19 by 19	0.75 ± 0.11
Bior1.1—4	10 by 10	0.72 ± 0.09
Bior1.1—5	5 by 5	0.72 ± 0.08
Bior1.1—6	3 by 3	0.71 ± 0.12

Open in a new tab

In this study, an extensive optimization of WPT parameters was implemented and significant improvements in terms of discrimination accuracy for the examined CRC tissue classification problem were achieved. This is in line with our initial working hypothesis that texture is a scale-dependent phenomenon that may significantly affect performance in radiomics/pathonics applications. Our results indicate that proper customization of a wavelet analysis should be prioritized in the design of a pathomics computational pipeline. The mother wavelet and sub-band selection can affect the classification performance of the analysis, thus providing a multi-resolution framework for implicit feature engineering. Furthermore, efficiency in computational cost of the extraction of complex features can be achieved with the projection into a lower dimensional space through WPT decomposition. Our results also confirm that ANNs with their superior fully-automated analysis can address the shortcomings of other machine learning techniques in terms of feature selection, combination and differentiation. The results of the current study should be tested in independent cohorts and in correlation with genomics findings and patients’ outcome.

Methods

Histopathology images dataset

The examined colorectal cancer dataset⁹ titled “Collection of textures in colorectal cancer histology” is available online as an open-access repository via the following link (https://doi.org/10.5281/zenodo.53169). The formalin-fixed paraffin-embedded colorectal primary tumor tiles were gathered from the University Medical Center Mannheim (Heidelberg University) in Germany. Manual extraction of 5000 patches with 74 by 74 μm dimension (0.495 μm per pixel) was performed in a set of ten digitized hematoxylin & eosin (H&E) stained tissue slides. In particular, the patches were organized into eight classes (Fig. 3) including seven tissue types from the tumor epithelium, simple stroma, complex stroma, lymphoid follicles, debris, mucosal glands, adipose and background patches with no tissue.

Samples from the examined dataset and the proposed multi-level textural analysis pipeline featuring wavelet packet transform, local binary patterns, Gabor, grey level co-occurrence matrices, first order statistics concatenated in a pathomics signature for support vector machines and artificial neural networks tissue classification. (Figure created in PowerPoint 2013, https://www.microsoft.com/microsoft-365/powerpoint).

Experimental protocol

The wide variety of cell and tissue types in the studied dataset constitute a complex classification problem and was the main motivation for proposing a framework for investigating the impact of multi-scale wavelet transformations in terms of texture analysis. WPT was applied on the histopathology tiles using every discrete mother wavelet and level of decomposition available in the pywavelets library²⁰. In particular, six families of wavelets with a total of 105 variations and up-to 6 levels of decompositions were applied on the original tissue tiles and resulting in 378 datasets of approximated images were calculated. For each one of these derived datasets with different wavelet setting, a composite feature extraction process was executed as is illustrated in Fig. 4. The features included multiple combinations of texture-based features (WPT-LBP, WPT-Gabor filters, WPT-GLCM, WPT-LBP-GLCM, WPT-Gabor-GLCM, and other as depicted in Fig. 4), first order statistics (FOS) and higher order statistics (HOS) as described in the “Statistical features” section below. Thus, the resulted pathomics vector consisted of 532 features extracted from each level of decomposition in the proposed analysis (Fig. 4). The overall analysis is presented in Fig. 3. “Best practices” in machine learning analysis²¹ were considered mainly by applying a class-preserving hold-out method with 10% of the full dataset for determining a subset of pathomics for feature selection (SVM analysis) and hyperparameter optimization/early stopping (ANN analysis). Additionally, on the remaining images a 10-fold cross-validation was performed iteratively for splitting into training and unseen testing sets. The three separate sets ensured a fair and unbiased model evaluation. In total, three different classification strategies were performed, two based on SVMs and one on ANN. The SVMs were applied on the full pathomics vector and on a set of statistical significant features. An extensive hyperparameter optimization was performed to identify the best configuration of ANN, including the learned parameters (number of neurons and layers) and other parameters such as learning rate, dropout rate, activation functions and optimizer. The ANN models were trained on the 532-pathomics vector.

Decomposition and feature extraction

The proposed feature extraction approach is comprised of individual operators such as WPT, Gabor, LBP and GLCM paired with first-order statistics and combinations of higher order statistics methods. This diverse set of simple and complex features with multiple levels of analysis aims in forming a discriminative and robust pathomics signature. The final vector consists of 532 imaging features. A detailed depiction of the proposed multi-level feature extraction methodology is provided in Fig. 4.

Wavelet packet transform (WPT)

Since a thorough description of the mathematical background is demanding and beyond the scope of this work, we briefly, present the one dimensional continuous wavelet transform below. Further in depth information regarding the two dimensional discrete wavelet transform applied to images can be found in literarure^22,23.

Assume a function f(t) in the set of square integrable functions $L^{2} (R)$ . A continuous wavelet transform (CWT) shows how f(t) is decomposed into a set of basis functions g(t), called wavelets, and is given by:

C W T_{g} f (s, τ) = \underset{- \infty}{\int^{\infty}} f (t) g_{s, t}^{*} (t) d t

In Eq. (1), *represents the complex conjugate; $s, τ$ are the variables related to scale and translation after the transform and are typically the new dimensions. In the generalized case of two dimensional CWT a scaling function and three two dimensional wavelets are compulsory which are responsible for functional variations (image intensity changes) along different directions such as variations along columns (horizontal edges), variations along rows (vertical edges) and variations along diagonals giving the sense of image texture.

To explore textural information from the studied histopathology images a variety of bases of discrete mother wavelets were used from the pywavelets library such as Haar (haar), Daubechies (db), Symlets (sym), Coiflets (coif), Biorthogonal (bior), and Reverse biorthogonal (rbio). Other families of such as Gaussian wavelets (gaus), Mexican hat wavelet (mexh), Morlet wavelet (morl), Complex Gaussian wavelets (cgau), Shannon wavelets (shan), Frequency B-Spline wavelets (fbsp), and Complex Morlet wavelets (cmor) wavelets were rejected because of their continuous nature. An indicative illustration of these wavelet transforms applied to a lymphoid follicles tile image is depicted in Fig. 5.

Differences among mother wavelets at the second level of decomposition for part (red bounding box) of the original image. db daubechies, *sym* symlets, *coif* coiflets, *bior* biorthogonal, *rbio* reverse biorthogonal. (Figure created in PowerPoint 2013, https://www.microsoft.com/microsoft-365/powerpoint).

Gabor filters

The wavelet transformed image tiles were used as the input for filtering with multiple Gabor kernels²⁴ calculated over five directions and four wavelengths from 4 to 20 pixels per cycle. Additionally, various rotation angles (from 70° to 190°) were employed on the original input image.

G_{c [i, j]} = K e^{- \frac{(i^{2} + j^{2})}{2 σ^{2}}} \cos (2, π, f, (i c o s θ + j s i n θ))

G_{s [i, j]} = K e^{- \frac{(i^{2} + j^{2})}{2 σ^{2}}} s i n (2 π f (i c o s θ + j s i n θ))

In Eqs. (2 and 3) K is the scale factor, f the frequency of the complex harmonic function, θ is the direction of the filter and σ the standard deviation in any filter direction.

Local binary patterns (LBP)

The LBP images were calculated over the WPT approximations with three methods including gray scale invariant, variance measures of the intensities of local texture, and uniform patterns with finer set points²⁵. A radius of one pixel and with eight symmetric neighbors was selected for calculating this visual descriptor. The arrow path in Fig. 4 represents the integration of LBP images with statistical, Gabor and GLCM operators.

{LBP}_{P, R} (x_{c}, y_{c}) = \sum_{P = 0}^{P - 1} s (i_{P} - i_{c}) 2^{P}

s (x) = \{\begin{matrix} 1 i f x \geq 0 \\ 0 i f x < 0 \end{matrix})

In Eq. (4) P denotes the neighbor pixels, c the central pixel of the operator, i the gray-level value of their respective subscript, and s is defined in the fifth equation.

Statistical features

First order features were extracted over the approximated WPT, Gabor filtered or LBP images including metrics such as maximum, minimum, mean and standard deviation of pixel values of the feature maps. Gray Level Co-occurrence Matrix (GLCM) features are based on the co-occurrence of intensities of the pixels at a given direction and distance²⁶. In particular, four angles from 0° to 135° and three distances from 1 to 5 pixels were calculated. A set of higher order statistical features such as contrast [Eq. (6)], dissimilarity [Eq. (7)], homogeneity [Eq. (8)], correlation [Eq. (9)], angular second moment (ASM) [Eq. (10)] and energy [Eq. (11)], were computed from the calculated co-occurrence matrices.

C o n t r a s t : \sum_{i, j = 0}^{N - 1} P_{i, j} {(i - j)}^{2}

D i s s i m i l a r i t y : \sum_{i, j = 0}^{N - 1} P_{i, j} | i - j |

H o m o g e n e i t y : \sum_{i, j = 0}^{N - 1} \frac{P_{i, j}}{1 + {(i - j)}^{2}}

C o r r e l a t i o n : \sum_{i, j = 0}^{N - 1} P_{i, j} [\frac{(i - μ_{i}) (j - μ_{j})}{\sqrt{(σ_{i}^{2}) (σ_{j}^{2})}}]

A S M : \sum_{i, j = 0}^{N - 1} P_{i, j}^{2}

E n e r g y : \sqrt{ASM}

In the above equations μ denotes mean, σ² the variance of the intensities of the reference pixels, N is the number of gray levels and P_i,j the normalized GLCM element.

Feature selection

The analysis of variance (ANOVA²⁷ Test) was used for feature selection on the extracted pathomics in one SVM model of the presented analysis (the other used the full feature vector). The joint analysis of p-values with the corresponding F-values of the examined vector supported the identification of a subset of features establishing a compact and discriminative signature. This method was applied during the experimental phase and solely on the validation set reducing the dimensionality of feature vector. Based on these derived feature labels, the statistically significant components were then selected in both training and testing sets. The later remained unseen throughout this process. No feature selection method was applied during the ANN experiments.

Classification

The classification was performed among eight classes including seven tissue types and the background of the tiles. Two different machine learning approaches for classification were applied: support vector machines and artificial neural networks. Multiple radial basis function (rbf) based SVM models were trained on selected and full pathomics to establish the baseline of the analysis. ANN were trained and evaluated on the complete pathomics signature as high-level data analysis models for end-to-end and fully-automated feature selection and classification. A cross-validation data stratification technique was applied to separate the dataset in ten parts one serving as the unseen testing set and the rest as the training set with iteratively interchangeable roles among the splits.

Performance evaluation

The studied dataset is comprised of 8-classes, as discussed mentioned. Thus, the multiclass model was assessed by the classification accuracy where TP, TN, FP, and FN stand for true-positive, true-negative, false-positive and false-negative respectively. The performance metrics presented in the following sections refer to the average accuracy of the 10-fold cross-validation.

A C C = \frac{T P + T N}{T P + T N + F P + F N}

Author contributions

E.T. conceived and designed the study. E.T., G.I., K.M. contributed in the performed analysis and interpretation of data and drafted the manuscript. E.T., G.I., I.S., A.K., M.T. and K.M. contributed in the literature research, interpretation of data and revised the manuscript. I.S., A.K., M.T. contributed in the clinical aspects as well in the critical revision of the paper. K.M. contributed in the critical revision of the paper and was the guarantor of integrity of entire study. All authors agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated, and finally approved the version of the manuscript to be published.

Funding

This study was financially supported by the Stavros Niarchos Foundation within the framework of the project ARCHERS (‘Advancing Young Researchers' Human Capital in Cutting Edge Technologies in the Preservation of Cultural Heritage and the Tackling of Societal Challenges’).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Ferro CJS, Warner TA. Scale and texture in digital image classification. Photogramm. Eng. Remote Sensing. 2002;68:51–63. [Google Scholar]
2.de Siqueira FR, RobsonSchwartz W, Pedrini H. Multi-scale gray level co-occurrence matrices for texture description. Neurocomputing. 2013;120:336–345. doi: 10.1016/j.neucom.2012.09.042. [DOI] [Google Scholar]
3.Gao, R. X. & Yan, R. Wavelet Packet Transform. in Wavelets 69–81 (Springer, 2011). 10.1007/978-1-4419-1545-0_5.
4.Van Griethuysen JJM, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Sharma, S., Jain, S. & Bhusri, S. Classification of breast lesions using gabor wavelet filter for three classes. in 4th International Conference on “Computing for Sustainable Global Development” 6282–6284 (2017).
6.Wang S, et al. Pathological brain detection via wavelet packet tsallis entropy and real-coded biogeography-based optimization. Fundam. Inform. 2017;151:275–291. doi: 10.3233/FI-2017-1492. [DOI] [Google Scholar]
7.Amin MN, et al. Wavelet-based computationally-efficient computer-aided characterization of liver steatosis using conventional B-mode ultrasound images. Biomed. Signal Process. Control. 2019;52:84–96. doi: 10.1016/j.bspc.2019.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Takruri M, Abu Mahmoud MK, Al-Jumaily A. PSO-SVM hybrid system for melanoma detection from histo-pathological images. Int. J. Electron. Comput. Eng. 2019;9:2941. [Google Scholar]
9.Kather JN, 2016. Collection of textures in colorectal cancer histology. Zenodo. [DOI]
10.Kather JN, et al. Multi-class texture analysis in colorectal cancer histology. Sci. Rep. 2016;6:1–11. doi: 10.1038/srep27988. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Cascianelli, S. et al. Dimensionality reduction strategies for CNN-based classification of histopathological images. in Smart Innovation, Systems and Technologies vol. 76, 21–30 (Springer, 2018).
12.Sarkar R, Acton ST. SDL: Saliency-based dictionary learning framework for image similarity. IEEE Trans. Image Process. 2018;27:749–763. doi: 10.1109/TIP.2017.2763829. [DOI] [PubMed] [Google Scholar]
13.Kontopodis, E. et al. DCE-MRI radiomics features for predicting breast cancer neoadjuvant therapy response. in IST 2018 IEEE International Conference on Imaging Systems and Techniques, Proceedings (Institute of Electrical and Electronics Engineers Inc., 2018). 10.1109/IST.2018.8577128.
14.Wilson R, Devaraj A. Radiomics of pulmonary nodules and lung cancer. Transl. Lung Cancer Res. 2017;6:86–91. doi: 10.21037/tlcr.2017.01.04. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Chaddad A, Daniel P, Niazi T. Radiomics evaluation of histological heterogeneity using multiscale textures derived from 3D wavelet transformation of multispectral images. Front. Oncol. 2018;8:1. doi: 10.3389/fonc.2018.00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Liu Y, et al. Radiomic features are associated with EGFR mutation status in lung adenocarcinomas. Clin. Lung Cancer. 2016;17:441–448.e6. doi: 10.1016/j.cllc.2016.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Coroller TP, et al. Radiomic phenotype features predict pathological response in non-small cell Radiomic predicts pathological response lung cancer. Radiother. Oncol. 2016;119:480–486. doi: 10.1016/j.radonc.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Soufi M, Arimura H, Nagami N. Identification of optimal mother wavelets in survival prediction of lung cancer patients using wavelet decomposition-based radiomic features. Med. Phys. 2018;45:5116–5128. doi: 10.1002/mp.13202. [DOI] [PubMed] [Google Scholar]
19.Virtanen P, et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Lee G, Gommers R, Waselewski F, Wohlfahrt K, O’Leary A. PyWavelets: A Python package for wavelet analysis. J. Open Source Softw. 2019;4:1237. doi: 10.21105/joss.01237. [DOI] [Google Scholar]
21.Wujek, B., Hall, P. & Güneș, F. Best Practices for Machine Learning Applications. 1–23 (SAS Inst. Inc, 2016).
22.Mallat SG. A theory for multiresolution signal decomposition: The wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989;11:674–693. doi: 10.1109/34.192463. [DOI] [Google Scholar]
23.Yang L, Tang YY, Sun Q. Implementation of 2D discrete wavelet transform by number theoretic transform and 2D overlap-save method. Math. Probl. Eng. 2014;2014:1–15. [Google Scholar]
24.Gabor D. Theory of communication. J. Inst. Electron. Eng. 1947;I(94):58–58. [Google Scholar]
25.Ojala T, Pietikäinen M, Mäenpää T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 2002;24:971–987. doi: 10.1109/TPAMI.2002.1017623. [DOI] [Google Scholar]
26.Haralick RM, Dinstein I, Shanmugam K. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973;3:610–621. doi: 10.1109/TSMC.1973.4309314. [DOI] [Google Scholar]
27.Pedregosa F, et al. Scikit-learn: Machine learning in {P}ython. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Kather JN, 2016. Collection of textures in colorectal cancer histology. Zenodo. [DOI]

[CR1] 1.Ferro CJS, Warner TA. Scale and texture in digital image classification. Photogramm. Eng. Remote Sensing. 2002;68:51–63. [Google Scholar]

[CR2] 2.de Siqueira FR, RobsonSchwartz W, Pedrini H. Multi-scale gray level co-occurrence matrices for texture description. Neurocomputing. 2013;120:336–345. doi: 10.1016/j.neucom.2012.09.042. [DOI] [Google Scholar]

[CR3] 3.Gao, R. X. & Yan, R. Wavelet Packet Transform. in Wavelets 69–81 (Springer, 2011). 10.1007/978-1-4419-1545-0_5.

[CR4] 4.Van Griethuysen JJM, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Sharma, S., Jain, S. & Bhusri, S. Classification of breast lesions using gabor wavelet filter for three classes. in 4th International Conference on “Computing for Sustainable Global Development” 6282–6284 (2017).

[CR6] 6.Wang S, et al. Pathological brain detection via wavelet packet tsallis entropy and real-coded biogeography-based optimization. Fundam. Inform. 2017;151:275–291. doi: 10.3233/FI-2017-1492. [DOI] [Google Scholar]

[CR7] 7.Amin MN, et al. Wavelet-based computationally-efficient computer-aided characterization of liver steatosis using conventional B-mode ultrasound images. Biomed. Signal Process. Control. 2019;52:84–96. doi: 10.1016/j.bspc.2019.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Takruri M, Abu Mahmoud MK, Al-Jumaily A. PSO-SVM hybrid system for melanoma detection from histo-pathological images. Int. J. Electron. Comput. Eng. 2019;9:2941. [Google Scholar]

[CR9] 9.Kather JN, 2016. Collection of textures in colorectal cancer histology. Zenodo. [DOI]

[CR10] 10.Kather JN, et al. Multi-class texture analysis in colorectal cancer histology. Sci. Rep. 2016;6:1–11. doi: 10.1038/srep27988. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Cascianelli, S. et al. Dimensionality reduction strategies for CNN-based classification of histopathological images. in Smart Innovation, Systems and Technologies vol. 76, 21–30 (Springer, 2018).

[CR12] 12.Sarkar R, Acton ST. SDL: Saliency-based dictionary learning framework for image similarity. IEEE Trans. Image Process. 2018;27:749–763. doi: 10.1109/TIP.2017.2763829. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Kontopodis, E. et al. DCE-MRI radiomics features for predicting breast cancer neoadjuvant therapy response. in IST 2018 IEEE International Conference on Imaging Systems and Techniques, Proceedings (Institute of Electrical and Electronics Engineers Inc., 2018). 10.1109/IST.2018.8577128.

[CR14] 14.Wilson R, Devaraj A. Radiomics of pulmonary nodules and lung cancer. Transl. Lung Cancer Res. 2017;6:86–91. doi: 10.21037/tlcr.2017.01.04. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Chaddad A, Daniel P, Niazi T. Radiomics evaluation of histological heterogeneity using multiscale textures derived from 3D wavelet transformation of multispectral images. Front. Oncol. 2018;8:1. doi: 10.3389/fonc.2018.00001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Liu Y, et al. Radiomic features are associated with EGFR mutation status in lung adenocarcinomas. Clin. Lung Cancer. 2016;17:441–448.e6. doi: 10.1016/j.cllc.2016.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Coroller TP, et al. Radiomic phenotype features predict pathological response in non-small cell Radiomic predicts pathological response lung cancer. Radiother. Oncol. 2016;119:480–486. doi: 10.1016/j.radonc.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Soufi M, Arimura H, Nagami N. Identification of optimal mother wavelets in survival prediction of lung cancer patients using wavelet decomposition-based radiomic features. Med. Phys. 2018;45:5116–5128. doi: 10.1002/mp.13202. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Virtanen P, et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Lee G, Gommers R, Waselewski F, Wohlfahrt K, O’Leary A. PyWavelets: A Python package for wavelet analysis. J. Open Source Softw. 2019;4:1237. doi: 10.21105/joss.01237. [DOI] [Google Scholar]

[CR21] 21.Wujek, B., Hall, P. & Güneș, F. Best Practices for Machine Learning Applications. 1–23 (SAS Inst. Inc, 2016).

[CR22] 22.Mallat SG. A theory for multiresolution signal decomposition: The wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989;11:674–693. doi: 10.1109/34.192463. [DOI] [Google Scholar]

[CR23] 23.Yang L, Tang YY, Sun Q. Implementation of 2D discrete wavelet transform by number theoretic transform and 2D overlap-save method. Math. Probl. Eng. 2014;2014:1–15. [Google Scholar]

[CR24] 24.Gabor D. Theory of communication. J. Inst. Electron. Eng. 1947;I(94):58–58. [Google Scholar]

[CR25] 25.Ojala T, Pietikäinen M, Mäenpää T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 2002;24:971–987. doi: 10.1109/TPAMI.2002.1017623. [DOI] [Google Scholar]

[CR26] 26.Haralick RM, Dinstein I, Shanmugam K. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973;3:610–621. doi: 10.1109/TSMC.1973.4309314. [DOI] [Google Scholar]

[CR27] 27.Pedregosa F, et al. Scikit-learn: Machine learning in {P}ython. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]

PERMALINK

A neural pathomics framework for classifying colorectal cancer histopathology images based on wavelet multi-scale texture analysis

Eleftherios Trivizakis

Georgios S Ioannidis

Ioannis Souglakos

Apostolos H Karantanas

Maria Tzardi

Kostas Marias

Abstract

Introduction

Results

Figure 4.

Table 1.

Figure 1.

Figure 2.

Discussion

Table 2.

Methods

Histopathology images dataset

Figure 3.

Experimental protocol

Decomposition and feature extraction

Wavelet packet transform (WPT)

Figure 5.

Gabor filters

Local binary patterns (LBP)

Statistical features

Feature selection

Classification

Performance evaluation

Author contributions

Funding

Competing interests

Footnotes

References

Associated Data

Data Citations

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases