On Regularization Based Twin Support Vector Regression with Huber Loss

Umesh Gupta; Deepak Gupta

doi:10.1007/s11063-020-10380-y

. 2021 Jan 3;53(1):459–515. doi: 10.1007/s11063-020-10380-y

On Regularization Based Twin Support Vector Regression with Huber Loss

Umesh Gupta ¹, Deepak Gupta ^1,^✉

PMCID: PMC7779113 PMID: 33424418

Abstract

Twin support vector regression (TSVR) is generally employed with $ε$ -insensitive loss function which is not well capable to handle the noises and outliers. According to the definition, Huber loss function performs as quadratic for small errors and linear for others and shows better performance in comparison to Gaussian loss hence it restrains easily for a different type of noises and outliers. Recently, TSVR with Huber loss (HN-TSVR) has been suggested to handle the noise and outliers. Like TSVR, it is also having the singularity problem which degrades the performance of the model. In this paper, regularized version of HN-TSVR is proposed as regularization based twin support vector regression (RHN-TSVR) to avoid the singularity problem of HN-TSVR by applying the structured risk minimization principle that leads to our model convex and well-posed. This proposed RHN-TSVR model is well capable to handle the noise as well as outliers and avoids the singularity issue. To show the validity and applicability of proposed RHN-TSVR, various experiments perform on several artificial generated datasets having uniform, Gaussian and Laplacian noise as well as on benchmark different real-world datasets and compare with support vector regression, TSVR, $ε$ -asymmetric Huber SVR, $ε$ -support vector quantile regression and HN-TSVR. Here, all benchmark real-world datasets are embedded with a different significant level of noise 0%, 5% and 10% on different reported algorithms with the proposed approach. The proposed algorithm RHN-TSVR is showing better prediction ability on artificial datasets as well as real-world datasets with a different significant level of noise compared to other reported models.

Keywords: Support vector regression, Twin support vector regression, Gaussian noise, Huber loss, Laplacian noise

Introduction

Over the last decade, support vector machine (SVM) (13) has played a leading role in the field of classification and regression problems which utilize the gist of statistical learning theory. The state of the art methods like SVM, attracts many areas such as medical imaging [40, 69], financial time series [7, 9], cybercrime [18], remote sensing applications [10, 44], sediment [29] and many more. The main aim of SVM is to obtain the optimal solution through solving the quadratic programming problems (QPPs) (Cristianini and Shawe-Taylor [15]) in the consideration of the structural risk minimization principle. SVMs are well applicable to handle small sized data samples. It is having better generalization performance in binary classification but high learning cost i.e. O $(m^{3})$ and more sensitive towards the noise of the training data sample. To reduce these drawbacks of SVMs, a twin version of SVMs for binary classification has been formulated named TWSVM by Jayadeva et al. [36]. This TWSVM is highly motivated from PSVM [23] in which it generates two non- parallel hyperplanes that are nearer to either positive or negative class, respectively and unit distance separable to each other. An experiment result related to TWSVM has been shown in the literature that the computational cost is four times superior to standard SVM because it solves a pair of small sizes QPPs rather than one complex QPP. The popularity of SVM based models increases due to its low learning cost, e.g. Kumar and Gopal [39], Gupta and Gupta [26], Gupta et al. [28], Bai et al. [3] and Tang et al. [60, 61]. In the regression problem, SVR has been used for finding the optimal regressor that consists of two sets of constraints [19]. These constraints are trying to separate the data sample in $ε$ -insensitive field [57]. Peng [50] have formulated a twin version of SVR termed as TSVR model, influenced by the TWSVM for the regression problem. TSVR is generating two functions, which measure both $ε$ -insensitive bound of the regressor. In the gist of TWSVM, TSVR also solves a pair of small size QPPs, unlike SVR as one large QPP, which is responsible for attaining high computational speed to TSVR. In the computational field, the main drawback is that to fit the data having different noise and outliers as well as to deal with overfitting phenomenon.

To achieve the robustness, several variants of SVR that deals with different types of noises and outliers, are discussed in the literature as Hwang et al. [35], Cui and Yan [16], Chen et al. [8], Tang et al. [59] and Chen et al. [7, 9], Yang and Xu [79] etc. Other popular models are to assign different weight values to the samples to reduce the effect of outliers as follows: Xu and Wang [75], Xu et al. [74], Tanveer et al. [62] and Mao et al. [43] etc. Some researchers include fuzzy-based membership values to training points through assigning some importance in terms of penalty to the data samples [12, 22, 6]. For avoiding the overfitting problem, some valuable work is added in the field of regression [37, 82] but some gap is still present. As we already know the importance of noise in the computation, so the right choice of loss functions which are associated with particular noise in the training data sample will give the proper direction towards optimal generalization. There are several loss functions which generally consider in different regression estimation problem such as $ε$ - insensitive loss [13], Gaussian loss [31, 71, 73], Laplacian loss [46, 80], Pinball loss [25, 54, 78], Quadratic loss [30, 70], Huber loss [5, 8, 45, 52], Asymmetric loss [32, 76], 1- norm [41], Soft insensitive loss [11, 84], Non-convex loss [83], and Hinge loss [56].

Niu et al. [47] have proposed a new approach termed as TSVR with Huber loss (HN-TSVR) that tested on real-world datasets and Gaussian noise data. In this approach, HN-TSVR performs well in comparison to TSVR with Gaussian loss function (GN-TSVR) and TSVR. For further study, the reader can follow: [33, 34, 42, 47, 72]. To strongly handle the asymmetric noise distribution and outliers in the observed real world data, Balasundaram and Meena [5] have proposed SVR with asymmetric Huber loss where solutions are attained by using functional iterative approach. In the similar pattern, Balasundaram and Prasad [4] have also proposed TSVR based variant named as robust TSVR with Huber loss which is solved by function iterative approach and Newton iterative with Armijo step size to less sensitivity of noise and outliers. To improve the sparseness and prediction ability of the regression model, Anand et al. [1] have improved the $ε$ -support vector quantile regression model ( $ε$ -SVQR) by adding the regularization term for quantile estimation. The $ε$ -SVQR model mainly focuses on the sparseness of the regression model. Gu et al. [24] also have investigated a new fast clustering-based approach for TSVR to lessen the effect of outliers and noise in the observed data examples by following the prior structural information and successive over relaxation algorithm. Due to influence of noise and outliers in observed real world data, SVR and TSVR regression based models attract in the literature [49, 51, 77, 81]; Wang et al. [66–68].

Our proposed approach, RHN-TSVR, is highly influenced by HN-TSVR approach. Here, we further improve this HN-TSVR model through the addition of the regularization term to follow the structural risk minimization principle which shows more effective and efficient generalization performance and lead to a well-posed model. The advantage of our proposed algorithm is that it contains a strong convex optimization function and works well for different types of noises and outliers. The efficacy and usability of RHN-TSVR are discussed based on various numerical experiments using twenty-four artificial generated and forty-two real-world datasets. It shows better prediction performance in the case of both types of datasets. Our proposed approach is also validated through the standard statistical test which also signifies that our approach is performed better for different types of noisy data. The attractive characteristics of proposed RHN-TSVR follow as:

As we know that the TSVR and HN-TSVR suffer from the possible singularity problem, so to resolve this problem, we reformulate the primal problem by adding a regularization term in the objective functions.
Our proposed approach RHN-TSVR is following the structured risk minimization principle that leads to our model convex and well-posed.
By using the Huber loss function, the proposed approach RHN-TSVR effectively deals with noisy data.
Our proposed approach RHN-TSVR is tested and validated on both various real world datasets having different significant level of noise $0 %$ , $5 %$ and $10 %$ as well as on artificial datasets.

In this paper, all the sections are organized in this manner. In Sect. 2, related work such as SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR formulations have been stated for the non-linear case. An improved and regularized version of HN-TSVR (RHN-TSVR) is proposed diligently under Sects. 3. In Sect. 4, numerical experiments on benchmark different real world and several artificial generated datasets are conducted properly while Sect. 5 dwells on conclusion and future scope.

Related Work

In this related work section, the mathematical formulation of standard SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR is derived for the non-linear case. Assume $B \in (B_{1}, \dots, B_{n})$ is the matrix of size $m \times n$ in which $g^{th}$ vector is $x_{g}^{t}$ and $y \in R^{m}$ . For non-linear kernel function, $K (B, B^{t})$ is defined for $g h^{th}$ entry as ${{(K (B, B^{t})}}_{gh} = k (x_{g}, x_{h})$ and $K (x, B^{t}) = (k (x, x_{1}), \dots, k (x, x_{m}))$ be a row vector. The non-linear regression function on a training set can be obtained through mapping of data to higher-dimensional space $ϕ (.)$ .

${(x_{g}, y_{g})}_{g = 1}^{m}$	Training samples	$e$	Vector of one’s
$x_{g} \in R^{n}$	Input sample	${(a)}_{+} = max {a, 0}$	Plus function definition
$y_{g} \in R$	Desired output	$ϕ (.)$	Higher-dimensional feature space

Open in a new tab

Support Vector Regression (SVR)

According to Vapnik [64], standard SVR uses the $ε$ - insensitive loss for finding the unknown value of $w$ and $b$ through the solution of following QPP (Cristianini and Shawe-Taylor [15]) in such a way:

min_{w, b, λ_{1}, λ_{2}} \frac{1}{2} {| | w | |}^{2} + C (e^{t} λ_{1} + e^{t} λ_{2})

subject to:

\begin{matrix} y_{g} - (ϕ (x_{g}) w + b e) \leq ε e + λ_{1 g}, λ_{1 g} \geq 0, \\ (ϕ (x_{g}) w + b e) - y_{g} \leq ε e + λ_{2 g}, for g = 1, \dots, m, \end{matrix}

where slack variables are $λ_{1} \in {(λ_{11}, \dots, λ_{1 m})}^{t}$ and $λ_{2} \in {(λ_{21}, \dots, λ_{2 m})}^{t}$ ; input parameters are $C > 0, ε > 0$ .

After introducing the Lagrange’s multipliers $η_{1}, η_{2}$ and apply sufficient conditions, the Wolfe dual of the Eq. (1) is given as:

min_{η_{1}, η_{2} \in R^{m}} \frac{1}{2} \sum_{g, h = 1}^{m} (η_{1 g} - η_{2 g}) k (x_{g}, x_{h}) (η_{1 h} - η_{2 h}) + ε \sum_{g = 1}^{m} (η_{1 g} + η_{2 g}) - \sum_{g = 1}^{m} y_{g} (η_{1 g} - η_{2 g})

subject to:

\begin{matrix} \sum_{g = 1}^{m} e^{t} (η_{1 g} - η_{2 g}) = 0, \\ and 0 \leq η_{1}, η_{2} \leq C e . \end{matrix}

The non-linear decision regression function $f (.)$ will be calculated by solving the dual problem as represented in Eq. (2) (Cristianini and Shawe-Taylor [15]) for any new input $x^{*} \in R^{n}$ as

f (x^{*}) = \sum_{z = 1}^{m} (η_{1 z} - η_{2 z}) k (x^{*}, x_{z}) + b .

For more details, see [15, 19].

Twin Support Vector Regression (TSVR)

The twin model of SVR named TSVR is proposed [50] in which it finds a pair of non- parallel kernel generating functions related to both $ε$ -insensitive down-bound $f_{1} (x) = K (x^{t}, B^{t}) w_{1} + b_{1}$ and up-bound function $f_{2} (x) = K (x^{t}, B^{t}) w_{2} + b_{2}$ , where $w_{1}, w_{2} \in R^{m}$ , $b_{1}, b_{2} \in R$ . The formulation of TSVR in primal form is given as:

min_{w_{1}, b_{1}, λ_{1} \in R^{2 m + 1}} \frac{1}{2} | | y - ε_{1} e - (K (B, B^{t}) w_{1} + b_{1} e) {| |}^{2} + C_{1} e^{t} λ_{1}

subject to:

y - (K (B, B^{t}) w_{1} + b_{1} e) \geq ε_{1} e - λ_{1}, λ_{1} \geq 0

and

min_{w_{2}, b_{2}, λ_{2} \in R^{2 m + 1}} \frac{1}{2} | | y + ε_{2} e - (K (B, B^{t}) w_{2} + b_{2} e) {| |}^{2} + C_{2} e^{t} λ_{2}

subject to:

(K (B, B^{t}) w_{2} + b_{2} e) - y \geq ε_{2} e - λ_{2}, λ_{2} \geq 0

where the slack variables are $λ_{1}, λ_{2}$ ; input parameters $C_{1}, C_{2} > 0, ε_{1}, ε_{2} > 0$ are chosen a priori.

Similarly, solve unconstrained optimization problems as mentioned in Eqs. (4) and (5) with Lagrange’s multiplier $η_{1}, η_{2}$ and apply Karush–Kuhn–Tucker (KKT) sufficient conditions, we get

\begin{matrix} max_{w_{1}, b_{1} \in R^{m + 1}} \frac{1}{2} η_{1}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{1} + {(y - ε_{1} e)}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{1} - {(y - ε_{1} e)}^{t} η_{1} \\ subject to : 0 \leq η_{1} \leq C_{1} e, \end{matrix}

and

\begin{matrix} max_{w_{2}, b_{2} \in R^{m + 1}} \frac{1}{2} η_{2}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{2} + {(y + ε_{2} e)}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{2} + {(y + ε_{2} e)}^{t} η_{2} \\ subject to : 0 \leq η_{2} \leq C_{2} e, \end{matrix}

where $Z = [\begin{matrix} K (B, B^{t}) & e \end{matrix}]$ is the augmented matrix.

The values $w_{1}, w_{2}, b_{1}$ and $b_{2}$ are obtained from Eqs. (6) and (7) as follows:

[\begin{matrix} w_{1} \\ b_{1} \end{matrix}] = {(Z^{t} Z + δ I)}^{- 1} Z^{t} (u_{1} - η_{1}) and [\begin{matrix} w_{2} \\ b_{2} \end{matrix}] = {(Z^{t} Z + δ I)}^{- 1} Z^{t} (u_{2} + η_{2}),

where $u_{1} = y - e ε_{1}$ and $u_{2} = y + e ε_{2}$ .

We add $δ I$ a term in the matrix ${(Z^{t} Z)}^{- 1}$ with small value where $δ > 0$ . Its final prediction value for a new test sample can be obtained to take the average of functions $f_{1} (x)$ and $f_{2} (x)$ .

$ε$ -Insensitive Asymmetric Huber Support Vector Regression ( $ε$ -AHSVR)

The $ε$ -insensitive asymmetric Huber based SVR is proposed by Balasundaram and Meena [5] in which it finds $w \in R^{m}$ and $b \in R$ through the solution of QPPs using Huber loss in such a way:

\begin{matrix} min_{(w, b) \in R^{m + 1}} \frac{1}{2} {(w^{t} w + b)}^{2} & + \frac{1}{2} C [| | (y - {((K (B, B^{t}) w + b e) - ε_{α} e)}_{+} {| |}^{2} \\ - | | (y - {((K (B, B^{t}) w + b e) - (ε_{α} + ζ_{α}) e)}_{+} {| |}^{2} + | | ({((K (B, B^{t}) w + b e) - y - ε_{β} e)}_{+} {| |}^{2} \\ - | | ({((K (B, B^{t}) w + b e) - y - (ε_{β} + ζ_{β}) e)}_{+} | |^{2}] \end{matrix}

where $ε_{α}, ε_{β} > 0$ ; $ζ_{α}, ζ_{β} > 0$ ; $C > 0$ are inputs.

One can rewrite the (9) problem by considering $ε_{α} = ε_{β} = ε$ , $Z = [\begin{matrix} K (B, B^{t}) & e \end{matrix}]$ and $ψ = [\begin{matrix} w \\ b \end{matrix}]$ in this manner as follows:

\begin{matrix} min_{ψ \in R^{m + 1}} \frac{1}{2} ψ^{t} ψ & + \frac{1}{2} {C [| | (y - Z ψ - ε e)}_{+} {| |}^{2} - | | (y - Z ψ - (ε + ζ_{α}) e))_{+} {| |}^{2} \\ + {| | (Z ψ - y - ε e)}_{+} {| |}^{2} - | | {(Z ψ - y - (ε + ζ_{β}) e)}_{+} {| |}^{2}] \end{matrix}

To solve the (10) by following the function iterative approach as $ψ_{}^{g + 1} = {(\frac{I}{C} + Z^{g} Z)}^{- 1} Z [y + \frac{| y - Z ψ^{g} - ε e | - | Z ψ^{g} - y - ε e |}{2} + {(Z ψ^{g} - y - (ε + ζ_{α}) e)}_{+} - {(y - Z ψ^{g} - (ε + ζ_{β}) e)}_{+}]$ for $g = 0, 1, \dots$

$ε$ -Insensitive Support Vector Quantile Regression ( $ε$ -SVQR)

A novel $ε$ -SVQR model uses the $ε$ -insensitive pinball loss function for quantile estimation [1] to find the unknown value of $w$ and $b$ through the solution of QPPs in such a way:

min_{w, b, λ_{1}, λ_{2}} \frac{1}{2} {| | w | |}^{2} + C \sum_{g = 1}^{m} (θ λ_{1 g} + (1 - θ) λ_{2 g})

subject to:

y_{g} - (ϕ (x_{g}) w + b e) \leq ε (1 - θ) + λ_{1 g}, λ_{1 g} \geq 0

and

(ϕ (x_{g}) w + b e) - y_{g} \leq θ ε + λ_{2 g}, λ_{1 g} \geq 0, λ_{2 g} \geq 0 for g = 1, \dots, m

where slack variables are $λ_{1 g} : = \frac{λ_{1 g}}{θ}$ and $λ_{2 g} : = \frac{λ_{2 g}}{1 - θ}$ ; $θ > 0$ is the quantile; input parameters are $C > 0, ε > 0$ .

After introducing the Lagrange’s multipliers $η_{1}, η_{2}$ and apply sufficient conditions, the Wolfe dual of the Eq. (11) is given as:

\begin{matrix} min_{η_{1}, η_{2} \in R^{m}} \frac{1}{2} \sum_{g, h = 1}^{m} (η_{1 g} - η_{2 g}) k (x_{g}, x_{h}) (η_{1 h} - η_{2 h}) \\ - \sum_{g = 1}^{m} (η_{1 g} - η_{2 g}) y_{g} + \sum_{g = 1}^{m} ((1 - θ) ε η_{1 g} + θ ε η_{2 g}) \end{matrix}

subject to:

\begin{matrix} \sum_{g = 1}^{m} (η_{1 g} - η_{2 g}) = 0, \\ and 0 \leq η_{1_{g}} \leq C θ, g = 1, 2, \dots, m, \\ 0 \leq η_{2 g} \leq C (1 - θ), g = 1, 2, \dots, m, \end{matrix}

where the kernel function is $k (x_{g}, x_{h}) = ϕ {(x_{g})}^{t} ϕ (x_{h})$ .

The non-linear decision regression function $f (.)$ will be calculated by solving the dual problem as represented in Eq. (12) for any new input $x^{*} \in R^{n}$ as

f (x^{*}) = \sum_{g = 1}^{m} (η_{1 g} - η_{2 g}) k (x^{*}, x_{g}) + b .

Twin Support Vector Regression with Huber Loss (HN-TSVR)

In order to improve the prediction ability, a hybrid approach is proposed by combination of Huber loss function and Twin model of SVR, termed as TSVR with Huber loss (HN-TSVR) in this sub-section.

Huber loss function [47]

$c (λ_{g}) = \{\begin{matrix} \frac{λ_{g}^{2}}{2}, & i f λ_{g} \leq ε \\ ε |λ_{g}| - \frac{ε_{}^{2}}{2}, & otherwise \end{matrix}) w h e r e ε = ε_{1}^{*}$

where $ε_{1}^{*}, ε_{2}^{*}$ are input parameters.

c (ζ_{g}) = \{\begin{matrix} \frac{ζ_{g}^{2}}{2}, & i f ζ_{g} \leq ε \\ ε |ζ_{g}| - \frac{ε_{}^{2}}{2}, & otherwise \end{matrix}) w h e r e ε = ε_{2}^{*}

Open in a new tab

HN-TSVR regression involves the following optimization problems as

\begin{matrix} min_{w_{1}, b_{1}, λ} \frac{1}{2} | | y - ε_{1} e - (K (B, B^{t}) w_{1} + b_{1} e) {| |}^{2} + C_{1} e^{t} (\sum_{g \in D_{1}} \frac{1}{2} λ_{g}^{2} + ε \sum_{g \in D_{1}^{^{'}}} (λ_{g} - \frac{1}{2} ε)) \\ subject to : y - (K (B, B^{t}) w_{1} + b_{1} e) \geq ε_{1} e - λ, λ \geq 0, \end{matrix}

and

\begin{matrix} min_{w_{2}, b_{2}, ζ} \frac{1}{2} | | y + ε_{2} e - (K (B, B^{t}) w_{2} + b_{2} e) {| |}^{2} + C_{2} e^{t} (\sum_{g \in D_{2}} \frac{1}{2} ζ_{g}^{2} + ε \sum_{g \in D_{2}^{^{'}}} (ζ_{g} - \frac{1}{2} ε)) \\ subject to : (K (B, B^{t}) w_{2} + b_{2} e) - y \geq ε_{2} e - ζ, ζ \geq 0, \end{matrix}

where $D_{1}^{} = \{g | 0 \leq λ_{g} < ε\}$ and $D_{1}^{^{'}} = \{g | λ_{g} \geq ε\}$ ; $D_{2}^{} = \{g | 0 \leq ζ_{g} < ε\}$ and $D_{2}^{^{'}} = \{g | ζ_{g} \geq ε\}$ . The slack variables are $λ, ζ$ .

So, derive the dual formulation with sufficient conditions are shown as

min_{η_{1}} \frac{1}{2} η_{1}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{1} - {(y - ε_{1} e)}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{1} + {(y - ε_{1} e)}^{t} η_{1} + \frac{1}{2 C_{1}} η_{1}^{t} η_{1}

subject to:

0 \leq η_{1} \leq C_{1} ε_{1}^{*} e,

and

min_{η_{2}} \frac{1}{2} η_{2}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{2} + {(y + ε_{2} e)}^{t} Z {(Z^{t} Z)}^{- 1} Z^{t} η_{2} - {(y + ε_{2} e)}^{t} η_{2} + \frac{1}{2 C_{2}} η_{2}^{t} η_{2}

subject to:

0 \leq η_{2} \leq C_{2} ε_{2}^{*} e,

where, $Z = [\begin{matrix} K (B, B^{t}) & e \end{matrix}]$ is the augmented matrix.

The values of $w_{1}, w_{2}, b_{1}, b_{2}$ can be solved as

[\begin{matrix} w_{1} \\ b_{1} \end{matrix}] = {(Z^{t} Z)}^{- 1} Z^{t} (u_{1} - η_{1}) and [\begin{matrix} w_{2} \\ b_{2} \end{matrix}] = {(Z^{t} Z)}^{- 1} Z^{t} (u_{2} + η_{2}) .

where $u_{1} = y - e ε_{1}$ , $u_{2} = y + e ε_{2}$ .

Further, the final prediction value can be obtained same as TSVR. For more details see Niu et al. [47].

Proposed Regularization Based Twin Support Vector Regression with Huber Loss (RHN-TSVR)

Twin model of SVR (TSVR) deals with $ε$ -insensitive loss but fail to address the Gaussian noise data. To handle this problem, Niu et al. [47] have proposed an approach named HN-TSVR which deals with Huber loss function but it has failed to follow the structured minimization principle. To avoid the singularity problem of HN-TSVR, we are adding one regularization term $\frac{C_{3}}{2} (| | w_{1} {| |}^{2} + b_{1}^{2})$ and $\frac{C_{4}}{2} (| | w_{2} {| |}^{2} + b_{2}^{2})$ in the primal problem of (19) and (20) respectively that leads to a stable and well-posed model, named as regularization based twin support vector regression with Huber loss which follows the gist of statistical learning theory. The Mathematical formulation of RHN-TSVR is written as:

RHN-TSVR requires two kernel generating functions as $f_{1} (x) = K (x^{t}, B^{t}) w_{1} + b_{1},$ and $f_{2} (x) = K (x^{t}, B^{t}) w_{2} + b_{2}$ .
The proposed approach involves the following optimization problems as

$\begin{matrix} min_{w_{1}, b_{1}, λ} \frac{1}{2} | | y - ε_{1} e - (K (B, B^{t}) w_{1} + b_{1} e) {| |}^{2} \\ + C_{1} e^{t} (\sum_{g \in D_{1}} \frac{1}{2} λ_{i}^{2} + ε \sum_{g \in D_{1}^{^{'}}} (λ_{i} - \frac{1}{2} ε)) + \frac{C_{3}}{2} (| | w_{1} {| |}^{2} + b_{1}^{2} \\ subject to : y - (K (B, B^{t}) w_{1} + b_{1} e) \geq ε_{1} e - λ, λ \geq 0, \end{matrix}$ 19

and

$\begin{matrix} min_{w_{2}, b_{2}, ζ} \frac{1}{2} | | y + ε_{2} e - (K (B, B^{t}) w_{2} + b_{2} e) {| |}^{2} \\ + C_{2} e^{t} (\sum_{g \in D_{2}} \frac{1}{2} ζ_{i}^{2} + ε \sum_{g \in D_{2}^{^{'}}} (ζ_{i} - \frac{1}{2} ε)) + \frac{C_{4}}{2} (| | w_{2} {| |}^{2} + b_{2}^{2}) \\ subject to : (K (B, B^{t}) w_{2} + b_{2} e) - y \geq ε_{2} e - ζ, ζ \geq 0, \end{matrix}$ 20

where, $D_{1}^{} = \{g | 0 \leq λ_{g} < ε\}$ and $D_{1}^{^{'}} = \{g | λ_{g} \geq ε\}$ ; $D_{2}^{} = \{g | 0 \leq ζ_{g} < ε\}$ and $D_{2}^{^{'}} = \{g | ζ_{g} \geq ε\}$ ; slack variables are $λ, ζ$ ; input parameters are $C_{1}, C_{2} > 0 ; ε_{1}, ε_{2} > 0$ .
By introducing the Lagrangian multipliers $η_{1}, η_{2}, α_{1}, α_{2}$ and apply sufficient KKT conditions to the Eq. (19) and (20)

$\begin{matrix} L_{1} (w_{1}, b_{1}, η_{1}, α_{1}) & = \frac{1}{2} | | (y - ε_{1} e - (K (B, B^{t}) w_{1} + b_{1} e)) {| |}^{2} \\ + C_{1} e^{t} (\sum_{g \in D_{1}} \frac{1}{2} λ_{g}^{2} + ε \sum_{g \in D_{1}^{^{'}}} (λ_{g} - \frac{1}{2} ε)) \\ + \frac{C_{3}}{2} (| | w_{1} {| |}^{2} + b_{1}^{2}) - η_{1}^{t} (y - ε_{1} e - (K (B, B^{t}) w_{1} + b_{1} e) + λ) - α_{1}^{t} λ \end{matrix}$ 21

$\begin{matrix} L_{2} (w_{2}, b_{2}, η_{2}, α_{2}) & = \frac{1}{2} | | (y + ε_{2} e - (K (B, B^{t}) w_{2} + b_{2} e)) {| |}^{2} \\ + C_{2} e^{t} (\sum_{g \in D_{2}} \frac{1}{2} ζ_{g}^{2} + ε \sum_{g \in D_{2}^{^{'}}} (ζ_{g} - \frac{1}{2} ε)) \\ + \frac{C_{4}}{2} (| | w_{2} {| |}^{2} + b_{2}^{2}) - η_{2}^{t} (y + ε_{2} e - (K (B, B^{t}) w_{2} + b_{2} e) + ζ) - α_{2}^{t} ζ \end{matrix}$ 22
Then, we find the gradient of (21) according to KKT conditions with respect to $w_{1}$ , $b_{1}$ and $λ$ :

$\begin{matrix} \frac{\partial L_{1}}{\partial w_{1}} & = - K {(B, B^{t})}^{t} (y - K (B, B^{t}) w_{1} - b_{1} e - ε_{1} e) + K {(B, B^{t})}^{t} η_{1} + C_{3} w_{1}^{t} = 0, \\ \frac{\partial L_{1}}{\partial b_{1}} & = - e^{t} (y - ε_{1} e - K (B, B^{t}) w_{1} - b_{1} e) + e^{t} η_{1} + C_{3} b_{1} = 0, \\ \frac{\partial L_{1}}{\partial λ} & = C_{1} v_{g} - η_{1 g} - α_{1 g} = 0 . \end{matrix}$

where $v_{g} = \frac{\partial (c (λ_{g}))}{\partial (λ_{g})} = \{\begin{matrix} λ_{g} & if g \in D_{1} \\ ε & if g \in D_{1}^{^{'}} \end{matrix})$ , $f o r g \in D_{1}$ .

Here, we have $λ_{g} < ε$ , thus $v_{g} \leq ε$ . Also, $α_{1 g} \geq 0$ , then we can get $0 \leq η_{1 g} \leq C_{1} v_{g}$ . Therefore we can conclude that $0 \leq η_{1 g} \leq C_{1} ε$ .
Similar to (21), find the gradient of Eq. (22) with respect to $w_{2}$ , $b_{2}$ and $ζ$ :

$\begin{matrix} \frac{\partial L_{2}}{\partial w_{2}} & = - K {(B, B^{t})}^{t} (y - K (B, B^{t}) w_{2} - b_{2} e + ε_{2} e) + K {(B, B^{t})}^{t} η_{2} + C_{4} w_{2}^{t} = 0, \\ \frac{\partial L_{2}}{\partial b_{2}} & = - e^{t} (y + ε_{2} e - K (B, B^{t}) w_{2} - b_{2} e) + e^{t} η_{2} + C_{4} b_{2} = 0, \\ \frac{\partial L_{2}}{\partial ζ} & = C_{2} v_{g}^{^{'}} - η_{2 g} - α_{2 g} = 0 . \end{matrix}$

where, $v_{g}^{^{'}} = \frac{\partial (c (ζ_{g}))}{\partial (ζ_{g})} = \{\begin{matrix} ζ_{g} & if i \in D_{2} \\ ε & if i \in D_{2}^{^{'}} \end{matrix}), f o r i \in D_{2}$ .

Here, we have $ζ_{g} < ε$ , thus $v_{g}^{^{'}} \leq ε$ . Also, $α_{2 g} \geq 0$ , then we can get $0 \leq η_{2 g} \leq C_{2} v_{g}^{^{'}}$ . Therefore we can conclude that $0 \leq η_{2 g} \leq C_{2} ε$ .
By following the same approach [47], we get the dual formulation of (19) as

$\begin{matrix} min_{η_{1}} \frac{1}{2} η_{1}^{t} Z {(Z^{t} Z + C_{3} I)}^{- 1} Z^{t} η_{1} \\ - {(y - ε_{1} e)}^{t} Z {(Z^{t} Z + C_{3} I)}^{- 1} Z^{t} η_{1} + {(y - ε_{1} e)}^{t} η_{1} + \frac{1}{2 C_{1}} η_{1}^{t} η_{1} \\ subject to : 0 \leq η_{1} \leq C_{1} ε_{1}^{*} e, \end{matrix}$ 23

where, $Z = [\begin{matrix} K (B, B^{t}) & e \end{matrix}]$ is the augmented matrix.
Similar to above, we get the dual formulation of (20) as

$\begin{matrix} min_{η_{2}} \frac{1}{2} η_{2}^{t} Z {(Z^{t} Z + C_{4} I)}^{- 1} Z^{t} η_{2} \\ - {(y + ε_{2} e)}^{t} Z {(Z^{t} Z + C_{4} I)}^{- 1} Z^{t} η_{2} + {(y + ε_{2} e)}^{t} η_{2} + \frac{1}{2 C_{2}} η_{2}^{t} η_{2} \\ subject to : 0 \leq η_{2} \leq C_{2} ε_{2}^{*} e, \end{matrix}$ 24
The values of $w_{1}, w_{2}, b_{1}, b_{2}$ can be obtained as

$\begin{matrix} ψ_{1} = [\begin{matrix} w_{1} \\ b_{1} \end{matrix}] = {(Z^{t} Z + C_{3} I)}^{- 1} Z^{t} (u_{1} - η_{1}), \\ and \\ ψ_{2} = [\begin{matrix} w_{2} \\ b_{2} \end{matrix}] = {(Z^{t} Z + C_{4} I)}^{- 1} Z^{t} (u_{2} + η_{2}) \end{matrix}$ 25
where $u_{1} = y - ε_{1} e$ , $u_{2} = y + ε_{2} e$ .
The end regressor value can be determined for a new test sample to take the average of functions $f_{1} (x)$ and $f_{2} (x)$ .

$f_{} (x) = \frac{f_{1} (x) + f_{2} (x)}{2} .$ 26

One can follow the Algorithm 3.1 of proposed approach RHN-TSVR as:

graphic file with name 11063_2020_10380_Figa_HTML.jpg

Remark

One can observe that for TWSVM, the authors in [55] gave an improvement by adding a regularization term in the objective function aiming at minimizing the structural risk by maximizing the margin. This method is called TBSVM, where the bias term is also penalized. But penalizing the bias term will not affect the result significantly and will change the optimization problem slightly. From a geometric point of view, it is sufficient to penalize the norm of w in order to maximize the margin [45].

Numerical Experiments and Results

In this section, we have demonstrated a number of experiments to validate the efficacy of our new proposed approach RHN-TSVR in comparison to existing approaches such as SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR on various artificial datasets having different types of noises and real-world datasets at significant noise level 0%, 5% and 10%.

Experimental Setup

To perform this experiment, some hardware and software are required to execute the numerical experiment such as one desktop PC, one CPU with 4 Gigabyte RAM and a high-speed 64-bit processor as i5@intel 3.20 GHz, operating system @ Microsoft windows 10 and MATLAB software. An optimization toolbox MOSEK (accessible from https://www.mosek.com) is also needed to solve the QPP in case of SVR, TSVR, $ε$ -SVQR and HN-TSVR and RHN-TSVR models. In this paper, Gaussian kernel function is considered for nonlinear consideration as $K (x_{z_{1}}, x_{z_{2}}) = exp (- μ | | x_{z_{1}} - x_{z_{2}} {| |}^{2})$ , for $z_{1}, z_{2} = 1, \dots, m$ , where kernel parameter $μ > 0$ . Here, all the parameters with their set of ranges corresponding to concerned algorithms are defined in Table 1. The 10-fold cross-validation is applied for all interested concerned algorithms on benchmark standard real world as well as artificial generated datasets.

Table 1.

All parameters with their range and concerned algorithms

Parameters	Range	Model
$C$ , $C_{1} = C_{2}, C_{3} = C_{4}$	${10^{- 5}, \dots, 10^{5}}$	SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR, RHN-TSVR
$μ$	${2^{- 5}, \dots, 2^{5}}$	SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR, RHN-TSVR
$ε$	${0.1}$	SVR
$ε_{1}, ε_{2}$	${0.001, 0.01, 0.1, 0.3, 0.5, 0.7, 0.9}$	TSVR
$(ε_{α} = ε_{β})$	${0.001, 0.01, 0.1}$	$ε$ -AHSVR
$ε$	${0.1, 0.3, 0.5, 0.7, 0.9}$	$ε$ -SVQR
$(ε_{1} = ε_{2}), (ε_{1}^{} = ε_{2}^{})$	${0.1, 0.3, 0.5, 0.7, 0.9}$	HN-TSVR, RHN-TSVR
$ζ_{α}, ζ_{β}$	${0.1, 1.0, 1.345}$	$ε$ -AHSVR
$θ$	${0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8}$	$ε$ -SVQR

Open in a new tab

Here, the root mean square error (RMSE) is considered to compute the prediction error for all interested algorithms on test data. Its formula is shown as:

RMSE = \sqrt{\frac{1}{N} \sum_{z = 1}^{N} {(y_{z} - {\hat{y}}_{z})}^{2}}

where $\overset{}{y_{z}}$ considered as observed data, ${\hat{y}}_{z}$ considered as predicted data and $N$ considered total test data.

Artificial Data Set

Uniform and Gaussian noise

We generate artificial datasets to test the effectiveness of our proposed RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR whose function definitions are mentioned in Table 2. In Table 2, two types of noise are considered: I) Uniform noise $Θ \in U (a, b)$ with interval $(a, b)$ ; II) Gaussian noise $Θ \in N (μ, σ^{2})$ with mean $μ$ and variance $σ^{2}$ . In Table 2, Functions from 1 to 14 are using uniform variability of noise with symmetric distribution and Functions from 15 to 18 are having the heteroscedastic noise structure in which noise is computed on the basis of the value of input sample.

Table 2.

Different artificially generated functions using Uniform noise and Gaussian noise with their definition and domain of definition

Function name

Function definition

Domain of definition

Noise Type

Function1

Function2

f(x) = (\frac{4}{| x_{1} | + 2}) + cos (2 x_{1}) + sin (3 x_{1}) + Θ

x_{1} \in [- 10, 10]

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 1^{2})$

Function3

Function4

\begin{matrix} {f(x}_{1}, x_{2}, x_{3}, x_{4}, x_{5}) & = 0 . 79 + 1 . 27 x_{1} x_{2} + 1 . 56 x_{1} x_{4} \\ + 3 . 42 x_{2} x_{5} + 2 . 06 x_{3} x_{4} x_{5} + Θ \end{matrix}

$x_{i} \in [0, 1]$

$i \in \{ 1,2,3,4,5\}$

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 1^{2})$

Function5

Function6

{f(x}_{1}, x_{2}) = 42.659 (0.1 {+ x}_{1} (0.05 {+ x}_{1}^{4} {- 10x}_{1}^{2} x_{2}^{2} {+ 5x}_{2}^{4})) + Θ

x_{1}, x_{2} \in [- 0 . 5, 0.5]

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 1^{2})$

Function7

Function8

\begin{matrix} {f(x}_{1}, x_{2}, x_{3}, x_{4}, x_{5} {) = 10 sin π x_{1} x_{2} {+ 20(x}_{3} - 0 . 5)}^{2} {+ 10x}_{4} {+ 5x}_{5} + Θ \end{matrix}

$x_{i} \in [0, 1]$

$i = 1, 2, 3, 4, 5$

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 1^{2})$

Function9

Function10

{f(x}_{1}) = (\frac{4}{| x_{1} | + 2}) + cos (2 x_{1}) + sin (3 x_{1}) + Θ

x_{1} \in [- 10, 10]

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 2^{2})$

Function11

Function12

\begin{matrix} {f(x}_{1}, x_{2}) = 1.3356 (exp (3 (x_{2} - 0.5)) sin (4 π (x_{2} - 0.9))) \\ (+ 1.5 (1 - x_{1})) + 1.5 (1 - x_{1}) + exp (2 x_{1} - 1) \\ sin (3 π (x_{1} - 0.6) (x_{1} - 0.6)) + Θ \end{matrix}

$x_{i} \in [0, 1]$

$i \in \{ 1,2\}$

Type A: $Θ \in U (- 0.2, 0.2)$

Type B: $Θ \in N (0, 0 . 2^{2})$

Function13

Function14

\begin{matrix} {f(x}_{1}) = (\frac{1}{0.3 \sqrt{2 π}}) exp (- ((x_{1} - 2) (x_{1} - 2)) / (2 (0 . 3^{2}))) \\ + (\frac{1}{1.2 \sqrt{2 π}}) exp (- ((x_{1} - 7) (x_{1} - 7)) / (2 (1 . 2^{2}))) + Θ \end{matrix}

x_{1} \in [0, 10]

Type A: $Θ \in U (- 0.5, 0.5)$

Type B: $Θ \in N (0, 0 . 1^{2})$

Function15

Function16

f_{1} (x) = \frac{sin (x)}{x}

such that

y_{i} = f_{1} (x_{i}) + (0.5 - \frac{| x_{i} |}{8 π}) Θ_{i}

\begin{matrix} x_{i} \in U (- 4 π, 4 π), \\ i = 1, 2, \dots, 200 \end{matrix}

Type A: $Θ \in U (- 1, 1)$

Type B: $Θ \in N (0, 0 . 5^{2})$

Function17

Function18

$f(x) = {x + 2exp( - 16x}^{2})$

such that $y_{i} = {f(x}_{i}) {+ (x}_{i} + 0 . 5) Θ_{i}$

$x_{i} = 0.01 (i - 1) - 1,$

$i = 1, 2, \dots, 200$

Type A: $Θ \in U (- 1, 1)$

Type B: $Θ \in N (0, 0 . 5^{2})$

Open in a new tab

The performance analysis of RHN-TSVR model with standard SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR using Gaussian kernel on artificially generated datasets is tabulated in Table 3. Table 4 contains the average ranks of all concerned algorithms over artificially generated datasets for the Gaussian kernel. The proposed algorithm RHN-TSVR has the lowest rank among SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR in Table 4 that signifies our approach is far better than others. The performance of RHN-TSVR is better in 10 out of 18 artificial functions for both types of noise either uniform or Gaussian noise from Table 3. It also shows prominent impact in both uniform variabilities of noise as well as heteroscedastic noise structure.

Table 3.

Performance comparison of RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR using Gaussian kernel on synthetic data sets with Uniform and Gaussian noise

Dataset
(Train size, Test size)

SVR
RMSE

(C, ε, μ)

Time

TSVR
RMSE

(C_{1} = C_{2},

ε, μ)

Time

ε

-AHSVR
RMSE

(C, ε_{α} = ε_{β}, ζ_{α},

ζ_{β}, μ)

Time

ε

-SVQR
RMSE

(C, ε,

θ, μ)

Time

HN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

RHN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

Function1

(150X2,500X2)

0.060254

(10¹,0.1,2⁰)

0.24846

0.07662

(10⁰,10⁻³,2⁰)

0.07161

0.102363

(10³,10⁻³, 1.345, 1.345, 2¹)

0.00237

0.08825

(10¹,0.1,0.3,2¹)

0.11429

0.07716

(10¹, 0.1, 0.1, 2⁰)

0.13004

0.12935

(10⁻⁵,10⁻⁵, 0.9, 0.1, 2⁰)

0.0068301

Function2

(150X2,500X2)

0.084763

(10⁵,0.1,2⁻¹)

0.27253

0.02778

(10⁰,10⁻¹,2⁰)

0.09476

0.025112

(10⁵,10⁻³, 1, 1.345, 2¹)

0.00214

0.04269

(10³,0.1,0.3,2⁻¹)

0.13124

0.02775

(10¹, 0.9, 0.3, 2⁰)

0.10073

0.02199

(10¹,10⁻⁵, 0.9, 0.9, 2¹)

0.00916

Function3

(150X6,500X6)

0.01362

(10⁵,0.1,2⁻³)

0.23213

0.01921

(10³,10⁻³,2⁻²)

0.17381

0.01156

(10⁵,10⁻¹, 1, 1.345, 2⁻²)

0.00603

0.01356

(10⁵,0.3,0.5,2⁻³)

0.06577

0.02379

(10⁵, 0.5, 0.3, 2⁰)

0.19114

0.00988

(10⁵,10⁻⁵, 0.1, 0.3, 2⁻³)

0.0118904

Function4

(150X6,500X6)

0.061408

(10⁴,0.1,2⁻⁵)

0.23359

0.01346

(10⁴,10⁻¹,2⁰)

0.10612

0.00632

(10⁵,10⁻³, 0.1, 1.345, 2³)

0.00569

0.02464

(10⁵,0.1,0.5,2⁻³)

0.06941

0.01333

(10⁵, 0.9, 0.5, 2⁰)

0.19064

0.00602

(10¹,10⁻⁵, 0.9, 0.5, 2³)

0.0154294

Function5

(150X3,500X3)

0.051805

(10⁴,0.1,2⁵)

0.23124

0.03731

(10³,10⁻¹,2⁵)

0.10908

0.031527

(10³,10⁻¹, 1, 1.345, 2⁵)

0.01027

0.04023

(10⁵,0.3,0.5,2⁵)

0.07013

0.03731

(10³, 0.9, 0.5, 2⁵)

0.16043

0.03919

(10⁵,10⁻⁵, 0.9, 0.5, 2⁵)

0.0139697

Function6

(150X3,500X3)

0.068543

(10⁴,0.1,2²)

0.24664

0.00725

(10²,10⁻¹,2⁵)

0.10067

0.004275

(10⁵,10⁻³, 0.1, 1.345, 2⁵)

0.00794

0.03068

(10⁵,0.1,0.2,2³)

0.07966

0.00727

(10³, 0.3, 0.9, 2⁵)

0.12189

0.00406

(10⁰,10⁻⁵, 0.1, 0.5, 2⁵)

0.015654

Function7

(150X6,500X6)

0.017707

(10⁴,0.1,2⁻⁴)

0.21608

0.008

(10³,10⁻³,2¹)

0.10053

0.010915

(10⁵,10⁻¹, 1, 1.345, 2⁻⁴)

0.01112

0.02645

(10⁵,0.1,0.6,2⁻⁵)

0.07513

0.00799

(10⁵, 0.1, 0.1, 2¹)

0.12071

0.00795

(10⁵,10⁻⁵, 0.1, 0.7, 2⁰)

0.0161369

Function8

(150X6,500X6)

0.062695

(10³,0.1,2¹)

0.23092

0.00418

(10¹,10⁻¹,2³)

0.10265

0.002216

(10⁵,10⁻³, 0.1, 1.345, 2²)

0.00669

0.02145

(10⁵,0.1,0.1,2⁻¹)

0.07909

0.00418

(10⁵, 0.5, 0.3, 2³)

0.10959

0.00169

(10¹,10⁻⁵, 0.1, 0.1, 2⁰)

0.0169987

Function9

(150X2,500X2)

0.088101

(10²,0.1,2⁰)

0.27421

0.08105

(10⁰,10⁻¹,2⁰)

0.07858

0.083806

(10²,10⁻³, 1.345, 1.345, 2¹)

0.00213

0.09462

(10³,0.1,0.5,2⁻¹)

0.12864

0.08129

(10¹, 0.9, 0.1, 2⁰)

0.07415

0.08129

(10¹,10⁻³, 0.9, 0.1, 2⁰)

0.0151367

Function10

(150X2,500X2)

0.078086

(10²,0.1,2⁰)

0.48633

0.0158

(10²,10⁻³,2¹)

0.07912

0.011333

(10⁵,10⁻³, 0.1, 1.345, 2¹)

0.0013

0.04075

(10⁵,0.1,0.2,2⁻¹)

0.14998

0.01579

(10⁵, 0.1, 0.9, 2¹)

0.06906

0.0112

(10⁻⁵,10⁻⁵, 0.1, 0.1, 2¹)

0.0126496

Function11

(150X3,500X3)

0.027556

(10⁵,0.1,2²)

0.25697

0.01022

(10⁵,10⁻¹,2⁴)

0.10644

0.016189

(10⁵,10⁻¹, 1, 1.345, 2³)

0.01082

0.04191

(10⁵,0.3,0.1,2⁵)

0.07923

0.0138

(10⁵, 0.1, 0.7, 2³)

0.15925

0.01293

(10⁵,10⁻⁵, 0.9, 0.3, 2³)

0.017398

Function12

(150X3,500X3)

0.061853

(10⁴,0.1,2³)

0.28748

0.00696

(10⁰,10⁻¹,2⁵)

0.08644

0.004961

(10⁵,10⁻³, 0.1, 1.345, 2⁴)

0.00726

0.03326

(10³,0.1,0.1,2³)

0.07364

0.00689

(10³, 0.9, 0.1, 2⁵)

0.09879

0.00531

(10⁰,10⁻⁵, 0.9, 0.3, 2⁵)

0.0173283

Function13

(200X2,450X2)

0.136885

(10⁰,0.1,2¹)

0.26544

0.11783

(10¹,10⁻³,2⁻¹)

0.12948

0.08155

(10¹,10⁻¹, 1, 1.345, 2¹)

0.00388

0.10239

(10¹,0.9,0.2,2¹)

0.12211

0.11656

(10⁵, 0.1, 0.5, 2⁻¹)

0.10624

0.08749

(10⁵,10¹, 0.3, 0.5, 2¹)

0.0225593

Function14

(200X2,500X2)

0.186935

(10⁰,0.1,2⁴)

0.25514

0.12023

(10⁵,10⁻³,2⁰)

0.10377

0.184837

(10¹,10⁻¹, 1, 1.345, 2⁵)

0.00275

0.15779

(10¹,0.5,0.1,2⁵)

0.10492

0.12034

(10³, 0.1, 0.7, 2⁰)

0.09102

0.10566

(10⁵,10¹, 0.1, 0.9, 2³)

0.0235153

Function15

(200X2,500X2)

0.034488

(10²,0.1,2⁻⁵)

0.46716

0.01879

(10³,10⁻¹,2⁻⁵)

0.16623

0.019379

(10¹,10⁻¹, 1, 1.345, 2⁻⁴)

0.00225

0.02682

(10³,0.3,0.5,2⁻⁵)

0.22456

0.01878

(10⁵, 0.9, 0.7, 2⁻⁵)

0.30141

0.01878

(10⁵,10⁻³, 0.9, 0.7, 2⁻⁵)

0.024471

Function16

(200X2,500X2)

0.036077

(10²,0.1,2⁻³)

0.47924

0.02116

(10⁻⁵,10⁻³,2 ⁻³)

0.15487

0.0222

(10³,10⁻², 1, 1.345, 2⁻³)

0.0046

0.02166

(10³,0.1,0.5,2⁻⁵)

0.22893

0.0219

(10⁰, 0.9, 0.1, 2⁻³)

0.236

0.01955

(10⁵,10¹, 0.1, 0.5, 2⁻³)

0.028278

Function17

(200X2,500X2)

0.147868

(10⁰,0.1,2³)

0.46149

0.15262

(10⁻⁵,10⁻¹,2¹)

0.143

0.100892

(10¹,10⁻¹, 1.345, 1.345, 2⁴)

0.00233

0.18809

(10¹,0.7,0.8,2⁵)

0.24851

0.15262

(10⁻⁵, 0.1, 0.1, 2¹)

0.14581

0.07032

(10¹,10⁻¹, 0.9, 0.5, 2³)

0.0183895

Function18

(200X2,500X2)

0.052586

(10¹,0.1,2³)

0.44357

0.05026

(10⁰,10⁻³,2³)

0.14172

0.044265

(10²,10⁻², 1, 1.345, 2³)

0.0035

0.03809

(10¹,0.1,0.5,2³)

0.21918

0.05027

(10¹, 0.9, 0.1, 2³)

0.2795

0.05027

(10¹,10⁻³, 0.9, 0.1, 2³)

0.0184345

Open in a new tab

The best result is shown as boldface

Table 4.

Average ranks of proposed RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR on RMSE using Gaussian kernel on synthetic data sets with Uniform and Gaussian noise

Datasets	SVR	TSVR	$ε$ -AHSVR	$ε$ -SVQR	HN-TSVR	RHN-TSVR
Function1	1	2	5	4	3	6
Function2	6	4	2	5	3	1
Function3	4	5	2	3	6	1
Function4	6	4	2	5	3	1
Function5	6	2.5	1	5	2.5	4
Function6	6	3	2	5	4	1
Function7	5	3	4	6	2	1
Function8	6	3.5	2	5	3.5	1
Function9	5	1	4	6	2.5	2.5
Function10	6	4	2	5	3	1
Function11	5	1	4	6	3	2
Function12	6	4	1	5	3	2
Function13	6	5	1	3	4	2
Function14	6	2	5	4	3	1
Function15	6	3	4	5	1.5	1.5
Function16	6	2	5	3	4	1
Function17	3	4.5	2	6	4.5	1
Function18	6	3	2	1	4.5	4.5
Average rank	5.2777778	3.1388889	2.7777778	4.5555556	3.3333333	1.9166667

Open in a new tab

Figures 1 and 2 are plotted corresponding to artificial Function 13 and Function 14 from Table 3 in order to show the uniform variability of noise with symmetric distribution respectively. Figures 3 and 4 are plotted corresponding to artificial Functions 15 and Function 16 from Table 3 results using Gaussian kernel in order to show the predictions with heteroscedastic noise structure respectively. One can find from Fig. 1, 2, 3 and 4 that our proposed approach RHN-TSVR is having closed relationship with the original one in comparison to other reported approaches. Hence, one can say that RHN-TSVR is very well capable to deal with uniform noise as well as heteroscedastic noise structure if present in the dataset.

Fig. 1 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Function 13 artificial generated dataset. Gaussian kernel was used

Fig. 2 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Function 14 artificial generated dataset. Gaussian kernel was used

Fig. 3 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Function 15 artificial generated dataset. Gaussian kernel was used

Fig. 4 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Function 16 artificial generated dataset. Gaussian kernel was used

Laplacian Noise

Further, we generate artificial datasets having another type noise i.e. Laplacian noise to validate our RHN-TSVR approach with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, and HN-TSVR whose function definitions are mentioned in Table 5. The Laplacian noise is considered as $Ψ \in L (μ, b)$ with interval $(0, 1)$ .

Table 5.

Different artificially generated functions having Laplacian noise with their definition and domain of definition

Function name	Function definition	Domain of definition
Function19	$f(x) = (\frac{4}{\| x_{1} \| + 2}) + cos (2 x_{1}) + sin (3 x_{1}) + Ψ$	$x_{1} \in [- 10, 10]$
Function20	$\begin{matrix} f(x) = ((1 + sin (2 x_{1} + 3 x_{2})) /) \\ ((3.5 + sin (x_{1} - x_{2}))) + Ψ \end{matrix}$	$x_{i} \in [- 2, 2]$ $i \in \{ 1,2\}$
Function21	${f(x}_{1}, x_{2}) = {exp(x}_{1} sin(π x_{2})) + Ψ$	$x_{1}, x_{2} \in [- 1, 1]$
Function22	$\begin{matrix} f(x) = 0.02 [(12 + 3x - 3 . 5 x^{2} + 7 . 2 x^{3}) \\ (1 + cos4 π x)(1 + 0 . 8 sin3 π x)] + Ψ \end{matrix}$	$x \in [- 0 . 25, 0.2 5]$
Function23	$\begin{matrix} {f(x}_{1}, x_{2}, x_{3}, x_{4}, x_{5}) = 10 sin π x_{1} x_{2} \\ {+ 20(x}_{3} {- 0 . 5)}^{2} {+ 10x}_{4} {+ 5x}_{5} + Ψ \end{matrix}$	$x_{i} \in [0, 1]$ $i = 1, 2, 3, 4, 5$
Function24	$f(x) = 0.2 sin (2 π x) + 0 . 2 x^{2} + 0.3$ such that $y_{i} = {f(x}_{i}) + (0 . 1 x_{i}^{2} + 0 . 05) Ψ_{i}$	$x_{i} = 0.01 (i - 1) - 1,$ $i = 1, 2, \dots, 200$

Open in a new tab

The prediction performance of RHN-TSVR to conventional SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, and HN-TSVR using Gaussian kernel on artificially generated datasets have been tabulated in Table 6. Here, proposed RHN-TSVR shows better prediction capability in 4 cases among 6. Further, the average ranks of all models are computed in Table 7. The proposed algorithm RHN-TSVR has the lowest rank among all that signifies our approach outperforms to others. We have plotted Fig. 5 corresponding to Function 19 to understand the close relationship between prediction and original values. In this case, our proposed RHN-TSVR is having similar performance as HN-TSVR.

Table 6.

Performance comparison of RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR using Gaussian kernel on synthetic data sets with Laplacian noise

Dataset
(Train size, Test size)

SVR
RMSE

(C, ε, μ)

Time

TSVR
RMSE

(C_{1} = C_{2}, ε, μ)

Time

ε

-AHSVR
RMSE

(C, ε_{α} = ε_{β}, ζ_{α},

ζ_{β}, μ)

Time

ε

-SVQR
RMSE

(C, ε, θ, μ)

Time

HN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

RHN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

Function19

(150X2,500X2)

0.416258

(10²,0.1,2⁻¹)

0.55812

0.404259

(10⁰,0.9,2⁻¹)

0.03316

0.40645

(10³,10⁻¹, 1, 1, 2⁻¹)

0.00513

0.47416

(10³,0.7,0.5,2⁻¹)

0.02272

0.404141

(10¹, 0.9, 0.1, 2⁻¹)

0.02615

0.404141

(10¹,10⁻³, 0.9, 0.1, 2⁻¹)

0.0083096

Function20

(150X3,500X3)

0.146758

(10⁰,0.1,2⁰)

0.47995

0.217388

(10⁰,0.9,2⁻¹)

0.02454

0.252318

(10⁻²,10⁻¹, 0.1, 1, 2⁴)

0.01452

0.21538

(10¹,0.5,0.5,2¹)

0.02619

0.217806

(10¹, 0.9, 0.1, 2⁻¹)

0.02312

0.138944

(10¹,10², 0.5, 0.1, 2³)

0.0125658

Function21

(150X3,500X3)

0.193024

(10⁵,0.1,2⁻⁵)

0.46148

0.215517

(10⁰,0.9,2⁻³)

0.02405

0.213251

(10⁻¹,10⁻¹, 1, 1.345, 2⁰)

0.01195

0.24943

(10⁻¹,0.1,0.8,2⁵)

0.03729

0.215388

(10¹, 0.9, 0.1, 2⁻³)

0.02207

0.188105

(10¹,10¹, 0.9, 0.1, 2⁰)

0.0189023

Function22

(150X2,500X2)

0.107321

(10⁰,0.1,2⁴)

0.43419

0.116748

(10⁰,0.9,2²)

0.03422

0.081983

(10⁰,10⁻¹, 0.1, 0.1, 2⁵)

0.00426

0.11815

(10⁵,0.3,0.4,2⁻¹)

0.0151

0.118841

(10¹, 0.1, 0.1, 2³)

0.02572

0.104688

(10¹,10¹, 0.1, 0.1, 2⁵)

0.0133986

Function23

(150X6,500X6)

0.190689

(10²,0.1,2¹)

0.38126

0.240025

(10⁰,0.1,2¹)

0.02595

0.24585

(10⁰,10⁻¹, 1, 1, 2²)

0.01102

0.28425

(10¹,0.3,0.3,2³)

0.0199

0.244521

(10⁰, 0.1, 0.7, 2¹)

0.02311

0.18934

(10¹,10⁻¹, 0.9, 0.1, 2¹)

0.0136633

Function24

(200X2,500X2)

0.026582

(10²,0.1,2¹)

0.89983

0.021008

(10⁰,0.9,2¹)

0.06646

0.027998

(10⁴,10⁻¹, 0.1, 0.1, 2²)

0.00481

0.01086

(10³,0.1,0.4,2¹)

0.02411

0.019294

(10¹, 0.1, 0.1, 2¹)

0.08291

0.012581

(10⁰,10⁻⁴, 0.9, 0.3, 2¹)

0.0169828

Open in a new tab

The best result is shown as boldface

Table 7.

Average ranks of proposed RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR on RMSE using Gaussian kernel on synthetic data sets with Laplacian noise

Datasets	SVR	TSVR	$ε$ -AHSVR	$ε$ -SVQR	HN-TSVR	RHN-TSVR
Function19	5	3	4	6	1.5	1.5
Function20	2	4	6	3	5	1
Function21	2	5	3	6	4	1
Function22	3	4	1	5	6	2
Function23	2	3	5	6	4	1
Function24	5	4	6	1	3	2
Average rank	3.1666667	3.8333333	4.1666667	4.5	3.9166667	1.4166667

Open in a new tab

Fig. 5 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Function 19 artificial generated dataset. Gaussian kernel was used

Real World Datasets

Here, 42 standard benchmark real-world datasets at different significant noise levels such as 0%, 5% and 10% are used to demonstrate the performance of RHN-TSVR. The details of datasets follow as:

Real world datasets	Repositories
Forestfires, Machine_CPU, Auto-original, Winequality, Gas_furnace, Quake	(UCI datasets repositories, [63])
SantafeA	[53]
The inverse dynamics of a Flexible robot arm	[21]
The financial time series datasets: S&P500, INFY, ONGC_NS, XOM, ATX, BSESN, DJI, GDAXI, MXX, N225	[20, 27]
Space Ga	[58]
KEEL time-series datasets: NNGC1_dataset_E1_V1_001, NNGC1_dataset_F1_V1_008, NNGC1_dataset_F1_V1_009, NNGC1_dataset_F1_V1_010, NNGC1_dataset_F1_V1_006, NN5_Complete_109, NN5_Complete_104, NN5_Complete_106, NN5_Complete_103, NN5_Complete_101, NN5_Complete_105, NN5_Complete_111, D1dat_1_2000	[38]
Wankara, Laser, Dee, Friedman, Mortgage	[38]
Roszman1, Gauss1, Chwirut2	[48]
Vineyard	[65]
COVID-19_spain	[14]

Open in a new tab

At Significant Noise Level 0%

The numerical experiment results for SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and proposed approach RHN-TSVR on 42 standard real-world benchmark datasets at significant noise level 0% are obtained and tabulated in Table 8. It consists of the prediction accuracy with optimal parameters value and the learning time for all reported approaches. The performance of our proposed RHN-TSVR over benchmark real-world datasets is better in 22 cases compared to SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, and HN-TSVR. Further, the average ranks are determined to perform the statistical test in Table 9 based on RMSE values for the Gaussian kernel. We can see that the average rank of our proposed RHN-TSVR is least in compared to other reported approaches. One can reach to conclusion that RHN-TSVR performs better on real world datasets at significant noise level 0%. Figures 6 and 7 are plotted corresponding to Machine CPU and Gas furnace datasets respectively in order to show the prediction performance graphically. Both graphs are clearly showing a better prediction capability of RHN-TSVR.

Table 8.

Performance comparison of RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR using Gaussian kernel with noise 0% on real world datasets

Dataset
(Train size, Test size)

SVR
RMSE

(C, ε, μ)

Time

TSVR
RMSE

(C_{1} = C_{2},

ε, μ)

Time

ε

-AHSVR
RMSE

(C, ε_{α} = ε_{β}, ζ_{α},

ζ_{β}, μ)

Time

ε

-SVQR
RMSE

(C, ε,

θ, μ)

Time

HN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

RHN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

Forestfires

(150X13,367X13)

0.07059

(10⁻³, 0.1,2⁵)

0.55583

0.07058

(10⁻⁴,0.1,2⁵)

0.063014

0.070592

(10⁻⁵,10⁻³, 0.1, 0.1, 2¹)

0.09428

0.070476

(10¹,0.1, 0.1, 2⁻⁵)

0.02012

0.07058

(10⁻⁵, 0.5, 0.5, 2⁵)

0.25727

0.07059

(10⁵,10⁵, 0.3, 0.3, 2⁻⁵)

0.0125241

Machine CPU

(100X8,109X8)

0.03506

(10⁴, 0.1,2⁻²)

0.24525

0.02102

(10³,0.9,2⁻²)

0.041461

0.050225

(10⁵,10⁻¹, 0.1, 0.1, 2⁻³)

0.332

0.036146

(10¹,0.1, 0.7, 2⁻⁵)

0.00977

0.03035

(10³, 0.9, 0.5, 2⁻¹)

0.20161

0.01579

(10³,10⁻⁵, 0.9, 0.3, 2⁻³)

0.0107687

Auto-original

(100X8,292X8)

0.33338

(10⁰, 0.1,2¹)

0.2708

0.29912

(10⁻²,0.1,2⁰)

0.033108

0.171313

(10¹,10⁻¹, 0.1, 1, 2⁻⁵)

0.06359

0.260394

(10¹,0.1, 0.7, 2⁻¹)

0.01014

0.29899

(10⁻¹, 0.1, 0.1, 2⁰)

0.04945

0.29899

(10⁻¹,10⁻³, 0.1, 0.1, 2⁰)

0.0126264

Winequality

(500X12,4398X12)

0.13377

(10², 0.1,2⁻³)

6.31838

0.13437

(10⁻⁵,0.7,2⁻¹)

0.719632

0.130454

(10⁵,10⁻¹, 0.1, 1, 2⁻³)

0.13756

0.132434

(10³,0.1, 0.5, 2⁻⁵)

0.47543

0.135

(10⁰, 0.1, 0.1, 2⁻¹)

0.61889

0.13486

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0832587

SantafeA

(500X6,495X6)

0.05654

(10⁰, 0.1,2²)

5.59668

0.03811

(10⁻¹,0.9,2³)

0.758993

0.081809

(10³,10⁻², 0.1, 1, 2¹)

0.1015

0.039983

(10³,0.1, 0.5, 2⁻¹)

0.86175

0.03922

(10⁰, 0.9, 0.1, 2³)

0.64863

0.0375

(10⁵,10⁻⁵, 0.1, 0.7, 2³)

0.113057

Gas_furnace

(150X7,143X7)

0.07154

(10⁵, 0.1,2⁻⁵)

0.47456

0.04394

(10²,0.1,2⁻⁴)

0.068103

0.084165

(10¹,10⁻¹, 0.1, 1, 2⁻³)

0.05107

0.05659

(10³,0.1, 0.2, 2⁻³)

0.0444

0.04158

(10¹, 0.1, 0.1, 2⁻³)

0.08725

0.0383

(10⁰,10⁻⁵, 0.1, 0.3, 2⁻⁵)

0.0174328

Quake

(500X4,1678X4)

0.17349

(10⁻¹, 0.1,2⁻²)

9.48833

0.17227

(10⁰,0.9,2⁻⁵)

0.758743

0.17214

(10⁻¹,10⁻¹, 0.1, 0.1, 2⁻³)

0.14973

0.172642

(10⁻³,0.1, 0.7, 2⁻³)

0.36931

0.17219

(10⁰, 0.9, 0.3, 2⁻⁵)

0.82559

0.17224

(10⁰,10¹, 0.9, 0.3, 2⁻⁵)

0.0794874

Flex_robotarm

(500X10,519X10)

0.04337

(10³, 0.1,2⁻⁵)

9.5114

0.03595

(10¹,0.1,2⁻¹)

0.795891

0.056807

(10⁵,10⁻¹, 0.1, 0.1, 2⁻⁵)

0.0878

0.025886

(10⁵,0.1, 0.4, 2⁻⁵)

0.24667

0.03595

(10³, 0.1, 0.9, 2⁻¹)

1.20655

0.02046

(10³,10⁻⁵, 0.9, 0.7, 2⁻¹)

0.10096

S&P500

(200X6,550X6)

0.19605

(10⁵, 0.1,2⁻¹)

1.41245

0.13739

(10⁻¹,0.9,2¹)

0.113956

0.255194

(10⁻¹,10⁻¹, 0.1, 1, 2⁻¹)

0.02659

0.095518

(10³,0.1, 0.8, 2⁻³)

0.04174

0.13983

(10¹, 0.9, 0.1, 2¹)

0.33202

0.13355

(10⁰,10⁻⁵, 0.1, 0.3, 2⁻¹)

0.0243076

Space-Ga

(500X7,2607X7)

0.30306

(10⁻¹, 0.1,2⁻¹)

9.72392

0.29628

(10⁻³,0.1,2⁵)

0.697419

0.30902

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.09576

0.285357

(10⁻¹,0.1, 0.6, 2⁻¹)

0.39695

0.29628

(10⁻⁵, 0.1, 0.1, 2⁵)

0.92673

0.29306

(10⁰,10⁻¹, 0.9, 0.1, 2⁵)

0.107667

Gauss1

(75X2,175X2)

0.738274

(10⁵,0.1,2⁻⁵)

0.12142

0.479003

(10³, 0.9, 2³)

0.01401

0.738025

(10³,10⁻¹, 0.1, 0.1, 2³)

0.00066

0.645752

(10⁵,0.1, 0.8, 2⁻⁵)

0.00861

0.478936

(10⁵, 0.9, 0.5, 2³)

0.02833

0.619614

(10¹,10¹, 0.9, 0.9, 2³)

0.0061953

Chwirut2

(17X2,37X2)

0.120144

(10⁰,0.1,2⁵)

0.00941

0.078331

(10⁻⁵, 0.5, 2⁵)

0.01558

0.131455

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.00014

0.10763

(10¹,0.1, 0.3, 2⁵)

0.00408

0.078332

(10⁻⁵, 0.3, 0.7, 2⁵)

0.02219

0.101378

(10³,10⁻⁵, 0.9, 0.7, 2³)

0.0053057

Roszman1

(8X2,17X2)

0.833004

(10²,0.1,2⁵)

0.00441

0.147794

(10⁻⁵, 1, 2⁰)

0.01062

0.704285

(10²,10⁻¹, 0.1, 1, 2⁵)

0.00009

0.774556

(10¹,0.1, 0.3, 2⁵)

0.00621

0.147779

(10⁻⁵, 1, 0.7, 2⁰)

0.02386

0.955187

(10³,10⁻⁵, 1, 0.7, 2⁰)

0.0039767

INFY

(226X6,525X6)

0.061211

(10³,0.1,2⁻⁵)

0.98812

0.040511

(10⁻¹, 0.9, 2⁻³)

0.0482

0.047693

(10⁵,10⁻¹, 0.1, 1, 2⁻⁵)

0.02781

0.041628

(10³,0.1, 0.5, 2⁻⁵)

0.05389

0.040633

(10⁰, 0.9, 0.1, 2⁻³)

0.06636

0.040282

(10⁻¹,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0205775

ONGC_NS

(221X6,514X6)

0.029434

(10¹,0.1,2⁻⁵)

0.91666

0.028493

(10⁰, 0.1, 2⁻³)

0.05983

0.044317

(10³,10⁻¹, 0.1, 1, 2⁻⁵)

0.02583

0.02502

(10³,0.1, 0.5, 2⁻⁵)

0.05644

0.029634

(10¹, 0.9, 0.1, 2⁻³)

0.06879

0.023333

(10¹,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0232562

XOM

(226X6,525X6)

0.03609

(10¹,0.1,2⁻³)

0.97152

0.033274

(10⁻¹, 0.1, 2⁻³)

0.04656

0.05045

(10⁵,10⁻¹, 0.1, 1, 2⁻³)

0.034

0.033264

(10¹,0.1, 0.5, 2⁻³)

0.05051

0.033344

(10⁻¹, 0.1, 0.7, 2⁻³)

0.05523

0.032806

(10⁻³,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0252035

ATX

(222X6,515X6)

0.413258

(10⁻¹,0.1,2⁰)

0.93081

0.155567

(10⁻¹, 0.1, 2⁻¹)

0.04733

0.475983

(10⁻¹,10⁻¹, 0.1, 1, 2¹)

0.03145

0.129202

(10³,0.1, 0.2, 2⁻³)

0.05126

0.161982

(10⁰, 0.1, 0.3, 2⁻¹)

0.09139

0.067275

(10⁻¹,10⁻⁵, 0.9, 0.7, 2⁻³)

0.0260819

BSESN

(220X6,513X6)

0.197459

(10⁰,0.1,2⁻¹)

0.92109

0.080313

(10⁻¹, 0.1, 2⁻¹)

0.05851

0.117697

(10¹,10⁻¹, 0.1, 1, 2⁻³)

0.0298

0.034469

(10³,0.1, 0.7, 2⁻⁵)

0.04303

0.08503

(10⁰, 0.1, 0.3, 2⁻¹)

0.0663

0.04276

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0254383

DJI

(226X6,525X6)

0.437188

(10⁻¹,0.1,2¹)

0.98217

0.252219

(10⁻¹, 0.1, 2⁻¹)

0.05257

0.445899

(10⁻¹,10⁻¹, 0.1, 0.1, 2¹)

0.02312

0.134225

(10¹,0.1, 0.8, 2⁻⁵)

0.04319

0.304002

(10⁻¹, 0.1, 0.1, 2⁰)

0.06687

0.106785

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0278918

GDAXI

(227X6,528X6)

0.033826

(10⁵,0.1,2⁻⁵)

1.00256

0.032456

(10⁰, 0.9, 2⁻³)

0.06494

0.059894

(10³,10-1, 0.1, 1, 2⁻⁵)

0.03418

0.050758

(10¹,0.1, 0.6, 2⁻¹)

0.05458

0.036447

(10¹, 0.9, 0.3, 2⁻³)

0.07007

0.028567

(10⁻¹,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0259832

MXX

(225X6,525X6)

0.096186

(10⁵,0.1,2⁻⁵)

0.98059

0.207771

(10⁻³, 0.9, 2⁻¹)

0.0454

0.15942

(10⁵,10⁻¹, 0.1, 1, 2⁻³)

0.02529

0.170982

(10³,0.1, 0.5, 2⁻³)

0.05175

0.208147

(10⁻¹, 0.1, 0.1, 2⁻¹)

0.07338

0.070677

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0260515

N225

(220X6,512X6)

0.100205

(10⁵,0.1,2⁻³)

0.94593

0.0391

(10⁻¹, 0.9, 2⁻³)

0.04503

0.215338

(10⁻¹,10⁻¹, 0.1, 1, 2¹)

0.01587

0.032827

(10³,0.1, 0.7, 2⁻⁵)

0.04882

0.040257

(10⁰, 0.1, 0.1, 2⁻³)

0.07263

0.032986

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0236554

Wankara

(97X10,224X10)

0.056582

(10³,0.1,2⁰)

0.18213

0.025151

(10⁻¹, 0.9, 2⁻³)

0.02217

0.059292

(10³,10⁻¹, 0.1, 0.1, 2⁻⁵)

0.00379

0.031492

(10³,0.1, 0.2, 2⁻³)

0.01671

0.025243

(10¹, 0.9, 0.7, 2⁻³)

0.03012

0.024392

(10¹,10⁻⁵, 0.1, 0.1, 2⁻³)

0.014395

Laser

(298X5,695X5)

0.057145

(10³,0.1,2⁻³)

1.72804

0.036527

(10⁻¹, 0.9, 2³)

0.08

0.076187

(10⁵,10⁻², 0.1, 0.1, 2⁻³)

0.04045

0.039881

(10¹,0.1, 0.4, 2¹)

0.18004

0.037124

(10³, 0.9, 0.1, 2³)

0.14206

0.037946

(10³,10⁻⁵, 0.1, 0.9, 2³)

0.0432735

Dee

(110X7,255X7)

0.104984

(10⁰,0.1,2⁻³)

0.22914

0.142128

(10⁰, 0.1, 2⁻⁵)

0.02127

0.129019

(10¹,10⁻¹, 1, 0.1, 2⁻³)

0.00309

0.101998

(10¹,0.1, 0.7, 2⁻⁵)

0.02589

0.145534

(10¹, 0.1, 0.1, 2⁻⁵)

0.03195

0.102515

(10¹,10⁻¹, 0.9, 0.1, 2⁻³)

0.0154981

Friedman

(360X6,840X6)

0.058623

(10³,0.1,2⁻¹)

2.57757

0.044491

(10⁻⁵, 0.1, 2⁰)

0.12724

0.067515

(10³,10⁻¹, 0.1, 1, 2⁻¹)

0.08704

0.045422

(10⁵,0.1, 0.5, 2⁻⁵)

0.24806

0.044433

(10⁰, 0.1, 0.5, 2⁰)

0.18492

0.043238

(10⁻⁵,10⁻⁵, 0.1, 0.3, 2⁻¹)

0.0423292

Mortgage

(315X16,734X16)

0.047516

(10³,0.1,2⁻¹)

1.97172

0.007537

(10³, 0.9, 2⁰)

0.11881

0.061293

(10¹,10⁻¹, 0.1, 0.1, 2-5)

0.05237

0.024022

(10¹,0.1, 0.8, 2⁻³)

0.2937

0.007581

(10¹, 0.9, 0.1, 2⁰)

0.21581

0.00775

(10⁻¹,10⁻⁵, 0.9, 0.3, 2¹)

0.0464666

NNGC1_dataset_E1_V1_001

(111X5,259X5)

0.182661

(10¹,0.1,2¹)

0.2343

0.180173

(10⁻¹, 0.9, 2¹)

0.01953

0.172927

(10¹,10⁻¹, 0.1, 1, 2³)

0.00626

0.181204

(10¹,0.1, 0.6, 2¹)

0.01454

0.180518

(10⁰, 0.1, 0.1, 2¹)

0.02345

0.17386

(10⁰,10⁰, 0.9, 0.1, 2³)

0.0142513

NNGC1_dataset_F1_V1_008

(269X5,626X5)

0.123395

(10¹,0.1,2⁵)

1.41931

0.073928

(10³, 0.1, 2⁵)

0.13796

0.114369

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.02353

0.104162

(10¹,0.1, 0.5, 2⁵)

0.07411

0.073928

(10⁵, 0.1, 0.5, 2⁵)

0.10784

0.073928

(10⁵,10⁻³, 0.1, 0.5, 2⁵)

0.0428835

NNGC1_dataset_F1_V1_009

(269X5,626X5)

0.092724

(10⁵,0.1,2¹)

1.44621

0.06457

(10⁻³, 0.9, 2⁵)

0.07942

0.092574

(10⁵,10⁻¹, 0.1, 1, 2¹)

0.02105

0.076728

(10¹,0.1, 0.7, 2³)

0.0875

0.064281

(10⁻¹, 0.9, 0.1, 2⁵)

0.08413

0.064281

(10⁻¹,10⁻³, 0.9, 0.1, 2⁵)

0.0320651

NNGC1_dataset_F1_V1_010

(269X5,626X5)

0.086069

(10⁵,0.1,2⁻³)

1.41684

0.044163

(10⁻³, 0.9, 2⁵)

0.10058

0.084937

(10⁵,10⁻¹, 0.1, 1, 2¹)

0.01928

0.06221

(10¹,0.1, 0.7, 2³)

0.07695

0.04426

(10⁰, 0.1, 0.1, 2⁵)

0.0761

0.04426

(10⁰,10⁻³, 0.1, 0.1, 2⁵)

0.0402401

NNGC1_dataset_F1_V1_006

(521X5,1214X5)

0.060847

(10⁵,0.1,2⁻¹)

5.75965

0.037313

(10⁻¹, 0.9, 2⁵)

0.31331

0.065632

(10⁵,10⁻¹, 0.1, 1, 2¹)

0.08754

0.045866

(10¹,0.1, 0.6, 2³)

0.3148

0.037242

(10⁰, 0.9, 0.1, 2⁵)

0.28168

0.037242

(10⁰,10⁻³, 0.9, 0.1, 2⁵)

0.10657

NN5_Complete_109

(237X5,550X5)

0.109583

(10⁰,0.1,2³)

1.09254

0.107237

(10⁻¹, 0.9, 2³)

0.06317

0.110924

(10¹,10⁻¹, 0.1, 0.1, 2³)

0.01518

0.102385

(10¹,0.1, 0.6, 2³)

0.05181

0.106897

(10⁰, 0.1, 0.1, 2³)

0.07392

0.095636

(10⁰,10⁰, 0.3, 0.1, 2⁵)

0.026553

NN5_Complete_104

(237X5,550X5)

0.174167

(10⁻¹,0.1,2³)

1.07709

0.144628

(10⁻³, 0.1, 2³)

0.05759

0.153973

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.01743

0.154479

(10¹,0.1, 0.8, 2³)

0.05446

0.143688

(10⁰, 0.9, 0.1, 2³)

0.06773

0.142988

(10⁰,10⁰, 0.1, 0.1, 2⁵)

0.030209

NN5_Complete_106

(237X5,550X5)

0.189308

(10⁰,0.1,2³)

1.10129

0.174154

(10⁻¹, 0.1, 2³)

0.06459

0.165296

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.01407

0.187904

(10¹,0.1, 0.6, 2³)

0.05242

0.171795

(10⁻¹, 0.1, 0.1, 2³)

0.06597

0.15994

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.0335387

NN5_Complete_103

(237X5,550X5)

0.128696

(10⁰,0.1,2⁻¹)

1.08169

0.130192

(10⁰, 0.1, 2⁻¹)

0.06974

0.131486

(10¹,10⁻¹, 0.1, 0.1, 2¹)

0.02025

0.128318

(10¹,0.1, 0.4, 2⁻³)

0.06604

0.129899

(10⁻⁵, 0.1, 0.3, 2⁻¹)

0.06392

0.129068

(10⁰,10⁻¹, 0.1, 0.3, 2⁰)

0.027654

NN5_Complete_101

(237X5,550X5)

0.149576

(10⁰,0.1,2¹)

1.08068

0.154734

(10⁰, 0.9, 2⁰)

0.06989

0.150469

(10¹,10⁻¹, 0.1, 0.1, 2³)

0.01455

0.14708

(10¹,0.1, 0.6, 2¹)

0.05376

0.155191

(10¹, 0.1, 0.1, 2⁰)

0.06886

0.144747

(10⁰,10⁰, 0.9, 0.1, 2³)

0.0264708

NN5_Complete_105

(237X5,550X5)

0.127845

(10⁰,0.1,2¹)

1.08763

0.13341

(10⁻¹, 0.9, 2¹)

0.04819

0.130747

(10¹,10⁻¹, 1, 0.1, 2³)

0.0148

0.129324

(10¹,0.1, 0.4, 2¹)

0.0635

0.134297

(10⁰, 0.1, 0.1, 2¹)

0.09553

0.126683

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.031413

NN5_Complete_111

(237X5,550X5)

0.089431

(10⁰,0.1,2³)

1.09501

0.092115

(10⁻⁵, 0.9, 2³)

0.07277

0.087204

(10¹,10⁻¹, 0.1, 0.1,²⁵⁾

0.0199

0.094835

(10¹,0.1, 0.2, 2³)

0.0468

0.092115

(10⁻⁵, 0.1, 0.1, 2³)

0.06021

0.08508

(10⁻¹,100, 0.1, 0.1, 2⁵)

0.0297235

D1dat_1_2000

(599X5,1396X5)

0.091946

(10³,0.1,2⁻¹)

7.51017

0.135055

(10⁻¹, 0.9, 2³)

0.51861

0.10349

(10³,10⁻¹, 0.1, 1, 2⁻³)

0.10285

0.115531

(10³,0.1, 0.4, 2¹)

0.39425

0.145437

(10⁰, 0.1, 0.9, 2³)

0.32529

0.121417

(10⁻¹,10⁻⁵, 0.1, 0.1, 2¹)

0.117241

Vineyard

(16X4,36X4)

0.141193

(10⁰,0.1,2⁰)

0.00798

0.298839

(10⁰, 0.9, 2⁻¹)

0.01456

0.234064

(10¹,10⁻¹, 0.1, 0.1, 2¹)

0.00015

0.188534

(10¹,0.1, 0.1, 2¹)

0.00692

0.241009

(10¹, 0.9, 0.3, 2⁻³)

0.01197

0.240423

(10⁰,101, 0.9, 0.5, 2¹)

0.0103753

COVID-19_spain

(251X5,585X5)

0.167119

(10³,0.1,2¹)

1.21284

0.154007

(10⁻³, 0.5, 2⁵)

0.07972

0.167119

(10⁻⁵,10⁻³, 0.1, 0.1, 2⁻⁵)

0.01419

0.156555

(10⁵,0.1, 0.7, 2⁵)

0.04876

0.154173

(10⁻¹, 0.1, 0.9, 2⁵)

0.04778

0.154061

(10⁻⁵,10-5, 0.9, 0.1, 2⁵)

0.024674

Open in a new tab

The best result is shown as boldface

Table 9.

Average ranks of SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on RMSE values using Gaussian kernel with noise 0% for real world dataset

Datasets	SVR	TSVR	$ε$ -AHSVR	$ε$ -SVQR	HN-TSVR	RHN-TSVR
Forestfires	5.5	2.5	5.5	1	2.5	4
Machine CPU	4	2	6	5	3	1
Auto-original	6	5	1	2	3.5	3.5
Winequality	3	4	1	2	6	5
SantafeA	5	2	6	4	3	1
Gas_furnace	5	3	6	4	2	1
Quake	6	4	1	5	2	3
Flex_robotarm	5	3.5	6	2	3.5	1
S&P500	5	3	6	1	4	2
Space-Ga	5	3.5	6	1	3.5	2
Gauss1	6	2	5	4	1	3
Chwirut2	5	1	6	4	2	3
Roszman1	5	2	3	4	1	6
INFY	6	2	5	4	3	1
ONGC_NS	4	3	6	2	5	1
XOM	5	3	6	2	4	1
ATX	5	3	6	2	4	1
BSESN	6	3	5	1	4	2
DJI	5	3	6	2	4	1
GDAXI	3	2	6	5	4	1
MXX	2	5	3	4	6	1
N225	5	3	6	1	4	2
Wankara	5	2	6	4	3	1
Laser	5	1	6	4	2	3
Dee	3	5	4	1	6	2
Friedman	5	3	6	4	2	1
Mortgage	5	1	6	4	2	3
NNGC1_dataset_E1_V1_001	6	3	1	5	4	2
NNGC1_dataset_F1_V1_008	6	3	5	4	1.5	1.5
NNGC1_dataset_F1_V1_009	6	3	5	4	1.5	1.5
NNGC1_dataset_F1_V1_010	6	1	5	4	2.5	2.5
NNGC1_dataset_F1_V1_006	5	3	6	4	1.5	1.5
NN5_Complete_109	5	4	6	2	3	1
NN5_Complete_104	6	3	4	5	2	1
NN5_Complete_106	6	4	2	5	3	1
NN5_Complete_103	2	5	6	1	4	3
NN5_Complete_101	3	5	4	2	6	1
NN5_Complete_105	2	5	4	3	6	1
NN5_Complete_111	3	4	2	6	5	1
D1dat_1_2000	1	5	2	3	6	4
Vineyard	1	6	3	2	5	4
COVID-19_spain	5.5	1	5.5	4	3	2
Average rank	4.5952381	3.1309524	4.6666667	3.1666667	3.4285714	2.0119048

Open in a new tab

Fig. 6 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Machine CPU dataset with 0% noise. Gaussian kernel was used

Fig. 7 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Gas furnace dataset with 0% noise. Gaussian kernel was used

Now, a statistical analysis is implemented using a Friedman test (Demsar [17]) by using the average ranks of algorithms as mentioned in Table 9. Here, we decide the null hypothesis $H_{0}$ : All the reported algorithms such as SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR are equivalent.

In this context, now compute both $χ_{F}^{2}$ and $F_{F}$ based on Table 9 as shown below:

\begin{matrix} χ_{F}^{2} & = \frac{12 \times 42}{6 \times 7} [(4 . 595238^{2} + 3 . 130952^{2} + 4 . 666667^{2} + 3 . 166667^{2})) \\ ((+ 3 . 428571^{2} + 2 . 011905^{2}) - (\frac{6 \times 7^{2}}{4})] = 60.3299, \end{matrix}

and

F_{F} = \frac{(42 - 1) \times 60.3299}{(42 \times (6 - 1)) - 60.3299} = 16.5265,

Here, the critical value, corresponding to $(5, 205)$ the degree of freedom at the probability level $Φ = 0.05$ , is smaller than $F_{F}$ $(16.5265 > 2.2581)$ . So, the null hypothesis $H_{0}$ is rejected and further performed pairwise Nemenyi test. Now, calculate the critical difference at a significant level $(p = 0.10)$ as

Critical difference = $q_{α} \sqrt{\frac{k \times (k + 1)}{6 \times N} =}$ $2.589 \sqrt{\frac{6 \times (6 + 1)}{6 \times 42}}$ $= 1.057$ ,

where, $k$ and $N$ are the no of reported algorithms and datasets respectively and $q_{α} = 2.589$ (Demsar [17]).

One can conclude some points from this Nemenyi test that are discussed below:

The average rank of proposed algorithm RHN-TSVR is compared with SVR and TSVR i.e. $(4.595238 - 2 . 011905 = 2 . 58333)$ and $(3.130952 - 2 . 011905 = 1 . 119048)$ respectively. The differences are greater than critical difference $1.057$ which shows, RHN-TSVR is superior to SVR and TSVR.
Now calculate the average rank differences of RHN-TSVR to $ε$ -AHSVR and $ε$ -SVQR i.e. $(4.666667 - 2 . 011905 = 2 . 654762)$ and $(3.166667 - 2 . 011905 = 1 . 154762)$ respectively which are also greater than $1.057$ . Hence, it justifies the effectiveness of our proposed RHN-TSVR.
Check the dissimilarity of average ranks between RHN-TSVR and HN-TSVR, i.e. $(3.428571 - 2 . 011905 = 1 . 416667)$ . It is also higher than the critical difference $(1.416667 > 1.057)$ that declares, our proposed approach RHN-TSVR outperforms to HN-TSVR.

At Significant Noise Level 5%

In this section, we have verified the applicability of RHN-TSVR model on real-world datasets at significant noise level 5%. All the results are computed and placed in Table 10 corresponding to 42 benchmark datasets. In Table 10, one can easily find the RMSE values with the computational time of all reported approaches. One can notice that RHN-TSVR model has shown better performance for 18 cases. The average ranks of each model are determined in Table 11 based on RMSE values to perform the statistical test. Here, RHN-TSVR has shown the lowest average rank in Table 11. Overall, we can analyze that RHN-TSVR may be one of the better choice for noisy datasets. We have plotted the prediction value graph for Machine CPU and Gas furnace datasets having significant noise level 5%, as shown in Figs. 8 and 9 respectively. The interpretation of these graphs is crystal clear and easily understandable that RHN-TSVR shows a closer relationship with the desired output.

Table 10.

Performance comparison of SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR using Gaussian kernel with noise 5% on real world datasets

Dataset
(Train size, Test size)

SVR
RMSE

(C, ε, μ)

Time

TSVR
RMSE

(C_{1} = C_{2},

ε, μ)

Time

ε

-AHSVR
RMSE

(C, ε_{α} = ε_{β}, ζ_{α},

ζ_{β}, μ)

Time

ε

-SVQR
RMSE

(C, ε,

θ, μ)

Time

HN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

RHN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

Forestfires

(150X13,367X13)

0.070592

(10⁻⁵, 0.1,2⁵)

0.229976

0.0842079

(10⁻⁴, 0.9, 2⁵)

0.09435

0.070588

(10⁻⁵,10⁻³, 0.1, 0.1, 2⁵)

0.0112

0.069557

(10⁻¹,0.9, 0.1, 2⁵)

0.08134

0.08422

(10³, 0.9, 0.1, 2⁵)

0.22418

0.07057

(10⁻⁵,10⁵, 0.9, 0.1, 2⁵)

0.012158

Machine CPU

(100X8,109X8)

0.052104

(10⁵, 0.1,2⁻⁵)

0.11723

0.0485319

(10⁻¹, 0.1, 2⁻²)

0.04732

0.07941

(10⁵,10⁻³, 0.1, 0.1, 2⁻³)

0.0042

0.072152

(10⁵,0.1, 0.5, 2⁻⁵)

0.04398

0.05934

(10⁰,0.1, 0.1, 2⁻¹)

0.05237

0.04173

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0118531

Auto-original

(100X8,292X8)

0.155361

(10³, 0.1,2⁻⁵)

0.100113

0.16358

(10⁻², 0.9, 2⁻¹)

0.04578

0.156916

(10³,10⁻¹, 0.1, 1, 2⁻¹)

0.01078

0.157831

(10¹,0.1, 0.5, 2⁻³)

0.04071

0.1639

(10⁻¹, 0.1, 0.5, 2⁻¹)

0.13337

0.15585

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0137573

Winequality

(500X12,4398X12)

0.158162

(10⁰, 0.1,2⁻¹)

2.7085

0.181833

(10⁻¹, 0.9, 2⁻²)

0.92887

0.160284

(10³,10⁻¹, 0.1, 0.1, 2⁻³)

0.1069

0.225503

(10³,0.1, 0.5, 2⁻⁵)

1.41081

0.17557

(10⁰, 0.9, 0.3, 2⁻³)

0.991

0.1538

(10⁰,10⁻¹, 0.9, 0.1, 2⁰)

0.0646298

SantafeA

(500X6,495X6)

0.10608

(10⁰, 0.1,2³)

3.07441

0.0665399

(10⁻¹, 0.9, 2²)

0.84628

0.061044

(10⁵,10⁻², 0.1, 1, 2¹)

0.13854

0.082625

(10¹,0.1, 0.4, 2³)

1.50956

0.06684

(10⁻¹, 0.9, 0.1, 2³)

0.97786

0.06568

(10⁰,10⁻⁵, 0.9, 0.1, 2¹)

0.0511625

Gas_furnace

(150X7,143X7)

0.094555

(10⁰, 0.1,2⁻³)

0.228517

0.0603093

(10⁻¹, 0.7, 2⁻⁴)

0.09493

0.061808

(10⁵,10⁻³, 0.1, 1, 2⁻⁵)

0.01632

0.080781

(10¹,0.1, 0.1, 2⁻⁵)

0.09148

0.05582

(10⁰, 0.1, 0.1, 2⁻³)

0.11075

0.04891

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0181955

Quake

(500X4,1678X4)

0.186957

(10⁻¹, 0.1,2⁻¹)

2.76314

0.185072

(10⁰, 0.9, 2⁻⁵)

0.82747

0.185139

(10⁻¹,10⁻¹, 0.1, 0.1, 2⁻³)

0.15029

0.19368

(10⁻¹,0.5, 0.1, 2⁻⁵)

1.63681

0.1851

(10⁰, 0.9, 0.3, 2⁻⁵)

0.91814

0.18506

(10⁰,10¹, 0.1, 0.3, 2⁻⁵)

0.0648712

Flex_robotarm

(500X10,519X10)

0.044356

(10³, 0.1,2⁻⁵)

2.91903

0.0455536

(10⁻¹, 0.9, 2⁻¹)

0.88575

0.056341

(10⁵,10⁻¹, 0.1, 0.1, 2⁻⁵)

0.11533

0.029939

(10³,0.1, 0.1, 2⁻⁵)

1.56187

0.04608

(10⁰, 0.9, 0.1, 2⁻¹)

0.92991

0.02582

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻¹)

0.0688442

S&P500

(200X6,550X6)

0.039455

(10¹, 0.1,2⁻⁵)

0.413842

0.0264072

(10⁻¹, 0.9, 2⁻⁵)

0.15642

0.045908

(10⁵,10⁻¹, 0.1, 1, 2⁻⁵)

0.02588

0.027269

(10³,0.1, 0.4, 2⁻⁵)

0.191

0.02683

(10⁰, 0.1, 0.1, 2⁻⁵)

0.17724

0.02663

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0197694

Space-Ga

(500X7,2607X7)

0.29754

(10⁰, 0.1,2⁻⁵)

2.78144

0.327565

(10⁰, 0.1, 2⁵)

1.01007

0.292501

(10¹,10⁻¹, 1, 0.1, 2⁻⁵)

0.12059

0.273409

(10⁻¹,0.1, 0.6, 2⁻¹)

1.32311

0.33016

(10⁰, 0.1, 0.1, 2⁵)

0.97394

0.28284

(10⁰,10⁻¹, 0.9, 0.1, 2⁵)

0.0719253

Gauss1

(75X2,175X2)

0.399263

(10¹, 0.1,2⁻⁵)

0.062618

0.391528

(10³, 0.9, 2⁰)

0.01613

0.414253

(10³,10⁻¹, 0.1, 1.345, 2⁻⁵)

0.0006

0.409457

(10¹,0.5, 0.7, 2¹)

0.01021

0.391532

(10⁵, 0.9, 0.1, 2⁰)

0.02981

0.389198

(10³,10 ⁻¹, 0.1, 0.3, 2¹)

0.0072498

Chwirut2

(17X2,37X2)

0.097839

(10³, 0.1,2³)

0.005741

0.116295

(10⁻¹, 0.9, 2⁻³)

0.01351

0.093916

(10¹,10⁻³, 0.1, 1, 2⁵)

0.0002

0.087731

(10⁵,0.3, 0.3, 2³)

0.00406

0.114574

(10⁰, 0.9, 0.3, 2⁻³)

0.02547

0.108905

(10¹,10 ⁻⁵, 0.1, 0.1, 2³)

0.004341

Roszman1

(8X2,17X2)

0.079154

(10², 0.0001,2³)

0.002892

0.049779

(10⁻¹, 1, 2⁰)

0.01399

0.212352

(10²,10⁻³, 0.1, 1, 2⁵)

0.00008

0.292596

(10⁵,0.3, 0.3, 2³)

0.00387

0.047258

(10⁰, 1, 0.3, 2⁰)

0.02349

0.048173

(10¹,10⁻⁵, 1, 0.1, 2⁰)

0.004168

INFY

(226X6,525X6)

0.048422

(10³, 0.1,2⁻⁵)

0.512362

0.044445

(10⁰, 0.3, 2⁻⁵)

0.06776

0.050969

(10⁵,10⁻¹, 0.1, 1, 2⁻⁵)

0.02874

0.046417

(10³,0.1, 0.6, 2⁻³)

0.05323

0.043197

(10⁰, 0.9, 0.1, 2⁻⁵)

0.05714

0.043197

(10⁰,10⁻³, 0.9, 0.1, 2⁻⁵)

0.0147357

ONGC_NS

(221X6,514X6)

0.040992

(10³, 0.1,2⁻⁵)

0.545728

0.036287

(10⁰, 0.1, 2⁻⁵)

0.05809

0.045566

(10⁵,10⁻¹, 0.1, 0.1, 2⁻⁵)

0.02695

0.035167

(10³,0.1, 0.4, 2⁻⁵)

0.05503

0.036136

(10¹, 0.9, 0.1, 2⁻⁵)

0.06659

0.035021

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0200796

XOM

(226X6,525X6)

0.044282

(10¹, 0.1,2⁻³)

0.52792

0.040461

(10⁻³, 0.9, 2⁻³)

0.05028

0.046998

(10³,10⁻¹, 0.1, 0.1, 2⁻³)

0.01829

0.040378

(10¹,0.1, 0.5, 2⁻³)

0.05652

0.040412

(10⁻¹, 0.1, 0.1, 2⁻³)

0.10638

0.040412

(10⁻¹,10⁻³, 0.1, 0.1, 2⁻³)

0.0254784

ATX

(222X6,515X6)

0.027291

(10¹, 0.1,2⁻³)

0.512387

0.025448

(10⁰, 0.9, 2⁻⁵)

0.05701

0.037052

(10³,10⁻¹, 0.1, 1, 2⁻³)

0.03231

0.02157

(10³,0.1, 0.4, 2⁻³)

0.04981

0.023878

(10⁰, 0.5, 0.1, 2⁻⁵)

0.06775

0.024036

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.020381

BSESN

(220X6,513X6)

0.027462

(10¹, 0.1,2⁻³)

0.529594

0.02329

(10⁻¹, 0.9, 2⁻⁵)

0.05242

0.030826

(10⁵,10⁻¹, 0.1, 1, 2⁻⁵)

0.01959

0.025185

(10¹,0.1, 0.6, 2⁻⁵)

0.04508

0.023207

(10⁰, 0.9, 0.1, 2⁻⁵)

0.06566

0.022333

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0285901

DJI

(226X6,525X6)

0.024042

(10¹, 0.1,2⁻⁵)

0.541827

0.02023

(10⁻¹, 0.9, 2⁻⁵)

0.04965

0.021978

(10³,10⁻¹, 0.1, 1, 2⁻³)

0.01969

0.040544

(10¹,0.1, 0.7, 2⁻³)

0.0463

0.019661

(10⁰, 0.1, 0.1, 2⁻⁵)

0.06078

0.019661

(10⁰,10⁻³, 0.1, 0.1, 2⁻⁵)

0.0217884

GDAXI

(227X6,528X6)

0.029271

(10¹, 0.1,2⁻⁵)

0.615071

0.028296

(10⁻¹, 0.1, 2⁻⁵)

0.05101

0.032066

(10³,10⁻¹, 1, 1, 2⁻⁵)

0.03326

0.026

(10³,0.1, 0.5, 2⁻⁵)

0.05606

0.028228

(10⁰, 0.9, 0.1, 2⁻⁵)

0.07403

0.027102

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0287863

MXX

(225X6,525X6)

0.042338

(10¹, 0.1,2⁻³)

0.559407

0.03836

(10⁻¹, 0.9, 2⁻³)

0.04882

0.044005

(10³,10⁻¹, 0.1, 1, 2⁻³)

0.01992

0.040381

(10¹,0.1, 0.6, 2⁻³)

0.0505

0.038028

(10⁰, 0.1, 0.1, 2⁻³)

0.12634

0.036046

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0187388

N225

(220X6,512X6)

0.03575

(10¹, 0.1,2⁻⁵)

0.497952

0.033073

(10⁻¹, 0.9, 2⁻⁵)

0.0423

0.034927

(10³,10⁻¹, 1, 1, 2⁻³)

0.02333

0.034832

(10¹,0.1, 0.5, 2⁻³)

0.0498

0.032898

(10⁰, 0.1, 0.1, 2⁻⁵)

0.04164

0.033887

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0189589

Wankara

(97X10,224X10)

0.064073

(10⁵, 0.1,2⁻⁵)

0.095646

0.029472

(10⁻¹, 0.9, 2⁻³)

0.01693

0.039629

(10³,10⁻², 0.1, 0.1, 2⁻⁵)

0.00491

0.034031

(10¹,0.1, 0.1, 2⁻¹)

0.01904

0.030836

(10⁻¹, 0.1, 0.1, 2⁻³)

0.02922

0.030752

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0168666

Laser

(298X5,695X5)

0.081302

(10⁰, 0.1,2¹)

1.08662

0.04679

(10⁻¹, 0.1, 2³)

0.10288

0.051436

(10³,10⁻³, 1, 1, 2¹)

0.04178

0.058167

(10⁵,0.1, 0.4, 2⁻⁵)

0.29224

0.042864

(10⁰, 0.1, 0.1, 2¹)

0.08458

0.040318

(10⁰,10⁻⁵, 0.1, 0.1, 2¹)

0.0288311

Dee

(110X7,255X7)

0.107666

(10⁰, 0.1,2⁰)

0.12765

0.108599

(10⁰, 0.9, 2⁻⁵)

0.01828

0.100024

(10¹,10⁻¹, 1, 0.1, 2⁻³)

0.0039

0.107159

(10¹,0.1, 0.3, 2⁻¹)

0.01765

0.109692

(10¹, 0.9, 0.1, 2⁻⁵)

0.02564

0.104123

(10¹,10⁻¹, 0.9, 0.1, 2⁻⁵)

0.0134618

Friedman

(360X6,840X6)

0.061985

(10¹, 0.1,2⁻¹)

1.33317

0.050917

(10⁻¹, 0.9, 2⁻¹)

0.12606

0.070737

(10³,10⁻¹, 1, 1, 2⁻¹)

0.05842

0.049461

(10⁵,0.1, 0.3, 2⁻⁵)

0.42496

0.051177

(10⁰, 0.1, 0.1, 2⁻¹)

0.1547

0.049195

(10⁻⁵,10⁻⁵, 0.1, 0.1, 2⁻¹)

0.0283486

Mortgage

(315X16,734X16)

0.056068

(10⁰, 0.1,2⁻⁵)

1.05694

0.018067

(10⁻¹, 0.1, 2⁻¹)

0.10178

0.037756

(10³,10⁻³, 0.1, 1, 2⁻⁵)

0.05046

0.041226

(10¹,0.1, 0.6, 2¹)

0.46162

0.018797

(10⁰, 0.1, 0.1, 2⁻¹)

0.10658

0.01766

(10⁰,10 -5, 0.1, 0.1, 2⁻³)

0.0292358

NNGC1_dataset_E1_V1_001

(111X5,259X5)

0.174273

(10⁰, 0.1,2¹)

0.119395

0.180527

(10⁻¹, 0.1, 2¹)

0.01801

0.174901

(10¹,10⁻¹, 0.1, 1, 2³)

0.00866

0.181674

(10¹,0.1, 0.4, 2¹)

0.01282

0.180662

(10⁰, 0.9, 0.1, 2¹)

0.0269

0.174368

(10⁰,10⁰, 0.9, 0.1, 2³)

0.017439

NNGC1_dataset_F1_V1_008

(269X5,626X5)

0.077357

(10¹, 0.1,2³)

0.737512

0.057912

(10⁻¹, 0.9, 2⁵)

0.1161

0.069382

(10¹,10⁻², 0.1, 1.345, 2⁵)

0.02783

0.061483

(10¹,0.1, 0.4, 2⁵)

0.07307

0.057788

(10⁻¹, 0.9, 0.1, 2⁵)

0.07614

0.057788

(10⁻¹,10⁻³, 0.9, 0.1, 2⁵)

0.0272107

NNGC1_dataset_F1_V1_009

(269X5,626X5)

0.077045

(10³, 0.1,2⁻¹)

0.735128

0.050753

(10⁻¹, 0.9, 2³)

0.10377

0.080904

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.02837

0.055231

(10¹,0.1, 0.5, 2³)

0.06838

0.051426

(10⁰, 0.9, 0.1, 2³)

0.09655

0.048986

(10⁰,10⁻¹, 0.9, 0.1, 2⁵)

0.022648

NNGC1_dataset_F1_V1_010

(269X5,626X5)

0.073579

(10³, 0.1,2¹)

0.745154

0.044397

(10⁻¹, 0.9, 2³)

0.12389

0.068758

(10³,10⁻², 0.1, 1, 2³)

0.03004

0.057324

(10³,0.1, 0.3, 2¹)

0.08042

0.045481

(10⁰, 0.9, 0.1, 2³)

0.07167

0.05176

(10⁻¹,10⁻⁵, 0.1, 0.1, 2³)

0.0318337

NNGC1_dataset_F1_V1_006

(521X5,1214X5)

0.063795

(10³, 0.1,2¹)

3.01858

0.041367

(10⁻¹, 0.9, 2⁵)

0.37961

0.06495

(10³,10⁻¹, 0.1, 0.1, 2³)

0.08197

0.049231

(10¹,0.1, 0.3, 2³)

0.28967

0.042234

(10⁻¹, 0.9, 0.1, 2⁵)

0.24892

0.047416

(10⁰,10⁻⁵, 0.9, 0.1, 2³)

0.0936319

NN5_Complete_109

(237X5,550X5)

0.160178

(10⁰, 0.1,2¹)

0.559677

0.169086

(10⁻¹, 0.1, 2¹)

0.07075

0.153803

(10¹,10⁻¹, 0.1, 0.1, 2³)

0.0163

0.167964

(10¹,0.1, 0.5, 2¹)

0.06157

0.170163

(10⁰, 0.1, 0.1, 2¹)

0.07691

0.159525

(10⁰,10⁰, 0.1, 0.1, 2³)

0.0250756

NN5_Complete_104

(237X5,550X5)

0.25227

(10⁰, 0.1,2¹)

0.557894

0.246728

(10⁻¹, 0.9, 2⁰)

0.08172

0.223425

(10¹,10⁻¹, 0.1, 1, 2⁵)

0.01747

0.243549

(10¹,0.1, 0.6, 2¹)

0.06105

0.245921

(10⁰, 0.9, 0.1, 2⁰)

0.08696

0.226614

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.0190927

NN5_Complete_106

(237X5,550X5)

0.212644

(10¹, 0.1,2¹)

0.563886

0.227166

(10⁻³, 0.3, 2¹)

0.05488

0.223541

(10¹,10⁻¹, 0.1, 0.1, 2³)

0.01608

0.203542

(10¹,0.1, 0.5, 2¹)

0.05971

0.226991

(10⁻¹, 0.1, 0.3, 2¹)

0.06767

0.207904

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.0196831

NN5_Complete_103

(237X5,550X5)

0.128071

(10⁰, 0.1,2⁻¹)

0.55103

0.129677

(10⁰, 0.9, 2⁻³)

0.06733

0.130405

(10¹,10⁻¹, 1, 0.1, 2⁻¹)

0.02415

0.129381

(10¹,0.3, 0.4, 2⁻³)

0.0482

0.129794

(10¹, 0.9, 0.1, 2⁻³)

0.09831

0.129352

(10¹,10⁻¹, 0.1, 0.1, 2⁻¹)

0.0197878

NN5_Complete_101

(237X5,550X5)

0.150229

(10⁰, 0.1,2¹)

0.566503

0.154038

(10⁻¹, 0.9, 2¹)

0.06516

0.154147

(10¹,10⁻¹, 0.1, 0.1, 2¹)

0.02724

0.146225

(10¹,0.1, 0.5, 2¹)

0.05404

0.153808

(10⁻¹, 0.9, 0.1, 2¹)

0.07726

0.147679

(10⁰,10⁰, 0.9, 0.1, 2³)

0.026214

NN5_Complete_105

(237X5,550X5)

0.206735

(10⁰, 0.1,2¹)

0.59797

0.20295

(10⁻¹, 0.9, 2¹)

0.07722

0.212735

(10¹,10⁻¹, 1, 0.1, 2³)

0.01905

0.215317

(10¹,0.1, 0.3, 2¹)

0.0591

0.204263

(10⁰, 0.9, 0.1, 2¹)

0.07342

0.202491

(10⁰,10⁰, 0.1, 0.1, 2³)

0.0223573

NN5_Complete_111

(237X5,550X5)

0.18836

(10⁰, 0.1,2¹)

0.555449

0.211531

(10⁻⁵, 0.7, 2¹)

0.06467

0.218224

(10¹,10⁻¹, 1, 0.1, 2³)

0.01603

0.194376

(10¹,0.1, 0.4, 2¹)

0.06702

0.211115

(10⁻¹, 0.1, 0.1, 2¹)

0.06854

0.202472

(10⁰,10⁰, 0.1, 0.1, 2³)

0.0195859

D1dat_1_2000

(599X5,1396X5)

0.056375

(10³, 0.1,2⁻⁵)

3.88997

0.039592

(10⁻¹, 0.1, 2¹)

0.50661

0.056756

(10⁵,10⁻¹, 0.1, 1, 2⁻³)

0.11601

0.04234

(10¹,0.1, 0.3, 2¹)

0.37059

0.039961

(10⁰, 0.1, 0.1, 2¹)

0.46588

0.038144

(10⁰,10⁻⁵, 0.9, 0.1, 2⁰)

0.0892223

Vineyard

(16X4,36X4)

0.237345

(10⁰, 0.1,2⁻¹)

0.004569

0.236484

(10¹, 0.9, 2⁻⁵)

0.01397

0.246205

(10¹,10⁻¹, 0.1, 0.1, 2⁻¹)

0.00014

0.213259

(10³,0.5, 0.2, 2⁻¹)

0.00665

0.236413

(10¹, 0.9, 0.7, 2⁻⁵)

0.01649

0.240828

(10¹,10¹, 0.7, 0.1, 2⁰)

0.0119425

COVID-19_spain

(251X5,585X5)

0.118454

(10¹, 0.1,2⁻⁵)

0.650153

0.097704

(10⁻¹, 0.1, 2¹)

0.08398

0.08227

(10³,10⁻³, 0.1, 1.345, 2⁻³)

0.01543

0.081305

(10¹,0.3, 0.3, 2⁻³)

0.06166

0.097513

(10⁰, 0.1, 0.1, 2¹)

0.0508

0.084161

(10⁻¹,10⁻⁵, 0.9, 0.1, 2⁰)

0.0200923

Open in a new tab

The best result is shown as boldface

Table 11.

Average ranks of SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR, and RHN-TSVR on RMSE values using Gaussian kernel with noise 5% for real world dataset

Datasets	SVR	TSVR	$ε$ -AHSVR	$ε$ -SVQR	HN-TSVR	RHN-TSVR
Forestfires	4	5	3	1	6	2
Machine CPU	3	2	6	5	4	1
Auto-original	1	5	3	4	6	2
Winequality	2	5	3	6	4	1
SantafeA	6	3	1	5	4	2
Gas_furnace	6	3	4	5	2	1
Quake	5	2	4	6	3	1
Flex_robotarm	3	4	6	2	5	1
S&P500	5	1	6	4	3	2
Space-Ga	4	5	3	1	6	2
Gauss1	4	2	6	5	3	1
Chwirut2	3	6	2	1	5	4
Roszman1	4	3	5	6	1	2
INFY	5	3	6	4	1.5	1.5
ONGC_NS	5	4	6	2	3	1
XOM	5	4	6	1	2.5	2.5
ATX	5	4	6	1	2	3
BSESN	5	3	6	4	2	1
DJI	5	3	4	6	1.5	1.5
GDAXI	5	4	6	1	3	2
MXX	5	3	6	4	2	1
N225	6	2	5	4	1	3
Wankara	6	1	5	4	3	2
Laser	6	3	4	5	2	1
Dee	4	5	1	3	6	2
Friedman	5	3	6	2	4	1
Mortgage	6	2	4	5	3	1
NNGC1_dataset_E1_V1_001	1	4	3	6	5	2
NNGC1_dataset_F1_V1_008	6	3	5	4	1.5	1.5
NNGC1_dataset_F1_V1_009	5	2	6	4	3	1
NNGC1_dataset_F1_V1_010	6	1	5	4	2	3
NNGC1_dataset_F1_V1_006	5	1	6	4	2	3
NN5_Complete_109	3	5	1	4	6	2
NN5_Complete_104	6	5	1	3	4	2
NN5_Complete_106	3	6	4	1	5	2
NN5_Complete_103	1	4	6	3	5	2
NN5_Complete_101	3	5	6	1	4	2
NN5_Complete_105	4	2	5	6	3	1
NN5_Complete_111	1	5	6	2	4	3
D1dat_1_2000	5	2	6	4	3	1
Vineyard	4	3	6	1	2	5
COVID-19_spain	6	5	2	1	4	3
Average rank	4.333333	3.404762	4.547619	3.4523809	3.380952	1.880952

Open in a new tab

Fig. 8 — Prediction over the testing dtaset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Machine CPU dataset with 5% noise. Gaussian kernel was used

Fig. 9 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Gas furnace dataset with 5% noise. Gaussian kernel was used

Further, the Friedman statistical test is performed to validate the efficacy of proposed approach RHN-TSVR on noisy data at significant level 5% with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR and HN-TSVR by using the results of Table 11.

\begin{matrix} χ_{F}^{2} & = \frac{12 \times 42}{6 \times 7} [(4 . 33333^{2} + 3 . 404762^{2} + 4 . 547619^{2} + 3 . 452381^{2})) \\ ((+ 3 . 380952^{2} + 1 . 880952^{2}) - (\frac{6 \times 7^{2}}{4})] = 52.2653, \end{matrix}

And

F_{F} = \frac{(42 - 1) \times 52.2653}{(42 \times (6 - 1)) - 52.2653} = 13.9336

Here, the critical value, corresponding to $(5, 205)$ the degree of freedom is smaller than $F_{F}$ $(13.9336 > 2.2581)$ . So the null hypothesis $H_{0}$ is rejected thus perform pairwise test. Now, calculate the critical difference at a significant level $(p = 0.10)$ to perform the Nemenyi test. The critical difference is the same as to the previous case i.e. $1.057$ . One can analyze few points as below:

The differences of the average ranks of RHN-TSVR compared to SVR, TSVR, $ε$ -AHSVR and $ε$ -SVQR are always greater than $1.057$ . Hence, it shows that RHN-TSVR is superior.
Now, check the dissimilarity of average ranks between RHN-TSVR and HN-TSVR, i.e. $(3.380952 - 1 . 889052 = 1 . 5)$ which is greater than the critical difference $(1.5 > 1.057)$ . It declares that our proposed approach RHN-TSVR is better than HN-TSVR.

At Significant Noise Level 10%

Similar to significant noise level 5%, we have added more noise by increasing the significant noise level up to 10% on real-world datasets and tested the performance of proposed RHN-TSVR into the more noisy environment. All the significant results of SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and proposed approach RHN-TSVR are placed in Table 12 at noise level 10%. It is clearly visible from Table 12 that the RHN-TSVR performs better in 18 cases overall. The average rank of all reported approaches with RHN-TSVR is computed in Table 13 where RHN-TSVR is having the lowest average rank. Similar to previous cases, the prediction graphs of Machine CPU and Gas furnace datasets at significant noise level 10% have been plotted in Figs. 10 and 11 and the same conclusion may be suggested.

Table 12.

Performance comparison of RHN-TSVR with SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, and HN-TSVR using Gaussian kernel with noise 10% on real world datasets

Dataset
(Train size, Test size)

SVR
RMSE

(C, ε, μ)

Time

TSVR
RMSE

(C_{1} = C_{2},

ε, μ)

Time

ε

-AHSVR
RMSE

(C, ε_{α} = ε_{β}, ζ_{α},

ζ_{β}, μ)

Time

ε

-SVQR
RMSE

(C, ε,

θ, μ)

Time

HN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

RHN-TSVR
RMSE

(C_{1} = C_{2}, ε_{1} = ε_{2},

ε_{1}^{*} = ε_{2}^{*}, μ)

Time

Forestfires

(150X13,367X13)

0.070556

(10⁻⁵, 0.1,2⁻⁵)

0.21571

0.0871489

(10⁻⁴, 0.9, 2⁵)

0.12293

0.070377

(10⁻⁴,10⁻³, 0.1, 1.345, 2⁵)

0.05959

0.068864

(10⁻¹,0.9, 0.3, 2⁻⁵)

0.09252

0.08716

(10⁵, 0.9, 0.5, 2⁵)

0.18846

0.07046

(10¹,10⁵, 0.1, 0.5, 2⁵)

0.010531

Machine CPU

(100X8,109X8)

0.091585

(10¹, 0.1,2⁻⁵)

0.10812

0.0505539

(10⁻¹, 0.1, 2⁻⁵)

0.16241

0.058089

(10²,10⁻³, 1, 1.345, 2⁻⁵)

0.02408

0.035819

(10³,0.001, 0.2, 2⁻⁵)

0.01103

0.05156

(10⁻¹, 0.1, 0.1, 2⁻⁵)

0.04804

0.05156

(10⁻¹,10⁻³, 0.1, 0.1, 2⁻⁵)

0.008127

Auto-original

(100X8,292X8)

0.180002

(10⁰, 0.1,2⁻¹)

0.09423

0.16907

(10⁻¹, 0.9, 2⁻¹)

0.05376

0.172661

(10⁵,10⁻², 1, 1.345, 2⁻²)

0.02505

0.17522

(10⁵,0.3, 0.2, 2⁻¹)

0.05034

0.16966

(10⁰, 0.9, 0.1, 2⁻¹)

0.04592

0.15891

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻³)

0.0085984

Winequality

(500X12,4398X12)

0.194176

(10³, 0.1,2⁻⁵)

2.74866

0.168979

(10⁻¹, 0.9, 2⁻²)

0.88092

0.196418

(10⁴,10⁻², 0.1, 1.345, 2⁻²)

0.60085

0.218789

(10³,0.1, 0.5, 2⁻⁵)

1.39773

0.17353

(10⁰, 0.9, 0.1, 2⁻¹)

1.26227

0.17405

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻³)

0.0681787

SantafeA

(500X6,495X6)

0.092708

(10⁵, 0.1,2⁻⁵)

2.94318

0.0656795

(10⁻¹, 0.9, 2²)

0.98025

0.069208

(10⁴,10⁻³, 0.1, 1.345, 2²)

0.45914

0.040906

(10⁵,0.1, 0.4, 2⁻³)

1.68406

0.07466

(10⁰, 0.9, 0.1, 2³)

1.01333

0.06537

(10⁰,10⁻⁵, 0.9, 0.1, 2¹)

0.0525275

Gas_furnace

(150X7,143X7)

0.099002

(10⁰, 0.1,2⁻⁵)

0.23578

0.0622738

(10⁻¹, 0.9, 2⁻⁴)

0.10164

0.052053

(10⁵,10⁻³, 0.1, 1.345, 2⁻⁵)

0.05392

0.081241

(10¹,0.1, 0.1, 2⁻⁵)

0.08915

0.05777

(10⁰, 0.9, 0.1, 2⁻³)

0.0824

0.0516

(10⁻¹,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0185242

Quake

(500X4,1678X4)

0.185914

(10⁻¹, 0.1,2⁻⁵)

2.84951

0.18575

(10⁰, 0.1, 2⁻⁵)

0.96815

0.186022

(10⁻³,10⁻³, 1, 1.345, 2⁻³)

0.35004

0.185954

(10⁻³,0.1, 0.3, 2⁻³)

1.49636

0.18578

(10⁰, 0.9, 0.3, 2⁻⁵)

0.89834

0.18619

(10¹,10⁵, 0.3, 0.3, 2⁻³)

0.0608908

Flex_robotarm

(500X10,519X10)

0.049069

(10¹, 0.1,2⁻³)

2.78802

0.0446311

(10⁻¹, 0.9, 2⁻³)

1.18666

0.026773

(10⁵,10⁻³, 0.1, 1.345, 2⁻²)

0.37359

0.029822

(10³,0.1, 0.1, 2⁻⁵)

1.46689

0.04443

(10⁰, 0.1, 0.1, 2⁻³)

1.23052

0.02832

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻¹)

0.0653991

S&P500

(200X6,550X6)

0.031468

(10¹, 0.1,2⁻⁵)

0.4072

0.0280403

(10⁻¹, 0.9, 2⁻⁴)

0.30475

0.026838

(10⁴,10⁻³, 0.1, 1.345, 2⁻⁵)

0.22688

0.054827

(10³,0.1, 0.6, 2⁻³)

0.18523

0.02819

(10⁰, 0.9, 0.1, 2⁻⁵)

0.11794

0.02665

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0243972

Space-Ga

(500X7,2607X7)

0.386259

(10³, 0.1,2¹)

2.71523

0.322389

(10⁻⁵, 0.1, 2⁵)

0.68511

0.297924

(10²,10⁻³, 0.1, 1.345, 2⁵)

0.38471

0.271389

(10⁻¹,0.1, 0.6, 2⁻¹)

1.26518

0.32239

(10⁻⁵, 0.1, 0.3, 2⁵)

0.8876

0.28372

(10¹,10⁻¹, 0.5, 0.1, 2⁵)

0.0599814

Gauss1

(75X2,175X2)

0.391899

(10⁰,0.1,2⁻³)

0.06196

0.385943

(10³, 0.9, 2⁰)

0.01724

0.381646

(10¹,10⁻³, 0.1, 1.345, 2⁻⁴)

0.0329

0.420118

(10¹,0.3, 0.8, 2⁻¹)

0.00907

0.385946

(10⁵, 0.9, 0.3, 2⁰)

0.02471

0.384733

(10⁵,10 ⁻¹, 0.1, 0.5, 2¹)

0.0053786

Chwirut2

(17X2,37X2)

0.095709

(10⁰,0.1,2⁵)

0.00532

0.111636

(10⁰, 0.9, 2⁻³)

0.01408

0.094656

(10⁴,10⁻³, 1, 1.345, 2⁵)

0.00143

0.096011

(10¹,0.1, 0.4, 2⁵)

0.00409

0.111998

(10¹, 0.9, 0.1, 2⁻³)

0.02217

0.093851

(10⁰,10⁻⁵, 0.1, 0.1, 2³)

0.0063419

Roszman1

(8X2,17X2)

0.435543

(10²,0.0001,2⁵)

0.00308

0.048457

(10⁰, 1, 2⁰)

0.01349

0.156611

(10²,10⁻³, 1, 1.345, 2⁵)

0.00038

0.298359

(10¹,0.1, 0.4, 2⁵)

0.00394

0.047929

(10¹, 1, 0.1, 2⁰)

0.01341

0.045437

(10⁰,10⁻⁵, 1, 0.1, 2⁰)

0.0066292

INFY

(226X6,525X6)

0.048232

(10³,0.1,2⁻⁵)

0.54122

0.045631

(10⁻³, 0.7, 2⁻⁵)

0.05414

0.047149

(10⁵,10⁻³, 1, 1.345, 2⁻⁵)

0.14077

0.042672

(10¹,0.1, 0.4, 2⁻³)

0.04483

0.044563

(10⁰, 0.9, 0.1, 2⁻⁵)

0.07999

0.046205

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0163618

ONGC_NS

(221X6,514X6)

0.040414

(10¹,0.1,2⁻⁵)

0.51419

0.037272

(10⁻¹, 0.9, 2⁻⁵)

0.04756

0.035483

(10⁴,10⁻³, 0.1, 1.345, 2⁻⁵)

0.10074

0.03802

(10⁵,0.1, 0.2, 2⁻³)

0.05531

0.037329

(10⁰, 0.1, 0.1, 2⁻⁵)

0.05609

0.0358

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0151793

XOM

(226X6,525X6)

0.043564

(10¹,0.1,2⁻⁵)

0.54421

0.040508

(10⁻³, 0.1, 2⁻³)

0.05942

0.040229

(10⁴,10⁻³, 1, 1.345, 2⁻⁴)

0.17387

0.04221

(10¹,0.1, 0.4, 2⁻³)

0.04784

0.040477

(10⁻¹, 0.9, 0.1, 2⁻³)

0.04748

0.040179

(10⁻¹,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0196942

ATX

(222X6,515X6)

0.031246

(10¹,0.1,2⁻⁵)

0.50765

0.030522

(10⁻¹, 0.1, 2⁻¹)

0.05096

0.035835

(10⁵,10⁻³, 0.1, 1.345, 2⁻¹)

0.09628

0.039418

(10⁵,0.1, 0.6, 2⁻⁵)

0.05423

0.031933

(10⁰, 0.9, 0.1, 2⁻¹)

0.0676

0.031411

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻¹)

0.0201047

BSESN

(220X6,513X6)

0.024455

(10¹,0.1,2⁻⁵)

0.50639

0.025717

(10⁻¹, 0.3, 2⁻³)

0.04473

0.023904

(10⁴,10⁻², 0.1, 1.345, 2⁻⁵)

0.09047

0.023822

(10³,0.1, 0.6, 2⁻⁵)

0.04013

0.024003

(10⁰, 0.9, 0.1, 2⁻⁵)

0.11163

0.023631

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0189221

DJI

(226X6,525X6)

0.060085

(10⁰,0.1,2⁻¹)

0.56475

0.038362

(10⁻¹, 0.9, 2⁻³)

0.04941

0.059886

(10¹,10⁻², 0.1, 1.345, 2⁻¹)

0.11015

0.050961

(10¹,0.1, 0.7, 2⁻¹)

0.04794

0.038214

(10⁰, 0.9, 0.1, 2⁻³)

0.06362

0.054533

(10¹,10⁰, 0.1, 0.1, 2⁻¹)

0.0188101

GDAXI

(227X6,528X6)

0.035349

(10¹,0.1,2⁻⁵)

0.55039

0.02908

(10⁻¹, 0.1, 2⁻⁵)

0.04355

0.02882

(10⁵,10⁻³, 1, 1.345, 2⁻⁵)

0.11852

0.029441

(10³,0.1, 0.4, 2⁻⁵)

0.05947

0.029419

(10⁰, 0.9, 0.1, 2⁻⁵)

0.06226

0.028775

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0240547

MXX

(225X6,525X6)

0.040878

(10¹,0.1,2⁻³)

0.52684

0.0394

(10⁻³, 0.3, 2⁻³)

0.04676

0.040972

(10⁵,10⁻³, 1, 1.345, 2⁻³)

0.10279

0.034736

(10³,0.1, 0.4, 2⁻⁵)

0.05477

0.040958

(10⁰, 0.9, 0.3, 2⁻³)

0.06418

0.041988

(10⁰,10⁻⁵, 0.9, 0.3, 2⁻³)

0.0191296

N225

(220X6,512X6)

0.03902

(10¹,0.1,2⁻⁵)

0.51209

0.035597

(10⁻¹, 0.1, 2⁻⁵)

0.04948

0.035549

(10³,10⁻³, 1, 1.345, 2⁻⁴)

0.09

0.038293

(10¹,0.1, 0.5, 2⁻⁵)

0.03976

0.03552

(10⁻¹, 0.9, 0.1, 2⁻⁵)

0.05325

0.03552

(10⁻¹,10⁻³, 0.9, 0.1, 2⁻⁵)

0.0193947

Wankara

(97X10,224X10)

0.063142

(10¹,0.1,2⁻⁵)

0.09453

0.030462

(10⁻¹, 0.1, 2⁻³)

0.01633

0.035809

(10⁴,10⁻³, 0.1, 1.345, 2⁻⁵)

0.00244

0.044248

(10¹,0.1, 0.1, 2⁻³)

0.02061

0.03271

(10⁰, 0.1, 0.1, 2⁻³)

0.02543

0.0349

(10⁰,10⁻⁵, 0.1, 0.1, 2⁻⁵)

0.0199694

Laser

(298X5,695X5)

0.085416

(10¹,0.1,2⁰)

1.02388

0.047765

(10⁻¹, 0.1, 2³)

0.10974

0.044438

(10⁵,10⁻³, 0.1, 1.345, 2¹)

0.02504

0.070032

(10¹,0.1, 0.4, 2⁻¹)

0.30005

0.049892

(10⁰, 0.1, 0.1, 2³)

0.09753

0.043683

(10⁻¹,10⁻⁵, 0.1, 0.1, 2¹)

0.0249245

Dee

(110X7,255X7)

0.112566

(10⁰,0.1,2⁰)

0.13122

0.105096

(10⁰, 0.9, 2⁻⁵)

0.01948

0.102947

(10²,10⁻¹, 1, 1.345, 2⁻⁵)

0.00378

0.107498

(10¹,0.1, 0.7, 2⁻⁵)

0.02758

0.105666

(10¹, 0.1, 0.1, 2⁻⁵)

0.02575

0.104154

(10¹,10⁻¹, 0.9, 0.1, 2⁻⁵)

0.0162945

Friedman

(360X6,840X6)

0.069441

(10⁰,0.1,2⁻¹)

1.32811

0.051352

(10⁻¹, 0.1, 2⁻¹)

0.10366

0.053727

(10³,10⁻³, 1, 1.345, 2⁻¹)

0.04133

0.048789

(10⁵,0.1, 0.3, 2⁻⁵)

0.40358

0.051929

(10⁰, 0.1, 0.1, 2⁻¹)

0.14075

0.051929

(10⁰,10⁻³, 0.1, 0.1, 2⁻¹)

0.0346828

Mortgage

(315X16,734X16)

0.059066

(10⁰,0.1,2⁻⁵)

1.09222

0.015498

(10⁻¹, 0.3, 2⁻³)

0.11248

0.017121

(10⁵,10⁻³, 0.1, 1.345, 2⁻⁵)

0.02693

0.046823

(10¹,0.1, 0.1, 2⁻⁵)

1.69654

0.016332

(10⁰, 0.9, 0.1, 2⁻³)

0.10476

0.016587

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻⁵)

0.0350727

NNGC1_dataset_E1_V1_001

(111X5,259X5)

0.188128

(10¹,0.1,2¹)

0.12438

0.180734

(10⁻⁵, 0.1, 2⁰)

0.01945

0.17804

(10¹,10⁻¹, 0.1, 1.345, 2²)

0.00346

0.179554

(10¹,0.3, 0.5, 2¹)

0.01199

0.185076

(10⁰, 0.1, 0.3, 2¹)

0.03176

0.17484

(10⁰,10⁰, 0.9, 0.1, 2³)

0.0158275

NNGC1_dataset_F1_V1_008

(269X5,626X5)

0.076954

(10⁰,0.1,2⁵)

0.74556

0.060209

(10⁻¹, 0.9, 2⁵)

0.12007

0.060038

(10³,10⁻³, 1, 1.345, 2⁵)

0.02009

0.065316

(10¹,0.1, 0.1, 2⁵)

0.07343

0.060003

(10⁻¹, 0.9, 0.1, 2⁵)

0.09681

0.060003

(10⁻¹,10⁻³, 0.9, 0.1, 2⁵)

0.0231774

NNGC1_dataset_F1_V1_009

(269X5,626X5)

0.080366

(10³,0.1,2⁻¹)

0.74747

0.055165

(10⁻¹, 0.7, 2³)

0.10069

0.053169

(10¹,10⁻³, 1, 1.345, 2⁵)

0.0182

0.061428

(10³,0.1, 0.2, 2¹)

0.08028

0.055746

(10⁰, 0.9, 0.1, 2³)

0.09388

0.052221

(10⁰,10⁻¹, 0.1, 0.1, 2⁵)

0.0302937

NNGC1_dataset_F1_V1_010

(269X5,626X5)

0.069884

(10³,0.1,2⁰)

0.74931

0.044151

(10⁻¹, 0.9, 2³)

0.09345

0.043029

(10¹,10⁻³, 0.1, 1.345, 2⁵)

0.01833

0.052085

(10¹,0.1, 0.2, 2³)

0.08409

0.045211

(10⁰, 0.9, 0.1, 2³)

0.10088

0.042634

(10⁰,10⁻¹, 0.9, 0.1, 2⁵)

0.0342732

NNGC1_dataset_F1_V1_006

(521X5,1214X5)

0.06958

(10¹,0.1,2³)

2.97391

0.047711

(10⁻¹, 0.9, 2⁵)

0.50363

0.046947

(10¹,10⁻³, 1, 1.345, 2⁵)

0.07101

0.061655

(10¹,0.1, 0.3, 2⁵)

0.3305

0.048554

(10⁰, 0.9, 0.1, 2⁵)

0.38209

0.046072

(10⁰,10⁻¹, 0.9, 0.1, 2⁵)

0.0727278

NN5_Complete_109

(237X5,550X5)

0.166291

(10⁰,0.1,2¹)

0.55777

0.171426

(10⁻¹, 0.1, 2¹)

0.09335

0.16871

(10¹,10⁻³, 0.1, 1.345, 2³)

0.01475

0.155751

(10¹,0.1, 0.3, 2¹)

0.0501

0.172463

(10⁰, 0.1, 0.1, 2¹)

0.08337

0.159981

(10⁰,10⁰, 0.1, 0.1, 2³)

0.0246379

NN5_Complete_104

(237X5,550X5)

0.248315

(10⁰,0.1,2¹)

0.57739

0.255563

(10⁻¹, 0.9, 2⁰)

0.07181

0.233829

(10⁻¹,10⁻³, 0.1, 1.345, 2⁴)

0.01461

0.271226

(10¹,0.1, 0.7, 2¹)

0.05814

0.254416

(10⁰, 0.9, 0.1, 2⁰)

0.09904

0.211625

(10¹,10¹, 0.9, 0.1, 2⁵)

0.0218523

NN5_Complete_106

(237X5,550X5)

0.214481

(10¹,0.1,2¹)

0.56271

0.225332

(10⁻⁵, 0.7, 2¹)

0.0672

0.224239

(10¹,10⁻³, 0.1, 1.345, 2⁴)

0.01511

0.206398

(10¹,0.1, 0.5, 2¹)

0.05335

0.225333

(10⁻⁵, 0.1, 0.5, 2¹)

0.06475

0.20665

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.0196063

NN5_Complete_103

(237X5,550X5)

0.128811

(10⁰,0.1,2⁻¹)

0.57129

0.129722

(10⁰, 0.9, 2⁻³)

0.07779

0.12998

(10¹,10⁻³, 1, 1.345, 2⁻¹)

0.01483

0.127552

(10¹,0.3, 0.4, 2⁻¹)

0.05419

0.129951

(10¹, 0.1, 0.1, 2⁻³)

0.07177

0.129655

(10¹,10⁻¹, 0.1, 0.1, 2⁻¹)

0.0198243

NN5_Complete_101

(237X5,550X5)

0.150035

(10⁰,0.1,2¹)

0.55307

0.157233

(10⁰, 0.1, 2⁰)

0.07508

0.155235

(10¹,10⁻³, 1, 1.345, 2²)

0.01382

0.15831

(10¹,0.3, 0.4, 2¹)

0.06478

0.157715

(10¹, 0.1, 0.1, 2⁰)

0.07666

0.151755

(10⁰,10⁰, 0.5, 0.1, 2³)

0.0237155

NN5_Complete_105

(237X5,550X5)

0.207145

(10⁰,0.1,2¹)

0.5438

0.199703

(10⁻¹, 0.9, 2¹)

0.06793

0.200107

(10¹,10⁻³, 1, 1.345, 2⁵)

0.0142

0.208579

(10¹,0.1, 0.3, 2¹)

0.0545

0.196538

(10⁻¹, 0.9, 0.1, 2⁵)

0.07154

0.19833

(10⁰,10⁰, 0.9, 0.1, 2⁵)

0.0243086

NN5_Complete_111

(237X5,550X5)

0.217826

(10⁰,0.1,2³)

0.55873

0.214238

(10⁻¹, 0.9, 2⁰)

0.05185

0.204766

(10⁻¹,10⁻³, 0.1, 1.345, 2⁵)

0.01406

0.204239

(10¹,0.1, 0.4, 2¹)

0.06088

0.21311

(10⁰, 0.9, 0.1, 2⁰)

0.0681

0.206733

(10⁰,10⁰, 0.1, 0.1, 2³)

0.0197837

D1dat_1_2000

(599X5,1396X5)

0.056778

(10³,0.1,2⁻⁵)

4.0872

0.042117

(10⁻¹, 0.9, 2⁰)

0.71631

0.043586

(10⁴,10⁻³, 1, 1.345, 2⁻¹)

0.08671

0.043239

(10³,0.1, 0.2, 2⁻¹)

0.47343

0.043216

(10⁰, 0.9, 0.1, 2⁰)

0.4761

0.041703

(10⁰,10⁻⁵, 0.9, 0.1, 2⁻¹)

0.0961339

Vineyard

(16X4,36X4)

0.256848

(10⁰,0.1,2⁻¹)

0.00456

0.245166

(10⁰, 0.9, 2⁻⁵)

0.01548

0.239557

(10²,10⁻¹, 0.1, 1.345, 2⁻¹)

0.00014

0.242875

(10¹,0.1, 0.1, 2⁻¹)

0.00743

0.239164

(10¹, 0.1, 0.1, 2⁻⁵)

0.02454

0.233692

(10¹,10¹, 0.1, 0.1, 2⁰)

0.0122392

COVID-19_spain

(251X5,585X5)

0.11715

(10¹,0.1,2⁻⁵)

0.63258

0.104614

(10⁻¹, 0.1, 2¹)

0.07824

0.113934

(10⁴,10⁻³, 0.1, 1.345, 2¹)

0.01442

0.091393

(10³,0.3, 0.5, 2⁻¹)

0.04905

0.104057

(10⁰, 0.1, 0.1, 2¹)

0.04803

0.098165

(10⁻¹,10⁻⁵, 0.9, 0.1, 2⁰)

0.0281752

Open in a new tab

The best result is shown as boldface

Table 13.

Average ranks of SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR, and RHN-TSVR on RMSE values using Gaussian kernel with noise 10% for real world dataset

Datasets	SVR	TSVR	$ε$ -AHSVR	$ε$ -SVQR	HN-TSVR	RHN-TSVR
Forestfires	4	5	2	1	6	3
Machine_8	5	1	4	6	2.5	2.5
Auto-original	6	2	4	5	3	1
Winequality	4	1	5	6	2	3
SantafeA	6	3	4	1	5	2
Gas_furnace	6	4	2	5	3	1
Quake	3	1	5	4	2	6
Flex_robotarm	6	5	1	3	4	2
S&P500	5	3	2	6	4	1
Space-Ga	6	4	3	1	5	2
Gauss1	5	3	1	6	4	2
Chwirut2	3	5	2	4	6	1
Roszman1	6	3	4	5	2	1
INFY	6	3	5	1	2	4
ONGC_NS	6	3	1	5	4	2
XOM	6	4	2	5	3	1
ATX	2	1	5	6	4	3
BSESN	5	6	3	2	4	1
DJI	6	2	5	3	1	4
GDAXI	6	3	2	5	4	1
MXX	3	2	5	1	4	6
N225	6	4	3	5	1.5	1.5
Wankara	6	1	4	5	2	3
Laser	6	3	2	5	4	1
Dee	6	3	1	5	4	2
Friedman	6	2	5	1	3.5	3.5
Mortgage	6	1	4	5	2	3
NNGC1_dataset_E1_V1_001	6	4	2	3	5	1
NNGC1_dataset_F1_V1_008	6	4	3	5	1.5	1.5
NNGC1_dataset_F1_V1_009	6	3	2	5	4	1
NNGC1_dataset_F1_V1_010	6	3	2	5	4	1
NNGC1_dataset_F1_V1_006	6	3	2	5	4	1
NN5_Complete_109	3	5	4	1	6	2
NN5_Complete_104	3	5	2	6	4	1
NN5_Complete_106	3	5	4	1	6	2
NN5_Complete_103	2	4	6	1	5	3
NN5_Complete_101	1	4	3	6	5	2
NN5_Complete_105	5	3	4	6	1	2
NN5_Complete_111	6	5	2	1	4	3
D1dat_1_2000	6	2	5	4	3	1
Vineyard	6	5	3	4	2	1
COVID-19_spain	6	4	5	1	3	2
Average rank	5.047619	3.2619048	3.2142857	3.8333333	3.547619	2.0952381

Open in a new tab

Fig. 10 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Machine CPU dataset with 10% noise. Gaussian kernel was used

Fig. 11 — Prediction over the testing dataset by SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, HN-TSVR and RHN-TSVR on the Gas furnace dataset with 10% noise. Gaussian kernel was used

Same as to previous cases, compute the values of $χ_{F}^{2}$ and $F_{F}$ using Table 13 as

\begin{matrix} χ_{F}^{2} & = \frac{12 \times 42}{6 \times 7} [(5 . 047619^{2} + 3 . 2619048^{2} + 3 . 2142857^{2} + 3 . 833333^{2})) \\ ((+ 3 . 547619^{2} + 2 . 0952381^{2}) - (\frac{6 \times 7^{2}}{4})] \\ ≅ 55.442 \\ F_{F} & = \frac{(42 - 1) \times 55.442}{(42 \times (6 - 1)) - 55.442} = 14.707, \end{matrix}

Here, $F_{F}$ is also greater than the critical value on degree of freedom $(5, 205)$ i.e. $(14.707 > 2.2581)$ . So, null hypothesis $H_{0}$ is also rejected in this case which means all the models may have some significant differences. Now, we perform Nemenyi test on these algorithms and few points may be concluded based on test. Similar to previous two cases, RHN-TSVR is still having lowest average rank at significant noise level 10% and the average rank differences with others are always greater than the critical difference i.e. $1.057$ . So, one can notice that overall the performance of RHN-TSVR is outperformed to others models.

Effect of Increasing Noise Percentage

In this section, we have shown the effect of increasing significant noise level percentage like 0%, 5%, and 10% on real-world dataset. As we have already discussed the performance of proposed RHN-TSVR on real-world datasets at different significant noise level in the previous sections. We have plotted the performance accuracy for Gas furnace dataset in Fig. 12 at different noise levels 0%, 5% and 10%.

Fig. 12 — Prediction/observed value over the testing dataset by RHN-TSVR on the Gas furnace dataset with 0%, 5% and 10% noise. Gaussian kernel was used

One can see that the test data samples are plotted in the form of the black line and the prediction results obtained on RHN-TSVR for different noise levels 0%, 5% and 10% are shown in the form of brown, blue, and pink dotted lines respectively. From Fig. 12, one can analyze the impact of increasing noise percentage for proposed RHN-TSVR model. Here, noisy results are having a closer relationship with the desired output which justifies the applicability and usability of the proposed RHN-TSVR model for noisy environment.

Conclusion and Future Scope

In this paper, regularized version of TSVR with Huber loss propose to avoid the singularity problem of HN-TSVR by implementing the structural risk minimization principle as regularization based twin support vector regression with Huber loss (RHN-TSVR) and further, justify the noise insensitivity of the proposed model by different variation of the significant noise level such as 0%, 5%, and 10%. As we know that TSVR accommodates $ε$ -insensitive loss function which is not capable to deal with different types of noise and outliers. Classical Huber loss function performs as quadratic for small error and linear for others. It is the combination of the Laplacian loss and Gaussian loss function which gives better generalization performance for data having Gaussian noise and outliers. The performance of the proposed approach is tested and validated on different benchmark real-world datasets at different significant levels and on artificial datasets having a different type of noises for the non-linear kernel. The proposed RHN-TSVR shows better prediction accuracy in most of the cases and takes less or comparable computational time in comparison to existing approaches such as SVR, TSVR, $ε$ -AHSVR, $ε$ -SVQR, and HN-TSVR. A comparative study is analyzed based on numerical experiments which justify the importance of RHN-TSVR model in comparison to reported approaches especially to deal with the data, having noise and outliers. One can apply this model for the financial time series forecasting which can be one of the applications. In future, the computational cost may be reduced by suggesting the iterative approach.

Compliance with Ethical Standards

Conflict of interest

Authors have no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Umesh Gupta, Email: er.umeshgupta@gmail.com.

Deepak Gupta, Email: deepakjnu85@gmail.com, Email: deepak@nitap.ac.in.

References

1.Anand P, Rastogi R, Chandra S. A new asymmetric ϵ-insensitive pinball loss function based support vector quantile regression model. Appl Soft Comput. 2019;94:1–14. [Google Scholar]
2.Anand P, Rastogi R, Chandra S (2019) Support vector regression via a combined reward cum penalty loss function. arXiv: 1904.12331v2 [cs.LG] version: 2, pp 1–13
3.Bai L, Shao Y-H, Wang Z, Li C-N. Clustering by twin support vector machine and least square twin support vector classifier with uniform output coding. Knowl-Based Syst. 2019;163:227–240. doi: 10.1016/j.knosys.2018.08.034. [DOI] [Google Scholar]
4.Balasundaram S, Prasad SC. Robust twin support vector regression based on Huber loss function. Neural Comput Appl. 2019;32:1–25. [Google Scholar]
5.Balasundaram S, Meena Y. Robust support vector regression in primal with asymmetric Huber loss. Neural Process Lett. 2019;49(3):1399–1431. doi: 10.1007/s11063-018-9875-8. [DOI] [Google Scholar]
6.Chen S-G, Xiao-Jun W. A new fuzzy twin support vector machine for pattern classification. Int J Mach Learn Cybernet. 2018;9(9):1553–1564. doi: 10.1007/s13042-017-0664-x. [DOI] [Google Scholar]
7.Chen, S, Liu X, Li B (2018) A cost-sensitive loss function for machine learning. In: International conference on database systems for advanced applications vol 10829. LNCS. Springer, Cham, pp 255–268
8.Chen C, Yan C, Zhao N, Guo B, Liu G. A robust algorithm of support vector regression with a trimmed Huber loss function in the primal. Soft Comput. 2017;21(18):5235–5243. doi: 10.1007/s00500-016-2229-4. [DOI] [Google Scholar]
9.Chen Z, Matousek R, Wanke P. Chinese bank efficiency during the global financial crisis: a combined approach using satisficing DEA and support vector machines. N Am J Econ Finance. 2018;43:71–86. doi: 10.1016/j.najef.2017.10.003. [DOI] [Google Scholar]
10.Chen C, Li Y, Yan C, Dai H, Liu G. A robust algorithm of multiquadric method based on an improved Huber loss function for interpolating remote-sensing-derived elevation data sets. Remote Sens. 2015;7(3):3347–3371. doi: 10.3390/rs70303347. [DOI] [Google Scholar]
11.Chu W, Sathiya Keerthi S, Ong CJ. Bayesian support vector regression using a unified loss function. IEEE Trans Neural Netw. 2004;15(1):29–44. doi: 10.1109/TNN.2003.820830. [DOI] [PubMed] [Google Scholar]
12.Chuang C-C. Fuzzy weighted support vector regression with a fuzzy partition. IEEE Trans Syst Man Cybern Part B (Cybern) 2007;37(3):630–640. doi: 10.1109/TSMCB.2006.889611. [DOI] [PubMed] [Google Scholar]
13.Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–297. doi: 10.1007/BF00994018. [DOI] [Google Scholar]
14.COVID19S (2020)[online]. https:/dataverse.harvard.edu/dataset.xhtml/
15.Cristianini N, Shawe-Taylor J. An introduction to support vector machines and other kernel-based learning methods. Cambridge: Cambridge University Press; 2000. [Google Scholar]
16.Cui W, Yan X. Adaptive weighted least square support vector machine regression integrated with outlier detection and its application in QSAR. Chemometr Intell Lab Syst. 2009;98(2):130–135. doi: 10.1016/j.chemolab.2009.05.008. [DOI] [Google Scholar]
17.Demšar J. Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res. 2006;7(Jan):1–30. [Google Scholar]
18.Deylami H-M, PrasadSingh Y. Cybercrime detection techniques based on support vector machines. Artif Intell Res. 2012;2(1):1. doi: 10.5430/air.v2n1p1. [DOI] [Google Scholar]
19.Drucker H, Burges CJC, Kaufman L, Smola AJ, Vapnik V (1997) Support vector regression machines. In: Advances in neural information processing systems, vol 9. pp 155–161
20.Financial Dataset (2020) [online]. http://finance.yahoo.com
21.Flexible Robot Arm (2020) [online]. http://homes.esat.kuleuven.be/~smc/daisydata.html
22.Forghani Y, Sigari Tabrizi R, Sadoghi Yazdi H, Mohammad-R. Akbarzadeh-T (2011) Fuzzy support vector regression. In: 2011 1st international eConference on computer and knowledge engineering (ICCKE), IEEE (2011), pp 28–33
23.Fung GM, Mangasarian OL. Multicategory proximal support vector machine classifiers. Mach Learn. 2005;59(1-2):77–97. doi: 10.1007/s10994-005-0463-6. [DOI] [Google Scholar]
24.Gu B, Fang J, Pan F, Bai Z. Fast clustering-based weighted twin support vector regression.”. Soft Comput. 2020;24:1–17. doi: 10.1007/s00500-020-04746-6. [DOI] [Google Scholar]
25.Gupta U, Gupta D. An improved regularization based Lagrangian asymmetric ν-twin support vector regression using pinball loss function. Appl Intell. 2019;49(10):3606–3627. doi: 10.1007/s10489-019-01465-w. [DOI] [Google Scholar]
26.Gupta U, Gupta, D (2018) Lagrangian twin-bounded support vector machine based on L2-norm. In: Recent developments in machine learning and data analytics, vol 740. AISC. Springer, Singapore, pp 431–444
27.Gupta D, Pratama M, Ma Z, Li J, Prasad M. Financial time series forecasting using twin support vector regression. PLoS ONE. 2019;14(3):0211402. doi: 10.1371/journal.pone.0211402. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Gupta U, Gupta D, Prasad M (2018) Kernel target alignment based fuzzy least square twin bounded support vector machine. In: 2018 IEEE symposium series on computational intelligence (SSCI). IEEE
29.Hazarika BB, Gupta D, Berlin M. Modeling suspended sediment load in a river using extreme learning machine and twin support vector regression with wavelet conjunction. Environ Earth Sci. 2020;79:234. doi: 10.1007/s12665-020-08949-w. [DOI] [Google Scholar]
30.Hong DH, Hwang C. Interval regression analysis using quadratic loss support vector machine. IEEE Trans Fuzzy Syst. 2005;13(2):229–237. doi: 10.1109/TFUZZ.2004.840133. [DOI] [Google Scholar]
31.Huang M-L. Intersection traffic flow forecasting based on ν-GSVR with a new hybrid evolutionary algorithm. Neurocomputing. 2015;147:343–349. doi: 10.1016/j.neucom.2014.06.054. [DOI] [Google Scholar]
32.Huang X, Shi L, Suykens JAK. Asymmetric least squares support vector machine classifiers.”. Comput Stat Data Anal. 2014;70:395–405. doi: 10.1016/j.csda.2013.09.015. [DOI] [Google Scholar]
33.Huber PJ. Robust estimation of a location parameter. Ann Math Stat. 1964;35(1):73–101. doi: 10.1214/aoms/1177703732. [DOI] [Google Scholar]
34.Huber PJ. Robust statistical procedures. Philadelphia: SIAM; 1996. [Google Scholar]
35.Hwang C, Hong DH, Seok KH. Support vector interval regression machine for crisp input and output data. Fuzzy Sets Syst. 2006;157(8):1114–1125. doi: 10.1016/j.fss.2005.09.008. [DOI] [Google Scholar]
36.Jayadeva, Khemchandani R, Chandra S. Twin support vector machines for pattern classification. IEEE Trans Pattern Anal Mach Intell. 2007;29(5):905–910. doi: 10.1109/TPAMI.2007.1068. [DOI] [PubMed] [Google Scholar]
37.Kaneko H, Funatsu K. Adaptive soft sensor based on online support vector regression and Bayesian ensemble learning for various states in chemical plants. Chemometr Intell Lab Syst. 2014;137:57–66. doi: 10.1016/j.chemolab.2014.06.008. [DOI] [Google Scholar]
38.KEEL (2020) [online]. https://sci2s.ugr.es/keel/html/
39.Kumar MA, Gopal M. Least squares twin support vector machines for pattern classification. Expert Syst Appl. 2009;36(4):7535–7543. doi: 10.1016/j.eswa.2008.09.066. [DOI] [Google Scholar]
40.Liu LL, Zhao Y, Kong L, Liu M, Dong L, Ma F, Pang Z. Robust real-time heart rate prediction for multiple subjects from facial video using compressive tracking and support vector machine. J Med Imaging. 2018;5(2):024503. doi: 10.1117/1.JMI.5.2.024503. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Liu X, Zhu T, Zhai L, Liu J. Mass classification of benign and malignant with a new twin support vector machine joint l2,1 - norm. Int J Mach Learn Cybern. 2017;10:1–17. [Google Scholar]
42.Mangasarian OL, Musicant DR. Robust linear and support vector regression. IEEE Trans Pattern Anal Mach Intell. 2000;22(9):950–955. doi: 10.1109/34.877518. [DOI] [Google Scholar]
43.Mao X, Wang Y, Liu X, Guo Y. An adaptive weighted least square support vector regression for hysteresis in piezoelectric actuators. Sens Actuators A. 2017;263:423–429. doi: 10.1016/j.sna.2017.06.030. [DOI] [Google Scholar]
44.Maulik U, Chakraborty D. Remote sensing image classification: a survey of support-vector-machine-based advanced techniques. IEEE Geosci Remote Sens Mag. 2017;5(1):33–52. doi: 10.1109/MGRS.2016.2641240. [DOI] [Google Scholar]
45.Mehrkanoon S, Huang X, Suykens JAK. Non-parallel support vector classifiers with different loss functions. Neurocomputing. 2014;143:294–301. doi: 10.1016/j.neucom.2014.05.063. [DOI] [Google Scholar]
46.Melacci S, Belkin M. Laplacian support vector machines trained in the primal. J Mach Learn Res. 2011;12(Mar):1149–1184. [Google Scholar]
47.Niu J, Chen J, Yitian X. Twin support vector regression with Huber loss. J Intell Fuzzy Syst. 2017;32(6):4247–4258. doi: 10.3233/JIFS-16629. [DOI] [Google Scholar]
48.NLREG repositories (2020) [online]. http://www.nlreg.com/
49.Ouyang X, Zhao N, Gao C, Wang L. An efficient twin projection support vector machine for regression. Eng Lett. 2019;27(1):103–107. [Google Scholar]
50.Peng X. TSVR: an efficient twin support vector machine for regression. Neural Netw. 2010;23(3):365–372. doi: 10.1016/j.neunet.2009.07.002. [DOI] [PubMed] [Google Scholar]
51.Peng X, Chen D. An $$ l_1 $$-norm loss based twin support vector regression and its geometric extension. Int J Mach Learn Cybernet. 2019;10(9):2573–2588. doi: 10.1007/s13042-018-0892-8. [DOI] [Google Scholar]
52.Puthiyottil A, Balasundaram S, Meena Y (2020) “L1-norm support vector regression in primal based on huber loss function. In: Proceedings of ICETIT 2019, vol 605. LNEE. Springer, Cham, pp 195–205
53.SantaFeA dataset (2020) [online]. http://lib.stat.cmu.edu/datasets
54.Shen X, Niu L, Qi Z, Tian Y. Support vector machine classifier with truncated pinball loss. Pattern Recognit. 2017;68:199–210. doi: 10.1016/j.patcog.2017.03.011. [DOI] [Google Scholar]
55.Shao YH, Zhang C, Wang X, Deng N. Improvements on twin support vector machines. IEEE Trans Neural Netw. 2011;22(6):962–968. doi: 10.1109/TNN.2011.2130540. [DOI] [PubMed] [Google Scholar]
56.Singla M, Ghosh D, Shukla KK, Pedrycz W. “Robust twin support vector regression based on rescaled hinge loss. Pattern Recognit. 2020;105:107395. doi: 10.1016/j.patcog.2020.107395. [DOI] [Google Scholar]
57.Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput. 2004;14(3):199–222. doi: 10.1023/B:STCO.0000035301.49549.88. [DOI] [Google Scholar]
58.SpaceGa dataset (2020).[online]. http://lib.stat.cmu.edu/datasets
59.Tang L, Tian Y, Yang C, Pardalos PM. Ramp-loss nonparallel support vector regression: robust, sparse and scalable approximation. Knowl-Based Syst. 2018;147:55–67. doi: 10.1016/j.knosys.2018.02.016. [DOI] [Google Scholar]
60.Tang L, Tian Y, Pardalos PM. A novel perspective on multiclass classification: regular simplex support vector machine. Inf Sci. 2019;480:324–338. doi: 10.1016/j.ins.2018.12.026. [DOI] [Google Scholar]
61.Tang L, Tian Y, Li W, Pardalos PM. Structural improved regular simplex support vector machine for multiclass classification. Appl Soft Comput. 2020;91:106235. doi: 10.1016/j.asoc.2020.106235. [DOI] [Google Scholar]
62.Tanveer M, Shubham K, Aldhaifallah M, Ho SS. An efficient regularized K-nearest neighbor based weighted twin support vector regression. Knowl-Based Syst. 2016;94:70–87. doi: 10.1016/j.knosys.2015.11.011. [DOI] [Google Scholar]
63.UCI data repository (2020) [online]. https://archive.ics.uci.edu/ml/
64.Vapnik V. The nature of statistical learning theory. Berlin: Springer; 2000. [Google Scholar]
65.Vineyard dataset (2020) [online]. https://data.gov.au/dataset/
66.Wang L, Gao C, Zhao N, Chen X. Wavelet transform-based weighted $$\nu $$ ν-twin support vector regression. Int J Mach Learn Cybernet. 2020;11(1):95–110. doi: 10.1007/s13042-019-00957-y. [DOI] [Google Scholar]
67.Wang L, Gao C, Zhao N, Chen X. A projection wavelet weighted twin support vector regression and its primal solution. Appl Intell. 2019;49(8):3061–3081. doi: 10.1007/s10489-019-01422-7. [DOI] [Google Scholar]
68.Wang K, Pei H, Ding X, Zhong P. Robust proximal support vector regression based on maximum correntropy criterion. Sci Progr. 2019;2019:1–11. [Google Scholar]
69.Wang C, Li Z, Dey N, Li Z, Ashour AS, Fong SJ, Simon Sherratt R, Wu L, Shi F. Histogram of oriented gradient based plantar pressure image feature extraction and classification employing fuzzy support vector machine. J Med Imaging Health Inf. 2018;8(4):842–854. doi: 10.1166/jmihi.2018.2310. [DOI] [Google Scholar]
70.Wang K, Zhong P. Robust support vector regression with flexible loss function. Int J Signal Process Image Process Pattern Recognit. 2014;7(4):211–220. [Google Scholar]
71.Wu Q. A hybrid-forecasting model based on Gaussian support vector machine and chaotic particle swarm optimization. Expert Syst Appl. 2010;37(3):2388–2394. doi: 10.1016/j.eswa.2009.07.057. [DOI] [Google Scholar]
72.Wu Q, Yan H. Product sales forecasting model based on robust ν-support vector machine. Comput Integrated Manuf Syst. 2009;15(6):1081–1087. [Google Scholar]
73.Wu Q, Law R, Xin X. A sparse Gaussian process regression model for tourism demand forecasting in Hong Kong. Expert Syst Appl. 2012;39(5):4769–4774. doi: 10.1016/j.eswa.2011.09.159. [DOI] [Google Scholar]
74.Xu Q, Zhang J, Jiang C, Huang X, He Y. Weighted quantile regression via support vector machine. Expert Syst Appl. 2015;42(13):5441–5451. doi: 10.1016/j.eswa.2015.03.003. [DOI] [Google Scholar]
75.Xu Y, Wang L. K-nearest neighbor-based weighted twin support vector regression. Appl Intell. 2014;41(1):299–309. doi: 10.1007/s10489-014-0518-0. [DOI] [Google Scholar]
76.Xu Y, Li X, Pan X, Yang Z. Asymmetric ν-twin support vector regression. Neural Comput Appl. 2017;30:1–16. [Google Scholar]
77.Yang L, Ding G, Yuan C, Zhang M. Robust regression framework with asymmetrically analogous to correntropy-induced loss. Knowl-Based Syst. 2020;191:105211. doi: 10.1016/j.knosys.2019.105211. [DOI] [Google Scholar]
78.Yang L, Dong H. Support vector machine with truncated pinball loss and its application in pattern recognition. Chemometr Intell Lab Syst. 2018;177:89–99. doi: 10.1016/j.chemolab.2018.04.003. [DOI] [Google Scholar]
79.Yang Z, Xu Y. A safe sample screening rule for Laplacian twin parametric-margin support vector machine. Pattern Recognit. 2018;84:1–12. doi: 10.1016/j.patcog.2018.06.018. [DOI] [Google Scholar]
80.Yang L, Ren Z, Wang Y, Dong H. A robust regression framework with laplace kernel-induced loss. Neural Comput. 2017;29(11):3014–3039. doi: 10.1162/neco_a_01002. [DOI] [PubMed] [Google Scholar]
81.Ye Y, Gao J, Shao Y, Li C, Jin Y, Hua X. Robust support vector regression with generic quadratic nonconvex ε-insensitive loss. Appl Math Model. 2020;82:235–251. doi: 10.1016/j.apm.2020.01.053. [DOI] [Google Scholar]
82.Zhang J, Zheng C-H, Xia Y, Wang B, Chen P. Optimization enhanced genetic algorithm-support vector regression for the prediction of compound retention indices in gas chromatography. Neurocomputing. 2017;240:183–190. doi: 10.1016/j.neucom.2016.11.070. [DOI] [Google Scholar]
83.Zhao Y, Sun J. Robust support vector regression in the primal. Neural Netw. 2008;21(10):1548–1555. doi: 10.1016/j.neunet.2008.09.001. [DOI] [PubMed] [Google Scholar]
84.Zhu J, Hoi SCH, Rung-Tsong Lyu M. Robust regularized kernel regression. IEEE Trans Syst Man Cybern Part B (Cybern) 2008;38(6):1639–1644. doi: 10.1109/TSMCB.2008.927279. [DOI] [PubMed] [Google Scholar]

[CR1] 1.Anand P, Rastogi R, Chandra S. A new asymmetric ϵ-insensitive pinball loss function based support vector quantile regression model. Appl Soft Comput. 2019;94:1–14. [Google Scholar]

[CR2] 2.Anand P, Rastogi R, Chandra S (2019) Support vector regression via a combined reward cum penalty loss function. arXiv: 1904.12331v2 [cs.LG] version: 2, pp 1–13

[CR3] 3.Bai L, Shao Y-H, Wang Z, Li C-N. Clustering by twin support vector machine and least square twin support vector classifier with uniform output coding. Knowl-Based Syst. 2019;163:227–240. doi: 10.1016/j.knosys.2018.08.034. [DOI] [Google Scholar]

[CR4] 4.Balasundaram S, Prasad SC. Robust twin support vector regression based on Huber loss function. Neural Comput Appl. 2019;32:1–25. [Google Scholar]

[CR5] 5.Balasundaram S, Meena Y. Robust support vector regression in primal with asymmetric Huber loss. Neural Process Lett. 2019;49(3):1399–1431. doi: 10.1007/s11063-018-9875-8. [DOI] [Google Scholar]

[CR6] 6.Chen S-G, Xiao-Jun W. A new fuzzy twin support vector machine for pattern classification. Int J Mach Learn Cybernet. 2018;9(9):1553–1564. doi: 10.1007/s13042-017-0664-x. [DOI] [Google Scholar]

[CR7] 7.Chen, S, Liu X, Li B (2018) A cost-sensitive loss function for machine learning. In: International conference on database systems for advanced applications vol 10829. LNCS. Springer, Cham, pp 255–268

[CR8] 8.Chen C, Yan C, Zhao N, Guo B, Liu G. A robust algorithm of support vector regression with a trimmed Huber loss function in the primal. Soft Comput. 2017;21(18):5235–5243. doi: 10.1007/s00500-016-2229-4. [DOI] [Google Scholar]

[CR9] 9.Chen Z, Matousek R, Wanke P. Chinese bank efficiency during the global financial crisis: a combined approach using satisficing DEA and support vector machines. N Am J Econ Finance. 2018;43:71–86. doi: 10.1016/j.najef.2017.10.003. [DOI] [Google Scholar]

[CR10] 10.Chen C, Li Y, Yan C, Dai H, Liu G. A robust algorithm of multiquadric method based on an improved Huber loss function for interpolating remote-sensing-derived elevation data sets. Remote Sens. 2015;7(3):3347–3371. doi: 10.3390/rs70303347. [DOI] [Google Scholar]

[CR11] 11.Chu W, Sathiya Keerthi S, Ong CJ. Bayesian support vector regression using a unified loss function. IEEE Trans Neural Netw. 2004;15(1):29–44. doi: 10.1109/TNN.2003.820830. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Chuang C-C. Fuzzy weighted support vector regression with a fuzzy partition. IEEE Trans Syst Man Cybern Part B (Cybern) 2007;37(3):630–640. doi: 10.1109/TSMCB.2006.889611. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–297. doi: 10.1007/BF00994018. [DOI] [Google Scholar]

[CR14] 14.COVID19S (2020)[online]. https:/dataverse.harvard.edu/dataset.xhtml/

[CR15] 15.Cristianini N, Shawe-Taylor J. An introduction to support vector machines and other kernel-based learning methods. Cambridge: Cambridge University Press; 2000. [Google Scholar]

[CR16] 16.Cui W, Yan X. Adaptive weighted least square support vector machine regression integrated with outlier detection and its application in QSAR. Chemometr Intell Lab Syst. 2009;98(2):130–135. doi: 10.1016/j.chemolab.2009.05.008. [DOI] [Google Scholar]

[CR17] 17.Demšar J. Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res. 2006;7(Jan):1–30. [Google Scholar]

[CR18] 18.Deylami H-M, PrasadSingh Y. Cybercrime detection techniques based on support vector machines. Artif Intell Res. 2012;2(1):1. doi: 10.5430/air.v2n1p1. [DOI] [Google Scholar]

[CR19] 19.Drucker H, Burges CJC, Kaufman L, Smola AJ, Vapnik V (1997) Support vector regression machines. In: Advances in neural information processing systems, vol 9. pp 155–161

[CR20] 20.Financial Dataset (2020) [online]. http://finance.yahoo.com

[CR21] 21.Flexible Robot Arm (2020) [online]. http://homes.esat.kuleuven.be/~smc/daisydata.html

[CR22] 22.Forghani Y, Sigari Tabrizi R, Sadoghi Yazdi H, Mohammad-R. Akbarzadeh-T (2011) Fuzzy support vector regression. In: 2011 1st international eConference on computer and knowledge engineering (ICCKE), IEEE (2011), pp 28–33

[CR23] 23.Fung GM, Mangasarian OL. Multicategory proximal support vector machine classifiers. Mach Learn. 2005;59(1-2):77–97. doi: 10.1007/s10994-005-0463-6. [DOI] [Google Scholar]

[CR24] 24.Gu B, Fang J, Pan F, Bai Z. Fast clustering-based weighted twin support vector regression.”. Soft Comput. 2020;24:1–17. doi: 10.1007/s00500-020-04746-6. [DOI] [Google Scholar]

[CR25] 25.Gupta U, Gupta D. An improved regularization based Lagrangian asymmetric ν-twin support vector regression using pinball loss function. Appl Intell. 2019;49(10):3606–3627. doi: 10.1007/s10489-019-01465-w. [DOI] [Google Scholar]

[CR26] 26.Gupta U, Gupta, D (2018) Lagrangian twin-bounded support vector machine based on L2-norm. In: Recent developments in machine learning and data analytics, vol 740. AISC. Springer, Singapore, pp 431–444

[CR27] 27.Gupta D, Pratama M, Ma Z, Li J, Prasad M. Financial time series forecasting using twin support vector regression. PLoS ONE. 2019;14(3):0211402. doi: 10.1371/journal.pone.0211402. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Gupta U, Gupta D, Prasad M (2018) Kernel target alignment based fuzzy least square twin bounded support vector machine. In: 2018 IEEE symposium series on computational intelligence (SSCI). IEEE

[CR29] 29.Hazarika BB, Gupta D, Berlin M. Modeling suspended sediment load in a river using extreme learning machine and twin support vector regression with wavelet conjunction. Environ Earth Sci. 2020;79:234. doi: 10.1007/s12665-020-08949-w. [DOI] [Google Scholar]

[CR30] 30.Hong DH, Hwang C. Interval regression analysis using quadratic loss support vector machine. IEEE Trans Fuzzy Syst. 2005;13(2):229–237. doi: 10.1109/TFUZZ.2004.840133. [DOI] [Google Scholar]

[CR31] 31.Huang M-L. Intersection traffic flow forecasting based on ν-GSVR with a new hybrid evolutionary algorithm. Neurocomputing. 2015;147:343–349. doi: 10.1016/j.neucom.2014.06.054. [DOI] [Google Scholar]

[CR32] 32.Huang X, Shi L, Suykens JAK. Asymmetric least squares support vector machine classifiers.”. Comput Stat Data Anal. 2014;70:395–405. doi: 10.1016/j.csda.2013.09.015. [DOI] [Google Scholar]

[CR33] 33.Huber PJ. Robust estimation of a location parameter. Ann Math Stat. 1964;35(1):73–101. doi: 10.1214/aoms/1177703732. [DOI] [Google Scholar]

[CR34] 34.Huber PJ. Robust statistical procedures. Philadelphia: SIAM; 1996. [Google Scholar]

[CR35] 35.Hwang C, Hong DH, Seok KH. Support vector interval regression machine for crisp input and output data. Fuzzy Sets Syst. 2006;157(8):1114–1125. doi: 10.1016/j.fss.2005.09.008. [DOI] [Google Scholar]

[CR36] 36.Jayadeva, Khemchandani R, Chandra S. Twin support vector machines for pattern classification. IEEE Trans Pattern Anal Mach Intell. 2007;29(5):905–910. doi: 10.1109/TPAMI.2007.1068. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Kaneko H, Funatsu K. Adaptive soft sensor based on online support vector regression and Bayesian ensemble learning for various states in chemical plants. Chemometr Intell Lab Syst. 2014;137:57–66. doi: 10.1016/j.chemolab.2014.06.008. [DOI] [Google Scholar]

[CR38] 38.KEEL (2020) [online]. https://sci2s.ugr.es/keel/html/

[CR39] 39.Kumar MA, Gopal M. Least squares twin support vector machines for pattern classification. Expert Syst Appl. 2009;36(4):7535–7543. doi: 10.1016/j.eswa.2008.09.066. [DOI] [Google Scholar]

[CR40] 40.Liu LL, Zhao Y, Kong L, Liu M, Dong L, Ma F, Pang Z. Robust real-time heart rate prediction for multiple subjects from facial video using compressive tracking and support vector machine. J Med Imaging. 2018;5(2):024503. doi: 10.1117/1.JMI.5.2.024503. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Liu X, Zhu T, Zhai L, Liu J. Mass classification of benign and malignant with a new twin support vector machine joint l2,1 - norm. Int J Mach Learn Cybern. 2017;10:1–17. [Google Scholar]

[CR42] 42.Mangasarian OL, Musicant DR. Robust linear and support vector regression. IEEE Trans Pattern Anal Mach Intell. 2000;22(9):950–955. doi: 10.1109/34.877518. [DOI] [Google Scholar]

[CR43] 43.Mao X, Wang Y, Liu X, Guo Y. An adaptive weighted least square support vector regression for hysteresis in piezoelectric actuators. Sens Actuators A. 2017;263:423–429. doi: 10.1016/j.sna.2017.06.030. [DOI] [Google Scholar]

[CR44] 44.Maulik U, Chakraborty D. Remote sensing image classification: a survey of support-vector-machine-based advanced techniques. IEEE Geosci Remote Sens Mag. 2017;5(1):33–52. doi: 10.1109/MGRS.2016.2641240. [DOI] [Google Scholar]

[CR45] 45.Mehrkanoon S, Huang X, Suykens JAK. Non-parallel support vector classifiers with different loss functions. Neurocomputing. 2014;143:294–301. doi: 10.1016/j.neucom.2014.05.063. [DOI] [Google Scholar]

[CR46] 46.Melacci S, Belkin M. Laplacian support vector machines trained in the primal. J Mach Learn Res. 2011;12(Mar):1149–1184. [Google Scholar]

[CR47] 47.Niu J, Chen J, Yitian X. Twin support vector regression with Huber loss. J Intell Fuzzy Syst. 2017;32(6):4247–4258. doi: 10.3233/JIFS-16629. [DOI] [Google Scholar]

[CR48] 48.NLREG repositories (2020) [online]. http://www.nlreg.com/

[CR49] 49.Ouyang X, Zhao N, Gao C, Wang L. An efficient twin projection support vector machine for regression. Eng Lett. 2019;27(1):103–107. [Google Scholar]

[CR50] 50.Peng X. TSVR: an efficient twin support vector machine for regression. Neural Netw. 2010;23(3):365–372. doi: 10.1016/j.neunet.2009.07.002. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Peng X, Chen D. An $$ l_1 $$-norm loss based twin support vector regression and its geometric extension. Int J Mach Learn Cybernet. 2019;10(9):2573–2588. doi: 10.1007/s13042-018-0892-8. [DOI] [Google Scholar]

[CR52] 52.Puthiyottil A, Balasundaram S, Meena Y (2020) “L1-norm support vector regression in primal based on huber loss function. In: Proceedings of ICETIT 2019, vol 605. LNEE. Springer, Cham, pp 195–205

[CR53] 53.SantaFeA dataset (2020) [online]. http://lib.stat.cmu.edu/datasets

[CR54] 54.Shen X, Niu L, Qi Z, Tian Y. Support vector machine classifier with truncated pinball loss. Pattern Recognit. 2017;68:199–210. doi: 10.1016/j.patcog.2017.03.011. [DOI] [Google Scholar]

[CR55] 55.Shao YH, Zhang C, Wang X, Deng N. Improvements on twin support vector machines. IEEE Trans Neural Netw. 2011;22(6):962–968. doi: 10.1109/TNN.2011.2130540. [DOI] [PubMed] [Google Scholar]

[CR56] 56.Singla M, Ghosh D, Shukla KK, Pedrycz W. “Robust twin support vector regression based on rescaled hinge loss. Pattern Recognit. 2020;105:107395. doi: 10.1016/j.patcog.2020.107395. [DOI] [Google Scholar]

[CR57] 57.Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput. 2004;14(3):199–222. doi: 10.1023/B:STCO.0000035301.49549.88. [DOI] [Google Scholar]

[CR58] 58.SpaceGa dataset (2020).[online]. http://lib.stat.cmu.edu/datasets

[CR59] 59.Tang L, Tian Y, Yang C, Pardalos PM. Ramp-loss nonparallel support vector regression: robust, sparse and scalable approximation. Knowl-Based Syst. 2018;147:55–67. doi: 10.1016/j.knosys.2018.02.016. [DOI] [Google Scholar]

[CR60] 60.Tang L, Tian Y, Pardalos PM. A novel perspective on multiclass classification: regular simplex support vector machine. Inf Sci. 2019;480:324–338. doi: 10.1016/j.ins.2018.12.026. [DOI] [Google Scholar]

[CR61] 61.Tang L, Tian Y, Li W, Pardalos PM. Structural improved regular simplex support vector machine for multiclass classification. Appl Soft Comput. 2020;91:106235. doi: 10.1016/j.asoc.2020.106235. [DOI] [Google Scholar]

[CR62] 62.Tanveer M, Shubham K, Aldhaifallah M, Ho SS. An efficient regularized K-nearest neighbor based weighted twin support vector regression. Knowl-Based Syst. 2016;94:70–87. doi: 10.1016/j.knosys.2015.11.011. [DOI] [Google Scholar]

[CR63] 63.UCI data repository (2020) [online]. https://archive.ics.uci.edu/ml/

[CR64] 64.Vapnik V. The nature of statistical learning theory. Berlin: Springer; 2000. [Google Scholar]

[CR65] 65.Vineyard dataset (2020) [online]. https://data.gov.au/dataset/

[CR66] 66.Wang L, Gao C, Zhao N, Chen X. Wavelet transform-based weighted $$\nu $$ ν-twin support vector regression. Int J Mach Learn Cybernet. 2020;11(1):95–110. doi: 10.1007/s13042-019-00957-y. [DOI] [Google Scholar]

[CR67] 67.Wang L, Gao C, Zhao N, Chen X. A projection wavelet weighted twin support vector regression and its primal solution. Appl Intell. 2019;49(8):3061–3081. doi: 10.1007/s10489-019-01422-7. [DOI] [Google Scholar]

[CR68] 68.Wang K, Pei H, Ding X, Zhong P. Robust proximal support vector regression based on maximum correntropy criterion. Sci Progr. 2019;2019:1–11. [Google Scholar]

[CR69] 69.Wang C, Li Z, Dey N, Li Z, Ashour AS, Fong SJ, Simon Sherratt R, Wu L, Shi F. Histogram of oriented gradient based plantar pressure image feature extraction and classification employing fuzzy support vector machine. J Med Imaging Health Inf. 2018;8(4):842–854. doi: 10.1166/jmihi.2018.2310. [DOI] [Google Scholar]

[CR70] 70.Wang K, Zhong P. Robust support vector regression with flexible loss function. Int J Signal Process Image Process Pattern Recognit. 2014;7(4):211–220. [Google Scholar]

[CR71] 71.Wu Q. A hybrid-forecasting model based on Gaussian support vector machine and chaotic particle swarm optimization. Expert Syst Appl. 2010;37(3):2388–2394. doi: 10.1016/j.eswa.2009.07.057. [DOI] [Google Scholar]

[CR72] 72.Wu Q, Yan H. Product sales forecasting model based on robust ν-support vector machine. Comput Integrated Manuf Syst. 2009;15(6):1081–1087. [Google Scholar]

[CR73] 73.Wu Q, Law R, Xin X. A sparse Gaussian process regression model for tourism demand forecasting in Hong Kong. Expert Syst Appl. 2012;39(5):4769–4774. doi: 10.1016/j.eswa.2011.09.159. [DOI] [Google Scholar]

[CR74] 74.Xu Q, Zhang J, Jiang C, Huang X, He Y. Weighted quantile regression via support vector machine. Expert Syst Appl. 2015;42(13):5441–5451. doi: 10.1016/j.eswa.2015.03.003. [DOI] [Google Scholar]

[CR75] 75.Xu Y, Wang L. K-nearest neighbor-based weighted twin support vector regression. Appl Intell. 2014;41(1):299–309. doi: 10.1007/s10489-014-0518-0. [DOI] [Google Scholar]

[CR76] 76.Xu Y, Li X, Pan X, Yang Z. Asymmetric ν-twin support vector regression. Neural Comput Appl. 2017;30:1–16. [Google Scholar]

[CR77] 77.Yang L, Ding G, Yuan C, Zhang M. Robust regression framework with asymmetrically analogous to correntropy-induced loss. Knowl-Based Syst. 2020;191:105211. doi: 10.1016/j.knosys.2019.105211. [DOI] [Google Scholar]

[CR78] 78.Yang L, Dong H. Support vector machine with truncated pinball loss and its application in pattern recognition. Chemometr Intell Lab Syst. 2018;177:89–99. doi: 10.1016/j.chemolab.2018.04.003. [DOI] [Google Scholar]

[CR79] 79.Yang Z, Xu Y. A safe sample screening rule for Laplacian twin parametric-margin support vector machine. Pattern Recognit. 2018;84:1–12. doi: 10.1016/j.patcog.2018.06.018. [DOI] [Google Scholar]

[CR80] 80.Yang L, Ren Z, Wang Y, Dong H. A robust regression framework with laplace kernel-induced loss. Neural Comput. 2017;29(11):3014–3039. doi: 10.1162/neco_a_01002. [DOI] [PubMed] [Google Scholar]

[CR81] 81.Ye Y, Gao J, Shao Y, Li C, Jin Y, Hua X. Robust support vector regression with generic quadratic nonconvex ε-insensitive loss. Appl Math Model. 2020;82:235–251. doi: 10.1016/j.apm.2020.01.053. [DOI] [Google Scholar]

[CR82] 82.Zhang J, Zheng C-H, Xia Y, Wang B, Chen P. Optimization enhanced genetic algorithm-support vector regression for the prediction of compound retention indices in gas chromatography. Neurocomputing. 2017;240:183–190. doi: 10.1016/j.neucom.2016.11.070. [DOI] [Google Scholar]

[CR83] 83.Zhao Y, Sun J. Robust support vector regression in the primal. Neural Netw. 2008;21(10):1548–1555. doi: 10.1016/j.neunet.2008.09.001. [DOI] [PubMed] [Google Scholar]

[CR84] 84.Zhu J, Hoi SCH, Rung-Tsong Lyu M. Robust regularized kernel regression. IEEE Trans Syst Man Cybern Part B (Cybern) 2008;38(6):1639–1644. doi: 10.1109/TSMCB.2008.927279. [DOI] [PubMed] [Google Scholar]

PERMALINK

On Regularization Based Twin Support Vector Regression with Huber Loss

Umesh Gupta

Deepak Gupta

Abstract

Introduction

Related Work

Support Vector Regression (SVR)

Twin Support Vector Regression (TSVR)

ε-Insensitive Asymmetric Huber Support Vector Regression (ε-AHSVR)

ε-Insensitive Support Vector Quantile Regression (ε-SVQR)

Twin Support Vector Regression with Huber Loss (HN-TSVR)

Proposed Regularization Based Twin Support Vector Regression with Huber Loss (RHN-TSVR)

Remark

Numerical Experiments and Results

Experimental Setup

Table 1.

Artificial Data Set

Uniform and Gaussian noise

Table 2.

Table 3.

Table 4.

Fig. 1.

Fig. 2.

Fig. 3.

Fig. 4.

Laplacian Noise

Table 5.

Table 6.

Table 7.

Fig. 5.

Real World Datasets

At Significant Noise Level 0%

Table 8.

Table 9.

Fig. 6.

Fig. 7.

At Significant Noise Level 5%

Table 10.

Table 11.

Fig. 8.

Fig. 9.

At Significant Noise Level 10%

Table 12.

Table 13.

Fig. 10.

Fig. 11.

Effect of Increasing Noise Percentage

Fig. 12.

Conclusion and Future Scope

Compliance with Ethical Standards

Conflict of interest

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

$ε$ -Insensitive Asymmetric Huber Support Vector Regression ( $ε$ -AHSVR)

$ε$ -Insensitive Support Vector Quantile Regression ( $ε$ -SVQR)