Skip to main content
Heliyon logoLink to Heliyon
. 2022 Nov 3;8(11):e11388. doi: 10.1016/j.heliyon.2022.e11388

Analysis of parametric and non-parametric option pricing models

Qiang Luo 1, Zhaoli Jia 1,, Hongbo Li 1, Yongxin Wu 1
PMCID: PMC9641221  PMID: 36387555

Abstract

In this paper, a closed-form analytical solution of option price under the Bi-Heston model is derived. Through empirical analysis, the advantages and disadvantages of the parametric pricing model are compared and analysed with those of the non-parametric model. The analysis shows that: (1) the parametric pricing model significantly outperforms the machine learning model in terms of in-sample pricing effects, while the Bi-Heston model slightly outperforms the Heston model. (2) In terms of out-of-sample pricing, the machine learning model is inferior to the parametric model for call options, while the Bi-Heston model is significantly better than the other two models for put options, and the other two models are similar. (3) In the robustness analysis of the three pricing models, the machine learning model shows strong instability, while the Bi-Heston model shows a more stable side.

Keywords: Heston model, Bi-Heston model, Option, XGBoost


Heston model; Bi-Heston model; Option; XGBoost

1. Introduction

In recent years, China's financial markets have developed at a rapid pace, and a large number of financial products have emerged as derivatives. Financial derivatives, which are financial contracts with functions such as risk management, price discovery and hedging, have become an integral part of the financial market as they can efficiently enhance market allocation efficiency. Options, as a financial derivative product and one of the most dominant products traded in the financial derivatives market, are valued by investors for their flexibility in risk management. While options can bring great convenience to investors, they also have some potential drawbacks if they are not priced correctly. Therefore, the correct pricing of options is critical to the risk control of investors. In the literature, there are two types of pricing models, parametric option pricing models and non-parametric option pricing models.

In all the parametric models, the most influential one was proposed by Black and Scholes [1] in 1973, from which an option pricing formula in the form of a closed solution can be derived. This formula was considered to be epoch-making and was well known among researchers and practitioners due to an easiness of use. Soon after, researchers discovered that it did not correspond well to some of the realities of the market due to its strict assumptions. And then, Heston [2] improved the Black-Scholes model by making a new assumption that volatility is not constant, but follows a CIR [3] process. This solves the volatility estimation problem. The Heston model is arguably one of the most widely used volatility models, but later scholars have found that the Heston model does not fit the implied volatility smile phenomenon very well. To address this problem, Christoffersen et al. [4] improved the Heston model by proposing the Bi-Heston stochastic volatility model which specifies a two-factor structure for volatility. Its additional parameters allow it to estimate option prices more accurately as well as fit implied volatility. Very recently, more scholars joined in the study of the model. Rouah [5] showed the implied volatility of the Bi-Heston model and found that it was closer to the actual market implied volatility than the Heston model. Fallah [6] first proved the existence and uniqueness of solutions of stochastic differential equations under the Bi-Heston model. Mehrdoust [7] proposed an extension of the Bi-Heston model to include jumps in the financial modelling of stock prices and found that it was an efficient tool for option pricing with the Fast Fourier Transform method. Their findings all suggest that the Bi-Heston model is highly relevant. Therefore, in this paper, we aim to explore the pricing performance of the Bi-Heston model, and the Heston model is used to compare with.

However, there is one more issue that needs to be addressed when using these parametric models in practice. As the name implies, to use the parametric model to price options, the model parameters need to be estimated first. Models usually have many parameters to be estimated, and the objective function is complex. To solve this problem, there are also many researchers who have given their solutions. Cui [8] changed the structure of the characteristic functions in the Heston model to give an analytic gradient for solving the problem of parameter estimation and proposed a new fast and efficient multi-parameter correction algorithm for the Heston model based on the analytic gradient. In recent studies, Cacace [9] developed a new polynomial filtering method to estimate parameters in the Heston model that relied on a linear filter using a polynomial state-space formulation of a discrete version of the continuous-time model. In this paper, we reduce this problem to an optimization problem and give a solution based on numerical calculations.

Even if the problem of parameter estimation is solved, traditional parametric models still have some drawbacks that they are strictly limited in some statistical or economic assumptions and do not facilitate the judgement of the merit of the model fit. Machine learning, as an alternative to non-parametric models, is also intensively developed recently. Ma [10] proposed a price forecasting method based on cost-sensitive deep forest, which maintains the high accuracy of deep forest and can significantly reduce the forecast cost error while maintaining high forecast accuracy. Hitam [11] proposed a support vector machine based on particle swarm algorithm optimization to predict the futures price of cryptocurrencies. It is found that the hybrid model can effectively predict the price of cryptocurrencies, and the prediction performance is better than that of a single support vector machine. Yang et al. [12] combined hybrid genetic algorithms with support vector machines and found that support vector machines constructed based on genetic algorithms and radial basis functions are more suitable for real values than other methods. Kumar [13] proposed four hybrid prediction models that are combinations of four different feature selection techniques with proximal support vector machine classifier. Due to the variety of machine learning algorithms, researchers have proposed a large number of related pricing models, and among these, the XGBoost model stands out. Le et al. [14] used an improved XGBoost algorithm to build a non-parametric machine learning model for analyzing corporate financial statements and demonstrated that the optimized XGBoost algorithm has a great advantage in processing time and has the best prediction results. More recently, Ivacu [15] analyzed the prediction performance of the XGBoost algorithm for option pricing and affirmed its feasibility. Chang [16] constructed a credit risk assessment model using an extreme gradient boosting machine classifier and empirically tested that the XGBoost classifier had better prediction results than logistic regression, clustered data processing techniques, and three support vector machine classifiers.

In this paper, we use traditional parametric models and non-parametric machine learning models to price 50ETF options respectively and compare the prediction accuracy of all models. Of these models, the Heston and Bi-Heston model represent the traditional parametric models, and the XGBoost model represents the non-parametric model. Our article is organized as follows: In Section 2, we give a brief description of the Heston model, and a detailed derivation of the Bi-Heston model is provided. In Section 3, the research methodology and data are presented. Firstly, the selected data sources are presented, followed by the machine learning model using the XGBoost algorithm. Besides, the methods for parameter calibration and the metrics for model evaluation are given. In Section 4, the detailed process of the empirical analysis is given. The prediction results of the traditional parametric model and the machine learning model are compared to provide a more comprehensive comparison of the predictive validity of them. Furthermore, the robustness tests are conducted for both. Some summaries are provided in Section 5.

2. Derivation of the bi-Heston model

It is well-known that the Heston model overcomes the Black-Scholes model's shortcoming of assuming constant volatility well and is one of the most popular models in the option valuation literature. It is given by

{dSt=rStdt+vtStdZ1,tdvt=κ(θvt)dt+σvtdZ2,tCov(Z1,t,Z2,t)=ρ,

where St denotes the underlying asset price, which is modelled by the square root process with a stochastic instantaneous variance vt, {Zi,t} (i=1,2) denote the standard Brownian motion, r is the risk-free rate, κ is the mean reversion speed, θ is the mean reversion of variance, σ is the volatility of variance, and ρ is the correlation coefficient. However, some scholars have pointed out that the Heston model sometimes fails to fit the implied volatility smile very well. To overcome this shortcoming, the Bi-Heston model is generalized from the Heston model, which is a straightforward way to incorporate a stochastic correlation by using multiple stochastic volatility factors. Assume that risky asset prices satisfy the Bi-Heston model as follows

{dSt=rStdt+v1tdZ1,t+v2tdZ2,tdv1t=κ1(θ1v1t)dt+σ1v1tdZ3,tdv2t=κ2(θ2v2t)dt+σ2v2tdZ4,t,

where St is modelled by two square root processes with two stochastic instantaneous variances v1t and v2t, {Zi,t} (i=1,2,3,4) denote the standard Brownian motion, v1t,v2t are extended from vt. In order to ensure that the square root processes are always positive, we require that 2κ1θ1(σ1)2,2κ2θ2(σ2)2 [2]. We assume that the covariances between these Brownian motions are equal to zero except Cov(Z1,t,Z3,t)=ρ1t, Cov(Z2,t,Z4,t)=ρ2t.

Consider undefined assets denoted by V(St,v1t,v2t,t). The maturity payout function for European options is (STK)+. By the Faman-Kac theorem, we have

{Vt+rSVS+κ1(θ1v1t)Vv1t+κ2(θ2v2t)Vv2t+12(v1t+v2t)S22VS2+Sσ1v1tρ12VSv1t+Sσ2v2tρ22VSv2t+12σ12v1t2Vv1t2+12σ22v2t2Vv2t2rV=0V(S,v1t,v2t,T)=(SK)+.

Decompose V(St,v1t,v2t,t) into two parts

V(S,v1t,v2t,t)=SP1(S,v1t,v2t,t)Ker(Tt)P2(S,v1t,v2t,t), (1)

where Pj(S,v1t,v2t,t) (j=1,2) satisfies the following equation at t=T

P1(S,v1t,v2t,T)=P2(S,v1t,v2t,T)=H(SK),

where H(x) is Heaviside function, H(x)=I{x>0}.

Let x=lnS, then Pj(x,v1t,v2t,t), (j=1,2) satisfies the following differential equation

{Pjt+[r+αj(v1t+v2t)Vx]+(κ1θ1βjv1t)Vv1t+(κ2θ2rjv2t)Vv2t+12(v1t+v2t)2Vx2+σ1v1tρ12Vv1tx+σ2v2tρ22Vv2tx+12σ12v1t2Vv1t2+12σ22v2t2Vv2t2=0Pj(x,v1t,v2t,T)=H(xlnK),

where

{α1=12,α2=12,β1=κ1+λ1ρ1σ1,β2=κ1+λ1,γ1=κ2+λ2ρ2σ2,γ2=κ2+λ2.

To solve the above system of differential equations, we first consider its characteristic functions, namely fj(x,v1t,v2t,t;φ), (j=1,2). When t=T, the termination condition is

fj(x,v1t,v2t,T;φ)=eiφx,(j=1,2).

Similar to the derivation of the Heston model, we assume that the characteristic function has the form of the following solution

fj(x,v1t,v2t,t;φ)=eC(Tt;φ)+D(Tt;φ)v1t+E(Tt;φ)v2t+iφx. (2)

Taking the partial derivatives separately into equation (2) yields the following three ODEs

{dDdtv1t+iφαjv1t12v1tφ2+(iφσ1ρ1βj)v1tD+12σ1v1tD2=0dEdtv2t+iφαjv2t12v2tφ2+(iφσ2ρ2γj)v2tE+12σ2v2tE2=0dCdt+γiφ+κ1θ1D+κ2θ2E=0

By solving the above ODE, we can easily obtain a display solution of the characteristic function, as follows

{C(τ,φ)=rφiτ+κ1θ1σ12{(βjρ1σ1φi+d1)τ2ln[1g1ed1τ1g1]}+κ2θ2σ22{(γjρ2σ2φi+d2)τ2ln[1g2ed2τ1g2]}D(τ,φ)=βjρ1σ1φi+d1σ12[1ed1τ1g1ed1τ]E(τ,φ)=γjρ2σ2φi+d2σ22[1ed2τ1g2ed2τ],

where

{g1=βjρ1σ1φi+d1βjρ1σ1φid1,d1=(ρ1σ1φiβj)2σ12(2αjφiφ2),g2=γjρ2σ2φi+d2γjρ2σ2φid2,d2=(ρ2σ2φiγj)2σ22(2αjφiφ)2.

By the Fourier inversion, Pj(S,v1t,v2t,t) (j=1,2) can be calculated in the following form.

Pj(S,v1t,v2t,t)=12+1π0Re(eiφlnKfj(x,v1t,v2t,φ)iφ)dφ. (3)

The above expressions can be obtained numerically by programming. As a result, substitute the equation (3) into (1), we obtain a closed-form solution of the Bi-Heston model.

3. Data and price by XGBoost

3.1. Data

In this paper, we use the 50ETF stock Index options data as the research data to calibrate the parameters of the parametric model and train the machine learning model. A total of 123,999 options were selected from 4 January 2016 to 31 December 2020, with the option code, trade date, option closing price, option volume, option trade amount, option type, option name, strike price, expiry date, remaining expiry time, 50ETF price, risk-free return and implied volatility recorded as the original data set (Table 9, Table 10).

Table 9.

The data of 50ETF Call options.

Panel A: Number of Call options1
Moneyness2 DTM3 <30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 4171 5826 4694 1966 16657
OTM(0.94≤S/K<0.97) 1693 2578 1992 1224 7487
ATM(0.97≤S/K<1.03) 3510 5298 4169 2709 15686
ITM(1.03≤S/K<1.06) 1198 2046 1789 1089 6122
DITM(S/K≥1.06) 2491 4169 3428 1629 11717
ALL 13063 19917 16072 8617 57669

Panel B: Average Call Price

Moneyness DTM<30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 0.003 0.016 0.044 0.083 0.029
OTM(0.94≤S/K<0.97) 0.014 0.039 0.080 0.117 0.057
ATM(0.97≤S/K<1.03) 0.055 0.086 0.131 0.166 0.105
ITM(1.03≤S/K<1.06) 0.146 0.166 0.205 0.238 0.186
DITM(S/K≥1.06) 0.400 0.392 0.369 0.363 0.383
ALL 0.108 0.132 0.158 0.187 0.142

Panel C: Average Implied Volatility from Call Options4

Moneyness DTM<30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 0.268 0.211 0.182 0.166 0.212
OTM(0.94≤S/K<0.97) 0.183 0.163 0.148 0.142 0.160
ATM(0.97≤S/K<1.03) 0.188 0.158 0.139 0.129 0.155
ITM(1.03≤S/K<1.06) 0.264 0.171 0.138 0.126 0.171
DITM(S/K≥1.06) 0.550 0.270 0.161 0.143 0.279
ALL 0.288 0.199 0.158 0.142 0.199
1

We use the closing price of options for each business day from 4 January 2016 to 31 December 2020.

2

The moneyness and maturity filters used by Bakshi [17] are applied here as well.

3

DTM represents the time to maturity of the option.

4

The implied volatilities are extracted using the Black-Scholes formula.

Table 10.

The data of 50ETF Put options.

Panel A: Number of Put options1
Moneyness2 DTM3 <30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 2735 4658 4694 1966 14053
OTM(0.94≤S/K<0.97) 1188 2363 1993 1224 6768
ATM(0.97≤S/K<1.03) 3384 5387 4281 2824 15876
ITM(1.03≤S/K<1.06) 1709 2482 2078 1321 7590
DITM(S/K≥1.06) 5096 7600 6715 2633 22044
ALL 14112 22490 19761 9968 66331

Panel B: Average Put Price

Moneyness DTM<30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 0.382 0.391 0.409 0.384 0.394
OTM(0.94≤S/K<0.97) 0.138 0.157 0.196 0.229 0.178
ATM(0.97≤S/K<1.03) 0.045 0.075 0.121 0.155 0.095
ITM(1.03≤S/K<1.06) 0.011 0.032 0.070 0.100 0.050
DITM(S/K≥1.06) 0.003 0.012 0.033 0.067 0.027
ALL 0.099 0.123 0.162 0.179 0.138

Panel C: Average Implied Volatility from Put Options4

Moneyness DTM<30 30≤DTM<90 90≤DTM<180 DTM≥180 ALL

DOTM(S/K<0.94) 0.432 0.303 0.267 0.231 0.306
OTM(0.94≤S/K<0.97) 0.224 0.192 0.203 0.212 0.205
ATM(0.97≤S/K<1.03) 0.164 0.170 0.188 0.198 0.178
ITM(1.03≤S/K<1.06) 0.173 0.168 0.180 0.189 0.176
DITM(S/K≥1.06) 0.264 0.201 0.192 0.199 0.215
ALL 0.259 0.212 0.209 0.205 0.220
1

We use the closing price of options for each business day from 4 January 2016 to 31 December 2020.

2

The moneyness and maturity filters used by Bakshi [17] are applied here as well.

3

DTM represents the time to maturity of the option.

4

The implied volatilities are extracted using the Black-Scholes formula.

3.2. The XGBoost model

The option pricing model is established based on XGBoost, where the feature variables (xi(m), m=1,...,7) are: option trading volume, execution price, remaining maturity time, 50ETF price, 50ETF daily return, risk-free return and implied volatility, and the target variable (p) is the closing price of the option. Let the training data set with n data sets and 7 feature indicators be, D={(x1,p1),...,(xn,pn)}, where xi={xi(1),...,xi(7)}. Because its essence is boosting iterative algorithm, which includes two key parts: addition model and forward distribution algorithm. According to the addition model, the predicted value of option price can be expressed as pˆi=t=1kft(xi), ft is the t-th tree model, and pˆi denotes the predicted price of the i-th option sample xi. At the same time, we need to minimize the objective function to learn the function in the supervision model. During the training of option pricing model, the objective function is:

Obj=i=1nl(pi,pˆi)+t=1kΩ(ft).

pi is the market price of the option. The objective function Obj consists of a traditional loss function l and a regularization term Ω that suppresses the complexity of the model and prevents the model from over fitting. The loss function reflects the difference between the predicted value and the real value. As the penalty term of tree model complexity, Ω should be regularized by the number of leaf nodes and the weight of leaf nodes. So we define the complexity function of the tree model

Ω(ft)=γT+12λj=1Tωj2,Ij={i|q(xi)=j},

where γT denotes the penalty term for the complexity of the number of leaf nodes T, ωj is the weight of the leaf node and also represents the predicted value on each leaf node, 12λj=1Tωj2 punishes the weight of leaf nodes in the form of norm. In addition, XGBoost, as the antecedent distribution algorithm, the learning process is to learn the first tree first, then learn the second tree based on the first, and so on. In the t-th iteration, the price prediction for the i-th option sample xi is

pˆit=pˆit1+ft(xi),

where ft(xi) is the tree model we are going to find in this round. By optimizing the above objective function, we can get ft(xi). Specifically, carry out the second-order Taylor expansion of the function l and remove the constant term l(pi,pˆit1) of the second iteration to obtain the simplified result:

Obj=i=1n(gift(xi)+12hift2(xi))+t=1kΩ(ft), (4)

where gi,hi are the first and second derivatives of the function l to the option price prediction value pˆit1 obtained by the passed t1 iteration. Next, define the sample set of leaf nodes

Ij={i|q(xi)=j},

q(x) represents the specific leaf node into which the sample falls, ωq(x) represents the predicted value of each sample, i.e., we can convert ft(x) to ωq(x). Rewriting the set of samples to the set of leaf nodes. In the equation (4), let Gj=iIjgi, Hj=iIjhi, the new objective function can be obtained

Obj=j=1T[Gjωj+12(Hj+λ)ωj2]+γT. (5)

The predicted value ωj is unknown. The problem of requiring the most value of the above function, i.e., making the value of the first-order derivative of the objective function (5), finding the value ωj=GjHj+λ corresponding to the leaf node j, and obtaining the minimum value of the objective function as

Obj=12j=1TGj2Hj+λ+γT.

The final sample prediction value we want to get is the sum of the weights of the leaf nodes Ω that the sample finally falls into in each tree model. When implementing XGBoost algorithm, we should consider the optimal parameter combination of the algorithm to ensure the optimality of the model. The super parameters of XGBoost algorithm are mainly divided into three categories: task parameters, conventional parameters and lifter parameters. The following is the range of a set of super parameters of XGBoost selected by grid search method combined with cross validation to find the best combination of super parameters (Table 1):

Table 1.

Algorithm Parameters of XGBoost Model.

Parameters of XGBoost Implication of parameter Range of parameter
n-estimators Number of decision trees 300~1000
max-depth Maximum depth of tree 5~10
min-child weight Sum of leaf node weights 5~10
learning rate Learning Rate 0.1~0.5
subsample Proportion of subsamples to training sample set 0.6~0.8
Colsample bytree Feature random sampling ratio 0.3~0.5

In the empirical analysis, we control the scope of the above hyperparameters within the above range to obtain multiple sets of prediction results and then compare the models.

3.3. Parameter calibration and model estimation

There is no doubt that, after observing the option pricing formulae described in the previous section, when we want to use the Heston and Bi-Heston models for option pricing, we should determine the values of the parameters in the models firstly. The objective of parameter determination is to minimise the error between the true price and the theoretical price, and the method of judgement used in this paper is non-linear least squares, so the parameter calibration problem is transformed into an optimisation problem as shown in the following equation.

argminΘMSE=1ni=1n(pipˆi(Θ))2,

where n denotes the number of samples used for parameter calibration, and pi denotes the market price of option i and pˆi denotes the price predicted by the Heston or Bi-Heston model of option i. The Heston model requires the determination of the values of 5 parameters Θ=(ν,θ,κ,σ,ρ), while the Bi-Heston requires 10 parameters Θ=(ν1,θ1,κ1,σ1,ρ1,ν2,θ2,κ2,σ2,ρ2). At the same time, the parameter calibration problem is transformed into the problem of finding the minimum value of a high-dimensional function. We use a particle swarm algorithm based on parallel computing to calibrate the parameters of the Heston and Bi-Heston models. To avoid the influence of the parameter calibration part on the empirical study of the models later on, we selected the same objective function and the same sample for the parameter calibration of the two models mentioned above.

Firstly, we use the root mean squared error (RMSE) as a way of assessing the in-sample pricing effectiveness of the model. A distinct advantage of the RMSE is that it squares the bias term, which allows it to give greater weight to error terms with larger weights, thus penalising models with larger error terms. We expect the better model to show a smaller RMSE, which in turn reflects the good fit of the model.

RMSE=1ni=1n(pipˆi)2.

Secondly, we use the implied volatility root mean square error (IVRMSE) as a way of assessing the out-of-sample pricing effectiveness of the model. IVRMSE is a frequently used method when evaluating the pricing effectiveness of options and is represented by the following equation

IVRMSE=1ni=1n(σiσˆi)2,

where σi denotes the implied volatility of the underlying asset as inverse solved from the B-S model using the price of option i on that day; and σˆi denotes the implied volatility of the underlying asset as inverse solved from the B-S model by using the price of option i obtained from the pricing model. As implied volatility is an important factor in option analysis, a smaller IVRMSE indicates a smaller deviation between the market price of the option and the theoretical price, and the model is better.

4. Empirical analysis

4.1. Experimental steps

In the first step, since the dataset has some invalid fields and some data are missing, we need to pre-process the dataset to facilitate the subsequent study; in the second step, the valid dataset is divided into experimental samples, part of which is used for in-sample parameter calibration and model training, and the other part is used for out-of-sample model prediction; in the third step, the parameters of the Heston and Bi-Heston models are calibrated and the XGBoost model is trained for the first part of the sample; in the fourth step, the parameters and the model obtained in the third step are used for out-of-sample pricing for the second part of the sample (Fig. 1).

Figure 1.

Figure 1

Experimental Flow Chart.

4.2. Analysis of in-sample pricing effects

Based on the processed dataset, we perform a hybrid parameter calibration using options data from 30 September 2020 to 17 December 2020, i.e., call and put options are placed in the same dataset. Finally, the calibrated parameters and the trained model are used to perform in-sample pricing on this dataset. As the PSO algorithm is an intelligent algorithm and XGBoost is a machine learning model, the results obtained are stochastic in nature. Therefore, for the in-sample pricing of the three models, we will perform 10 operations each and evaluate the in-sample pricing effectiveness of the three models using the evaluation metric RMSE described in the previous section.

As can be seen from Table 2, the Bi-Heston model outperforms the Heston model in terms of parameter calibration, although both models use the same calibration technique. In terms of in-sample pricing, the Bi-Heston model and the Heston model are significantly better than the XGBoost model, and the Heston model is more consistent with the Bi-Heston model in terms of pricing, but the Bi-Heston model is slightly better than the Heston model. It is only a coincidence that the in-sample pricing effect is not good enough, but the out-of-sample pricing effect is better, so we select the group with the best in-sample pricing effect as the parameters for out-of-sample pricing, and the specific parameter values are as follows (Table 3):

Table 2.

Comparison of Parametric Calibration Results.

Number of experiment Heston model Bi-Heston model XGBoost model
1 0.0207 0.0191 0.0375
2 0.0200 0.0190 0.0387
3 0.0199 0.0191 0.0454
4 0.0199 0.0193 0.0366
5 0.0200 0.0190 0.0375
6 0.0199 0.0190 0.0399
7 0.0209 0.0190 0.0366
8 0.0199 0.0191 0.0375
9 0.0199 0.0190 0.0451
10 0.0199 0.0190 0.0375

Table 3.

Model parameters.

Heston ν θ κ σ ρ
0.0086 0.7644 0.8579 1.1452 0.4069



Bi-Heston ν1 θ1 κ1 σ1 ρ1
0.0471 0.0010 0.8579 0.0025 -0.9997
ν2 θ2 κ2 σ2 ρ2
0.5038 1.0000 0.0422 0.0010 0.2901

4.3. Analysis of out-sample pricing effects

Based on the model parameters obtained as described previously, we can easily calculate the out-of-sample option price based on the option pricing formula. Fig. 2 shows a comparison of the actual results of the three models for call and put options respectively.

Figure 2.

Figure 2

Out of sample pricing effect comparison of the three models.

It is clear from the graphs that the pricing results of the Heston and Bi-Heston models match the actual option prices, while the XGBoost model is less effective than the first two models. As the strike price of the call option increases, the theoretical price of the option decreases for all models; as the strike price of the put option increases, the theoretical price of the option increases for all models. This phenomenon is consistent with objective reality. The Bi-Heston model is slightly more effective than the Heston model, as can be seen from the images, and specifically, the actual pricing results are shown in Table 4, Table 5.

Table 4.

Call options for Selected Days and Maturities.

K True Heston Bi-Heston XGBoost
2.514 1.031 1.026 1.026 0.855
2.612 0.933 0.928 0.928 0.783
2.711 0.835 0.830 0.830 0.747
2.809 0.737 0.732 0.732 0.671
2.908 0.639 0.633 0.633 0.558
3.100 0.449 0.444 0.444 0.399
3.200 0.348 0.341 0.348 0.317
3.300 0.252 0.242 0.258 0.220
3.400 0.166 0.148 0.178 0.142
3.500 0.096 0.071 0.115 0.094
3.600 0.053 0.030 0.071 0.050
3.700 0.028 0.012 0.042 0.031
3.800 0.015 0.005 0.025 0.029
3.900 0.010 0.002 0.014 0.026
4.000 0.007 0.001 0.008 0.026

Table 5.

Put options for Selected Days and Maturities.

K True Heston Bi-Heston XGBoost
2.563 0.004 0.001 0.004 0.004
2.661 0.006 0.002 0.007 0.007
2.711 0.006 0.002 0.008 0.011
2.809 0.010 0.005 0.013 0.014
2.908 0.016 0.009 0.020 0.020
3.056 0.036 0.025 0.038 0.042
3.154 0.058 0.045 0.057 0.064
3.253 0.091 0.077 0.088 0.103
3.351 0.136 0.122 0.134 0.152
3.450 0.194 0.179 0.193 0.205
3.549 0.263 0.248 0.263 0.277
3.647 0.340 0.324 0.339 0.371
3.746 0.428 0.406 0.421 0.468
3.844 0.518 0.492 0.506 0.540

In response to the above actual results, we can analyse that as the strike price (K) increases, the errors of all three models also increase, but the Bi-Heston model is more stable than the remaining two models. We can also see from the specific values that the Heston model does not differ significantly from the Bi-Heston model at low strike prices for call options, but as the strike price increases, the Bi-Heston model significantly outperforms the Heston model; for put options, the Bi-Heston model significantly outperforms the other two models.

On the other hand, we also examine the effect of out-of-sample pricing of options using IVRMSE. The out-of-sample pricing spans the period from 30 September 2020 to 17 December 2020, and the expiry date of the options is 23 December 2020. We can see that the data set has a relatively long time horizon and we can consider studies that address the stability of the option pricing model. The actual results are shown in Fig. 3.

Figure 3.

Figure 3

Comparison of the Stability of three models.

As can be seen from the graph, the Bi-Heston model is more stable than the other two models for both call and put options. All three models show an increase in IVRMSE as the expiry date approaches, and this is particularly evident in the XGBoost model. Otherwise, the IVRMSE reflects the deviation between the market price of the option and the theoretical price, and the smaller the IVRMSE, the better the option is priced. From the above graph, we can see that the IVRMSE of the Bi-Heston model is smaller than that of the Heston and XGBoost models for both call and put options when the expiry dates are the same, which means that the pricing effect of the Bi-Heston model is better than that of the other two models. For call options, the XGBoost model is significantly better than the Heston model when the expiry date is far away, while the XGBoost model is less stable than the Heston model as the expiry date approaches, and the pricing effect is inferior to that of the Heston model. For put options, both the Heston and Bi-Heston models are more stable and better priced than the XGBoost model.

According to the assumptions of the traditional B-S model, the implied volatility in the model should be a constant that is independent of the strike price. However, many scholars have pointed out that this assumption is not valid in the real market, and the most direct evidence of that is the volatility smile phenomenon. The volatility smile gets its name from the shape of this curve: when the option price deviates from the strike price, the implied volatility of the option rises, thus showing a low middle and high sides, in the form of a smiling mouth. We then also explored whether the three pricing models mentioned above could exhibit the phenomenon of a volatility smile, as shown in Fig. 4.

Figure 4.

Figure 4

Volatility Smile for Selected Days.

The first row shows the relationship between the implied volatility of the call option and the strike price, while the second row shows the relationship between the implied volatility of the put option and the strike price. We can see from the above graph that both the Heston model and the Bi-Heston model reflect the phenomenon of volatility smiles well, while the XGBoost model does not reflect the phenomenon well.

4.4. Analysis of the impact of dataset on the models

In order to explore the impact of the selected data on the models, we designed empirical analyses for different option datasets. In the process of the experiment, we only changed the types of datasets, and did not change the specific parameters of the three models and the experimental process, so as to exclude the adverse effects of other possible factors. In addition to the 50ETF, we also selected two other stock index options, namely, 300ETF(510300) and 300ETF(159919). The number of datasets and variables are basically consistent with the 50ETF described above. The experimental process is still divided into the mixed pricing of call and put options in the sample, and the separate pricing of call and put options out of the sample. The pricing effect is analyzed according to the evaluation metrics RMSE and IVRMSE. Detailed experimental data are shown in Table 6, Table 7, Table 8.

Table 6.

RMSE for in-sample pricing results.

Dataset Name Heston model Bi-Heston model XGBoost model
50ETF 0.0199 0.0190 0.0366
300ETF(510300) 0.0260 0.0250 0.0521
300ETF(159919) 0.0166 0.0162 0.0327

Table 7.

IVRMSE for pricing results of out-sample call option.

Dataset Name Heston model Bi-Heston model XGBoost model
50ETF 0.0268 0.0206 0.0894
300ETF(510300) 0.3479 0.3166 0.0800
300ETF(159919) 0.0524 0.0569 0.0830

Table 8.

IVRMSE for pricing results of out-sample put option.

Dataset Name Heston model Bi-Heston model XGBoost model
50ETF 0.0072 0.0146 0.0868
300ETF(510300) 0.0232 0.0207 0.2002
300ETF(159919) 0.0073 0.0068 0.1271

From the data in Table 6, Table 7, Table 8, it can be concluded that the pricing effect of the model has nothing to do with the type of dataset, and the relevant results are basically consistent with those above.

5. Conclusions

In this paper, the Bi-Heston pricing model is derived and a closed-form analytical solution is given. A relevant empirical analysis is also done to compare and analyse the advantages and disadvantages of the parametric pricing model with the machine learning pricing model. Due to the high latitude and complex nature of the estimated reference function of the parametric model, we use a particle swarm algorithm based on parallel computing for parameter calibration, and the actual results show that the parameter calibration is excellent. In the final empirical analysis, we obtain the following conclusions: (1) In terms of in-sample pricing effects, the parametric pricing model is significantly better than the machine learning model, while the Bi-Heston model is slightly better than the Heston model. (2) In terms of out-of-sample pricing, the machine learning model is inferior to the parametric model for call options, while the Bi-Heston model is significantly better than the other two models for put options, and the other two models are similar. (3) In the robustness analysis of the three pricing models, the machine learning model shows strong instability, while the Bi-Heston model shows a more stable side. (4) When plotting the implied volatility images of the three pricing models, the machine learning model fails to capture the phenomenon of volatility smiles.

Declarations

Author contribution statement

Zhaoli Jia, Ph.D.: Conceived and designed the experiments; Wrote the paper.

Qiang Luo: Conceived and designed the experiments; Performed the experiments; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Hongbo Li: Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data.

Yongxin Wu: Analyzed and interpreted the data.

Funding statement

This work was supported by National Natural Science Foundation of China [72071068]; Innovation and entrepreneurship project for College Students [202110359075].

Data availability statement

The data that has been used is confidential.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

Footnotes

This paper was supported by the National Natural Science Foundation of China under Grant No. 72071068; Innovation and entrepreneurship project for College Students under Grant No. 202110359075.

Contributor Information

Qiang Luo, Email: luoqiang0930@163.com.

Zhaoli Jia, Email: jiazhaoli@hfut.edu.cn.

Hongbo Li, Email: lihongbo214274@163.com.

Yongxin Wu, Email: Wuyongxin0709@163.com.

References

  • 1.Black F., Scholes M. The pricing of options and corporate liabilities. J. Polit. Econ. 1973;81(3):637–654. [Google Scholar]
  • 2.Heston S.L. A closed-form solution for options with stochastic volatility with applications to bond and currency options. Rev. Financ. Stud. 1993;6(2):327–343. [Google Scholar]
  • 3.Cox J.C., Ingersoll J.E., Ross S.A. A theory of the term structure of interest rates. Econometrica. 1985;53:385–408. [Google Scholar]
  • 4.Christoffersen P., Heston S., Jacobs K. Review of the shape and term structure of the index option smirk: why multifactor stochastic volatility models work so well. Manag. Sci. 2009;55(12):1914–1932. [Google Scholar]
  • 5.Rouah F.D. John Wiley & Sons; 2013. The Heston Model and Its Extensions in Matlab and C. [Google Scholar]
  • 6.Fallah S., Mehrdoust F. On the existence and uniqueness of the solution to the double Heston model equation and valuing lookback option. J. Comput. Appl. Math. 2019;350:412–422. [Google Scholar]
  • 7.Mehrdoust F., Saber N., Najafi A.R. Modeling asset price under two-factor Heston model with jumps. Int. J. Appl. Comput. Math. 2017;3(4):3783–3794. [Google Scholar]
  • 8.Cui Y., del Bano Rollin S., Germano G. Full and fast calibration of the Heston stochastic volatility model. Eur. J. Oper. Res. 2017;263(2):625–638. [Google Scholar]
  • 9.Cacace F., Germani A., Papi M. On parameter estimation of Heston's stochastic volatility model: a polynomial filtering method. Decis. Econ. Finance. 2019;42(2):503–525. [Google Scholar]
  • 10.Ma C., Liu Z., Cao Z., Song W., Zhang J., Zeng W. Cost-sensitive deep forest for price prediction. Pattern Recognit. 2020;107 [Google Scholar]
  • 11.Hitam N.A., Ismail A.R., Saeed F. An optimized support vector machine (svm) based on particle swarm optimization (pso) for cryptocurrency forecasting. Proc. Comput. Sci. 2019;163:427–433. [Google Scholar]
  • 12.Yang R., Yu L., Zhao Y., Yu H., Xu G., Wu Y., Liu Z. Big data analytics for financial market volatility forecast based on support vector machine. Int. J. Inf. Manag. 2020;50:452–462. [Google Scholar]
  • 13.Kumar D., Meghwani S.S., Thakur M. Proximal support vector machine based hybrid prediction models for trend forecasting in financial markets. J. Comput. Sci. 2016;17:1–13. [Google Scholar]
  • 14.Le T., Vo B., Fujita H., Nguyen N.T., Baik S.W. A fast and accurate approach for bankruptcy forecasting using squared logistics loss with gpu-based extreme gradient boosting. Inf. Sci. 2019;494:294–310. [Google Scholar]
  • 15.Ivaşcu C.F. Option pricing using machine learning. Expert Syst. Appl. 2021;163 [Google Scholar]
  • 16.Chang Y.C., Chang K.H., Wu G.J. Application of extreme gradient boosting trees in the construction of credit risk assessment models for financial institutions. Appl. Soft Comput. 2018;73:914–920. [Google Scholar]
  • 17.Bakshi G., Cao C., Chen Z. Empirical performance of alternative option pricing models. J. Finance. 1997;52(5):2003–2049. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data that has been used is confidential.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES