Deep Learning Approach for Robust Prediction of Reservoir Bubble Point Pressure

Fahd Saeed Alakbari; Mysara Eissa Mohyaldinn; Mohammed Abdalla Ayoub; Ali Samer Muhsan

doi:10.1021/acsomega.1c02376

. 2021 Aug 12;6(33):21499–21513. doi: 10.1021/acsomega.1c02376

Deep Learning Approach for Robust Prediction of Reservoir Bubble Point Pressure

Fahd Saeed Alakbari ^†,^*, Mysara Eissa Mohyaldinn ^†,^*, Mohammed Abdalla Ayoub ^†, Ali Samer Muhsan ^‡

PMCID: PMC8388097 PMID: 34471753

Abstract

graphic file with name ao1c02376_0018.jpg

The bubble point pressure (P_b) is a crucial pressure–volume–temperature (PVT) property and a primary input needed for performing many petroleum engineering calculations, such as reservoir simulation. The industrial practice of determining P_b is by direct measurement from PVT tests or prediction using empirical correlations. The main problems encountered with the published empirical correlations are their lack of accuracy and the noncomprehensive data set used to develop the model. In addition, most of the published correlations have not proven the relationships between the inputs and outputs as part of the validation process (i.e., no trend analysis was conducted). Nowadays, deep learning techniques such as long short-term memory (LSTM) networks have begun to replace the empirical correlations as they generate high accuracy. This study, therefore, presents a robust LSTM-based model for predicting P_b using a global data set of 760 collected data points from different fields worldwide to build the model. The developed model was then validated by applying trend analysis to ensure that the model follows the correct relationships between the inputs and outputs and performing statistical analysis after comparing the most published correlations. The robustness and accuracy of the model have been verified by performing various statistical analyses and using additional data that was not part of the data set used to develop the model. The trend analysis results have proven that the proposed LSTM-based model follows the correct relationships, indicating the model’s reliability. Furthermore, the statistical analysis results have shown that the lowest average absolute percent relative error (AAPRE) is 8.422% and the highest correlation coefficient is 0.99. These values are much better than those given by the most accurate models in the literature.

1. Introduction

Bubble point pressure (P_b) is known as the pressure at which the gas’s initial bubble emerges from the liquid oil solution.¹ Obtaining accurate P_b is essential for performing proper reservoir simulations, production optimization, and oil recovery calculations.²⁻⁵ Thus, accurate calculation of P_b is hugely significant in the petroleum industry. Therefore, enhancing P_b correlations and models for robust and accurate determination of reservoir P_b has been one of the hot topics of petroleum engineering research.⁶

In this regard, substantial research efforts have been devoted by many researchers leading to the availability of valuable prediction correlation and reported procedures for determining P_b. Standing⁷ used 105 data points from California crude oil, USA, to obtain the P_b correlation. Lasater⁸ published a correlation to obtain P_b based on 158 experimental data points from the USA and Canada. Glaso⁹ applied linear and nonlinear regressions to obtain a P_b correlation utilizing 41 data points from the North Sea to estimate the P_b. Vasquez and Beggs¹⁰ implemented regression techniques to find correlations for the P_b prediction utilizing 6004 data points worldwide. Al-Marhoun¹¹ used nonlinear multiple regression analysis and introduced a P_b correlation using 160 data points from the Middle East. Kartoatmodjo and Schmidt¹² applied nonlinear regression analysis to develop a bubble point prediction correlation using 5392 data points from the USA and South-East Asia. They formed a correlation to determine P_b.¹² Dokla and Osman¹³ utilized a regression method utilizing 51 data points from the UAE to build a P_b prediction correlation. Petrosky and Farshad¹⁴ proposed a bubble point pressure correlation utilizing 90 data points from the Gulf of Mexico. They applied regression methods using the statistical analysis system (SAS) software. Macary and El-Batanoney¹⁵ applied 90 data points from the Gulf of Suez to build a P_b correlation. Omar and Todd¹⁶ utilized a nonlinear regression method to obtain P_b utilizing 93 data points from Malaysia. De Ghetto et al.¹⁷ applied multiple, linear, and nonlinear regressions using the statistical analysis system (SAS) program to develop a bubble point pressure correlation utilizing 3700 data points from the Mediterranean Basin, Africa, Persian Gulf, and the North Sea. Frashad et al.¹⁸ applied multiple nonlinear regression techniques to develop a P_b correlation utilizing 43 data points from Colombia. Almehaideb¹⁹ proposed a P_b correlation utilizing 62 data points from the United Arab Emirates (UAE). He used regression analysis to obtain the correlation.¹⁹ Hanafy et al.²⁰ applied regression analysis to develop a P_b prediction using 324 data points from Egypt. Velarde et al.² applied nonlinear regression methods utilizing 2097 data points worldwide to create a P_b correlation. Gharbi et al.²¹ applied a neural network to find P_b. Al-Shammasi²² used linear and nonlinear regression to propose a P_b correlation utilizing 1661 data points worldwide. Dindoruk and Christman²³ developed the equation to estimate P_b utilizing Microsoft Excel. The correlation was based on 100 Gulf of Mexico pressure–volume–temperature (PVT) laboratory. Mehran et al.²⁴ and Bolondarzadeh et al.²⁵ proposed a P_b correlation using 387 data points and 166 data points, respectively, from Iran fields. Malallah et al.²⁶ used graphical alternating conditional expectation to calculate a P_b. Hemmati and Kharrat²⁷ proposed a P_b correlation using 287 data points from Iranian crude oils. They used the nonlinear multiple regression techniques to build the P_b correlation.²⁷ Mazandarani and Asghari²⁸ applied multiple regression analysis to develop a P_b correlation using 55 data points from different locations in Iran. Khamehchi et al.²⁹ applied a stepwise multiple linear regression analysis using a statistical software package Statistical Package for the Social Sciences (SPSS) for Windows to develop a P_b correlation, utilizing 94 crude oil data points.²⁹ Dutta and Gupta³⁰ used support vector regression to find a P_b calculation. Arabloo et al.⁶ introduced a P_b correlation by applying a multistart feature of LINGO software and using 750 data points worldwide. Patil et al.³¹ applied the multiple regression methods and utilized 23 data points from Niger Delta to calculate P_b. Gomaa³² used the multiple nonlinear regression analysis, utilizing 441 data points from the Middle East crude oil, to build a P_b correlation.³² Alakbari et al.³³ utilized an artificial neural network and the fuzzy logic method to obtain P_b models. Sharrad and Abd-Alrahman³⁴ presented a P_b correlation applying 35 data points from Libya crude oil, using linear regression analysis through EViews software. Seyyedattar et al.³⁵ used 569 data points from different places applying extra trees, a least square support vector machine, and an adaptive network-based fuzzy inference system to predict P_b. They showed that the extra trees model is a robust tool to forecast P_b.³⁵ Talebkeikhah et al.³⁶ utilized a radial basis function neural network, multilayer perceptron, support vector regression, adaptive neuro-fuzzy inference system, decision trees, and random forest to obtain P_b. They indicated that the decision trees model is the best model with an average absolute relative error of 3.379%.³⁶ Tariq et al.³⁷ used a functional network coupled with particle swarm optimization to determine P_b. However, the previous models lack accuracy and have not applied trend analysis to identify the correct relationships between the inputs and outputs as part of the validation process. To our knowledge, this paper is the first study to apply the long short-term memory (LSTM) network and trend analysis for developing a bubble point pressure model.

In this work, we developed an accurate and reliable LSTM-based model by implementing trend analysis to predict P_b. LSTM was applied in this study because it has surpassed other deep learning techniques such as recurrent neural network (RNN) and can capture the nonlinear trends in data and remember previous information for over a long time.³⁸ LSTM has self-connecting hidden layers and a gating structure to control long-sequence time-series data efficiently.³⁹ Therefore, LSTM has been successfully used to solve many problems.³⁸ Furthermore, the trend analysis is used to present the relationships between inputs (gas solubility (R_s), gas-specific gravity (γ_g), oil specific gravity (API), and reservoir temperature (T_f)) and output (bubble point pressure (P_b)) in the existing and LSTM models. The trend analysis defines errors in the existing and LSTM models to present unexpected relationships between inputs and output, highlighting the need to show the model’s reliability. Moreover, several statistical analyses such as average absolute percentage relative error (AAPRE), maximum absolute percent relative error (E_max), minimum absolute percent relative error (E_min), root mean square error (RMSE), standard deviation (SD), and correlation coefficient (R) have been performed to describe, validate, and compare the LSTM model with the previously published models.

2. Methodology

Figure 1 presents the workflow diagram of the previous and proposed LSTM models. The workflow mainly included six phases: data collection, testing of previous models, training the LSTM model, trend analysis, testing the LSTM model, and comparing the models.

2.1. Data Collection and Preprocessing

Seven hundred and sixty data points, collected from different published sources^11,13,16 relevant to this model, were utilized in this work. The preprocessing step involved data cleaning for outliers in the collected data. The collected data set was split into two subparts; 70% (532 data points) of the data was allocated for training. In contrast, 30% (228 data points) of the collected data was allocated for testing the model. Table 1 lists the statistical description of the collected training and testing data. The box and whisker plot was used to detect and remove outliers in the collected data to obtain normal data. The box and whisker plot was introduced by Tukey.⁴⁰Figure S1 (Supporting Information) presents the box and whisker plot that displays lower quartile Q1, upper quartile Q3, interquartile range (IQR), lowest observation (Q1 – 1.5 × IQR), and the highest observation (Q3 + 1.5 IQR). Outliers are any value less than (Q1 – 1.5 × IQR) or more than (Q3 + 1.5 IQR), Figure S1. The box and whisker plots for the inputs and output parameters for the collected and the normal training data are presented in Figures S2 and S3, respectively. The testing data does not have any outlier to make the box and whisker plots of the inputs and output parameters for the collected and normal testing data the same (refer to Figure S4). The statistical description of the normal data is shown in Table S1 (Supporting Information). Figures S5 and S6 show histograms of the inputs and output parameters for the collected and the normal training data in the Supporting Information, respectively. The histograms of the inputs and output parameters for the collected and the normal testing data are shown in Figure S7.

Table 1. Statistical Description of the Collected Data (760 Data Points).

	training data					testing data
parameters	P_b (psi)	R_s (SCF/STB)	γ_g	API (°API)	T_f (°F)	P_b (psi)	R_s (SCF/STB)	γ_g	API (°API)	T_f (°F)
minimum	126	9	0.589	15.3	74	130	26	0.589	19.4	74
maximum	7127	2637	1.367	59.5	294	4432	1850	1.367	51.7	271
standard deviation (SD)	1151	423	0.150	7.32	49.4	1135	424	0.160	6.4	45.3
mean	1790	536	0.905	36.05	171	1808	567	0.947	33.3	158
median	1697	433	0.876	36.7	169	1912	560	0.930	32.6	160
mode	500	585	0.802	33.3	100	4005	61	0.802	31.2	100
kurtosis	1.447	1.959	0.325	–0.260	–0.785	–1.134	0.188	–0.105	–0.470	–0.684
skewness	0.881	1.230	0.834	–0.011	0.097	0.107	0.766	0.734	0.182	0.224
range	7001	2628	0.778	44.2	220	4302	1824	0.778	32.3	197
interquartile range	1648	559	0.201	10.35	74	2086	731	0.230	9.9	67

Open in a new tab

We considered what is reported in the literature regarding the dependency of P_b on other fluid properties. We found that P_b highly depends on R_s, γ_g, API, and T_f, as reported by most previous studies; Table 2. Randomization of data was applied using the shuffle method that changes the original list of the data, and it does not return a new list of the data. Randomization can be used to assure that each data set does not memorize the pattern to avoid generalization and model overfitting. The correlation coefficient (R) was calculated to assess each feature’s significance (R_s, γ_g, API, and T_f) to the output P_b. It is found that P_b is a strong function of R_s (the correlation coefficient of 0.876 and 0.851 for collected and normal data, respectively) but γ_g is moderate (the correlation coefficient of −0.513 and −0.544 for collected and normal data, respectively). P_b is a weak function of API (correlation coefficient of 0.383 and 0.410 for collected and normal data, respectively). P_b is also a weak function of T_f (correlation coefficient of 0.315 and 0.265 for collected and normal data, respectively); Figure 2.

Table 2. Comparison of the Used Data and Accuracy for the Previous and the Proposed LSTM Models.

				input parameters
no.	model	location	no. of data points	B_ob (bbl/STB)	R_s (SCF/STB)	γ_g	API (°API)	T_f (°F)	average absolute error %	standard deviation
1	Standing⁷	California USA	105		√	√	√	√
2	Lasater⁸	Canada West and Mid-cont U.S. and SA	158		√	√	√	√
3	Glaso⁹	North Sea	41		√	√	√	√		6.98
4	Vazquez and Beggs¹⁰	Worldwide	6004		√	√	√	√
5	Al-Marhoun¹¹	Middle East	160		√	√	√	√	3.66	4.536
6	Kartoatmodjo and Schmit¹²	Worldwide	5392		√	√	√	√	20.17
7	Dokla and Osman¹³	UAE	51		√	√	√	√	7.61	10.378
8	Petrosky and Farshed¹⁴	Gulf of Mexico	90		√	√	√	√	3.28	2.56
9	Macary and El-Batanoney¹⁵	Gulf of Suez	90		√	√	√	√	7.04
10	Omar and Todd¹⁶	Malaysia	93	√	√	√	√	√	7.17	9.54
11	De Ghetto et al.¹⁷	Mediterranean Basin, Africa, Persian Gulf, and the North Sea	195		√	√	√	√	12.8	10
12	Frashad et al.¹⁸	Colombia	43		√	√	√	√		37.02
13	Almehaideb¹⁹	UAE	62	√	√	√	√	√	4.997	6.56
14	Hanafy et al.²⁰	Egypt	90		√
15	Velarde et al.²	USA	728		√	√	√	√	11.7
16	Al-Shammasi²²	Worldwide	1709		√	√	√	√	17.849	17.16
17	Dindoruk and Christman²³	Gulf of Mexico	100		√	√	√	√	5.7	7.51
18	Mehran et al.²⁴	Iran	387		√	√	√	√
19	Bolodazadeh et al.²⁵	Iran	166		√	√	√	√
20	Hemati and Kharrat²⁷	Iran	287	√	√	√	√	√	3.71	4.06
21	Mazandarani and Asghari²⁸	Iran	55		√	√	√	√	0.51
22	Khamechi et al.²⁹	Iran	106		√	√	√	√		0.061
23	Arabloo et al.⁶	Worldwide	756		√	√	√	√	18.9
24	Patil et al.³¹	Niger Delta	23		√	√	√	√
25	Gomaa³²	Middle East	441		√	√	√	√	8.12	10.69
26	Sharrad and Abd-Alrahman³⁴	Libya	35		√	√	√	√	8.7
27	Tariq et al.³⁷	Worldwide	760		√	√	√	√	0.9358
28	proposed LSTM	Worldwide	760		√	√	√	√	8.422	0.088

Open in a new tab

Relative importance of input parameters with the P_b output for the (a) collected and (b) normal data.

Before the proposed LSTM model was applied, all of the parameters (inputs and outputs) are normalized between −1 and 1 to scale the parameters within a specified range using the following equation

where Y is the parameter in the normalized form, Y_max is the maximum value of the normalized form (1), Y_min is the minimum value of the normalized form (−1), X is the parameter to be normalized, X_min is the minimum value of the parameter, and X_max is the maximum value of the parameter.

2.2. Proposed LSTM Model

LSTM is a special type of recurrent neural network (RNN) mostly applied to define the relationship between the current and previous input data.⁴¹ Its memory is utilized to save the internal information of the previously input data. The long short-term memory hidden state can contain some memory cells (recurrently connected cells). Each LSTM memory cell involves three gates, the forget gate, the input gate, and the output gate that maintains information movement, which moves through the given layer; Figure 3. A memory cell, identified as a cell state, aims to store all significant information that is to be passed in the next layer. Eq 3 is used to update the cell state to a new value relevant to the new cell state. The inputs are shown as x at the bottom, and the outputs are shown as h; Figure 4.⁴² The gates can learn and remember the input sequence data, which is significant.

On the other hand, the method can discard unnecessary information. There are some multiple activation functions applied in each gate, namely, tanh and sigmoid activation. The activation functions can be used to determine which information passes through the layer.

Hidden state of LSTM for the time step 0 to t + 1.

2.2.1. tanh Activation

tanh activation can control the data passing through the layer and squish data to ensure that the range of data falls between −1 and 1. Moreover, it can regulate the output to ensure that the value is between −1 and 1 or otherwise make the other insignificant value.

2.2.2. Sigmoid Activation

Sigmoid activation squishes values between one and zero and can update or forget the information. When it is zero, the information disappears or is forgotten, whereas the information is remembered when it is one. Therefore, the network can learn when the information can be significant and can be maintained. When the information is not significant, it can be forgotten.

2.2.3. Forget Gate

Forget gate can be used to decide on retaining or discarding the information. It can accept both the present sequence and the previous state information and decide to discard irrelevant data. The insignificant data is extracted, applying the multiplication of a filter. Sigmoid activation can be required here to make the information be between one and zero. The operation in the forget gate can be obtained using

The forget gate can accept data from the prior hidden state h_t–1 and the sequence input x_t. Then, the date can be added with a bias and multiplied by the weights. Next, the values are followed through the sigmoid activation to be kept in the cell state or removed.

2.2.4. Input Gate

Input gate obtains information from h_t–1 and x_t to update information in the cell state. The data is passed into two functions: sigmoid and tanh. Sigmoid activation can be used to decide significant information and can be updated. The values of x_t and h_t–1 are added with a bias and multiplied by the weight to obtain a vector from zero or one. The value of the sigmoid is utilized to determine significant data applying the following equation

Then, eq 8 can be used to obtain the tanh function used to convert the data between −1 and 1 using the following equation. This was to create a new vector that needed to be applied to the cell.

Before being applied to the present cell state, the sigmoid and tanh function values were multiplied together. This step is significant to ensure that the data added to the cell is essential and not redundant.

2.2.5. Output Gate

It is the last gate that relies on current inputs to obtain the next hidden state. Two steps are included in the output gate to complete the process.

The first step can be to obtain the current input x_t and the prior hidden state h_t–1. It then provides values into the sigmoid function. The sigmoid activation can be similar to the other two gates, where they can be multiplied by the weight before being added with bias.

The next step can be to obtain the cell state’s value and bring it into the tanh activation. Then, the tanh function’s value is multiplied with the sigmoid’s output to determine the information for the hidden state.⁴³ Our study focuses on applying LSTM to predict P_b (more about LSTM is found in different references, such as (44−46)).

where f_t is the output of the forget gate, i_t is the output of the input gate, C̅_t is the temporary memory cell in the memory block that is produced by a tanh layer, C_t is the temporary memory cell in the memory block for the current cell state, C_t–1 is the temporary memory cell in the memory block for the previous cell state, and o_t is the output of the output gate. W_f, W_i, W_c, W_o are the recurrent weight matrices; b_f, b_i, b_c are the related bias vectors; h_t–1 represents the previous results in each LSTM cell; h_t represents the results in each LSTM cell; x_t is the current input in each LSTM cell; and σ is the sigmoid function that determines the propagation of the information.

MATLAB software was used to develop the proposed LSTM model. The pseudocode of the proposed LSTM model is presented in Algorithm 1 (see the Supporting Information). Table 3 shows the hyperparameters of the LSTM MATLAB code applied to find P_b.

Table 3. Hyperparameters of the LSTM Model.

parameter	description/value
features (inputs)	4
responses (outputs)	1
hidden units	50
max epochs	256
mini-batch size	128
gradient threshold	0.2000
initial learn rate	0.3000
verbose (indicator to display training progress information)	1
training options	sgdm (stochastic gradient descent with momentum)
momentum	0.9000
L2 Regularization (factor for L2 regularization)	0.0001
OutputMode (format of output)	sequence
StateActivationFunction (activation function to update the cell and hidden state)	tanh
GateActivationFunction (activation function to apply to the gates)	sigmoid
shuffle	once
InputWeightsInitializer (function to initialize input weights)	Glorot (Glorot initializer)
LearnRateSchedule (option for dropping the learning rate during training)	none (the learning rate remains constant throughout the training)
RecurrentWeightsInitializer (function to initialize recurrent weights)	orthogonal
BiasInitializer (function to initialize bias)	unit-forget-gate
GradientThresholdMethod (gradient threshold method)	l2norm
SequenceLength (option to pad, truncate, or split input sequences)	longest
SequencePaddingDirection (direction of padding or truncation)	right
ExecutionEnvironment (hardware resource for the training network)	auto

Open in a new tab

3. Results and Discussion

3.1. P_b Prediction Model Using LSTM

The proposed LSTM model was validated using 228 unseen data points (30% testing data set) after being studied by applying trend analysis to ensure that the model was stable and had sound relationships between the inputs and the output (see Section 3.3). In addition, statistical error analysis was performed to evaluate the LSTM model, as seen in Table 4. As a result, the LSTM model has the lowest average absolute percentage relative error (AAPRE), maximum absolute percent relative error (E_max), minimum absolute percent relative error (E_min), root mean square error (RMSE), and standard deviation (SD) and the highest correlation coefficient (R). As per the tabulated results in Table 4, the accuracy of the LSTM model is outstanding.

Table 4. Statistical Error Analysis of the LSTM Model.

AAPRE (%)	E_max (%)	E_min (%)	RMSE	SD	R
8.422	45.409	0.040	12.20	0.088	0.9900

Open in a new tab

The cross-plotting analysis was applied to assist in visualizing the model’s accuracy. First, the measured P_b values are plotted vs the predicted P_b values. Then, a 45° straight line is plotted on the measured P_b values’ cross plot and the expected P_b values. The closer the plotted data points to the straight line, the higher the correlation or model’s accuracy.

Figures 5 and 6 demonstrate the cross-plotting of the training and testing data set of the proposed LSTM model with the coefficient of determination (R²) of 0.9837 and 9843 for the training and testing data sets, respectively.

Cross plot of the training data set using the LSTM model.

Cross plot of the testing data set using the LSTM model.

3.2. Proposed LSTM Model and Current Models’ Comparison

Some statistical analyses have been performed to describe, validate, and compare the proposed LSTM model with the previously published models that follow the correct relationships between the inputs and the output (see Section 3.3). The statistical error analysis used for the LSTM model and the previous models included cross plot, correlation coefficient, AAPRE, RMSE, SD, E_max, and E_min. In this study, AAPRE and R are used as key indicators.

3.2.1. Cross-Plotting Analysis

Figure 7 displays the cross-plotting and coefficient of determination (R²) of the LSTM model compared to the existing correlations. As presented in the figure, the LSTM model shows the highest accuracy and can predict P_b with the coefficient of determination (R²) of 0.9843.

Cross plot comparison of the LSTM model with the previously published models.

3.2.2. Statistical Error Analysis

Figure 8 shows the AAPRE and correlation coefficient (R) comparison of the LSTM model with the previously published models. As shown, the LSTM model indicates the lowest AAPRE of 8.422% and the highest correlation coefficient (R) of 0.99. On the other hand, the Velarde et al.² and Mehran et al.²⁴ models display AAPRE (%) of 9.0 and 9.75% and correlation coefficient (R) of 0.972 and 0.969, respectively. The Petrosky and Farshad¹⁴ model shows the highest AAPRE of 76.59%.

Correlation coefficient (R) and AAPRE (%) comparison of the LSTM model with previously published models.

The prediction performance of the published correlations was compared with the LSTM model. Table 5 shows the statistical parameters for the comparative evaluation of the existing correlations and the LSTM model. Statistical error analysis has been performed to describe the robustness of the LSTM model. AAPRE and correlation coefficient (R) were used as primary indicators of the model’s accuracy. As seen in the table, the LSTM model and the existing correlations are ranked based on AAPRE and the correlation coefficient (R). The LSTM model also has the lowest RMSE and SD compared to other models. This useful comparison of all correlations and the LSTM model forms an essential means of evaluating correlations and the LSTM performance. Investigation of these statistical error analyses proves that the LSTM model exceeds all of the current correlations.

Table 5. Statistical Error Analysis Comparison of the LSTM Model with the Previously Published Models.

rank	model	AAPRE (%)	E_max (%)	E_min (%)	RMSE	SD	R
1	The proposed LSTM (2021)	8.422	45.409	0.040	12.20	0.088	0.9900
2	Velarde et al. (1997)	9.00	62.47	0.039	13.04	0.095	0.9724
3	Mehran et al. (2006)	9.75	63.86	0.035	13.60	0.095	0.9699
4	Lasater (1958)	11.07	66.08	0.016	15.31	0.106	0.9742
5	Standing (1947)	12.35	69.28	0.032	16.26	0.106	0.9753
6	Arabloo et al. (2014)	12.66	72.98	0.000	17.12	0.116	0.9589
7	Hemati and Kharrat (2007)	13.76	85.01	0.026	22.13	0.174	0.9741
8	Vazquez and Beggs (1980)	16.88	74.79	0.493	21.65	0.136	0.9767
9	Kartoatmodjo and Schmit (1991)	16.94	78.37	0.085	22.74	0.152	0.9722
10	Al-Shammasi (1999)	17.33	62.95	0.205	22.60	0.145	0.9663
11	Frashad et al. (1996)	18.23	74.23	0.042	24.30	0.161	0.9621
12	De Ghetto et al. (1994)	18.37	73.97	0.007	24.83	0.167	0.9720
13	Dindoruk and Christman (2001)	20.89	77.83	0.432	25.81	0.152	0.9369
14	Glaso (1980)	23.02	79.52	0.281	27.70	0.154	0.9701
15	Mazandarani and Asghari (2007)	23.91	120.93	0.127	34.19	0.245	0.9462
16	Almehaideb (1997)	26.15	234.92	0.037	44.18	0.357	0.9482
17	Macary and El-Batanoney (1993)	31.20	149.75	0.111	42.62	0.291	0.9499
18	Khamechi et al. (2009)	31.24	97.52	0.059	37.27	0.204	0.9652
19	Bolodazadeh et al. (2006)	40.42	434.20	0.175	84.69	0.746	0.9694
20	Sharrad and Abd-Alrahman (2019)	45.93	72.46	0.346	47.96	0.139	0.8929
21	Al-Marhoun (1988)	54.06	79.22	27.176	54.40	0.148	0.9538
22	Petrosky and Farshed (1993)	76.59	784.59	0.295	159.87	1.406	0.9703

Open in a new tab

From Table 5, it can also be observed that the Velarde et al.² correlation is the second-rank correlation with AAPRE of 9% and the correlation coefficient (R) of 0.972. The Petrosky and Farshad¹⁴ model shows AAPRE of 76.59% and the correlation coefficient (R) of 0.970 in predicting P_b.

3.3. Sensitivity Analysis

Sensitivity analysis or trend analysis is the study of the uncertainty in the output of a model.⁴⁷ The trend analysis is utilized to determine the model’s reliability. Trend analysis may define the relationships in a model between output and input variables and look for model errors. Additionally, trend analysis can also identify and remove unnecessary parts of the model structure.⁴⁸ An input parameter can be changed between the minimum value and the maximum value, while the others are fixed at their constant mean values.^22,49−51 Graphs are plotted for the input parameter values as the x-axis against P_b as the y-axis for the previous models and the LSTM model.

Four input parameters (R_s, γ_g, API, and T_f) have been selected for the trend analysis. Oil formation volume factor at the bubble point pressure (B_ob) (bbl/STB) has been discarded from the model’s inputs as Omar and Todd,¹⁶ Almehaideb,¹⁹ and Hemati and Kharrat²⁷ only apply it. No apparent trend of P_b has been reported in the literature.

The trend of R_s is displayed in Figure 9. Petrosky and Farshad¹⁴ and Almehaideb¹⁹ developed correlations based on the R_s ranges of 217–1406 and 128–3871 SCF/STB, respectively. The Petrosky and Farshad¹⁴ and Almehaideb¹⁹ correlations show that P_b was negative, −812.6 and −207.5 psi, respectively, at R_s 26 SCF/STB (Figure 9). The LSTM model demonstrates that the trend obeys the trend of the correlations in the literature where R_s is directly proportional to P_b;Figure 10. Therefore, the LSTM model is successful in following the models’ trends.

Gas solubility trend analysis of the LSTM model and previously published models.

Gas solubility trend analysis of the LSTM model.

The trend of gas-specific gravity (γ_g) is demonstrated in Figure 11. As presented by the previous models, the gas-specific gravity (γ_g) is inversely proportional to P_b. Nevertheless, the Hanafy et al.²⁰ correlation indicates that P_b is constant as they did not include the gas-specific gravity in their correlation. The Goma³² correlation has shown that a slight increase in gas-specific gravity can increase P_b, contradicting the previous correlation behavior. Omar and Todd’s¹⁶ correlation showed different trends in which the gas-specific gravity shows a decreasing trend and then increases. Consequently, Hanafy et al.,²⁰ Omar and Todd,¹⁶ and Goma³² failed to represent the behavior accurately. The LSTM model trend followed the most current correlation trend; Figure 12 shows that the LSTM model follows the proper gas-specific gravity trend.

Gas-specific gravity trend analysis of the LSTM model and previously published models.

Gas-specific gravity trend analysis of the LSTM model.

The trend of oil API gravity (API) is displayed in Figure 13. The oil API gravity (API) is inversely proportional to P_b. Hanafy et al.²⁰ showed a horizontal line behavior, which indicates that their correlation does not consider the oil API gravity. Dokla and Osman¹³ and Gomaa³² correlations followed the trend of existing correlations, but increasing the oil API gravity slightly leads to a decrease in P_b. The Tariq et al.³⁷ model shows that increasing the oil API gravity leads to an increase of P_b.

Oil API gravity’s trend analysis of the LSTM model and previously published models.

Consequently, Hanafy et al.,²⁰ Dokla and Osman,¹³ Gomaa,³² and Tariq et al.³⁷ have not proven a proper trend for the P_b correlation. Petrosky and Farshad¹⁴ established their correlation based on the API oil gravity range of 16.3–45 °API. Thus, the Petrosky and Farshad¹⁴ correlation shows that P_b is negative, −37.35 and −145.91 psi at API oil gravity of 48.11 and 51.7 °API, respectively; Figure 13. Figure 14 displays that the oil API gravity trend of the LSTM model obeys the trend of the existing correlations.

Oil API gravity’s trend analysis of the LSTM model.

Figure 15 reveals the trend of the reservoir temperature. P_b is an increasing function regarding reservoir temperature. However, the Hanafy et al.²⁰ correlation shows a horizontal line, which indicates that their correlation too did not include the reservoir temperature parameter. Dindoruk and Christman²³ and Arabloo et al.⁶ correlations indicate that P_b is almost constant, including the reservoir temperature in their correlations. Dokla and Osman’s¹³ correlation suggests that P_b decreases with the increase of the reservoir temperature. Therefore, the Dokla and Osman’s¹³ correlation shows that the trend fails to show dependency adequately. The trend demonstrated by the LSTM model also follows the correct relationships between inputs and output, Figure 16, indicating that it represents the proper trend for the reservoir temperature.

Reservoir temperature trend analysis of the LSTM model and previously published models.

Reservoir temperature trend analysis of the LSTM model.

In conclusion, all of the input parameters (R_s, γ_g, API, and T_f) of the LSTM model follow the proper trends. Consequently, the trend analysis represented the reliability of the proposed LSTM model. In contrast, Hanafy et al.,²⁰ Goma,³² Omar and Todd,¹⁶ Dokla and Osman,¹³ and Tariq et al.³⁷ correlation trends fail to present the behavior correctly. Finally, Petrosky and Farshad¹⁴ and Almehaideb¹⁹ correlations show the predicted P_b’s negative values because the used values of R_s and oil API are out of their considered ranges.

4. Conclusions

The LSTM-based prediction model was developed, validated, and compared against the previous models available in the literature to predict P_b using 760 data points. As a result, the following conclusions can be drawn.

The model’s trend analysis results indicate that the model can accurately describe the profile P_b as a function of all of the considered independent variables following the actual trend expected from the physical relationship.

The LSTM model generates a substantially higher accuracy of P_b compared to all published models. This accuracy is proven by the highest correlation coefficient of 0.99, the lowest AAPRE of 8.422%, the smallest root mean squared error of 12.20, and the lowest standard error deviation (SD) of 0.088 compared to 27 published correlations. The model accuracy and reliability were further enhanced by data randomization to ensure that each data set does not memorize the pattern and avoid generalization and model overfitting.

5. Limitations and Recommendations

The LSTM model was built based on bulk properties (gas solubility (R_s), gas-specific gravity (γ_g), oil specific gravity (API), and reservoir temperature (T_f)) without considering the compositional data of the crude oil because of the data limitations. The proposed model provides high-accuracy results with the four input parameters considered (R_s, γ_g, API, and T_f), especially if the input values are within the data ranges used to develop the model. However, the proposed model’s benefits outweigh the limitations. The results presented here are self-explanatory. The proposed LSTM model was the most robust and accurate model compared to the published models. We recommend using more data to enhance the performance of LSTM for other applications with optimal hyperparameters. For example, deep learning (LSTM) can be applied for many petroleum industry calculations, such as production engineering and drilling engineering calculations.

Acknowledgments

The authors express their appreciation to the Universiti Teknologi PETRONAS for supporting this work under the YUTP-Grant cost center #015LC0-098. They also especially thank COREOR, Petroleum Engineering, Universiti Teknologi PETRONAS, for supporting this work. The first author, in particular, is grateful to Universiti Teknologi PETRONAS for supporting his Ph.D. study under the Graduate Assistance (GA) scheme.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsomega.1c02376.

Appendix for statistical error analysis; pseudocode of the proposed LSTM model; statistical description of the normal data; box and whisker plot; box and whisker plots for the collected training data; box and whisker plots for the normal training data; box and whisker plots for the collected and normal testing data; histograms of the inputs and output parameters for the collected training data; histograms of the inputs and output parameters for the normal training data; and histograms of the inputs and output parameters for the collected and normal testing data (PDF)

Author Contributions

F.S.A. implemented the writing: original draft preparation, review and editing, methodology, software, and data collection. M.E.M. implemented the problem conceptualization, research supervision and funding, and drafting. M.A.A. implemented model development, result verification, and review and editing. A.S.M. implemented the review and editing.

The authors declare no competing financial interest.

Supplementary Material

ao1c02376_si_001.pdf^{(997.5KB, pdf)}

References

McCain W. D., Jr.The Properties of Petroleum Fluids, 2nd ed.; PennWell Publ. Co.: Tulsa, Oklahoma, 1990. [Google Scholar]
Velarde J.; Blasingame T. A.; McCain W. D. Jr. In Correlation of Black Oil Properties at Pressures Below Bubble Point Pressure-A New Approach, Annual Technical Meeting; Petroleum Society of Canada, 1997.
Valkó P. P.; McCain W. D. Jr. Reservoir Oil Bubblepoint Pressures Revisited; Solution Gas–Oil Ratios and Surface Gas Specific Gravities. J. Pet. Sci. Eng. 2003, 37, 153–169. 10.1016/S0920-4105(02)00319-4. [DOI] [Google Scholar]
Bandyopadhyay P.; Sharma A. Development of a New Semi Analytical Model for Prediction of Bubble Point Pressure of Crude Oils. J. Pet. Sci. Eng. 2011, 78, 719–731. 10.1016/j.petrol.2011.06.007. [DOI] [Google Scholar]
Farasat A.; Shokrollahi A.; Arabloo M.; Gharagheizi F.; Mohammadi A. H. Toward an Intelligent Approach for Determination of Saturation Pressure of Crude Oil. Fuel Process. Technol. 2013, 115, 201–214. 10.1016/j.fuproc.2013.06.007. [DOI] [Google Scholar]
Arabloo M.; Amooie M.-A.; Hemmati-Sarapardeh A.; Ghazanfari M.-H.; Mohammadi A. H. Application of Constrained Multi-Variable Search Methods for Prediction of PVT Properties of Crude Oil Systems. Fluid Phase Equilib. 2014, 363, 121–130. 10.1016/j.fluid.2013.11.012. [DOI] [Google Scholar]
Standing M. B.A Pressure-Volume-Temperature Correlation for Mixtures of California Oils and Gases. In Drilling and Production Practice; American Petroleum Institute, 1947. [Google Scholar]
Lasater J. A. Bubble Point Pressure Correlation. J. Pet. Technol. 1958, 10, 65–67. 10.2118/957-G. [DOI] [Google Scholar]
Glaso O. Generalized Pressure-Volume-Temperature Correlations. J. Pet. Technol. 1980, 32, 785–795. 10.2118/8016-PA. [DOI] [Google Scholar]
Vasquez M.; Beggs H. D. Correlations for Fluid Physical Property Prediction. J. Pet. Technol. 1980, 32, 968–970. 10.2118/6719-PA. [DOI] [Google Scholar]
Al-Marhoun M. A. PVT Correlations for Middle East Crude Oils. J. Pet. Technol. 1988, 40, 650–666. 10.2118/13718-PA. [DOI] [Google Scholar]
Kartoatmodjo R. S. T.; Schmidt Z.. New Correlations for Crude Oil Physical Properties. 1991.
Dokla M.; Osman M. Correlation of PVT Properties for UAE Crudes (Includes Associated Papers 26135 and 26316). SPE Form. Eval. 1992, 7, 41–46. 10.2118/20989-PA. [DOI] [Google Scholar]
Petrosky G. E.; Farshad F. F. In Pressure-Volume-Temperature Correlations for Gulf of Mexico Crude Oils, SPE Annual Technical Conference and Exhibition; Society of Petroleum Engineers, 1993.
Macary S. M.; El-Batanoney M. H. Derivation of PVT Correlations for the Gulf of Suez Crude Oils. Sekiyu Gakkaishi 1993, 36, 472–478. 10.1627/jpi1958.36.472. [DOI] [Google Scholar]
Omar M. I.; Todd A. C. In Development of New Modified Black Oil Correlations for Malaysian Crudes, SPE Asia Pacific Oil and Gas Conference; Society of Petroleum Engineers, 1993.
De Ghetto G.; Paone F.; Villa M.. Reliability Analysis on PVT Correlations. Presented at the European Petroleum Conference Held in London, U.K., October 25–27, 1994; Paper SPE-28904.
Frashad F.; LeBlanc J. L.; Garber J. D.; Osorio J. G. In Empirical PVT Correlations for Colombian Crude Oils, SPE Latin America/Caribbean Petroleum Engineering Conference; Society of Petroleum Engineers, 1996.
Almehaideb R. A. In Improved PVT Correlations for UAE Crude Oils, Middle East Oil Show and Conference; Society of Petroleum Engineers, 1997.
Hanafy H. H.; Macary S. M.; ElNady Y. M.; Bayomi A. A.; El Batanony M. H. In Empirical PVT Correlations Applied to Egyptian Crude Oils Exemplify Significance of Using Regional Correlations, International Symposium on Oilfield Chemistry; Society of Petroleum Engineers, 1997.
Gharbi R. B.; Elsharkawy A. M.; Karkoub M. Universal Neural-Network-Based Model for Estimating the PVT Properties of Crude Oil Systems. Energy Fuels 1999, 13, 454–458. 10.1021/ef980143v. [DOI] [Google Scholar]
Al-Shammasi A. A. In Bubble Point Pressure and Oil Formation Volume Factor Correlations, Middle East Oil Show and Conference; Society of Petroleum Engineers: Bahrain, 1999; p 17.
Dindoruk B.; Christman P. G. In PVT Properties and Viscosity Correlations for Gulf of Mexico Oils, SPE Annual Technical Conference and Exhibition; Society of Petroleum Engineers: New Orleans, Louisiana, 2001; p 14.
Mehran F.; Movagharnejad K.; Didanloo A. In New Correlation for Estimation of Formation Volume Factor and Bubblepoint Pressure for Iranian Oil Fields, 1st Iranian Petroleum Engineering Conference, 2006.
Bolondarzadeh A.; Hashemi S.; Solgani B. In The New PVT Generated Correlations of Iranian Oil Properties, 4th Iranian Petroleum Engineering Student Conference, 2006.
Malallah A.; Gharbi R.; Algharaib M. Accurate Estimation of the World Crude Oil PVT Properties Using Graphical Alternating Conditional Expectation. Energy Fuels 2006, 20, 688–698. 10.1021/ef0501750. [DOI] [Google Scholar]
Hemmati M. N.; Kharrat R. In A Correlation Approach for Prediction of Crude Oil PVT Properties, SPE Middle East Oil and Gas Show and Conference; Society of Petroleum Engineers, 2007.
Mazandarani M. T.; Asghari S. M. In Correlations for Predicting Solution Gas-Oil Ratio, Bubblepoint Pressure and Oil Formation Volume Factor at Bubblepoint of Iran Crude Oils, European Congress of Chemical Engineering, Copenhagen, 2007.
Khamehchi E.; Rashidi F.; Rasouli H.; Ebrahimian A. Novel Empirical Correlations for Estimation of Bubble Point Pressure, Saturated Viscosity and Gas Solubility of Crude Oils. Pet. Sci. 2009, 6, 86–90. 10.1007/s12182-009-0016-x. [DOI] [Google Scholar]
Dutta S.; Gupta J. P. PVT Correlations of Indian Crude Using Support Vector Regression. Energy Fuels 2009, 23, 5483–5490. 10.1021/ef900518f. [DOI] [Google Scholar]
Patil P. A.; Bai M. X.; Teodoriu C.; Reinicke K. M. Development of PVT Correlations According to Geography. Pet. Sci. Technol. 2014, 32, 991–999. 10.1080/10916466.2011.641653. [DOI] [Google Scholar]
Gomaa S. New Bubble Point Pressure Correlation for Middle East Crude Oils. Int. Adv. Res. J. Sci. Eng. Technol. 2016, 3, 1–9. [Google Scholar]
Alakbari F. S.; Elkatatny S.; Baarimah S. O. In Prediction of Bubble Point Pressure Using Artificial Intelligence AI Techniques, SPE Middle East Artificial Lift Conference and Exhibition; Society of Petroleum Engineers: Manama, Kingdom of Bahrain, 2016; p 9.
Sharrad M.; Abd-Alrahman H. H. New Derived Correlations for Libyan Crude Oil to Estimate Bubble-Point Pressure. Sci. J. Appl. Sci. Sabratha Univ. 2019, 2, 1–13. 10.47891/sabujas.v2i1.1-13. [DOI] [Google Scholar]
Seyyedattar M.; Ghiasi M. M.; Zendehboudi S.; Butt S. Determination of Bubble Point Pressure and Oil Formation Volume Factor: Extra Trees Compared with LSSVM-CSA Hybrid and ANFIS Models. Fuel 2020, 269, 116834 10.1016/j.fuel.2019.116834. [DOI] [Google Scholar]
Talebkeikhah M.; Amar M. N.; Naseri A.; Humand M.; Hemmati-Sarapardeh A.; Dabir B.; Seghier M. E. A. Ben. Experimental Measurement and Compositional Modeling of Crude Oil Viscosity at Reservoir Conditions. J. Taiwan Inst. Chem. Eng. 2020, 109, 35–50. 10.1016/j.jtice.2020.03.001. [DOI] [Google Scholar]
Tariq Z.; Mahmoud M.; Abdulraheem A. Machine Learning-Based Improved Pressure–Volume–Temperature Correlations for Black Oil Reservoirs. J. Energy Resour. Technol. 2021, 143, 113003 10.1115/1.4050579. [DOI] [Google Scholar]
Fan D.; Sun H.; Yao J.; Zhang K.; Yan X.; Sun Z. Well Production Forecasting Based on ARIMA-LSTM Model Considering Manual Operations. Energy 2021, 220, 119708 10.1016/j.energy.2020.119708. [DOI] [Google Scholar]
Iqbal N.; Rizwan A.; Khan A. N.; Ahmad R.; Kim B. W.; Kim K.; Kim D.-H. Boreholes Data Analysis Architecture Based on Clustering and Prediction Models for Enhancing Underground Safety Verification. IEEE Access 2021, 78428. 10.1109/ACCESS.2021.3083175. [DOI] [Google Scholar]
Tukey J. W.Exploratory Data Analysis; Addison-Wesley Publishing Company: Reading, MA, 1977; Vol. 2. [Google Scholar]
Karim F.; Majumdar S.; Darabi H.; Chen S. LSTM Fully Convolutional Networks for Time Series Classification. IEEE Access 2018, 6, 1662–1669. 10.1109/ACCESS.2017.2779939. [DOI] [Google Scholar]
Xiang Z.; Yan J.; Demir I. A Rainfall-runoff Model with LSTM-based Sequence-to-sequence Learning. Water Resour. Res. 2020, 56, e2019WR025326 10.1029/2019WR025326. [DOI] [Google Scholar]
Pysal D.; Abdulkadir S. J.; Shukri S. R. M.; Alhussian H. Classification of Children’s Drawing Strategies on Touch-Screen of Seriation Objects Using a Novel Deep Learning Hybrid Model. Alexandria Eng. J. 2020, 115–129. 10.1016/j.aej.2020.06.019. [DOI] [Google Scholar]
Hochreiter S.; Schmidhuber J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]
Schmidhuber J. Deep Learning in Neural Networks: An Overview. Neural Networks 2015, 61, 85–117. 10.1016/j.neunet.2014.09.003. [DOI] [PubMed] [Google Scholar]
LeCun Y.; Bengio Y.; Hinton G. Deep Learning. Nature 2015, 521, 436–444. 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
Saltelli A. Sensitivity Analysis for Importance Assessment. Risk Anal. 2002, 22, 579–590. 10.1111/0272-4332.00040. [DOI] [PubMed] [Google Scholar]
Bahremand A.; De Smedt F. Distributed Hydrological Modeling and Sensitivity Analysis in Torysa Watershed, Slovakia. Water Resour. Manage. 2008, 22, 393–408. 10.1007/s11269-007-9168-x. [DOI] [Google Scholar]
Osman E. S. A.; Ayoub M. A.; Aggour M. A.. Artificial Neural Network Model for Predicting Bottomhole Flowing Pressure in Vertical Multiphase Flow; Society of Petroleum Engineers, 2005. [Google Scholar]
Ayoub M. A.; Zainal S. N.; Elhaj M. E.; Ku Ishak K. E. H.; Ahmed Q. In Revisiting the Coefficient of Isothermal Oil Compressibility Below Bubble Point Pressure and Formulation of a New Model Using Adaptive Neuro-Fuzzy Inference System Technique, International Petroleum Technology Conference; International Petroleum Technology Conference, 2020.
Alakbari F. S.; Mohyaldinn M. E.; Ayoub M. A.; Muhsan A. S.; Hussein I. A. A Robust Fuzzy Logic-Based Model for Predicting the Critical Total Drawdown in Sand Production in Oil and Gas Wells. PLoS One 2021, 16, e0250466 10.1371/journal.pone.0250466. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ao1c02376_si_001.pdf^{(997.5KB, pdf)}

[ref1] McCain W. D., Jr.The Properties of Petroleum Fluids, 2nd ed.; PennWell Publ. Co.: Tulsa, Oklahoma, 1990. [Google Scholar]

[ref2] Velarde J.; Blasingame T. A.; McCain W. D. Jr. In Correlation of Black Oil Properties at Pressures Below Bubble Point Pressure-A New Approach, Annual Technical Meeting; Petroleum Society of Canada, 1997.

[ref3] Valkó P. P.; McCain W. D. Jr. Reservoir Oil Bubblepoint Pressures Revisited; Solution Gas–Oil Ratios and Surface Gas Specific Gravities. J. Pet. Sci. Eng. 2003, 37, 153–169. 10.1016/S0920-4105(02)00319-4. [DOI] [Google Scholar]

[ref4] Bandyopadhyay P.; Sharma A. Development of a New Semi Analytical Model for Prediction of Bubble Point Pressure of Crude Oils. J. Pet. Sci. Eng. 2011, 78, 719–731. 10.1016/j.petrol.2011.06.007. [DOI] [Google Scholar]

[ref5] Farasat A.; Shokrollahi A.; Arabloo M.; Gharagheizi F.; Mohammadi A. H. Toward an Intelligent Approach for Determination of Saturation Pressure of Crude Oil. Fuel Process. Technol. 2013, 115, 201–214. 10.1016/j.fuproc.2013.06.007. [DOI] [Google Scholar]

[ref6] Arabloo M.; Amooie M.-A.; Hemmati-Sarapardeh A.; Ghazanfari M.-H.; Mohammadi A. H. Application of Constrained Multi-Variable Search Methods for Prediction of PVT Properties of Crude Oil Systems. Fluid Phase Equilib. 2014, 363, 121–130. 10.1016/j.fluid.2013.11.012. [DOI] [Google Scholar]

[ref7] Standing M. B.A Pressure-Volume-Temperature Correlation for Mixtures of California Oils and Gases. In Drilling and Production Practice; American Petroleum Institute, 1947. [Google Scholar]

[ref8] Lasater J. A. Bubble Point Pressure Correlation. J. Pet. Technol. 1958, 10, 65–67. 10.2118/957-G. [DOI] [Google Scholar]

[ref9] Glaso O. Generalized Pressure-Volume-Temperature Correlations. J. Pet. Technol. 1980, 32, 785–795. 10.2118/8016-PA. [DOI] [Google Scholar]

[ref10] Vasquez M.; Beggs H. D. Correlations for Fluid Physical Property Prediction. J. Pet. Technol. 1980, 32, 968–970. 10.2118/6719-PA. [DOI] [Google Scholar]

[ref11] Al-Marhoun M. A. PVT Correlations for Middle East Crude Oils. J. Pet. Technol. 1988, 40, 650–666. 10.2118/13718-PA. [DOI] [Google Scholar]

[ref12] Kartoatmodjo R. S. T.; Schmidt Z.. New Correlations for Crude Oil Physical Properties. 1991.

[ref13] Dokla M.; Osman M. Correlation of PVT Properties for UAE Crudes (Includes Associated Papers 26135 and 26316). SPE Form. Eval. 1992, 7, 41–46. 10.2118/20989-PA. [DOI] [Google Scholar]

[ref14] Petrosky G. E.; Farshad F. F. In Pressure-Volume-Temperature Correlations for Gulf of Mexico Crude Oils, SPE Annual Technical Conference and Exhibition; Society of Petroleum Engineers, 1993.

[ref15] Macary S. M.; El-Batanoney M. H. Derivation of PVT Correlations for the Gulf of Suez Crude Oils. Sekiyu Gakkaishi 1993, 36, 472–478. 10.1627/jpi1958.36.472. [DOI] [Google Scholar]

[ref16] Omar M. I.; Todd A. C. In Development of New Modified Black Oil Correlations for Malaysian Crudes, SPE Asia Pacific Oil and Gas Conference; Society of Petroleum Engineers, 1993.

[ref17] De Ghetto G.; Paone F.; Villa M.. Reliability Analysis on PVT Correlations. Presented at the European Petroleum Conference Held in London, U.K., October 25–27, 1994; Paper SPE-28904.

[ref18] Frashad F.; LeBlanc J. L.; Garber J. D.; Osorio J. G. In Empirical PVT Correlations for Colombian Crude Oils, SPE Latin America/Caribbean Petroleum Engineering Conference; Society of Petroleum Engineers, 1996.

[ref19] Almehaideb R. A. In Improved PVT Correlations for UAE Crude Oils, Middle East Oil Show and Conference; Society of Petroleum Engineers, 1997.

[ref20] Hanafy H. H.; Macary S. M.; ElNady Y. M.; Bayomi A. A.; El Batanony M. H. In Empirical PVT Correlations Applied to Egyptian Crude Oils Exemplify Significance of Using Regional Correlations, International Symposium on Oilfield Chemistry; Society of Petroleum Engineers, 1997.

[ref21] Gharbi R. B.; Elsharkawy A. M.; Karkoub M. Universal Neural-Network-Based Model for Estimating the PVT Properties of Crude Oil Systems. Energy Fuels 1999, 13, 454–458. 10.1021/ef980143v. [DOI] [Google Scholar]

[ref22] Al-Shammasi A. A. In Bubble Point Pressure and Oil Formation Volume Factor Correlations, Middle East Oil Show and Conference; Society of Petroleum Engineers: Bahrain, 1999; p 17.

[ref23] Dindoruk B.; Christman P. G. In PVT Properties and Viscosity Correlations for Gulf of Mexico Oils, SPE Annual Technical Conference and Exhibition; Society of Petroleum Engineers: New Orleans, Louisiana, 2001; p 14.

[ref24] Mehran F.; Movagharnejad K.; Didanloo A. In New Correlation for Estimation of Formation Volume Factor and Bubblepoint Pressure for Iranian Oil Fields, 1st Iranian Petroleum Engineering Conference, 2006.

[ref25] Bolondarzadeh A.; Hashemi S.; Solgani B. In The New PVT Generated Correlations of Iranian Oil Properties, 4th Iranian Petroleum Engineering Student Conference, 2006.

[ref26] Malallah A.; Gharbi R.; Algharaib M. Accurate Estimation of the World Crude Oil PVT Properties Using Graphical Alternating Conditional Expectation. Energy Fuels 2006, 20, 688–698. 10.1021/ef0501750. [DOI] [Google Scholar]

[ref27] Hemmati M. N.; Kharrat R. In A Correlation Approach for Prediction of Crude Oil PVT Properties, SPE Middle East Oil and Gas Show and Conference; Society of Petroleum Engineers, 2007.

[ref28] Mazandarani M. T.; Asghari S. M. In Correlations for Predicting Solution Gas-Oil Ratio, Bubblepoint Pressure and Oil Formation Volume Factor at Bubblepoint of Iran Crude Oils, European Congress of Chemical Engineering, Copenhagen, 2007.

[ref29] Khamehchi E.; Rashidi F.; Rasouli H.; Ebrahimian A. Novel Empirical Correlations for Estimation of Bubble Point Pressure, Saturated Viscosity and Gas Solubility of Crude Oils. Pet. Sci. 2009, 6, 86–90. 10.1007/s12182-009-0016-x. [DOI] [Google Scholar]

[ref30] Dutta S.; Gupta J. P. PVT Correlations of Indian Crude Using Support Vector Regression. Energy Fuels 2009, 23, 5483–5490. 10.1021/ef900518f. [DOI] [Google Scholar]

[ref31] Patil P. A.; Bai M. X.; Teodoriu C.; Reinicke K. M. Development of PVT Correlations According to Geography. Pet. Sci. Technol. 2014, 32, 991–999. 10.1080/10916466.2011.641653. [DOI] [Google Scholar]

[ref32] Gomaa S. New Bubble Point Pressure Correlation for Middle East Crude Oils. Int. Adv. Res. J. Sci. Eng. Technol. 2016, 3, 1–9. [Google Scholar]

[ref33] Alakbari F. S.; Elkatatny S.; Baarimah S. O. In Prediction of Bubble Point Pressure Using Artificial Intelligence AI Techniques, SPE Middle East Artificial Lift Conference and Exhibition; Society of Petroleum Engineers: Manama, Kingdom of Bahrain, 2016; p 9.

[ref34] Sharrad M.; Abd-Alrahman H. H. New Derived Correlations for Libyan Crude Oil to Estimate Bubble-Point Pressure. Sci. J. Appl. Sci. Sabratha Univ. 2019, 2, 1–13. 10.47891/sabujas.v2i1.1-13. [DOI] [Google Scholar]

[ref35] Seyyedattar M.; Ghiasi M. M.; Zendehboudi S.; Butt S. Determination of Bubble Point Pressure and Oil Formation Volume Factor: Extra Trees Compared with LSSVM-CSA Hybrid and ANFIS Models. Fuel 2020, 269, 116834 10.1016/j.fuel.2019.116834. [DOI] [Google Scholar]

[ref36] Talebkeikhah M.; Amar M. N.; Naseri A.; Humand M.; Hemmati-Sarapardeh A.; Dabir B.; Seghier M. E. A. Ben. Experimental Measurement and Compositional Modeling of Crude Oil Viscosity at Reservoir Conditions. J. Taiwan Inst. Chem. Eng. 2020, 109, 35–50. 10.1016/j.jtice.2020.03.001. [DOI] [Google Scholar]

[ref37] Tariq Z.; Mahmoud M.; Abdulraheem A. Machine Learning-Based Improved Pressure–Volume–Temperature Correlations for Black Oil Reservoirs. J. Energy Resour. Technol. 2021, 143, 113003 10.1115/1.4050579. [DOI] [Google Scholar]

[ref38] Fan D.; Sun H.; Yao J.; Zhang K.; Yan X.; Sun Z. Well Production Forecasting Based on ARIMA-LSTM Model Considering Manual Operations. Energy 2021, 220, 119708 10.1016/j.energy.2020.119708. [DOI] [Google Scholar]

[ref39] Iqbal N.; Rizwan A.; Khan A. N.; Ahmad R.; Kim B. W.; Kim K.; Kim D.-H. Boreholes Data Analysis Architecture Based on Clustering and Prediction Models for Enhancing Underground Safety Verification. IEEE Access 2021, 78428. 10.1109/ACCESS.2021.3083175. [DOI] [Google Scholar]

[ref40] Tukey J. W.Exploratory Data Analysis; Addison-Wesley Publishing Company: Reading, MA, 1977; Vol. 2. [Google Scholar]

[ref41] Karim F.; Majumdar S.; Darabi H.; Chen S. LSTM Fully Convolutional Networks for Time Series Classification. IEEE Access 2018, 6, 1662–1669. 10.1109/ACCESS.2017.2779939. [DOI] [Google Scholar]

[ref42] Xiang Z.; Yan J.; Demir I. A Rainfall-runoff Model with LSTM-based Sequence-to-sequence Learning. Water Resour. Res. 2020, 56, e2019WR025326 10.1029/2019WR025326. [DOI] [Google Scholar]

[ref43] Pysal D.; Abdulkadir S. J.; Shukri S. R. M.; Alhussian H. Classification of Children’s Drawing Strategies on Touch-Screen of Seriation Objects Using a Novel Deep Learning Hybrid Model. Alexandria Eng. J. 2020, 115–129. 10.1016/j.aej.2020.06.019. [DOI] [Google Scholar]

[ref44] Hochreiter S.; Schmidhuber J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. 10.1162/neco.1997.9.8.1735. [DOI] [PubMed] [Google Scholar]

[ref45] Schmidhuber J. Deep Learning in Neural Networks: An Overview. Neural Networks 2015, 61, 85–117. 10.1016/j.neunet.2014.09.003. [DOI] [PubMed] [Google Scholar]

[ref46] LeCun Y.; Bengio Y.; Hinton G. Deep Learning. Nature 2015, 521, 436–444. 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]

[ref47] Saltelli A. Sensitivity Analysis for Importance Assessment. Risk Anal. 2002, 22, 579–590. 10.1111/0272-4332.00040. [DOI] [PubMed] [Google Scholar]

[ref48] Bahremand A.; De Smedt F. Distributed Hydrological Modeling and Sensitivity Analysis in Torysa Watershed, Slovakia. Water Resour. Manage. 2008, 22, 393–408. 10.1007/s11269-007-9168-x. [DOI] [Google Scholar]

[ref49] Osman E. S. A.; Ayoub M. A.; Aggour M. A.. Artificial Neural Network Model for Predicting Bottomhole Flowing Pressure in Vertical Multiphase Flow; Society of Petroleum Engineers, 2005. [Google Scholar]

[ref50] Ayoub M. A.; Zainal S. N.; Elhaj M. E.; Ku Ishak K. E. H.; Ahmed Q. In Revisiting the Coefficient of Isothermal Oil Compressibility Below Bubble Point Pressure and Formulation of a New Model Using Adaptive Neuro-Fuzzy Inference System Technique, International Petroleum Technology Conference; International Petroleum Technology Conference, 2020.

[ref51] Alakbari F. S.; Mohyaldinn M. E.; Ayoub M. A.; Muhsan A. S.; Hussein I. A. A Robust Fuzzy Logic-Based Model for Predicting the Critical Total Drawdown in Sand Production in Oil and Gas Wells. PLoS One 2021, 16, e0250466 10.1371/journal.pone.0250466. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Deep Learning Approach for Robust Prediction of Reservoir Bubble Point Pressure

Fahd Saeed Alakbari

Mysara Eissa Mohyaldinn

Mohammed Abdalla Ayoub

Ali Samer Muhsan

Abstract

1. Introduction

2. Methodology

Figure 1.

2.1. Data Collection and Preprocessing

Table 1. Statistical Description of the Collected Data (760 Data Points).

Table 2. Comparison of the Used Data and Accuracy for the Previous and the Proposed LSTM Models.

Figure 2.

2.2. Proposed LSTM Model

Figure 3.

Figure 4.

2.2.1. tanh Activation

2.2.2. Sigmoid Activation

2.2.3. Forget Gate

2.2.4. Input Gate

2.2.5. Output Gate

Table 3. Hyperparameters of the LSTM Model.

3. Results and Discussion

3.1. Pb Prediction Model Using LSTM

Table 4. Statistical Error Analysis of the LSTM Model.

Figure 5.

Figure 6.

3.2. Proposed LSTM Model and Current Models’ Comparison

3.2.1. Cross-Plotting Analysis

Figure 7.

3.2.2. Statistical Error Analysis

Figure 8.

Table 5. Statistical Error Analysis Comparison of the LSTM Model with the Previously Published Models.

3.3. Sensitivity Analysis

Figure 9.

Figure 10.

Figure 11.

Figure 12.

Figure 13.

Figure 14.

Figure 15.

Figure 16.

4. Conclusions

5. Limitations and Recommendations

Acknowledgments

Supporting Information Available

Author Contributions

Supplementary Material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1. P_b Prediction Model Using LSTM