Numerical solution of neutral delay differential equations using orthogonal neural network

Chavda Divyesh Vinodbhai; Shruti Dubey

doi:10.1038/s41598-023-30127-8

. 2023 Feb 23;13:3164. doi: 10.1038/s41598-023-30127-8

Numerical solution of neutral delay differential equations using orthogonal neural network

Chavda Divyesh Vinodbhai ¹, Shruti Dubey ^1,^✉

PMCID: PMC9950134 PMID: 36823259

Abstract

In this paper, an efficient orthogonal neural network (ONN) approach is introduced to solve the higher-order neutral delay differential equations (NDDEs) with variable coefficients and multiple delays. The method is implemented by replacing the hidden layer of the feed-forward neural network with the orthogonal polynomial-based functional expansion block, and the corresponding weights of the network are obtained using an extreme learning machine(ELM) approach. Starting with simple delay differential equations (DDEs), an interest has been shown in solving NDDEs and system of NDDEs. Interest is given to consistency and convergence analysis, and it is seen that the method can produce a uniform closed-form solution with an error of order $2^{- n}$ , where n is the number of neurons. The developed neural network method is validated over various types of example problems(DDEs, NDDEs, and system of NDDEs) with four different types of special orthogonal polynomials.

Subject terms: Applied mathematics, Computational science, Information technology

Introduction

Delay differential equation (DDE) plays a crucial role in epidemiology, population growth, and many mathematical modeling problems. In DDEs, the dependent variable depends not only on its current state but also on a specific past state. One type of DDE in which time delays are included in the state derivative is called the neutral delay differential equation (NDDE). Delay terms are classified into three types: discrete, continuous, and proportional delay. In this paper, we are focusing on proportional DDEs and NDDEs. One famous example of proportional delay differential equations is the pantograph differential equation which was first introduced in¹.

Generally, the exact solution of delay differential equations is complicated to find, and due to the model’s complexity, many DDEs do not have an exact solution. Various numerical schemes have been developed over the years to find the approximate solution of delay differential equations. There are several articles^2–9 that illustrate some exact and numerical methods for approximate solutions of DDEs and NDDEs.

Artificial neural networks(ANNs) have been utilised to produce an approximate solution of differential equations for the past 22 years. A neural network approach for several ordinary and partial differential equations was first proposed by Lagaris et al. in¹⁰. The approximate solution delivered by the artificial neural networks has a variety of advantages: (i) The derived approximation of the solution is in closed analytic form. (ii) The generalization ability of an approximation is excellent. (iii) Discretization of derivatives is not required. Many articles on approximation artificial neural network solutions to different differential equations are available in the literature^11–20. As far as we know, the studies for obtaining an approximate solution to delay differential equations using artificial neural networks are limited. There is very little literature available for solving delay differential equations using ANNs. J. Fang et al. solved first-order delay differential equations with single delay using ANN²¹. In²², Chih-Chun Houe et al. obtained approximate solutions of proportional delay differential equation using ANN. All these artificial neural network approaches suffer from common problems: (1) All the algorithms are time-consuming and therefore they are computationally expansive numerical optimization algorithms, (2) They completely depend on the trial solution, which is difficult to construct for higher dimensional problems. Recently in²³, Manoj and Shagun obtained an approximate solution of differential equations using an optimization-free neural network approach in which they trained the network weights using ELM algorithm²⁴. In²⁵, authors solved the first-order pantograph equation using the optimization-free ANN approach. Linear first-order delay differential-algebraic equations have been solved using Legendre neural network in²⁶.

This work presents an orthogonal neural network with an extreme learning machine algorithm(ONN-ELM) to obtain an approximate solution for higher-order delay differential equations, neutral delay differential equations, and a system with multiple delays and variable coefficients. The ONN model is a particular functional link neural network(FLNN)^12,27–29 case. It has the advantage of fast and very accurate learning. The entire procedure becomes much quicker than a traditional neural network because it removes the high-cost iteration procedure and trains the network weights using the Moore-Penrose generalized inverse. The following are the benefits of the proposed approach:

It is a single hidden layer neural network, we only need to train the output layer weights by randomly selecting the input layer weights.
We use an unsupervised extreme learning machine algorithm to train the output weights; no optimization technique is used in this procedure.
It is simple to implement, accurate compared to other numerical schemes mentioned in the literature, and runs quickly.

This work considers four different orthogonal polynomials-based neural networks: (i) Legendre neural network, (ii) Hermite neural network, (iii) Laguerre neural network, and (iv) Chebyshev neural network with ELM for solving DDEs, NDDEs, and systems of NDDEs with multiple delays and variable coefficients. The interest is to find the orthogonal neural network among these four that can produce more accurate solution.

The layout of this paper is as follows. In “Preliminaries” section, we present some definitions and properties of orthogonal polynomials and a description of the considered problems. In “Orthogonal neural network” section, we describe the architecture of the orthogonal neural network(ONN) with an extreme learning algorithm(ELM). “Error analysis” section discusses the convergence analysis and error analysis. The methodology of the proposed method is presented in “Methodology” section. Various numerical illustrations are presented in “Numerical illustrations” section and a comparative study is given in “Comparative analysis” section.

Preliminaries

In this section, first, we introduce basic definitions and some properties of the orthogonal polynomials. Throughout the paper, we will use $P_{n} (x)$ to represent the orthogonal polynomial of order n.

Orthogonal polynomial

Definition 1

The orthogonal polynomials are special class of polynomials $P_{n} (x)$ defined on [a, b] that follow an orthogonality relation as,

\begin{matrix} \int_{a}^{b} g (x) P_{m} (x) P_{n} (x) d x = δ_{m, n} k_{n}, \end{matrix}

where $n, m \in N$ , $δ_{m, n}$ is Kronecker delta, g(x) is a weight function and $k_{n} = \int_{a}^{b} g (x) {[P_{n} (x)]}^{2} d x$ .

Remark

If a weight function $g (x) = 1$ , then the orthogonal polynomial $P_{n} (x)$ is called Legendre polynomial.
If a weight function $g (x) = {(1 - x^{2})}^{- \frac{1}{2}}$ , then the orthogonal polynomial $P_{n} (x)$ is called Chebyshev polynomial of first kind.
If a weight function $g (x) = e^{- x^{2}}$ , then the orthogonal polynomial $P_{n} (x)$ is called Hermite polynomial.
If a weight function $g (x) = e^{- x}$ , then the orthogonal polynomial $P_{n} (x)$ is called Laguerre polynomial.

Properties of orthogonal polynomials

The following are some of the remarkable properties of a set of orthogonal polynomials:

Each polynomial $P_{n} (t)$ is orthogonal to any other polynomial of degree $< n$ in a set of orthogonal polynomials ${P_{0} (t), \dots, P_{n} (t), \dots,}$ .
Any set of orthogonal polynomials has a recurrence formula that connects any three consecutive polynomials in the sequence, i.e., the relation $P_{n + 1} (t) = (a_{n} t + b_{n}) P_{n} (t) - c_{n} P_{n - 1} (t)$ exists, with constants $a_{n}, b_{n}, c_{n}$ depending on n.
The zeroes of orthogonal polynomials are real numbers.
There is always a zero of orthogonal polynomial $P_{n + 1} (t)$ between two zeroes of $P_{n} (t)$ .

Moore-Penrose generalized inverse

In this section, the Moore-Penrose generalized inverse is introduced.

There can be problems in obtaining the solution of a general linear system $A x = y$ , where A may be a singular matrix or may even not be square. The Moore-Penrose generalized inverse can be used to solve such difficulties. The term generalized inverse is sometimes referred to as a synonym of pseudoinverse. More precisely, we define the Moore-Penrose generalized inverse as follows:

Definition 2

³⁰ A matrix B of order $n \times m$ is the Moore-Penrose generalized inverse of matrix A of order $m \times n$ , if the following hold

\begin{matrix} A B A = A, B A B = B, {(A B)}^{T} = A B, {(B A)}^{T} = B A, \end{matrix}

where $A^{T}$ denotes the transpose of matrix A. The Moore-Penrose generalized inverse of matrix A is denoted by $A^{†}$ .

Definition 3

$x_{0} \in R^{n}$ is said to be a minimum norm least-squares solution of a general linear system $A x = y$ if for any $y \in R^{m}$

\begin{matrix} ‖ x_{0} ‖ \leq ‖ x ‖, \forall x \in {x : ‖ A x - y ‖ \leq ‖ A z - y ‖, \forall z \in R^{n}} \end{matrix}

where $‖ . ‖$ is the Euclidean norm.

In other words, if a solution $x_{0}$ has the smallest norm among all the least-squares solutions, it is considered to be a minimum norm least-squares solution of the general linear system $A x = y$ .

Theorem 1

³⁰ Let B be a matrix with a minimum norm least-squares solution to the linear equation $A x = y$ . Then $B = A^{†}$ , the Moore-Penrose generalized inverse of matrix A, is both required and sufficient.

Problem definition

In this subsection, we present the general form of the pantograph equation, higher order delay differential equation, higher order neutral delay differential equation, and the system of higher order delay differential equation with variable coefficients and multiple delays.

The generalized Pantograph equation

Pantograph type equation arises as a mathematical model in the study of the wave motion of the overhead supply line to an electric locomotive. The following equation gives the generalized form of a pantograph type equation with multiple delays:

\begin{matrix} z^{'} (t) = a (t) z (t) + \sum_{i = 1}^{k} b_{i} (t) z (q_{i} t) + \sum_{j = 1}^{l} c_{j} (t) z^{'} (q_{j} t) + g (t), \end{matrix}

with initial conditions

\begin{matrix} z (t_{0}) = z_{0}, \end{matrix}

where g(t), a(t), $b_{i} (t)$ and $c_{i} (t)$ is continuous function, $0 < q_{i}, q_{j} < 1$ for some $k, l \in N$ and $t \in [t_{0}, t_{1}]$ for some, $t_{0}, t_{1} \in R$ .

Higher order DDEs and NDDEs

Consider the general form of Higher-order DDEs with multiple delay
$\begin{matrix} z^{k} (t) = f (t, z (t), . . . z^{k - 1} (t), z (q_{1} t), . . . z (q_{n} t)), \end{matrix}$ 3
with initial conditions
$\begin{matrix} z (t_{0}) = z_{0}, z^{'} (t_{0}) = z_{1}, \dots, z^{k - 1} (t_{0}) = z_{k - 1}, \end{matrix}$ 4
where $q_{i}^{'} s \in (0, 1)$ for $i =$ 1,...,n and $z^{k}$ denotes the kth derivative of z(t).
Consider the general form of Higher-order NDDEs with multiple delay
$\begin{matrix} \begin{matrix} z^{k} (t) = f (t, & z (t), . . . z^{k - 1} (t), z (q_{1}^{1} t), \dots, \\ z (q_{n_{1}}^{1} t), z^{'} (q_{1}^{2} t), \dots, z^{'} (q_{n_{2}}^{2} t), \dots, z^{k} (q_{1}^{k + 1} t), \dots, z^{k} (q_{n_{k + 1}}^{k + 1} t)), \end{matrix} \end{matrix}$ 5
with initial condition
$\begin{matrix} z (t_{0}) = z_{0}, z^{'} (t_{0}) = z_{1}, \dots, z^{k - 1} (t_{0}) = z_{k - 1}, \end{matrix}$ 6
where all $q_{i}^{j} \in (0, 1)$ for $j = 1, . ., k + 1$ , $i = 1, \dots, n_{j}$ , $n_{j}, k \in N$ and $z^{k}$ denotes the kth derivative of z(t).

Higher order system of DDE

Consider the general form of higher order coupled neutral delay differential equation with multiple delays as:

\begin{matrix} \begin{matrix} z_{1}^{k} (t) = f ( & t, z_{1} (t), . . . z_{1}^{k - 1} (t), z_{2} (t), . . . z_{2}^{k} (t), z_{1} (q_{1}^{1} t), \dots, z_{1} (q_{n_{1}}^{1} t), z_{2} (p_{1}^{1} t), \dots, \\ z_{2} (p_{m_{1}}^{1} t), z_{1}^{'} (q_{1}^{2} t), \dots, z_{1}^{'} (q_{n_{2}}^{2} t), z_{2}^{'} (p_{1}^{2} t), \dots, z_{2}^{'} (p_{m_{2}}^{2} t), \dots, z_{1}^{k} (q_{1}^{k + 1} t), \dots, \\ z_{1}^{k} (q_{n_{k + 1}}^{k + 1} t), z_{2}^{k} (p_{1}^{k + 1} t), \dots, z_{2}^{k} (p_{m_{k + 1}}^{k + 1} t)), \\ z_{1} (t_{0}) = z_{0}^{1} & , z_{1}^{'} (t_{0}) = z_{1}^{1}, \dots, z_{1}^{k - 1} (t_{0}) = z_{k - 1}^{1}, \end{matrix} \end{matrix}

\begin{matrix} \begin{matrix} z_{2}^{k} (t) = g ( & t, z_{1} (t), . . . z_{1}^{k - 1} (t), z_{2} (t), . . . z_{2}^{k} (t), z_{1} (r_{1}^{1} t), \dots, z_{1} (r_{l_{1}}^{1} t), z_{2} (s_{1}^{1} t), \dots, \\ z_{2} (s_{h_{1}}^{1} t), z_{1}^{'} (r_{1}^{2} t), \dots, z_{1}^{'} (r_{l_{2}}^{2} t), z_{2}^{'} (s_{1}^{2} t), \dots, z_{2}^{'} (s_{h_{2}}^{2} t), \dots, z_{1}^{k} (r_{1}^{k + 1} t), \dots, \\ z_{1}^{k} (r_{l_{k + 1}}^{k + 1} t), z_{2}^{k} (s_{1}^{k + 1} t), \dots, z_{2}^{k} (s_{h_{k + 1}}^{k + 1} t)), \\ z_{2} (t_{0}) = z_{0}^{2} & , z_{2}^{'} (t_{0}) = z_{1}^{2}, \dots, z_{2}^{k - 1} (t_{0}) = z_{k - 1}^{2}, \end{matrix} \end{matrix}

where $n_{j}, m_{j}, l_{j}, h_{j} \in N$ and all $q_{i_{1}}^{j}, p_{i_{2}}^{j}, r_{i_{3}}^{j}, s_{i_{4}}^{j} \in (0, 1)$ for $j = 1, . ., k + 1$ , $i_{1} = 1, \dots, n_{j}$ , $i_{2} = 1, \dots, m_{j}$ , $i_{3} = 1, \dots, l_{j}$ , $i_{4} = 1, \dots, h_{j}$ .

Orthogonal neural network

In this section, we introduce the structure of a single-layered orthogonal neural network(ONN) model with an extreme learning machine(ELM) algorithm for training the network weights.

Structure of orthogonal neural network (ONN)

Orthogonal neural network(ONN) is a single-layered feed-forward neural network, which consists of one input neuron t, one output neuron $N (t, a, w)$ and a hidden layer is eliminated by the orthogonal functional expansion block. The architecture of an orthogonal neural network is depicted in Fig. 1.

The structure of orthogonal neural network.

Consider a 1-dimensional input neuron t. The enhanced pattern is obtained by orthogonal functional expansion block as follows:

\begin{matrix} [P_{0} (a_{0} t), P_{1} (a_{1} t), \dots, P_{n} (a_{n} t)] . \end{matrix}

Here $N (t, a, w) = \sum_{i = 0}^{n} w_{i} P_{i} (a_{i} t)$ is the output of the orthogonal neural network, where $a_{i}^{'} s$ are randomly selected fixed weights and $w_{i}^{'} s$ are the weights to be trained.

Extreme learning machine (ELM) algorithm

For a given sample points $(t_{j}, y_{j})$ , $t_{j} \in R^{n}$ and $y_{j} \in R$ , for $j = 0, 1, \dots, m$ , a single-layer feed-forward neural network with $(n + 1)$ neurons has the following output:

\begin{matrix} \sum_{i = 0}^{n} w_{i} g_{i} (a_{i} t_{j}), j = 0, 1, \dots, m, \end{matrix}

where $g_{i}$ is the activation function of i-th neuron in a hidden layer, $a_{i}^{'} s$ are the randomly selected fixed weights between the input layer and hidden layer, and $w_{i}^{'} s$ are the weights between the hidden layer and output, which need to be trained.

When the neural network completely approximates the given data, i.e., the output of the neural network and actual data are equal, the following relation hold:

\begin{matrix} \sum_{i = 0}^{n} w_{i} g_{i} (a_{i} t_{j}) = y_{j} . j = 0, 1, \dots, m . \end{matrix}

Equation (9) can be written in matrix form as:

\begin{matrix} A w = b, \end{matrix}

where the hidden layer output matrix A is defined as follows:

\begin{matrix} A = (\begin{matrix} g_{0} (a_{0} t_{0}) & g_{1} (a_{1} t_{0}) & \dots & g_{n} (a_{n} t_{0}) \\ g_{0} (a_{0} t_{1}) & g_{1} (a_{1} t_{1}) & \dots & g_{n} (a_{n} t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ g_{0} (a_{0} t_{m}) & g_{1} (a_{1} t_{m}) & \dots & g_{n} (a_{n} t_{m}) \end{matrix}), \end{matrix}

and $w = {[w_{0}, w_{1}, \dots, w_{n}]}^{T}$ , $b = {[y_{0}, y_{1}, \dots, y_{m}]}^{T}$ .

For the given training points $t_{j}^{'} s \in R^{n}$ and the weights $a_{i}^{'} s$ , the matrix A can be calculated and the weights $w_{i}^{'} s$ can be calculated by solving the linear system $A w = b$ .

Theorem 2

The system $A w = b$ is solvable in the following several cases:

If $A$ is a square matrix, then $w = A^{- 1} b$
If $A$ is a rectangular matrix, then $w = A^{+} b$ , and $w$ is the minimal least square solution of $A w = b$ . Here $A^{+}$ is a pseudo inverse of $A$ .
If $A$ is a singular matrix, then $w = A^{+} b$ and $A^{+}$ = $A^{T} {(λ I + A A^{T})}^{- 1}$ , where $λ$ is the regularization coefficient. We can set a value of $λ$ according to the specific instance.

Error analysis

This section will discuss the convergence result and error analysis of the ONN-ELM method for solving the delay and neutral delay differential equations.

Theorem 3

²⁴ Let single layer feed-forward orthogonal neural network $N (t, a, w)$ be an approximate solution of one-dimensional neutral delay differential equation, for $m + 1$ arbitrary distinct sample points $(t_{j}, y_{j})$ for $j = 0, 1, . . . m$ , where $t_{i}, y_{i} \in R$ , then the orthogonal expansion layer output matrix A is invertible, and $‖ A w - b ‖ = 0$ .

Theorem 4

Let $z \in C^{\infty} (t_{0}, t_{m})$ , ${\hat{z}}_{n} = N (t, a, w)$ be the orthogonal neural network with n neurons in the hidden layer and $e_{n}$ be the absolute error with n hidden neurons, then $‖ e_{n} ‖ \to 0$ as $n \to \infty$ .

Proof

The Taylor expansion formula gives us the following expression for z(t) on $(t_{0}, t_{m})$ :

\begin{matrix} z (t) = z (t_{0}^{+}) + z^{'} (t_{0}^{+}) (t - t_{0}) + \frac{z^{''} (t_{0}^{+})}{2!} {(t - t_{0})}^{2} + . . . + \frac{z^{n} (c)}{n!} {(t - t_{0})}^{n}, c \in (t_{0}, t_{1}) . \end{matrix}

Let us define $z_{n} (t) = \sum_{i = 0}^{n - 1} \frac{z^{i} (t_{0}^{+})}{i!} {(t - t_{0})}^{i}$ , then we get

\begin{matrix} ‖ z (t) - z_{n} (t) ‖ = \frac{1}{n!} ‖ z^{n} (c) {(t - t_{0})}^{n} ‖ . \end{matrix}

Let $L = s p a n {P_{0} (t), P_{1} (t), \dots, P_{n} (t)}$ and let ${\hat{z}}_{n} (t)$ be the best approximation of z(t) in L given as, ${\hat{z}}_{n} (t) = \sum_{i = 0}^{n - 1} w_{i} P_{i} (a_{i} t)$ , where $w_{i}$ ’s are the weights obtained by ELM algorithm. we get

\begin{matrix} ‖ z (t) - {\hat{z}}_{n} (t) ‖ \leq ‖ z (t) - \bar{z} (t) ‖, \forall \bar{z} (t) \in L . \end{matrix}

In particular, taking $\bar{z} (t) = z_{n} (t)$ we have

\begin{matrix} \begin{matrix} ‖ e_{n} (t) ‖ & = ‖ z (t) - {\hat{z}}_{n} (t) ‖ \\ \leq ‖ z (t) - z_{n} (t) ‖ \\ = \frac{1}{n!} ‖ z^{n} (c) {(t - t_{0})}^{n} ‖ \end{matrix} \end{matrix}

Thus,

\begin{matrix} \begin{matrix} ‖ e_{n} (t) ‖ & \leq ‖ \frac{z^{n} (c)}{n!} {(t - t_{0})}^{n} ‖ \\ \leq \frac{M}{2^{n}}, \end{matrix} \end{matrix}

where, $M = m a x ‖ z^{n} (c) {(t - t_{0})}^{n} ‖$ , for $t \in (t_{0}, t_{m})$ .

Moreover, from Eq. (16) we deduce that $‖ e_{n} (t) ‖ \to 0$ for large value of n. This shows that ONN has high representational abilities and it can approximate the exact solution with almost no error. $□$

Methodology

This section explains the method to obtain an approximate solution of second-order NDDE using the ONN-ELM algorithm. It can be easily extended to the higher-order NDDE and the higher-order DDE is a special case of the higher-order NDDE.

Consider the general form of linear second-order NDDE

\begin{matrix} \begin{matrix} z^{''} (t) & + a (t) z^{'} (t) + b (t) z (t) + \sum_{j = 1}^{m_{1}} c_{j} (t) z (α_{j} t) + \sum_{k = 1}^{m_{2}} d_{k} (t) z^{'} (β_{k} t) \\ + \sum_{l = 1}^{m_{3}} e_{l} (t) z^{''} (γ_{l} t) = f (t), t \in (a, b), \end{matrix} \end{matrix}

with initial condition $z (a) = z_{0}$ and $z^{'} (a) = z_{1}$ or boundary condition $z (a) = z_{2}$ and $z (b) = z_{3}$ , where $z_{0}, z_{1}, z_{2}, z_{3} \in R$ , $a (t), b (t), c_{j} (t), d_{k} (t), e_{l} (t), f (t)$ are continuously differentiable function for $t \in (a, b)$ and $m_{1}, m_{2}, m_{3} \in N$ .

Using ONN-ELM with n neurons, an approximate solution of Eq. (17) is obtained in the form:

\begin{matrix} {\hat{z}}_{n} (t) = \sum_{i = 0}^{n} w_{i} P_{i} (t), \end{matrix}

where $w_{i}$ ’s are the output weights that need to be trained and $P_{i} (t)$ is the i-th orthogonal polynomial.

Since the approximate solution obtained by the ONN-ELM algorithm is the linear combination of the orthogonal polynomials, it is infinitely differentiable and we have,

\begin{matrix} {\hat{z}}_{n}^{'} (t) = & \sum_{i = 0}^{n} w_{i} P_{i}^{'} (t), \end{matrix}

\begin{matrix} {\hat{z}}_{n}^{''} (t) = & \sum_{i = 0}^{n} w_{i} P_{i}^{''} (t), \end{matrix}

\begin{matrix} \sum_{j = 1}^{m_{1}} {\hat{z}}_{n} (α_{j} t) = & \sum_{j = 1}^{m_{1}} \sum_{i = 0}^{n} w_{i} P_{i} (α_{j} t), \end{matrix}

\begin{matrix} \sum_{k = 1}^{m_{2}} {\hat{z}}_{n}^{'} (β_{k} t) = & \sum_{k = 1}^{m_{2}} \sum_{i = 0}^{n} β_{k} w_{i} P_{i}^{'} (β_{k} t), \end{matrix}

\begin{matrix} \sum_{l = 1}^{m_{3}} {\hat{z}}_{n}^{''} (γ_{l} t) = & \sum_{l = 1}^{m_{3}} \sum_{i = 0}^{n} γ_{l}^{2} w_{i} P_{i}^{''} (γ_{l} t) . \end{matrix}

Substituting Eqs. (18)–(23) into the second order neutral delay differential equation (17), we have

\begin{matrix} \begin{matrix} \sum_{i = 0}^{n} w_{i} P_{i}^{''} (t) + a (t) \sum_{i = 0}^{n} w_{i} P_{i}^{'} (t) + b (t) \sum_{i = 0}^{n} w_{i} P_{i} (t) + \sum_{i = 0}^{n} w_{i} \sum_{j = 1}^{m_{1}} c_{j} (t) P_{i} (α_{j} t) \\ + \sum_{i = 0}^{n} w_{i} \sum_{k = 1}^{m_{2}} β_{k} d_{k} (t) P_{i}^{'} (β_{k} t) + \sum_{i = 0}^{n} w_{i} \sum_{l = 1}^{m_{3}} γ_{l}^{2} e_{l} (t) P_{i}^{''} (γ_{l} t) = f (t) . \end{matrix} \end{matrix}

We can write Eq. (24) as:

\begin{matrix} \sum_{i = 0}^{n} w_{i} A_{i} (t) = f (t), \end{matrix}

where,

\begin{matrix} \begin{matrix} A_{i} = & P_{i}^{''} (t) + a (t) P_{i} (t) + b (t) P_{i} (t) + \sum_{j = 1}^{m_{1}} c_{j} (t) P_{i} (α_{j} t) + \sum_{k = 1}^{m_{2}} β_{k} d_{k} (t) P_{i}^{'} (β_{k} t) \\ + \sum_{l = 1}^{m_{3}} γ_{l}^{2} e_{l} (t) P_{i}^{''} (γ_{l} t) . \end{matrix} \end{matrix}

Using the discretization of interval [a, b] as $a = t_{0} < t_{1} <, \dots, < t_{m} = b$ for $m \in N$ , define $f_{m} = f (t_{m})$ . At these discretized points, Eq. (25) is to be satisfied, that is:

\begin{matrix} \sum_{i = 0}^{n} w_{i} A_{i} (t_{m}) = f (t_{m}), \forall m \in N . \end{matrix}

Equation (26) can be written as a system of equations as:

\begin{matrix} A_{1} w = b_{1}, \end{matrix}

where $w = {[w_{0}, w_{1}, \dots, w_{n}]}^{T}$ ,

\begin{matrix} A_{1} = (\begin{matrix} A_{0} (t_{0}) & A_{1} (t_{0}) & \dots & A_{n} (t_{0}) \\ A_{0} (t_{1}) & A_{1} (t_{1}) & \dots & A_{n} (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ A_{0} (t_{m}) & A_{1} (t_{m}) & \dots & A_{n} (t_{m}) \end{matrix}), \end{matrix}

and $b_{1}$ = ${[f (t_{0}), f (t_{1}), \dots, f (t_{m})]}^{T}$ .

Case:1 Consider Eq. (17) with the initial conditions. Then the following linear system is obtained:

\begin{matrix} \underset{A}{\underset{⏟}{(\begin{matrix} A_{0} (t_{0}) & A_{1} (t_{0}) & \dots & A_{n} (t_{0}) \\ A_{0} (t_{1}) & A_{1} (t_{1}) & \dots & A_{n} (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ A_{0} (t_{m}) & A_{1} (t_{m}) & \dots & A_{n} (t_{m}) \\ P_{0} (a) & P_{1} (a) & \dots & P_{n} (a) \\ P_{0}^{'} (a) & P_{1}^{'} (a) & \dots & P_{n}^{'} (a) \end{matrix})}} \underset{w}{\underset{⏟}{(\begin{matrix} w_{0} \\ w_{1} \\ ⋮ \\ w_{n} \end{matrix})}} \approx \underset{b}{\underset{⏟}{(\begin{matrix} f_{0} \\ f_{1} \\ ⋮ \\ f_{m} \\ z_{0} \\ z_{1} \end{matrix})}} \end{matrix}

Case:2 Consider Eq. (17) with the boundary conditions. Then the following linear system for NDDE is obtained:

\begin{matrix} \underset{A}{\underset{⏟}{(\begin{matrix} A_{0} (t_{0}) & A_{1} (t_{0}) & \dots & A_{n} (t_{0}) \\ A_{0} (t_{1}) & A_{1} (t_{1}) & \dots & A_{n} (t_{1}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ A_{0} (t_{m}) & A_{1} (t_{m}) & \dots & A_{n} (t_{m}) \\ P_{0} (a) & P_{1} (a) & \dots & P_{n} (a) \\ P_{0} (b) & P_{1} (b) & \dots & P_{n} (b) \end{matrix})}} \underset{w}{\underset{⏟}{(\begin{matrix} w_{0} \\ w_{1} \\ ⋮ \\ w_{n} \end{matrix})}} \approx \underset{b}{\underset{⏟}{(\begin{matrix} f_{0} \\ f_{1} \\ ⋮ \\ f_{m} \\ z_{2} \\ z_{3} \end{matrix})}} \end{matrix}

To calculate the weight vector $w$ of the network, we use the extreme learning algorithm, that is:

\begin{matrix} w = A^{†} b, \end{matrix}

where $A^{†} = {(A^{T} A)}^{- 1} A^{T}$ is the least square solution of Eq. (27).

Note: Similar methodology can be used for the higher order neutral delay differential equation and the system of higher order neutral delay differential equations. graphic file with name 41598_2023_30127_Figa_HTML.jpg

Steps of solving NDDEs using an ONN-ELM algorithm:

Discretize the domain as $a = t_{0} < t_{1} < t_{2} < . . . < t_{m} = b$ .
Construct the approximate solution by using the orthogonal polynomial as an activation function that is,
$\begin{matrix} N (t, w) = \sum_{i = 0}^{n} w_{i} P_{i} (a_{i} t), \end{matrix}$
where $a_{i}^{'} s$ are the randomly generated fixed weights.
At the discrete points, substitute the approximate solution and its derivatives into the differential equation and its boundary conditions and obtain the system of equations $A w = b$ .
Solve the system of equations $A w = b$ by ELM algorithm and obtain the network weights $w_{i}^{'} s$ .
Substitute the value of $w_{i}^{'} s$ and get an approximate solution of DDE.

Numerical illustrations

This section considers the higher order delay and neutral delay differential equations with multiple delays and variable coefficients. We also consider the system of delay and neutral delay differential equations. In all the test examples, we use the special orthogonal polynomials based neural network like Legendre neural network, Laguerre neural network, Chebyshev neural network, and Hermite neural network. Further, to show the reliability and powerfulness of the presented method; we compare the approximate solutions with the exact solution. All computations are carried out using Python 3.9.7 on Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz 1.80 GHz and the Window 10 operating system. We calculate the relative error which is defined as follows.

\begin{matrix} R e l a t i v e e r r o r = ∥\frac{e x a c t s o l u t i o n - n u m e r i c a l s o l u t i o n}{e x a c t s o l u t i o n}∥ \end{matrix}

Example 6.1

²² Consider the second-order boundary valued proportional delay differential equation with variable coefficients

\begin{matrix} \begin{matrix} z^{''} (t) & = 0.5 z (t) + e^{0.5 t} z (\frac{t}{2}) - 2 e^{- t}, \\ z (0) & = 0, z (1) = e^{- 1} . \end{matrix} \end{matrix}

The exact solution of the given equation is $t e^{- t}$ .

We employ four ONNs to obtain the approximate solution of the given second-order DDE with variable coefficients. We choose ten uniformly distributed points in [0, 1]. The relative errors for all ONNs are shown in Fig. 3. Obtained relative errors for different orthogonal neural networks are reported in Table 1, and we compare the approximate solutions with the exact solution in Fig. 2.

Error graph for different orthogonal neural networks with different numbers of neurons for Example 6.1.

Table 1.

The relative error for Example 6.1 with different orthogonal neural networks.

t	Legendre neural network	Hermite neural network	Laguerre neural network	Chebyshev neural network
0.1	1.56e−08	2.26e−08	1.39e−08	5.61e−08
0.2	6.20e−08	5.89e−08	6.49e−08	4.54e−08
0.3	4.57e−08	4.39e−08	4.89e−08	4.12e−08
0.4	1.07e−08	9.71e−09	1.40e−08	2.09e−08
0.5	1.94e−08	1.88e−08	2.26e−08	4.57e−08
0.6	5.66e−08	5.64e−08	5.98e−08	1.06e−08
0.7	6.84e−08	6.86e−08	7.17e−08	2.73e−08
0.8	5.09e−08	5.14e−08	5.42e−08	1.32e−08
0.9	6.99e−08	7.08e−08	7.34e−08	1.79e−08
1	7.08e−10	4.55e−10	2.93e−09	4.63e−09

Open in a new tab

Comparison of the exact solution with the obtained approximate solutions of Example 6.1.

Table 1 and Fig. 3 clearly show that the Chebyshev polynomial-based ONN performs best with the maximum relative error $5.61 \times 10^{- 8}$ . Table 2 shows the comparison of the maximum relative error for Example 6.1 using the Legendre, Laguerre, Hermite, and Chebyshev neural networks with various numbers of neurons (n = 5, 8, and 11) and their respective computational time. Additionally, Table 2 shows that all four neural networks satisfy Theorem 4, and for $n = 5$ , all four orthogonal neural networks show similar accuracy. However, Chebyshev neural network performs better with $n = 8, 11$ .

Table 2.

Comparision of maximum relative error for Example 6.1 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
5	0.004	0.0049	0.009	0.0049	0.002	0.0049	0.008	0.0049
8	0.012	7.07e−08	0.011	6.97e−08	0.003	7.07e−08	0.043	7.07e−08
11	0.019	1.82e−11	0.011	7.74e−06	0.003	6.81e−10	0.043	3.93e−12

Open in a new tab

Significant values are in bold.

Example 6.2

² Consider the second-order neutral delay differential equation with multiple delays

\begin{matrix} \begin{matrix} z^{''} (t) & = \frac{3}{4} z (t) + z (\frac{t}{2}) + z^{'} (\frac{t}{2}) + 0.5 z^{''} (\frac{t}{2}) + f (t), \\ z (0) & = 0, z^{'} (0) = 0, \end{matrix} \end{matrix}

where $f (t) = - t^{2} - t + 1$ , $t \in (0, 1)$ .

The exact solution of the given equation is $z (t) = t^{2}$ .

This equation is solved using four ONNs architecture with ten uniformly distributed training points and with 6,8, and 9 neurons in the hidden layer. Relative errors for the different ONNs with 6,8, and 9 neurons as activation functions are reported in Table 3. Figure 4 shows an error graph of different orthogonal neural networks, and a comparison of approximate solutions with the exact solution is shown in Fig. 5.

Table 3.

Comparision of maximum relative error for Example 6.2 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
6	0.007	8.25e−14	0.004	2.06e−12	0.002	2.87e−13	0.001	3.17e−14
8	0.009	4.94e−08	0.004	7.29e−09	0.004	7.73e−13	0.002	7.19e−14
9	0.019	2.22e−14	0.069	1.57e−08	0.017	6.13e−13	0.022	6.77e−15

Open in a new tab

Significant values are in bold.

Error graph for different orthogonal neural networks with different numbers of neurons for Example 6.2.

Comparison of the exact solution with the obtained approximate solutions of Example 6.2.

From Table 4 and Fig. 4 we conclude that for the given second-order neutral delay differential equation, Chebyshev polynomial-based ONN performs best with the maximum relative error $7.19 \times 10^{- 14}$ . Additionally, Table 3 shows that all four neural networks satisfy Theorem 4.

Table 4.

The relative error for Example 6.2 with different orthogonal neural networks.

t	Legendre neural network	Hermite neural network	Laguerre neural network	Chebyshev neural network
0.1	4.94e−08	7.73e−13	7.29e−09	7.19e−14
0.2	1.19e−08	1.90e−13	1.32e−09	1.00e−14
0.3	5.05e−09	7.61e−14	3.83e−10	9.25e−15
0.5	1.49e−09	1.26e−14	8.31e−12	5.55e−15
0.6	8.88e−10	3.08e−16	3.37e−11	6.32e−15
0.7	5.20e−10	7.81e−15	5.12e−11	7.13e−15
0.8	2.81e−10	1.37e−14	5.75e−11	8.15e−15
0.9	1.17e−10	1.80e−14	5.85e−11	9.45e−15
1	2.66e−15	2.17e−14	5.68e−11	1.11e−14

Open in a new tab

Example 6.3

² Consider the second-order neutral delay differential equation with variable coefficients

\begin{matrix} \begin{matrix} z^{''} (t) & = z^{'} (\frac{t}{2}) - \frac{t}{2} z^{''} (\frac{t}{2}) + 2, t \in (0, 1) \\ z (0) & = 1, z^{'} (0) = 0 . \end{matrix} \end{matrix}

The exact solution of the given equation is $z (t) = t^{2} + 1$ .

To obtain the approximate solution of the given equation, we use four ONNs with ten uniformly distributed training points in [0,1] and with 8,9, and 11 neurons as activation functions in the hidden layer. Relative errors for the different ONNs and with different numbers of neurons are reported in Table 6. The exact and approximate solutions are compared in Fig. 7. Figures 6, 7 shows the absolute relative error of four special ONNs.

Table 6.

Comparision of maximum relative error for Example 6.3 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
8	0.007	3.50e−09	0.004	1.52e−10	0.002	1.52e−13	0.001	2.29e−15
9	0.009	3.07e−13	0.004	8.80e−11	0.004	1.69e−05	0.002	2.29e−15
11	0.019	3.07e−13	0.069	3.16e−11	0.017	1.80e−05	0.022	1.32e−15

Open in a new tab

Significant values are in bold.

Comparison of the exact solution with the obtained approximate solutions of Example 6.3.

Error graph for different orthogonal neural networks with different numbers of neurons for Example 6.3.

From Table 5 and Fig. 6, we conclude that for the given second-order neutral delay differential equation, Chebyshev polynomial-based ONN provides the best accurate solution with the maximum relative error $2.29 \times 10^{- 15}$ . Additionally, Table 6 shows that all four neural networks satisfy Theorem 4.

Table 5.

The relative error for Example−6.3 with different orthogonal neural networks.

t	Legendre neural network	Hermite neural network	Laguerre neural network	Chebyshev neural network
0.0	3.50e−09	1.17e−13	1.52e−10	7.77e−16
0.1	3.46e−09	1.14e−13	8.90e−11	0.0
0.2	3.34e−09	1.01e−13	4.18e−11	1.06e−15
0.3	3.16e−09	8.06e−14	1.01e−11	1.83e−15
0.4	2.94e−09	5.28e−14	8.50e−12	2.29e−15
0.5	2.70e−09	1.98e−14	1.72e−11	2.13e−15
0.6	2.44e−09	1.58e−14	1.90e−11	1.63e−15
0.7	2.18e−09	5.27e−14	1.66e−11	1.04e−15
0.8	1.93e−09	8.93e−14	1.21e−11	2.70e−16
0.9	1.70e−09	1.24e−13	7.09e−12	4.90e−16
1.0	1.50e−09	1.52e−13	2.35e−12	8.88e−16

Open in a new tab

Example 6.4

³¹ Consider the third-order pantograph equation

\begin{matrix} \begin{matrix} z^{'''} (t) & = t z^{''} (2 t) - z^{'} (t) - z (\frac{t}{2}) + t c o s (2 t) + c o s (\frac{t}{2}), t \in (0, 1) \\ z (0) & = 1, z^{'} (0) = 0, z^{''} (0) = - 1 . \end{matrix} \end{matrix}

The exact solution of the given equation is $z (t) = c o s (t)$ .

To obtain the approximate solution of the given equation, we use four ONNs with ten uniformly distributed training points in [0,1] and with 8,11,13 neurons as activation functions in the hidden layer. Relative errors for the different ONNs with different numbers of neurons as activation functions are reported in Table 7. The exact and approximate solutions are compared in Fig. 8. Figure 9 shows the maximum relative error of four special ONNs with different numbers of neurons.

Table 7.

Comparision of maximum relative error for Example 6.4 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
8	0.007	1.11e−05	0.004	1.11e−05	0.002	1.11e−05	0.001	1.11e−05
11	0.019	3.86e−08	0.004	6.56e−08	0.004	6.06e−08	0.004	3.86e−08
13	0.019	1.12e−09	0.069	3.11e−09	0.017	1.20e−06	0.022	3.77e−10

Open in a new tab

Significant values are in bold.

Comparison of the exact solution with the obtained approximate solutions of Example 6.4.

Error graph for different orthogonal neural networks with different numbers of neurons for Example 6.4.

From Table 8 and Fig. 9, we conclude that for the given third-order neutral delay differential equation, Chebyshev polynomial-based ONN provides the best accurate solution with the maximum relative error $3.77 \times 10^{- 10}$ . Additionally, Table 7 shows that all four orthogonal neural networks satisfy Theorem 4.

Table 8.

The relative error for Example 6.4 with different orthogonal neural networks.

t	Legendre neural network	Hermite neural network	Laguerre neural network	Chebyshev neural network
0	3.79e−10	7.39e−07	5.91e−10	1.00e−11
0.1	3.82e−10	5.75e−07	6.05e−10	6.11e−12
0.2	3.89e−10	3.94e−07	6.33e−10	5.81e−12
0.3	4.02e−10	2.05e−07	6.75e−10	2.65e−11
0.4	4.21e−10	1.32e−08	7.33e−10	5.73e−11
0.5	4.49e−10	1.79e−07	8.11e−10	9.98e−11
0.6	4.88e−10	3.71e−07	9.22e−10	1.54e−10
0.7	5.49e−10	5.65e−07	1.10e−09	2.20e−10
0.8	6.49e−10	7.63e−07	1.41e−09	2.92e−10
0.9	8.21e−10	9.72e−07	2.00e−09	3.54e−10
1	1.12e−09	1.20e−06	3.11e−09	3.77e−10

Open in a new tab

Comparative analysis

This section describes a comparative study of the proposed approach to the 1st-order pantograph equation and system of pantograph equations with other neural network approaches.

Example 7.1

²⁵ Consider the pantograph equation with variable coefficients and multiple delays

\begin{matrix} \begin{matrix} z^{'} (t) & = 0.5 z (t) + 0.5 e^{0.5 t} z (\frac{t}{2}) + \frac{3}{8} t z (\frac{t}{3}) + g (t), \\ z (0) & = 0, \end{matrix} \end{matrix}

where, $g (t) = \frac{1}{8} e^{- t} (12 s i n (t) + 4 e^{t} s i n (\frac{t}{2}) - 8 c o s (t) + 3 t e^{\frac{2 t}{3}} s i n (\frac{t}{3}))$ .

The exact solution of the given equation is $z (t) = s i n (t) e^{- t}$ .

We employ four ONNs to obtain the approximate solution of a given pantograph equation with multiple delays. We choose eight uniformly distributed points in [0, 1] with 5,8 and 11 neurons in the hidden layer. The relative errors with all four ONNs with different numbers of neurons are shown in Fig. 11. Obtained relative errors for the different orthogonal neural networks are reported in Table 9, and we compare the approximate solutions with the exact solution in Fig. 10.

Error graph for different orthogonal neural networks with different numbers of neurons for Example 7.1.

Table 9.

Comparision of maximum relative error for Example 7.1 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
5	0.007	0.0014	0.004	0.0014	0.002	0.0014	0.001	0.0014
8	0.019	7.02e−08	0.004	6.56e−08	0.004	7.02e−08	0.004	7.02e−08
11	0.019	4.75e−11	0.069	1.06e−06	0.017	2.63e−09	0.022	3.40e−11

Open in a new tab

Significant values are in bold.

Comparison of the exact solution with the obtained approximate solutions of Example 7.1.

Table 9 and Fig. 11 clearly show that the Chebyshev polynomial-based ONN performs best with the maximum relative error $3.40 \times 10^{- 11}$ .

The maximum relative error of a simple feed-forward neural network(FNN) method in²⁵ is $4.05 \times 10^{- 10}$ and the maximum relative error of the proposed FLNN-based ONN method is $3.40 \times 10^{- 11}$ . This comparison shows that the ONN method can obtain a better accuracy solution than simple FNN. Additionally, Table 9 shows that all four orthogonal neural networks satisfy Theorem 4.

Example 7.2

²⁵ Consider the system of pantograph equation

\begin{matrix} \begin{matrix} z_{1}^{'} (t) & = z_{1} (t) - z_{2} (t) + z_{1} (\frac{t}{2}) - e^{0.5 t} + e^{t}, \\ z_{2}^{'} (t) & = - z_{1} (t) - z_{2} (t) - z_{2} (\frac{t}{2}) + e^{- 0.5 t} + e^{t}, \\ z_{1} (0) & = 1, z_{2} (0) = 1 . \end{matrix} \end{matrix}

The exact solutions of the given system of pantograph equation is $z_{1} (t) = e^{t}$ and $z_{2} (t) = e^{- t}$ .

To obtain the approximate solutions of the given system of DDEs, we use four ONNs with twelve uniformly distributed training points in [0,1] and with 5,7, and 10 neurons in an orthogonal functional expansion block as activation functions. Relative errors for the different ONNs with 5,7, and 10 neurons as activation functions are reported in Tables 10 and 11. Comparison between the exact solution and approximate solutions are presented in Figs. 14 and 15. Figures 12, 13, 14 and 15 show the absolute relative error between four special ONNs and exact solutions.

Table 10.

Comparision of maximum relative error of $z_{1} (t)$ for Example 7.2 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
5	0.005	0.06	0.004	0.0004	0.002	0.0004	0.001	0.0004
7	0.019	0.067	0.004	1.72e−06	0.004	1.93e−07	0.004	1.93e−07
10	0.019	3.23e−10	0.069	1.71e−06	0.017	1.81e−09	0.022	1.60e−10

Open in a new tab

Significant values are in bold.

Table 11.

Comparision of maximum relative error of $z_{2} (t)$ for Example 7.2 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
5	0.005	0.312	0.004	0.0006	0.002	0.0006	0.001	0.0006
7	0.019	0.02	0.004	1.94e−06	0.004	2.00e−07	0.004	6.47e−09
10	0.019	1.42e−08	0.069	2.20e−08	0.017	5.91e−06	0.022	5.11e−10

Open in a new tab

Significant values are in bold.

Error graph of $z_{1} (t)$ for different orthogonal neural networks with different numbers of neurons for Example 7.2.

Error graph of $z_{2} (t)$ for different orthogonal neural networks with different numbers of neurons for Example 7.2.

Comparison of the exact solution $z_{1} (t)$ with the obtained approximate solutions of Example 7.2.

Comparison of the exact solution $z_{2} (t)$ with the obtained approximate solutions of Example 7.2.

From Tables 10 and 11, we conclude that for the given system of delay differential equation, Chebyshev polynomial-based ONN provides the best accurate solution for $z_{1} (t)$ and $z_{2} (t)$ with the maximum relative errors $1.60 \times 10^{- 9}$ and $5.11 \times 10^{- 11}$ , respectively.

The maximum relative error of a simple feed-forward neural network(FNN) method in²⁵ for $z_{1} (t)$ and $z_{2} (t)$ with twelve training points are $1.93 \times 10^{- 9}$ and $2.42 \times 10^{- 9}$ respectively and the maximum relative error of the proposed FLNN-based ONN method for $z_{1} (t)$ and $z_{2} (t)$ with twelve training points are $1.60 \times 10^{- 9}$ and $5.11 \times 10^{- 10}$ respectively. This comparison shows that the ONN method can obtain a better accuracy solution than simple FNN. Additionally, Tables 10 and 11 show that all four orthogonal neural networks satisfy Theorem 4.

Example 7.3

²⁵ Consider the system of pantograph equation

\begin{matrix} \begin{matrix} z_{1}^{'} (t) + z_{2}^{'} (t) - 2 z_{3}^{'} (t) & = z_{1} (0.2 t) + z_{2} (t) - z_{2} (0.3 t) \\ - 2 z_{3} (t) - z_{3} (0.3 t) + z_{3} (0.5) + f_{1} (t), \\ z_{1}^{'} (t) - z_{2}^{'} (t) & = z_{1} (t) - z_{3} (t) + 3 z_{1} (0.5 t) \\ - z_{2} (0.5 t) + z_{2} (0.3 t) + z_{3} (0.7 t) + f_{2} (t), \\ z_{2}^{'} (t) - 2 z_{3}^{'} (t) & = z_{1} (t) - z_{3} (0.8 t) + 3 z_{2} (t) \\ - z_{1} (0.2 t) + z_{3} (0.8 t) + f_{3} (t), \\ z_{1} (0) & = 0, z_{2} (0) = 1, z_{3} (0) = 1, \end{matrix} \end{matrix}

where, $f_{1} (t) = c o s (0.3 t) - s i n (0.2 t) - s i n (t) + e^{0.3 t} - e^{0.5 t}$ ,

$f_{2} (t) = - c o s (0.3 t) + c o s (0.5 t) - 3 s i n (0.5 t) + c o s (t) - e^{0.7 t} + e^{t}$ ,

$f_{3} (t) = - c o s (0.8 t) + s i n (0.2 t) - 3 c o s (t) - 2 s i n (t) + e^{0.8 t} - 2 e^{t}$ .

The exact solutions of the given system of pantograph equation are $z_{1} (t) = s i n (t)$ , $z_{2} (t) = c o s (t)$ , and $z_{3} (t) = e^{t}$ .

To obtain the approximate solution of the given system of DDEs, we use four ONNs with ten uniformly distributed training points in [0,1] and with 7,10, and 13 neurons in an orthogonal functional expansion block as activation functions. Relative errors for the different ONNs with 7,10, and 13 neurons as activation functions are reported in Tables 12, 13, and 14. Comparison between the exact solution and approximate solutions are presented in Figs. 16, 17, 18, and 19. Figures 16, 20, and 21 show the absolute relative error between four special ONNs and exact solutions.

Table 12.

Comparision of maximum relative error of $z_{1} (t)$ for Example 7.3 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
7	0.007	5.41e−07	0.004	6.20e−07	0.002	6.17e−07	0.001	6.16e−07
10	0.019	9.87e−08	0.004	6.97e−07	0.004	1.45e−09	0.004	1.53e−10
13	0.019	9.11e−11	0.069	5.79e−07	0.017	1.23e−09	0.022	1.98e−11

Open in a new tab

Significant values are in bold.

Table 13.

Comparision of maximum relative error of $z_{2} (t)$ for Example 7.3 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
7	0.007	1.60e−07	0.004	3.18e−07	0.002	3.17e−05	0.001	2.93e−07
10	0.019	1.05e−09	0.004	5.71e−07	0.004	5.03e−10	0.004	3.70e−10
13	0.019	1.05e−09	0.069	5.24e−07	0.017	8.04e−09	0.022	3.11e−10

Open in a new tab

Significant values are in bold.

Table 14.

Comparision of maximum relative error of $z_{3} (t)$ for Example 7.3 with different numbers of neurons.

n	Legendre		Laguerre		Hermite		Chebyshev
n	Time(s)	Error	Time(s)	Error	Time(s)	Error	Time(s)	Error
7	0.007	1.31e−07	0.004	1.76e−07	0.002	1.78e−07	0.001	1.74e−07
10	0.019	2.82e−08	0.004	4.05e−07	0.004	1.27e−08	0.004	3.91e−09
13	0.019	5.18e−08	0.069	5.65e−07	0.017	4.23e−08	0.022	5.74e−09

Open in a new tab

Significant values are in bold.

Error graph of $z_{1} (t)$ for different orthogonal neural networks with different numbers of neurons for Example 7.3.

Comparison of the exact solution $z_{1} (t)$ with the obtained approximate solutions of Example 7.3.

Comparison of the exact solution $z_{2} (t)$ with the obtained approximate solutions of Example 7.3.

Comparison of the exact solution $z_{3} (t)$ with the obtained approximate solutions of Example 7.3.

Error graph of $z_{2} (t)$ for different orthogonal neural networks with different numbers of neurons for Example 7.3.

Error graph of $z_{3} (t)$ for different orthogonal neural networks with different numbers of neurons for Example 7.3.

From Tables 12, 13 and 14, we conclude that for the given system of delay differential equation, Chebyshev polynomial-based ONN provides the best accurate solutions of $z_{1} (t)$ , $z_{2} (t)$ and $z_{3} (t)$ with the maximum relative errors $1.98 \times 10^{- 10}$ , $3.11 \times 10^{- 10}$ and $5.74 \times 10^{- 9}$ respectively.

The maximum relative error of a simple feed-forward neural network(FNN) method in²⁵ for $z_{1} (t)$ , $z_{2} (t)$ and $z_{3} (t)$ with ten training points are $8.78 \times 10^{- 8}$ , $1.42 \times 10^{- 8}$ and $1.93 \times 10^{- 7}$ respectively and the maximum relative error of the proposed FLNN-based ONN method for $z_{1} (t)$ , $z_{2} (t)$ and $z_{3} (t)$ with ten training points are $1.98 \times 10^{- 10}$ , $3.11 \times 10^{- 10}$ and $5.74 \times 10^{- 9}$ respectively. This comparison shows that the ONN method can obtain a better accuracy solution than simple FNN. Additionally, Tables 12, 13 and 14 show that all four orthogonal neural networks satisfy Theorem 4.

Conclusion

In this paper, we obtained approximate solutions of higher order NDDEs, as well as a system of DDEs with multiple delays and variable coefficients, using four single-layer orthogonal polynomial-based neural networks: (i) Legendre neural network, (ii) Chebyshev neural network, (iii) Hermite neural network, and (iv) Laguerre neural network. For training the network weights, the ELM algorithm is used. It is proved that the relative error between the exact solution and approximate solutions obtained by ONNs is of order $2^{- n}$ , where $n$ is the number of neurons. Further, it is shown that each orthogonal polynomial-based neural networks provide an approximate solution, that are in good agreement with the exact solution. However, it is observed that, among these four ONNs, the Chebyshev neural network provides the most accurate result.

The results in the section (6), (7) demonstrate that the proposed method is simple to implement and a powerful mathematical technique for obtaining the approximate solution of the higher order NDDEs as well as the system of DDEs.

Acknowledgements

Chavda Divyesh Vinodbhai acknowledges the financial support provided by the MoE (Ministry of Education), Government of India, to carry out the work. The second author is thankful for the financial support received from the Indian Institute of Technology Madras.

Author contributions

The contributions of each authors are equal.

Data availability

The data that support the findings of this investigation are accessible from the authors upon reasonable request. If necessary, you can contact by email sdubey@iitm.ac.in.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Ockendon JR, Tayler AB. The dynamics of a current collection system for an electronic locomotive. Numer. Math. 1971;72(2):447–468. [Google Scholar]
2.Biazar J, Ghanbari B. The homotopy perturbation method for solving neutral functional-differential equations with proportional delays. J. King Saud Univ.-Sci. 2012;24(1):33–37. doi: 10.1016/j.jksus.2010.07.026. [DOI] [Google Scholar]
3.Bahşi, M.M. & Çevik, M. Numerical solution of pantograph-type delay differential equations using perturbation-iteration algorithms. J. Appl. Math.2015 (2015).
4.Bahuguna D, Agarwal S. Approximations of solutions to neutral functional differential equations with nonlocal history conditions. J. Math. Anal. Appl. 2006;317(2):583–602. doi: 10.1016/j.jmaa.2005.07.010. [DOI] [Google Scholar]
5.Dubey SA. The method of lines applied to nonlinear nonlocal functional differential equations. J. Math. Anal. Appl. 2011;376(1):275–281. doi: 10.1016/j.jmaa.2010.10.024. [DOI] [Google Scholar]
6.Aibinu M, Thakur S, Moyo S. Exact solutions of nonlinear delay reaction-diffusion equations with variable coefficients. Partial Differ. Equ. Appl. Math. 2021;4:100170. doi: 10.1016/j.padiff.2021.100170. [DOI] [Google Scholar]
7.Mahata A, Paul S, Mukherjee S, Roy B. Stability analysis and Hopf bifurcation in fractional order SEIRV epidemic model with a time delay in infected individuals. Partial Differ. Equ. Appl. Math. 2022;5:100282. doi: 10.1016/j.padiff.2022.100282. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Cakmak M, Alkan S. A numerical method for solving a class of systems of nonlinear pantograph differential equations. Alex. Eng. J. 2022;61(4):2651–2661. doi: 10.1016/j.aej.2021.07.028. [DOI] [Google Scholar]
9.Muslim M. Approximation of solutions to history-valued neutral functional differential equations. Comput. Math. Appl. 2006;51(3–4):537–550. doi: 10.1016/j.camwa.2005.07.013. [DOI] [Google Scholar]
10.Lagaris IE, Likas A, Fotiadis DI. Artificial neural networks for solving ordinary and partial differential equations. IEEE Trans. Neural Netw. 1998;9(5):987–1000. doi: 10.1109/72.712178. [DOI] [PubMed] [Google Scholar]
11.Aarts LP, Van Der Veer P. Neural network method for solving partial differential equations. Neural Process. Lett. 2001;14(3):261–271. doi: 10.1023/A:1012784129883. [DOI] [Google Scholar]
12.Mall S, Chakraverty S. Application of Legendre neural network for solving ordinary differential equations. Appl. Soft Comput. 2016;43:347–356. doi: 10.1016/j.asoc.2015.10.069. [DOI] [Google Scholar]
13.Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019;378:686–707. doi: 10.1016/j.jcp.2018.10.045. [DOI] [Google Scholar]
14.Panghal S, Kumar M. Multilayer perceptron and Chebyshev polynomials based neural network for solving Emden–Fowler type initial value problems. Int. J. Appl. Comput. Math. 2020;6(6):1–12. doi: 10.1007/s40819-020-00914-2. [DOI] [Google Scholar]
15.Ezadi S. & Parandin N. An application of neural networks to solve ordinary differential equations (2013)
16.Liu Z, Yang Y, Cai Q. Neural network as a function approximator and its application in solving differential equations. Appl. Math. Mech. 2019;40(2):237–248. doi: 10.1007/s10483-019-2429-8. [DOI] [Google Scholar]
17.Pakdaman M, Ahmadian A, Effati S, Salahshour S, Baleanu D. Solving differential equations of fractional order using an optimization technique based on training artificial neural network. Appl. Math. Comput. 2017;293:81–95. [Google Scholar]
18.Nguyen L, Raissi M, Seshaiyer P. Efficient Physics Informed Neural Networks Coupled with Domain Decomposition Methods for Solving Coupled Multi-physics Problems. Springer; 2022. pp. 41–53. [Google Scholar]
19.Mall S, Chakraverty S. Numerical solution of nonlinear singular initial value problems of Emden–Fowler type using Chebyshev neural network method. Neurocomputing. 2015;149:975–982. doi: 10.1016/j.neucom.2014.07.036. [DOI] [Google Scholar]
20.Dufera TT. Deep neural network for system of ordinary differential equations: Vectorized algorithm and simulation. Mach. Learn. Appl. 2021;5:100058. [Google Scholar]
21.Fang J, Liu C, Simos T, Famelis IT. Neural network solution of single-delay differential equations. Mediterr. J. Math. 2020;17(1):1–15. doi: 10.1007/s00009-019-1452-5. [DOI] [Google Scholar]
22.Hou C-C, Simos TE, Famelis IT. Neural network solution of pantograph type differential equations. Math. Methods Appl. Sci. 2020;43(6):3369–3374. doi: 10.1002/mma.6126. [DOI] [Google Scholar]
23.Panghal S, Kumar M. Optimization free neural network approach for solving ordinary and partial differential equations. Eng. Comput. 2021;37(4):2989–3002. doi: 10.1007/s00366-020-00985-1. [DOI] [Google Scholar]
24.Huang G-B, Zhu Q-Y, Siew C-K. Extreme learning machine: Theory and applications. Neurocomputing. 2006;70(1–3):489–501. doi: 10.1016/j.neucom.2005.12.126. [DOI] [Google Scholar]
25.Panghal, S. & Kumar M. Neural network method: delay and system of delay differential equations. Eng. Comput. 1–10 (2021)
26.Liu H, Song J, Liu H, Xu J, Li L. Legendre neural network for solving linear variable coefficients delay differential-algebraic equations with weak discontinuities. Adv. Appl. Math. Mech. 2021;13(1):101–118. doi: 10.4208/aamm.OA-2019-0281. [DOI] [Google Scholar]
27.Mall, S. & Chakraverty, S. Artificial Neural Networks for Engineers and Scientists: Solving Ordinary Differential Equations, 1st ed., 168 (2017)
28.Verma A, Kumar M. Numerical solution of third-order Emden–Fowler type equations using artificial neural network technique. Eur. Phys. J. Plus. 2020;135(9):1–14. doi: 10.1140/epjp/s13360-020-00780-3. [DOI] [Google Scholar]
29.Verma A, Kumar M. Numerical solution of Bagley–Torvik equations using Legendre artificial neural network method. Evol. Intell. 2021;14(4):2027–2037. doi: 10.1007/s12065-020-00481-x. [DOI] [Google Scholar]
30.Serre D. Matrices: Theory and Applications. Springer Inc; 2002. [Google Scholar]
31.Sezer M, Akyüz-Daşcıogˇlu A. A Taylor method for numerical solution of generalized pantograph equations with linear functional argument. J. Comput. Appl. Math. 2007;200(1):217–225. doi: 10.1016/j.cam.2005.12.015. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data that support the findings of this investigation are accessible from the authors upon reasonable request. If necessary, you can contact by email sdubey@iitm.ac.in.

[CR1] 1.Ockendon JR, Tayler AB. The dynamics of a current collection system for an electronic locomotive. Numer. Math. 1971;72(2):447–468. [Google Scholar]

[CR2] 2.Biazar J, Ghanbari B. The homotopy perturbation method for solving neutral functional-differential equations with proportional delays. J. King Saud Univ.-Sci. 2012;24(1):33–37. doi: 10.1016/j.jksus.2010.07.026. [DOI] [Google Scholar]

[CR3] 3.Bahşi, M.M. & Çevik, M. Numerical solution of pantograph-type delay differential equations using perturbation-iteration algorithms. J. Appl. Math.2015 (2015).

[CR4] 4.Bahuguna D, Agarwal S. Approximations of solutions to neutral functional differential equations with nonlocal history conditions. J. Math. Anal. Appl. 2006;317(2):583–602. doi: 10.1016/j.jmaa.2005.07.010. [DOI] [Google Scholar]

[CR5] 5.Dubey SA. The method of lines applied to nonlinear nonlocal functional differential equations. J. Math. Anal. Appl. 2011;376(1):275–281. doi: 10.1016/j.jmaa.2010.10.024. [DOI] [Google Scholar]

[CR6] 6.Aibinu M, Thakur S, Moyo S. Exact solutions of nonlinear delay reaction-diffusion equations with variable coefficients. Partial Differ. Equ. Appl. Math. 2021;4:100170. doi: 10.1016/j.padiff.2021.100170. [DOI] [Google Scholar]

[CR7] 7.Mahata A, Paul S, Mukherjee S, Roy B. Stability analysis and Hopf bifurcation in fractional order SEIRV epidemic model with a time delay in infected individuals. Partial Differ. Equ. Appl. Math. 2022;5:100282. doi: 10.1016/j.padiff.2022.100282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Cakmak M, Alkan S. A numerical method for solving a class of systems of nonlinear pantograph differential equations. Alex. Eng. J. 2022;61(4):2651–2661. doi: 10.1016/j.aej.2021.07.028. [DOI] [Google Scholar]

[CR9] 9.Muslim M. Approximation of solutions to history-valued neutral functional differential equations. Comput. Math. Appl. 2006;51(3–4):537–550. doi: 10.1016/j.camwa.2005.07.013. [DOI] [Google Scholar]

[CR10] 10.Lagaris IE, Likas A, Fotiadis DI. Artificial neural networks for solving ordinary and partial differential equations. IEEE Trans. Neural Netw. 1998;9(5):987–1000. doi: 10.1109/72.712178. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Aarts LP, Van Der Veer P. Neural network method for solving partial differential equations. Neural Process. Lett. 2001;14(3):261–271. doi: 10.1023/A:1012784129883. [DOI] [Google Scholar]

[CR12] 12.Mall S, Chakraverty S. Application of Legendre neural network for solving ordinary differential equations. Appl. Soft Comput. 2016;43:347–356. doi: 10.1016/j.asoc.2015.10.069. [DOI] [Google Scholar]

[CR13] 13.Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019;378:686–707. doi: 10.1016/j.jcp.2018.10.045. [DOI] [Google Scholar]

[CR14] 14.Panghal S, Kumar M. Multilayer perceptron and Chebyshev polynomials based neural network for solving Emden–Fowler type initial value problems. Int. J. Appl. Comput. Math. 2020;6(6):1–12. doi: 10.1007/s40819-020-00914-2. [DOI] [Google Scholar]

[CR15] 15.Ezadi S. & Parandin N. An application of neural networks to solve ordinary differential equations (2013)

[CR16] 16.Liu Z, Yang Y, Cai Q. Neural network as a function approximator and its application in solving differential equations. Appl. Math. Mech. 2019;40(2):237–248. doi: 10.1007/s10483-019-2429-8. [DOI] [Google Scholar]

[CR17] 17.Pakdaman M, Ahmadian A, Effati S, Salahshour S, Baleanu D. Solving differential equations of fractional order using an optimization technique based on training artificial neural network. Appl. Math. Comput. 2017;293:81–95. [Google Scholar]

[CR18] 18.Nguyen L, Raissi M, Seshaiyer P. Efficient Physics Informed Neural Networks Coupled with Domain Decomposition Methods for Solving Coupled Multi-physics Problems. Springer; 2022. pp. 41–53. [Google Scholar]

[CR19] 19.Mall S, Chakraverty S. Numerical solution of nonlinear singular initial value problems of Emden–Fowler type using Chebyshev neural network method. Neurocomputing. 2015;149:975–982. doi: 10.1016/j.neucom.2014.07.036. [DOI] [Google Scholar]

[CR20] 20.Dufera TT. Deep neural network for system of ordinary differential equations: Vectorized algorithm and simulation. Mach. Learn. Appl. 2021;5:100058. [Google Scholar]

[CR21] 21.Fang J, Liu C, Simos T, Famelis IT. Neural network solution of single-delay differential equations. Mediterr. J. Math. 2020;17(1):1–15. doi: 10.1007/s00009-019-1452-5. [DOI] [Google Scholar]

[CR22] 22.Hou C-C, Simos TE, Famelis IT. Neural network solution of pantograph type differential equations. Math. Methods Appl. Sci. 2020;43(6):3369–3374. doi: 10.1002/mma.6126. [DOI] [Google Scholar]

[CR23] 23.Panghal S, Kumar M. Optimization free neural network approach for solving ordinary and partial differential equations. Eng. Comput. 2021;37(4):2989–3002. doi: 10.1007/s00366-020-00985-1. [DOI] [Google Scholar]

[CR24] 24.Huang G-B, Zhu Q-Y, Siew C-K. Extreme learning machine: Theory and applications. Neurocomputing. 2006;70(1–3):489–501. doi: 10.1016/j.neucom.2005.12.126. [DOI] [Google Scholar]

[CR25] 25.Panghal, S. & Kumar M. Neural network method: delay and system of delay differential equations. Eng. Comput. 1–10 (2021)

[CR26] 26.Liu H, Song J, Liu H, Xu J, Li L. Legendre neural network for solving linear variable coefficients delay differential-algebraic equations with weak discontinuities. Adv. Appl. Math. Mech. 2021;13(1):101–118. doi: 10.4208/aamm.OA-2019-0281. [DOI] [Google Scholar]

[CR27] 27.Mall, S. & Chakraverty, S. Artificial Neural Networks for Engineers and Scientists: Solving Ordinary Differential Equations, 1st ed., 168 (2017)

[CR28] 28.Verma A, Kumar M. Numerical solution of third-order Emden–Fowler type equations using artificial neural network technique. Eur. Phys. J. Plus. 2020;135(9):1–14. doi: 10.1140/epjp/s13360-020-00780-3. [DOI] [Google Scholar]

[CR29] 29.Verma A, Kumar M. Numerical solution of Bagley–Torvik equations using Legendre artificial neural network method. Evol. Intell. 2021;14(4):2027–2037. doi: 10.1007/s12065-020-00481-x. [DOI] [Google Scholar]

[CR30] 30.Serre D. Matrices: Theory and Applications. Springer Inc; 2002. [Google Scholar]

[CR31] 31.Sezer M, Akyüz-Daşcıogˇlu A. A Taylor method for numerical solution of generalized pantograph equations with linear functional argument. J. Comput. Appl. Math. 2007;200(1):217–225. doi: 10.1016/j.cam.2005.12.015. [DOI] [Google Scholar]

PERMALINK

Numerical solution of neutral delay differential equations using orthogonal neural network

Chavda Divyesh Vinodbhai

Shruti Dubey

Abstract

Introduction

Preliminaries

Orthogonal polynomial

Definition 1

Remark

Properties of orthogonal polynomials

Moore-Penrose generalized inverse

Definition 2

Definition 3

Theorem 1

Problem definition

The generalized Pantograph equation

Higher order DDEs and NDDEs

Higher order system of DDE

Orthogonal neural network

Structure of orthogonal neural network (ONN)

Figure 1.

Extreme learning machine (ELM) algorithm

Theorem 2

Error analysis

Theorem 3

Theorem 4

Proof

Methodology

Numerical illustrations

Example 6.1

Figure 3.

Table 1.

Figure 2.

Table 2.

Example 6.2

Table 3.

Figure 4.

Figure 5.

Table 4.

Example 6.3

Table 6.

Figure 7.

Figure 6.

Table 5.

Example 6.4

Table 7.

Figure 8.

Figure 9.

Table 8.

Comparative analysis

Example 7.1

Figure 11.

Table 9.

Figure 10.

Example 7.2

Table 10.

Table 11.

Figure 14.

Figure 15.

Figure 12.

Figure 13.

Example 7.3

Table 12.

Table 13.

Table 14.

Figure 16.

Figure 17.

Figure 18.

Figure 19.

Figure 20.

Figure 21.

Conclusion

Acknowledgements

Author contributions

Data availability

Competing interests

Footnotes

References

Associated Data