Journals & Magazines >IEEE Access >Volume: 9

Parameter Estimation in Neutral Delay Differential Equations Using Genetic Algorithm With Multi-Parent Crossover

Plots of the solution y(t) (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of Genetic Algorithm with Multi-Parent Crosso...

Abstract:

Neutral delay differential equations (NDDEs) are differential equations containing time lags not only in the states but also in the state derivatives. NDDEs have applicat...Show More

Metadata

Abstract:

Neutral delay differential equations (NDDEs) are differential equations containing time lags not only in the states but also in the state derivatives. NDDEs have applications in modeling physical and biological systems. An NDDE model may consist of several parameters, some of which can be determined using available data. Various numerical techniques have been studied to estimate parameters of mathematical models. In this study, parameter estimation is posed as an optimization problem. The use of heuristic algorithms in parameter estimation has gained popularity because of its ease of implementation, requiring only function evaluations. But to our knowledge, heuristic algorithms have never been employed in estimating parameters in NDDE models. In this work, we apply Genetic Algorithm with Multi-Parent Crossover (GA-MPC) to obtain parameter estimates of three NDDE models with a discrete delay. We compare the estimates to those obtained using standard heuristic algorithms. Results show that GA-MPC is capable of consistently identifying model parameters that provide a good fit of the model to the data.

Plots of the solution y(t) (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of Genetic Algorithm with Multi-Parent Crosso...

Published in: IEEE Access ( Volume: 9)

Page(s): 131348 - 131364

Date of Publication: 17 September 2021

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2021.3113677

Funding Agency:

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

Differential equations, where the derivatives at time $t$ depend on the solution and its derivatives at a previous time, are called delay differential equations (DDEs). A general form of a DDE is given by

$\begin{equation*} y^{(k)}(t)=f(t,y(t),\ldots,y^{(k-1)}(t),y(d_{0}),\ldots,y^{(k)}(d_{k});\mathbf {p}), \tag{1}\end{equation*}$ View Source

where

$d_{j}=d_{j}(t,y(t))$

$j=0,\ldots,k$

are the delays which satisfy

$d_{j} \leq t~\forall t \in [t_{0},t_{1}]$

for some

$t_{0},t_{1}\in \mathbb {R}^{+}$

. The function

$f$

is parametrized by

$\mathbf {p}\in \mathbb {R}^{L}$

, where

$L$

is the number of parameters. The solution to (1) is dependent on some initial function

$\phi (t)$

defined at a previous time. This function

$\phi (t)$

is referred to as the history or preshape function. A DDE is of retarded type (RDDE) when there is no delay in the derivative terms of the DDE. On the other hand, it is of neutral type (NDDE) if there are delays in the derivatives of the state variables.

Several studies have been done on the numerical approximations and asymptotic properties of solutions to NDDEs [1]–[7]. NDDEs have been used to develop models in engineering [8]–[12], biosciences [1], [13]–[17], and economics [18], [19]. A review of the applications of NDDEs in various fields is discussed in [20].

Here, we consider a single constant delay $\tau$ . That is,

$\begin{equation*} d_{j}(t,y(t))=t-\tau,\quad \forall j.\end{equation*}$ View Source

Note that the solution $y(t)$ to (1) depends on the parameter $\mathbf {p}$ and the delay $\tau$ . Because $\tau$ can also be treated as an additional parameter, we let

$\begin{equation*} \theta:=[\mathbf {p};\tau],\end{equation*}$ View Source

be the vector of parameters. We also denote by

$y_\theta (t)$

the solution to (1) to make the dependence of

$y(t)$

on the parameters and delay more obvious.

The values of the parameters in a model are often unknown. While some parameters are found in the literature, others need to be estimated. Ideally, we want the experimental data recorded at time $t_{i}$ , denoted by $D_{i}$ , to coincide with the solution to the model at $t_{i}$ . That is,

$\begin{equation*} D_{i} \approx y_\theta (t_{i}),\quad i=1,2,\ldots,N, \tag{2}\end{equation*}$ View Source

where

$N$

is the number of data points. Hence, we look for the value of

$\theta$

that will make the approximation in (2) as close as possible. This can be done by minimizing a least-squares formulation expressed as

$\begin{equation*} \min _{\theta } \dfrac {1}{M}\sum _{i=1}^{N} \left ({D_{i} - y_\theta (t_{i})}\right)^{2}, \tag{3}\end{equation*}$

View Source

where

$M$

is a normalizing constant. We denote the cost function to be minimized by

$\begin{equation*} J(\theta):=\dfrac {1}{M}\sum _{i=1}^{N} \left ({D_{i} - y_\theta (t_{i})}\right)^{2}.\tag{4}\end{equation*}$

View Source

There are many numerical optimization algorithms in the literature that can be used to solve (3). Classical techniques use the gradient of the cost function $J$ in (4) to estimate the minimizer. Gradient-based methods are fast but can be stuck to a local minimum or a saddle point [21]. Using derivative discontinuity tracking theory for DDEs, it was shown in [22] that jumps can arise in the gradient of $J$ . Hence, to solve (3), we rely on a derivative-free method.

Numerical algorithms that only use function values of $J$ are either deterministic or heuristic. Deterministic algorithms are computationally inexpensive and fast but tend to converge to local minimizers [21], [23]. Heuristic algorithms are often nature-inspired [24] and are computationally more expensive than deterministic algorithms but can converge to a global minimizer given enough number of function evaluations [24]–[26].

In [27], a modified particle swarm optimization (PSO) was used to estimate the parameters of dynamical systems involving ordinary differential equations. It was shown in [28] how a hybrid adaptive cuckoo search with simulated annealing could be effective in identifying parameters in chaotic systems. A hybrid Taguchi-differential evolution algorithm was proposed in [29] to estimate the parameters of an HIV model. Heuristic algorithms have also been shown to be effective in identifying parameters of S-system models [30]–[32]. Other recent works on the application of various heuristic optimization methods on estimating parameters of engineering systems are found in [33]–[41].

Heuristic algorithms such as genetic algorithm (GA) [42], which is a hybrid of an evolutionary search strategy with a local multiple-shooting approach [43], and differential evolution [44] have been applied to parameter estimation in RDDE models. In [45], Ant Colony Optimization was used to determine parameter values in RDDE models on human health. To determine the unknown parameters in chaotic systems with time-delays, a hybrid bio-geography optimization was used in [46], PSO was used in [47], and a hybrid cuckoo search algorithm was used in [48].

To the best of our knowledge, heuristic algorithms have never been employed in estimating parameters of models involving NDDEs. In this paper, we solve the least-squares formulation in (3) using Genetic Algorithm with Multi-Parent Crossover (GA-MPC), and compare the results to those obtained using standard global optimization algorithms: GA, pattern search algorithm (PS), simulated annealing (SA), and PSO. GA-MPC is a modified GA that proposes a new crossover method [49]. It has been used to effectively solve an economic dispatch problem [50] and an optimal power flow problem [51]. It has also been applied to solve an inverse problem in RDDEs [52].

In the next section, we discuss GA-MPC and a parameter bootstrapping method. In Section III, parameter estimation is done on three NDDE models arising from applications using the heuristic algorithms mentioned above. The first model describes the population growth of E. coli using a first-order NDDE. The second one is an age-structured model involving a system of first-order NDDEs. The third model is a second-order NDDE describing a pendulum-mass-spring-damper (PMSD) system. Parameter bootstrapping is done to assess uncertainty in the estimates. Furthermore, since the only stopping criterion is the maximum number of function evaluations, we investigate the effect of varying this quantity to the cost function value. In the last section, we state our conclusions and recommendations.

SECTION II.

Methods

A. Genetic Algorithm With Multi-Parent Crossover

Genetic Algorithm was formulated by John Holland in 1975 based on Darwin’s theory of evolution [53]. It has gained popularity for its ease of use and applicability in various fields of study. Some applications of GA can be found in [25], [26], [42], [54]–[59].

The general process of GA starts with a pool of randomly generated individuals that serve as members of the first generation. The members of the population act as estimates of the minimizer. The best members of the population are selected based on their survival fitness. In the case of minimization, the lower the function value of an individual, the higher is its chance of survival. The members of the population who did not survive will be replaced by the offspring of the surviving population. The offspring are created using a crossover method. To make sure that the solution does not converge to a local minimum, a mutation step is done at the end of the generation. In this step, a portion of the population is modified randomly. The process is repeated until a stopping criterion is satisfied. A detailed discussion on GA, as well as the MATLAB codes, are found in [26].

GA-MPC proposes a new crossover method that uses three parents from the selection pool to generate three offspring (refer to Algorithm 1). Two of the offspring ( $o_{1}$ and $o_{3}$ in Algorithm 1) are formulated in such a way that they are generated in the direction with less function value. These two offspring exploit the location of the best parents to attain better estimates. The third offspring ( $o_{2}$ in Algorithm 1) will move away from the best parent. Although this is counter-intuitive, this gives the algorithm the capability to escape local minima. Thus, GA-MPC also adopts a randomized operator to diversify the offspring. A detailed discussion on GA-MPC is found in [49]. This algorithm ranked first among the participating algorithms in the 2011 IEEE Congress of Evolutionary Computation Competition, which tests algorithms by solving real-world optimization problems [60].

Algorithm 1 Genetic Algorithm With Multi-Parent Crossover (GA-MPC) [49]

Require: Cost function $f(\mathbf {x})$ , $\mathbf {x}\in \mathbb {R}^{r}$ , $PopSize$ , $\mathbf {x}_{LB}$ , $\mathbf {x}_{UB}$ , $M$ , $\beta$ , $cr$ , $p$ , $MaxEval$ .

Ensure: $f$ is optimal at $\mathbf {x}^{*}$

Randomly generate the initial population size $PopSize$ . The initial population must stay within the bounds $x_{LB}$ and $x_{UB}$ .

while $FuncEvals < MaxEval$ do

Sort the population by their cost function value. Form the archive pool $\tilde {A}$ using the best $M$ members.

Generate tournament selection size $TC$ randomly between 2 and 3. Do a tournament selection and fill the selection pool.

Randomly generate a number $\bar {u}\in [{0,1}]$ .

for each three consecutive individuals in the selection pool do

if one of the selected individual is the same as another individual then

Replace one by a randomly-selected individual in the selection pool.

end if

10:

if $\bar {u} < cr$ then

11:

Sort the three individuals $x_{i}$ , $x_{i+1}$ , $x_{i+2}$ so that $f(\mathbf {x}_{i}) \leq f(\mathbf {x}_{i+1}) \leq f(\mathbf {x}_{i+2})$ .

12:

Compute $\beta = N(\tilde {\mu },\tilde {\sigma })$ .

13:

The three offspring are generated from the three parents using: $o_{1} = \mathbf {x}_{1} + \beta (\mathbf {x}_{2} - \mathbf {x}_{3}),\, o_{2} = \mathbf {x}_{2} + \beta (\mathbf {x}_{3} - \mathbf {x}_{1}), \, o_{3} = \mathbf {x}_{3} + \beta (\mathbf {x}_{1} - \mathbf {x}_{2})$ .

14:

end if

15:

for $i=1:3$ do

16:

Randomly generate $\tilde {u}\in [{0,1}]$ .

17:

if $\tilde {u} < p$ then

18:

The offspring undergoes mutation: $o_{i}^{j} = \mathbf {x}_{mut}^{j}$ , where $\mathbf {x}_{mut}\in \tilde {A}$ and $mut \in [1,m]$ .

19:

end if

20:

end for

21:

end for

22:

if a duplicate individual ${x}_{i}^{j}$ exists then

23:

Change the duplicate with $\mathbf {x}_{i}^{j} = \mathbf {x}_{i}^{j} + N(0.5*\bar {u},0.25*\bar {u}), \quad \bar {u} \in [{0,1}]$ .

24:

end if

25:

end while

In the following simulations, the parameter values of GA-MPC are set to their default values as stated in [49]. That is, the population size $PopSize = 90$ , selection rate $M=0.50\times PopSize$ , crossover rate $cr=1$ , crossover factor $\beta \sim \mathcal {N}(0.5,0.3)$ , and mutation rate $p=0.1$ .

The results obtained using GA-MPC are compared to those obtained using standard heuristic algorithms: GA, PS, SA, and PSO, which are implemented using the built-in functions ga, patternsearch, simulannealbnd, and particleswarm, respectively, in the global optimization toolbox of MATLAB. For consistency of comparison, all the parameters in these built-in functions are set to their default values. The only stopping criterion in all the algorithms is the maximum number of function evaluations. In the parameter estimation results in Section III, the maximum number of function evaluations is set to 2000 times the number of parameters to be estimated. Because heuristic algorithms are probabilistic, we run each of the algorithms 100 times and compute for the mean of the estimates. Box plots are used to illustrate the distribution of the parameter estimates. Furthermore, we explore the effect of the maximum number of function evaluations to the cost function value by setting this maximum number to the number of parameters to be estimated times 500, 1000, 1500, or 2000.

B. Parameter Uncertainty Analysis

In [61], a parameter bootstrapping approach is presented to quantify the standard error in the parameter estimates in models involving RDDEs. We adopt this strategy to analyze the uncertainty in parameter estimates in the NDDE models. The method is summarized in the following steps:

Solve the parameter estimation problem in (3) and denote the estimated parameter by $\hat {\theta }$ .
Let $y_{\hat {\theta }}(t)$ be the solution of the NDDE model. Set the maximum number $L$ of realizations to 100. Initialize the number of realizations $r$ to 1.
Using $(y_{\hat {\theta }}(t_{i}),\sigma ^{2})$ , $i=1,\ldots,N$ , generate a noisy data $\{\hat {D}_{i}\}_{i=1}^{N}$ based on an assumed probability distribution. Here, the noisy data are generated using the normal distribution Normal$(y_{\hat {\theta }}(t_{i}),\sigma ^{2})$ , $i=1,\ldots,N$ , where $\sigma$ is the noise set to 10% of the standard deviation of $\{y_{\hat {\theta }}(t_{i})\}_{i=1}^{N}$ (compare with [61]).
Get a new set of parameter estimates using the simulated noisy data $\{\hat {D}_{i}\}_{i=1}^{N}$ . This is done by replacing $\{D_{i}\}_{i=1}^{N}$ by $\{\hat {D}_{i}\}_{i=1}^{N}$ in (4) and minimizing the modified cost function. Store this parameter as $\bar {\theta }_{r}$ .
If $r < L$ , set $r\leftarrow r+1$ and go back to step 3. Otherwise, go to the next step.
The standard error of the parameter $\hat {\theta }$ , denoted by $Err(\hat {\theta })$ , is equal to the standard deviation of $\{\bar {\theta }_{r}\}_{r=1}^{100}$ . The 95% confidence interval for the parameter $\hat {\theta }$ is $[\hat {\theta }-1.96*Err(\hat {\theta }),\hat {\theta }+1.96*Err(\hat {\theta })]$ . The standard error and 95% confidence interval of the solution to the NDDE are computed similarly.

SECTION III.

Results and Discussion

A. E. Coli Population Growth Model

In [16], an NDDE model of E. coli population growth is proposed. The model assumes that all cells have the same division time and that they divide simultaneously. Furthermore, it is assumed that there is a prolonged initial step-like growth. The model is given by

$\begin{equation*} y'(t) =\rho _{0}y(t)+\rho _{1}y(t - \tau)+\rho _{2}y'(t - \tau), \tag{5}\end{equation*}$ View Source

where

$y(t)$

denotes the number of colonies at time

$t$

, and

$\rho _{0},\rho _{1}$

$\rho _{2}$

, and

$\tau$

are the model parameters. The parameter

$\rho _{0}$

is the rate of cell death,

$\rho _{1}$

is the rate of commitment to cell division,

$\rho _{2}$

represents the gradual dispersal of synchronization, and

$\tau$

is the average cell-division time [62]. The history function for this problem is given by

$\begin{equation*} \phi (t)=y_{0}+\beta \Psi (t),\quad t\in [-\tau,0],\end{equation*}$

View Source

where

$\begin{equation*} \Psi (t)=\dfrac {2y_{0}\beta \gamma }{\rho _{1}\tau }E\left ({\dfrac {2t}{\tau }+1}\right),\end{equation*}$

View Source

and

$\begin{align*} E(t)=\begin{cases} \exp [-1/(1-t^{2})],& \text {for }|t| < 1,\\ 0, & \text {otherwise.} \end{cases}\end{align*}$

View Source

The function $\phi (t)$ follows a bell-shaped distribution that corresponds to non-monotone cell growth [16]. The value of $\gamma$ is set to 2.25 and the initial population $y_{0}$ to 99 [16].

We wish to estimate the parameters $\rho _{0},\rho _{1},\rho _{2},\tau$ , and $\beta$ by comparing the numerical solution to (5) with the number of colonies $\{y^\star _{i}\}_{i=1}^{28}$ based on experimental data [16]. For brevity, we define

$\begin{equation*} \theta:=[\rho _{0},\rho _{1},\rho _{2},\tau,\beta].\end{equation*}$ View Source

Thus, the minimization problem to estimate the parameters of the NDDE model of E. coli population growth is given by

$\begin{equation*} \min _{\theta \in \mathbb {R}^{5}} \dfrac {\sum \limits _{i=1}^{28} \left ({y^\star _{i} - y_\theta (t_{i})}\right)^{2}}{\sum \limits _{i=1}^{28} \left ({y^\star _{i} }\right)^{2}}. \end{equation*}$ View Source

Here, $y_\theta (t)$ is the solution to (5) given an arbitrary parameter vector $\theta$ . To compute for $y_\theta (t)$ numerically, we use the MATLAB built-in function ddensd with step size 0.01. The maximum and minimum values of the parameters used in the simulations are shown in Table 1.

TABLE 1 Bounds for the Parameters in the NDDE Model of E. coli Population Growth

Figure 1 shows the box plots of the parameter estimates of the E. coli population growth model obtained using GA-MPC, GA, PS, SA, and PSO. Compared to the other algorithms, most of the estimates obtained using GA-MPC fall within the first and third quartiles of the 100 estimates. Furthermore, the narrow interquartile ranges suggest small variations among estimates. The cyan curves in Figure 2 are the plots of the solutions corresponding to the 100 sets of estimates. The data points are represented by the asterisks. Among the five algorithms, plots of the solution using GA-MPC produce the best fit to the experimental data.

FIGURE 1.

Box plots of the parameter estimates for the E. coli population growth model using GA-MPC, GA, PS, SA, and PSO. The outliers, i.e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively.

MIT Libraries

MIT Libraries

Parameter Estimation in Neutral Delay Differential Equations Using Genetic Algorithm With Multi-Parent Crossover

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

Methods

A. Genetic Algorithm With Multi-Parent Crossover

Algorithm 1 Genetic Algorithm With Multi-Parent Crossover (GA-MPC) [49]

B. Parameter Uncertainty Analysis

Results and Discussion

A. E. Coli Population Growth Model

B. Age-Structured Model

C. Coupled Pendulum-Mass-Spring-Damper System

D. Varying Maximum Number of Function Evaluations

Conclusion

Appendix AAdditional Figures

Additional Figures

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Appendix A
Additional Figures