Loading web-font TeX/Math/Italic
Parameter Estimation in Neutral Delay Differential Equations Using Genetic Algorithm With Multi-Parent Crossover | IEEE Journals & Magazine | IEEE Xplore

Parameter Estimation in Neutral Delay Differential Equations Using Genetic Algorithm With Multi-Parent Crossover


Plots of the solution y(t) (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of Genetic Algorithm with Multi-Parent Crosso...

Abstract:

Neutral delay differential equations (NDDEs) are differential equations containing time lags not only in the states but also in the state derivatives. NDDEs have applicat...Show More

Abstract:

Neutral delay differential equations (NDDEs) are differential equations containing time lags not only in the states but also in the state derivatives. NDDEs have applications in modeling physical and biological systems. An NDDE model may consist of several parameters, some of which can be determined using available data. Various numerical techniques have been studied to estimate parameters of mathematical models. In this study, parameter estimation is posed as an optimization problem. The use of heuristic algorithms in parameter estimation has gained popularity because of its ease of implementation, requiring only function evaluations. But to our knowledge, heuristic algorithms have never been employed in estimating parameters in NDDE models. In this work, we apply Genetic Algorithm with Multi-Parent Crossover (GA-MPC) to obtain parameter estimates of three NDDE models with a discrete delay. We compare the estimates to those obtained using standard heuristic algorithms. Results show that GA-MPC is capable of consistently identifying model parameters that provide a good fit of the model to the data.
Plots of the solution y(t) (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of Genetic Algorithm with Multi-Parent Crosso...
Published in: IEEE Access ( Volume: 9)
Page(s): 131348 - 131364
Date of Publication: 17 September 2021
Electronic ISSN: 2169-3536

Funding Agency:


CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.
SECTION I.

Introduction

Differential equations, where the derivatives at time t depend on the solution and its derivatives at a previous time, are called delay differential equations (DDEs). A general form of a DDE is given by \begin{equation*} y^{(k)}(t)=f(t,y(t),\ldots,y^{(k-1)}(t),y(d_{0}),\ldots,y^{(k)}(d_{k});\mathbf {p}), \tag{1}\end{equation*}

View SourceRight-click on figure for MathML and additional features. where d_{j}=d_{j}(t,y(t)) , j=0,\ldots,k are the delays which satisfy d_{j} \leq t~\forall t \in [t_{0},t_{1}] for some t_{0},t_{1}\in \mathbb {R}^{+} . The function f is parametrized by \mathbf {p}\in \mathbb {R}^{L} , where L is the number of parameters. The solution to (1) is dependent on some initial function \phi (t) defined at a previous time. This function \phi (t) is referred to as the history or preshape function. A DDE is of retarded type (RDDE) when there is no delay in the derivative terms of the DDE. On the other hand, it is of neutral type (NDDE) if there are delays in the derivatives of the state variables.

Several studies have been done on the numerical approximations and asymptotic properties of solutions to NDDEs [1]–​[7]. NDDEs have been used to develop models in engineering [8]–​[12], biosciences [1], [13]–​[17], and economics [18], [19]. A review of the applications of NDDEs in various fields is discussed in [20].

Here, we consider a single constant delay \tau . That is, \begin{equation*} d_{j}(t,y(t))=t-\tau,\quad \forall j.\end{equation*}

View SourceRight-click on figure for MathML and additional features.

Note that the solution y(t) to (1) depends on the parameter \mathbf {p} and the delay \tau . Because \tau can also be treated as an additional parameter, we let \begin{equation*} \theta:=[\mathbf {p};\tau],\end{equation*}

View SourceRight-click on figure for MathML and additional features. be the vector of parameters. We also denote by y_\theta (t) the solution to (1) to make the dependence of y(t) on the parameters and delay more obvious.

The values of the parameters in a model are often unknown. While some parameters are found in the literature, others need to be estimated. Ideally, we want the experimental data recorded at time t_{i} , denoted by D_{i} , to coincide with the solution to the model at t_{i} . That is, \begin{equation*} D_{i} \approx y_\theta (t_{i}),\quad i=1,2,\ldots,N, \tag{2}\end{equation*}

View SourceRight-click on figure for MathML and additional features. where N is the number of data points. Hence, we look for the value of \theta that will make the approximation in (2) as close as possible. This can be done by minimizing a least-squares formulation expressed as \begin{equation*} \min _{\theta } \dfrac {1}{M}\sum _{i=1}^{N} \left ({D_{i} - y_\theta (t_{i})}\right)^{2}, \tag{3}\end{equation*}
View SourceRight-click on figure for MathML and additional features.
where M is a normalizing constant. We denote the cost function to be minimized by \begin{equation*} J(\theta):=\dfrac {1}{M}\sum _{i=1}^{N} \left ({D_{i} - y_\theta (t_{i})}\right)^{2}.\tag{4}\end{equation*}
View SourceRight-click on figure for MathML and additional features.

There are many numerical optimization algorithms in the literature that can be used to solve (3). Classical techniques use the gradient of the cost function J in (4) to estimate the minimizer. Gradient-based methods are fast but can be stuck to a local minimum or a saddle point [21]. Using derivative discontinuity tracking theory for DDEs, it was shown in [22] that jumps can arise in the gradient of J . Hence, to solve (3), we rely on a derivative-free method.

Numerical algorithms that only use function values of J are either deterministic or heuristic. Deterministic algorithms are computationally inexpensive and fast but tend to converge to local minimizers [21], [23]. Heuristic algorithms are often nature-inspired [24] and are computationally more expensive than deterministic algorithms but can converge to a global minimizer given enough number of function evaluations [24]–​[26].

In [27], a modified particle swarm optimization (PSO) was used to estimate the parameters of dynamical systems involving ordinary differential equations. It was shown in [28] how a hybrid adaptive cuckoo search with simulated annealing could be effective in identifying parameters in chaotic systems. A hybrid Taguchi-differential evolution algorithm was proposed in [29] to estimate the parameters of an HIV model. Heuristic algorithms have also been shown to be effective in identifying parameters of S-system models [30]–​[32]. Other recent works on the application of various heuristic optimization methods on estimating parameters of engineering systems are found in [33]–​[41].

Heuristic algorithms such as genetic algorithm (GA) [42], which is a hybrid of an evolutionary search strategy with a local multiple-shooting approach [43], and differential evolution [44] have been applied to parameter estimation in RDDE models. In [45], Ant Colony Optimization was used to determine parameter values in RDDE models on human health. To determine the unknown parameters in chaotic systems with time-delays, a hybrid bio-geography optimization was used in [46], PSO was used in [47], and a hybrid cuckoo search algorithm was used in [48].

To the best of our knowledge, heuristic algorithms have never been employed in estimating parameters of models involving NDDEs. In this paper, we solve the least-squares formulation in (3) using Genetic Algorithm with Multi-Parent Crossover (GA-MPC), and compare the results to those obtained using standard global optimization algorithms: GA, pattern search algorithm (PS), simulated annealing (SA), and PSO. GA-MPC is a modified GA that proposes a new crossover method [49]. It has been used to effectively solve an economic dispatch problem [50] and an optimal power flow problem [51]. It has also been applied to solve an inverse problem in RDDEs [52].

In the next section, we discuss GA-MPC and a parameter bootstrapping method. In Section III, parameter estimation is done on three NDDE models arising from applications using the heuristic algorithms mentioned above. The first model describes the population growth of E. coli using a first-order NDDE. The second one is an age-structured model involving a system of first-order NDDEs. The third model is a second-order NDDE describing a pendulum-mass-spring-damper (PMSD) system. Parameter bootstrapping is done to assess uncertainty in the estimates. Furthermore, since the only stopping criterion is the maximum number of function evaluations, we investigate the effect of varying this quantity to the cost function value. In the last section, we state our conclusions and recommendations.

SECTION II.

Methods

A. Genetic Algorithm With Multi-Parent Crossover

Genetic Algorithm was formulated by John Holland in 1975 based on Darwin’s theory of evolution [53]. It has gained popularity for its ease of use and applicability in various fields of study. Some applications of GA can be found in [25], [26], [42], [54]–​[59].

The general process of GA starts with a pool of randomly generated individuals that serve as members of the first generation. The members of the population act as estimates of the minimizer. The best members of the population are selected based on their survival fitness. In the case of minimization, the lower the function value of an individual, the higher is its chance of survival. The members of the population who did not survive will be replaced by the offspring of the surviving population. The offspring are created using a crossover method. To make sure that the solution does not converge to a local minimum, a mutation step is done at the end of the generation. In this step, a portion of the population is modified randomly. The process is repeated until a stopping criterion is satisfied. A detailed discussion on GA, as well as the MATLAB codes, are found in [26].

GA-MPC proposes a new crossover method that uses three parents from the selection pool to generate three offspring (refer to Algorithm 1). Two of the offspring (o_{1} and o_{3} in Algorithm 1) are formulated in such a way that they are generated in the direction with less function value. These two offspring exploit the location of the best parents to attain better estimates. The third offspring (o_{2} in Algorithm 1) will move away from the best parent. Although this is counter-intuitive, this gives the algorithm the capability to escape local minima. Thus, GA-MPC also adopts a randomized operator to diversify the offspring. A detailed discussion on GA-MPC is found in [49]. This algorithm ranked first among the participating algorithms in the 2011 IEEE Congress of Evolutionary Computation Competition, which tests algorithms by solving real-world optimization problems [60].

Algorithm 1 Genetic Algorithm With Multi-Parent Crossover (GA-MPC) [49]

Require: Cost function f(\mathbf {x}) , \mathbf {x}\in \mathbb {R}^{r} , PopSize , \mathbf {x}_{LB} , \mathbf {x}_{UB} , M , \beta , cr , p , MaxEval .

Ensure: f is optimal at \mathbf {x}^{*}

1:

Randomly generate the initial population size PopSize . The initial population must stay within the bounds x_{LB} and x_{UB} .

2:

while FuncEvals < MaxEval do

3:

Sort the population by their cost function value. Form the archive pool \tilde {A} using the best M members.

4:

Generate tournament selection size TC randomly between 2 and 3. Do a tournament selection and fill the selection pool.

5:

Randomly generate a number \bar {u}\in [{0,1}] .

6:

for each three consecutive individuals in the selection pool do

7:

if one of the selected individual is the same as another individual then

8:

Replace one by a randomly-selected individual in the selection pool.

9:

end if

10:

if \bar {u} < cr then

11:

Sort the three individuals x_{i} , x_{i+1} , x_{i+2} so that f(\mathbf {x}_{i}) \leq f(\mathbf {x}_{i+1}) \leq f(\mathbf {x}_{i+2}) .

12:

Compute \beta = N(\tilde {\mu },\tilde {\sigma }) .

13:

The three offspring are generated from the three parents using: o_{1} = \mathbf {x}_{1} + \beta (\mathbf {x}_{2} - \mathbf {x}_{3}),\, o_{2} = \mathbf {x}_{2} + \beta (\mathbf {x}_{3} - \mathbf {x}_{1}), \, o_{3} = \mathbf {x}_{3} + \beta (\mathbf {x}_{1} - \mathbf {x}_{2}) .

14:

end if

15:

for i=1:3 do

16:

Randomly generate \tilde {u}\in [{0,1}] .

17:

if \tilde {u} < p then

18:

The offspring undergoes mutation: o_{i}^{j} = \mathbf {x}_{mut}^{j} , where \mathbf {x}_{mut}\in \tilde {A} and mut \in [1,m] .

19:

end if

20:

end for

21:

end for

22:

if a duplicate individual {x}_{i}^{j} exists then

23:

Change the duplicate with \mathbf {x}_{i}^{j} = \mathbf {x}_{i}^{j} + N(0.5*\bar {u},0.25*\bar {u}), \quad \bar {u} \in [{0,1}] .

24:

end if

25:

end while

In the following simulations, the parameter values of GA-MPC are set to their default values as stated in [49]. That is, the population size PopSize = 90 , selection rate M=0.50\times PopSize , crossover rate cr=1 , crossover factor \beta \sim \mathcal {N}(0.5,0.3) , and mutation rate p=0.1 .

The results obtained using GA-MPC are compared to those obtained using standard heuristic algorithms: GA, PS, SA, and PSO, which are implemented using the built-in functions ga, patternsearch, simulannealbnd, and particleswarm, respectively, in the global optimization toolbox of MATLAB. For consistency of comparison, all the parameters in these built-in functions are set to their default values. The only stopping criterion in all the algorithms is the maximum number of function evaluations. In the parameter estimation results in Section III, the maximum number of function evaluations is set to 2000 times the number of parameters to be estimated. Because heuristic algorithms are probabilistic, we run each of the algorithms 100 times and compute for the mean of the estimates. Box plots are used to illustrate the distribution of the parameter estimates. Furthermore, we explore the effect of the maximum number of function evaluations to the cost function value by setting this maximum number to the number of parameters to be estimated times 500, 1000, 1500, or 2000.

B. Parameter Uncertainty Analysis

In [61], a parameter bootstrapping approach is presented to quantify the standard error in the parameter estimates in models involving RDDEs. We adopt this strategy to analyze the uncertainty in parameter estimates in the NDDE models. The method is summarized in the following steps:

  1. Solve the parameter estimation problem in (3) and denote the estimated parameter by \hat {\theta } .

  2. Let y_{\hat {\theta }}(t) be the solution of the NDDE model. Set the maximum number L of realizations to 100. Initialize the number of realizations r to 1.

  3. Using (y_{\hat {\theta }}(t_{i}),\sigma ^{2}) , i=1,\ldots,N , generate a noisy data \{\hat {D}_{i}\}_{i=1}^{N} based on an assumed probability distribution. Here, the noisy data are generated using the normal distribution Normal$(y_{\hat {\theta }}(t_{i}),\sigma ^{2})$ , i=1,\ldots,N , where \sigma is the noise set to 10% of the standard deviation of \{y_{\hat {\theta }}(t_{i})\}_{i=1}^{N} (compare with [61]).

  4. Get a new set of parameter estimates using the simulated noisy data \{\hat {D}_{i}\}_{i=1}^{N} . This is done by replacing \{D_{i}\}_{i=1}^{N} by \{\hat {D}_{i}\}_{i=1}^{N} in (4) and minimizing the modified cost function. Store this parameter as \bar {\theta }_{r} .

  5. If r < L , set r\leftarrow r+1 and go back to step 3. Otherwise, go to the next step.

  6. The standard error of the parameter \hat {\theta } , denoted by Err(\hat {\theta }) , is equal to the standard deviation of \{\bar {\theta }_{r}\}_{r=1}^{100} . The 95% confidence interval for the parameter \hat {\theta } is [\hat {\theta }-1.96*Err(\hat {\theta }),\hat {\theta }+1.96*Err(\hat {\theta })] . The standard error and 95% confidence interval of the solution to the NDDE are computed similarly.

SECTION III.

Results and Discussion

A. E. Coli Population Growth Model

In [16], an NDDE model of E. coli population growth is proposed. The model assumes that all cells have the same division time and that they divide simultaneously. Furthermore, it is assumed that there is a prolonged initial step-like growth. The model is given by \begin{equation*} y'(t) =\rho _{0}y(t)+\rho _{1}y(t - \tau)+\rho _{2}y'(t - \tau), \tag{5}\end{equation*}

View SourceRight-click on figure for MathML and additional features. where y(t) denotes the number of colonies at time t , and \rho _{0},\rho _{1} , \rho _{2} , and \tau are the model parameters. The parameter \rho _{0} is the rate of cell death, \rho _{1} is the rate of commitment to cell division, \rho _{2} represents the gradual dispersal of synchronization, and \tau is the average cell-division time [62]. The history function for this problem is given by \begin{equation*} \phi (t)=y_{0}+\beta \Psi (t),\quad t\in [-\tau,0],\end{equation*}
View SourceRight-click on figure for MathML and additional features.
where \begin{equation*} \Psi (t)=\dfrac {2y_{0}\beta \gamma }{\rho _{1}\tau }E\left ({\dfrac {2t}{\tau }+1}\right),\end{equation*}
View SourceRight-click on figure for MathML and additional features.
and \begin{align*} E(t)=\begin{cases} \exp [-1/(1-t^{2})],& \text {for }|t| < 1,\\ 0, & \text {otherwise.} \end{cases}\end{align*}
View SourceRight-click on figure for MathML and additional features.

The function \phi (t) follows a bell-shaped distribution that corresponds to non-monotone cell growth [16]. The value of \gamma is set to 2.25 and the initial population y_{0} to 99 [16].

We wish to estimate the parameters \rho _{0},\rho _{1},\rho _{2},\tau , and \beta by comparing the numerical solution to (5) with the number of colonies \{y^\star _{i}\}_{i=1}^{28} based on experimental data [16]. For brevity, we define \begin{equation*} \theta:=[\rho _{0},\rho _{1},\rho _{2},\tau,\beta].\end{equation*}

View SourceRight-click on figure for MathML and additional features.

Thus, the minimization problem to estimate the parameters of the NDDE model of E. coli population growth is given by \begin{equation*} \min _{\theta \in \mathbb {R}^{5}} \dfrac {\sum \limits _{i=1}^{28} \left ({y^\star _{i} - y_\theta (t_{i})}\right)^{2}}{\sum \limits _{i=1}^{28} \left ({y^\star _{i} }\right)^{2}}. \end{equation*}

View SourceRight-click on figure for MathML and additional features.

Here, y_\theta (t) is the solution to (5) given an arbitrary parameter vector \theta . To compute for y_\theta (t) numerically, we use the MATLAB built-in function ddensd with step size 0.01. The maximum and minimum values of the parameters used in the simulations are shown in Table 1.

TABLE 1 Bounds for the Parameters in the NDDE Model of E. coli Population Growth
Table 1- 
Bounds for the Parameters in the NDDE Model of E. coli Population Growth

Figure 1 shows the box plots of the parameter estimates of the E. coli population growth model obtained using GA-MPC, GA, PS, SA, and PSO. Compared to the other algorithms, most of the estimates obtained using GA-MPC fall within the first and third quartiles of the 100 estimates. Furthermore, the narrow interquartile ranges suggest small variations among estimates. The cyan curves in Figure 2 are the plots of the solutions corresponding to the 100 sets of estimates. The data points are represented by the asterisks. Among the five algorithms, plots of the solution using GA-MPC produce the best fit to the experimental data.

FIGURE 1. - Box plots of the parameter estimates for the E. coli population growth model using GA-MPC, GA, PS, SA, and PSO. The outliers, i.e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively.
FIGURE 1.

Box plots of the parameter estimates for the E. coli population growth model using GA-MPC, GA, PS, SA, and PSO. The outliers, i.e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively.

FIGURE 2. - Plots of the solution 
$y(t)$
 (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of GA-MPC, GA, PS, SA, and PSO. Experimental data points are represented by asterisks.
FIGURE 2.

Plots of the solution y(t) (cyan curve) to the NDDE model of E. coli population growth using the estimates from the 100 runs of GA-MPC, GA, PS, SA, and PSO. Experimental data points are represented by asterisks.

Next, we apply the bootstrapping method presented in Subsection II-B to assess the uncertainty in the parameter estimates obtained using GA-MPC. Table 2 shows the standard errors and mean of the estimated parameters, which all fall within the 95% confidence interval. In Figure 3, we see that the plot of the solution using the mean of the estimated parameters (blue curve) fits the experimental data (asterisks) well. The bounds for the 95% confidence interval of the solution y_{\hat {\theta }}(t) are represented by the red curves.

TABLE 2 Standard Error Err(\hat{\theta}) , 95% Confidence Interval, and Mean of the 100 Realizations Using GA-MPC for the NDDE Model of E. coli Population Growth
Table 2- 
Standard Error 
$Err(\hat{\theta})$
, 95% Confidence Interval, and Mean of the 100 Realizations Using GA-MPC for the NDDE Model of E. coli Population Growth
FIGURE 3. - The numerical solution 
$y(t)$
 of the NDDE model of E. coli population growth using the mean parameter estimates of GA-MPC is shown as the blue curve. The black asterisks are the experimental data points. The 95% confidence interval of the solutions is shown in gray and is bounded by the red curves.
FIGURE 3.

The numerical solution y(t) of the NDDE model of E. coli population growth using the mean parameter estimates of GA-MPC is shown as the blue curve. The black asterisks are the experimental data points. The 95% confidence interval of the solutions is shown in gray and is bounded by the red curves.

B. Age-Structured Model

Next, we consider a population divided into two age groups – juveniles and adults. This can be described by the following age-structured population growth model from [14]:\begin{align*} \dot {U}(t)=&\begin{cases} \displaystyle -\mu _{0}U(t)+b_{1}V(t)\\ \displaystyle +\,(b_{2}-1)n_{0}(\tau -t)e^{-\mu _{0}t},& 0\leq t\leq \tau,\\ \displaystyle c_{0}U(t)+c_{1}V(t)\\ \displaystyle +\,c_{3}V(t-\tau)+c_{4}\dot {V}(t-\tau),& t>\tau,\\ \displaystyle \end{cases}\\ \dot {V}(t)=&\begin{cases} \displaystyle n_{0}(\tau -t)e^{-\mu _{0}t}-\mu _{1}V(t),& 0\leq t\leq \tau,\\ \displaystyle a_{1}V(t-\tau)+a_{2}\dot {V}(t-\tau)-a_{3}V(t), & t>\tau, \end{cases}\end{align*}

View SourceRight-click on figure for MathML and additional features. where U(t) is the juvenile population, V(t) is the adult population, \begin{align*} c_{0}=&b_{0}-\mu _{0}, \\ c_{1}=&b_{1}, \\ c_{2}=&(b_{2}-1)b_{0}e^{-\mu _{0}\tau }, \\ c_{3}=&(b_{2}-1)(b_{1}e^{-\mu _{2}}+b_{2}\mu _{1})e^{\mu _{2}-\mu _{0}\tau }, \\ c_{4}=&(b_{2}-1)b_{2}e^{\mu _{2}-\mu _{0}\tau }, \\ a_{0}=&b_{0}e^{\mu _{2}-\mu _{0}\tau }, \\ a_{1}=&(b_{1}e^{-\mu _{2}}+b_{2}\mu _{1})e^{-\mu _{0}\tau }, \\ a _{2}=&b_{2}e^{-\mu _{0}\tau }, \\ a_{3}=&\mu _{1}, \tag{6}\end{align*}
View SourceRight-click on figure for MathML and additional features.
b_{j} , and \mu _{j} , j=0,1,2 , are different birth and death rates, respectively. We note from [14] that the coefficients a_{i} in (6) are non-negative and that \begin{equation*} a_{1}\geq a_{2}a_{3}.\end{equation*}
View SourceRight-click on figure for MathML and additional features.

We set the initial values for U(t) and V(t) as (compare with [15]) \begin{align*}\begin{bmatrix} U(0)\\ V (0) \end{bmatrix}=\begin{bmatrix} \int _{0}^{36}n_{0}(a)da\\[0.5pc] \int _{36}^{50}n_{0}(a)da\end{bmatrix}=\begin{bmatrix} \int _{0}^{36}1da\\[0.5pc] \int _{36}^{50}1da\end{bmatrix}=\begin{bmatrix} 36\\ 14 \end{bmatrix},\end{align*}

View SourceRight-click on figure for MathML and additional features. where the initial age distribution is given by \begin{align*} n_{0}(a)=\begin{cases} 1,& 0\leq a\leq 50,\\ 0,& \text {otherwise}. \end{cases}\end{align*}
View SourceRight-click on figure for MathML and additional features.

In this example, we use generated population data for U(t) and V(t) at n different time points. To generate noisy data, we follow the method used in DDE models presented in [61]. First, clean data are generated by solving the model given the true parameters (see Table 3). The solution of the NDDE model is computed numerically using the MATLAB function ddensd. Then noisy data are generated following a normal distribution, with standard deviation set to 10% of the standard deviation of the NDDE solution. We generate data for n=20 , 50, and 100 time points over a fixed interval. The goal is to estimate the parameters of the age-structured NDDE model from the generated noisy data \{U^{*}(t_{i}),V^{*}(t_{i})\},\,\,i=1,2,\ldots,n .

TABLE 3 True Values and Bounds for the Parameters in the Age-Structured NDDE Model
Table 3- 
True Values and Bounds for the Parameters in the Age-Structured NDDE Model

The minimization problem for the parameter estimation of this model is given by \begin{equation*} \min \limits _{\theta \in \mathbb {R}^{7}}\left ({\sum _{i=1}^{n} \dfrac {|U^{*}(t_{i})-U_\theta (t_{i})|^{2}}{|U^{*}(t_{i})|^{2}}+ \dfrac {|V^{*}(t_{i})-V_\theta (t_{i})|^{2}}{|V^{*}(t_{i})|^{2}}}\right)+\zeta g,\end{equation*}

View SourceRight-click on figure for MathML and additional features. where \begin{equation*} \theta =[\tau,b_{0},b_{1},b_{2},\mu _{0},\mu _{1},\mu _{2}],\end{equation*}
View SourceRight-click on figure for MathML and additional features.
\zeta is a penalty parameter, n is the number of data points, \{U_\theta (t_{i}), V_\theta (t_{i})\} are the solutions to the model at time t_{i} given an arbitrary parameter vector \theta , and \begin{align*} g=&-\min [a_{1}-a_{2}a_{3},0]\\=&-\min [(b_{1}e^{-\mu _{2}}+b_{2}\mu _{1})e^{-\mu _{0}\tau }-\mu _{1}b_{2}e^{-\mu _{0}\tau },0]\end{align*}
View SourceRight-click on figure for MathML and additional features.
is a penalty function to ensure that the parameter values satisfy the condition a_{1}\geq a_{2}a_{3} . The parameter \zeta is set to 1000. We solve the minimization problem using GA-MPC, GA, PS, SA, and PSO. The bounds for the parameters are given in Table 3.

We see in Figures 4 to 5 that in most cases, GA-MPC and PSO result in parameter estimates which are close to the true parameter values (black dotted line) and vary within small interquartile ranges. However, GA-MPC gives better estimates than PSO for the parameters b_{2} , \mu _{0},~\mu _{1} , and \mu _{2} . Using PSO, some estimates for the death rates \mu _{0} and \mu _{1} are zero, which fall outside the meaningful range for the death rate. Some parameter estimates using GA, PS, and SA vary on a larger range and a significant number of estimates fall outside the first and third quartiles. There is no notable difference in the parameter estimation results when n=20 , 50, or 100.

FIGURE 4. - Box plots of the estimates for 
$\tau $
, 
$b_{0}$
, 
$b_{1}$
, and 
$b_{2}$
 in the age-structured NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i. e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.
FIGURE 4.

Box plots of the estimates for \tau , b_{0} , b_{1} , and b_{2} in the age-structured NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i. e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.

FIGURE 5. - Box plots of the estimates for 
$\mu _{0}$
, 
$\mu _{1}$
, and 
$\mu _{2}$
 in the age-structured NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i. e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.
FIGURE 5.

Box plots of the estimates for \mu _{0} , \mu _{1} , and \mu _{2} in the age-structured NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i. e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.

Figure 6 shows the plots of the solutions U(t) and V(t) (cyan and magenta curves, respectively) using GA-MPC. We see that the plots have a good fit to the n data points (asterisks) and that the value of n does not affect the performance of GA-MPC. Figures 12 to 14 in the Appendix show the plots of the solutions to the age-structured NDDE model using the estimates from GA, PS, SA, and PSO.

FIGURE 6. - Plots of the solutions 
$U(t)$
 and 
$V(t)$
 (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from the 100 runs of GA-MPC. The generated data sets with 20, 50, and 100 sample points are represented by asterisks.
FIGURE 6.

Plots of the solutions U(t) and V(t) (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from the 100 runs of GA-MPC. The generated data sets with 20, 50, and 100 sample points are represented by asterisks.

FIGURE 7. - The numerical solutions 
$U(t)$
 (black curve) and 
$V(t)$
 (blue curve) of the age-structured NDDE model using the mean estimated parameter values of GA-MPC. The black asterisks are the 50 generated data points. The 95% confidence interval of the NDDE solutions is illustrated in gray and is bounded by the red curves.
FIGURE 7.

The numerical solutions U(t) (black curve) and V(t) (blue curve) of the age-structured NDDE model using the mean estimated parameter values of GA-MPC. The black asterisks are the 50 generated data points. The 95% confidence interval of the NDDE solutions is illustrated in gray and is bounded by the red curves.

FIGURE 8. - Box plots of the parameter estimates in the PMSD NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i.e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.
FIGURE 8.

Box plots of the parameter estimates in the PMSD NDDE model using GA-MPC, GA, PS, SA, and PSO. The outliers, i.e., the estimates that fall outside the lower and upper quartiles, are plotted in red (+). The median and mean are represented by a red line across the box plots and black dot, respectively. The true values are shown as a black dotted line. The estimates are computed from a generated data set containing 20, 50, and 100 sample points.

FIGURE 9. - Plots of the solutions 
$y(t)$
 (cyan curves) to the PMSD NDDE model using the estimates from the 100 runs of GA-MPC. The generated data sets with 20, 50, and 100 sample points are represented by asterisks.
FIGURE 9.

Plots of the solutions y(t) (cyan curves) to the PMSD NDDE model using the estimates from the 100 runs of GA-MPC. The generated data sets with 20, 50, and 100 sample points are represented by asterisks.

FIGURE 10. - The numerical solution 
$y(t)$
 of the PMSD NDDE model using the estimated parameter values of GA-MPC is shown as the blue curve. The black asterisks are the 50 generated data points. The 95% confidence interval for the NDDE solutions is illustrated in gray and is bounded by the red curves.
FIGURE 10.

The numerical solution y(t) of the PMSD NDDE model using the estimated parameter values of GA-MPC is shown as the blue curve. The black asterisks are the 50 generated data points. The 95% confidence interval for the NDDE solutions is illustrated in gray and is bounded by the red curves.

FIGURE 11. - The mean cost function value in models 1 to 3 using GA-MPC, GA, PS, SA, and PSO for different maximum number of function evaluations. The maximum function evaluations is set to 
$500\cdot d$
 (blue), 
$1000\cdot d$
 (red), 
$1500\cdot d$
 (yellow), and 
$2000\cdot d$
 (violet), where 
$d$
 is the number of parameters to be estimated.
FIGURE 11.

The mean cost function value in models 1 to 3 using GA-MPC, GA, PS, SA, and PSO for different maximum number of function evaluations. The maximum function evaluations is set to 500\cdot d (blue), 1000\cdot d (red), 1500\cdot d (yellow), and 2000\cdot d (violet), where d is the number of parameters to be estimated.

FIGURE 12. - Plots of the solutions 
$U(t)$
 and 
$V(t)$
 (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data sets with 20 data points are represented by asterisks.
FIGURE 12.

Plots of the solutions U(t) and V(t) (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data sets with 20 data points are represented by asterisks.

FIGURE 13. - Plots of the solutions 
$U(t)$
 and 
$V(t)$
 (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 50 data points are represented by asterisks.
FIGURE 13.

Plots of the solutions U(t) and V(t) (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 50 data points are represented by asterisks.

FIGURE 14. - Plots of the solutions 
$U(t)$
 and 
$V(t)$
 (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 100 data points are represented by asterisks.
FIGURE 14.

Plots of the solutions U(t) and V(t) (cyan and magenta curves, respectively) to the age-structured NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 100 data points are represented by asterisks.

To analyze the uncertainty in the estimates, we apply the bootstrapping method to the data with n=50 . Results in Table 4 show small standard errors of all the parameters. Furthermore, the mean values of the estimates all fall within the 95% confidence intervals. Figure 7 shows the good fit of the solutions to the data set using the re-estimates.

TABLE 4 Standard Error Err(\hat{\theta}) , 95% Confidence Interval, and Mean of the 100 Realizations Using GA-MPC for the Age-Structured NDDE Model
Table 4- 
Standard Error 
$Err(\hat{\theta})$
, 95% Confidence Interval, and Mean of the 100 Realizations Using GA-MPC for the Age-Structured NDDE Model

C. Coupled Pendulum-Mass-Spring-Damper System

Consider a mechanical system consisting of a mass M mounted on a linear spring, where a pendulum of mass m is attached to it via a hinged rod of length l [9]. We assume that the angular deflection of the pendulum from the downward position is very small, and is thus negligible. Moreover, we assume that there is a linear, viscous damping C on the mass M and no external forces are acting on the system. The equation of motion is given by the second-order NDDE \begin{equation*} M\ddot {x}(t) + C\dot {x}(t)+Kx(t)+m\ddot {x}(t-\tau)=0, \tag{7}\end{equation*}

View SourceRight-click on figure for MathML and additional features. where C and K are the damping and stiffness coefficients, respectively. This equation describes the vertical motion of a PMSD system, and the terms x(t),\dot {x}(t) and \ddot {x}(t) are the position, velocity, and acceleration of the PMSD at time t . In the following discussion, we consider the nondimensionalized version of (7) given by \begin{equation*} \ddot {y}+2\zeta \dot {y}+y+p\ddot {y}(t-\tau)=0,\tag{8}\end{equation*}
View SourceRight-click on figure for MathML and additional features.
with the preshape function \phi (t)=\cos (t/2) [9]. Note that if |p| < 1 and \zeta >1/\sqrt {2} , then the steady state of \eqref {pendulum} is locally asymptotically stable [9]. In particular, we are interested in the case when p>2\zeta \sqrt {1-\zeta ^{2}} , so that there is a succession of stability switches as the value of \tau increases.

We estimate the parameters of (8) from noisy data generated in the same way as in the age-structured NDDE model. First, clean data is obtained by numerically solving (8) using the MATLAB function ddensd. We use the following parameter values from [9]: \tau =1 , \zeta =0.05 , and p=0 . Then noisy data y^{*}(t_{i}) , i=1,2,\ldots,n , are generated following a normal distribution, with the standard deviation set to 10% of the standard deviation of the NDDE solution. We generate data sets containing n=20 , 50, and 100 time points divided over a fixed interval. We minimize the objective function given by \begin{equation*} \min _{\theta \in \mathbb {R}^{3}} \dfrac {\sum \limits _{i=1}^{n} \left ({y^\star _{i} - y_\theta (t_{i})}\right)^{2}}{\sum \limits _{i=1}^{n} \left ({y^\star _{i} }\right)^{2}}, \end{equation*}

View SourceRight-click on figure for MathML and additional features. where \begin{equation*} \theta:=[\tau,\zeta,p],\end{equation*}
View SourceRight-click on figure for MathML and additional features.
y_\theta (t_{i}) is the solution to (8) at time t_{i} using an arbitrary parameter vector \theta . The bounds for the parameters are shown in Table 5.

TABLE 5 True Values and Bounds for the Parameters in the PMSD NDDE Model
Table 5- 
True Values and Bounds for the Parameters in the PMSD NDDE Model

Figure 8 shows the zoomed-in box plots of the parameter estimates of the PMSD NDDE model. Aside from PS, the heuristic algorithms performed well in the parameter estimation. This might be attributed to having only 3 parameters being estimated. We note that there are many outliers in the estimates using PS with n=50 , which are not seen in Figure 8 but are seen in the plots of the solutions (see Figures 15 to 17 in the Appendix). The algorithms GA-MPC and GA both give good approximations to the true parameter values (black dotted line) in all 100 runs, but GA-MPC has fewer outliers. The plots of the solutions using parameter estimates of GA-MPC are shown in Figure 9. The number of data points used in the estimation does not appear to significantly affect the cost function value and the fit of the solutions to the data points.

FIGURE 15. - Plots of the solutions 
$y(t)$
 (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 20 sample points are represented by asterisks.
FIGURE 15.

Plots of the solutions y(t) (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 20 sample points are represented by asterisks.

FIGURE 16. - Plots of the solutions 
$y(t)$
 (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 50 sample points are represented by asterisks.
FIGURE 16.

Plots of the solutions y(t) (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 50 sample points are represented by asterisks.

FIGURE 17. - Plots of the solutions 
$y(t)$
 (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 100 sample points are represented by asterisks.
FIGURE 17.

Plots of the solutions y(t) (cyan curves) to the PMSD NDDE model using the estimates from 100 runs of GA, PS, SA, and PSO. The generated data set with 100 sample points are represented by asterisks.

Results of the parameter bootstrapping method show small standard errors of the parameter estimates in the PMSD NDDE model with mean estimates lying within the 95% confidence intervals (Table 6). Figure 10 shows a good fit to the data of most of the solutions using the re-estimated parameters.

TABLE 6 Standard Error Err(\hat{\theta}) and Mean of the 100 Realizations Using GA-MPC, and the Resulting 95% Confidence Intervals of the Parameter Estimates for the PMSD NDDE Model
Table 6- 
Standard Error 
$Err(\hat{\theta})$
 and Mean of the 100 Realizations Using GA-MPC, and the Resulting 95% Confidence Intervals of the Parameter Estimates for the PMSD NDDE Model

D. Varying Maximum Number of Function Evaluations

The only stopping criterion used in the previous simulations is the maximum number of function evaluations and this is set to 2000 times the number of parameters to be estimated. Results show that GA-MPC consistently obtained good parameter estimates in the three models. Now, we look at how varying the maximum number of function evaluations affect the performance of GA-MPC, GA, PS, SA, and PSO in parameter estimation.

We consider setting the stopping criterion to 500, 1000, 1500, or 2000 times the number of parameters to be estimated, which we denote by d . Figure 11 shows the mean cost function value of 20 independent runs performed using GA-MPC, GA, PS, SA, and PSO. In Model 1, we see that GA-MPC produces the least cost function values across the different maximum numbers of function evaluations. In Models 2 and 3, we observe that GA-MPC and PSO give parameter estimates with the lowest cost function values, even when the maximum numbers of function evaluations are 500d or 1000d . In all three models, GA-MPC consistently provides parameter estimates with low cost function values even for smaller numbers of maximum function evaluations. This is advantageous in instances when modeling using NDDEs requires parameter tuning at a faster time.

SECTION IV.

Conclusion

In this work, we examine the use of a modified genetic algorithm called GA-MPC in solving parameter estimation problems involving NDDEs. We test our method on three different NDDE models – a first-order NDDE, a system of first-order NDDEs, and a second-order NDDE. We look at the performance of GA-MPC in comparison to other heuristic algorithms and incorporate parameter uncertainty analysis to examine the extent of uncertainties in the estimates. For consistency of comparison, the default values of the parameters in the algorithms used are unchanged and the only stopping criterion is the maximum number of function evaluations. The simulations show that heuristic algorithms, particularly GA-MPC, can provide good estimates for the parameter values and a good fit to experimental data. The high accuracy and good consistency of the parameter estimates using GA-MPC are observed in all three different types of NDDE models we presented.

An advantage of using GA-MPC in solving the least-squares formulation arising from the parameter estimation problem is that it does not depend on the explicit formulation or the derivatives of the solution to the NDDE. Although heuristic algorithms are computationally expensive, they are robust and are easy to implement.

Many factors contribute to the performance of heuristic algorithms in solving optimization problems. For future research, one can explore modifying the default parameter values or the stopping criteria in the algorithms. One can also consider other recent heuristic algorithms and hybrids of heuristic algorithms in solving inverse problems in NDDEs. In this study, we have varied the number of data points and maximum number of function evaluations. For future work, one can use other metrics for comparison such as varying the noise level in the simulated data and adding different types of noise.

Appendix A

Additional Figures

Figures 12 to 14 show the plots of the solutions to the age-structured NDDE model using parameter estimates obtained from the data set containing 20, 50, and 100 points using GA, PS, SA, and PSO. The plots of U(t) and V(t) are the cyan and magenta curves, respectively, while the data points are represented by the asterisks. Figures 15 to 17 show the plots of the solution to the PMSD NDDE model obtained using the estimates from 20, 50, and 100 data points, using GA, PS, SA, and PSO. The plots of the solutions are in cyan and the data points are represented by the asterisks.

References

References is not available for this document.