Extracting LPV and qLPV Structures From State-Space Functions: A TP Model Transformation Based Framework

SECTION I.

Introduction

This paper proposes the tensor product (TP) model transformation [1]–[3]-based framework to numerically reconstruct linear time invariant (LTI), linear parameter varying (LPV), and quasi-LPV (qLPV) representations of state-space models such that the reconstructed models are in TP model form. (Note that in the present discourse the TP model form is equivalent to the T–S fuzzy model form [1]–[4], therefore, whenever this paper makes an assertion concerning the TP model, it will also be applicable to the T–S fuzzy model.) The proposed framework incorporates all advantageous properties of the TP model transformation framework, such as the ability to reconstruct canonical [5], [6] as well as various other types of convex TP model (or T–S fuzzy model) forms [1], [7], and the ability to find the minimal number of vertices (or fuzzy rules) corresponding to a specified approximation accuracy [1], [8]. The goal of this proposed framework is to replace the complicated (or even intractable) derivations of closed formulae with straightforward, tractable numerically appealing routine-like solutions that can be executed in a reasonable amount of time. A further goal is that it should be irrelevant how the state-space model is given (as a set of closed formulae, as a neural network, as a black-box model), and indeed, the numerical steps of the proposed framework can be executed in all of these cases.

The motivation behind exploring variations of the T–S fuzzy model structure can be outlined as follows. Polytopic model-based state-space control design has three key steps: A) defining the state-space model; B) finding the optimal polytopic model (TP model or T–S fuzzy model); and C) deriving the controller. The crucial point is that the structure of the model obtained after Steps A and B has a key role in determining the effectiveness of the controller design obtained in Step C. Thus, it is important to structure the elements of the system matrix in Step A well, since ultimately, the ordering of the elements influences the whole design process in a strong sense. The convex hull defined by the polytopic structure in Step B also directly influences the design. Therefore, the applied control design strategy must take into account these points in order to guarantee that the best controller is obtained for the task at hand [2], [9]–[11].

This paper proposes a TP model transformation-based solution that can be automatically executed and used to derive models for Steps A and B. More importantly, the key novelty of the paper is that it extends the TP model transformation-based control design framework to state-space models whose matrix structure is unknown. Further novelties of the proposed extension include its capability of dealing with the internal structure of the vertices or system matrix in a systematic way, that it allows for the flexible manipulation of this structure in a way that considerably improves the controller design, and that it allows for the specification and design of the external parameters and state vector element contained in the parameter vector.

Much work has been carried out in the recent past on the TP model transformation. Computational analyses and improvements to the original formulation were recently proposed in [12] and [13]. It was also proved in [2], [9]–[11] that linear matrix inequality (LMI)-based control design theories are very sensitive to the properties of the convex hulls (both the shape of the weighting functions and the number of vertices) defined by the TP models; hence, the convex hull manipulation capability of the TP model transformation is an important and necessary step in LMI-based control design. Very effective convex hull manipulation methods were incorporated into the TP model transformation in [7], [14], and [15]. Further useful control approaches and applications were published in the field of control theory [16]–[23], including in the area of sliding mode control in [18], [24], and [25]. For further theories and applications, readers are referred to [26]–[59].

The rest of this paper is organized as follows. Section II defines the notation and the basic concepts of the TP model transformation in a modified form to better fit the proposed framework. Section III outlines the key points of the paper. It highlights the main goal, namely, to extend the TP model transformation to state-space functions where the linear matrix product structure (product of the system matrix and the state vector) is unknown. Sections IV to VII present the novelty of this paper. These sections show how to numerically reconstruct various state-space forms in a reasonable amount of time, even in cases when the exact closed formulae are unknown, but the model can be sampled over a grid. Section IV shows how to numerically reconstruct a global linearization, and also shows how to make the linearization in a given point. Section V presents one of the key points of the paper and shows how to execute the extended TP model transformation even in cases when we do not know the inner linear parameter dependent matrix product-based structure of the given state-space model. Sections VI and VII give some idea of how to manipulate the parameter space and the structure of the system matrix. Section VIII presents some examples to show that the MATLAB implementations of the proposed framework are very simple. The performance of the algorithms are demonstrated on the example of the inverted pendulum. Section IX discusses the contrast between the proposed extension and previous solutions. Finally, Section X concludes this paper.

SECTION II.

Notations and Preliminary Concepts

This section provides the notations used in the paper and introduces the basic concepts of the TP model transformation in a modified and rather more compact form that is better suited to the proposed framework.

A. Notations

The following notations are used in this paper:

Scalar: $a$ ;
Vector: $\mathbf {a}$ contains elements $a_i$ ;
Matrix: $\mathbf {A}$ contains elements $a_{i,j}$ ;
Tensor: $\mathcal {A}$ contains elements $a_{i,j,k,\ldots }$ ;
$\mathbf {x}\in {\mathbb {R}}^N$ denotes that the vector $\mathbf {x}$ contains $N$ elements, and the values of these elements are real numbers;
$\mathcal {A}\in {\mathbb {R}}^{I_1\times I_2\times \cdots \times I_N}$ means that the values of the $N$ -dimensional tensor are real numbers. $I_n$ , $n=1\ldots N$ denotes the number of elements in the $n$ th dimension of tensor $\mathcal {A}$ .;
Index $i$ : The upper bounds of the indices are denoted by the uppercase letter, e.g., $i=1\ldots I$ ;
Interval: $\omega =[\omega _{\text{min}},\; \omega _{\text{max}}]$ ;
Space: $\Omega:\omega _1\times \omega _2\times \cdots \times \omega _N$ is an $N$ -dimensional hypercube;
$\mathbf {x}\in \Omega$ expresses the fact that vector $\mathbf {x}$ is within the space $\Omega$ . The dimensions of $\mathbf {x}$ and $\Omega$ are the same;
$\Omega ^x$ and $\Omega ^p$ : We frequently define vectors $\mathbf {x}\in \Omega$ and $\mathbf {p}\in \Omega$ . In order to denote that these spaces of $\mathbf {x}$ and $\mathbf {p}$ are different, we use superscript $x$ and $p$ as $\Omega ^x$ and $\Omega ^p$ . This simply means that $\mathbf {x}\in \Omega ^x$ and $\mathbf {p}\in \Omega ^p$ ;
$\sqsubset$ refers to a dimensionality reduced subset in general as:
1. in the case of spaces: $\Theta \sqsubset \Omega$ states that $\Theta$ is a hypercube with the same-sized intervals as $\Omega$ , but has a smaller number of dimensions;
2. in the case of vectors: $\mathbf {a}\sqsubset \mathbf {b}$ , where $\mathbf {a}\in \Theta \subset {\mathbb {R}}^N$ and $\mathbf {b}\in \Omega \subset {\mathbb {R}}^M$ means that $N< M$ and $\Theta \sqsubset \Omega$ ;
3. in the case of tensors, $\mathcal {A}\sqsubset \mathcal {B}$ means, for instance, that $\mathcal {A}$ is obtained by deleting complete dimensions from tensor $\mathcal {B}$ ;
$\mathbf {A}=\mathcal {A}_{[n]}$ denotes the layout of tensor $\mathcal {A}$ . $\mathbf {A}$ is a matrix with the size of $I_n \times \prod ^N_{i=1,i\ne n}I_i$ , where the size of $\mathcal {A}$ is $I_1 \times I_2 \times \cdots \times I_N$ . The column vectors of matrix $\mathbf {A}$ are the vectors of the $n$ th dimension of tensor $\mathcal {A}$ . For further details, see [60];
$\mathcal {S}\mathop {\boxtimes }\nolimits _{n=1}^N\mathbf {U}_n$ is the TP. It defines the tensor product between tensor $\mathcal {S}$ and matrices $\mathbf {U}$ . $\mathcal {S}$ is the $N$ -dimensional core tensor and $\mathbf {U}_n$ are matrices assigned to each dimension. For details refer to [1]–[3];
$f(\mathbf {x})=\mathcal {S}\mathop {\boxtimes }\nolimits _{n=1}^N\mathbf {w}_n(x_n)$ represents the TP function $\mathbf {x}\in {\mathbb {R}}^N$ where $\mathbf {w}_n(x_n)=[w_{n,1}(x_n) \;\; w_{n,2}(x_n) \;\; \ldots \;\; w_{n,I_n} (x_n)]$ is called the weighting function system. This is equivalent to the very frequently used transfer function of the T–S fuzzy model given in classical form such as
$\begin{align*} y=\sum _{i_1=1}^{I_1}\sum _{i_2=1}^{I_2} \ldots \sum _{i_N}^{I_N} \prod _{n=1}^N w_{n,i_n}(x_n)b_{i_1,i_2,\ldots,i_N} \tag{1} \end{align*}$ View Source
Types of the weighting functions:
1. SN: Sum normalized;
2. NN: Nonnegative;
3. NO: Normalized;
4. CNO: Close to normalized;
5. RNO: Relaxed normalized;
6. INO: Inverse normalized;
7. IRNO: Inverse relaxed normalized;
for further details refer to [1] and [2].

B. Concepts

In this section, slight modifications are proposed to the basic concepts used in the TP model transformation. This will allow us to more easily address the key steps of the proposed extensions.

Definition 1.

Grid: An equidistant hyper rectangular grid with $I$ dimensions is determined by space $\Omega ={\mathbb {R}}^{\omega _1\times \omega _2\times \cdots \times \omega _I}$ and the density of the grid denoted by $M:M_1\times M_2 \times \cdots \times M_I$ . Thus the grid, per dimension, is $[g_{i,1} \;\; g_{i,2} \;\; \ldots\;\; g_{i,M_i}]$ , where $\forall i: g_{i,m}\leq g_{i,m+1}, g_{i,1}=\omega _{\text{min}}, g_{i,M_i}=\omega _{\text{max}}$ .

Definition 2.

Grid tensor: A grid tensor, denoted as $\mathcal {G}$ , contains the coordinates of the hyper rectangular grid (given by $\Omega$ and $M$ ) in each entry. Each element of the tensor is a vector pointing to the location of one grid. Thus, $\mathbf {g}_{m_1,m_2,\ldots,m_I}=[g_{1,m_1} \;\; g_{2,m_2} \;\; \ldots \;\; g_{I,m_I}]$ , $m_i=1,\ldots,M_i$ . A grid tensor is obtained when we store $\mathbf {g}_{m_1,m_2,\ldots,m_I}$ in $\mathcal {G}$ . This means that $\mathcal {G}$ has $I+1$ dimensions and $\mathcal {G}\in {\mathbb {R}}^{M_1\times M_2 \times \cdots \times M_I \times I}$ .

Definition 3.

Multiple operations: Assume a given function $\mathbf {y}=f(\mathbf {x})$ , where $\mathbf {y}\in {\mathbb {R}}^O$ and $\mathbf {x}\in {\mathbb {R}}^I$ . The multiple operation

$\begin{align*} \mathcal {Y}=f(*\mathcal {X}) \tag{2} \end{align*}$ View Source

takes an input tensor

$\mathcal {X}\in {\mathbb {R}}^{M_1\times M_2 \times \cdots \times M_I \times I}$

, and returns an output tensor

$\mathcal {Y}\in {\mathbb {R}}^{M_1\times M_2 \times \cdots \times O}$

whose entries are

$\mathbf {y}_{m_1,m_2,\ldots,m_I}=f(\mathbf {x}_{m_1,m_2,\ldots,m_I})$

, where

$\mathbf {x}_{m_1,m_2,\ldots,m_I}$

are the elements of tensor

$\mathcal {X}$

.

Definition 4.

Discretization: The discretized variant $\mathcal {F}^{\mathcal {G}}$ of $f(\mathbf {x})$ , $\mathbf {x}\in \Omega$ is

$\begin{align*} \mathcal {F}^{\mathcal {G}}=f(*\mathcal {G}) \tag{3} \end{align*}$ View Source

where

$\mathcal {G}$

is a grid defined to

$\Omega$

with density

$M$

.

Definition 5.

TP model: The following TP function-based structure is referred to as the TP model:

$\begin{align*} \mathbf {y}=f(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \tag{4} \end{align*}$ View Source

where

$\mathbf {y}\in {\mathbb {R}}^{O_1 \times O_2 \times \cdots \times O_K}$

,

$\mathbf {p}\in {\mathbb {R}}^N$

, and core tensor

$\mathcal {S}\in {\mathbb {R}}^{R_1\times R_2 \times \cdots \times R_N\times O_1 \times O_2 \times \cdots \times O_K }$

contains the vertexes

$\mathbf {s}_{r_1,r_2,\ldots,r_N}\in {\mathbb {R}}^{O_1 \times O_2 \times \cdots \times O_K}$

and vectors

$\mathbf {w}_n(p_n)\in {\mathbb {R}}^{R_n}$

contain the weighting functions

$w_{n,r_n}(p_n)$

assigned to the vertexes. Several beneficial properties of TP models, as well as various types thereof have been studied in the past. Two important research areas relevant to the topic are as follows.

Higher order singular value (HOSVD)-based canonical form of TP models. Here, $R_n$ also expresses the rank of the function on each dimension. The TP model structure is based on the higher order singular values, the vertexes, and the weightings of orthonormed systems. For further details, see [5] and [6].
Convex TP models. Here, the $\mathbf {y}$ is within the convex hull defined by the vertexes. Various types (loose and tight) hulls were defined, such as SN, NN, NO, CNO, RNO, INO,..., see [1]–[3] and [7]. It was also proved that the manipulation of the convex hull has a crucial role in control design theory, see [2], [9]–[11].

Polytopic form in (4) is the higher structured variant of the frequently used formulae (in polytopic modeling) $\mathbf {S}(\mathbf {p})=\sum _{h=1}^H w_h(\mathbf {p}) \mathbf {S}_h$ , where vertexes $\mathbf {S}_h$ are the elements $\mathbf {S}_{r_1,r_2,\ldots,r_N}$ of the core tensor $\mathcal {S}$ , where the multidimensional index $r_1,r_2,\ldots,r_N$ is replaced with its linear equivalent $h=1\ldots H=\prod _1^N R_n$ . Accordingly, $w_h(\mathbf {p})=\prod _{n=1}^N w_{n,r_n}(p_n)$ .

Method 1.

TP model transformation: Assume a given function $\mathcal {Y}=f(\mathbf {p})$ , where $\mathcal {Y}\in \Omega ^y\subset {\mathbb {R}}^O$ and $\mathbf {p}\in \Omega ^p\subset {\mathbb {R}}^N$ . The TP model transformation numerically reconstructs the TP model of the given function as

$\begin{align*} f(\mathbf {p})\approx _\epsilon \mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n). \tag{5} \end{align*}$ View Source

The TP model transformation finds the exact TP structure, if it exists (

$\epsilon =0$

), with the minimal number of vertices and weighting functions. It has further variants such as the pseudo- and multi-TP transformation, as well as other generalized TP model transformations for different purposes [2], [3].

The TP model transformation has three steps.

Step 1: Discretization: Defining $\mathcal {F}^\mathcal {G}$ of $f(\mathbf {p})$ .
Step 2: Extracting the TP structure: $\mathcal {F}^{\mathcal {G}}=\mathcal {S}\mathop {\boxtimes }\nolimits _{n=1}^N \mathbf {U}_n$ and performing the necessary complexity tradeoff. This is done by executing the HOSVD [60] on $\mathcal {F}^{\mathcal {G}}$ . The HOSVD results in matrices $\mathbf {U}_n$ and core tensor $\mathcal {S}$ . The matrices $\mathbf {U}_n$ are the singular matrices by dimensions. Further convex hull manipulation can be executed by transforming $\mathbf {U}_n$ to specialized matrices $\mathbf {U}^*_n$ while $\mathcal {S}\mathop {\boxtimes }\nolimits _{n=1}^N \mathbf {U}_n=\mathcal {S}^*\mathop {\boxtimes }\nolimits _{n=1}^N \mathbf {U}_n^*$ as detailed in [1]. This will control the characteristics of the weighting functions determined in the next step. For instance, to have convex combination (i.e., to determine the antecedent membership function in the Ruspini partition), the singular matrices are transformed to CNO type, see [1].
Step 3: Defining weighting functions $\mathbf {w}_n(p_n)$ from $\mathbf {U}^*_n$ . There are two ways to reconstruct the weighting functions. One is to simply perform linear interpolation between the elements of the columns in matrix $\mathbf {U}_n$ , then each column defines one piecewise linear weighting function. This is referred to as the bilinear TP model transformation. The other way is to recalculate the weighting functions at any given $p_n$ (while all $p_i$ , $i\ne n$ are fixed). This requires the calculation of $f(\mathbf {p})$ for that point. Usually a combination of these two approaches is used. The weighting functions are recalculated over a large number of points “off-line,” then a piecewise linear function is used between the points.

SECTION III.

Problem Outline

Assume a state-space model

$\begin{align*} \left[{\begin{array}{c}\dot{\mathbf {x}} \\ \mathbf {y} \end{array}}\right]=f(\mathbf {x},\mathbf {u}) \tag{6} \end{align*}$ View Source

where

$\left[{\begin{array}{cc}\mathbf {x}& \mathbf {u} \end{array}}\right]^T\in \Omega ^x\subset {\mathbb {R}}^I$

contains the state vector

$\mathbf {x}$

and input

$\mathbf {u}$

. Vector

$\left[{\begin{array}{cc}\dot{\mathbf {x}} & \mathbf {y} \end{array}}\right]^T\in {\mathbb {R}}^O$

contains output

$\mathbf {y}$

.

The paper introduces a framework to reconstruct the LTI representation of (6) as

$\begin{align*} \left[{\begin{array}{c}\dot{\mathbf {x}} \\ \mathbf {y} \end{array}}\right]\approx _{\epsilon } \mathbf {S} \left[{\begin{array}{c}\mathbf {x}\\ \mathbf {u} \end{array}}\right]\tag{7} \end{align*}$ View Source

where

$\mathbf {S}\in {\mathbb {R}}^{O\times I}$

and

$\epsilon$

are minimized in the least-square sense. If it has no acceptable accuracy then the proposed framework is also capable of reconstructing the LPV representation

$\begin{align*} \left[{\begin{array}{c}\dot{\mathbf {x}} \\ \mathbf {y} \end{array}}\right]\approx _{\epsilon } \mathbf {S}(\mathbf {p}) \left[{\begin{array}{c}\mathbf {x}\\ \mathbf {u} \end{array}}\right]\tag{8} \end{align*}$

View Source

of parameter dependent state-space model

$\begin{align*} \left[{\begin{array}{c}\dot{\mathbf {x}} \\ \mathbf {y} \end{array}}\right]=f(\mathbf {x},\mathbf {u},\mathbf {p}) \tag{9} \end{align*}$

View Source

where parameter vector

$\mathbf {p}\in \Omega ^p\subset {\mathbb {R}}^N$

and

$\epsilon$

are minimized in the least-square sense. If the approximation error is not acceptable (because of the nonlinearity), the paper proposes a method to reconstruct the qLPV representation as

$\begin{align*} \left[{\begin{array}{c}\dot{\mathbf {x}} \\ \mathbf {y} \end{array}}\right]\approx _{\epsilon } \mathbf {S}(\mathbf {p}) \left[{\begin{array}{c}\mathbf {x}\\ \mathbf {u} \end{array}}\right]\tag{10} \end{align*}$

View Source

where vector

$\mathbf {p}$

contains elements of the state vector

$\mathbf {x}$

. Thus, the structure is still linear, however, the model belongs to the family of nonlinear systems.

In the abovementioned cases, the parameter-varying system matrix is given in TP model form as

$\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \tag{11} \end{align*}$ View Source

such that tensor

$\mathcal {S}$

contains the vertex systems and vectors

$\mathbf {w}_n(p_n)$

contain the weighting functions by dimensions of

$\mathbf {p}$

. Once again, this is equivalent to the T–S fuzzy model, where

$\mathcal {S}$

contains the consequent system matrices and

$\mathbf {w}_n(p_n)$

contains the antecedent membership functions.

The control design is typically based on the linear matrix form (8) and 10) of the state-space transfer function (6). Since this representation is not unique, determining how the elements of system matrix $\mathbf {S}$ should be organized and structured is crucial. Furthermore, the TP model of $\mathbf {S}(\mathbf {p})$ is also not unique—an infinite number of different vertex and weighting function systems exist to represent the same $\mathbf {S}(\mathbf {p})$ . Since the further design steps (e.g., LMI-based design) are based on the vertexes, the appropriate selection of the vertexes is also a crucial point. As a result, both the internal structure of the system matrix and the vertexes of the models strongly influence the further design steps [2], [9]–[11].

The framework proposed in this paper provides various ways to incorporate the TP model transformation in the process of manipulating these important components of the TP model and the design (see also [2]). Using the TP model transformation to begin has the following advantages:

it is executable on models given by equations or soft computing-based representations, such as fuzzy rules, neural networks, or other black-box models. The only requirement is that the model must provide an output for each input (at least on a discrete scale);
it will find the minimal complexity, namely, the minimal number of components of the TP model (or rules of the T–S fuzzy model). If further complexity reduction is required, it provides one of the best tradeoffs between the number of components (fuzzy rules) and the approximation error;
it works like the principal component analysis in that it determines the order of the components (fuzzy rules) according to their importance;
it is capable of deriving the weighting functions or antecedent fuzzy sets according to various constraints. For instance, it can be used to define different convex hulls, a capability which has recently been shown to play an important role in control theory as mentioned above;
it is capable of transforming the given model to predefined weighting functions (antecedent fuzzy sets), using, i.e., the pseudo-TP model transformation;
it is capable of transforming a set of models simultaneously, while deriving common weighting functions (antecedent fuzzy sets) for all models.

SECTION IV.

Transformation to Linear Structure

The goal is to replace the function

$\begin{align*} \mathbf {y}=f(\mathbf {x}) \tag{12} \end{align*}$ View Source

with a linear mapping

$\begin{align*} \mathbf {y}=\mathbf {S}\mathbf {x}\tag{13} \end{align*}$

View Source

where

$\mathbf {x}\in \Omega \subset {\mathbb {R}}^I$

and

$\mathbf {y}\in {\mathbb {R}}^O$

(hence,

$\mathbf {S}\in {\mathbb {R}}^{O\times I}$

).

Method 2.

Linearization: The goal is to find a linear mapping $\mathbf {S}$ between a huge number of input–output pairs. Let these pairs be given by the grid tensor $\mathcal {G}$ defined over $\Omega$ with density $M$ and by the corresponding outputs in

$\begin{align*} \mathcal {F}^{\mathcal {G}}=f(*\mathcal {G}). \tag{14} \end{align*}$ View Source

Thus, we define

$\mathbf {S}$

as

$\begin{align*} \mathcal {F}^{\mathcal {G}}=\mathcal {G}\times _{I+1}\mathbf {S} \tag{15} \end{align*}$

View Source

therefore, using pseudoinverse we have

$\begin{align*} \mathbf {S}=\mathcal {F}^{\mathcal {G}}_{\left[{\begin{array}{c}I+1 \end{array}}\right]} \left(\mathcal {G}_{\left[{\begin{array}{c}I+1 \end{array}}\right]}\right)^+. \tag{16} \end{align*}$

View Source

At a more detailed level: (14) means

$\begin{align*} \mathbf {f}_{m_1,m_2,\ldots,m_I}=f(\mathbf {g}_{m_1,m_2,\ldots,m_I}) \tag{17} \end{align*}$

View Source

and (15) means

$\begin{align*} \mathbf {f}_{m_1,m_2,\ldots,m_I}=\mathbf {S}\mathbf {g}_{m_1,m_2,\ldots,m_I} \tag{18} \end{align*}$

View Source

for all vectors

$\mathbf {f}_{m_1,m_2,\ldots,m_I}$

and

$\mathbf {g}_{m_1,m_2,\ldots,m_I}$

stored in tensors

$\mathcal {F}^{\mathcal {G}}$

and

$\mathcal {G}$

, respectively.

Since these vectors are in the $(I+1)$ th dimension, we can layout these tensors in the $(I+1)$ th dimension and rewrite (15) in the following form:

$\begin{align*} \mathcal {F}^{\mathcal {G}}_{\left[{\begin{array}{c}I+1 \end{array}}\right]} = \mathbf {S}\left(\mathcal {G} \right)_{\left[{\begin{array}{c}I+1 \end{array}}\right]}. \tag{19} \end{align*}$ View Source

Thus we have arrived at (16).

As a result, we have

$\begin{align*} f(\mathbf {x})\approx _\epsilon \mathbf {S}\mathbf {x}. \tag{20} \end{align*}$ View Source

The error can be evaluated, for instance, over another dense grid

$\mathcal {H}$

defined over

$\Omega$

as

$\epsilon =\mathcal {F}^{H}-\mathcal {H}\times _{I+1}\mathbf {S}$

where

$\mathcal {F}^{H}=f(*\mathcal {H})$

.

If the error is not acceptable, one alternative approach is to try the quasi-linear structure detailed later.

SECTION V.

Transformation to Parameter-Varying Linear Structure

The goal is to replace the parameter-varying function

$\begin{align*} \mathbf {y}=f(\mathbf {x},\mathbf {p}) \tag{21} \end{align*}$ View Source

$\mathbf {y}\in {\mathbb {R}}^O$

,

$\mathbf {x}\in \Omega ^x\subset {\mathbb {R}}^I$

,

$\mathbf {p}\in \Omega ^p\subset {\mathbb {R}}^N$

, with

$\begin{align*} \mathbf {y}=\mathbf {S}(\mathbf {p})\mathbf {x}\tag{22} \end{align*}$

View Source

where

$\mathbf {S}(\mathbf {p})$

is given in TP model form

$\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n). \tag{23} \end{align*}$

View Source

The ability to incorporate all of the advantageous properties of the TP model transformation is highly desirable here.

Method 3:

Parameter-varying linear structure

Step 1: Define discretization grid $\mathcal {G}^p$ to $\Omega ^p$ with density $M^p$ .
Step 2: Define a linearization grid $\mathcal {G}^x$ fit to $\Omega ^x$ with density $M^x$ .
Step 3: Linearize $f(\mathbf {x},\mathbf {p})$ for each $\mathbf {p}_{m_1,m_2,\ldots,m_N}$ of $\mathcal {G}^p$ by Method 2. Namely, define a linear mapping between a sufficiently large number of input–output pairs given by grid $\mathcal {G}^x$ and
$\begin{align*} \mathcal {F}^{\mathcal {G}^x}_{m_1,m_2,\ldots,m_N}=f(*\mathcal {G}^x,\mathbf {p}_{m_1,m_2,\ldots,m_N}) \tag{24} \end{align*}$ View Sourcein the form of $\begin{align*} \mathcal {F}^{\mathcal {G}^x}_{m_1,m_2,\ldots,m_N}=\mathcal {G}^x\times _{I+1}\mathbf {S}_{m_1,m_2,\ldots,m_N}. \tag{25} \end{align*}$ View SourceAs a result, we have $\begin{align*} \mathbf {S}_{m_1,m_2,\ldots,m_N}=\left(\mathcal {F}^{\mathcal {G}^x}_{m_1,m_2,\ldots,m_N} \right)_{\left[{\begin{array}{c}I+1 \end{array}}\right]} \Psi \tag{26} \end{align*}$ View Sourcewhere $\begin{align*} \Psi =\left(\mathcal {G}^x _{\left[{\begin{array}{c}I+1 \end{array}}\right]}\right)^+. \tag{27} \end{align*}$ View Source
Step 4: Since the tensor $\mathcal {S}^{\mathcal {G}^p}\in {\mathbb {R}}^{M^p_1\times M^p_2 \times \cdots \times M^p_N \times O \times I}$ constructed from $\mathbf {S}_{m_1,m_2,\ldots,m_N}$ is the discretized variant of $\mathbf {S}(\mathbf {p})$ over $\mathcal {G}^p$ , we can continue with the second step of the TP model transformation (incorporating all the beneficial properties of the TP model transformation), which results in
$\begin{align*} \mathcal {S}^{\mathcal {G}^p}=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {U}_n \tag{28} \end{align*}$ View Sourceand then, the third step results in $\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \tag{29} \end{align*}$ View Sourcethus, to summarize $\begin{align*} \mathbf {y}=\left(\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \right)\mathbf {x}. \tag{30} \end{align*}$ View Source

Remark 1.

A bilinear TP model can be derived directly from $\mathbf {U}_n$ , however, in order to reconstruct the weighting functions over any given $p_n$ , $\mathbf {S}$ must be linearized again for the appropriate $\mathbf {p}$ .

SECTION VI.

Variate the Parameter Dependency

In the previous section, the parameter dependence was defined by $\mathbf {y}=f(\mathbf {x},\mathbf {p})$ . The goal here is to modify this dependence and replace vector $\mathbf {p}\in \Omega ^p$ with $\mathbf {v}\in \Omega ^v$ . In many cases, it is desirable that a linear transformation $\mathbf {T}$ of the parameter space be used

$\begin{align*} \mathbf {p}=\mathbf {T}\mathbf {v} \tag{31} \end{align*}$ View Source

thus, the goal is to find the equivalent

$\begin{align*} \mathbf {y}=f(\mathbf {x},\mathbf {v}) \tag{32} \end{align*}$

View Source

in parameter-varying linear form. We may examine an even more general case and replace vector

$\mathbf {p}$

with another parameter vector

$\mathbf {v}\in {\mathbb {R}}^K$

, such as

$\begin{align*} \mathbf {p}=T(\mathbf {v}) \tag{33} \end{align*}$

View Source

that leads to

$\begin{align*} \mathbf {y}=f(\mathbf {x},\mathbf {v}) \tag{34} \end{align*}$

View Source

and execute Method 3 to reveal the parameter-varying linear structure. In this case the discretization grid

$\mathcal {G}^v$

is defined within

$\Omega ^v$

and Method 3 will numerically reconstruct

$\begin{align*} \mathbf {y}=\left(\mathcal {S}\mathop {\boxtimes }\limits _{k=1}^K \mathbf {w}_k(v_k) \right)\mathbf {x}. \tag{35} \end{align*}$

View Source

For instance, if we know that element

$p_1$

always acts in

$\mathbf {y}=f(\mathbf {x},\mathbf {p})$

as

$p^2_1$

, then, we may choose to use

$v_1=p^2_1$

instead. Consider the following example:

$\begin{align*} y=x_1p^2+x_2 \tag{36} \end{align*}$

View Source

that can be transformed to the parameter-dependent linear form as

$\begin{align*} y=\left[{\begin{array}{cc}p^2 & 1 \end{array}}\right]\left[{\begin{array}{c}x_1 \\ x_2 \end{array}}\right]=\mathbf {S}(p)\mathbf {x}. \tag{37} \end{align*}$

View Source

However, we may define a new parameter

$v=p^2$

as

$\begin{align*} y=x_1v+x_2=\left[{\begin{array}{cc}v & 1 \end{array}}\right]\left[{\begin{array}{c}x_1 \\ x_2 \end{array}}\right]=\mathbf {S}(v)\mathbf {x}. \tag{38} \end{align*}$

View Source

This idea can be extended to entire sets of parameters. For instance, consider the following function:

$\begin{align*} y=x_1\left(p_1^2+p_2 \right)+x_2p_2 \tag{39} \end{align*}$

View Source

one may choose to use the new parameters

$v_1=p_1^2+p_2$

and

$v_2=p_2$

and obtain

$\begin{align*} y=x_1v_1+x_2v_2=\left[{\begin{array}{cc}v_1 & v_2 \end{array}}\right]\left[{\begin{array}{c}x_1 \\ x_2 \end{array}}\right]=\mathbf {S}(\mathbf {v})\mathbf {x}. \tag{40} \end{align*}$

View Source

SECTION VII.

Transformation to Quasi-Linear Structure

If the above-discussed linearization leads to a larger than acceptable error $\epsilon$ , an alternative approach would be to search for a quasi-linear structure.

The goal, then, is to replace the function

$\begin{align*} \mathbf {y}=f(\mathbf {x}) \tag{41} \end{align*}$ View Source

with a linear mapping

$\begin{align*} \mathbf {y}=\mathbf {S}(\mathbf {p})\mathbf {x}\tag{42} \end{align*}$

View Source

where

$\mathbf {x}\in \Omega \subset {\mathbb {R}}^I$

and

$\mathbf {y}\in {\mathbb {R}}^O$

(hence

$\mathbf {S}\in {\mathbb {R}}^{O\times I}$

). Here, matrix

$\mathbf {S}(\mathbf {p})$

is a function of vector

$\mathbf {p}$

, where

$\mathbf {p}\in \Omega ^p\subset {\mathbb {R}}^{N}$

is constructed from elements of vector

$\mathbf {x}$

as

$\mathbf {p}\sqsubseteq \mathbf {x}$

, hence,

$\Omega ^p\sqsubseteq \Omega ^x$

. This is still a linear structure, however, it represents a nonlinear mapping between

$\mathbf {x}$

and

$\mathbf {y}$

—hence the name quasi-linear structure.

The following method approximates $\mathbf {S}(\mathbf {p})$ in a polytopic form as

$\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n). \tag{43} \end{align*}$ View Source

It is a crucial point to define those variables in (41) which will be considered as parameters as well. This can usually be done in more than a single way. For instance, consider the following example:

$\begin{align*} y=x_1x_2+x_1 \tag{44} \end{align*}$

View Source

which can be given as

$\begin{align*} y=\left[{\begin{array}{cc}1 & p_1 \end{array}}\right]\left[{\begin{array}{c}x_1 \\ x_2 \end{array}}\right]=\mathbf {S}(x_1)\mathbf {x}\tag{45} \end{align*}$

View Source

where

$p_1=x_1$

. However, another solution (leading to a completely different structure) would be to write

$\begin{align*} y=\left[{\begin{array}{cc}p_2+1 & 0 \end{array}}\right]\left[{\begin{array}{c}x_1 \\ x_2 \end{array}}\right]=\mathbf {S}(x_2)\mathbf {x}\tag{46} \end{align*}$

View Source

where

$p_2=x_2$

. This example shows that one has to select the desired parameters before starting the numerical reconstruction.

Based on this, first we need to find the components $f_i(\mathbf {x}^p_i)$ of the function as

$\begin{align*} f(\mathbf {x})=\sum _{i=1}^{I} f_i(\mathbf {x}^p_i)x_i \tag{47} \end{align*}$ View Source

where vector

$\forall i: \mathbf {x}^p_i\sqsubseteq \mathbf {x}$

(

$p$

denotes that these values are in parameter vector as well). The product with one of the elements

$x_i$

of

$\mathbf {x}$

is necessary since each element of matrix

$\mathbf {S}$

will be multiplied with one element of the vector

$\mathbf {x}$

. Once the components are defined, the elements of vectors

$\mathbf {x}^p_i$

define the parameter

$p_n$

,

$n=1,\ldots,N$

of parameter vector

$\mathbf {p}$

. In the above-mentioned example:

$f(\mathbf {x})=f(x_1)x_2+x_1$

or

$f(\mathbf {x})=f(x_2)x_1 +x_1=(f(x_2)+1)x_1$

. Thus, the candidate for the parameter vector is

$p_1=x_1$

or

$p_2=x_2$

.

Assume that we have the parameter vector $\mathbf {p}\sqsubseteq \mathbf {x}$ and the components $f_i(\mathbf {p}^p_i)$ , $\mathbf {p}^p_i\sqsubseteq \mathbf {p}$ of

$\begin{align*} f(\mathbf {x},\mathbf {p})=\sum _{i=1}^{I} f_i(\mathbf {p}^p_i)x_i. \tag{48} \end{align*}$ View Source

Then, we execute the following method to numerically reconstruct

$\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^{N}\mathbf {w}_n(p_n). \tag{49} \end{align*}$

View Source

For the sake of simplicity, we extend all components

$f_i(\mathbf {p}^p_i)$

with zero blocks of those parameters which are not in

$\mathbf {p}^p_i$

to obtain

$\begin{align*} f_i(\mathbf {p})=f_i(\mathbf {p}^p_i)+\sum _j 0p_j \tag{50} \end{align*}$

View Source

where

$\forall j:p_j$

is not included in

$\mathbf {p}^p_r$

, but is included in

$\mathbf {p}$

. Thus, we have

$\begin{align*} f(\mathbf {x},\mathbf {p})=\sum _{i=1}^{I} f_i(\mathbf {p})x_i \tag{51} \end{align*}$

View Source

where

$\mathbf {p}\sqsubseteq \mathbf {x}$

.

Method 4:

Reconstruct the quasi-linear structure

Step 1: Define a discretization grid $\mathcal {G}^p$ to $\Omega ^p\sqsubseteq \Omega ^x$ with density $M^p$ .
Step 2: Define a linearization grid $\mathcal {G}^x_{m_1,m_2,\ldots,m_N}$ to each element of $\mathcal {G}^p$ , where the linearization space $\Omega ^x_{m_1,m_2,\ldots,m_N}$ is around $\mathbf {g}^p_{m_1,m_2,\ldots,m_N}$ of $\mathcal {G}^p$ . Obviously, the interval $\omega ^x_i$ of those $x_i$ which are not selected to be a parameter can be set arbitrarily within $\Omega ^x$ .
Step 3: Execute linearization according to Method 2 for each $\mathbf {g}^p_{m_1,m_2,\ldots,m_N}\in \mathcal {G}^p$ as
$\begin{align*} \mathcal {F}^{\mathcal {G}^x_{m_1,m_2,\ldots,m_N}}=f(*\mathcal {G}^x_{m_1,m_2,\ldots,m_N},\mathbf {g}_{m_1,m_2,\ldots,m_N}^p) \tag{52} \end{align*}$ View Sourceand $\begin{align*} \mathbf {S}_{m_1,m_2,\ldots,m_N}=\left(\mathcal {F}^{\mathcal {G}^x_{m_1,m_2,\ldots,m_N}} \right)_{\left[{\begin{array}{c}I+1 \end{array}}\right]} \Psi \tag{53} \end{align*}$ View Sourcewhere $\begin{align*} \Psi =\left(\left(\mathcal {G}^x _{m_1,m_2,\ldots,m_N}\right)_{\left[{\begin{array}{c}I+1 \end{array}}\right]}\right)^+. \tag{54} \end{align*}$ View Source
Step 4: Since tensor $\mathcal {S}^{\mathcal {G}^p}\in {\mathbb {R}}^{M^p_1\times M^p_2 \times \cdots \times M^p_N \times O \times I}$ constructed from $\mathbf {S}_{m_1,m_2,\ldots,m_N}$ is the discretized variant of $\mathbf {S}(\mathbf {p})$ over $\mathcal {G}^p$ , we can continue with the second step of the TP model transformation (incorporating all the beneficial properties of the TP model transformation), that results in
$\begin{align*} \mathcal {S}^{\mathcal {G}^p}=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {U}_n \tag{55} \end{align*}$ View Sourceand finally, we have $\begin{align*} \mathbf {S}(\mathbf {p})=\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \tag{56} \end{align*}$ View Sourcethus $\begin{align*} \mathbf {y}=\left(\mathcal {S}\mathop {\boxtimes }\limits _{n=1}^N \mathbf {w}_n(p_n) \right)\mathbf {x}\tag{57} \end{align*}$ View Sourcewhere $p_n$ are the elements of vector $\mathbf {x}$ .

Remark 2:

A bilinear TP model can be derived directly from $\mathbf {U}_n$ , however, in order to reconstruct the weighting functions over any given $p_n$ , $\mathbf {S}$ must be linearized for the appropriate $\mathbf {p}$ .

SECTION VIII.

Examples

This section demonstrates how the above-discussed theories may be applied to the model of the inverted pendulum. An implementation of the relevant procedures is provided in the MATLAB language using the TPtool toolbox (which can be downloaded through the Wikipedia page on TP model-based control). The variables in the MATLAB code use the same notation as was used in the previous sections.

A. Model of the Inverted Pendulum

Assume that we have an inverted pendulum, such that the mass of the pendulum is $m_p$ , the mass of the cart is $M_c$ , the length of the pendulum is $l$ , the motor force is $u$ , and the angle of the pendulum is $\theta$ . The inverted pendulum has the following state-space model representation:

$\begin{align*} \dot{\mathbf {x}}=f(\mathbf {x},u)=\left[{\begin{array}{c}x_2 \\ f_1(\mathbf {x},u) \\ x_4 \\ f_2(\mathbf {x},u) \end{array}}\right]\tag{58} \end{align*}$ View Source

where

$x_1=\theta$

,

$x_2=\dot{\theta }$

,

$x_3=s$

,

$x_4=\dot{s}$

, and

$\begin{align*} f_1 &= \frac{a_1+a_2}{l(M_c+m_p\sin ^2(x_1))} \tag{59} \\ f_2 &= \frac{a_3+a_4}{M_c+m_p\sin ^2(x_1)} \tag{60} \end{align*}$

View Source

where

$\begin{align*} a_1 &= (M_c+m_p)g\sin (x_1)+\cos (x_1)u \tag{61} \\ a_2 &= b\cos (x_1)x_4-m_pl\sin (x_1)\cos (x_1)x_2^2 \tag{62} \\ a_3 &= -m_pg\sin (x_1)\cos (x_1)-bx_4+u \tag{63} \\ a_4 &= m_pl\sin (x_1)x_2^2. \tag{64} \end{align*}$

View Source

Let

$M_c=3\text{ kg}$

,

$m_p=0.2\text{ kg}$

,

$l=0.31\text{ m}$

,

$b=0.1\text{ N/ms}^{-1}$

. The MATLAB code of the models is:

function DX=model(X)

th=X(1); dth=X(2); x=X(3); dx=X(4);

u=X(5); si=sin(th); co=cos(th);

Mc=3;mp=0.2;l=0.31;b=0.1;g=9.88; s1l=l*(Mc+mp*si*si); s2l=Mc+mp*si*si;

T1=(Mc+mp)*g*si/s1l; T2=b*co*dx/s1l;

T3=-mp*l*si*co*dth*dth/s1l;

T4=-co*u/s1l;

P1=-mp*g*si*co/s2l; P2=-b*dx/s2l;

P3=mp*l*si*dth*dth/s2l; P4=u/s2l;

DX=[dth T1+T2+T3+T4 dx P1+P2+P3+P4]’;

B. Example 1: Reconstruct the Linear Structure

Assume that the equations of the above model are unknown (the code is unknown), only the input $\mathbf {x},u$ and output $\dot{\mathbf {x}}$ pairs are available. In order to replace $\dot{\mathbf {x}}=f(\mathbf {x})$ with $\dot{\mathbf {x}}=\mathbf {S}\mathbf {x}$ , we execute Method 2. First, let us define a grid as

function g=Getgrid(m,M,Omega,O)

a=length(m); for i=1:a a=(m(i)-1)*2*Omega(i)/(M-1);

g(i)=O(i)-Omega(i)+a; end

Remark 3:

O defines the center of the grid. Omega defines the intervals around these center points (thus, the value in Omega is at the half of the interval) for each dimension.

Then, the linearization is

function S=linear(Omega,M,O)

for m1=1:M for m2=1:M for m3=1:M

for m4=1:M for m5=1:M

m=[m1 m2 m3 m4 m5];

g=(Getgrid(m,M,Omega,O))’;

G(m1,m2,m3,m4,m5,:)=g;

DXT(m1,m2,m3,m4,m5,:,:)=model(g);

end end end end end

Xu=ndim_unfold(G,6);

DXu=ndim_unfold(DXT,6);

S=DXu*pinv(Xu)

The overall algorithm can then be written as follows:

clear;

Omega=[1.6 1.6 1 1 1]; M=5;

O=[0 0 0 0 0];% center of Omega

S=linear(Omega,M,O);

The resulting $\mathbf {S}$ is the best approximation in least-square sense. We may increase the grid density M as high as possible. The linearization error is quite large over $\Omega$ . For control design purposes, let us linearize the model for $\theta$ and $\dot{\theta }$ $\approx 0$ (the typical stabilisation point). Let us execute the above algorithm with Omega=[1e-10 1e-10 1 1 1]; The resulting $\mathbf {S}$ is

$\begin{align*} \left[{\begin{array}{ccccc}0 & 1 & 0 & 0 & 0 \\ 33.75 & 0 & 0 & 0.11 & -1.08 \\ 0 & 0 & 0 & 1 & 0 \\ -0.65 & 0 & 0 & -0.03 & 0.33\end{array}}\right]. \tag{65} \end{align*}$ View Source

Indeed, we arrive at the same matrix when we substitute

$\sin (x_1)\approx 0$

and

$\cos (x_1)\approx 1$

into (58). The conclusion is that, using Method 2 we can numerically reconstruct the linearization of the model at a global scale or over any desired point, without analytical derivation and even in cases where the closed formulae of the model are not given.

C. Example 2: Extended TP Model Transformation

If the simple replacement of the model with $\dot{\mathbf {x}}=\mathbf {S}\mathbf {x}$ is not enough, because of strong nonlinearities, we may execute Method 3 to find the qLPV representation. Method 3 generates the TP model (or a T–S fuzzy model)-based qLPV representation. This example demonstrates how the proposed extension to the TP model transformation can be used in cases where the state vector $\mathbf {x}$ , input $\mathbf {u}$ , and $\dot{\mathbf {x}}$ , and hence, the size of the system matrix, are known as in Example 1; however, the components of the qLPV structure, such as the elements of the parameter-dependent system matrix $\mathbf {S}(\mathbf {p})$ and parameter vector $\mathbf {p}$ are not known, so that the matrix $\mathbf {S}(\mathbf {p})$ cannot be sampled directly.

Assume again that we have $\dot{\mathbf {x}}=f(\mathbf {x},u)$ , and the goal is to extract

$\begin{align*} \dot{\mathbf {x}}=\mathbf {S}(\mathbf {p})\left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]. \tag{66} \end{align*}$ View Source

Let us extend the algorithm so that we can perform linearization at each gridpoint over a linearization grid. In order to achieve more stable computation, we use the pseudoinverse at the gridpoint O. Thus, a small modification in the MATLAB function LINEAR is necessary: G(m1,m2,m3,m4,m5,:)=O; Then, the algorithm is:

D=3; r=1e-10; M=10; LOmega=[r r r r r];

Omega=[1.6 1.6 1 1 1]; O=[0 0 0 0 0];

for m1=1:M for m2=1:M for m3=1:M

for m4=1:M for m5=1:M

m=[m1 m2 m3 m4 m5];

g=(Getgrid(m,M,Omega,O))’;

S=linear(LOmega,D,g);

ST(m1,m2,m3,m4,m5,:,:)=S;

end end end end end

Remark 4:

LOmega and r define the linearization subspace for each grid. D is the sampling density for the linearization.

Next, we extract the convex TP model form with SNNN-type weighting functions and vertexes [S,U,sv]=hosvd(ST,[1 1 1 1 1 0 0],1e-11);

for i=1:5 Uc{i}=genhull(U{i},'snnn’);

figure(i); plot(Uc{i},’LineWidth’,3);

Ucp{i}=pinv(Uc{i});

end

Sc=tprods(S,Ucp);

The SNNN weighting functions are determined in cell U and the related vertexes are stored in Sc. Thus, finally we obtain

$\begin{align*} f(\mathbf {x},u)=f(\mathbf {x},u,\mathbf {p})\left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]= \mathcal {S}\mathop {\boxtimes }\limits _n \mathbf {W}_n(p_n)\left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]\tag{67} \end{align*}$ View Source

where

$p_1=x_1$

,

$p_2=x_2$

,

$p_3=x_3$

,

$p_4=x_4$

, and

$p_5=u$

.

D. Example 3: Varying the Components of the Vertexes

This example shows how we can easily modify the structure of the pendulum model and generate various alternative T–S fuzzy models in a few minutes without any analytical derivation of closed formulae. Exploring such variations will allow us to find the best option for further design steps. The state-space model of (58) is

$\begin{align*} \dot{\mathbf {x}}=\left[{\begin{array}{ccccc}0 & 1 & 0 & 0 & 0 \\ 0 & t_1/x_2+t_2 & 0 & bt_3 & t_3 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & v_1/x_2+v_2 & 0 & -b/h_2 & 1/h_2 \end{array}}\right]\left[{\begin{array}{c}\dot{\mathbf {x}}\\ u \end{array}}\right]\tag{68} \end{align*}$ View Source

where

$\begin{align*} h_1 &= l(M_c+m_p\sin ^2(x_1)) \tag{69} \\ h_2 &= M_c+m_p\sin ^2(x_1) \tag{70} \\ t_1 &= (M_c+m_p)b\sin (x_1)/h_1 \tag{71} \\ t_2 &= m_pl\sin (x_1)\cos (x_1)x_2/h_1 \tag{72} \\ t_3 &= \cos (x_1)/h_1 \tag{73} \\ v_1 &= -m_pg\sin (x_1)\cos (x_1)/h_2 \tag{74} \\ v_2 &= m_pl\sin (x_1)x_2/h_2. \tag{75} \end{align*}$

View Source

Its MATLAB code is

function S=modelS(P)

dth=P(2); si=sin(P(1)); co=cos(P(1));

Mc=3;mp=0.2;l=0.31;b=0.1;g=9.88; s1l=l*(Mc+mp*si*si); s2l=Mc+mp*si*si;

T1=(Mc+mp)*g*si/s1l; T2=b*co/s1l;

T3=-mp*l*si*co/s1l; T4=-co/s1l;

P1=-mp*g*si*co/s2l; P2=-b/s2l;

P3=mp*l*si*dth*dth/s2l; P4=1/s2l;

S=[0 1 0 0 0; 0 T1/dth+T3 0 T2 T4;

0 0 0 1 0; 0 P1/dth+P3 0 p2 p4];

The nonlinearity is caused by $x_1$ and $x_2$ . We can execute the TP model transformation (in order to avoid division by zero, the number of grids are set to an even number) as follows:

M=50; Omega=[1.6 1.6]; O=[0 0];

for m1=1:M for m2=1:M

m=[m1 m2];

g=(Getgrid(m,M,Omega,O))’;

ST(m1,m2,:,:)=modelS(g);

end end

[S,U,sv]=hosvd(ST,[1 1 0 0],1e-11);

for i=1:2 Uc{i}=genhull(U{i},'snnn’);

figure(i); plot(Uc{i},’LineWidth’,3);

Ucp{i}=pinv(Uc{3});

end

Sc=tprods(S,Ucp);

In the end, we arrive at the weighting functions depicted in Fig. 1. Thus, the T–S fuzzy model of the pendulum is

$\begin{align*} \dot{\mathbf {x}}=\mathbf {S}(p_1,p_2) \left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]=\mathcal {S}\mathop {\boxtimes }\limits _n^2 \mathbf {W}_n(p_n)\left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]\tag{76} \end{align*}$ View Source

where

$p_1=x_1$

and

$p_2=x_2$

. The number of rules are

$3\times 5=15$

. Various alternative parameter-dependent system matrices can be derived, for instance

$\begin{align*} \dot{\mathbf {x}}=\left[{\begin{array}{ccccc}0 & 1 & 0 & 0 & 0 \\ t1/x_1 & t_2 & 0 & bt_3 & t_3 \\ 0 & 0 & 0 & 1 & 0 \\ v_1/x_1 & v_2 & 0 & -b/h_2 & 1/h_2 \end{array}}\right]\left[{\begin{array}{c}\dot{\mathbf {x}}\\ u \end{array}}\right]. \tag{77} \end{align*}$

View Source

By applying the TP model transformation, all the different T–S fuzzy models can easily be derived from the differently structured system matrices. The type of the convex hull defined by the vertices, namely the type of the weighting functions, can also be manipulated easily, as mentioned in Section II.

Fig. 1.

Weighting functions of $p_1$ and $p_2$ , see Example 3.

Show All

E. Example 4: Modifying the Parameter Space

This example shows that the parameter space of the model can also readily be changed. Let us have $p_1=\sin (x_1)$ , $p_2=\cos (x_2)$ , $p_3=x_2$ , and let us change the first line of function MODELS accordingly, as follows:

function S=modelS(P)

dth=P(3); si=P(1); co=P(2);

We then execute the TP model transformation to derive CNO-type weighting functions as follows:

M=50; Omega=[1.6 1.6 0.6]; O=[0 0 0];

for m1=1:M for m2=1:M for m3=1:M

m=[m1 m2 m3];

g=(Getgrid(m,M,Omega,O))’;

ST(m1,m2,:,:)=modelS(g);

end end

[S,U,sv]=hosvd(ST,[1 1 1 0 0],1e-11);

for i=1:3 Uc{i}=genhull(U{i}, ‘cno’);

figure(i); plot(Uc{i}, ‘LineWidth’,3);

Ucp{i}=pinv(Uc{3});

end

Sc=tprods(S,Ucp);

Finally, we obtain the weighting function depicted in Fig. 2. Thus, we can express the T–S fuzzy model as follows:

$\begin{align*} \dot{\mathbf {x}}=\mathbf {S}(p_1,p_2,p_3) \left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]=\mathcal {S}\mathop {\boxtimes }\limits _n^3 \mathbf {W}_n(p_n)\left[{\begin{array}{c}\mathbf {x}\\ u \end{array}}\right]. \tag{78} \end{align*}$ View Source

Indeed, the dimensionality of the model has increased, but at the same time the number of antecedents in the different dimensions are reduced, and only 18 rules remain. Therefore, it is much easier to manipulate the weighting functions for further control design purposes, i.e., to obtain a CNO-type derivation. We can go further and define the following setting

$p_1=\sin (x_1)$

,

$p_2=\cos (x_1)$

,

$p_3=x_2$

, and

$p_4=1/x_2$

. The resulting weighting functions are depicted in Fig. 3. The number of rules is 24, however, the most simple qLPV model is obtained. Again, we may define a number of variations easily using the presented framework and see which one leads to the best control performance.

Fig. 2.

Weighting functions of $p_1$ , $p_2$ , and $p_3$ , see Example 4.

Show All

Fig. 3.

Weighting functions of $p_1$ , $p_2$ , $p_3$ , and $p_4$ , see Example 4.

Show All

SECTION IX.

Comparison to Other Solutions

If we compare the proposed TP model transformation-based framework to its previous version, the most important difference to highlight is that the proposed version is applicable to a considerably larger class of models. Namely, the previous version was applicable to

$\begin{align*} \dot{\mathbf {x}}=\mathbf {S}(\mathbf {x},\mathbf {p},u) \left[{\begin{array}{c}\mathbf {x}\\ \mathbf {u} \end{array}}\right]\tag{79} \end{align*}$ View Source

where the linear matrix structure is already determined, but the current, extended version is applicable to

$\begin{align*} \dot{\mathbf {x}}=f(\mathbf {x},u,\mathbf {p}) \tag{80} \end{align*}$

View Source

or even to

$\begin{align*} \mathbf {x}=f(\mathbf {x},u) \tag{81} \end{align*}$

View Source

where the elements of

$\mathbf {x}$

,

$\dot{\mathbf {x}}$

,

$\mathbf {u}$

, and the dimensionality of

$\mathbf {S}(\mathbf {p})$

are available; however, the inner structure of the system matrix

$\mathbf {S}(\mathbf {p})$

is not known and the parameter vector is not revealed. This means that the proposed version of the TP model transformation offers a very powerful way to extract various alternative T–S fuzzy models of a given dynamic model.

One of the most frequently used approaches to convert a given dynamic model to a T–S fuzzy model is the sector nonlinearity method. Many of the publications dealing with T–S fuzzy control use the sector nonlinearity method to convert the dynamic equations to the T–S fuzzy model. If we compare the proposed method to the sector nonlinearity method, the most important difference is that the TP model transformation provides various features to manipulate all the parameters of the T–S fuzzy model, i.e., the shape of the antecedents, the number of fuzzy rules, and the convex hull defined by the vertexes. It provides an exact HOSVD-based canonical form and complexity tradeoff. The sector nonlinearity based solution is not equipped with such important features. Since the manipulation of the parameters of the T–S fuzzy model is a crucial step, as mentioned in the introduction, the manipulation power of the TP model transformation plays a crucial role in control design. A further important point is that the TP model transformation has a well developed, tractable numerical implementation based on the numerical implementations of HOSVD. Thus, in contrast to the sector nonlinearity method, it is executable automatically with minimal human interaction and has all the same benefits as the HOSVD does for tensors.

SECTION X.

Conclusion

This paper was based on the observation that an identified state-space model can have an infinite number of T–S fuzzy or TP model representations. Such representations can differ in the structure of the model, and in the construction of the parameter vector—especially if elements of the state vector are involved—and the location of the vertexes in the polytopic form. The selection of the representation strongly influences the control design. Therefore, we were required to derive a huge set of variations to see which one of them leads to the best solution. This paper proposed an effective tool to readily define such variations, and placed the tool into the context of the TP model transformation-based control design framework, which allows for the principled use of the tool as part of a broader toolset for controller design.

Extracting LPV and qLPV Structures From State-Space Functions: A TP Model Transformation Based Framework

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Introduction

Notations and Preliminary Concepts

A. Notations

B. Concepts

Definition 1.

Definition 2.

Definition 3.

Definition 4.

Definition 5.

Method 1.

Problem Outline

Transformation to Linear Structure

Method 2.

Transformation to Parameter-Varying Linear Structure

Method 3:

Remark 1.

Variate the Parameter Dependency

Transformation to Quasi-Linear Structure

Method 4:

Remark 2:

Examples

A. Model of the Inverted Pendulum

B. Example 1: Reconstruct the Linear Structure

Remark 3:

C. Example 2: Extended TP Model Transformation

Remark 4:

D. Example 3: Varying the Components of the Vertexes

E. Example 4: Modifying the Parameter Space

Comparison to Other Solutions

Conclusion

ACKNOWLEDGMENT

Authors

Figures

Fig. 1.

Fig. 2.

Fig. 3.

References

Citations

Keywords

Metrics

References