Design of a Data-Driven Multi PID Controllers using Ensemble Learning and VRFT

Takuya Kinoshita; Yuma Morota; Toru Yamamoto

doi:10.2991/jrnal.k.200512.014

<Previous Article In Issue

Download article (PDF)

Next Article In Issue>

Volume 7, Issue 1, June 2020, Pages 68 - 72

Design of a Data-Driven Multi PID Controllers using Ensemble Learning and VRFT

Authors

Takuya Kinoshita^*, Yuma Morota, Toru Yamamoto

Graduate School of Engineering, Hiroshima University, 1-4-1 Kagamiyama, Higashi-hiroshima city, Hiroshima, Japan

^*Corresponding author. Email: kinoshita-takuya@hiroshima-u.ac.jp

Corresponding Author

Takuya Kinoshita

Received 6 November 2019, Accepted 17 March 2020, Available Online 20 May 2020.

DOI: 10.2991/jrnal.k.200512.014 How to use a DOI?
Keywords: Data-driven control; PID control; ensemble learning
Abstract: Data-driven control has been proposed for directly calculating control parameters using experimental data. Specifically, the Virtual Reference Feedback Tuning (VRFT) has been proposed for linear time-invariant systems. In the field of machine learning, the ensemble learning was proposed to improve the accuracy of prediction by using multiple learners. In this study, a design scheme of data-driven controllers using the ensemble learning and VRFT is newly proposed for linear time-varying systems. The ensemble learning can divide the linear time-varying system into some sections that can be regarded locally as linear time-invariant systems.
Copyright: © 2020 The Authors. Published by Atlantis Press SARL.
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

Data-driven control schemes have been proposed for directly designing a controller using a set of experimental data. Virtual Reference Feedback Tuning (VRFT) [1] and Fictitious Reference Iterative Tuning (FRIT) [2] have been proposed as data-driven control schemes. According to these schemes, control parameters can be directly calculated using a set of experimental data. However, VRFT and FRIT work well for only linear time-invariant systems, and it is difficult to obtain good control performance for linear time-varying systems.

Meanwhile, the effectiveness of the deep and machine learning have been demonstrated in the field of image recognition. The ensemble learning such as random forest [3] and adaptive boosting [4] is a machine learning scheme, and it has been proposed to improve the accuracy of prediction by using multiple learners.

This paper proposes a design scheme of data-driven controllers using ensemble learning and VRFT for linear time-varying systems. Specifically, a linear time-invariant system is first divided into some linear systems by applying ensemble learning based on decision tree learning. Second, VRFT is applied to each divided linear system for obtaining multiple linear controllers.

2. DESIGN OF DATA-DRIVEN CONTROLLER USING ENSEMBLE LEARNING

Figure 1 shows the schematic figure of the proposed data-driven control system where multiple controllers (controller-1, controller-2,…, controller-n) are designed using ensemble learning using the initial closed-loop data. First, the closed-loop data is divided into some linear time-invariant systems based on time t (t₁, t₂,…, t_n₋₁). Next, control parameters are calculated using VRFT for each divided data (Divided data-1, Divided data-2, …, Divided data-n).

2.1. Virtual Reference Feedback Tuning

Figure 2 shows the block diagram of VRFT. P(z⁻¹) is a system plant and C(z⁻¹, θ) shows the controller. θ is a control parameters. The optimization problem is adjusting θ so that closed-loop transfer function W(z⁻¹) closed to the following desired reference model G_m(z⁻¹) [5].

$Gm(z−1)=z−1T(1)T(z−1)$ (1)

$T(z−1)=1+t1z−1+t2z−2,$ (2)

where

${t1=−2exp(−ρ2μ)cos(4μ−12μρ)t2=exp(−ρμ)ρ=Tsσμ=0.25(1−δ)+0.51δ.$ (3)

T_s is the sampling time, σ and δ are the user-specified parameters related to the rise characteristic and attenuation characteristic of control system.

In order to match W(z⁻¹) to G_m(z⁻¹), one-shot experimental input u₀(t) and output y₀(t) is obtained. Next, a pseudo reference input $r˜(t)$ is calculated as follows:

$r˜(t)=1Gm(z−1)y0(t).$ (4)

The output $u˜(t)$ is denoted as Equation (5).

$u˜(t)=C(z−1,θ)Δe˜(t)$ (5)

$e˜(t)=r˜(t)−y0(t)$ (6)

The following equation denotes the objective function J.

$J=12∑t=1N{ε(t,θ)}2$ (7)

$ε(t,θ)=u0(t)−u˜(t)$ (8)

In this paper, I − P controller is used and the control parameter θ which minimizes the evaluation function J. From Figure 2, $u˜(t)$ becomes given by

$u˜(t)=−KPy0(t)+KIe˜(t)Δ$ (9)

By putting $e˜(t)/Δ=x(t)$ , Equation (9) is rewritten as follows:

$u˜(t)=−KPy0(t)+KIx(t)=θφ(t)$ (10)

Here, the following equations denote θ and φ(t).

$θ=[KPKI]T$ (11)

$φ(t)=[−y0(t)x(t)]$ (12)

The control parameter θ^* which minimizes the objective function J is obtained as Equation (13).

$θ*=(ΦTΦ)−1ΦTU$ (13)

$Φ=[φ(1)φ(2)⋯φ(N)]T$ (14)

$U=[u0(1)u0(2)…u0(N)]T$ (15)

2.2. Design of Learner to Divide Closed-loop Data

In this paper, a learner is designed to divide closed-loop data into n systems in Figure 3 is designed. Decision tree learning is a scheme of assigning a data set allocated to a parent node to a child node according to a split function h^* and creating a decision tree such as Figure 4. The split function h^* is obtained as follows:

$h*=arg max I$ (16)

I is an evaluation function showing the degree of variation of the class in the child node which divided the parent node. Based on the division scheme of the decision tree, the optimum time to split closed-loop data

$t*(t1*,t2*,⋯,tn−1*)$ is obtained using the evaluation function I as follows:

$t*=arg max I(t)$ (17)

Here, the evaluation function I is defined as follows:

$I(t)=−∑k=1nJk*$ (18)

$Jk*$ is the minimum value in Equation (7) of the k^th system obtained by dividing the closed-loop data at time t.

2.2.1. Ensemble learning

Ensemble learning is a scheme of integrating multiple learners to generate one learner. If the accuracy of each learner is higher than 50%, it is known that higher accuracy is achieved than one learner since the learner that estimates erroneously becomes a minority.

In this paper, ‘fminsearch.m’ in MATLAB & Simulink Ver. 9.4.0813654 (R2018a), Optimization Toolbox is used to find t^*.

2.3. Controller Selection Method based on the Controller’s Plane

Equation (10) is as follows when the closed-loop transfer function at the top of Figure 2 and the transfer function G_m(z⁻¹) are equal.

$u0(t)=−KPy0(t)+KIe(t)Δ$ (19)

$−KPy0(t)+KIe(t)Δ−u0(t)=0$ (20)

The following equation defined the equation of the plane passing the origin with x, y, z as axes.

$ax+by+cz=0$ (21)

From Equations (20) and (21) can be regarded as an equation of a plane passing the origin with y, e/Δ, and u as axes. From Equation (20), a linear time-invariant system can be represented by a plane and the coefficients consist of PI gains. From this property, input and output data are plotted in a space with y, e/Δ, and u as axes, and a controller corresponding to a plane closest to the plotted points is sequentially selected.

3. NUMERICAL EXAMPLE

3.1. Control Object and Setting Parameters

In this numerical example, the following system is discussed.

$G(s)={51+20s(0≤t<150)31+50s(150≤t<350)11+10s(350≤t≤600)$ (22)

The reference signal r(t) is given as follows:

$r(t)={100(0≤t<200)150(200≤t<400)50(400≤t≤600)$ (23)

White Gaussian noise with zero mean and a variance of (1/3)² is added to the controlled object. The reference model G_m(z⁻¹) was set as Equation (24).

$Gm(z−1)=0.0392z−11−1.6057z−1+0.6449z−2$ (24)

3.2. Simulation Result

The control result with fixed PI gains which are calculated by VRFT is shown in Figure 5. PI gains are calculated as follows:

$KP=−0.45, KI=0.10$ (25)

K_P is negative gain because the system has three characteristics of Equation (22). In Figure 5, the overshoot has occurred on the output with fixed PI gains and good control results are not obtained. The sum of squared errors is 708.7 in Table 1.

	Σ e(t)²
Conventional scheme (Figure 5)	708.7
Proposed scheme (Figure 6)	653.9

Table 1

Sum of squared errors

Figure 6 shows the control result applying the proposed scheme. Here, the number of weak learners is 10. Figures 7 and 8 respectively show the estimation result of the system change step in the weak learners and the transition of PI gains in the proposed scheme. Figure 6 shows that good control result can be obtained using the proposed scheme. In addition, Figures 7 and 8 show that the system change step is estimated accurately and PI gains are changed at the proper timing. Finally, the sum of squared errors is 653.9 in Table 1, and the proposed scheme is better than the conventional scheme.

4. CONCLUSION

In this paper, a new control scheme has been proposed. It is a scheme of dividing a linear time-varying system into multiple linear time-invariant systems and applying VRFT to calculate multiple linear controllers. In the numerical example, the effectiveness of the proposed scheme has been shown.

In the future works, it is necessary to optimize the number of divisions for an unknown system. In addition, the least squares method is susceptible to noises other than normal distribution. Therefore, further consideration is needed such as incorporating bagging [6] to suppress the effects of noise.

CONFLICTS OF INTEREST

The authors declare they have no conflicts of interest.

AUTHORS INTRODUCTION

Dr. Takuya Kinoshita

He received his B.Eng., M.Eng. and D.Eng. from Hiroshima University in Japan in 2013, 2015 and 2017, respectively. He was postdoctoral fellow of JSPS (Japan Society for the Promotion of Science) in 2017. He is currently an Assistant Professor with the Department of System Cybernetics, Graduate School of Engineering, Hiroshima University, Japan. His research interests are performance driven control.

Mr. Yuma Morota

He received his B.Eng. From Hiroshima University in Japan in 2019. He is currently a Master course student in Hiroshima University, Japan.

Prof. Toru Yamamoto

He received the B.Eng. and M.Eng. degrees from the University of Tokushima, Tokushima, Japan, in 1984 and 1987, respectively, and the D.Eng. degree from Osaka University, Osaka, Japan, in 1994. He is currently a Professor with the Department of System Cybernetics, Graduate School of Engineering, Hiroshima University, Japan. He was a Visiting Researcher with the Department of Mathematical Engineering and Information Physics, University of Tokyo, Tokyo, Japan, in 1991. He was an Overseas Research Fellow of the Japan Society for Promotion of Science with the University of Alberta for 6 months in 2006. His current research interests are in the area of data-driven control, and process control. He was the recipient of the Commendation for Science and Technology by the Minister of Education, Culture, Sports and Technology in 2009.

REFERENCES

[1]MC Campi, A Lecchini, and SM Savaresi, Virtual reference feedback tuning: a direct method for the design of feedback controllers, Automatica, Vol. 38, 2002, pp. 1337-1346.

[2]S Soma, O Kaneko, and T Fujii, A new approach to parameter tuning of controllers by using one-shot experimental data - a proposal of fictitious reference iterative tuning, Trans. Inst. Syst. Control Inform. Eng., Vol. 17, 2004, pp. 528-536. (in Japanese).

[3]L Breiman, Random forests, Mach. Learn., Vol. 45, 2001, pp. 5-32.

[4]Y Freund and RE Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, J. Comput. Syst. Sci., Vol. 55, 1997, pp. 119-139.

[5]T Yamamoto and SL Shah, Design and experimental evaluation of a multivariable self-tuning PID controller, IEEE Proc. Control Theory Appl., Vol. 151, 2004, pp. 645-652.

[6]L Breiman, Bagging predictors, Mach. Learn., Vol. 24, 1996, pp. 123-140.

<Previous Article In Issue

Download article (PDF)

Next Article In Issue>

Journal: Journal of Robotics, Networking and Artificial Life
Volume-Issue: 7 - 1
Pages: 68 - 72
Publication Date: 2020/05/20
ISSN (Online): 2352-6386
ISSN (Print): 2405-9021
DOI: 10.2991/jrnal.k.200512.014 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - JOUR
AU  - Takuya Kinoshita
AU  - Yuma Morota
AU  - Toru Yamamoto
PY  - 2020
DA  - 2020/05/20
TI  - Design of a Data-Driven Multi PID Controllers using Ensemble Learning and VRFT
JO  - Journal of Robotics, Networking and Artificial Life
SP  - 68
EP  - 72
VL  - 7
IS  - 1
SN  - 2352-6386
UR  - https://doi.org/10.2991/jrnal.k.200512.014
DO  - 10.2991/jrnal.k.200512.014
ID  - Kinoshita2020
ER  -

download .riscopy to clipboard