Linearization - System Analysis and Control

Smooth functions look like straight lines when you zoom in close enough to any point. It turns out something similar is true of dynamical systems. If we “zoom in close enough,” it will look like a linear system. In this section, we will make this idea precise and show how to derive linear approximations of nonlinear dynamical systems.

Keep in mind that a linearized model is only accurate near the operating point about which we linearize. Any controller we later design from such a model must therefore tolerate this modeling error. This tolerance—robustness—is a theme we will return to throughout the course.

Linearizing a function¶

Linearization is an approximation method. Given a function $f(x)$ , consider a reference point $x_0$ . If $f$ is a smooth function, $f(x)$ near $x=x_0$ can be approximated by drawing a tangent line on the graph and using the line as the approximation. The line will have slope $f'(x_0)$ , so the linear approximation will have equation:

f(x) \approx f(x_0) + f'(x_0)(x-x_0)

(1)

We can write this another way. Define $y=f(x)$ and $y_0=f(x_0)$ . Then define the deviations from the reference point as

\delta x := x - x_0\qquad\textsf{and}\qquad \delta y := y - y_0

(2)

and we can write our linearized equation as:

\delta y = f'(x_0) \delta x

(3)

Now it truly looks like a linear equation $y=kx$ , but it is linear in the deviations from the reference point. It is not linear in the original coordinates $(x,y)$ .

Linearization of a function f(x) about a point x_0. — Figure 1:Linearization of a function $f(x)$ about a point $x_0$ .

Example: approximating the square root¶

Let’s approximate $\sqrt{4.01}$ via linearization.

We know that $4.01$ is close to $4$ and $\sqrt{4}=2$ , so let’s set $f(x) = \sqrt{x}$ and pick the reference point $x_0=4$ . Then we have $f(x_0) = \sqrt{4} = 2$ and $f'(x) = \frac{1}{2\sqrt{x}}$ , so $f'(x_0) = \frac{1}{2\cdot 2} = \frac{1}{4}$ . Therefore, our linearized approximation near $x_0=4$ is:

f(x) \approx 2 + \frac{1}{4}\delta x = 2 + \frac{1}{4}(x-4)

(4)

In our example, $\delta x = 0.01$ , so we have $\sqrt{4.01} \approx 2.0025$ .

The true value is $\sqrt{4.01} \approx 2.002498$ , so our approximation is quite accurate!

Multivariable functions¶

We can make an analogous approximation for multivariable functions, except we approximate the function using a tangent hyperplane rather than a tangent line. For example, for a function $f(x,y)$ , we can approximate it about the point $(x_0,y_0)$ as

f(x,y) \;\approx\; f(x_0,y_0) \;+\; \left.\frac{\partial f}{\partial x}\right|_0 (x-x_0) \;+\; \left.\frac{\partial f}{\partial y}\right|_0 (y-y_0)

(5)

where we use the notation $\left.\frac{\partial f}{\partial x}\right|_0$ to mean “take the partial derivative of $f(x,y)$ with respect to $x$ and then evaluate the result at $x=x_0$ and $y=y_0$ ”. Setting $z=f(x,y)$ and $z_0=f(x_0,y_0)$ and defining the deviations $\delta x$ , $\delta y$ , $\delta z$ analogously as before, our linearized equation becomes

\delta z \;=\; \left.\frac{\partial f}{\partial x}\right|_0 \delta x \;+\; \left.\frac{\partial f}{\partial y}\right|_0 \delta y

(6)

Linearization of a function f(x,y) about a point (x_0,y_0). — Figure 2:Linearization of a function $f(x,y)$ about a point $(x_0,y_0)$ .

Equilibrium points¶

For example, consider the pendulum of Eq. (3).

m\ell^2\ddot\theta + mg\ell\sin\theta = T

(7)

An equilibrium point $(\theta_0,T_0)$ is a constant angle $\theta_0$ and constant torque $T_0$ that satisfies (7). All time derivatives vanish and we are left with the equation of motion, we have

T_0=mg\ell\sin\theta_0

(8)

There are many ways to satisfy Eq. (8). Each is a valid equilibrium point.

Pick $\theta_0=0$ and $T_0=0$ . The pendulum is pointing down and we apply no torque.
Pick $\theta_0=\pi$ and $T_0=0$ . The pendulum is pointing up and we apply no torque.
Pick $\theta_0=\tfrac{\pi}{4}$ and $T_0=\tfrac{1}{\sqrt{2}}mg\ell$ . The pendulum is perfectly balanced at 45°.
Pick any $\theta_0$ and use $T_0=mg\ell\sin\theta_0$ .

In all cases, if we use $\theta(0)=\theta_0$ as an initialization and apply the equilibrium torque $T(t)=T_0$ for all $t\geq 0$ , the pendulum will remain at the angle $\theta(t)=\theta_0$ for all $t\geq 0$ .

We typically find equilibrium points in one of two ways. If the system in question has input $u$ and other signals $x$ , then:

We are interested in the zero-input equilibrium, so we use $u_0=0$ and we ask what values of $x_0$ are possible. This is what we did for the first two equilibrium examples for the pendulum above.
We pick a desired $x_0$ , and we ask what is the required $u_0$ that will lead to equilibrium. This is what we did for the last two equilibrium examples for the pendulum above.

Linearizing a dynamical system¶

To linearize a non-LTI system, we carry out the following steps. We will assume the system has inputs $\u = [u_1,\dots,u_m]$ and other signals $\x = [x_1,\dots,x_n]$ (we use the boldface letters to denote vectors).

Steps to linearize a dynamical system

Identify an equilibrium point $(\x_0,\u_0)$ of the system. This should be a choice of constant signals such that the system remains at rest if initialized there. In other words, all time derivatives should be zero.
Linearize nonlinear terms. For every nonlinear term that appears in the system’s equations of motion, apply the single-variable or multivariable linearization (as appropriate) about the reference point defined by the equilibrium point. For example, if the system contains a nonlinear function $f(\x,\u)$ , then we would linearize it about the point $(\x_0,\u_0)$ :

f(\x,\u) \approx f(\x_0,\u_0) + \sum_{j=1}^n\left.\frac{\partial f}{\partial x_j}\right|_0\!(x_j-x_{0j}) + \sum_{k=1}^m\left.\frac{\partial f}{\partial u_k}\right|_0\!(u_k-u_{0k})

(9)

where all derivatives are evaluated at the equilibrium point $(\x_0,\u_0)$ . 3. Shift coordinates to deviations from equilibrium. Replace $\x = \x_0 + \bdelta\x$ and $\u = \u_0 + \bdelta\u$ . Since $\x_0$ and $\u_0$ are constant, derivative terms are $\bdelta\dot\x = \dot\x$ , $\bdelta\ddot\x = \ddot\x$ , and so on. Likewise for derivatives of $\u$ . 4. Simplify the resulting equations. Since $(\x_0,\u_0)$ is an equilibrium point, all constant terms should cancel out and you should be left with an LTI system with input $\bdelta\u$ and other signals $\bdelta\x$ .

The linearized system is a system of ODEs in the variables $(\bdelta\x,\bdelta\u)$ . To recover the original signals, we can shift them back:

\x(t) = \x_0 + \bdelta\x(t) \qquad\textsf{and}\qquad \u(t) = \u_0 + \bdelta\u(t)

(10)

We will spend the rest of this section doing specific examples to understand how to derive linearized approximations in practice.

Example: gravity shift¶

Consider a spring-mass system under the effect of a force $F$ and also gravity.

If we imagine that the spring is at rest without gravity then the equations of motion are:

m\ddot x + kx = F + mg

(11)

This equation is not linear due to the affine term $mg$ . So let’s linearize!

We want the system to be at rest when $F=0$ (no applied force), so let’s pick $F_0=0$ and see what $x_0$ must be. Substituting into the equation of motion, we have

kx_0 = mg

(12)

Therefore $x_0=\frac{mg}{k}$ . This is where the mass will sit so that the spring force perfectly balances out the force of gravity.

Our equation of motion (11) is already affine, so we don’t need to take any derivatives. We can simply substitute our deviations from equilibrium: $x = x_0 + \delta x$ and $F = F_0 + \delta F = \delta F$ . Substituting into (11), we have

m\,\delta \ddot x + k(x_0 + \delta x) = \delta F + mg

(13)

The constant terms cancel out $(kx_0 = mg)$ because of how we chose $x_0$ in (12) and we are left with

\boxed{m\,\delta \ddot x + k\,\delta x = \delta F}

(14)

This is an LTI system in the variables $(\delta x, \delta F)$ . In fact, it looks very similar to our original equations of motion (11), except that the gravity term is gone.

What happened physically? We shifted our coordinate system down by $\frac{mg}{k}$ , so that the new “zero” position is where the mass naturally sits under gravity. In this new coordinate system, there is no gravity term, and if we apply a force from this new “zero” position, the mass will move up or down the same way as without gravity, except shifted down by $\frac{mg}{k}$ . The equations of motion (14) describe exactly this behavior.

Example: cruise control¶

Consider a car moving along a flat road. The motor provides a forward force $F_a$ and the aerodynamic drag provides an opposing force $F_d$ . Let’s derive linearized equations of motion linearized about a nominal speed $v_0$ .

Free body diagram of a car with drag force F_d. — Figure 4:Free body diagram of a car with drag force $F_d$ .

Aerodynamic drag is typically proportional to the square of the speed of the object. This drag force is given by the equation

F_d = \underbrace{\tfrac{1}{2} C_d \, \rho \, A }_{c}\, v^2,

(15)

where $C_d$ is the drag coefficient, $\rho$ is the air density, $A$ is the lateral surface area, and $v$ is the speed of the moving object. We replaced all the constant terms by $c$ for simplicity. The equation of motion for the car is therefore

m\dot v + cv^2 = F_a

(16)

This is not linear due to the $v^2$ term. Let’s linearize about the nominal speed $v_0$ . We will carefully follow the four steps outlined earlier.

Step 1: Let’s identify an equilibrium point. In order to maintain a constant speed $v_0$ , what must be the applied force $F_a$ ? We can substitute into the equation of motion to obtain:

F_0 = c\, v_0^2

(17)

Step 2: Now we linearize all nonlinear terms. The only nonlinear term is the drag force $cv^2$ . Linearize this about $v_0$ using Eq. (1):

\begin{aligned} cv^2 &\approx c v_0^2 + 2 c v_0 ( v - v_0 ) \end{aligned}

(18)

Substituting this into our equation of motion, we obtain

m\dot v + c v_0^2 + 2 c v_0 ( v - v_0 ) = F_a

(19)

Step 3: Now we replace all signals with their deviations from equilibrium: $v = v_0 + \delta v$ and $F_a = F_0 + \delta F$ . Also use the fact that $\dot v = \delta \dot v$ because $v_0$ is constant. Substituting, we have

m\delta \dot v + c v_0^2 + 2 c v_0 \,\delta v = F_0 + \delta F

(20)

Step 4: Finally, we can cancel out the nominal force terms using (17) and we obtain our linearized equation of motion.

\boxed{m\delta \dot v + 2 c v_0 \delta v = \delta F }

(21)

We can quickly check that this is indeed LTI: all terms are linear functions of the deviations $\delta v$ and $\delta F$ .

We can also interpret this result physically. Our linearized equation (21) relates the deviation from $F_0$ to the deviation from $v_0$ . If we let $\delta v = \delta \dot x$ , we can write it as an equation in terms of position:

m\delta \ddot x + 2 c v_0 \,\delta \dot x = \delta F

(22)

This is a spring-mass-damper equation with no spring! Moreover, the damping coefficient depends linearly on the nominal velocity: the faster you go, the more damping you have. Makes sense!

Example: pendulum¶

Returning to the pendulum example illustrated below, let’s find the linearized equations of motion about an arbitrary equilibrium angle $\theta_0$ .

Pendulum with an external torque T and gravity. — Figure 5:Pendulum with an external torque $T$ and gravity.

As mentioned earlier, the equation of motion is

m\ell^2\ddot\theta + mg\ell\sin\theta = T

(23)

Step 1: First, we find the equilibrium torque. As computed earlier,

T_0 = mg\ell\sin\theta_0

(24)

Step 2: Next, we linearize the nonlinear term about $\theta_0$ :

mg\ell\sin\theta \approx mg\ell\sin\theta_0 + mg\ell\cos\theta_0 (\theta-\theta_0)

(25)

Substituting into the equation of motion (23), we obtain

m\ell^2\ddot\theta + mg\ell\sin\theta_0 + mg\ell\cos\theta_0 (\theta-\theta_0) = T

(26)

Step 3: Now substitute $T = T_0 + \delta T$ and $\theta = \theta_0 + \delta\theta$ and obtain

m\ell^2\delta\ddot \theta + mg\ell\sin\theta_0 + mg\ell\cos\theta_0\, \delta\theta = T_0 + \delta T

(27)

Step 4: We can apply (24) to cancel the nominal torque and we get

\boxed{m\ell^2\delta\ddot \theta + mg\ell\cos\theta_0 \,\delta\theta = \delta T}

(28)

This is an LTI system in the variables $(\delta \theta, \delta T)$ , so our linearization is complete!

We can investigate what happens when we use different equilibrium points:

If $\theta_0=0$ (pendulum pointing down), we get $m\ell^2\delta\ddot \theta + mg\ell \delta\theta = \delta T$ . This is just what you get when you apply the “small angle formula” $\sin\theta \approx \theta$ . Indeed, the small angle formula is simply linearization about zero. The linearized equation of motion is the same as that of a standard spring-mass system.
If $\theta_0=\pi$ (pendulum pointing up), we get $m\ell^2\delta\ddot \theta - mg\ell \delta\theta = \delta T$ . This looks similar to the standard linearized pendulum, but with an important negative sign. This is like a spring-mass system with a negative spring constant. When you push the mass, it accelerates away from you rather than returning back.
If $\theta_0=\tfrac{\pi}{4}$ (pendulum at 45 degrees), we get $m\ell^2\delta\ddot \theta + \tfrac{1}{\sqrt{2}}mg\ell \delta\theta = \delta T$ . It’s a spring-mass system with a reduced spring constant.

Be mindful of coordinates!

This bears repeating! The linearized equations of motion uses different coordinates than the original system. This can be confusing because we often linearize about zero. For example, the first pendulum case used $\theta_0=0$ and $T_0=0$ , so we get $\delta\theta=\theta$ and $\delta T = T$ . This is the case where you can just “replace $\sin\theta$ by $\theta$ ”. The linearized equations of motion in the original coordinates become

m\ell^2\ddot \theta + mg\ell \theta = T

(29)

However, if we use a nonzero equilibrium point, then $T = T_0+\delta T$ and $\theta=\theta_0+\delta\theta$ . The system is only LTI when viewed as a perturbation from the equilibrium configuration. And that equilibrium configuration need not be zero. In the original coordinates, the dyamics will look affine, not linear:

m\ell^2\ddot \theta + mg\ell\cos\theta_0 (\theta - \theta_0) = T - T_0

(30)

Test your knowledge¶

Solution to Exercise 1 #

First, compute the partial derivatives:

\frac{\partial f}{\partial x} = 2xy \qquad\textsf{and}\qquad \frac{\partial f}{\partial y} = x^2 + \cos y

(31)

Next, evaluate everything at the reference point $(1,0)$

\begin{aligned} f(1,0) &= 1^2\cdot 0+\sin 0=0 \\ \left.\frac{\partial f}{\partial x}\right|_{(1,0)} &= 2(1)(0)=0 \\ \left.\frac{\partial f}{\partial y}\right|_{(1,0)} &= 1^2+\cos 0 = 1+1=2 \end{aligned}

(32)

The linearized approximation is

\begin{aligned} f(x,y) &\;\approx\; f(1,0) \;+\; \left.\frac{\partial f}{\partial x}\right|_{(1,0)} (x-1) \;+\; \left.\frac{\partial f}{\partial y}\right|_{(1,0)} y \;=\; 2y \end{aligned}

(33)

Therefore, using our linearization,

f(1.1,0.05)\approx 2 (0.05) = 0.10

(34)

We can confirm this is close by check the exact value:

f(1.1,0.05) = (1.1)^2(0.05) + \sin(0.05) = 0.110479...

(35)

Exercise 2

The Volterra model of predator-prey dynamics is a simple set of coupled nonlinear equations. The following set of equations is a simplified case of the full Volterra model:

\begin{aligned} \dot{x} &= \alpha x - xy\\ \dot{y} &= xy - \beta y. \end{aligned}

(36)

The state variable $x$ is the number of prey animals in the closed ecosystem and $y$ is the number of predator animals. $\alpha$ and $\beta$ are positive real constants.

Find the number of prey animals, $x^\star$ , and predator animals, $y^\star$ when the system is at an equilibrium. Note that you should ignore the equilibrium $x^\star=y^\star=0$ , which is the trivial case of both groups being extinct.
Linearize the nonlinear system about this equilibrium point. Write your linearized equations as a set of coupled ODEs in standard form, where $\delta x$ and $\delta y$ are the output variables.

Solution to Exercise 2 #

At equilibrium, $\dot x = 0$ and $\dot y = 0$ . Substituting into the dynamics, we see that this occurs when $\alpha x = \beta y = xy$ . Excluding the trivial solution $x=y=0$ , we see that

x^\star = \beta\qquad\textsf{and}\qquad y^\star = \alpha

(37)

To find the linearized equations of motion, we linearize each equation about the equilibrium point $(\beta,\alpha)$ . For the first equation, let $f(x,y) = \alpha x - xy$ . Evaluating partial derivatives:

\frac{\partial f}{\partial x} = \alpha-y, \qquad\textsf{and}\qquad \frac{\partial f}{\partial y} = -x

(38)

Linearizing about $(\beta,\alpha)$ , we obtain:

\begin{aligned} f(x,y) &\approx f(x^\star,y^\star) + \left.\frac{\partial f}{\partial x}\right|_{\star}(x-x^\star) + \left.\frac{\partial f}{\partial y}\right|_{\star}(y-y^\star) \\ &= 0 + 0(x-\beta) + (-\beta )(y-\alpha) \end{aligned}

(39)

Therefore $\dot x \approx -\beta (y-\alpha)$ , or $\delta\dot x \approx -\beta \delta y$ .

Similarly, we can linearize the second equation and obtain $\delta\dot y \approx \alpha \delta x$ . So our linearized equations of motion are

\begin{aligned} \delta \dot x &= -\beta \delta y \\ \delta \dot y &= \alpha \delta x \end{aligned}

(40)

And these states are measured relative to the equilibrium point, so

x(t) = \beta + \delta x(t) \qquad\textsf{and}\qquad y(t) = \alpha + \delta y(t)

(41)