Linear systems - System Analysis and Control

The systems we have seen so far have had equations of motions that look pretty similar to one another. Whether we were dealing with translational motion, rotational motion, or combinations of the two, our equations of motion always looked like some version of a spring-mass-damper equation at the end of the day. We will see that this pattern persists for electrical circuits, electromechanical systems such as motors and solenoids, and even for hydraulic and thermal systems.

The systems we have seen so far that share this common type of ODE are called linear time-invariant systems (LTI), and they are related to the familiar concept of a linear function. LTI systems are so common that it makes sense to take some time to study their properties in greater generality. This will be the focus of the present section.

Linear functions¶

The standard definition of a linear function is something of the form $f(u) = a u + b$ , because the graph of such a function is a straight line. From now on, we will only call a function “linear” if $b=0$ , so the function must satisfy $f(0)=0$ . If $b$ is not necessarily zero, we call the function “affine”. In summary:

Affine function: A function of the form $f(u) = a u + b$
Linear function: A function of the form $f(u) = a u$

Therefore, all linear functions are affine, but not all affine functions are linear. Our next task will be to generalize this concept to dynamical systems.

Left: A generic affine function. Right: A generic linear function. — Figure 1:**Left:** A generic affine function. **Right:** A generic linear function.

Linear systems¶

When dealing with systems, the inputs and outputs are now signals (functions of time) rather than just being numbers. So if our input is the signal $u(t)$ , our output is the signal $y(t)$ , and the system is $G$ , we will write $y = G(u)$ .

We can’t draw the graph of a system $G$ the same way we can with a function. All we can do (for now) is draw what the output looks like for one particular test input. A common test input is the step function, and when using such an input, the output is called the step response. We might represent this as follows:

Possible step response of a dynamical system G. — Figure 2:Possible step response of a dynamical system $G$ .

In general, the step response of a system only tells use how the system responds to this particular step function input. It doesn’t provide any information about other inputs. However, we will see that if our system is LTI, the step response tells us everything!

A system is LTI provided it satisfies: time-invariance, homogeneity, and additivity. We also add a fourth property called causality, which all physical/real systems must satisfy. Let’s now define these properties and discuss them in more detail.

Time-invariance¶

Time invariance says that the system does not explicitly depend on time. If I apply the same input now or tomorrow, the system will behave the same way. Mathematically, we can write it as follows.

Here is a graphical illustration of what time invariance looks like for the system of Figure 2.

Time invariance: shifting the input in time produces a corresponding shift in the output. — Figure 3:**Time invariance:** shifting the input in time produces a corresponding shift in the output.

Homogeneity¶

Homogeneity says that multiplying the input by a scalar leads to a corresponding scaling of the output. This property is also a property of linear functions. Mathematically, we can write it as follows.

Here is a graphical illustration of what homogeneity looks like for the system of Figure 2.

Homogeneity: scaling the input produces a correspondingly scaled output. — Figure 4:**Homogeneity:** scaling the input produces a correspondingly scaled output.

In particular, if we pick $\alpha=0$ , it means that for any linear system, $G(0)=0$ .

Additivity¶

Additivity says that adding two inputs together leads to an output that is the sum of the corresponding outputs. This property is also a property of linear functions. Mathematically, we can write it as follows.

Here is a graphical illustration of what additivity looks like for the system of Figure 2.

Additivity: adding two inputs adds the corresponding outputs. — Figure 5:**Additivity:** adding two inputs adds the corresponding outputs.

Causality¶

Time-invariance, homogeneity, and additivity are mathematical properties a system may or may not have. Causality is different: it is a basic “reality check.” A real system cannot react to an input before that input is applied. Intuitively:

what you do to the input in the future cannot affect the output now
the output at time $t$ is allowed to depend on $u(t)$ and on the past $u(\tau)$ for $\tau<t$ , but not on values $u(\tau)$ for $\tau>t$ .

Causality is illustrated in Figure 5. The blue and green inputs agree up to time $t=2$ , and so do the blue and green outputs.

Identifying LTI systems¶

Given a set of differential/algebraic equations describing a system, how can we tell if the underlying system is LTI? Here is a simple rule:

The three properties (causality, linearity, time-invariance) are independent. Here are several examples illustrating how this can happen.

Example 1

The simple pendulum of Figure 1 with equation of motion

m \ell^2 \ddot \theta + m g \ell \sin\theta = T

(1)

is causal and time-invariant, but nonlinear due to the $\sin\theta$ term. If we use the small-angle approximation and replace $\sin\theta$ by $\theta$ , it becomes LTI.

Example 2

A rocket in space (no air resistance) has an equation of motion

m(t) \ddot x = u

(2)

where $u$ is the thrust (input) and $m(t)$ is the rocket’s mass, which decays over time as fuel burns. This system is causal and linear, but time-varying. With a fixed mass, it becomes LTI.

Example 3

Consider a system whose output is a time shift of the input ( $\tau$ is a constant).

y(t) = u(t-\tau)

(3)

The system is linear and time-invariant. If $\tau < 0$ , it is non-causal because the output at time $t$ depends on future inputs. If $\tau \geq 0$ , it becomes LTI.

Example 4

Consider two masses with positions $x$ and $y$ , and an input $u$ . Suppose the equations of motion are:

\begin{aligned} m\ddot x + b(\dot x - \dot y) &= u \\ m\ddot y + b\dot y + kx &= \dot u \end{aligned}

(4)

This system is LTI, since each term consists of a constant times derivatives of one of the signals.

Notes of caution regarding linearity

Parameters vs. signals: Positions, angles, velocities, forces, etc. are signals; they determine the current state of the dynamical system. However, masses, spring constants, etc. are parameters. Changing them changes the system, but they are not system states (the equations of motions do not contain derivatives of parameters). For example, the gravity term $mg\ell\theta$ in a small-angle pendulum is a linear term, however if the pendulum also had a cart with position $x$ and there was a term like $b x \theta$ , then this term would be nonlinear.
Initial conditions: When discussing linearity, we assume the systems involved have zero initial conditions, meaning they begin at rest. If a system has a nonzero initial condition, then using a zero input would still produce a nonzero output. For example, imagine a pendulum that starts at $\theta(0) \neq 0$ . If we apply zero torque, the pendulum will still have a nonzero angle for $t>0$ .

Why LTI systems?¶

Most of this course will focus on LTI systems. There are two main reasons:

LTI systems are easy to analyze and understand. We can solve all the associated differential equations in closed form (no need for numerical approximation), and a number of control design techniques exist for such systems. LTI systems behave in intuitive, predictable ways.
Typically in control, our goal is to regulate a system (keep the speed of a motor constant, make sure the vibrations of a structure remain small, etc.). In such cases, the system only undergoes small motions (assuming the controller is well-designed), and we can approximate the system as LTI. For example, think about the pendulum; it was nonlinear but when $\theta\approx 0$ , it becomes LTI. This sort of approximation is called “linearization”, which we will cover in the next lecture.

So LTI systems are easy to work with, and most systems we want to control can be approximated as LTI. How convenient!

Pulse decomposition¶

We will now discuss one of the advantages of working with LTI systems: they obey superposition. We said earlier that we can’t plot the “graph” of a system $G$ the same way that we can plot the graph of a function. The output of a system depends on which input we choose! Instead, we plotted a step response, to show the output for one possible input. When a system is LTI, its step response contains enough information to derive how the system would respond to any input.

First, by adding two shifted step responses, we can figure out how the system would respond to a pulse input.

How the system of responds to a pulse input. — Figure 6:How the system of Figure 2 responds to a pulse input.

We can then take an arbitrary input signal, and decompose it as a sum of scaled and shifted pulses, and then perform the same scaling, shifting, and summing of the output to obtain the net output.

How the system of responds to an arbitrary input, found by decomposing the input as a sum of scaled and shifted pulses. — Figure 7:How the system of Figure 2 responds to an arbitrary input, found by decomposing the input as a sum of scaled and shifted pulses.

We can get increasingly better approximations of the true output by using progressively narrower pulses, similar to how we can approximate a definite integral using a Riemann sum. So simply knowing the step response of a system is enough to figure out how the system will respond to any input.

This is one of the many benefits of working with LTI systems. There is more to come!

Test your knowledge¶

Solution to Exercise 1 #

Yes. We can verify the properties one by one:

Time-invariance: If we use the delayed input $u_\alpha(t) = u(t-\alpha)$ , then we can apply a change of variables in the integral and obtain:

\int_{-\infty}^t u_\alpha(\tau)\,\dd\tau = \int_{-\infty}^t u(\tau-\alpha)\,\dd\tau = \int_{-\infty}^{t-\alpha} u(s)\,\dd s = y(t-\alpha)

(6)

In the last step, we used the fact that $y(t) = \int_{-\infty}^t u(\tau)\,\dd\tau$ . But $y(t-\alpha) = y_\alpha(t)$ so we have shown that delaying the input by $\alpha$ simply delays the output by $\alpha$ .

Linearity: We can verify homogeneity and additivity together. Suppose that $y_1(t) = \int_{-\infty}^t u_1(\tau)\,\dd\tau$ and $y_2(t) = \int_{-\infty}^t u_2(\tau)\,\dd\tau$ . Then we have:

\begin{aligned} \int_{-\infty}^t (\alpha_1 u_1(\tau) + \alpha_2 u_2(\tau))\,\dd\tau &= \alpha_1 \int_{-\infty}^t u_1(\tau) + \alpha_2\int_{-\infty}^t u_2(\tau)\,\dd\tau \\ &= \alpha_1 y_1(t) + \alpha_2 y_2(t) \end{aligned}

(7)

Causality: Based on the limits of the integration, the output $y(t)$ only depends on the values of $u(\tau)$ for $\tau \leq t$ . Therefore, the system is causal.

A more straightforward way to verify that this system is LTI is to take the time derivative of both sides of (5). By the fundamental theorem of calculus, we obtain $\dot y = u$ . This differential equation satisfies our requirements for LTI systems, so the system is LTI.