Section LT Linear Transformations

Early in Chapter VS we prefaced the definition of a vector space with the comment that it was “one of the two most important definitions in the entire course.” Here comes the other. Any capsule summary of linear algebra would have to describe the subject as the interplay of linear transformations and vector spaces. Here we go.

Subsection LT Linear Transformations

Definition LT Linear Transformation

A linear transformation, $\ltdefn{T}{U}{V}$, is a function that carries elements of the vector space $U$ (called the domain) to the vector space $V$ (called the codomain), and which has two additional properties

  1. $\lt{T}{\vect{u}_1+\vect{u}_2}=\lt{T}{\vect{u}_1}+\lt{T}{\vect{u}_2}$ for all $\vect{u}_1,\,\vect{u}_2\in U$
  2. $\lt{T}{\alpha\vect{u}}=\alpha\lt{T}{\vect{u}}$ for all $\vect{u}\in U$ and all $\alpha\in\complex{\null}$

The two defining conditions in the definition of a linear transformation should “feel linear,” whatever that means. Conversely, these two conditions could be taken as exactly what it means to be linear. As every vector space property derives from vector addition and scalar multiplication, so too, every property of a linear transformation derives from these two defining properties. While these conditions may be reminiscent of how we test subspaces, they really are quite different, so do not confuse the two.

Here are two diagrams that convey the essence of the two defining properties of a linear transformation. In each case, begin in the upper left-hand corner, and follow the arrows around the rectangle to the lower-right hand corner, taking two different routes and doing the indicated operations labeled on the arrows. There are two results there. For a linear transformation these two expressions are always equal.

SVG image not dispayed

Diagram DLTA Definition of Linear Transformation, Additive
SVG image not dispayed

Diagram DLTM Definition of Linear Transformation, Multiplicative

A couple of words about notation. $T$ is the name of the linear transformation, and should be used when we want to discuss the function as a whole. $\lt{T}{\vect{u}}$ is how we talk about the output of the function, it is a vector in the vector space $V$. When we write $\lt{T}{\vect{x}+\vect{y}}=\lt{T}{\vect{x}}+\lt{T}{\vect{y}}$, the plus sign on the left is the operation of vector addition in the vector space $U$, since $\vect{x}$ and $\vect{y}$ are elements of $U$. The plus sign on the right is the operation of vector addition in the vector space $V$, since $\lt{T}{\vect{x}}$ and $\lt{T}{\vect{y}}$ are elements of the vector space $V$. These two instances of vector addition might be wildly different.

Let us examine several examples and begin to form a catalog of known linear transformations to work with.

Example ALT A linear transformation

It can be just as instructive to look at functions that are not linear transformations. Since the defining conditions must be true for all vectors and scalars, it is enough to find just one situation where the properties fail.

Example NLT Not a linear transformation
Example LTPM Linear transformation, polynomials to matrices
Example LTPP Linear transformation, polynomials to polynomials

Linear transformations have many amazing properties, which we will investigate through the next few sections. However, as a taste of things to come, here is a theorem we can prove now and put to use immediately.

Theorem LTTZZ Linear Transformations Take Zero to Zero

Suppose $\ltdefn{T}{U}{V}$ is a linear transformation. Then $\lt{T}{\zerovector}=\zerovector$.

Return to Example NLT and compute $\lt{S}{\colvector{0\\0\\0}}=\colvector{0\\0\\-2}$ to quickly see again that $S$ is not a linear transformation, while in Example LTPM compute \begin{align*} \lt{S}{0+0x+0x^2+0x^3}&=\begin{bmatrix}0&0\\0&0\end{bmatrix} \end{align*} as an example of Theorem LTTZZ at work.

Sage LTS Linear Transformations, Symbolic

Subsection LTC Linear Transformation Cartoons

Throughout this chapter, and Chapter R, we will include drawings of linear transformations. We will call them “cartoons,” not because they are humorous, but because they will only expose a portion of the truth. A Bugs Bunny cartoon might give us some insights on human nature, but the rules of physics and biology are routinely (and grossly) violated. So it will be with our linear transformation cartoons. Here is our first, followed by a guide to help you understand how these are meant to describe fundamental truths about linear transformations, while simultaneously violating other truths.

SVG image not dispayed

Diagram GLT General Linear Transformation

Here we picture a linear transformation $\ltdefn{T}{U}{V}$, where this information will be consistently displayed along the bottom edge. The ovals are meant to represent the vector spaces, in this case $U$, the domain, on the left and $V$, the codomain, on the right. Of course, vector spaces are typically infinite sets, so you will have to imagine that characteristic of these sets. A small dot inside of an oval will represent a vector within that vector space, sometimes with a name, sometimes not (in this case every vector has a name). The sizes of the ovals are meant to be proportional to the dimensions of the vector spaces. However, when we make no assumptions about the dimensions, we will draw the ovals as the same size, as we have done here (which is not meant to suggest that the dimensions have to be equal).

To convey that the linear transformation associates a certain input with a certain output, we will draw an arrow from the input to the output. So, for example, in this cartoon we suggest that $\lt{T}{\vect{x}}=\vect{y}$. Nothing in the definition of a linear transformation prevents two different inputs being sent to the same output and we see this in $\lt{T}{\vect{u}}=\vect{v}=\lt{T}{\vect{w}}$. Similarly, an output may not have any input being sent its way, as illustrated by no arrow pointing at $\vect{t}$. In this cartoon, we have captured the essence of our one general theorem about linear transformations, Theorem LTTZZ, $\lt{T}{\zerovector_U}=\zerovector_V$. On occasion we might include this basic fact when it is relevant, at other times maybe not. Note that the definition of a linear transformation requires that it be a function, so every element of the domain should be associated with some element of the codomain. This will be reflected by never having an element of the domain without an arrow originating there.

These cartoons are of course no substitute for careful definitions and proofs, but they can be a handy way to think about the various properties we will be studying.

Subsection MLT Matrices and Linear Transformations

If you give me a matrix, then I can quickly build you a linear transformation. Always. First a motivating example and then the theorem.

Example LTM Linear transformation from a matrix

So the multiplication of a vector by a matrix “transforms” the input vector into an output vector, possibly of a different size, by performing a linear combination. And this transformation happens in a “linear” fashion. This “functional” view of the matrix-vector product is the most important shift you can make right now in how you think about linear algebra. Here is the theorem, whose proof is very nearly an exact copy of the verification in the last example.

Theorem MBLT Matrices Build Linear Transformations

Suppose that $A$ is an $m\times n$ matrix. Define a function $\ltdefn{T}{\complex{n}}{\complex{m}}$ by $\lt{T}{\vect{x}}=A\vect{x}$. Then $T$ is a linear transformation.

So Theorem MBLT gives us a rapid way to construct linear transformations. Grab an $m\times n$ matrix $A$, define $\lt{T}{\vect{x}}=A\vect{x}$ and Theorem MBLT tells us that $T$ is a linear transformation from $\complex{n}$ to $\complex{m}$, without any further checking.

We can turn Theorem MBLT around. You give me a linear transformation and I will give you a matrix.

Example MFLT Matrix from a linear transformation

Example MFLT was not an accident. Consider any one of the archetypes where both the domain and codomain are sets of column vectors (Archetype M through Archetype R) and you should be able to mimic the previous example. Here is the theorem, which is notable since it is our first occasion to use the full power of the defining properties of a linear transformation when our hypothesis includes a linear transformation.

Theorem MLTCV Matrix of a Linear Transformation, Column Vectors

Suppose that $\ltdefn{T}{\complex{n}}{\complex{m}}$ is a linear transformation. Then there is an $m\times n$ matrix $A$ such that $\lt{T}{\vect{x}}=A\vect{x}$.

So if we were to restrict our study of linear transformations to those where the domain and codomain are both vector spaces of column vectors (Definition VSCV), every matrix leads to a linear transformation of this type (Theorem MBLT), while every such linear transformation leads to a matrix (Theorem MLTCV). So matrices and linear transformations are fundamentally the same. We call the matrix $A$ of Theorem MLTCV the matrix representation of $T$.

We have defined linear transformations for more general vector spaces than just $\complex{m}$. Can we extend this correspondence between linear transformations and matrices to more general linear transformations (more general domains and codomains)? Yes, and this is the main theme of Chapter R. Stay tuned. For now, let us illustrate Theorem MLTCV with an example.

Example MOLT Matrix of a linear transformation
Sage LTM Linear Transformations, Matrices

Subsection LTLC Linear Transformations and Linear Combinations

It is the interaction between linear transformations and linear combinations that lies at the heart of many of the important theorems of linear algebra. The next theorem distills the essence of this. The proof is not deep, the result is hardly startling, but it will be referenced frequently. We have already passed by one occasion to employ it, in the proof of Theorem MLTCV. Paraphrasing, this theorem says that we can “push” linear transformations “down into” linear combinations, or “pull” linear transformations “up out” of linear combinations. We will have opportunities to both push and pull.

Theorem LTLC Linear Transformations and Linear Combinations

Suppose that $\ltdefn{T}{U}{V}$ is a linear transformation, $\vectorlist{u}{t}$ are vectors from $U$ and $\scalarlist{a}{t}$ are scalars from $\complex{\null}$. Then \begin{equation*} \lt{T}{\lincombo{a}{u}{t}} = a_1\lt{T}{\vect{u}_1}+ a_2\lt{T}{\vect{u}_2}+ a_3\lt{T}{\vect{u}_3}+\cdots+ a_t\lt{T}{\vect{u}_t} \end{equation*}

Some authors, especially in more advanced texts, take the conclusion of Theorem LTLC as the defining condition of a linear transformation. This has the appeal of being a single condition, rather than the two-part condition of Definition LT. (See Exercise LT.T20).

Our next theorem says, informally, that it is enough to know how a linear transformation behaves for inputs from any basis of the domain, and all the other outputs are described by a linear combination of these few values. Again, the statement of the theorem, and its proof, are not remarkable, but the insight that goes along with it is very fundamental.

Theorem LTDB Linear Transformation Defined on a Basis

Suppose $U$ is a vector space with basis $B=\set{\vectorlist{u}{n}}$ and the vector space $V$ contains the vectors $\vectorlist{v}{n}$ (which may not be distinct). Then there is a unique linear transformation, $\ltdefn{T}{U}{V}$, such that $\lt{T}{\vect{u}_i}=\vect{v}_i$, $1\leq i\leq n$.

You might recall facts from analytic geometry, such as “any two points determine a line” and “any three non-collinear points determine a parabola.” Theorem LTDB has much of the same feel. By specifying the $n$ outputs for inputs from a basis, an entire linear transformation is determined. The analogy is not perfect, but the style of these facts are not very dissimilar from Theorem LTDB.

Notice that the statement of Theorem LTDB asserts the existence of a linear transformation with certain properties, while the proof shows us exactly how to define the desired linear transformation. The next two examples show how to compute values of linear transformations that we create this way.

Example LTDB1 Linear transformation defined on a basis
Example LTDB2 Linear transformation defined on a basis

Here is a third example of a linear transformation defined by its action on a basis, only with more abstract vector spaces involved.

Example LTDB3 Linear transformation defined on a basis

Informally, we can describe Theorem LTDB by saying “it is enough to know what a linear transformation does to a basis (of the domain).”

Sage LTB Linear Transformations, Bases

Subsection PI Pre-Images

The definition of a function requires that for each input in the domain there is exactly one output in the codomain. However, the correspondence does not have to behave the other way around. An output from the codomain could have many different inputs from the domain which the transformation sends to that output, or there could be no inputs at all which the transformation sends to that output. To formalize our discussion of this aspect of linear transformations, we define the pre-image.

Definition PI Pre-Image

Suppose that $\ltdefn{T}{U}{V}$ is a linear transformation. For each $\vect{v}$, define the pre-image of $\vect{v}$ to be the subset of $U$ given by \begin{equation*} \preimage{T}{\vect{v}}=\setparts{\vect{u}\in U}{\lt{T}{\vect{u}}=\vect{v}} \end{equation*}

In other words, $\preimage{T}{\vect{v}}$ is the set of all those vectors in the domain $U$ that get “sent” to the vector $\vect{v}$.

Example SPIAS Sample pre-images, Archetype S

The preimage is just a set, it is almost never a subspace of $U$ (you might think about just when $\preimage{T}{\vect{v}}$ is a subspace, see Exercise ILT.T10). We will describe its properties going forward, and it will be central to the main ideas of this chapter.

Sage PI Pre-Images

Subsection NLTFO New Linear Transformations From Old

We can combine linear transformations in natural ways to create new linear transformations. So we will define these combinations and then prove that the results really are still linear transformations. First the sum of two linear transformations.

Definition LTA Linear Transformation Addition

Suppose that $\ltdefn{T}{U}{V}$ and $\ltdefn{S}{U}{V}$ are two linear transformations with the same domain and codomain. Then their sum is the function $\ltdefn{T+S}{U}{V}$ whose outputs are defined by \begin{equation*} \lt{(T+S)}{\vect{u}}=\lt{T}{\vect{u}}+\lt{S}{\vect{u}} \end{equation*}

Notice that the first plus sign in the definition is the operation being defined, while the second one is the vector addition in $V$. (Vector addition in $U$ will appear just now in the proof that $T+S$ is a linear transformation.) Definition LTA only provides a function. It would be nice to know that when the constituents ($T$, $S$) are linear transformations, then so too is $T+S$.

Theorem SLTLT Sum of Linear Transformations is a Linear Transformation

Suppose that $\ltdefn{T}{U}{V}$ and $\ltdefn{S}{U}{V}$ are two linear transformations with the same domain and codomain. Then $\ltdefn{T+S}{U}{V}$ is a linear transformation.

Example STLT Sum of two linear transformations
Definition LTSM Linear Transformation Scalar Multiplication

Suppose that $\ltdefn{T}{U}{V}$ is a linear transformation and $\alpha\in\complex{\null}$. Then the scalar multiple is the function $\ltdefn{\alpha T}{U}{V}$ whose outputs are defined by \begin{equation*} \lt{(\alpha T)}{\vect{u}}=\alpha\lt{T}{\vect{u}} \end{equation*}

Given that $T$ is a linear transformation, it would be nice to know that $\alpha T$ is also a linear transformation.

Theorem MLTLT Multiple of a Linear Transformation is a Linear Transformation

Suppose that $\ltdefn{T}{U}{V}$ is a linear transformation and $\alpha\in\complex{\null}$. Then $\ltdefn{(\alpha T)}{U}{V}$ is a linear transformation.

Example SMLT Scalar multiple of a linear transformation

Now, let us imagine we have two vector spaces, $U$ and $V$, and we collect every possible linear transformation from $U$ to $V$ into one big set, and call it $\vslt{U}{V}$. Definition LTA and Definition LTSM tell us how we can “add” and “scalar multiply” two elements of $\vslt{U}{V}$. Theorem SLTLT and Theorem MLTLT tell us that if we do these operations, then the resulting functions are linear transformations that are also in $\vslt{U}{V}$. Hmmmm, sounds like a vector space to me! A set of objects, an addition and a scalar multiplication. Why not?

Theorem VSLT Vector Space of Linear Transformations

Suppose that $U$ and $V$ are vector spaces. Then the set of all linear transformations from $U$ to $V$, $\vslt{U}{V}$, is a vector space when the operations are those given in Definition LTA and Definition LTSM.

Definition LTC Linear Transformation Composition

Suppose that $\ltdefn{T}{U}{V}$ and $\ltdefn{S}{V}{W}$ are linear transformations. Then the composition of $S$ and $T$ is the function $\ltdefn{(\compose{S}{T})}{U}{W}$ whose outputs are defined by \begin{equation*} \lt{(\compose{S}{T})}{\vect{u}}=\lt{S}{\lt{T}{\vect{u}}} \end{equation*}

Given that $T$ and $S$ are linear transformations, it would be nice to know that $\compose{S}{T}$ is also a linear transformation.

Theorem CLTLT Composition of Linear Transformations is a Linear Transformation

Suppose that $\ltdefn{T}{U}{V}$ and $\ltdefn{S}{V}{W}$ are linear transformations. Then $\ltdefn{(\compose{S}{T})}{U}{W}$ is a linear transformation.

Example CTLT Composition of two linear transformations

Here is an interesting exercise that will presage an important result later. In Example STLT compute (via Theorem MLTCV) the matrix of $T$, $S$ and $T+S$. Do you see a relationship between these three matrices?

In Example SMLT compute (via Theorem MLTCV) the matrix of $T$ and $2T$. Do you see a relationship between these two matrices?

Here is the tough one. In Example CTLT compute (via Theorem MLTCV) the matrix of $T$, $S$ and $\compose{S}{T}$. Do you see a relationship between these three matrices???

Sage OLT Operations on Linear Transformations