Section LT Linear Transformations

From A First Course in Linear Algebra
Version 1.04
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/

Here comes a key definition.

Subsection LT: Linear Transformations

Definition LT
Linear Transformation
A linear transformation, $T : U \mapsto V$ , is a function that carries elements of the vector space $U$ (called the domain) to the vector space $V$ (called the codomain), and which has two additional properties

$T (u_{1} + u_{2}) = T (u_{1}) + T (u_{2})$ for all $u_{1}, u_{2} \in U$
$T (α u) = α T (u)$ for all $u \in U$ and all $α \in ℂ^{}$

(This definition contains Notation LT.) $△$

The two defining conditions in the definition of a linear transformation should “feel linear,” whatever that means. Conversely, these two conditions could be taken as a exactly what it means to be linear. As every vector space property derives from vector addition and scalar multiplication, so too, every property of a linear transformation derives from these two defining properties. While these conditions may be reminiscent of how we test subspaces, they really are quite different, so do not confuse the two.

Here are two diagrams that convey the essence of the two defining properties of a linear transformation. In each case, begin in the upper left-hand corner, and follow the arrows around the rectangle to the lower-right hand corner, taking two different routes and doing the indicated operations labeled on the arrows. There are two results there. For a linear transformation these two expressions are always equal.

2006/11/15: Commutative diagrams not available in XML version. See PDF versions.

A couple of words about notation. $T$ is the name of the linear transformation, and should be used when we want to discuss the function as a whole. $T (u)$ is how we talk about the output of the function, it is a vector in the vector space $V$ . When we write $T (x + y) = T (x) + T (y)$ , the plus sign on the left is the operation of vector addition in the vector space $U$ , since $x$ and $y$ are elements of $U$ . The plus sign on the right is the operation of vector addition in the vector space $V$ , since $T (x)$ and $T (y)$ are elements of the vector space $V$ . These two instances of vector addition might be wildly different.

Let’s examine several examples and begin to form a catalog of known linear transformations to work with.

Example ALT
A linear transformation
Define $T : ℂ^{3} \mapsto ℂ^{2}$ by describing the output of the function for a generic input with the formula

T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 2 x_{1} + x_{3} \\ - 4 x_{2} \end{matrix}]

and check the two defining properties.

\begin{array}{l} T (x + y) & = T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}] + [\begin{matrix} y_{1} \\ y_{2} \\ y_{3} \end{matrix}]) \\ = T ([\begin{matrix} x_{1} + y_{1} \\ x_{2} + y_{2} \\ x_{3} + y_{3} \end{matrix}]) \\ = [\begin{matrix} 2 (x_{1} + y_{1}) + (x_{3} + y_{3}) \\ - 4 (x_{2} + y_{2}) \end{matrix}] \\ = [\begin{matrix} (2 x_{1} + x_{3}) + (2 y_{1} + y_{3}) \\ - 4 x_{2} + (- 4) y_{2} \end{matrix}] \\ = [\begin{matrix} 2 x_{1} + x_{3} \\ - 4 x_{2} \end{matrix}] + [\begin{matrix} 2 y_{1} + y_{3} \\ - 4 y_{2} \end{matrix}] \\ = T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) + T ([\begin{matrix} y_{1} \\ y_{2} \\ y_{3} \end{matrix}]) \\ = T (x) + T (y) \\ and \\ T (α x) & = T (α [\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) \\ = T ([\begin{matrix} α x_{1} \\ α x_{2} \\ α x_{3} \end{matrix}]) \\ = [\begin{matrix} 2 (α x_{1}) + (α x_{3}) \\ - 4 (α x_{2}) \end{matrix}] \\ = [\begin{matrix} α (2 x_{1} + x_{3}) \\ α (- 4 x_{2}) \end{matrix}] \\ = α [\begin{matrix} 2 x_{1} + x_{3} \\ - 4 x_{2} \end{matrix}] \\ = α T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) \\ = α T (x) \end{array}

So by Definition LT, $T$ is a linear transformation. $⊠$

It can be just as instructive to look at functions that are not linear transformations. Since the defining conditions must be true for all vectors and scalars, it is enough to find just one situation where the properties fail.

Example NLT
Not a linear transformation
Define $S : ℂ^{3} \mapsto ℂ^{3}$ by

S ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 4 x_{1} + 2 x_{2} \\ 0 \\ x_{1} + 3 x_{3} - 2 \end{matrix}]

This function “looks” linear, but consider

\begin{array}{l} 3 S ([\begin{matrix} 1 \\ 2 \\ 3 \end{matrix}]) & = 3 [\begin{matrix} 8 \\ 0 \\ 8 \end{matrix}] = [\begin{matrix} 24 \\ 0 \\ 24 \end{matrix}] \\ while \\ S (3 [\begin{matrix} 1 \\ 2 \\ 3 \end{matrix}]) & = S ([\begin{matrix} 3 \\ 6 \\ 9 \end{matrix}]) = [\begin{matrix} 24 \\ 0 \\ 28 \end{matrix}] \end{array}

So the second required property fails for the choice of $α = 3$ and $x = [\begin{matrix} 1 \\ 2 \\ 3 \end{matrix}]$ and by Definition LT, $S$ is not a linear transformation. It is just about as easy to find an example where the first defining property fails (try it!). Notice that it is the “-2” in the third component of the definition of $S$ that prevents the function from being a linear transformation. $⊠$

Example LTPM
Linear transformation, polynomials to matrices
Define a linear transformation $T : P_{3} \mapsto M_{22}$ by

T (a + b x + c x^{2} + d x^{3}) = [\begin{matrix} a + b & a - 2 c \\ d & b - d \end{matrix}]

We verify the two defining conditions of a linear transformations.

\begin{array}{l} T (x + y) & = T ((a_{1} + b_{1} x + c_{1} x^{2} + d_{1} x^{3}) + (a_{2} + b_{2} x + c_{2} x^{2} + d_{2} x^{3})) \\ = T ((a_{1} + a_{2}) + (b_{1} + b_{2}) x + (c_{1} + c_{2}) x^{2} + (d_{1} + d_{2}) x^{3}) \\ = [\begin{matrix} (a_{1} + a_{2}) + (b_{1} + b_{2}) & (a_{1} + a_{2}) - 2 (c_{1} + c_{2}) \\ d_{1} + d_{2} & (b_{1} + b_{2}) - (d_{1} + d_{2}) \end{matrix}] \\ = [\begin{matrix} (a_{1} + b_{1}) + (a_{2} + b_{2}) & (a_{1} - 2 c_{1}) + (a_{2} - 2 c_{2}) \\ d_{1} + d_{2} & (b_{1} - d_{1}) + (b_{2} - d_{2}) \end{matrix}] \\ = [\begin{matrix} a_{1} + b_{1} & a_{1} - 2 c_{1} \\ d_{1} & b_{1} - d_{1} \end{matrix}] + [\begin{matrix} a_{2} + b_{2} & a_{2} - 2 c_{2} \\ d_{2} & b_{2} - d_{2} \end{matrix}] \\ = T (a_{1} + b_{1} x + c_{1} x^{2} + d_{1} x^{3}) + T (a_{2} + b_{2} x + c_{2} x^{2} + d_{2} x^{3}) \\ = T (x) + T (y) \\ and \\ T (α x) & = T (α (a + b x + c x^{2} + d x^{3})) \\ = T ((α a) + (α b) x + (α c) x^{2} + (α d) x^{3}) \\ = [\begin{matrix} (α a) + (α b) & (α a) - 2 (α c) \\ α d & (α b) - (α d) \end{matrix}] \\ = [\begin{matrix} α (a + b) & α (a - 2 c) \\ α d & α (b - d) \end{matrix}] \\ = α [\begin{matrix} a + b & a - 2 c \\ d & b - d \end{matrix}] \\ = α T (a + b x + c x^{2} + d x^{3}) \\ = α T (x) \end{array}

So by Definition LT, $T$ is a linear transformation. $⊠$

Example LTPP
Linear transformation, polynomials to polynomials
Define a function $S : P_{4} \mapsto P_{5}$ by

S (p (x)) = (x - 2) p (x)

Then

\begin{array}{l} S (p (x) + q (x)) & = (x - 2) (p (x) + q (x)) = (x - 2) p (x) + (x - 2) q (x) = S (p (x)) + S (q (x)) \\ S (α p (x)) & = (x - 2) (α p (x)) = (x - 2) α p (x) = α (x - 2) p (x) = α S (p (x)) \end{array}

So by Definition LT, $S$ is a linear transformation. $⊠$

Linear transformations have many amazing properties, which we will investigate through the next few sections. However, as a taste of things to come, here is a theorem we can prove now and put to use immediately.

Theorem LTTZZ
Linear Transformations Take Zero to Zero
Suppose $T : U \mapsto V$ is a linear transformation. Then $T (0) = 0$ . $□$

Proof The two zero vectors in the conclusion of the theorem are different. The first is from $U$ while the second is from $V$ . We will subscript the zero vectors in this proof to highlight the distinction. Think about your objects. (This proof is contributed by Mark Shoemaker).

\begin{array}{l} T (0_{U}) & = T (0 0_{U}) & Theorem ZSSM in U \\ = 0 T (0_{U}) & Definition LT \\ = 0_{V} & Theorem ZSSM in V \end{array}

■

Return to Example NLT and compute $S ([\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}]) = [\begin{matrix} 0 \\ 0 \\ - 2 \end{matrix}]$ to quickly see again that $S$ is not a linear transformation, while in Example LTPM and compute $S (0 + 0 x + 0 x^{2} + 0 x^{3}) = [\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}]$ as an example of Theorem LTTZZ at work.

Subsection MLT: Matrices and Linear Transformations

If you give me a matrix, then I can quickly build you a linear transformation. Always. First a motivating example and then the theorem.

Example LTM
Linear transformation from a matrix
Let

A = [\begin{matrix} 3 & - 1 & 8 & 1 \\ 2 & 0 & 5 & - 2 \\ 1 & 1 & 3 & - 7 \end{matrix}]

and define a function $P : ℂ^{4} \mapsto ℂ^{3}$ by

P (x) = A x

So we are using an old friend, the matrix-vector product (Definition MVP) as a way to convert a vector with 4 components into a vector with 3 components. Applying Definition MVP allows us to write the defining formula for $P$ in a slightly different form,

P (x) = A x = [\begin{matrix} 3 & - 1 & 8 & 1 \\ 2 & 0 & 5 & - 2 \\ 1 & 1 & 3 & - 7 \end{matrix}] [\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}] = x_{1} [\begin{matrix} 3 \\ 2 \\ 1 \end{matrix}] + x_{2} [\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}] + x_{3} [\begin{matrix} 8 \\ 5 \\ 3 \end{matrix}] + x_{4} [\begin{matrix} 1 \\ - 2 \\ - 7 \end{matrix}]

So we recognize the action of the function $P$ as using the components of the vector ( $x_{1}, x_{2}, x_{3}, x_{4}$ ) as scalars to form the output of $P$ as a linear combination of the four columns of the matrix $A$ , which are all members of $ℂ^{3}$ , so the result is a vector in $ℂ^{3}$ . We can rearrange this expression further, using our definitions of operations in $ℂ^{3}$ (Section VO).

\begin{array}{l} P (x) & = A x & Definition of P \\ = x_{1} [\begin{matrix} 3 \\ 2 \\ 1 \end{matrix}] + x_{2} [\begin{matrix} - 1 \\ 0 \\ 1 \end{matrix}] + x_{3} [\begin{matrix} 8 \\ 5 \\ 3 \end{matrix}] + x_{4} [\begin{matrix} 1 \\ - 2 \\ - 7 \end{matrix}] & Definition MVP \\ = [\begin{matrix} 3 x_{1} \\ 2 x_{1} \\ x_{1} \end{matrix}] + [\begin{matrix} - x_{2} \\ 0 \\ x_{2} \end{matrix}] + [\begin{matrix} 8 x_{3} \\ 5 x_{3} \\ 3 x_{3} \end{matrix}] + [\begin{matrix} x_{4} \\ - 2 x_{4} \\ - 7 x_{4} \end{matrix}] & Definition CVSM \\ = [\begin{matrix} 3 x_{1} - x_{2} + 8 x_{3} + x_{4} \\ 2 x_{1} + 5 x_{3} - 2 x_{4} \\ x_{1} + x_{2} + 3 x_{3} - 7 x_{4} \end{matrix}] & Definition CVA \end{array}

You might recognize this final expression as being similar in style to some previous examples (Example ALT) and some linear transformations defined in the archetypes (Archetype M through Archetype R). But the expression that says the output of this linear transformation is a linear combination of the columns of $A$ is probably the most powerful way of thinking about examples of this type.

Almost forgot — we should verify that $P$ is indeed a linear transformation. This is easy with two matrix properties from Section MM.

\begin{array}{l} P (x + y) & = A (x + y) & Definition of P \\ = A x + A y & Theorem MMDAA \\ = P (x) + P (y) & Definition of P \\ and \\ P (α x) & = A (α x) & Definition of P \\ = α (A x) & Theorem MMSMM \\ = α P (x) & Definition of P \end{array}

So by Definition LT, $P$ is a linear transformation. $⊠$

So the multiplication of a vector by a matrix “transforms” the input vector into an output vector, possibly of a different size, by performing a linear combination. And this transformation happens in a “linear” fashion. This “functional” view of the matrix-vector product is the most important shift you can make right now in how you think about linear algebra. Here’s the theorem, whose proof is very nearly an exact copy of the verification in the last example.

Theorem MBLT
Matrices Build Linear Transformations
Suppose that $A$ is an $m \times n$ matrix. Define a function $T : ℂ^{n} \mapsto ℂ^{m}$ by $T (x) = A x$ . Then $T$ is a linear transformation. $□$

Proof

\begin{array}{l} T (x + y) & = A (x + y) & Definition of T \\ = A x + A y & Theorem MMDAA \\ = T (x) + T (y) & Definition of T \\ and \\ T (α x) & = A (α x) & Definition of T \\ = α (A x) & Theorem MMSMM \\ = α T (x) & Definition of T \end{array}

So by Definition LT, $T$ is a linear transformation. $■$

So Theorem MBLT gives us a rapid way to construct linear transformations. Grab an $m \times n$ matrix $A$ , define $T (x) = A x$ and Theorem MBLT tells us that $T$ is a linear transformation from $ℂ^{n}$ to $ℂ^{m}$ , without any further checking.

We can turn Theorem MBLT around. You give me a linear transformation and I will give you a matrix.

Example MFLT
Matrix from a linear transformation
Define the function $R : ℂ^{3} \mapsto ℂ^{4}$ by

R ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 2 x_{1} - 3 x_{2} + 4 x_{3} \\ x_{1} + x_{2} + x_{3} \\ - x_{1} + 5 x_{2} - 3 x_{3} \\ x_{2} - 4 x_{3} \end{matrix}]

You could verify that $R$ is a linear transformation by applying the definition, but we will instead massage the expression defining a typical output until we recognize the form of a known class of linear transformations.

\begin{array}{l} R ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) & = [\begin{matrix} 2 x_{1} - 3 x_{2} + 4 x_{3} \\ x_{1} + x_{2} + x_{3} \\ - x_{1} + 5 x_{2} - 3 x_{3} \\ x_{2} - 4 x_{3} \end{matrix}] \\ = [\begin{matrix} 2 x_{1} \\ x_{1} \\ - x_{1} \\ 0 \end{matrix}] + [\begin{matrix} - 3 x_{2} \\ x_{2} \\ 5 x_{2} \\ x_{2} \end{matrix}] + [\begin{matrix} 4 x_{3} \\ x_{3} \\ - 3 x_{3} \\ - 4 x_{3} \end{matrix}] & Definition CVA \\ = x_{1} [\begin{matrix} 2 \\ 1 \\ - 1 \\ 0 \end{matrix}] + x_{2} [\begin{matrix} - 3 \\ 1 \\ 5 \\ 1 \end{matrix}] + x_{3} [\begin{matrix} 4 \\ 1 \\ - 3 \\ - 4 \end{matrix}] & Definition CVSM \\ = [\begin{matrix} 2 & - 3 & 4 \\ 1 & 1 & 1 \\ - 1 & 5 & - 3 \\ 0 & 1 & - 4 \end{matrix}] [\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}] & Definition MVP \end{array}

So if we define the matrix

B = [\begin{matrix} 2 & - 3 & 4 \\ 1 & 1 & 1 \\ - 1 & 5 & - 3 \\ 0 & 1 & - 4 \end{matrix}]

then $R (x) = B x$ . By Theorem MBLT, we can easily recognize $R$ as a linear transformation since it has the form described in the hypothesis of the theorem. $⊠$

Example MFLT was not accident. Consider any one of the archetypes where both the domain and codomain are sets of column vectors (Archetype M through Archetype R) and you should be able to mimic the previous example. Here’s the theorem, which is notable since it is our first occasion to use the full power of the defining properties of a linear transformation when our hypothesis includes a linear transformation.

Theorem MLTCV
Matrix of a Linear Transformation, Column Vectors
Suppose that $T : ℂ^{n} \mapsto ℂ^{m}$ is a linear transformation. Then there is an $m \times n$ matrix $A$ such that $T (x) = A x$ . $□$

Proof The conclusion says a certain matrix exists. What better way to prove something exists than to actually build it? So our proof will be constructive, and the procedure that we will use abstractly in the proof can be used concretely in specific examples.

Let $e_{1}, e_{2}, e_{3}, \dots, e_{n}$ be the columns of the identity matrix of size $n$ , $I_{n}$ (Definition SUV). Evaluate the linear transformation $T$ with each of these standard unit vectors as an input, and record the result. In other words, define $n$ vectors in $ℂ^{m}$ , $A_{i}$ , $1 \leq i \leq n$ by

A_{i} = T (e_{i})

Then package up these vectors as the columns of a matrix

A = [A_{1} | A_{2} | A_{3} | \dots | A_{n}]

Does $A$ have the desired properties? First, $A$ is clearly an $m \times n$ matrix. Then

\begin{array}{l} T (x) & = T (I_{n} x) & Theorem MMIM \\ = T ([e_{1} | e_{2} | e_{3} | \dots | e_{n}] x) & Definition SUV \\ = T ({[x]}_{1} e_{1} + {[x]}_{2} e_{2} + {[x]}_{3} e_{3} + \dots + {[x]}_{n} e_{n}) & Definition MVP \\ = T ({[x]}_{1} e_{1}) + T ({[x]}_{2} e_{2}) + T ({[x]}_{3} e_{3}) + \dots + T ({[x]}_{n} e_{n}) & Definition LT \\ = {[x]}_{1} T (e_{1}) + {[x]}_{2} T (e_{2}) + {[x]}_{3} T (e_{3}) + \dots + {[x]}_{n} T (e_{n}) & Definition LT \\ = {[x]}_{1} A_{1} + {[x]}_{2} A_{2} + {[x]}_{3} A_{3} + \dots + {[x]}_{n} A_{n} & Definition of A_{i} \\ = A x & Definition MVP \end{array}

as desired. $■$

So if we were to restrict our study of linear transformations to those where the domain and codomain are both vector spaces of column vectors (Definition VSCV), every matrix leads to a linear transformation of this type (Theorem MBLT), while every such linear transformation leads to a matrix (Theorem MLTCV). So matrices and linear transformations are fundamentally the same. We call the matrix $A$ of Theorem MLTCV the matrix representation of $T$ .

We have defined linear transformations for more general vector spaces than just $ℂ^{m}$ , can we extend this correspondence between linear transformations and matrices to more general linear transformations (more general domains and codomains)? Yes, and this is the main theme of Chapter R. Stay tuned. For now, let’s illustrate Theorem MLTCV with an example.

Example MOLT
Matrix of a linear transformation
Suppose $S : ℂ^{3} \mapsto ℂ^{4}$ is defined by

S ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 3 x_{1} - 2 x_{2} + 5 x_{3} \\ x_{1} + x_{2} + x_{3} \\ 9 x_{1} - 2 x_{2} + 5 x_{3} \\ 4 x_{2} \end{matrix}]

Then

\begin{array}{l} C_{1} & = S (e_{1}) = S ([\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}]) = [\begin{matrix} 3 \\ 1 \\ 9 \\ 0 \end{matrix}] \\ C_{2} & = S (e_{2}) = S ([\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}]) = [\begin{matrix} - 2 \\ 1 \\ - 2 \\ 4 \end{matrix}] \\ C_{3} & = S (e_{3}) = S ([\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}]) = [\begin{matrix} 5 \\ 1 \\ 5 \\ 0 \end{matrix}] \end{array}

so define

C = [C_{1} | C_{2} | C_{3}] = [\begin{matrix} 3 & - 2 & 5 \\ 1 & 1 & 1 \\ 9 & - 2 & 5 \\ 0 & 4 & 0 \end{matrix}]

and Theorem MLTCV guarantees that $S (x) = C x$ .

As an illuminating exercise, let $z = [\begin{matrix} 2 \\ - 3 \\ 3 \end{matrix}]$ and compute $S (z)$ two different ways. First, return to the definition of $S$ and evaluate $S (z)$ directly. Then do the matrix-vector product $C z$ . In both cases you should obtain the vector $S (z) = [\begin{matrix} 27 \\ 2 \\ 39 \\ - 12 \end{matrix}]$ . $⊠$

Subsection LTLC: Linear Transformations and Linear Combinations

It is the interaction between linear transformations and linear combinations that lies at the heart of many of the important theorems of linear algebra. The next theorem distills the essence of this. The proof is not deep, the result is hardly startling, but it will be referenced frequently. We have already passed by one occasion to employ it, in the proof of Theorem MLTCV. Paraphrasing, this theorem says that we can “push” linear transformations “down into” linear combinations, or “pull” linear transformations “up out” of linear combinations. We’ll have opportunities to both push and pull.

Theorem LTLC
Linear Transformations and Linear Combinations
Suppose that $T : U \mapsto V$ is a linear transformation, $u_{1}, u_{2}, u_{3}, \dots, u_{t}$ are vectors from $U$ and $a_{1}, a_{2}, a_{3}, \dots, a_{t}$ are scalars from $ℂ^{}$ . Then

T (a_{1} u_{1} + a_{2} u_{2} + a_{3} u_{3} + \dots + a_{t} u_{t}) = a_{1} T (u_{1}) + a_{2} T (u_{2}) + a_{3} T (u_{3}) + \dots + a_{t} T (u_{t})

□

Proof

\begin{array}{l} T (a_{1} u_{1} + a_{2} u_{2} + a_{3} u_{3} + \dots + a_{t} u_{t}) \\ = T (a_{1} u_{1}) + T (a_{2} u_{2}) + T (a_{3} u_{3}) + \dots + T (a_{t} u_{t}) & Definition LT \\ = a_{1} T (u_{1}) + a_{2} T (u_{2}) + a_{3} T (u_{3}) + \dots + a_{t} T (u_{t}) & Definition LT \end{array}

■

Our next theorem says, informally, that it is enough to know how a linear transformation behaves for inputs from a basis of the domain, and all other outputs are described by a linear combination of these values. Again, the theorem and its proof are not remarkable, but the insight that goes along with it is fundamental.

Theorem LTDB
Linear Transformation Defined on a Basis
Suppose that $T : U \mapsto V$ is a linear transformation, $B = \{u_{1}, u_{2}, u_{3}, \dots, u_{n}\}$ is a basis for $U$ and $w$ is a vector from $U$ . Let $a_{1}, a_{2}, a_{3}, \dots, a_{n}$ be the scalars from $ℂ^{}$ such that

w = a_{1} u_{1} + a_{2} u_{2} + a_{3} u_{3} + \dots + a_{n} u_{n}

Then

T (w) = a_{1} T (u_{1}) + a_{2} T (u_{2}) + a_{3} T (u_{3}) + \dots + a_{n} T (u_{n})

□

Proof For any $w \in U$ , Theorem VRRB says there are (unique) scalars such that $w$ is a linear combination of the basis vectors in $B$ . The result then follows from a straightforward application of Theorem LTLC to the linear combination. $■$

Example LTDB1
Linear transformation defined on a basis
Suppose you are told that $T : ℂ^{3} \mapsto ℂ^{2}$ is a linear transformation and given the three values,

\begin{array}{l} T ([\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}]) = [\begin{matrix} 2 \\ 1 \end{matrix}] & T ([\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}]) = [\begin{matrix} - 1 \\ 4 \end{matrix}] & T ([\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}]) = [\begin{matrix} 6 \\ 0 \end{matrix}] \end{array}

Because

B = \{[\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}], [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}]\}

is a basis for $ℂ^{3}$ (Theorem SUVB), Theorem LTDB says we can compute any output of $T$ with just this information. For example, consider,

w = [\begin{matrix} 2 \\ - 3 \\ 1 \end{matrix}] = (2) [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}] + (- 3) [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}] + (1) [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}]

T (w) = (2) [\begin{matrix} 2 \\ 1 \end{matrix}] + (- 3) [\begin{matrix} - 1 \\ 4 \end{matrix}] + (1) [\begin{matrix} 6 \\ 0 \end{matrix}] = [\begin{matrix} 13 \\ - 10 \end{matrix}]

Doing it again,

w = [\begin{matrix} 5 \\ 2 \\ - 3 \end{matrix}] = (5) [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}] + (2) [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}] + (- 3) [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}]

T (w) = (5) [\begin{matrix} 2 \\ 1 \end{matrix}] + (2) [\begin{matrix} - 1 \\ 4 \end{matrix}] + (- 3) [\begin{matrix} 6 \\ 0 \end{matrix}] = [\begin{matrix} - 10 \\ 13 \end{matrix}]

Any other value of $T$ could be computed in a similar manner. So rather than being given a formula for the outputs of $T$ , the requirement that $T$ behave as a linear transformation, along with its values on a handful of vectors (the basis), are just as sufficient as a formula for computing any value of the function. You might notice some parallels between this example and Example MOLT or Theorem MLTCV. $⊠$

Example LTDB2
Linear transformation defined on a basis
Suppose you are told that $R : ℂ^{3} \mapsto ℂ^{2}$ is a linear transformation and given the three values,

\begin{array}{l} R ([\begin{matrix} 1 \\ 2 \\ 1 \end{matrix}]) = [\begin{matrix} 5 \\ - 1 \end{matrix}] & R ([\begin{matrix} - 1 \\ 5 \\ 1 \end{matrix}]) = [\begin{matrix} 0 \\ 4 \end{matrix}] & R ([\begin{matrix} 3 \\ 1 \\ 4 \end{matrix}]) = [\begin{matrix} 2 \\ 3 \end{matrix}] \end{array}

You can check that

D = \{[\begin{matrix} 1 \\ 2 \\ 1 \end{matrix}], [\begin{matrix} - 1 \\ 5 \\ 1 \end{matrix}], [\begin{matrix} 3 \\ 1 \\ 4 \end{matrix}]\}

is a basis for $ℂ^{3}$ (make the vectors the columns of a square matrix and check that the matrix is nonsingular, Theorem CNMB). By Theorem LTDB we can compute any output of $R$ with just this information. However, we have to work just a bit harder to take an input vector and express it as a linear combination of the vectors in $D$ . For example, consider,

y = [\begin{matrix} 8 \\ - 3 \\ 5 \end{matrix}]

Then we must first write $y$ as a linear combination of the vectors in $D$ and solve for the unknown scalars, to arrive at

y = [\begin{matrix} 8 \\ - 3 \\ 5 \end{matrix}] = (3) [\begin{matrix} 1 \\ 2 \\ 1 \end{matrix}] + (- 2) [\begin{matrix} - 1 \\ 5 \\ 1 \end{matrix}] + (1) [\begin{matrix} 3 \\ 1 \\ 4 \end{matrix}]

Then Theorem LTDB gives us

R (y) = (3) [\begin{matrix} 5 \\ - 1 \end{matrix}] + (- 2) [\begin{matrix} 0 \\ 4 \end{matrix}] + (1) [\begin{matrix} 2 \\ 3 \end{matrix}] = [\begin{matrix} 17 \\ - 8 \end{matrix}]

Any other value of $R$ could be computed in a similar manner. $⊠$

Here is a third example of a linear transformation defined by its action on a basis, only with more abstract vector spaces involved.

Example LTDB3
Linear transformation defined on a basis
The set $W = \{p (x) \in P_{3} | p (1) = 0, p (3) = 0\} \subseteq P_{3}$ is a subspace of the vector space of polynomials $P_{3}$ . This subspace has $C = \{3 - 4 x + x^{2}, 12 - 13 x + x^{3}\}$ as a basis (check this!). Suppose we define a linear transformation $S : P_{3} \mapsto M_{22}$ by the values

\begin{array}{l} S (3 - 4 x + x^{2}) = [\begin{matrix} 1 & - 3 \\ 2 & 0 \end{matrix}] & S (12 - 13 x + x^{3}) = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] \end{array}

To illustrate a sample computation of $S$ , consider $q (x) = 9 - 6 x - 5 x^{2} + 2 x^{3}$ . Verify that $q (x)$ is an element of $W$ (does it have roots at $x = 1$ and $x = 3$ ?), then find the scalars needed to write it as a linear combination of the basis vectors in $C$ . Because

q (x) = 9 - 6 x - 5 x^{2} + 2 x^{3} = (- 5) (3 - 4 x + x^{2}) + (2) (12 - 13 x + x^{3})

Theorem LTDB gives us

S (q) = (- 5) [\begin{matrix} 1 & - 3 \\ 2 & 0 \end{matrix}] + (2) [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] = [\begin{matrix} - 5 & 17 \\ - 8 & 0 \end{matrix}]

And all the other outputs of $S$ could be computed in the same manner. Every output of $S$ will have a zero in the second row, second column. Can you see why this is so? $⊠$

Subsection PI: Pre-Images

The definition of a function requires that for each input in the domain there is exactly one output in the codomain. However, the correspondence does not have to behave the other way around. A member of the codomain might have many inputs from the domain that create it, or it may have none at all. To formalize our discussion of this aspect of linear transformations, we define the pre-image.

Definition PI
Pre-Image
Suppose that $T : U \mapsto V$ is a linear transformation. For each $v$ , define the pre-image of $v$ to be the subset of $U$ given by

T^{- 1} (v) = \{u \in U| T (u) = v\}

△

In other words, $T^{- 1} (v)$ is the set of all those vectors in the domain $U$ that get “sent” to the vector $v$ .

TODO: All preimages form a partition of $U$ , an equivalence relation is about. Maybe to exercises.

Example SPIAS
Sample pre-images, Archetype S
Archetype S is the linear transformation defined by

T : ℂ^{3} \mapsto M_{22}, T ([\begin{matrix} a \\ b \\ c \end{matrix}]) = [\begin{matrix} a - b & 2 a + 2 b + c \\ 3 a + b + c & - 2 a - 6 b - 2 c \end{matrix}]

We could compute a pre-image for every element of the codomain $M_{22}$ . However, even in a free textbook, we do not have the room to do that, so we will compute just two.

Choose

v = [\begin{matrix} 2 & 1 \\ 3 & 2 \end{matrix}] \in M_{22}

for no particular reason. What is $T^{- 1} (v)$ ? Suppose $u = [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}] \in T^{- 1} (v)$ . That $T (u) = v$ becomes

[\begin{matrix} 2 & 1 \\ 3 & 2 \end{matrix}] = v = T (u) = T ([\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}]) = [\begin{matrix} u_{1} - u_{2} & 2 u_{1} + 2 u_{2} + u_{3} \\ 3 u_{1} + u_{2} + u_{3} & - 2 u_{1} - 6 u_{2} - 2 u_{3} \end{matrix}]

Using matrix equality (Definition ME), we arrive at a system of four equations in the three unknowns $u_{1}, u_{2}, u_{3}$ with an augmented matrix that we can row-reduce in the hunt for solutions,

[\begin{matrix} 1 & - 1 & 0 & 2 \\ 2 & 2 & 1 & 1 \\ 3 & 1 & 1 & 3 \\ - 2 & - 6 & - 2 & 2 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & 0 & \frac{1}{4} & \frac{5}{4} \\ 0 & 1 & \frac{1}{4} & - \frac{3}{4} \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}]

We recognize this system as having infinitely many solutions described by the single free variable $u_{3}$ . Eventually obtaining the vector form of the solutions (Theorem VFSLS), we can describe the preimage precisely as,

\begin{array}{l} T^{- 1} (v) & = \{u \in ℂ^{3}| T (u) = v\} \\ = \{[\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}]| u_{1} = \frac{5}{4} - \frac{1}{4} u_{3}, u_{2} = - \frac{3}{4} - \frac{1}{4} u_{3}\} \\ = \{[\begin{matrix} \frac{5}{4} - \frac{1}{4} u_{3} \\ - \frac{3}{4} - \frac{1}{4} u_{3} \\ u_{3} \end{matrix}]| u_{3} \in ℂ^{3}\} \\ = \{[\begin{matrix} \frac{5}{4} \\ - \frac{3}{4} \\ 0 \end{matrix}] + u_{3} [\begin{matrix} - \frac{1}{4} \\ - \frac{1}{4} \\ 1 \end{matrix}]| u_{3} \in ℂ^{3}\} \\ = [\begin{matrix} \frac{5}{4} \\ - \frac{3}{4} \\ 0 \end{matrix}] + 〈\{[\begin{matrix} - \frac{1}{4} \\ - \frac{1}{4} \\ 1 \end{matrix}]\}〉 \end{array}

This last line is merely a suggestive way of describing the set on the previous line. You might create three or four vectors in the preimage, and evaluate $T$ with each. Was the result what you expected? For a hint of things to come, you might try evaluating $T$ with just the lone vector in the spanning set above. What was the result? Now take a look back at Theorem PSPHS. Hmmmm.

OK, let’s compute another preimage, but with a different outcome this time. Choose

v = [\begin{matrix} 1 & 1 \\ 2 & 4 \end{matrix}] \in M_{22}

What is $T^{- 1} (v)$ ? Suppose $u = [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}] \in T^{- 1} (v)$ . That $T (u) = v$ becomes

[\begin{matrix} 1 & 1 \\ 2 & 4 \end{matrix}] = v = T (u) = T ([\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}]) = [\begin{matrix} u_{1} - u_{2} & 2 u_{1} + 2 u_{2} + u_{3} \\ 3 u_{1} + u_{2} + u_{3} & - 2 u_{1} - 6 u_{2} - 2 u_{3} \end{matrix}]

Using matrix equality (Definition ME), we arrive at a system of four equations in the three unknowns $u_{1}, u_{2}, u_{3}$ with an augmented matrix that we can row-reduce in the hunt for solutions,

[\begin{matrix} 1 & - 1 & 0 & 1 \\ 2 & 2 & 1 & 1 \\ 3 & 1 & 1 & 2 \\ - 2 & - 6 & - 2 & 4 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & 0 & \frac{1}{4} & 0 \\ 0 & 1 & \frac{1}{4} & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \end{matrix}]

By Theorem RCLS we recognize this system as inconsistent. So no vector $u$ is a member of $T^{- 1} (v)$ and so

T^{- 1} (v) = \emptyset

⊠

The preimage is just a set, it is almost never a subspace of $U$ (you might think about just when $T^{- 1} (v)$ is a subspace, see Exercise ILT.T10). We will describe its properties going forward, and it will be central to the main ideas of this chapter.

Subsection NLTFO: New Linear Transformations From Old

We can combine linear transformations in natural ways to create new linear transformations. So we will define these combinations and then prove that the results really are still linear transformations. First the sum of two linear transformations.

Definition LTA
Linear Transformation Addition
Suppose that $T : U \mapsto V$ and $S : U \mapsto V$ are two linear transformations with the same domain and codomain. Then their sum is the function $T + S : U \mapsto V$ whose outputs are defined by

(T + S) (u) = T (u) + S (u)

△

Notice that the first plus sign in the definition is the operation being defined, while the second one is the vector addition in $V$ . (Vector addition in $U$ will appear just now in the proof that $T + S$ is a linear transformation.) Definition LTA only provides a function. It would be nice to know that when the constituents ( $T$ , $S$ ) are linear transformations, then so too is $T + S$ .

Theorem SLTLT
Sum of Linear Transformations is a Linear Transformation
Suppose that $T : U \mapsto V$ and $S : U \mapsto V$ are two linear transformations with the same domain and codomain. Then $T + S : U \mapsto V$ is a linear transformation. $□$

Proof We simply check the defining properties of a linear transformation (Definition LT). This is a good place to consistently ask yourself which objects are being combined with which operations.

\begin{array}{l} (T + S) (x + y) & = T (x + y) + S (x + y) & Definition LTA \\ = T (x) + T (y) + S (x) + S (y) & Definition LT \\ = T (x) + S (x) + T (y) + S (y) & Property C in V \\ = (T + S) (x) + (T + S) (y) & Definition LTA \\ and \\ (T + S) (α x) & = T (α x) + S (α x) & Definition LTA \\ = α T (x) + α S (x) & Definition LT \\ = α (T (x) + S (x)) & Property DVA in V \\ = α (T + S) (x) & Definition LTA \end{array}

■

Example STLT
Sum of two linear transformations
Suppose that $T : ℂ^{2} \mapsto ℂ^{3}$ and $S : ℂ^{2} \mapsto ℂ^{3}$ are defined by

\begin{array}{l} T ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = [\begin{matrix} x_{1} + 2 x_{2} \\ 3 x_{1} - 4 x_{2} \\ 5 x_{1} + 2 x_{2} \end{matrix}] & S ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = [\begin{matrix} 4 x_{1} - x_{2} \\ x_{1} + 3 x_{2} \\ - 7 x_{1} + 5 x_{2} \end{matrix}] \end{array}

Then by Definition LTA, we have

(T + S) ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = T ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) + S ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = [\begin{matrix} x_{1} + 2 x_{2} \\ 3 x_{1} - 4 x_{2} \\ 5 x_{1} + 2 x_{2} \end{matrix}] + [\begin{matrix} 4 x_{1} - x_{2} \\ x_{1} + 3 x_{2} \\ - 7 x_{1} + 5 x_{2} \end{matrix}] = [\begin{matrix} 5 x_{1} + x_{2} \\ 4 x_{1} - x_{2} \\ - 2 x_{1} + 7 x_{2} \end{matrix}]

and by Theorem SLTLT we know $T + S$ is also a linear transformation from $ℂ^{2}$ to $ℂ^{3}$ . $⊠$

Definition LTSM
Linear Transformation Scalar Multiplication
Suppose that $T : U \mapsto V$ is a linear transformation and $α \in ℂ^{}$ . Then the scalar multiple is the function $α T : U \mapsto V$ whose outputs are defined by

(α T) (u) = α T (u)

△

Given that $T$ is a linear transformation, it would be nice to know that $α T$ is also a linear transformation.

Theorem MLTLT
Multiple of a Linear Transformation is a Linear Transformation
Suppose that $T : U \mapsto V$ is a linear transformation and $α \in ℂ^{}$ . Then $(α T) : U \mapsto V$ is a linear transformation. $□$

Proof We simply check the defining properties of a linear transformation (Definition LT). This is another good place to consistently ask yourself which objects are being combined with which operations.

\begin{array}{l} (α T) (x + y) & = α (T (x + y)) & Definition LTSM \\ = α (T (x) + T (y)) & Definition LT \\ = α T (x) + α T (y) & Property DVA in V \\ = (α T) (x) + (α T) (y) & Definition LTSM \\ and \\ (α T) (β x) & = α T (β x) & Definition LTSM \\ = α (β T (x)) & Definition LT \\ = (α β) T (x) & Property SMA in V \\ = (β α) T (x) & Commutativity in ℂ^{} \\ = β (α T (x)) & Property SMA in V \\ = β ((α T) (x)) & Definition LTSM \end{array}

■

Example SMLT
Scalar multiple of a linear transformation
Suppose that $T : ℂ^{4} \mapsto ℂ^{3}$ is defined by

T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]) = [\begin{matrix} x_{1} + 2 x_{2} - x_{3} + 2 x_{4} \\ x_{1} + 5 x_{2} - 3 x_{3} + x_{4} \\ - 2 x_{1} + 3 x_{2} - 4 x_{3} + 2 x_{4} \end{matrix}]

For the sake of an example, choose $α = 2$ , so by Definition LTSM, we have

α T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]) = 2 T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]) = 2 [\begin{matrix} x_{1} + 2 x_{2} - x_{3} + 2 x_{4} \\ x_{1} + 5 x_{2} - 3 x_{3} + x_{4} \\ - 2 x_{1} + 3 x_{2} - 4 x_{3} + 2 x_{4} \end{matrix}] = [\begin{matrix} 2 x_{1} + 4 x_{2} - 2 x_{3} + 4 x_{4} \\ 2 x_{1} + 10 x_{2} - 6 x_{3} + 2 x_{4} \\ - 4 x_{1} + 6 x_{2} - 8 x_{3} + 4 x_{4} \end{matrix}]

and by Theorem MLTLT we know $2 T$ is also a linear transformation from $ℂ^{4}$ to $ℂ^{3}$ . $⊠$

Now, let’s imagine we have two vector spaces, $U$ and $V$ , and we collect every possible linear transformation from $U$ to $V$ into one big set, and call it $LT (U, V)$ . Definition LTA and Definition LTSM tell us how we can “add” and “scalar multiply” two elements of $LT (U, V)$ . Theorem SLTLT and Theorem MLTLT tell us that if we do these operations, then the resulting functions are linear transformations that are also in $LT (U, V)$ . Hmmmm, sounds like a vector space to me! A set of objects, an addition and a scalar multiplication. Why not?

Theorem VSLT
Vector Space of Linear Transformations
Suppose that $U$ and $V$ are vector spaces. Then the set of all linear transformations from $U$ to $V$ , $LT (U, V)$ is a vector space when the operations are those given in Definition LTA and Definition LTSM. $□$

Proof Theorem SLTLT and Theorem MLTLT provide two of the ten properties in Definition VS. However, we still need to verify the remaining eight properties. By and large, the proofs are straightforward and rely on concocting the obvious object, or by reducing the question to the same vector space property in the vector space $V$ .

The zero vector is of some interest, though. What linear transformation would we add to any other linear transformation, so as to keep the second one unchanged? The answer is $Z : U \mapsto V$ defined by $Z (u) = 0_{V}$ for every $u \in U$ . Notice how we do not need to know any specifics about $U$ and $V$ to make this definition. $■$

Definition LTC
Linear Transformation Composition
Suppose that $T : U \mapsto V$ and $S : V \mapsto W$ are linear transformations. Then the composition of $S$ and $T$ is the function $(S \circ T) : U \mapsto W$ whose outputs are defined by

(S \circ T) (u) = S (T (u))

△

Given that $T$ and $S$ are linear transformations, it would be nice to know that $S \circ T$ is also a linear transformation.

Theorem CLTLT
Composition of Linear Transformations is a Linear Transformation
Suppose that $T : U \mapsto V$ and $S : V \mapsto W$ are linear transformations. Then $(S \circ T) : U \mapsto W$ is a linear transformation. $□$

Proof We simply check the defining properties of a linear transformation (Definition LT).

\begin{array}{l} (S \circ T) (x + y) & = S (T (x + y)) & Definition LTC \\ = S (T (x) + T (y)) & Definition LT for T \\ = S (T (x)) + S (T (y)) & Definition LT for S \\ = (S \circ T) (x) + (S \circ T) (y) & Definition LTC \\ and \\ (S \circ T) (α x) & = S (T (α x)) & Definition LTC \\ = S (α T (x)) & Definition LT for T \\ = α S (T (x)) & Definition LT for S \\ = α (S \circ T) (x) & Definition LTC \end{array}

■

Example CTLT
Composition of two linear transformations
Suppose that $T : ℂ^{2} \mapsto ℂ^{4}$ and $S : ℂ^{4} \mapsto ℂ^{3}$ are defined by

\begin{array}{l} T ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = [\begin{matrix} x_{1} + 2 x_{2} \\ 3 x_{1} - 4 x_{2} \\ 5 x_{1} + 2 x_{2} \\ 6 x_{1} - 3 x_{2} \end{matrix}] & S ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]) = [\begin{matrix} 2 x_{1} - x_{2} + x_{3} - x_{4} \\ 5 x_{1} - 3 x_{2} + 8 x_{3} - 2 x_{4} \\ - 4 x_{1} + 3 x_{2} - 4 x_{3} + 5 x_{4} \end{matrix}] \end{array}

Then by Definition LTC

\begin{array}{l} (S \circ T) ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) & = S (T ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}])) \\ = S ([\begin{matrix} x_{1} + 2 x_{2} \\ 3 x_{1} - 4 x_{2} \\ 5 x_{1} + 2 x_{2} \\ 6 x_{1} - 3 x_{2} \end{matrix}]) \\ = [\begin{matrix} 2 (x_{1} + 2 x_{2}) - (3 x_{1} - 4 x_{2}) + (5 x_{1} + 2 x_{2}) - (6 x_{1} - 3 x_{2}) \\ 5 (x_{1} + 2 x_{2}) - 3 (3 x_{1} - 4 x_{2}) + 8 (5 x_{1} + 2 x_{2}) - 2 (6 x_{1} - 3 x_{2}) \\ - 4 (x_{1} + 2 x_{2}) + 3 (3 x_{1} - 4 x_{2}) - 4 (5 x_{1} + 2 x_{2}) + 5 (6 x_{1} - 3 x_{2}) \end{matrix}] \\ = [\begin{matrix} - 2 x_{1} + 13 x_{2} \\ 24 x_{1} + 44 x_{2} \\ 15 x_{1} - 43 x_{2} \end{matrix}] \end{array}

and by Theorem CLTLT $S \circ T$ is a linear transformation from $ℂ^{2}$ to $ℂ^{3}$ . $⊠$

Here is an interesting exercise that will presage an important result later. In Example STLT compute (via Theorem MLTCV) the matrix of $T$ , $S$ and $T + S$ . Do you see a relationship between these three matrices?

In Example SMLT compute (via Theorem MLTCV) the matrix of $T$ and $2 T$ . Do you see a relationship between these two matrices?

Here’s the tough one. In Example CTLT compute (via Theorem MLTCV) the matrix of $T$ , $S$ and $S \circ T$ . Do you see a relationship between these three matrices???

Subsection READ: Reading Questions

Is the function below a linear transformation? Why or why not?

T : ℂ^{3} \mapsto ℂ^{2}, T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 3 x_{1} - x_{2} + x_{3} \\ 8 x_{2} - 6 \end{matrix}]

Determine the matrix representation of the linear transformation

S

below.

S : ℂ^{2} \mapsto ℂ^{3}, S ([\begin{matrix} x_{1} \\ x_{2} \end{matrix}]) = [\begin{matrix} 3 x_{1} + 5 x_{2} \\ 8 x_{1} - 3 x_{2} \\ - 4 x_{1} \end{matrix}]

Theorem LTLC has a fairly simple proof. Yet the result itself is very powerful. Comment on why we might say this.

Subsection EXC: Exercises

C15 The archetypes below are all linear transformations whose domains and codomains are vector spaces of column vectors (Definition VSCV). For each one, compute the matrix representation described in the proof of Theorem MLTCV.
Archetype M
Archetype N
Archetype O
Archetype P
Archetype Q
Archetype R
Contributed by Robert Beezer

C20 Let $w = [\begin{matrix} - 3 \\ 1 \\ 4 \end{matrix}]$ . Referring to Example MOLT, compute $S (w)$ two different ways. First use the definition of $S$ , then compute the matrix-vector product $C w$ (Definition MVP).
Contributed by Robert Beezer Solution [1306]

C25 Define the linear transformation

T : ℂ^{3} \mapsto ℂ^{2}, T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 2 x_{1} - x_{2} + 5 x_{3} \\ - 4 x_{1} + 2 x_{2} - 10 x_{3} \end{matrix}]

Verify that $T$ is a linear transformation.
Contributed by Robert Beezer Solution [1306]

C26 Verify that the function below is a linear transformation.

T : P_{2} \mapsto ℂ^{2}, T (a + b x + c x^{2}) = [\begin{matrix} 2 a - b \\ b + c \end{matrix}]

Contributed by Robert Beezer Solution [1306]

C30 Define the linear transformation

T : ℂ^{3} \mapsto ℂ^{2}, T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 2 x_{1} - x_{2} + 5 x_{3} \\ - 4 x_{1} + 2 x_{2} - 10 x_{3} \end{matrix}]

Compute the preimages, $T^{- 1} ([\begin{matrix} 2 \\ 3 \end{matrix}])$ and $T^{- 1} ([\begin{matrix} 4 \\ - 8 \end{matrix}])$ .
Contributed by Robert Beezer Solution [1307]

C31 For the linear transformation $S$ compute the pre-images.

\begin{array}{l} S : ℂ^{3} \mapsto ℂ^{3}, S ([\begin{matrix} a \\ b \\ c \end{matrix}]) = [\begin{matrix} a - 2 b - c \\ 3 a - b + 2 c \\ a + b + 2 c \end{matrix}] \end{array}

\begin{array}{l} S^{- 1} ([\begin{matrix} - 2 \\ 5 \\ 3 \end{matrix}]) & S^{- 1} ([\begin{matrix} - 5 \\ 5 \\ 7 \end{matrix}]) \end{array}

Contributed by Robert Beezer Solution [1309]

M10 Define two linear transformations, $T : ℂ^{4} \mapsto ℂ^{3}$ and $S : ℂ^{3} \mapsto ℂ^{2}$ by

\begin{array}{l} S ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) & = [\begin{matrix} x_{1} - 2 x_{2} + 3 x_{3} \\ 5 x_{1} + 4 x_{2} + 2 x_{3} \end{matrix}] & T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \end{matrix}]) & = [\begin{matrix} - x_{1} + 3 x_{2} + x_{3} + 9 x_{4} \\ 2 x_{1} + x_{3} + 7 x_{4} \\ 4 x_{1} + 2 x_{2} + x_{3} + 2 x_{4} \end{matrix}] \end{array}

Using the proof of Theorem MLTCV compute the matrix representations of the three linear transformations $T$ , $S$ and $S \circ T$ . Discover and comment on the relationship between these three matrices.
Contributed by Robert Beezer Solution [1312]

Subsection SOL: Solutions

C20 Contributed by Robert Beezer Statement [1302]
In both cases the result will be $S (w) = [\begin{matrix} 9 \\ 2 \\ - 9 \\ 4 \end{matrix}]$ .

C25 Contributed by Robert Beezer Statement [1302]
We can rewrite $T$ as follows:

T ([\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]) = [\begin{matrix} 2 x_{1} - x_{2} + 5 x_{3} \\ - 4 x_{1} + 2 x_{2} - 10 x_{3} \end{matrix}] = x_{1} [\begin{matrix} 2 \\ - 4 \end{matrix}] + x_{2} [\begin{matrix} - 1 \\ 2 \end{matrix}] + x_{3} [\begin{matrix} 5 \\ - 10 \end{matrix}] = [\begin{matrix} 2 & - 1 & 5 \\ - 4 & 2 & - 10 \end{matrix}] [\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \end{matrix}]

and Theorem MBLT tell us that any function of this form is a linear transformation.

C26 Contributed by Robert Beezer Statement [1303]
Check the two conditions of Definition LT.

\begin{array}{l} T (u + v) & = T ((a + b x + c x^{2}) + (d + e x + f x^{2})) \\ = T ((a + d) + (b + e) x + (c + f) x^{2}) \\ = [\begin{matrix} 2 (a + d) - (b + e) \\ (b + e) + (c + f) \end{matrix}] \\ = [\begin{matrix} (2 a - b) + (2 d - e) \\ (b + c) + (e + f) \end{matrix}] \\ = [\begin{matrix} 2 a - b \\ b + c \end{matrix}] + [\begin{matrix} 2 d - e \\ e + f \end{matrix}] \\ = T (u) + T (v) \\ and \\ T (α u) & = T (α (a + b x + c x^{2})) \\ = T ((α a) + (α b) x + (α c) x^{2}) \\ = [\begin{matrix} 2 (α a) - (α b) \\ (α b) + (α c) \end{matrix}] \\ = [\begin{matrix} α (2 a - b) \\ α (b + c) \end{matrix}] \\ = α [\begin{matrix} 2 a - b \\ b + c \end{matrix}] \\ = α T (u) \end{array}

So $T$ is indeed a linear transformation.

C30 Contributed by Robert Beezer Statement [1303]
For the first pre-image, we want $x \in ℂ^{3}$ such that $T (x) = [\begin{matrix} 2 \\ 3 \end{matrix}]$ . This becomes,

[\begin{matrix} 2 x_{1} - x_{2} + 5 x_{3} \\ - 4 x_{1} + 2 x_{2} - 10 x_{3} \end{matrix}] = [\begin{matrix} 2 \\ 3 \end{matrix}]

Vector equality gives a system of two linear equations in three variables, represented by the augmented matrix

[\begin{matrix} 2 & - 1 & 5 & 2 \\ - 4 & 2 & - 10 & 3 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & - \frac{1}{2} & \frac{5}{2} & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

so the system is inconsistent and the pre-image is the empty set. For the second pre-image the same procedure leads to an augmented matrix with a different vector of constants

[\begin{matrix} 2 & - 1 & 5 & 4 \\ - 4 & 2 & - 10 & - 8 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & - \frac{1}{2} & \frac{5}{2} & 2 \\ 0 & 0 & 0 & 0 \end{matrix}]

This system is consistent and has infinitely many solutions, as we can see from the presence of the two free variables ( $x_{2}$ and $x_{3}$ ) both to zero. We apply Theorem VFSLS to obtain

T^{- 1} ([\begin{matrix} 4 \\ - 8 \end{matrix}]) = \{[\begin{matrix} 2 \\ 0 \\ 0 \end{matrix}] + x_{2} [\begin{matrix} - \frac{5}{2} \\ 0 \\ 1 \end{matrix}] + x_{3} [\begin{matrix} \frac{1}{2} \\ 1 \\ 0 \end{matrix}]| x_{2}, x_{3} \in ℂ^{}\}

C31 Contributed by Robert Beezer Statement [1304]
We work from the definition of the pre-image, Definition PI. Setting

S ([\begin{matrix} a \\ b \\ c \end{matrix}]) = [\begin{matrix} - 2 \\ 5 \\ 3 \end{matrix}]

we arrive at a system of three equations in three variables, with an augmented matrix that we row-reduce in a search for solutions,

[\begin{matrix} 1 & - 2 & - 1 & - 2 \\ 3 & - 1 & 2 & 5 \\ 1 & 1 & 2 & 3 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & 0 & 1 & 0 \\ 0 & 1 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

With a leading 1 in the last column, this system is inconsistent (Theorem RCLS), and there are no values of $a$ , $b$ and $c$ that will create an element of the pre-image. So the preimage is the empty set.

We work from the definition of the pre-image, Definition PI. Setting

S ([\begin{matrix} a \\ b \\ c \end{matrix}]) = [\begin{matrix} - 5 \\ 5 \\ 7 \end{matrix}]

we arrive at a system of three equations in three variables, with an augmented matrix that we row-reduce in a search for solutions,

[\begin{matrix} 1 & - 2 & - 1 & - 5 \\ 3 & - 1 & 2 & 5 \\ 1 & 1 & 2 & 7 \end{matrix}] \to_{}^{RREF} [\begin{matrix} 1 & 0 & 1 & 3 \\ 0 & 1 & 1 & 4 \\ 0 & 0 & 0 & 0 \end{matrix}]

The solution set to this system, which is also the desired pre-image, can be expressed using the vector form of the solutions (Theorem VFSLS)

S^{- 1} ([\begin{matrix} - 5 \\ 5 \\ 7 \end{matrix}]) = \{[\begin{matrix} 3 \\ 4 \\ 0 \end{matrix}] + c [\begin{matrix} - 1 \\ - 1 \\ 1 \end{matrix}]| c \in ℂ^{}\} = [\begin{matrix} 3 \\ 4 \\ 0 \end{matrix}] + 〈\{[\begin{matrix} - 1 \\ - 1 \\ 1 \end{matrix}]\}〉

Does the final expression for this set remind you of Theorem KPI?

M10 Contributed by Robert Beezer Statement [1304]

[\begin{matrix} 1 & - 2 & 3 \\ 5 & 4 & 2 \end{matrix}] [\begin{matrix} - 1 & 3 & 1 & 9 \\ 2 & 0 & 1 & 7 \\ 4 & 2 & 1 & 2 \end{matrix}] = [\begin{matrix} 7 & 9 & 2 & 1 \\ 11 & 19 & 11 & 77 \end{matrix}]

[next] [front] [up]