Section B Bases
A basis of a vector space is one of the most useful concepts in linear algebra. It often provides a concise, finite description of an infinite vector space.
Subsection B Bases
We now have all the tools in place to define a basis of a vector space.
Definition B Basis
Suppose $V$ is a vector space. Then a subset $S\subseteq V$ is a basis of $V$ if it is linearly independent and spans $V$.
So, a basis is a linearly independent spanning set for a vector space. The requirement that the set spans $V$ insures that $S$ has enough raw material to build $V$, while the linear independence requirement insures that we do not have any more raw material than we need. As we shall see soon in Section D, a basis is a minimal spanning set.
You may have noticed that we used the term basis for some of the titles of previous theorems (e.g. Theorem BNS, Theorem BCS, Theorem BRS) and if you review each of these theorems you will see that their conclusions provide linearly independent spanning sets for sets that we now recognize as subspaces of $\complex{m}$. Examples associated with these theorems include Example NSLIL, Example CSOCD and Example IAS. As we will see, these three theorems will continue to be powerful tools, even in the setting of more general vector spaces.
Furthermore, the archetypes contain an abundance of bases. For each coefficient matrix of a system of equations, and for each archetype defined simply as a matrix, there is a basis for the null space, three bases for the column space, and a basis for the row space. For this reason, our subsequent examples will concentrate on bases for vector spaces other than $\complex{m}$.
Notice that Definition B does not preclude a vector space from having many bases, and this is the case, as hinted above by the statement that the archetypes contain three bases for the column space of a matrix. More generally, we can grab any basis for a vector space, multiply any one basis vector by a nonzero scalar and create a slightly different set that is still a basis. For “important” vector spaces, it will be convenient to have a collection of “nice” bases. When a vector space has a single particularly nice basis, it is sometimes called the standard basis though there is nothing precise enough about this term to allow us to define it formally — it is a question of style. Here are some nice bases for important vector spaces.
Theorem SUVB Standard Unit Vectors are a Basis
The set of standard unit vectors for $\complex{m}$ (Definition SUV), $B=\setparts{\vect{e}_i}{1\leq i\leq m}$ is a basis for the vector space $\complex{m}$.
Example BP Bases for $P_n$
Example BM A basis for the vector space of matrices
The bases described above will often be convenient ones to work with. However a basis does not have to obviously look like a basis.
Example BSP4 A basis for a subspace of $P_4$
Example BSM22 A basis for a subspace of $M_{22}$
Example BC Basis for the crazy vector space
We have seen that several of the sets associated with a matrix are subspaces of vector spaces of column vectors. Specifically these are the null space (Theorem NSMS), column space (Theorem CSMS), row space (Theorem RSMS) and left null space (Theorem LNSMS). As subspaces they are vector spaces (Definition S) and it is natural to ask about bases for these vector spaces. Theorem BNS, Theorem BCS, Theorem BRS each have conclusions that provide linearly independent spanning sets for (respectively) the null space, column space, and row space. Notice that each of these theorems contains the word “basis” in its title, even though we did not know the precise meaning of the word at the time. To find a basis for a left null space we can use the definition of this subspace as a null space (Definition LNS) and apply Theorem BNS. Or Theorem FS tells us that the left null space can be expressed as a row space and we can then use Theorem BRS.
Theorem BS is another early result that provides a linearly independent spanning set (i.e. a basis) as its conclusion. If a vector space of column vectors can be expressed as a span of a set of column vectors, then Theorem BS can be employed in a straightforward manner to quickly yield a basis.
Subsection BSCV Bases for Spans of Column Vectors
We have seen several examples of bases in different vector spaces. In this subsection, and the next (Subsection B.BNM), we will consider building bases for $\complex{m}$ and its subspaces.
Suppose we have a subspace of $\complex{m}$ that is expressed as the span of a set of vectors, $S$, and $S$ is not necessarily linearly independent, or perhaps not very attractive. Theorem REMRS says that row-equivalent matrices have identical row spaces, while Theorem BRS says the nonzero rows of a matrix in reduced row-echelon form are a basis for the row space. These theorems together give us a great computational tool for quickly finding a basis for a subspace that is expressed originally as a span.
Example RSB Row space basis
Example IAS provides another example of this flavor, though now we can notice that $X$ is a subspace, and that the resulting set of three vectors is a basis. This is such a powerful technique that we should do one more example.
Example RS Reducing a span
Sage B Bases
Sage SUTH0 Sage Under The Hood, Round 0
Subsection BNM Bases and Nonsingular Matrices
A quick source of diverse bases for $\complex{m}$ is the set of columns of a nonsingular matrix.
Theorem CNMB Columns of Nonsingular Matrix are a Basis
Suppose that $A$ is a square matrix of size $m$. Then the columns of $A$ are a basis of $\complex{m}$ if and only if $A$ is nonsingular.
Example CABAK Columns as Basis, Archetype K
Perhaps we should view the fact that the standard unit vectors are a basis (Theorem SUVB) as just a simple corollary of Theorem CNMB? (See Proof Technique LC.)
With a new equivalence for a nonsingular matrix, we can update our list of equivalences.
Theorem NME5 Nonsingular Matrix Equivalences, Round 5
Suppose that $A$ is a square matrix of size $n$. The following are equivalent.
- $A$ is nonsingular.
- $A$ row-reduces to the identity matrix.
- The null space of $A$ contains only the zero vector, $\nsp{A}=\set{\zerovector}$.
- The linear system $\linearsystem{A}{\vect{b}}$ has a unique solution for every possible choice of $\vect{b}$.
- The columns of $A$ are a linearly independent set.
- $A$ is invertible.
- The column space of $A$ is $\complex{n}$, $\csp{A}=\complex{n}$.
- The columns of $A$ are a basis for $\complex{n}$.
Sage NME5 Nonsingular Matrix Equivalences, Round 5
Subsection OBC Orthonormal Bases and Coordinates
We learned about orthogonal sets of vectors in $\complex{m}$ back in Section O, and we also learned that orthogonal sets are automatically linearly independent (Theorem OSLI). When an orthogonal set also spans a subspace of $\complex{m}$, then the set is a basis. And when the set is orthonormal, then the set is an incredibly nice basis. We will back up this claim with a theorem, but first consider how you might manufacture such a set.
Suppose that $W$ is a subspace of $\complex{m}$ with basis $B$. Then $B$ spans $W$ and is a linearly independent set of nonzero vectors. We can apply the Gram-Schmidt Procedure (Theorem GSP) and obtain a linearly independent set $T$ such that $\spn{T}=\spn{B}=W$ and $T$ is orthogonal. In other words, $T$ is a basis for $W$, and is an orthogonal set. By scaling each vector of $T$ to norm 1, we can convert $T$ into an orthonormal set, without destroying the properties that make it a basis of $W$. In short, we can convert any basis into an orthonormal basis. Example GSTV, followed by Example ONTV, illustrates this process.
Unitary matrices (Definition UM) are another good source of orthonormal bases (and vice versa). Suppose that $Q$ is a unitary matrix of size $n$. Then the $n$ columns of $Q$ form an orthonormal set (Theorem CUMOS) that is therefore linearly independent (Theorem OSLI). Since $Q$ is invertible (Theorem UMI), we know $Q$ is nonsingular (Theorem NI), and then the columns of $Q$ span $\complex{n}$ (Theorem CSNM). So the columns of a unitary matrix of size $n$ are an orthonormal basis for $\complex{n}$.
Why all the fuss about orthonormal bases? Theorem VRRB told us that any vector in a vector space could be written, uniquely, as a linear combination of basis vectors. For an orthonormal basis, finding the scalars for this linear combination is extremely easy, and this is the content of the next theorem. Furthermore, with vectors written this way (as linear combinations of the elements of an orthonormal set) certain computations and analysis become much easier. Here is the promised theorem.
Theorem COB Coordinates and Orthonormal Bases
Suppose that $B=\set{\vectorlist{v}{p}}$ is an orthonormal basis of the subspace $W$ of $\complex{m}$. For any $\vect{w}\in W$, \begin{equation*} \vect{w}= \innerproduct{\vect{v}_1}{\vect{w}}\vect{v}_1+ \innerproduct{\vect{v}_2}{\vect{w}}\vect{v}_2+ \innerproduct{\vect{v}_3}{\vect{w}}\vect{v}_3+ \cdots+ \innerproduct{\vect{v}_p}{\vect{w}}\vect{v}_p \end{equation*}
Example CROB4 Coordinatization relative to an orthonormal basis, $\complex{4}$
A slightly less intimidating example follows, in three dimensions and with just real numbers.
Example CROB3 Coordinatization relative to an orthonormal basis, $\complex{3}$
Not only do the columns of a unitary matrix form an orthonormal basis, but there is a deeper connection between orthonormal bases and unitary matrices. Informally, the next theorem says that if we transform each vector of an orthonormal basis by multiplying it by a unitary matrix, then the resulting set will be another orthonormal basis. And more remarkably, any matrix with this property must be unitary! As an equivalence (Proof Technique E) we could take this as our defining property of a unitary matrix, though it might not have the same utility as Definition UM.
Theorem UMCOB Unitary Matrices Convert Orthonormal Bases
Let $A$ be an $n\times n$ matrix and $B=\set{\vectorlist{x}{n}}$ be an orthonormal basis of $\complex{n}$. Define \begin{align*} C&=\set{A\vect{x}_1,\,A\vect{x}_2,\,A\vect{x}_3,\,\dots,\,A\vect{x}_n} \end{align*}
Then $A$ is a unitary matrix if and only if $C$ is an orthonormal basis of $\complex{n}$.