Change of basis

Section 4.3 Change of basis

Coordinate vectors and matrix representations work in tandem to model vectors in an abstract vector space \(V\) as column vectors in \(\R^n\text{,}\) and linear transformations \(T\colon V\rightarrow W\) as \(m\times n\) matrices. In both cases the model depends on our choice of basis. In this section we investigate how different basis choices affect these various models. Specifically, we consider the two questions below.

Given \(V\) and two ordered bases \(B\) and \(B'\text{,}\) what is the algebraic relation between \([\boldv]_B\) and \([\boldv]_{B'}\text{?}\)
Given \(T\colon V\rightarrow V\) and two ordered bases \(B\) and \(B'\text{,}\) what is the relation between \([T]_{B}\) and \([T]_{B'}\text{?}\)

We will tackle each question in turn. Both answers rely on something called a change of basis matrix \(\underset{B\rightarrow B'}{P}\text{.}\)

Subsection 4.3.1 Change of basis matrix

We define change of basis matrices via a column-by-column formula and motivate the definition retroactively with Theorem 4.3.2.

Definition 4.3.1. Change of basis matrix.

Let \(B=(\boldv_1, \boldv_2, \dots, \boldv_n)\) and \(B'\) be two ordered bases for the vector space \(V\text{.}\) The change of basis from \(B\) to \(B'\) is the \(n\times n\) matrix \(\underset{B\rightarrow B'}{P}\) defined as

\begin{equation} \underset{B\rightarrow B'}{P}= \begin{bmatrix} \vert \amp \vert \amp \amp \vert \\ \phantom{v}[\boldv_1]_{B'} \amp \phantom{v}[\boldv_2]_{B'}\amp \dots \amp \phantom{v}[\boldv_n]_{B}\\ \vert \amp \vert \amp \amp \vert \end{bmatrix}\text{.}\tag{4.3.1} \end{equation}

In other words, the \(j\)-th column of \(\underset{B\rightarrow B'}{P}\) is obtained by computing the coordinate vector of the \(j\)-th element of the original basis \(B\) with respect to the new basis \(B'\text{.}\)

Theorem 4.3.2. Change of basis for coordinate vectors.

Let \(B\) and \(B'\) be two ordered bases of the \(n\)-dimensional vector space \(V\text{.}\)

Recall that \(\id_V\colon V\rightarrow V\) is the identity transformation (Definition 3.2.3), defined as \(\id_V(\boldv)=\boldv\) for all \(\boldv\in V\text{.}\) We have

\begin{equation*} \underset{B\rightarrow B'}{P}=[\id_V]_{B}^{B'}\text{.} \end{equation*}
For all \(\boldv\in V\) we have

\begin{equation} \underset{B\rightarrow B'}{P}[\boldv]_B=[\boldv]_{B'}\text{.}\tag{4.3.2} \end{equation}

In other words, to convert the \(B\)-coordinates of a vector \(\boldv\in V\) to \(B'\)-coordinates, simply multiply on the left by the matrix \(\underset{B\rightarrow B'}{P}\text{.}\)
Property (4.3.2) defines \(\underset{B\rightarrow B'}{P}\) uniquely: i.e., if \(A\) satisfies \(A[\boldv]_B=[\boldv]_{B'}\) for all \(\boldv\in \R^n\text{,}\) then \(A=\underset{B\rightarrow B'}{P}\text{.}\)

Proof.

Let \(B=(\boldv_1, \boldv_2,\dots, \boldv_n)\text{.}\) From formula (4.2.1) applied to \(I_v\text{,}\) we see that the \(j\)-th column of \([I_V]_B^{B'}\) is

\begin{equation*} [\boldv_j]_{B'} \end{equation*}

for all \(1\leq j\leq n\text{.}\) Using formula (4.3.1) from Definition 4.3.1 this is precisely the \(j\)-th column of \(\underset{B\rightarrow B'}{P}\) for all \(1\leq j\leq n\text{.}\) We conclude that

\begin{equation*} [I_V]_B^{B'} = \underset{B\rightarrow B'}{P}\text{.} \end{equation*}
This follows from (1) and Theorem 4.2.6:

\begin{align*} \underset{B\rightarrow B'}{P}[\boldv]_B \amp = [\id_V]_{B}^{B'}[\boldv]_B \\ \amp = [\id_V(\boldv)]_{B'} \amp (\knowl{./knowl/th_matrixrep.html}{\text{Theorem 4.2.6}})\\ \amp =[\boldv]_{B'} \text{.} \end{align*}
By (2) of Theorem 4.2.6 (the uniqueness claim), if \(A\) satisfies \(A[\boldv]_B=[\boldv]_{B'}\) for all \(\boldv\in \R^n\text{,}\) then \(A=[\id_V]_{B}^{B'}\text{.}\) Since \([\id_V]_{B}^{B'}=\underset{B\rightarrow B'}{P}\text{,}\) we conclude \(A=\underset{B\rightarrow B'}{P}\text{.}\)

Example 4.3.3.

Let \(V=\R^2\text{,}\) \(B=(\boldv_1=(1,1),\boldv_2=(1,-1))\text{,}\) \(B'=(\boldw_1=(1,2), \boldw_2=(2,-1))\text{.}\)

Compute \(\underset{B\rightarrow B'}{P}\text{.}\)
Let \(\boldx=(4,-2)\text{.}\) Compute \([\boldx]_{B'}\) using the change of basis formula (4.3.2).

Solution.

Using Definition 4.3.1, we have

\begin{align*} \underset{B\rightarrow B'}{P}\amp = \begin{bmatrix}\vert \amp \vert \\ \hspace{7pt}[\boldv_1]_{B'} \amp \hspace{7pt}[\boldv_2]_{B'}\\ \vert \amp \vert \end{bmatrix}\\ \amp = \begin{bmatrix}\frac{3}{5}\amp -\frac{1}{5}\\ \frac{1}{5}\amp \frac{3}{5} \end{bmatrix} \text{.} \end{align*}

The two coordinate vector computations \([(1,1)]_{B'}=(3/5, 1/5)\) and \([(1,-1)]_{B'}=(-1/5,3/5)\) were done as usual using Procedure 4.1.4: that is, by setting up in turn the vector equations

\begin{align*} (1,1) \amp = c_1(1,2)+c_2(2,-1) \amp (1,-1)\amp =c_1(1,2)+c_2(2,-1) \end{align*}

and solving for \(c_1,c_2\) using Gaussian elimination.
The usual application of Procedure 4.1.4 produces the coordinate vector \([\boldx]_{B}=(1, 3)\text{.}\) We leave the details to you. To compute \([\boldv]_{B'}\text{,}\) we use the change of basis formula (4.3.2):

\begin{align*} [\boldx]_{B'} \amp =\underset{B\rightarrow B'}{P}[\boldx]_B \\ \amp = \begin{bmatrix}\frac{3}{5}\amp -\frac{1}{5}\\ \frac{1}{5}\amp \frac{3}{5} \end{bmatrix} \begin{amatrix}[c]1 \\ 3 \end{amatrix}\\ \amp = \begin{amatrix}[r]0\\ 2 \end{amatrix}\text{.} \end{align*}

This should come as now surprise since

\begin{equation*} \boldx=(4,-2)=2(2,-1)=0\boldw_1+2\boldw_2\text{.} \end{equation*}

Example 4.3.4.

Let \(V=P_2\text{,}\) \(B=(x^2,x,1)\text{,}\) \(B'=(p_1(x)=(x-2)^2, p_2(x)=x-2, p_3(x)=1)\text{.}\)

Compute \(\underset{B\rightarrow B'}{P}\text{.}\)
Compute \([x^2+x+1]_{B'}\) using (4.3.2).

Solution.

We have

\begin{align*} \underset{B\rightarrow B'}{P} \amp = \begin{bmatrix} \vert \amp \vert \amp \vert \\ [x^2]_{B'}\amp [x]_{B'}\amp [1]_{B'} \\ \vert \amp \vert \amp \vert \end{bmatrix} \\ \amp = \begin{bmatrix} 1\amp 0\amp 0 \\ 4\amp 1\amp 0\\ 4\amp 2\amp 1 \end{bmatrix} \text{.} \end{align*}

The first two coordinate vector computations are nontrivial; you can verify for yourself that \(x^2=1(x-2)^2+4(x-2)+4\) and \(x=0(x-2)^2+1(x-2)+2\text{.}\) Alternatively, see Remark 4.3.5) for a neat trick for computing these coordinate vectors.
Since \(B\) is the standard basis, we see easily that \([x^2+x+1]_{B}=(1,1,1)\text{.}\) Using (4.3.2) we have

\begin{align*} [x^2+x+1]_{B'} \amp = \underset{B\rightarrow B'}{P}\begin{bmatrix} 1\\ 1\\ 1 \end{bmatrix}\\ \amp =\begin{bmatrix} 1\\ 5\\ 7 \end{bmatrix} \text{.} \end{align*}

Verify for yourself that we do indeed have

\begin{equation*} x^2+x+1=1(x-2)^2+5(x-2)+7\text{.} \end{equation*}

Remark 4.3.5. Taylor’s formula and change of basis.

Let \(B=(x^n, x^{n-1}, \dots, x, 1)\) be the standard basis of \(P_n\text{.}\) Fix any constant \(a\in \R\text{,}\) and let \(B'=((x-a)^n, (x-a)^{n-1}, \dots, (x-a), 1)\text{.}\) It is easy to see that \(B'\) is also an ordered basis: a simple degree argument shows that the polynomials \(p_k(x)=(x-a)^k\) are linearly independent. It follows from Taylor’s theorem (from single-variable calculus) that given any polynomial \(p\in P_n\) we have

\begin{equation*} p(x)=p(a)+p'(a)(x-a)+\frac{p''(x)}{2}(x-a)^2+\cdots \frac{p^{(n)}(a)}{n!}(x-a)^n\text{.} \end{equation*}

We call this expression the expansion of \(p(x)\) about \(x=a\text{.}\) In terms of coordinate vectors, this means that

\begin{equation} [p]_{B'}=\left(\frac{p^{(n)}(a)}{n!}, \frac{p^{(n-1)}(a)}{(n-1)!}, \dots, p'(a), p(a)\right)\text{.}\tag{4.3.3} \end{equation}

In other words, Taylor’s theorem provides a simple derivative formula for computing coordinate vectors with respect to the basis \(B'\text{.}\)

The following properties are often useful when computing various change of basis matrices.

Theorem 4.3.6. Change of basis matrix properties.

Let \(B, B', B''\) be ordered bases for the \(n\)-dimensional vector space \(V\text{.}\)

We have

\begin{equation*} \underset{B\rightarrow B}{P}=I\text{.} \end{equation*}
The matrix \(\underset{B\rightarrow B'}{P}\) is invertible. In fact, we have

\begin{equation} \left(\underset{B\rightarrow B'}{P}\right)^{-1}=\underset{B'\rightarrow B}{P}\tag{4.3.4} \end{equation}
We have

\begin{equation*} \underset{B\rightarrow B''}{P}=\underset{B'\rightarrow B''}{P}\, \underset{B\rightarrow B'}{P}\text{.} \end{equation*}

Proof.

Let \(B=(\boldv_1, \boldv_2,\dots, \boldv_n)\text{.}\) From formula (4.2.1) applied to \(I_v\text{,}\) we see that the \(j\)-th column of \([I_V]_B^{B'}\) is

\begin{equation*} [\boldv_j]_{B'} \end{equation*}

for all \(1\leq j\leq n\text{.}\) Using formula (4.3.1) from Definition 4.3.1 this is precisely the \(j\)-th column of \(\underset{B\rightarrow B'}{P}\) for all \(1\leq j\leq n\text{.}\) We conclude that

\begin{equation*} [I_V]_B^{B'} = \underset{B\rightarrow B'}{P}\text{.} \end{equation*}
This follows from (1) and Theorem 4.2.6:

\begin{align*} \underset{B\rightarrow B'}{P}[\boldv]_B \amp = [\id_V]_{B}^{B'}[\boldv]_B \\ \amp = [\id_V(\boldv)]_{B'} \amp (\knowl{./knowl/th_matrixrep.html}{\text{Theorem 4.2.6}})\\ \amp =[\boldv]_{B'} \text{.} \end{align*}
By (2) of Theorem 4.2.6 (the uniqueness claim), if \(A\) satisfies \(A[\boldv]_B=[\boldv]_{B'}\) for all \(\boldv\in \R^n\text{,}\) then \(A=[\id_V]_{B}^{B'}\text{.}\) Since \([\id_V]_{B}^{B'}=\underset{B\rightarrow B'}{P}\text{,}\) we conclude \(A=\underset{B\rightarrow B'}{P}\text{.}\)

Example 4.3.7. \(V=\R^n\text{,}\) \(B\) standard basis.

Consider the special situation where \(V=\R^n\text{,}\) \(B\) is the standard basis, and \(B'=\{\boldv_1,\dots,\boldv_n\}\) is some nonstandard basis. In this case we have

\begin{align*} \underset{B'\rightarrow B}{P}\amp =\begin{bmatrix}\vert\amp\vert\amp \amp \vert \\ [\boldv_1]_B\amp [\boldv_2]_B \amp \cdots\amp [\boldv_n]_B\\ \vert\amp \vert\amp \amp \vert \end{bmatrix}\\ \amp = \begin{bmatrix}\vert\amp\vert\amp \amp \vert \\ \boldv_1\amp \boldv_2\amp\cdots\amp \boldv_n\\ \vert\amp \vert\amp \amp \vert \end{bmatrix} \amp (B \text{ standard basis})\text{.} \end{align*}

In other words, \(\underset{B'\rightarrow B}{P}\) is the matrix whose \(j\)-th column is just the \(j\)-th element of \(B'\text{.}\) Thus, in this situation we can compute \(\underset{B'\rightarrow B}{P}\) by placing the elements of \(B'\) as columns of a matrix, and then use (2) of Theorem 4.3.6 to compute \(\underset{B\rightarrow B'}{P}=\left(\underset{B'\rightarrow B}{P}\right)^{-1}\text{.}\)

Example 4.3.8.

Let \(V=\R^2\text{,}\) \(B=((1,0),(0,1))\text{,}\) \(B'=\{(1,\sqrt{3}),(-\sqrt{3},1)\}\text{.}\) Compute \(\underset{B\rightarrow B'}{P}\) and \(\underset{B'\rightarrow B}{P}\text{.}\)

Solution.

According to Example 4.3.7 we have

\begin{equation*} \underset{B'\rightarrow B}{P}=\begin{bmatrix}1\amp -\sqrt{3}\\ \sqrt{3}\amp 1 \end{bmatrix}\text{.} \end{equation*}

We then compute

\begin{equation*} \underset{B\rightarrow B'}{P}=(\underset{B'\rightarrow B}{P})^{-1}=\left(\begin{bmatrix}1\amp -\sqrt{3}\\ \sqrt{3}\amp 1 \end{bmatrix} \right)^{-1}=\frac{1}{4}\begin{bmatrix}1\amp \sqrt{3}\\ -\sqrt{3}\amp 1 \end{bmatrix}\text{.} \end{equation*}

Remark 4.3.9. \(B\) standard basis of \(V\).

The observation from Example 4.3.7 applies more generally when \(B\) is the standard basis of the given vector space \(V\) and \(B'=(\boldv_1, \boldv_2, \dots, \boldv_n)\) is nonstandard. In this case computing \(\underset{B'\rightarrow B}{P}\) will be easy as the coordinate vectors \([\boldv_j]_{B}\) can be produced by inspection. See Example 4.3.10.

Example 4.3.10.

Let \(V=M_{22}\text{,}\) \(B=(E_{11}, E_{12}, E_{21}, E_{22})\) (standard basis) and \(B'=(A_1,A_2,A_3,A_4)\text{,}\) where

\begin{equation*} A_1=\begin{amatrix}[rr] 1\amp 1\\ 1\amp 1 \end{amatrix}, A_2=\begin{amatrix}[rr] 1\amp -1\\ 1\amp -1 \end{amatrix}, A_3=\begin{amatrix}[rr] 1\amp 1\\ -1\amp -1 \end{amatrix}, A_4=\begin{amatrix}[rr] -1\amp 1\\ 1\amp -1 \end{amatrix}\text{.} \end{equation*}

Compute \(\underset{B'\rightarrow B}{P}\text{.}\)

Solution.

We have

\begin{align*} \underset{B'\rightarrow B}{P}\amp = \begin{bmatrix} \vert\amp \vert\amp \vert\amp \vert\\ [A_1]_{B}\amp [A_2]_B\amp [A_3]_B\amp [A_4]_B\\ \vert\amp \vert\amp \vert\amp \vert \end{bmatrix}\\ \amp = \begin{amatrix}[rrrr] 1\amp 1\amp 1\amp -1\\ 1\amp -1\amp 1\amp 1\\ 1\amp 1\amp -1\amp 1\\ 1\amp -1\amp -1\amp -1 \end{amatrix} \text{.} \end{align*}

Here the coordinate vectors \([A_i]_B\) are easily computed by inspection since \(B\) is the standard basis.

It turns out that \(\underset{B\rightarrow B'}{P}=(\underset{B'\rightarrow B}{P})^{-1}\) is not so difficult to compute in this case since the columns \(\boldc_j\) of \(\underset{B'\rightarrow B}{P}\) satisfy

\begin{equation*} \boldc_i\cdot\boldc_j=\begin{cases} 4\amp \text{if } i=j\\ 0\amp \text{if } i\ne j \end{cases}\text{.} \end{equation*}

From this observation and Theorem 2.1.22 it is easy to see that

\begin{equation*} \underset{B\rightarrow B'}{P}=(\underset{B'\rightarrow B}{P})^{-1}=\frac{1}{4} \begin{amatrix}[rrrr] 1\amp 1\amp 1\amp 1\\ 1\amp -1\amp 1\amp -1\\ 1\amp 1\amp -1\amp -1\\ -1\amp -1\amp 1\amp -1 \end{amatrix}\text{.} \end{equation*}

Video example: change of basis matrix.

Figure 4.3.11. Video: change of basis matrix

Before connecting change of basis matrices with matrix representations of linear transformations, it is worth gathering some the different techniques for computing change of basis matrices we have discussed so far.

Procedure 4.3.12. Change of basis computational tips.

Let \(B=(\boldv_1, \boldv_2, \dots, \boldv_n)\) and \(B'\) be ordered bases of the vector space \(V\text{.}\) Below you find a variety of techniques for computing \(\underset{B\rightarrow B'}{P}\) and \(\underset{B'\rightarrow B}{P}\text{.}\)

To compute \(\underset{B\rightarrow B'}{P}\) directly, we must compute \([\boldv_j]_{B'}\) for each \(1\leq j\leq n\text{.}\) This typically involves setting up and solving a linear system.
We have \(\underset{B'\rightarrow B}{P}=(\underset{B\rightarrow B'}{P})^{-1}\text{.}\) This observation is useful in situations where (a) one change of basis matrix is easier to compute than the other and (b) computing inverse matrices is not too onerous.
If \(B\) is the standard basis of \(V\text{,}\) then \(\underset{B'\rightarrow B}{P}\) is easy to compute. (See Remark 4.3.9.)

Subsection 4.3.2 Change of basis for transformations

We now investigate how our choice of basis affects matrix representations of linear transformations. We will only consider the special case where \(T\colon V\rightarrow V\) and we are comparing matrix representations \([T]_B\) and \([T]_{B'}\) for two different ordered bases of \(V\text{.}\)

Theorem 4.3.13. Change of basis for transformations.

Let \(V\) be finite-dimensional, let \(T\colon V\rightarrow V\) be linear, and let \(B\) and \(B'\) be two ordered bases for \(V\text{.}\) We have

\begin{equation} [T]_{B'}=\underset{B\rightarrow B'}{P}\, [T]_B\, \underset{B'\rightarrow B}{P}\text{,}\tag{4.3.5} \end{equation}

or equivalently

\begin{equation} [T]_{B'}=\underset{B'\rightarrow B}{P}^{-1}\, [T]_B\, \underset{B'\rightarrow B}{P}\text{.}\tag{4.3.6} \end{equation}

Proof.

First observe that (4.3.6) follows from (4.3.5) and (2) of Theorem 4.3.6. Next, to prove (4.3.5), it suffices by (2) of Theorem 4.2.6 to show that the matrix \(A=\underset{B\rightarrow B'}{P}\, [T]_B\, \underset{B'\rightarrow B}{P}\) satisfies

\begin{equation*} A[\boldv]_{B'}=[T(\boldv)]_{B'} \end{equation*}

for all \(\boldv\in V\text{.}\) To this end, given any \(\boldv\in V\text{,}\) we have

\begin{align*} A[\boldv]_{B'}=\underset{B\rightarrow B'}{P}\, [T]_B\, \underset{B'\rightarrow B}{P}[\boldv]_{B'} \amp= \underset{B\rightarrow B'}{P}\, [T]_B [\boldv]_B \amp (\knowl{./knowl/th_change_of_basis_coordinates.html}{\text{Theorem 4.3.2}})\\ \amp= \underset{B\rightarrow B'}{P}[T(\boldv)]_{B} \amp (\knowl{./knowl/th_matrixrep.html}{\text{Theorem 4.2.6}}, (1)) \\ \amp = [\boldv]_{B'} \amp (\knowl{./knowl/th_change_of_basis_coordinates.html}{\text{Theorem 4.3.2}})\text{.} \end{align*}

Remark 4.3.14. Getting change of basis formulas correct.

It is easy to get the various details of the change of basis formula wrong. Here is a potential way to keep things organized in your mind.

We wish to relate \([T]_{B'}\) and \([T]_B\) with an equation of the form \([T]_{B'}=*[T]_B*\text{,}\) where the asterisks are to be replaced with change of basis matrices or their inverses. Think of the three matrices on the right-hand side of this equation as a sequence of three things done to coordinate vectors, reading from right to left.
\([T]_{B'}\) takes as inputs \(B'\)-coordinates of vectors, and outputs \(B'\)-coordinates. Thus the same should be true for \(*[T]_B*\text{.}\)
Since \([T]_B\) takes as inputs \(B\)-coordinates, we must first convert from \(B'\)-coordinates to \(B\)-coordinates. So we should have \([T]_{B'}=*[T]_B\underset{B'\rightarrow B}{P}\text{.}\)
Since \([T]_B\) outputs \(B\)-coordinates, we need to then convert back to \(B'\)-coordinates. Thus \([T]_{B'}=\underset{B\rightarrow B'}{P}[T]_B\underset{B'\rightarrow B}{P}\text{.}\)
If desired you may replace \(\underset{B\rightarrow B'}{P}\) with \(\underset{B'\rightarrow B}{P}^{-1}\text{.}\)

Example 4.3.15.

Let \(T\colon P_2\rightarrow P_2\) be defined as \(T(p(x))=p(x)+2p'(x)+xp''(x)\text{.}\)

Let \(B=(x^2, x, 1)\text{.}\) Compute \([T]_B\text{.}\)
Let \(B'=(x^2+x+1, x^2+1, x+1)\text{.}\) Use the change of basis formula to compute \([T]_{B'}\text{.}\)

Solution.

We easily compute \([T]_B=\begin{bmatrix}1\amp 0\amp 0\\ 6\amp 1\amp 0\\ 0\amp 2\amp 1 \end{bmatrix}\) using our usual recipe.
We need to compute both change of basis matrices. Since \(B\) is standard we compute

\begin{equation*} \underset{B'\rightarrow B}{P}=\begin{bmatrix}1\amp 1\amp 0\\ 1\amp 0\amp 1\\ 1\amp 1\amp 1 \end{bmatrix} \end{equation*}

essentially by inspection. It follows that

\begin{equation*} \underset{B\rightarrow B'}{P}=(\underset{B'\rightarrow B}{P})^{-1}=\begin{amatrix}[rrr] 1\amp 1\amp -1\\ 0\amp -1\amp 1\\ -1\amp 0\amp 1 \end{amatrix}\text{.} \end{equation*}

Lastly, using (4.3.5) we have

\begin{align*} [T]_{B'}\amp =\underset{B\rightarrow B'}{P}[T]_B\underset{B'\rightarrow B}{P} \\ \amp = \begin{amatrix}[rrr] 1\amp 1\amp -1\\ 0\amp -1\amp 1\\ -1\amp 0\amp 1 \end{amatrix} \begin{bmatrix}1\amp 0\amp 0\\ 6\amp 1\amp 0\\ 0\amp 2\amp 1 \end{bmatrix} \begin{bmatrix}1\amp 1\amp 0\\ 1\amp 0\amp 1\\ 1\amp 1\amp 1 \end{bmatrix}\\ \amp = \begin{amatrix}[rrr] 5\amp 6\amp -2\\ -4\amp -5\amp 2\\ 2\amp 0\amp 3 \end{amatrix}\text{.} \end{align*}

Consider the special case where \(T\colon \R^n\rightarrow \R^n\text{:}\) that is, when \(V=\R^n\) is a space of \(n\)-tuples. We know from Corollary 3.6.18 that \(T=T_A\) for a unique \(n\times n\) matrix \(A\text{.}\) Recall that \(A\) is called the standard matrix of \(T\) (3.6.19), and satisfies \(T(\boldx)=A\boldx\) for all \(\boldx\in \R^n\text{.}\) We often wish to compute \(A\text{,}\) as it provides a convenient matrix formula for \(T\text{.}\)

To compute \(A\) directly using the recipe in 3.6.18, we must compute \(T(\bolde_j)\) for each of the standard basis elements \(\bolde_j\text{.}\) For many naturally occurring transformations \(T\text{,}\) this is often not so easy to do. Theorem 4.3.13 provides an indirect method in such cases.

According to Theorem 4.2.3 we have \(A=[T]_B\text{:}\) i.e., the standard matrix of \(T\) is none other than the matrix representing \(T\) with respect to the standard basis. This connection allows us to compute \(A=[T]_B\) by first computing \([T]_{B'}\) for some more convenient basis \(B'\text{,}\) and then using the change of basis formula.

Procedure 4.3.16. Computing the standard matrix using change of basis.

Let \(T\colon \R^n\rightarrow \R^n\) be a linear transformation, and let \(A\) be its standard matrix. To compute \(A\) using the change of basis formula (4.3.5), proceed as follows.

Find a convenient basis \(B'\) for which the action of \(T\) is easily understood.
Compute \(A'=[T]_{B'}\text{.}\)
Let \(B\) be the standard basis of \(\R^n\text{.}\) Recall that \(A=[T]_B\text{.}\) Now compute \(A\) using the change of basis formula as

\begin{equation*} A=\underset{B'\rightarrow B}{P}\, A'\, \underset{B\rightarrow B'}{P}\text{.} \end{equation*}

Procedure 4.3.16 is a powerful technique for computing matrix formulas for many interesting geometric linear transformations of \(\R^n\text{:}\) e.g., rotations, reflections, and orthogonal projections. Often the very definition of such transformations will suggest a more convenient nonstandard basis \(B'\text{:}\) one that reflects the geometry involved. The next example illustrates this nicely.

Example 4.3.17. Orthogonal projection (again).

Consider \(V=\R^3\) together with the dot product. Let’s derive (once again) a matrix formula for orthogonal projection \(\operatorname{proj}_W\colon \R^3\rightarrow \R^3\text{,}\) where \(W=\{(x,y,z)\colon x+y+z=0\}\text{.}\) In other words we want to compute \(A=[\operatorname{proj}_W]_B\text{,}\) where \(B=((1,0,0), (0,1,0), (0,0,1))\) is the standard basis. We will do so indirectly by first computing \([\operatorname{proj}_W]_{B'}\) with respect to a more convenient basis: namely, \(B'=((1,-1,0),(1,0,-1), (1,1,1))\text{.}\) This is the same basis from Example 4.2.12, and was selected deliberateley so that the first two vectors form a basis of \(W\text{,}\) and the third vector spans the normal line to \(W\text{.}\) As in Example 4.2.12 we then easily compute

\begin{equation*} [\operatorname{proj}_W]_{B'}=\begin{bmatrix} 1\amp 0\amp 0\\ 0\amp 1\amp 0\\ 0\amp 0\amp 0 \end{bmatrix}\text{.} \end{equation*}

Now use (4.3.6) to compute

\begin{align*} A=[\operatorname{proj}_W]_{B}\amp = \underset{B'\rightarrow B}{P}[\operatorname{proj}_W]_{B'}\underset{B\rightarrow B'}{P} \\ \amp = \underset{B'\rightarrow B}{P}[\operatorname{proj}_W]_{B'}\left(\underset{B'\rightarrow B}{P}\right)^{-1}\\ \amp= \begin{amatrix}[rrr] 1\amp 1\amp 1\\ -1\amp 0\amp 1\\ 0\amp -1\amp 1 \end{amatrix} \begin{bmatrix} 1\amp 0\amp 0\\ 0\amp 1\amp 0\\ 0\amp 0\amp 0 \end{bmatrix} \begin{amatrix}[rrr] \frac{1}{3} \amp -\frac{2}{3} \amp \frac{1}{3} \\ \frac{1}{3} \amp \frac{1}{3} \amp -\frac{2}{3} \\ \frac{1}{3} \amp \frac{1}{3} \amp \frac{1}{3} \end{amatrix}\\ \amp = \frac{1}{3}\begin{amatrix}[rrr] 2\amp -1\amp -1\\ -1\amp 2\amp -1\\ -1\amp -1\amp 2 \end{amatrix}\text{.} \end{align*}

Lo and behold, we’ve discovered our matrix formula for projection onto \(W\) once again! (Compare with Example 5.3.17 and Example 4.2.12.)

Video example: change of basis for transformations.

Figure 4.3.18. Video: change of basis for transformations

Video example: change of basis and reflection.

Figure 4.3.19. Video: computing reflection via change of basis

Subsection 4.3.3 Similarity and the holy commutative tent of linear algebra

Theorem 4.3.13 supplies an algebraic answer to the question: What is the relation between two matrix representations \(A=[T]_B\) and \(A'=[T]_{B'}\text{?}\) Letting \(P=\underset{B'\rightarrow B}{P}\text{,}\) equation (4.3.6) becomes \(A'=P^{-1}AP\text{.}\) Matrices satisfying such a relation are said to be similar.

Definition 4.3.20. Similar matrices.

Matrices \(A, A'\in M_{nn}\) are similar if there is an invertible matrix \(P\) such that \(A'=P^{-1}AP\text{.}\)

So any two matrix representations of a linear transformation \(T\colon V\rightarrow V\) are similar in the technical sense of Definition 4.3.20. In fact, a converse of sorts is also true, as articulated in the theorem below.

Theorem 4.3.21. Similarity and matrix representations.

Two \(n\times n\) matrices \(A\) and \(A'\) are similar if and only if there is a linear transformation \(T\colon V\rightarrow V\) and bases \(B, B'\) of \(V\) satisfying \(A=[T]_B\) and \(A'=[T]_{B'}\text{.}\)

Proof.

The discussion above shows that if \(A=[T]_B\) and \(A'=[T]_{B'}\text{,}\) then \(A'=P^{-1}AP\text{,}\) where \(P=\underset{B'\rightarrow B}{P}\text{;}\) thus \(A\) and \(A'\) are similar in this case.

Now assume that \(A\) and \(A'\) are similar. By definition this means there is an invertible matrix \(P\) such that \(A'=P^{-1}AP\text{.}\) Define \(T\colon \R^n\rightarrow \R^n\) as the matrix transformation \(T=T_A\text{.}\) According to Theorem 4.2.3 we have \(A=[T]_B\) where \(B\) is the standard basis of \(\R^n\text{.}\) Next, letting \(B'\) be the ordered basis whose \(j\)-th element is the \(j\)-th column of \(P\text{,}\) we have \(P=\underset{B'\rightarrow B}{P}\) (Example 4.3.7), and hence

\begin{equation*} A'=P^{-1}AP=\underset{B\rightarrow B'}{P}\, [T]_B\, \underset{B'\rightarrow B}{P}=[T]_{B'}\text{,} \end{equation*}

as desired.

We will see in Section 4.4 that similar matrices are indeed similar algebraically speaking: i.e., they share many of the same properties. Theorem 4.3.21 provides the theoretical foundation to understand why this should be so: if \(A\) and \(A'\) are similar, then they are two matrix representations of a common linear transformation \(T\text{;}\) their many shared properties are simply inherited from the single overlying linear transformation that they both represent! This circle of ideas is neatly encompassed by Figure 4.3.22.

Figure 4.3.22. The holy commutative tent of linear algebra. Here we have \(P=\underset{B'\rightarrow B}{P}\) and \(A'=P^{-1}AP\text{.}\)

Perhaps a little exegesis is in order here. Think of the map \(T\colon V\rightarrow V\) as a linear transformation up in abstract heaven; and think of the two matrices \(A=[T]_B\) and \(A'=[T]_{B'}\) as two earthly shadows of \(T\text{.}\) OK, this gets at the holy bit somewhat, but why commutative? Each face of the tent is a commutative diagram, as we now explain.

Slanted sides of the tent.

The commutativity of the two slanted sides of the tent is a consequence of Theorem 4.2.9:

\begin{align*} [T(\boldv)]_B[\boldv]_B \amp = [T(\boldv)]_B \amp [T]_{B'}[\boldv]_{B'}\amp =[T(\boldv)]_{B'}\text{.} \end{align*}

Triangular ends of the tent.

Let \(P=\underset{B'\rightarrow B}{P}\text{,}\) so that \(P^{-1}=\underset{B\rightarrow B'}{P}\text{.}\) The commutativity of the two triangular ends of the tent are consequences of Theorem 4.3.2:

\begin{align*} P[\boldv]_{B'} \amp=[\boldv]_B \amp P^{-1}[\boldv]_B\amp=[\boldv]_{B'} \text{.} \end{align*}

Base of tent.

Lastly the commutativity of the base of the tent is a consequence of Theorem 4.3.13:

\begin{equation*} [T]_{B'}=\underset{B\rightarrow B'}{P}[T]_B\underset{B'\rightarrow B}{P}, \end{equation*}

or equivalently,

\begin{equation*} A'=P^{-1}AP\text{.} \end{equation*}

In summary, the holy commutative tent conveys a close connection between the three maps

\begin{equation*} \R^n\xrightarrow{A}\R^n, \R^n\xrightarrow{A'}\R^n, V\xrightarrow{T}V\text{.} \end{equation*}

Since the base of the tent is commutative, and since the maps given by \(P\) and \(P^{-1}\) are invertible, we can translate back and forth between the matrices \(A\) and \(A'\text{.}\) Furthermore, since the two slanted sides of the tent are commutative, and since the coordinate vector transformations are invertible, we can translate up and down between our two matrix representations \(A\) and \(A'\) and the overlying linear transformation \(T\text{.}\) There is one true \(T\text{!}\)

Mantra 4.3.23. Similar matrices mantra.

Similar matrices are but two shadows of a single overlying linear transformation.

Exercises 4.3.4 Exercises

WeBWork Exercises

1.

Consider the ordered bases \(B=((-4,1),(7,-2))\) and \(C=((4,-1),(-1,2))\) for the vector space \({\mathbb R}^2\text{.}\)

a. Find the transition matrix from \(C\) to the standard ordered basis \(E=((1,0),(0,1))\text{.}\)

\(T_C^E =\) (2 × 2 array)

b. Find the transition matrix from \(B\) to \(E\text{.}\)

\(T_B^E =\) (2 × 2 array)

c. Find the transition matrix from \(E\) to \(B\text{.}\)

\(T_E^B =\) (2 × 2 array)

d. Find the transition matrix from \(C\) to \(B\text{.}\)

\(T_C^B =\) (2 × 2 array)

e. Find the coordinates of \(u= (-1,3)\) in the ordered basis \(B\text{.}\) Note that \([u]_B = T_E^B [u]_E\text{.}\)

\([u]_B =\) (2 × 1 array)

f. Find the coordinates of \(v\) in the ordered basis \(B\) if the coordinate vector of \(v\) in \(C\) is \([v]_C = (-1,-1)\text{.}\)

\([v]_B =\) (2 × 1 array)

2.

Consider the ordered bases \(B=(\left[\begin{array}{cc} 3 \amp 0\cr 0 \amp -3 \end{array}\right],\left[\begin{array}{cc} 0 \amp 0\cr -1 \amp 0 \end{array}\right])\) and \(C=(\left[\begin{array}{cc} -5 \amp 0\cr -1 \amp 5 \end{array}\right],\left[\begin{array}{cc} -1 \amp 0\cr -3 \amp 1 \end{array}\right])\) for the vector space \(V\) of lower triangular \(2\times 2\) matrices with zero trace.

a. Find the transition matrix from \(C\) to \(B\text{.}\)

\(T_C^B =\) (2 × 2 array)

b. Find the coordinates of \(M\) in the ordered basis \(B\) if the coordinate vector of \(M\) in \(C\) is \([M]_C = \left[\begin{array}{c} -1\cr 3 \end{array}\right]\text{.}\)

\([M]_B =\) (2 × 1 array)

c. Find \(M\text{.}\)

\(M =\) (2 × 2 array)

3.

Let \(f : \mathbb{R}^{2} \to \mathbb{R}^{3}\) be the linear transformation defined by

\begin{equation*} f(\vec{x}) = \left[\begin{array}{cc} 1 \amp 0\cr -2 \amp 1\cr -1 \amp 1 \end{array}\right] \vec{x}. \end{equation*}

Let

\begin{equation*} \begin{array}{lcl} \mathcal{B} \amp = \amp \lbrace \left\lt 1,2\right>, \left\lt 2,5\right> \rbrace, \\ \mathcal{C} \amp = \amp \lbrace \left\lt 1,1,-1\right>, \left\lt 1,0,-1\right>, \left\lt 1,1,-2\right> \rbrace, \end{array} \end{equation*}

be bases for \(\mathbb{R}^{2}\) and \(\mathbb{R}^{3}\text{,}\) respectively. Find the matrix \(\lbrack f \rbrack_{\mathcal{B}}^{\mathcal{C}}\) for \(f\) relative to the basis \(\mathcal{B}\) in the domain and \(\mathcal{C}\) in the codomain.

\(\lbrack f \rbrack_{\mathcal{B}}^{\mathcal{C}} =\) (3 × 2 array)

4.

Let \(\mathcal{P}_{n}\) be the vector space of all polynomials of degree \(n\) or less in the variable \(x\text{.}\) Let \(D : \mathcal{P}_{3} \to \mathcal{P}_{2}\) be the linear transformation defined by \(D(p(x)) = p'(x)\text{.}\) That is, \(D\) is the derivative operator. Let

\begin{equation*} \begin{array}{lcl} \mathcal{B} \amp = \amp \lbrace 1,x,x^2,x^3 \rbrace, \\ \mathcal{C} \amp = \amp \lbrace 1,x,x^2 \rbrace, \end{array} \end{equation*}

be ordered bases for \(\mathcal{P}_{3}\) and \(\mathcal{P}_{2}\text{,}\) respectively. Find the matrix \(\lbrack D \rbrack_{\mathcal{B}}^{\mathcal{C}}\) for \(D\) relative to the basis \(\mathcal{B}\) in the domain and \(\mathcal{C}\) in the codomain.

\(\lbrack D \rbrack_{\mathcal{B}}^{\mathcal{C}} =\) (3 × 4 array)

5.

\begin{equation*} \begin{array}{lcl} \mathcal{B} \amp = \amp \lbrace 1, x, x^{2}, x^{3} \rbrace, \\ \mathcal{C} \amp = \amp \lbrace 2+x-x^{2}, -2-2x+x^{2}, -1-2x+x^{2} \rbrace, \end{array} \end{equation*}

\(\lbrack D \rbrack_{\mathcal{B}}^{\mathcal{C}} =\) (3 × 4 array)

Change of basis matrix.

In each exercise a vector space \(V\) is given along with two ordered bases \(B\) and \(B'\text{.}\) Compute \(\underset{B\rightarrow B'}{P}\) and \(\underset{B'\rightarrow B}{P}\text{.}\)

6.

\(V=\R^2\text{,}\) \(B=(\bolde_1, \bolde_2)\text{,}\) \(B'=\left((3,4),(1,1)\right)\)

7.

\(V=\R^2\text{,}\) \(B=\left((1,2), (2,1)\right)\text{,}\) \(B'=\left((-1,1),(1,1)\right)\)

8.

\(V=\R^3\text{,}\) \(B=(\bolde_1, \bolde_2, \bolde_3)\text{,}\) \(B'=\left( (1,0,0),(1,1,0),(1,1,1) \right)\)

9.

\(V=P_2\text{,}\) \(B=(x^2, x, 1)\text{,}\) \(B'=((x-2)^2, (x-2), 1)\)

10.

Let \(V=\R^3\text{,}\) \(B=(\bolde_1, \bolde_2, \bolde_3)\text{,}\) \(B'=\left( (1,0,0),(1,1,0),(1,1,1) \right)\text{,}\) as in Exercise 4.3.4.8.

Compute \([(1,2,3)]_{B'}\) directly using Definition 4.1.3.
Compute \([(1,2,3)]_{B'}\) using the change of basis matrix \(\underset{B\rightarrow B'}{P}\) and (4.3.2).

11.

Let \(V=P_2\text{,}\) \(B=(x^2, x, 1)\text{,}\) \(B'=((x-2)^2, (x-2), 1)\text{,}\) as in Exercise 4.3.4.9.

Compute \([x^2-x+4]_{B'}\) directly using Definition 4.1.3.
Compute \([x^2-x+4]_{B'}\) using the change of basis matrix \(\underset{B\rightarrow B'}{P}\) and the change of basis formula (4.3.2).

12.

Let \(B\) be the standard basis of \(\R^2\text{.}\) Find the ordered basis \(B'\) for which the change of basis matrix \(\underset{B\rightarrow B'}{P}\) is given by

\begin{equation*} \underset{B\rightarrow B'}{P}=\begin{amatrix}[rr]5\amp 1\\ -3\amp 2 \end{amatrix}\text{.} \end{equation*}

13.

Let \(B\) be the standard basis of \(P_2\text{.}\) Find the ordered basis \(B'\) for which the change of basis matrix \(\underset{B'\rightarrow B}{P}\) is given by

\begin{equation*} \underset{B'\rightarrow B}{P}=\begin{amatrix}[rrr]1\amp 1\amp 2\\ -3\amp 2\amp 0\\ 0\amp -1\amp 1 \end{amatrix}\text{.} \end{equation*}

14.

Suppose \(B=(\boldv_1, \boldv_2)\) and \(B'=(\boldw_1, \boldw_2)\) are two bases for the space \(V\) related by the change of basis matrix

\begin{equation*} \underset{B\rightarrow B'}{P}=\begin{amatrix}[rr]1\amp -2 \\ 3\amp 1 \end{amatrix}\text{.} \end{equation*}

Let \(\boldv=-3\boldv_1+2\boldv_2\text{.}\) Compute \([\boldv]_B\) and \([\boldv]_{B'}\text{.}\)
Let \(\boldw=\boldw_1+2\boldw_2\text{.}\) Compute \([\boldw]_B\) and \([\boldw]_{B'}\text{.}\)

15.

Let \(B\text{,}\) \(B'\text{,}\) and \(B''\) be three ordered bases of the vector space \(V\text{.}\)

Show that

\begin{equation} \underset{B\rightarrow B''}{P}=\underset{B'\rightarrow B''}{P}\underset{B\rightarrow B'}{P}\text{.}\tag{4.3.7} \end{equation}

To do so, set \(A=\underset{B'\rightarrow B''}{P}\) and \(B=\underset{B\rightarrow B'}{P}\) and show that the matrix \(AB\) satisfies the defining property of \(\underset{B\rightarrow B''}{P}\text{:}\) i.e.,

\begin{equation*} AB[\boldv]_{B}=[\boldv]_{B''} \end{equation*}

for all \(\boldv\in V\text{.}\)
Using (a), show that

\begin{equation*} \underset{B'\rightarrow B''}{P}=\underset{B\rightarrow B''}{P}\underset{B'\rightarrow B}{P}\text{.} \end{equation*}

Change of basis methods.

In each exercise a vector space \(V\) is given along with two ordered bases \(B'\) and \(B''\text{.}\)

Compute \(\underset{B'\rightarrow B''}{P}\) directly using Definition 4.3.1
Let \(B\) be the standard basis for \(V\text{.}\) Compute \(\underset{B'\rightarrow B''}{P}\) using formula (4.3.7) from Exercise 4.3.4.15.

16.

\(V=\R^2\text{,}\) \(B'=\left((1,1),(1,-1)\right)\text{,}\) \(B''=\left((2,1),(-1,2)\right)\)

17.

\(V=P_2\text{,}\) \(B'=(x^2+1, x+1, 1)\text{,}\) \(B''=(x^2+x+1, x^2+x, x^2)\)

18.

Let \(T\colon \R^3\rightarrow \R^3\) be the linear transformation defined as \(T(x,y,z)=(x+2y+z, -y, x+7z)\text{.}\) Let \(B\) be the standard basis of \(\R^3\text{,}\) and let \(B'=\left((1,0,0), (1,1,0), (1,1,1)\right)\text{.}\)

Compute \([T]_B\text{.}\)
Compute \([T]_{B'}\) using Theorem 4.3.13.

19.

Let \(T\colon P_1\rightarrow P_1\) be the linear transformation defined as \(T(p(x))=(x+1)p'(x)\text{.}\) Let \(B\) be the standard basis of \(P_1\text{,}\) and let \(B'=\left(2x+1, x-1\right)\text{.}\)

Compute \([T]_B\text{.}\)
Compute \([T]_{B'}\) using Theorem 4.3.13.

20. Reflection in \(\R^2\).

Let \(\boldv=(a,b)\in \R^2\) be nonzero and define \(\ell=\Span\{\boldv\}\text{,}\) the line passing through the origin with direction vector \(\boldv\text{.}\) Let \(T\colon \R^2\rightarrow \R^2\) be reflection through \(\ell\text{.}\) (See Definition 3.2.16.) In this exercise we will use a change of basis argument to find a formula for the standard matrix of \(T\text{:}\) i.e., the matrix \(A\) satisfying \(T(\boldx)=A\boldx\) for all \(\boldx\in \R^2\text{.}\) Our answer will be expressed in terms of \(a\) and \(b\text{.}\)

Pick a basis \(B'=\{\boldv_1, \boldv_2\}\) where \(\boldv_1\) points along \(\ell\) and \(\boldv_2\) is orthogonal to \(\boldv_1\text{.}\) (Both vectors will be expressed in terms of \(a\) and \(b\text{.}\)) Compute \([T]_{B'}\text{.}\)
Let \(B\) be the standard basis of \(\R^2\text{.}\) Use Theorem 4.3.13 to compute \(A=[T]_{B}\text{.}\)
How do we know that \(A\) is the standard matrix of \(T\text{?}\)
Explain why your matrix \(A\text{,}\) expressed in terms of \(a\) and \(b\) for \(T\) agrees with the matrix formula provided in Theorem 3.2.17, which is expressed in terms of the angle \(\alpha\) that \(\ell\) makes with the \(x\)-axis.