Matrix Representation Theorem

We noted that multiplication by an m x n matrix defines a transformation from 𝔽ⁿ into 𝔽^m. Matrix transformations are linear transformations since the properties in Definition 2.1.1 are simply restatement of the linearity properties for matrix multiplication. The matrix representation theorem says that every linear transformation from 𝔽ⁿ into 𝔽^m is a matrix transformation.
Just like every vector in a finite-dimensional vector space can be associated with a vector in 𝔽ⁿ, every linear transformation between vector spaces can be associated with a matrix in M_m,n(𝔽):

L: 𝔽ⁿ ⟶ 𝔽^m
L: x ⟼ Ax

Theorem 3.3.1. (Matrix Representation Theorem) Let V be an n-dimensional vector space over 𝔽 and W be m-dimensional vector space over 𝔽. Let B = {u₁, u₂, ..., u_n} and B' = {w₁, w₂, ..., w_m} be ordered bases for V and W respectively. Let T: V ⟶ W be any linear transformation. Then there is a unique matrix A such that T(x) = Ax for all X ∈ V. Then there exists a natural isomorpshim J: L(V,W) ⟶ M_m,n(𝔽) given by J(T) = A.

Proof. Let x ∈ V then x = x₁ u₁ + x₂ u₂ + ... + x_nu_n, where u_j is the jth element of the basis of V.
Applying the additivity and scalar properties of T, we obtain

T(x) = x₁ T(u₁) + x₂ T(u₂) + ...+ x_n T(u_n)

The vectors T(u_j) ∈ W, can be written as linear combination of the vectors of the basis B' of W:

T(u₁) = A₁₁ w₁ + A₂₁ w₂ + ... + A_m1 w_m
T(u₃) = A₁₂ w₁ + A₂₂ w₂ + ... + A_m2 w_m
...
T(u_n) = A_1n w₁ + A_2n w₂ + ... + A_mn w_m

Clearly these mn scalar A_ij, 1 ≤ i ≤ m, 1 ≤ j ≤ n determine T completely.

The m x n matrix A = (A_ij) whose j-th column is the coordinate matrix of T(u_j) relative to ordered basis B' is called the matrix of T relative to the pair of ordered basis B and B'. It is denoted by [T]_B,B'. Thus

[T]_{B, B^{'}} = (\begin{array}{cccc} A_{11} & A_{12} & \dots & A_{1 n} \\ A_{21} & A_{22} & \dots & A_{2 n} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ A_{m 1} & A_{m 2} & \dots & A_{m n} \end{array})

Then we can express the linear transformation as T(x) = [T]_B,B [x]_B = Ax, since

T (x) = T (\sum_{j = 1}^{n} x_{j} u_{j}) = \sum_{j = 1}^{n} x_{j} T (u_{j}) = \sum_{j = 1}^{n} x_{j} \sum_{i = 1}^{m} A_{i j} w_{i} = \sum_{j = 1}^{n} \sum_{i = 1}^{m} A_{i j} x_{j} w_{i} = \sum_{i = 1}^{m} (\sum_{j = 1}^{n} A_{i j} x_{j}) w_{i}

from which is evident that the scalar components of L(x), with respect to the basis B' are given by

y_{i} = \sum_{j = 1}^{n} A_{i j} x_{j}

or in matrix form as

(\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{m} \end{matrix}) = A (\begin{matrix} x_{1} \\ x_{2} \\ ⋮ \\ x_{n} \end{matrix})

Conversely suppose that A = (A_ij) be any given m x n matrix. Define T: V ⟶ W by

T (\sum_{j = 1}^{n} x_{j} u_{j}) = \sum_{i = 1}^{n} (\sum_{j = 1}^{n} A_{i j} x_{j}) w_{i}

Then T, is a linear transformation such that the properties in definition 2.1.1 hold:

Let a ∈ F then
$T (ax) = \sum_{i = 1}^{n} (a (\sum_{j = 1}^{n} A_{i j} x_{j})) w_{i}$
Hence

a[T]_B' = aT_B,B' [x]_B
Let S be any other linear transformation then

[(T + S)(x)]_B' = ([T_B,B'] + [S_B,B']) [x]_B

This completely proves the theorem. □

Example 3.3.2. Consider the linear transformation T: ℝ³ ⟶ ℝ² given by

T(x,y,z) = (x + y + z, x − y)

Consider B = {(1,0,0),(0,1,0),(0,0,1)} and B' = {(1,0), (0,1)} be standard basis of ℝ³ and ℝ² respectively. Then

T (1,0,0) = (1,0) = 1 (1,0) + 0(0,1)
T (0,1,0) = (1,−1) = 1 (1,0) − 1(0,1)
T (0,0,0) = (1,0) = 1 (1,0) − 0(0,1)

Hence [T(1,0,0)]_B = (1 0)^T, [T(0,1,0)]_B = (1 −1)^T, [T(0,0,1)]_B = (1 0)^T. Thus the matrix T relative to B and B' is

[T]_{B, B^{'}} = (\begin{array}{ccc} 1 & 1 & 1 \\ 0 & - 1 & 0 \end{array}) ■

«Isomorpshims Index Invertible linear transformations »