Matrix Reference Manual
Proofs Section 1: Basic Theorems
Go to: Introduction, Notation, Index
| 1.1 | If
A[m#n]=F[m#r]G[r#n]
has linearly independent columns then r >= n
and we can find B[r#r] and a subset of
F's columns, D[m#r-n], such that
F = [D A]B.
To prove this, we replace the columns of F one at a time with columns taken from A and show that we can do this for all of A's columns.
|
| 1.2 | For any non-zero
A[m#n] we can find a
linearly independent subset of A's columns, F such that
A=F[m#r]G[r#n]
with G upper triangular and r<=n.
We prove this by induction on n. If n=1, set F=A and G=1. Now suppose A, F and G satisfy the theorem and consider the m#n+1 matrix A+=[A x]. If x=Fy for some y then set F+=F and G+=[G y], otherwise set F+=[F x] and G+=[G 0; 0T 1]. It is straightforward to show in both cases that A+=F+G+ that G+ is upper triangular and that r+<=n+1. We must also show that in the second case (i.e. if x!=Fy for any y) that the columns of [F x] are linearly independent. If [F x][a; b] = 0, then Fa=-bx. We must have b=0 since otherwise x =F(-a/b) which contradicts the assumption that x != Fy. Hence Fa = 0 which implies that a=0 since the columns of F are linearly independent. Thus the columns of [F x] are also linearly independent. |
| 1.3 | For any matrix
A[m#n] we have
rank(A) <= m and
rank(A) <= n.
|
| 1.4 | rank(AB) <= rank(A) and
rank(AB) <= rank(B)
|
| 1.5 | rank(A)=n iff
A[m#n] has linearly
independent columns.
|
| 1.6 | rank(I[n#n])=n
|
| 1.7 | If A has linearly independent columns then we can extend it to a square matrix [D[m#m-n] A[m#n]] with linearly independent columns. |
| 1.8 | If
A[m#n] has linearly independent columns, then
there exists a Q[m#n] and upper triangular
R[n#n] and
S[n#n] such that A=QR and
QHQ = RS = I. If A is real
then Q, R and S may be chosen to be real.
We prove this by induction on n. If n=1, we set Q=a/||a||, R=||a|| and S=||a||-1 where a=A. Note that ||a|| cannot equal 0 since this would violate the assumption that the columns of A are linearly independent. Now assume the theorem to be true for A[m#n] with Q, R and S defined as in the theorem. Given an m#(n+1) matrix A+ = [A x] with linearly independent columns, we define y = x - QQHx and note that y cannot equal 0 since we would then have x = QQHx = ASQHx contradicting the assumption that the columns of A+ are linearly independent. We set Q+ = [Q y/||y|| ], R+ = [R QHx; 0T ||y|| ] and S+ = [S -SQHx/||y||; 0T ||y||-1 ]. It is straightforward to show that A+, Q+, R+ and S+ satisfy the theorem by performing the appropriate multiplications. Note that if A is real then all the above calculations give real results. This recursive procedure is called Gram-Schmitt orthogonalization but is not recommended because of its poor numerical properties. |
| 1.9 | For any
A[m#n] there exists a
Q[m#m] and upper triangular
R[m#n] such that A=QR and
QHQ = I. If A is real, then
Q and R may be taken as real.
We now have A = FG = [F D] [I; 0] G = Q U [I; 0] G = QR where R = U [I; 0] G is the product of three upper triangular matrices and is therefore upper triangular itself. If A is real then all of thee operations preserve realness. If m>=n, the lower m-n rows of R are all zero and so the product A=QR is unaffected if we delete the last n-r columns of Q and the last n-r rows of R. |
| 1.10 | The diagonal elements of a skew-symmetric matrix are all 0. [Provided that the underlying field does not have characteristic 2]
|
| 1.11 | The rank of a skew-symmetric matrix is even. [Provided that the underlying field does not have characteristic 2]
|
| 1.12 | If D is diagonal then AD =
DA iff ai,j=0 whenever di,i !=
dj,j.
|
| 1.13 | If D =
DIAG(c1I1,
c2I2, ...,
cMIM) where the
ck are distinct scalars and the
Ij are identity matrices, then AD = DA
iff A = DIAG(A1, A2, ...,
AM) where each Ak is the same
size as the corresponding Ik.
|
| 1.14 | A=UDVH=LDMH
are alternative singular value decompositions of A iff
UHL = DIAG(Q1,
Q2, ..., Qk, R) and
VHM = DIAG(Q1,
Q2, ..., Qk, S) where
Q1, Q2, ..., Qk
are unitary matrices whose sizes are given by the multiplicities of the
corresponding distinct non-zero singular values and R and S are
unitary matrices whose size equals the number of zero singular values.
[R.11]
|
| 1.15 | If D is diagonal then
XDXT = sumi(di
× xixiT) and
XDXH = sumi(di
×
xixiH)
|
| 1.16 | If D is diagonal then
tr(XDXT) = sumi(di
× xiTxi) and
tr(XDXH) = sumi(di
×
xiHxi) =
sumi(di ×
|xi|2)
|
| 1.17 | tr(AB) = tr(BA)
|
| 1.18 | tr([A
B]T [C D]) =
tr(ATC) +
tr(BTD)
|
| 1.19 | The pseudoinverse is
unique
|