Skip to main content

Section 4.5 Determinants

As invertibility plays a central role in this chapter, we need a criterion that tells us when a matrix is invertible. We already know that a square matrix is invertible if and only if it is row equivalent to the identity matrix. In this section, we will develop a second, numerical criterion that tells us when a square matrix is invertible.
To begin, let’s consider a \(2\by2\) matrix \(A\) whose columns are vectors \(\vvec_1\) and \(\vvec_2\text{.}\) We have frequently drawn the vectors and studied the linear combinations they form using a figure such as Figure 4.5.1.
Figure 4.5.1. Linear combinations of two vectors \(\vvec_1\) and \(\vvec_2\) form a collection of congruent parallelograms.
Notice how the linear combinations form a set of congruent parallelograms in the plane. In this section, we will use the area of these parallelograms to define a numerical quantity called the determinant that tells us whether the matrix \(A\) is invertible.
To recall, the area of parallelogram is found by multiplying the length of one side by the perpendicular distance to its parallel side. Using the notation in the figure, the area of the parallelogram is \(bh\text{.}\)

Preview Activity 4.5.1.

We will explore the area formula in this preview activity.
  1. Find the area of the following parallelograms.
    1.
    2.
    3.
    4.
    5.
  2. Explain why the area of the parallelogram formed by the vectors \(\vvec\) and \(\wvec_1\) is the same as that formed by \(\vvec\) and \(\wvec_2\text{.}\)
Solution.
  1. We find the following areas.
    1. A \(1\by1\) square has area 1.
    2. A \(2\by3\) rectangle has area 6.
    3. The square has side length \(\sqrt{2}\) giving an area of 2.
    4. If we consider the horizontal length as the base, we see that \(b=h=2\) so that the area is 4.
    5. In the same way, we can consider both the base and height to be 2 so that the area is 4.
  2. If we consider the base to be the length of \(\vvec\text{,}\) then the height, which is the perpendicular distance to its parallel side, is the same in both parallelograms.

Subsection 4.5.1 Determinants of \(2\by2\) matrices

Determinants are defined for any square matrix, but we will begin by defining the determinant of a \(2\by2\) matrix \(A = \left[\begin{array}{rr} \vvec_1 \amp \vvec_2 \end{array}\right]\text{.}\) First, however, we need to define the orientation of an ordered pair of vectors. As shown in Figure 4.5.2, an ordered pair of vectors \(\vvec_1\) and \(\vvec_2\) is called positively oriented if the angle, measured in the counterclockwise direction, from \(\vvec_1\) to \(\vvec_2\) is less than \(180^\circ\text{;}\) we say the pair is negatively oriented if it is more than \(180^\circ\text{.}\)
Figure 4.5.2. The vectors on the left are positively oriented while the ones on the right are negatively oriented.

Definition 4.5.3.

Suppose a \(2\by2\) matrix \(A\) has columns \(\vvec_1\) and \(\vvec_2\text{.}\) If the pair of vectors is positively oriented, then the determinant of \(A\text{,}\) denoted \(\det(A)\text{,}\) is the area of the parallelogram formed by \(\vvec_1\) and \(\vvec_2\text{.}\) If the pair is negatively oriented, then \(\det(A)\) is minus the area of the parallelogram.

Example 4.5.4.

Consider the determinant of the identity matrix
\begin{equation*} I = \left[\begin{array}{rr} 1\amp 0 \\ 0 \amp 1 \\ \end{array}\right] = \left[\begin{array}{rr} \evec_1 \amp \evec_2 \\ \end{array}\right]\text{.} \end{equation*}
As seen on the left of Figure 4.5.5, the vectors \(\vvec_1 = \evec_1\) and \(\vvec_2=\evec_2\) form a positively oriented pair. Since the parallelogram they form is a \(1\by1\) square, we have \(\det(I) = 1.\)
Figure 4.5.5. The determinant \(\det(I) = 1\text{,}\) as seen on the left. On the right, we see that \(\det(A) = -2\) where \(A\) is the matrix whose columns are shown.
Now consider the matrix
\begin{equation*} A = \left[\begin{array}{rr} -2\amp 0 \\ 0 \amp 1 \\ \end{array}\right] = \left[\begin{array}{rr} \vvec_1 \amp \vvec_2 \\ \end{array}\right]\text{.} \end{equation*}
As seen on the right of Figure 4.5.5, the vectors \(\vvec_1\) and \(\vvec_2\) form a negatively oriented pair. The parallelogram they define is a \(2\by1\) rectangle so we have \(\det(A) = -2\text{.}\)

Activity 4.5.2.

In this activity, we will find the determinant of some simple \(2\by2\) matrices and discover some important properties of determinants.
Instructions.
The sliders in the diagram below allow you to choose a matrix \(A=\begin{bmatrix}a \amp b \\ c \amp d \\ \end{bmatrix}\text{.}\) The two vectors representing the columns of the matrix, along with the parallelograms they define, are shown below.
Figure 4.5.6. The geometric meaning of the determinant of a matrix.
  1. Use the diagram to find the determinant of the matrix \(\left[\begin{array}{rr} -\frac12 \amp 0 \\ 0 \amp 2 \end{array}\right]\text{.}\) Along with Example 4.5.4, what does this lead you to believe is generally true about the determinant of a diagonal matrix?
  2. Use the diagram to find the determinant of the matrix \(\left[\begin{array}{rr} 0 \amp 1 \\ 1 \amp 0 \\ \end{array}\right]\text{.}\) What is the geometric effect of the matrix transformation defined by this matrix?
  3. Use the diagram to find the determinant of the matrix \(\left[\begin{array}{rr} 2 \amp 1 \\ 0 \amp 1 \\ \end{array}\right]\text{.}\) More generally, what do you notice about the determinant of any matrix of the form \(\left[\begin{array}{rr} 2 \amp k \\ 0 \amp 1 \\ \end{array}\right]\text{?}\) What does this say about the determinant of an upper triangular matrix?
  4. Use the diagram to find the determinant of any matrix of the form \(\left[\begin{array}{rr} 2 \amp 0 \\ k \amp 1 \\ \end{array}\right]\text{.}\) What does this say about the determinant of a lower triangular matrix?
  5. Use the diagram to find the determinant of the matrix \(\left[\begin{array}{rr} 1 \amp -1 \\ -2 \amp 2 \\ \end{array}\right]\text{.}\) In general, what is the determinant of a matrix whose columns are linearly dependent?
  6. Consider the matrices
    \begin{equation*} A = \left[\begin{array}{rr} 2 \amp 1 \\ 2 \amp -1 \\ \end{array}\right],~~~ B = \left[\begin{array}{rr} 1 \amp 0 \\ 0 \amp 2 \\ \end{array}\right]\text{.} \end{equation*}
    Use the diagram to find the determinants of \(A\text{,}\) \(B\text{,}\) and \(AB\text{.}\) What does this suggest is generally true about the relationship of \(\det(AB)\) to \(\det(A)\) and \(\det(B)\text{?}\)
Solution.
  1. The determinant is \(-1\) because the vectors are negatively oriented and the rectangle has sides of length \(\frac12\) and \(2\text{.}\) The determinant of a diagonal matrix seems to be the product of the diagonal entries.
  2. The matrix transformation is a reflection over the line \(y=x\) and we see that the determinant is \(-1\text{.}\)
  3. The determinant will continue to be \(2\) for any value of \(k\text{.}\) This illustrates the fact that the determinant of an upper triangular matrix equals the product of its diagonal entries.
  4. The same reasoning tells us that this determinant is \(2\) and, in fact, the determinant of a lower triangular matrix equals the product of its diagonal entries.
  5. The determinant of this matrix is \(0\) because the parallelogram formed by the vector has no area. This suggests that the determinant of a matrix whose columns are linearly dependent is \(0\text{.}\)
  6. We find that \(\det(A) = -4\text{,}\) \(\det(B) = 2\text{,}\) and \(\det(AB) = -8\text{.}\) This suggests that \(\det(AB) = \det(A) \det(B)\text{.}\)
Activity 4.5.2 illustrates some important properties of determinants that are true more generally, including for larger matrices.
  • If \(A\) is a diagonal matrix, then \(\det(A)\) equals the product of the entries on the diagonal. A \(2 \by 2\) diagonal matrix represents a transformation that scales horizontally and vertically by the scaling factors on the diagonal. The area of the transformed unit square is the product of the diagonal entries. If exactly one of the two is negative, the orientation is reversed and the determinant will be negative. We saw an example of this in Figure 4.5.5.
  • If \(A\) is a triangular matrix, then \(\det(A)\) equals the product of the entries on the diagonal. A \(2 \by 2\) triangular matrix represents a shear transformation. The diagonal entries represent the horizontal and vertical scaling. The non-zero off-diagonal entry represents the amount of shear. This distorts a rectangle into a parallelogram, but leaves the area unchanged. For example,
    \begin{equation*} \det\left[\begin{array}{rr} 2 \amp 2 \\ 0 \amp 3 \\ \end{array}\right] = 2\cdot 3 = 6\text{,} \end{equation*}
    since the two parallelograms in Figure 4.5.7 have equal area.
    Figure 4.5.7. The determinant of a triangular matrix equals the product of its diagonal entries.
  • We also saw that
    \begin{equation*} \det \left[\begin{array}{rr} 0 \amp 1 \\ 1 \amp 0 \\ \end{array}\right] = -1 \end{equation*}
    because the columns form a negatively oriented pair. You may remember from Section 4.2 that a matrix such as this is obtained by interchanging two rows of the identity matrix.
  • The determinant satisfies a multiplicative property, which says that
    \begin{equation*} \det(AB) = \det(A)\det(B). \end{equation*}
    Rather than simply thinking of the determinant as the area of a parallelogram, we may also think of it as a factor by which areas are scaled under the matrix transformation defined by the matrix. Applying the matrix transformation defined by \(B\) will scale area by \(\det(B)\text{.}\) If we then compose \(B\) with the matrix transformation defined by \(A\text{,}\) area will scale a second time by the factor \(\det(A)\text{.}\) The net effect is that the matrix transformation defined by \(AB\) scales area by \(\det(A)\det(B)\) so that \(\det(AB)=\det(A)\det(B)\text{.}\)
As we will see, these properties hold for larger matrices as well.

Subsection 4.5.2 Determinants of larger matrices

We can define determinants for \(n\by n\) matrices by measuring the "volume" of a "box" (technically a parallelepiped) defined by the columns of the matrix, even if this box resides in \(\real^n\) for some very large \(n\text{.}\)
For example, the columns of a \(3\by3\) matrix \(A\) will form a parallelpiped, like the one shown here, and there is a means by which we can classify sets of such vectors as either positively or negatively oriented. Therefore, we can define the determinant \(\det(A)\) in terms of the volume of the parallelpiped, but we will not worry about the details here.
Soon we will learn general methods for computing the determinant of a matrix, but some determinants are easy to calculate directly from the definition.
  1. The identity transformation doesn’t change anything, so the scaling factor is 1.
  2. If row \(r\) of \(A\) consists entirely of 0s, then the \(r\)th component of \(A \xvec\) will be 0. This means that the transformation associated with \(A\) collapses into a smaller space, and so the volume of any transformed shape will be 0.
Several properties established for \(2\by2\) matrices hold for larger matrices as well. In particular the following proposition holds by the same argument given for \(2 \by 2\) matrices.
Proposition 4.5.9 allows us to establish a relationship between the determinant of an invertible matrix and the determinant of its inverse. For an invertible matrix \(A\) we know that \(\det(A) \det(A^{-1}) = \det(A A^{-1}) = \det(I) = 1\text{.}\) This leads to the following proposition.
Perhaps the most important property of determinants also appeared in Activity 4.5.2. We saw that when the columns of the matrix \(A\) are linearly dependent, the parallelogram formed by those vectors folds down onto a line. For instance, if \(A=\begin{bmatrix} 1 \amp 2 \\ -1 \amp -2 \\ \end{bmatrix}\text{,}\) then the resulting parallelogram, as shown in Figure 4.5.11, has zero area, which means that \(\det(A)=0\text{.}\)
Figure 4.5.11. When the columns of \(A\) are linearly dependent, we find that \(\det(A) = 0\text{.}\)
The condition that the columns of \(A\) are linearly dependent is equivalent same to the condition that \(A\) is not invertible. This leads us to believe that \(A\) is not invertible if and only if its determinant is zero. The following proposition expresses this thought.
To understand this proposition more fully, let’s remember that the matrix \(A\) is invertible if and only if it is row equivalent to the identity matrix \(I\text{.}\) We will therefore consider how the determinant changes when we perform row operations on a matrix. Along the way, we will discover an effective means to compute the determinant.

Subsubsection 4.5.2.1 Determinants of elementary matrices

In Section 4.2, we saw how to describe the three row operations, scaling, interchange, and replacement, using matrix multiplication. Now we would like to compute the determinants of each of these matrices.
  • Scaling.
    Scalings are performed by multiplying a matrix \(A\) by a diagonal matrix, such as
    \begin{equation*} S = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 3 \amp 0 \\ 0 \amp 0 \amp 1 \\ \end{array}\right], \end{equation*}
    which has the effect of multiplying the second row of \(A\) by \(3\) to obtain \(A'\text{.}\) The associated transformation scales by a factor of three in one dimension, so it scales the volumne by 3 as well. Thus \(\det(S) = 3\text{.}\) More generally, if we scale any row of \(A\) \(k\text{,}\) we have
    \begin{equation*} \det(S) = k\text{.} \end{equation*}
    Note that when \(k\) is negative, the orientation is reversed by scaling.
  • Interchange.
    Interchanges are performed by matrices \(P\) which result from interchanging two rows of the identity matrix. For example,
    \begin{equation*} P = \left[\begin{array}{rrr} 0 \amp 1 \amp 0 \\ 1 \amp 0 \amp 0 \\ 0 \amp 0 \amp 1 \\ \end{array}\right], \end{equation*}
    interchanges the first and second rows of \(I_3\text{.}\) Note that for any such matrix \(P\text{,}\) \(P P = I\text{,}\) so \(\det(P) \det(P) = 1\text{.}\) This means that \(\det(P)\) must either be \(1\) or \(-1\text{.}\) As a transformation, an interchange "swaps" two axes. This reverses the orientation, so
    \begin{equation*} \det(P) = -1 \end{equation*}
    for any interchange matrix \(P\text{.}\)
  • Replacement.
    Row replacement operations are performed by matrices such as
    \begin{equation*} R = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ -2 \amp 0 \amp 1 \\ \end{array}\right], \end{equation*}
    which multiplies the first row by \(-2\) and adds the result to the third row. Each such matrix has exactly one off-diagonal entry that is non-zero. This represents a shear along one axis of \(\real^n\text{,}\) so
    \begin{equation*} \det(R) = 1 \end{equation*}
    for any replacement matrix \(R\text{.}\)

Subsubsection 4.5.2.2 Using RREF to compute determinants

Proposition 4.5.13 allows us to compute the determinant of any matrix. If \(A\) is invertible, then \(A\sim I\text{.}\) But this means that
\begin{equation*} E_k E_{k-1} \cdots E_2 E_1 A = I \end{equation*}
for some elementary matrices \(E_1, E_2, \dots, E_k\text{.}\) By Proposition 4.5.9 this implies
\begin{equation*} \det(E_k) \det(E_{k-1}) \cdots \det(E_2) \det(E_1) \det(A) = \det(I) = 1\text{,} \end{equation*}
so
\begin{equation*} \det(A) = \frac{1}{\det(E_k) \det(E_{k-1}) \cdots \det(E_2) \det(E_1)}\text{.} \end{equation*}
Recall that \(\det(R) = 1\) for an replacement matrices, so most of these terms are 1. Also \(\det(P) = -1\text{,}\) so we only need to know if we have an even or odd number of interchanges to know if the determinant is positive or negative. The abosulte value of the determinant of \(A\) is the reciprocal of the product of the scaling factors used in any scaling operations involved in computing the RREF.
If \(A\) is not invertible, then the RREF of \(A\) will have a row of zeros, so by Proposition 4.5.8,
\begin{equation*} \det(E_k) \det(E_{k-1}) \cdots \det(E_2) \det(E_1) \det(A) = 0\text{,} \end{equation*}
from which we see that \(\det(A) = 0\text{.}\)
So with just a little bookkeeping, we can compute the determinant of a matrix while we are calculting its reduced row echelon form.
Elemenatary matrices also provide us a tool for demonstrating Proposition 4.5.15 and Proposition 4.5.12.
If \(A\) is an \(n \by n\) triangular matrix, then we can reduce \(A\) to an \(n \by n\) diagonal matrix \(D\) using only replacement operations. Thus \(\det(A) = \det(D)\text{.}\) Also, the diagonal entries of \(D\) are the same as the diagonal entries of \(A\text{.}\)
A square diagonal matrix \(D\) can be reduced to \(I\) by a sequence of scaling operations. If the diagonal entires of \(D\) are \(d_1, d_2, \dots d_n\text{,}\) then we must scale by \(1/d_1, 1/d_2, \dots 1/d_n\text{,}\) so
\begin{equation*} \frac{1}{d_1} \cdot \frac{1}{d_2} \dots \frac{1}{d_n} \det(D) = \det(I) = 1 \end{equation*}
and
\begin{equation*} \det(D) = d_1 d_2 \cdots d_n\text{.} \end{equation*}
If \(A\) is invertible, then \(A \sim I\text{.}\) So \(\det(A)\) is a nonzero multiple of \(\det(I) = 1\text{.}\) Thus \(\det(A) \neq 0\text{.}\)
If \(A\) is not invertible, then \(A \sim B\) for some matrix \(B\) that contains a row of 0s. This means \(\det(A)\) is a nonzero multiple of \(\det(B) = 0\text{.}\) Thus \(\det(A) = 0\text{.}\)
Activity 4.5.3.
We will investigate the connection between the determinant of a matrix and its invertibility using Gaussian elimination.
  1. Consider the two upper triangular matrices
    \begin{equation*} U_1 = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 2 \amp 4 \\ 0 \amp 0 \amp -2 \\ \end{array}\right],~~~ U_2 = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 2 \amp 4 \\ 0 \amp 0 \amp 0 \\ \end{array}\right]. \end{equation*}
    Remembering Proposition 4.2.2, which of the matrices \(U_1\) and \(U_2\) are invertible? What are the determinants \(\det(U_1)\) and \(\det(U_2)\text{?}\)
  2. Explain why an upper triangular matrix is invertible if and only if its determinant is not zero.
  3. Let’s now consider the matrix
    \begin{equation*} A = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ -2 \amp 2 \amp -6 \\ 3 \amp -1 \amp 10 \\ \end{array}\right] \end{equation*}
    and begin the Gaussian elimination process with a row replacement operation
    \begin{equation*} A = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ -2 \amp 2 \amp -6 \\ 3 \amp -1 \amp 10 \\ \end{array}\right] \sim \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 0 \amp -2 \\ 3 \amp -1 \amp 10 \\ \end{array}\right] = A_1\text{.} \end{equation*}
    What is the relationship between \(\det(A)\) and \(\det(A_1)\text{?}\)
  4. Next we perform another row replacement operation:
    \begin{equation*} A_1= \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 0 \amp -2 \\ 3 \amp -1 \amp 10 \\ \end{array}\right] \sim \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 0 \amp -2 \\ 0 \amp 2 \amp 4 \\ \end{array}\right] = A_2\text{.} \end{equation*}
    What is the relationship between \(\det(A)\) and \(\det(A_2)\text{?}\)
  5. Finally, we perform an interchange:
    \begin{equation*} A_2 = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 0 \amp -2 \\ 0 \amp 2 \amp 4 \\ \end{array}\right] \sim \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ 0 \amp 2 \amp 4 \\ 0 \amp 0 \amp -2 \\ \end{array}\right] = U \end{equation*}
    to arrive at an upper triangular matrix \(U\text{.}\) What is the relationship between \(\det(A)\) and \(\det(U)\text{?}\)
  6. Since \(U\) is upper triangular, we can compute its determinant, which allows us to find \(\det(A)\text{.}\) What is \(\det(A)\text{?}\) Is \(A\) invertible?
  7. Now consider the matrix
    \begin{equation*} A = \left[\begin{array}{rrr} 1 \amp -1 \amp 3 \\ 0 \amp 2 \amp -2 \\ 2 \amp 1 \amp 3 \\ \end{array}\right]. \end{equation*}
    Perform a sequence of row operations to find an upper triangular matrix \(U\) that is row equivalent to \(A\text{.}\) Use this to determine \(\det(A)\) and whether \(A\) invertible?
  8. Suppose we apply a sequence of row operations on a matrix \(A\) to obtain \(A'\text{.}\) Explain why \(\det(A) \neq 0\) if and only if \(\det(A') \neq 0\text{.}\)
  9. Explain why an \(n\by n\) matrix \(A\) is invertible if and only if \(\det(A) \neq 0\text{.}\)
Solution.
  1. The matrix \(U_1\) is invertible because we see there is a pivot position in every row and column. The matrix \(U_2\text{,}\) however, is not invertible because there is not a pivot position in the third row.
  2. The determinant of an upper triangular matrix equals the product of its diagonal entries. Consequently, if the determinant of an upper triangular matrix is not zero, then each of its diagonal entries must be nonzero. In this case, there is a pivot position in every row and every column so that the matrix is invertible.
  3. Row replacement operations do not change the determinant so \(\det(A) = \det(A_1)\text{.}\)
  4. In the same way, \(\det(A) = \det(A_2)\text{.}\)
  5. Interchanges change the sign of the determinant so \(\det(A) = -\det(U)\text{.}\)
  6. The determinant \(\det(U) = -4\) since it is the product of the diagonal entries of \(U\text{.}\) This means that \(\det(A) = 4\text{.}\) We see that \(A\) is invertible because \(U\text{,}\) which has a pivot position in every row and every column, is invertible.
  7. Beginning with a row replacement operation, we arrive at
    \begin{equation*} A_1=\left[\begin{array}{rrr} 1 \amp - 1 \amp 3 \\ 0 \amp 2 \amp -2 \\ 0 \amp -3 \amp 3 \\ \end{array}\right]\text{.} \end{equation*}
    We next scale the second row by \(\frac12\) to obtain
    \begin{equation*} A_2=\left[\begin{array}{rrr} 1 \amp - 1 \amp 3 \\ 0 \amp 1 \amp -1 \\ 0 \amp -3 \amp 3 \\ \end{array}\right]\text{.} \end{equation*}
    Another row replacement operation gives
    \begin{equation*} A_3=\left[\begin{array}{rrr} 1 \amp - 1 \amp 3 \\ 0 \amp 1 \amp -1 \\ 0 \amp 0 \amp 0 \\ \end{array}\right]\text{.} \end{equation*}
    Putting these operations together, we see that \(\det(A) = 2\det(U) = 0\text{.}\) In this case, \(A\) is not invertible because \(U\text{,}\) which has a row without a pivot position, is not invertible.
  8. Performing one of the three row operations either leaves the determinant unchanged (row replacement), changes its sign (interchange), or multiplies it by a nonzero number (scaling). Therefore, if we begin with a matrix whose determinant is not zero, the determinant remains not zero after any row operation is applied.
  9. If we apply a sequence of row operations to \(A\) to find a row equivalent matrix \(U\) that is upper triangular, we know that \(\det(A)\neq0\) if and only if \(\det(U) \neq 0\text{.}\) We also know that \(A\) is invertible if and only if \(U\) is invertible. Putting these facts together, we conclude that \(\det(A)\neq 0\) if and only if \(A\) is invertible.
As seen in this activity, row operations can be used to compute the determinant of a matrix. More specifically, applying the forward substitution phase of Gaussian elimination to the matrix \(A\) requires only replacement operations and interchanges and leads us to an upper triangular matrix \(U\) so that \(A\sim U\) and \(\det(A) = \pm \det(U)\text{.}\) And \(\det(U)\) is easily calculated using Proposition 4.5.15. So we see that we can calculate determinates without completing the RREF algorithm. If suffices to do the forward substitutions phase.

Subsubsection 4.5.2.3 Cofactor expansions

We now have a technique for computing the determinant of a matrix using row operations. There is another way to compute determinants, using what are called cofactor expansions, that will be important for us in the next chapter. We will describe this method here.
To begin, let’s show how to compute the determinant of a \(2\by2\) matrix.
Determinant of a \(2 \by 2 \) matrix.
Let \(A = \left[\begin{array}{rr} a \amp b \\ c \amp d \\ \end{array}\right] \text{.}\) We can create an equivalent triangular matrix using \(-\frac{c}{a} R_1 + R_2 \to R_2\text{.}\) This gives us
\begin{equation*} \det A = \det \left[\begin{array}{rr} a \amp b \\ c \amp d \\ \end{array}\right] = \det \left[\begin{array}{rr} a \amp b \\ 0 \amp -\frac{c}{a} b + d \\ \end{array}\right] = a (-\frac{c}{a} b + d) = ad-bc\text{.} \end{equation*}
Note that the derivation of a formula for the determinant of a \(2 \by 2 \) matrix relied on two facts: (a) replacement doesn’t change the determinant, and (b) the determinant of a triangular matrix is the product of the diagonal elements.
Now that we have a formula for the determinant of a \(2 \by 2 \) matrix in hand, we can develop the method of cofactor expansion. Using a cofactor expansion to find the determinant of a more general \(n\by n\) matrix is a little more work so we will demonstrate it with an example.
Example 4.5.16.
We illustrate how to use a cofactor expansion to find the determinant of \(A\) where
\begin{equation*} A = \left[\begin{array}{rrr} 1 \amp -1 \amp 2 \\ -2 \amp 2 \amp -6 \\ 3 \amp -1 \amp 10 \\ \end{array}\right]. \end{equation*}
To begin, we choose one row or column. It doesn’t matter which we choose because the result will be the same in any case. Here, we choose the second row
\begin{equation*} \left[\begin{array}{rrr} \lgray{1} \amp \lgray{-1} \amp \lgray{2} \\ -2 \amp 2 \amp -6 \\ \lgray{3} \amp \lgray{-1} \amp \lgray{10} \\ \end{array}\right]\text{.} \end{equation*}
The determinant will be found by creating a sum of terms, one for each entry in the row we have chosen. For each entry in the row, we form its term by multiplying
  • \((-1)^{i+j}\) where \(i\) and \(j\) are the row and column numbers, respectively, of the entry,
  • the entry itself, and
  • the determinant of the entries left over when we have crossed out the row and column containing the entry.
Since we are computing the determinant of this matrix
\begin{equation*} \left[\begin{array}{rrr} \gray{1} \amp \gray{-1} \amp \gray{2} \\ -2 \amp 2 \amp -6 \\ \gray{3} \amp \gray{-1} \amp \gray{10} \\ \end{array}\right] \end{equation*}
using the second row, the entry in the first column of this row is \(-2\text{.}\) Let’s see how to form the term from this entry.
The term itself is \(-2\text{,}\) and the matrix that is left over when we cross out the second row and first column is
\begin{equation*} \left[\begin{array}{rrr} \gray{1} \amp {-1} \amp {2} \\ \gray{-2} \amp \gray{2} \amp \gray{-6} \\ \gray{3} \amp {-1} \amp {10} \\ \end{array}\right] \end{equation*}
whose determinant is
\begin{equation*} \det\left[\begin{array}{rr} -1 \amp 2 \\ -1 \amp 10 \\ \end{array}\right] = -1(10) - 2 (-1) = -8\text{.} \end{equation*}
Since this entry is in the second row and first column, the term we construct is \((-1)^{2+1}(-2)(-8) = -16 \text{.}\)
Putting this together, we find the determinant to be
\begin{equation*} \begin{aligned} \left[\begin{array}{rrr} {1} \amp {-1} \amp {2} \\ -2 \amp {2} \amp {-6} \\ {3} \amp {-1} \amp {10} \\ \end{array}\right] {}={} \amp (-1)^{2+1}(-2)\det\left[\begin{array}{rr} -1 \amp 2 \\ -1 \amp 10 \\ \end{array}\right] \\ \amp {}+{} (-1)^{2+2}(2)\det\left[\begin{array}{rr} 1 \amp 2 \\ 3 \amp 10 \\ \end{array}\right] \\ \amp {}+{} (-1)^{2+3}(-6)\det\left[\begin{array}{rr} -1 \amp -1 \\ 3 \amp -1 \\ \end{array}\right] \\ \\ {}={} \amp (-1)(-2)(-1(10)-2(-1)) \\ \amp + (1)(2)(1(10)-2(3)) \\ \amp + (-1)(-6)((-1)(-1)-(-1)3) \\ \\ {}={} \amp -16 + 8 + 12 \\ {}={} \amp 4 \\ \end{aligned}\text{.} \end{equation*}
Notice that this agrees with the determinant that we found for this matrix using row operations in the Activity 4.5.3.
Activity 4.5.4.
We will explore cofactor expansions through some examples.
  1. Using a cofactor expansion, show that the determinant of the following matrix
    \begin{equation*} \det \left[\begin{array}{rrr} 2 \amp 0 \amp -1 \\ 3 \amp 1 \amp 2 \\ -2 \amp 4 \amp -3 \\ \end{array}\right] = -36\text{.} \end{equation*}
    Remember that you can choose any row or column to create the expansion, but the choice of a particular row or column may simplify the computation.
  2. Use a cofactor expansion to find the determinant of
    \begin{equation*} \left[\begin{array}{rrrr} -3 \amp 0 \amp 0 \amp 0 \\ 4 \amp 1 \amp 0 \amp 0 \\ -1 \amp 4 \amp -4 \amp 0\\ 0 \amp 3 \amp 2 \amp 3 \\ \end{array}\right]\text{.} \end{equation*}
    (Which row should you choose to make your work especially easy?) Explain how the cofactor expansion technique shows that the determinant of a triangular matrix is equal to the product of its diagonal entries.
  3. Use a cofactor expansion to determine whether the following vectors form a basis of \(\real^3\text{:}\)
    \begin{equation*} \threevec{2}{-1}{-2}, \threevec{1}{-1}{2}, \threevec{1}{0}{-4}\text{.} \end{equation*}
  4. NumPy or SciPy will compute the determinant of a matrix A with the command numpy.linalg.det() or scipy.linalg.det. Use Python to find the determinant of the matrix
    \begin{equation*} \left[\begin{array}{rrrr} 2 \amp 1 \amp -2 \amp -3 \\ 3 \amp 0 \amp -1 \amp -2 \\ -3 \amp 4 \amp 1 \amp 2\\ 1 \amp 3 \amp 3 \amp -1 \\ \end{array}\right]\text{.} \end{equation*}
Solution.
  1. We will using a cofactor expansion along the first row so that
    \begin{equation*} \begin{array}{rl} \det\left[\begin{array}{rrr} 2 \amp 0 \amp -1 \\ 3 \amp 1 \amp 2 \\ -2 \amp 4 \amp -3 \\ \end{array}\right] = \amp (-1)^{1+1}\cdot 2 \det\left[\begin{array}{rr} 1 \amp 2 \\ 4 \amp -3 \\ \end{array}\right] \\ \amp + (-1)^{1+3}\cdot (-1) \det\left[\begin{array}{rr} 3 \amp 1 \\ -2 \amp 4 \\ \end{array}\right] \\ = \amp 2(-11) - 14 = -36 \end{array}\text{.} \end{equation*}
  2. Expanding along the first row gives
    \begin{equation*} \begin{array}{rl} \det \left[\begin{array}{rrrr} -3 \amp 0 \amp 0 \amp 0 \\ 4 \amp 1 \amp 0 \amp 0 \\ -1 \amp 4 \amp -4 \amp 0\\ 0 \amp 3 \amp 2 \amp 3 \\ \end{array}\right] \amp = 3\det \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 4 \amp -4 \amp 0\\ 3 \amp 2 \amp 3 \\ \end{array}\right] \\ \amp =-3(1)\det \left[\begin{array}{rr} -4 \amp 0 \\ 2 \amp 3 \\ \end{array}\right] \\ \amp =-3(1)(-4)(3) = 36 \end{array}\text{.} \end{equation*}
  3. We form the matrix \(A\) whose columns are the three given vectors. Expanding along either the second row or third column to take advantage of the zero in the \((2,3)\) entry, we see that \(\det(A) = 0\text{,}\) which means that \(A\) is not invertible. Therefore, the vectors do not form a basis for \(\real^3\text{.}\)
  4. Python tells us that \(\det(A) = 72\text{.}\)

Subsection 4.5.3 Summary

In this section, we associated a numerical quantity, the determinant, to a square matrix and showed how it tells us whether the matrix is invertible.
  • The determinant of a matrix has a geometric interpretation. In particular, when \(n=2\text{,}\) the determinant is the signed area of the parallelogram formed by the two columns of the matrix.
  • The determinant satisfies many properties, including the facts that
    • \(\det(AB) = \det(A) \det(B)\text{,}\) and
    • the determinant of a triangular matrix is equal to the product of its diagonal entries.
  • These properties helped us compute the determinant of a matrix using row operations. This also led to the important observation that the determinant of a matrix is nonzero if and only if the matrix is invertible.
  • Finally, we learned how to compute the determinant of a matrix using cofactor expansions, which will be a valuable tool for us in the next chapter.
We have seen three ways to compute the determinant: by interpreting the determinant as a signed area or volume; by applying appropriate row operations; and by using a cofactor expansion. It’s worth spending a moment to think about the relative merits of these approaches.
The geometric definition of the determinant tells us that the determinant is measuring a natural geometric quantity, an insight that does not easily come through the other two approaches. The intuition we gain by thinking about the determinant geometrically makes it seem reasonable that the determinant should be zero for matrices that are not invertible: if the columns are linearly dependent, the vectors cannot create a positive volume.
Approaching the determinant through row operations provides an effective means of computing the determinant. In fact, this is what most computer programs do behind the scenes when they compute a determinant. This approach is also a useful theoretical tool for explaining why the determinant tells us whether a matrix is invertible.
The cofactor expansion method will be useful to us in the next chapter when we look at eigenvalues and eigenvectors. It is not, however, a practical way to compute a determinant. To see why, consider the fact that the determinant of a \(2\by2\) matrix, written as \(ad-bc\text{,}\) requires us to compute two terms, \(ad\) and \(bc\text{.}\) To compute the determinant of a \(3\by3\) matrix, we need to compute three \(2\by2\) determinants, which involves \(3\cdot 2 = 6\) terms. For a \(4\by4\) matrix, we need to compute four \(3\by3\) determinants, which produces \(4\cdot3\cdot2 = 24\) terms. Continuing in this way, we see that the cofactor expansion of a \(10\by10\) matrix would involve \(10\cdot9\cdot8\ldots3\cdot2 = 10! = 3628800\) terms.
By contrast, we have seen that the number of steps required to perform Gaussian elimination on an \(n\by n\) matrix is proportional to \(n^3\text{.}\) When \(n=10\text{,}\) we have \(n^3 = 1000\text{,}\) which points to the fact that finding the determinant using Gaussian elimination is considerably less work.

Exercises 4.5.4 Exercises

1.

Consider the matrices
\begin{equation*} A = \left[\begin{array}{rrr} 2 \amp 1 \amp 0 \\ -4 \amp -4 \amp 3 \\ 2 \amp 1 \amp -3 \\ \end{array}\right], \qquad B = \left[\begin{array}{rrrr} -2 \amp 3 \amp 0 \amp 0 \\ 0 \amp 4 \amp 2 \amp 0 \\ 4 \amp -6 \amp -1 \amp 2 \\ 0 \amp 4 \amp 2 \amp -3 \\ \end{array}\right]\text{.} \end{equation*}
  1. Find the determinants of \(A\) and \(B\) using row operations.
  2. Now find the determinants of \(A\) and \(B\) using cofactor expansions to verify your results

2.

This exercise concerns rotations and reflections in \(\real^2\text{.}\)
  1. Suppose that \(A\) is the matrix that performs a counterclockwise rotation in \(\real^2\text{.}\) Draw a typical picture of the vectors that form the columns of \(A\) and use the geometric definition of the determinant to determine \(\det(A)\text{.}\)
  2. Suppose that \(B\) is the matrix that performs a reflection in a line passing through the origin. Draw a typical picture of the columns of \(B\) and use the geometric definition of the determinant to determine \(\det(B)\text{.}\)
  3. As we saw in Section 3.4, the matrices have the form
    \begin{equation*} A = \left[\begin{array}{rr} \cos \theta \amp -\sin\theta \\ \sin \theta \amp \cos \theta \\ \end{array}\right], \qquad B = \left[\begin{array}{rr} \cos(2\theta) \amp \sin(2\theta) \\ \sin(2\theta) \amp -\cos(2\theta) \\ \end{array}\right]\text{.} \end{equation*}
    Compute the determinants of \(A\) and \(B\) and verify that they agree with what you found in the earlier parts of this exercise.

3.

In the next chapter, we will say that matrices \(A\) and \(B\) are similar if there is a matrix \(P\) such that \(A= PBP^{-1}\text{.}\)
  1. Suppose that \(A\) and \(B\) are matrices and that there is a matrix \(P\) such that \(A=PBP^{-1}\text{.}\) Explain why \(\det(A) = \det(B)\text{.}\)
  2. Suppose that \(A\) is a \(3\by3\) matrix and that there is a matrix \(P\) such that
    \begin{equation*} A = P \left[\begin{array}{rrr} 2 \amp 0 \amp 0 \\ 0 \amp -5 \amp 0 \\ 0 \amp 0 \amp -3 \\ \end{array}\right] P^{-1}\text{.} \end{equation*}
    Find \(\det(A)\text{.}\)

4.

Consider the matrix
\begin{equation*} A = \left[\begin{array}{rrr} -2 \amp 1 \amp k \\ 2 \amp 3 \amp 0 \\ 1 \amp 2 \amp 2 \\ \end{array}\right] \end{equation*}
where \(k\) is a parameter.
  1. Find an expression for \(\det(A)\) in terms of the parameter \(k\text{.}\)
  2. Use your expression for \(\det(A)\) to determine the values of \(k\) for which the vectors
    \begin{equation*} \threevec{-2}{2}{1}, \threevec{1}{3}{2}, \threevec{k}{0}{2} \end{equation*}
    are linearly independent.

5.

Determine whether the following statements are true or false and explain your response.
  1. If we have a square matrix \(A\) and multiply the first row by \(5\) and add it to the third row to obtain \(A'\text{,}\) then \(\det(A') = 5\det(A)\text{.}\)
  2. If we interchange two rows of a matrix, then the determinant is unchanged.
  3. If we scale a row of the matrix \(A\) by \(17\) to obtain \(A'\text{,}\) then \(\det(A') = 17\det(A)\text{.}\)
  4. If \(A\) and \(A'\) are row equivalent and \(\det(A') = 0\text{,}\) then \(\det(A) = 0\) also.
  5. If \(A\) is row equivalent to the identity matrix, then \(\det(A) = \det(I) = 1\text{.}\)

6.

Suppose that \(A\) and \(B\) are \(5\by5\) matrices such that \(\det(A) = -2\) and \(\det(B) = 5\text{.}\) Find the following determinants:
  1. \(\det(2A)\text{.}\)
  2. \(\det(A^3)\text{.}\)
  3. \(\det(AB)\text{.}\)
  4. \(\det(-A)\text{.}\)
  5. \(\det(AB^{-1})\text{.}\)

7.

Suppose that \(A\) and \(B\) are \(n\by n\) matrices.
  1. If \(A\) and \(B\) are both invertible, use determinants to explain why \(AB\) is invertible.
  2. If \(AB\) is invertible, use determinants to explain why both \(A\) and \(B\) are invertible.

8.

Provide a justification for your responses to the following questions.
  1. If every entry in one row of a matrix is zero, what can you say about the determinant?
  2. If two rows of a square matrix are identical, what can you say about the determinant?
  3. If two columns of a square matrix are identical, what can you say about the determinant?
  4. If one column of a matrix is a linear combination of the others, what can you say about the determinant?

9.

Consider the matrix
\begin{equation*} A = \left[\begin{array}{rrr} 0 \amp 1 \amp x \\ 2 \amp 2 \amp y \\ -1 \amp 0 \amp z \\ \end{array}\right]\text{.} \end{equation*}
  1. Assuming that \(\det(A) = 0\text{,}\) rewrite the equation in terms of \(x\text{,}\) \(y\text{,}\) and \(z\text{.}\)
  2. Explain why \(\vvec_1\) and \(\vvec_2\text{,}\) the first two columns of \(A\text{,}\) satisfy the equation you found in the previous part.
  3. Explain why the solution space of this equation is the plane spanned by \(\vvec_1\) and \(\vvec_2\text{.}\)

10.

In this section, we studied the effect of row operations on the matrix \(A\text{.}\) In this exercise, we will study the effect of analogous column operations.
Suppose that \(A\) is the \(3\by3\) matrix \(A= \left[\begin{array}{rrr} \vvec_1 \amp \vvec_2 \amp \vvec_3 \end{array}\right]\text{.}\) Also consider elementary matrices
\begin{equation*} R = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ -3 \amp 0 \amp 1 \\ \end{array}\right],~~~ S = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 3 \amp 0 \\ 0 \amp 0 \amp 1 \\ \end{array}\right],~~~ P = \left[\begin{array}{rrr} 0 \amp 0 \amp 1 \\ 0 \amp 1 \amp 0 \\ 1 \amp 0 \amp 0 \\ \end{array}\right]\text{.} \end{equation*}
  1. Explain why the matrix \(AR\) is obtained from \(A\) by replacing the first column \(\vvec_1\) by \(\vvec_1 - 3\vvec_3\text{.}\) We call this a column replacement operation. Explain why column replacement operations do not change the determinant.
  2. Explain why the matrix \(AS\) is obtained from \(A\) by multiplying the second column by \(3\text{.}\) Explain the effect that scaling a column has on the determinant of a matrix.
  3. Explain why the matrix \(AP\) is obtained from \(A\) by interchanging the first and third columns. What is the effect of this operation on the determinant?
  4. Use column operations to compute the determinant of
    \begin{equation*} A=\left[\begin{array}{rrr} 0 \amp -3 \amp 1 \\ 1 \amp 1 \amp 4 \\ 1 \amp 1 \amp 0 \\ \end{array} \right]\text{.} \end{equation*}

11.

Consider the matrices
\begin{equation*} A = \left[\begin{array}{rrrr} 0 \amp 1 \amp 0 \amp 0 \\ 0 \amp 0 \amp 1 \amp 0 \\ 0 \amp 0 \amp 0 \amp 1 \\ 1 \amp 0 \amp 0 \amp 0 \\ \end{array}\right],~~~ B = \left[\begin{array}{rrrr} 0 \amp 1 \amp 0 \amp 0 \\ 1 \amp 0 \amp 0 \amp 0 \\ 0 \amp 0 \amp 0 \amp 1 \\ 0 \amp 0 \amp 1 \amp 0 \\ \end{array}\right],~~~ C = \left[\begin{array}{rrrr} 0 \amp 0 \amp 0 \amp a \\ 0 \amp 0 \amp b \amp 0 \\ 0 \amp c \amp 0 \amp 0 \\ d \amp 0 \amp 0 \amp 0 \\ \end{array}\right]\text{.} \end{equation*}
Use row operations to find the determinants of these matrices.

12.

Consider the matrices
\begin{equation*} \begin{aligned} A = \left[\begin{array}{rr} 0 \amp 1 \\ 1 \amp 0 \\ \end{array}\right], \qquad \amp B = \left[\begin{array}{rrr} 0 \amp 1 \amp 0 \\ 1 \amp 0 \amp 1 \\ 0 \amp 1 \amp 0 \\ \end{array}\right], \\ \\ C = \left[\begin{array}{rrrr} 0 \amp 1 \amp 0 \amp 0 \\ 1 \amp 0 \amp 1 \amp 0 \\ 0 \amp 1 \amp 0 \amp 1 \\ 0 \amp 0 \amp 1 \amp 0 \\ \end{array}\right], \qquad \amp D = \left[\begin{array}{rrrrr} 0 \amp 1 \amp 0 \amp 0 \amp 0 \\ 1 \amp 0 \amp 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \amp 1 \amp 0 \\ 0 \amp 0 \amp 1 \amp 0 \amp 1 \\ 0 \amp 0 \amp 0 \amp 1 \amp 0 \\ \end{array}\right] \end{aligned} \end{equation*}
  1. Use row (and/or column) operations to find the determinants of these matrices.
  2. Write the \(6\by6\) and \(7\by7\) matrices that follow in this pattern and state their determinants based on what you have seen.

13.

The following matrix is called a Vandermond matrix:
\begin{equation*} V = \left[\begin{array}{rrr} 1 \amp a \amp a^2 \\ 1 \amp b \amp b^2 \\ 1 \amp c \amp c^2 \\ \end{array}\right]\text{.} \end{equation*}
  1. Use row operations to explain why \(\det(V) = (b-a)(c-a)(c-b)\text{.}\)
  2. Explain why \(V\) is invertible if and only if \(a\text{,}\) \(b\text{,}\) and \(c\) are all distinct real numbers.
  3. There is a natural way to generalize this to a \(4\by4\) matrix with parameters \(a\text{,}\) \(b\text{,}\) \(c\text{,}\) and \(d\text{.}\) Write this matrix and state its determinant based on your previous work.
This matrix appeared in Exercise 2.4.4.9 when we were found a polynomial that passed through a given set of points.