Elementary matrix: Difference between revisions

Content deleted Content added

Inline

Latest revision as of 14:04, 29 June 2024

In mathematics, an elementary matrix is a matrix which differs from the identity matrix by one single elementary row operation. The elementary matrices generate the general linear group $GL n (F)$ when $F$ is a field. Left multiplication (pre-multiplication) by an elementary matrix represents elementary row operations, while right multiplication (post-multiplication) represents elementary column operations.

Elementary row operations are used in Gaussian elimination to reduce a matrix to row echelon form. They are also used in Gauss–Jordan elimination to further reduce the matrix to reduced row echelon form.

Elementary row operations

There are three types of elementary matrices, which correspond to three types of row operations (respectively, column operations):

Row switching: A row within the matrix can be switched with another row.; $R_{i}\leftrightarrow R_{j}$

Row multiplication: Each element in a row can be multiplied by a non-zero constant. It is also known as scaling a row.; $kR_{i}\rightarrow R_{i},\ {\mbox{where }}k\neq 0$

Row addition: A row can be replaced by the sum of that row and a multiple of another row.; $R_{i}+kR_{j}\rightarrow R_{i},{\mbox{where }}i\neq j$

If $E$ is an elementary matrix, as described below, to apply the elementary row operation to a matrix $A$ , one multiplies $A$ by the elementary matrix on the left, $EA$ . The elementary matrix for any row operation is obtained by executing the operation on the identity matrix. This fact can be understood as an instance of the Yoneda lemma applied to the category of matrices.^[1]

Row-switching transformations

The first type of row operation on a matrix $A$ switches all matrix elements on row $i$ with their counterparts on a different row $j$ . The corresponding elementary matrix is obtained by swapping row $i$ and row $j$ of the identity matrix.

T_{i,j}={\begin{bmatrix}1&&&&&&\\&\ddots &&&&&\\&&0&&1&&\\&&&\ddots &&&\\&&1&&0&&\\&&&&&\ddots &\\&&&&&&1\end{bmatrix}}

So $T i,j A$ is the matrix produced by exchanging row $i$ and row $j$ of $A$ .

Coefficient wise, the matrix $T i,j$ is defined by :

[T_{i,j}]_{k,l}={\begin{cases}0&k\neq i,k\neq j,k\neq l\\1&k\neq i,k\neq j,k=l\\0&k=i,l\neq j\\1&k=i,l=j\\0&k=j,l\neq i\\1&k=j,l=i\\\end{cases}}

Properties

The inverse of this matrix is itself: $T_{i,j}^{-1}=T_{i,j}.$
Since the determinant of the identity matrix is unity, $\det(T_{i,j})=-1.$ It follows that for any square matrix $A$ (of the correct size), we have $\det(T_{i,j}A)=-\det(A).$
For theoretical considerations, the row-switching transformation can be obtained from row-addition and row-multiplication transformations introduced below because $T_{i,j}=D_{i}(-1)\,L_{i,j}(-1)\,L_{j,i}(1)\,L_{i,j}(-1).$

Row-multiplying transformations

The next type of row operation on a matrix $A$ multiplies all elements on row $i$ by $m$ where $m$ is a non-zero scalar (usually a real number). The corresponding elementary matrix is a diagonal matrix, with diagonal entries 1 everywhere except in the $i$ th position, where it is $m$ .

D_{i}(m)={\begin{bmatrix}1&&&&&&\\&\ddots &&&&&\\&&1&&&&\\&&&m&&&\\&&&&1&&\\&&&&&\ddots &\\&&&&&&1\end{bmatrix}}

So $D i (m) A$ is the matrix produced from $A$ by multiplying row $i$ by $m$ .

Coefficient wise, the $D i (m)$ matrix is defined by :

[D_{i}(m)]_{k,l}={\begin{cases}0&k\neq l\\1&k=l,k\neq i\\m&k=l,k=i\end{cases}}

Properties

The inverse of this matrix is given by $D_{i}(m)^{-1}=D_{i}\left({\tfrac {1}{m}}\right).$
The matrix and its inverse are diagonal matrices.
$\det(D_{i}(m))=m.$ Therefore, for a square matrix $A$ (of the correct size), we have $\det(D_{i}(m)A)=m\det(A).$

Row-addition transformations

The final type of row operation on a matrix $A$ adds row $j$ multiplied by a scalar $m$ to row $i$ . The corresponding elementary matrix is the identity matrix but with an $m$ in the $(i, j)$ position.

L_{ij}(m)={\begin{bmatrix}1&&&&&&\\&\ddots &&&&&\\&&1&&&&\\&&&\ddots &&&\\&&m&&1&&\\&&&&&\ddots &\\&&&&&&1\end{bmatrix}}

So $L ij (m) A$ is the matrix produced from $A$ by adding $m$ times row $j$ to row $i$ . And $A L ij (m)$ is the matrix produced from $A$ by adding $m$ times column $i$ to column $j$ .

Coefficient wise, the matrix $L i,j (m)$ is defined by :

[L_{i,j}(m)]_{k,l}={\begin{cases}0&k\neq l,k\neq i,l\neq j\\1&k=l\\m&k=i,l=j\end{cases}}

Properties

These transformations are a kind of shear mapping, also known as a transvections.
The inverse of this matrix is given by $L_{ij}(m)^{-1}=L_{ij}(-m).$
The matrix and its inverse are triangular matrices.
$\det(L_{ij}(m))=1.$ Therefore, for a square matrix $A$ (of the correct size) we have $\det(L_{ij}(m)A)=\det(A).$
Row-addition transforms satisfy the Steinberg relations.

References

^ Perrone (2024), pp. 119–120

Axler, Sheldon Jay (1997), Linear Algebra Done Right (2nd ed.), Springer-Verlag, ISBN 0-387-98259-0
Lay, David C. (August 22, 2005), Linear Algebra and Its Applications (3rd ed.), Addison Wesley, ISBN 978-0-321-28713-7
Meyer, Carl D. (February 15, 2001), Matrix Analysis and Applied Linear Algebra, Society for Industrial and Applied Mathematics (SIAM), ISBN 978-0-89871-454-8, archived from the original on 2009-10-31
Perrone, Paolo (2024), Starting Category Theory, World Scientific, doi:10.1142/9789811286018_0005, ISBN 978-981-12-8600-1
Poole, David (2006), Linear Algebra: A Modern Introduction (2nd ed.), Brooks/Cole, ISBN 0-534-99845-3
Anton, Howard (2005), Elementary Linear Algebra (Applications Version) (9th ed.), Wiley International
Leon, Steven J. (2006), Linear Algebra With Applications (7th ed.), Pearson Prentice Hall
Strang, Gilbert (2016), Introduction to Linear Algebra (5th ed.), Wellesley-Cambridge Press, ISBN 978-09802327-7-6

[1] Perrone (2024), pp. 119–120

[1]

@@ Line 1: / Line 1: @@
+{{short description|Matrix which differs from the identity matrix by one elementary row operation}}
-In [[mathematics]], an '''elementary matrix''' is a [[matrix (mathematics)|matrix]] which differs from the [[identity matrix]] by one single elementary row operation. The elementary matrices generate the [[general linear group]] of [[invertible matrix|invertible matrices]].  Left multiplication (pre-multiplication) by an elementary matrix represents '''elementary row operations''', while right multiplication (post-multiplication) represents '''elementary column operations'''.
+In [[mathematics]], an '''elementary matrix''' is a [[matrix (mathematics)|matrix]] which differs from the [[identity matrix]] by one single elementary row operation. The elementary matrices generate the [[general linear group]] {{math|GL<sub>''n''</sub>('''F''')}} when {{math|'''F'''}} is a [[Field (mathematics)|field]]. Left multiplication (pre-multiplication) by an elementary matrix represents '''elementary row operations''', while right multiplication (post-multiplication) represents '''elementary column operations'''.
-Elementary row operations are used in [[Gaussian elimination]] to reduce a matrix to [[row echelon form]].  They are also used in [[Gauss-Jordan elimination]] to further reduce the matrix to [[reduced row echelon form]].
+Elementary row operations are used in [[Gaussian elimination]] to reduce a matrix to [[row echelon form]].  They are also used in [[Gauss–Jordan elimination]] to further reduce the matrix to [[reduced row echelon form]].
 ==Elementary row operations==
@@ Line 9: / Line 11: @@
 : <math>R_i \leftrightarrow R_j</math>
-;Row multiplication: Each element in a row can be multiplied by a non-zero constant.
+;Row multiplication: Each element in a row can be multiplied by a non-zero constant. It is also known as ''scaling'' a row.
 : <math>kR_i \rightarrow R_i,\ \mbox{where } k \neq 0</math>
@@ Line 15: / Line 17: @@
 : <math>R_i + kR_j \rightarrow R_i, \mbox{where } i \neq j </math>
-If ''E'' is an elementary matrix, as described below, to apply the elementary row operation to a matrix ''A'', one multiplies ''A'' by the elementary matrix on the left, ''E⋅A''. The elementary matrix for any row operation is obtained by executing the operation on the [[identity matrix]].
+If {{mvar|E}} is an elementary matrix, as described below, to apply the elementary row operation to a matrix {{mvar|A}}, one multiplies {{mvar|A}} by the elementary matrix on the left, {{mvar|EA}}. The elementary matrix for any row operation is obtained by executing the operation on the [[identity matrix]]. This fact can be understood as an instance of the [[Yoneda lemma]] applied to the category of matrices.<ref>{{harvp|Perrone|2024|pages=119-120}}</ref>
 ===Row-switching transformations===
+{{See also|Permutation matrix}}
-The first type of row operation on a matrix ''A'' switches all matrix elements on row ''i'' with their counterparts on row ''j''. The corresponding elementary matrix is obtained by swapping row ''i'' and row ''j'' of the [[identity matrix]].
+The first type of row operation on a matrix {{mvar|A}} switches all matrix elements on row {{mvar|i}} with their counterparts on a different row {{mvar|j}}. The corresponding elementary matrix is obtained by swapping row {{mvar|i}} and row {{mvar|j}} of the [[identity matrix]].
 :<math>T_{i,j} = \begin{bmatrix}
@@ Line 30: / Line 33: @@
 \end{bmatrix}</math>
-So ''T<sub>ij</sub>⋅A'' is the matrix produced by exchanging row ''i'' and row ''j'' of ''A''.
+So {{mvar|T<sub>i,j</sub> A}} is the matrix produced by exchanging row {{mvar|i}} and row {{mvar|j}} of {{mvar|A}}.
+Coefficient wise, the matrix {{mvar|T<sub>i,j</sub>}} is defined by :
+:<math>
+[T_{i,j}]_{k,l} =
+\begin{cases}
+& k \neq i, k \neq j ,k \neq l \\
+& k \neq i, k \neq j, k = l\\
+& k = i, l \neq j\\
+& k = i, l = j\\
+& k = j, l \neq i\\
+& k = j, l = i\\
+\end{cases}</math>
 ====Properties====
-* The inverse of this matrix is itself: ''T<sub>ij</sub><sup>&minus;1</sup>=T<sub>ij</sub>''.
+* The inverse of this matrix is itself: <math>T_{i,j}^{-1} = T_{i,j}.</math>
-* Since the [[determinant]] of the identity matrix is unity, det[''T''<sub>''ij''</sub>] = &minus;1.  It follows that for any square matrix ''A'' (of the correct size), we have det[''T''<sub>''ij''</sub>''A''] = &minus;det[''A''].
+* Since the [[determinant]] of the identity matrix is unity, <math>\det(T_{i,j}) = -1.</math>  It follows that for any square matrix {{mvar|A}} (of the correct size), we have <math>\det(T_{i,j}A) = -\det(A).</math>
+* For theoretical considerations, the row-switching transformation can be obtained from row-addition and row-multiplication transformations introduced below because <math>T_{i,j}=D_i(-1)\,L_{i,j}(-1)\,L_{j,i}(1)\,L_{i,j}(-1).</math>
 ===Row-multiplying transformations===
-The next type of row operation on a matrix ''A'' multiplies all elements on row ''i'' by ''m'' where ''m'' is a non-zero [[scalar (mathematics)|scalar]] (usually a real number). The corresponding elementary matrix is a diagonal matrix, with diagonal entries 1 everywhere except in the ''i''th position, where it is ''m''.
+The next type of row operation on a matrix {{mvar|A}} multiplies all elements on row {{mvar|i}} by {{mvar|m}} where {{mvar|m}} is a non-zero [[scalar (mathematics)|scalar]] (usually a real number). The corresponding elementary matrix is a diagonal matrix, with diagonal entries 1 everywhere except in the {{mvar|i}}th position, where it is {{mvar|m}}.
 :<math>D_i(m) = \begin{bmatrix}
@@ Line 49: / Line 66: @@
 \end{bmatrix}</math>
-So ''D<sub>i</sub>(m)⋅A'' is the matrix produced from ''A'' by multiplying row ''i'' by ''m''.
+So {{math|''D<sub>i</sub>''(''m'')''A''}} is the matrix produced from {{mvar|A}} by multiplying row {{mvar|i}} by {{mvar|m}}.
+Coefficient wise, the {{math|''D<sub>i</sub>''(''m'')}} matrix is defined by :
+:<math>
+[D_i(m)]_{k,l} = \begin{cases}
+& k \neq l \\
+& k = l, k \neq i \\
+m & k = l, k= i
+\end{cases}</math>
 ====Properties====
-* The inverse of this matrix is: D''<sub>i</sub>''(''m'')<sup>&minus;1</sup> = D''<sub>i</sub>''(1/''m'').
+* The inverse of this matrix is given by <math>D_i(m)^{-1} = D_i \left(\tfrac 1 m \right).</math>
 * The matrix and its inverse are [[diagonal matrix|diagonal matrices]].
-* det[D<sub>''i''</sub>(m)] = ''m''. Therefore for a square matrix ''A'' (of the correct size), we have det[D<sub>''i''</sub>(''m'')''A''] = ''m'' det[''A''].
+* <math>\det(D_i(m)) = m.</math> Therefore, for a square matrix {{mvar|A}} (of the correct size), we have <math>\det(D_i(m)A) = m\det(A).</math>
 ===Row-addition transformations===
-The final type of row operation on a matrix ''A'' adds row ''i'' multiplied by a scalar ''m'' to row ''j''. The corresponding elementary matrix is the identity matrix but with an ''m'' in the (''j,i'') position.
+The final type of row operation on a matrix {{mvar|A}} adds row {{mvar|j}} multiplied by a scalar {{mvar|m}} to row {{mvar|i}}. The corresponding elementary matrix is the identity matrix but with an {{mvar|m}} in the {{math|(''i, j'')}} position.
-:<math>L_{i,j}(m) = \begin{bmatrix}
+:<math>L_{ij}(m) = \begin{bmatrix}
-&        &   &        &   & &        &   \\
+&        &   &        &   &        &   \\
     & \ddots &   &        &   &        &   \\
     &        & 1 &        &   &        &   \\
@@ Line 68: / Line 94: @@
 \end{bmatrix}</math>
-So ''L<sub>i,j</sub>(m)⋅A'' is the matrix produced from ''A'' by adding ''m'' times row ''i'' to row ''j''.
+So {{math|''L<sub>ij</sub>''(''m'')''A''}} is the matrix produced from {{mvar|A}} by adding {{mvar|m}} times row {{mvar|j}} to row {{mvar|i}}.
+And {{math|''A L<sub>ij</sub>''(''m'')}} is the matrix produced from {{mvar|A}} by adding {{mvar|m}} times column {{mvar|i}} to column {{mvar|j}}.
+Coefficient wise, the matrix {{math|''L{{sub|i,j}}''(''m'')}} is defined by :
+:<math>[L_{i,j}(m)]_{k,l} = \begin{cases}
+& k \neq l, k \neq i, l \neq j \\
+& k = l \\
+m & k = i, l = j
+\end{cases}</math>
 ====Properties====
 * These transformations are a kind of [[shear mapping]], also known as a ''transvections''.
+* The inverse of this matrix is given by <math>L_{ij}(m)^{-1} = L_{ij}(-m).</math>
-* ''L<sub>ij</sub>''(''m'')<sup>&minus;1</sup> = ''L<sub>ij</sub>''(&minus;''m'') (inverse matrix).
 * The matrix and its inverse are [[triangular matrix|triangular matrices]].
-* det[''L<sub>ij</sub>''(''m'')] = 1. Therefore, for a square matrix ''A'' (of the correct size) we have det[''L''<sub>''ij''</sub>(''m'')''A''] = det[''A''].
+* <math>\det(L_{ij}(m)) = 1.</math> Therefore, for a square matrix {{mvar|A}} (of the correct size) we have <math>\det(L_{ij}(m)A) = \det(A).</math>
 * Row-addition transforms satisfy the [[Steinberg relations]].
@@ Line 86: / Line 121: @@
 ==References==
+{{reflist}}
 {{See also|Linear algebra#Further reading}}
 * {{Citation
@@ Line 113: / Line 149: @@
  |isbn        = 978-0-89871-454-8
  |url         = http://www.matrixanalysis.com/DownloadChapters.html
- |deadurl     = yes
+ |url-status     = dead
- |archiveurl  = https://web.archive.org/web/20091031193126/http://matrixanalysis.com/DownloadChapters.html
+ |archive-url  = https://web.archive.org/web/20091031193126/http://matrixanalysis.com/DownloadChapters.html
- |archivedate = 2009-10-31
+ |archive-date = 2009-10-31
+}}
- |df          =
+* {{Citation
+|last = Perrone |first = Paolo
+|title = Starting Category Theory
+|date = 2024
+|publisher = World Scientific
+|doi = 10.1142/9789811286018_0005
+|isbn =  978-981-12-8600-1
+|url = https://www.worldscientific.com/worldscibooks/10.1142/13670
 }}
 * {{Citation
@@ Line 143: / Line 187: @@
  | edition = 7th
 }}
+* {{Citation
+ | last = Strang
+ | first = Gilbert
+ | author-link = Gilbert Strang
+ | year = 2016
+ | title = Introduction to Linear Algebra
+ | publisher = Wellesley-Cambridge Press
+ | edition = 5th|isbn=978-09802327-7-6
+}}
+{{Matrix classes}}
 {{DEFAULTSORT:Elementary Matrix}}

v t e Matrix classes
Explicitly constrained entries	Alternant Anti-diagonal Anti-Hermitian Anti-symmetric Arrowhead Band Bidiagonal Bisymmetric Block-diagonal Block Block tridiagonal Boolean Cauchy Centrosymmetric Conference Complex Hadamard Copositive Diagonally dominant Diagonal Discrete Fourier Transform Elementary Equivalent Frobenius Generalized permutation Hadamard Hankel Hermitian Hessenberg Hollow Integer Logical Matrix unit Metzler Moore Nonnegative Pentadiagonal Permutation Persymmetric Polynomial Quaternionic Signature Skew-Hermitian Skew-symmetric Skyline Sparse Sylvester Symmetric Toeplitz Triangular Tridiagonal Vandermonde Walsh Z
Constant	Exchange Hilbert Identity Lehmer Of ones Pascal Pauli Redheffer Shift Zero
Conditions on eigenvalues or eigenvectors	Companion Convergent Defective Definite Diagonalizable Hurwitz Positive-definite Stieltjes
Satisfying conditions on products or inverses	Congruent Idempotent or Projection Invertible Involutory Nilpotent Normal Orthogonal Unimodular Unipotent Unitary Totally unimodular Weighing
With specific applications	Adjugate Alternating sign Augmented Bézout Carleman Cartan Circulant Cofactor Commutation Confusion Coxeter Distance Duplication and elimination Euclidean distance Fundamental (linear differential equation) Generator Gram Hessian Householder Jacobian Moment Payoff Pick Random Rotation Seifert Shear Similarity Symplectic Totally positive Transformation
Used in statistics	Centering Correlation Covariance Design Doubly stochastic Fisher information Hat Precision Stochastic Transition
Used in graph theory	Adjacency Biadjacency Degree Edmonds Incidence Laplacian Seidel adjacency Tutte
Used in science and engineering	Cabibbo–Kobayashi–Maskawa Density Fundamental (computer vision) Fuzzy associative Gamma Gell-Mann Hamiltonian Irregular Overlap S State transition Substitution Z (chemistry)
Related terms	Jordan normal form Linear independence Matrix exponential Matrix representation of conic sections Perfect matrix Pseudoinverse Row echelon form Wronskian
Mathematics portal List of matrices Category:Matrices