Conjugate transpose

Complex matrix A* obtained from a matrix A by transposing it and conjugating each entry

"Adjoint matrix" redirects here. For the transpose of cofactor, see Adjugate matrix.

In mathematics, the conjugate transpose, also known as the Hermitian transpose, of an $m\times n$ {\displaystyle m\times n} complex matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } is an $n\times m$ {\displaystyle n\times m} matrix obtained by transposing $\mathbf {A}$ {\displaystyle \mathbf {A} } and applying complex conjugation to each entry (the complex conjugate of $a+ib$ {\displaystyle a+ib} being $a-ib$ {\displaystyle a-ib}, for real numbers $a$ {\displaystyle a} and $b$ {\displaystyle b}). There are several notations, such as $\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }} or $\mathbf {A} ^{*}$ {\displaystyle \mathbf {A} ^{*}},^[1] $\mathbf {A} '$ {\displaystyle \mathbf {A} '},^[2] or (often in physics) $\mathbf {A} ^{\dagger }$ {\displaystyle \mathbf {A} ^{\dagger }}.

For real matrices, the conjugate transpose is just the transpose, $\mathbf {A} ^{\mathrm {H} }=\mathbf {A} ^{\operatorname {T} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }=\mathbf {A} ^{\operatorname {T} }}.

Definition

[edit ]

The conjugate transpose of an $m\times n$ {\displaystyle m\times n} matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } is formally defined by

\left(\mathbf {A} ^{\mathrm {H} }\right)_{ij}={\overline {\mathbf {A} _{ji}}}

{\displaystyle \left(\mathbf {A} ^{\mathrm {H} }\right)_{ij}={\overline {\mathbf {A} _{ji}}}} Eq.1

where the subscript $ij$ {\displaystyle ij} denotes the $(i,j)$ {\displaystyle (i,j)}-th entry (matrix element), for $1\leq i\leq n$ {\displaystyle 1\leq i\leq n} and $1\leq j\leq m$ {\displaystyle 1\leq j\leq m}, and the overbar denotes a scalar complex conjugate.

This definition can also be written as

\mathbf {A} ^{\mathrm {H} }=\left({\overline {\mathbf {A} }}\right)^{\operatorname {T} }={\overline {\mathbf {A} ^{\operatorname {T} }}}

{\displaystyle \mathbf {A} ^{\mathrm {H} }=\left({\overline {\mathbf {A} }}\right)^{\operatorname {T} }={\overline {\mathbf {A} ^{\operatorname {T} }}}}

where $\mathbf {A} ^{\operatorname {T} }$ {\displaystyle \mathbf {A} ^{\operatorname {T} }} denotes the transpose and ${\overline {\mathbf {A} }}$ {\displaystyle {\overline {\mathbf {A} }}} denotes the matrix with complex conjugated entries.

Other names for the conjugate transpose of a matrix are Hermitian transpose, Hermitian conjugate, adjoint matrix or transjugate. The conjugate transpose of a matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } can be denoted by any of these symbols:

$\mathbf {A} ^{*}$ {\displaystyle \mathbf {A} ^{*}}, commonly used in linear algebra
$\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }}, commonly used in linear algebra
$\mathbf {A} ^{\dagger }$ {\displaystyle \mathbf {A} ^{\dagger }} (sometimes pronounced as A dagger ), commonly used in quantum mechanics
$\mathbf {A} ^{+}$ {\displaystyle \mathbf {A} ^{+}}, although this symbol is more commonly used for the Moore–Penrose pseudoinverse

In some contexts, $\mathbf {A} ^{*}$ {\displaystyle \mathbf {A} ^{*}} denotes the matrix with only complex conjugated entries and no transposition.

Example

[edit ]

Suppose we want to calculate the conjugate transpose of the following matrix $\mathbf {A}$ {\displaystyle \mathbf {A} }.

\mathbf {A} ={\begin{bmatrix}1&-2-i&5\1円+i&i&4-2i\end{bmatrix}}

{\displaystyle \mathbf {A} ={\begin{bmatrix}1&-2-i&5\1円+i&i&4-2i\end{bmatrix}}}

We first transpose the matrix:

\mathbf {A} ^{\operatorname {T} }={\begin{bmatrix}1&1+i\\-2-i&i\5円&4-2i\end{bmatrix}}

{\displaystyle \mathbf {A} ^{\operatorname {T} }={\begin{bmatrix}1&1+i\\-2-i&i\5円&4-2i\end{bmatrix}}}

Then we conjugate every entry of the matrix:

\mathbf {A} ^{\mathrm {H} }={\begin{bmatrix}1&1-i\\-2+i&-i\5円&4+2i\end{bmatrix}}

{\displaystyle \mathbf {A} ^{\mathrm {H} }={\begin{bmatrix}1&1-i\\-2+i&-i\5円&4+2i\end{bmatrix}}}

Basic remarks

[edit ]

A square matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } with entries $a_{ij}$ {\displaystyle a_{ij}} is called

Hermitian or self-adjoint if $\mathbf {A} =\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} =\mathbf {A} ^{\mathrm {H} }}; i.e., $a_{ij}={\overline {a_{ji}}}$ {\displaystyle a_{ij}={\overline {a_{ji}}}}.
Skew Hermitian or antihermitian if $\mathbf {A} =-\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} =-\mathbf {A} ^{\mathrm {H} }}; i.e., $a_{ij}=-{\overline {a_{ji}}}$ {\displaystyle a_{ij}=-{\overline {a_{ji}}}}.
Normal if $\mathbf {A} ^{\mathrm {H} }\mathbf {A} =\mathbf {A} \mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }\mathbf {A} =\mathbf {A} \mathbf {A} ^{\mathrm {H} }}.
Unitary if $\mathbf {A} ^{\mathrm {H} }=\mathbf {A} ^{-1}$ {\displaystyle \mathbf {A} ^{\mathrm {H} }=\mathbf {A} ^{-1}}, equivalently $\mathbf {A} \mathbf {A} ^{\mathrm {H} }={\boldsymbol {I}}$ {\displaystyle \mathbf {A} \mathbf {A} ^{\mathrm {H} }={\boldsymbol {I}}}, equivalently $\mathbf {A} ^{\mathrm {H} }\mathbf {A} ={\boldsymbol {I}}$ {\displaystyle \mathbf {A} ^{\mathrm {H} }\mathbf {A} ={\boldsymbol {I}}}.

Even if $\mathbf {A}$ {\displaystyle \mathbf {A} } is not square, the two matrices $\mathbf {A} ^{\mathrm {H} }\mathbf {A}$ {\displaystyle \mathbf {A} ^{\mathrm {H} }\mathbf {A} } and $\mathbf {A} \mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} \mathbf {A} ^{\mathrm {H} }} are both Hermitian and in fact positive semi-definite matrices.

The conjugate transpose "adjoint" matrix $\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }} should not be confused with the adjugate, $\operatorname {adj} (\mathbf {A} )$ {\displaystyle \operatorname {adj} (\mathbf {A} )}, which is also sometimes called adjoint.

The conjugate transpose of a matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } with real entries reduces to the transpose of $\mathbf {A}$ {\displaystyle \mathbf {A} }, as the conjugate of a real number is the number itself.

The conjugate transpose can be motivated by noting that complex numbers can be usefully represented by $2\times 2$ {\displaystyle 2\times 2} real matrices, obeying matrix addition and multiplication:^[3]

a+ib\equiv {\begin{bmatrix}a&-b\\b&a\end{bmatrix}}.

{\displaystyle a+ib\equiv {\begin{bmatrix}a&-b\\b&a\end{bmatrix}}.}

That is, denoting each complex number $z$ {\displaystyle z} by the real $2\times 2$ {\displaystyle 2\times 2} matrix of the linear transformation on the Argand diagram (viewed as the real vector space $\mathbb {R} ^{2}$ {\displaystyle \mathbb {R} ^{2}}), affected by complex $z$ {\displaystyle z}-multiplication on $\mathbb {C}$ {\displaystyle \mathbb {C} }.

Thus, an $m\times n$ {\displaystyle m\times n} matrix of complex numbers could be well represented by a $2m\times 2n$ {\displaystyle 2m\times 2n} matrix of real numbers. The conjugate transpose, therefore, arises very naturally as the result of simply transposing such a matrix—when viewed back again as an $n\times m$ {\displaystyle n\times m} matrix made up of complex numbers.

For an explanation of the notation used here, we begin by representing complex numbers $e^{i\theta }$ {\displaystyle e^{i\theta }} as the rotation matrix, that is,

$e^{i\theta }={\begin{pmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \end{pmatrix}}=\cos \theta {\begin{pmatrix}1&0\0円&1\end{pmatrix}}+\sin \theta {\begin{pmatrix}0&-1\1円&0\end{pmatrix}}.$ {\displaystyle e^{i\theta }={\begin{pmatrix}\cos \theta &-\sin \theta \\\sin \theta &\cos \theta \end{pmatrix}}=\cos \theta {\begin{pmatrix}1&0\0円&1\end{pmatrix}}+\sin \theta {\begin{pmatrix}0&-1\1円&0\end{pmatrix}}.}

Since $e^{i\theta }=\cos \theta +i\sin \theta$ {\displaystyle e^{i\theta }=\cos \theta +i\sin \theta }, we are led to the matrix representations of the unit numbers as

$1={\begin{pmatrix}1&0\0円&1\end{pmatrix}},\quad i={\begin{pmatrix}0&-1\1円&0\end{pmatrix}}.$ {\displaystyle 1={\begin{pmatrix}1&0\0円&1\end{pmatrix}},\quad i={\begin{pmatrix}0&-1\1円&0\end{pmatrix}}.}

A general complex number $z=x+iy$ {\displaystyle z=x+iy} is then represented as $z={\begin{pmatrix}x&-y\\y&x\end{pmatrix}}.$ {\displaystyle z={\begin{pmatrix}x&-y\\y&x\end{pmatrix}}.}

The complex conjugate operation, where i→−i, is seen to be just the matrix transpose.

Properties

[edit ]

$(\mathbf {A} +{\boldsymbol {B}})^{\mathrm {H} }=\mathbf {A} ^{\mathrm {H} }+{\boldsymbol {B}}^{\mathrm {H} }$ {\displaystyle (\mathbf {A} +{\boldsymbol {B}})^{\mathrm {H} }=\mathbf {A} ^{\mathrm {H} }+{\boldsymbol {B}}^{\mathrm {H} }} for any two matrices $\mathbf {A}$ {\displaystyle \mathbf {A} } and ${\boldsymbol {B}}$ {\displaystyle {\boldsymbol {B}}} of the same dimensions.
$(z\mathbf {A} )^{\mathrm {H} }={\overline {z}}\mathbf {A} ^{\mathrm {H} }$ {\displaystyle (z\mathbf {A} )^{\mathrm {H} }={\overline {z}}\mathbf {A} ^{\mathrm {H} }} for any complex number $z$ {\displaystyle z} and any $m\times n$ {\displaystyle m\times n} matrix $\mathbf {A}$ {\displaystyle \mathbf {A} }.
$(\mathbf {A} {\boldsymbol {B}})^{\mathrm {H} }={\boldsymbol {B}}^{\mathrm {H} }\mathbf {A} ^{\mathrm {H} }$ {\displaystyle (\mathbf {A} {\boldsymbol {B}})^{\mathrm {H} }={\boldsymbol {B}}^{\mathrm {H} }\mathbf {A} ^{\mathrm {H} }} for any $m\times n$ {\displaystyle m\times n} matrix $\mathbf {A}$ {\displaystyle \mathbf {A} } and any $n\times p$ {\displaystyle n\times p} matrix ${\boldsymbol {B}}$ {\displaystyle {\boldsymbol {B}}}. Note that the order of the factors is reversed.^[1]
$\left(\mathbf {A} ^{\mathrm {H} }\right)^{\mathrm {H} }=\mathbf {A}$ {\displaystyle \left(\mathbf {A} ^{\mathrm {H} }\right)^{\mathrm {H} }=\mathbf {A} } for any $m\times n$ {\displaystyle m\times n} matrix $\mathbf {A}$ {\displaystyle \mathbf {A} }, i.e. Hermitian transposition is an involution.
If $\mathbf {A}$ {\displaystyle \mathbf {A} } is a square matrix, then $\det \left(\mathbf {A} ^{\mathrm {H} }\right)={\overline {\det \left(\mathbf {A} \right)}}$ {\displaystyle \det \left(\mathbf {A} ^{\mathrm {H} }\right)={\overline {\det \left(\mathbf {A} \right)}}} where $\operatorname {det} (A)$ {\displaystyle \operatorname {det} (A)} denotes the determinant of $\mathbf {A}$ {\displaystyle \mathbf {A} } .
If $\mathbf {A}$ {\displaystyle \mathbf {A} } is a square matrix, then $\operatorname {tr} \left(\mathbf {A} ^{\mathrm {H} }\right)={\overline {\operatorname {tr} (\mathbf {A} )}}$ {\displaystyle \operatorname {tr} \left(\mathbf {A} ^{\mathrm {H} }\right)={\overline {\operatorname {tr} (\mathbf {A} )}}} where $\operatorname {tr} (A)$ {\displaystyle \operatorname {tr} (A)} denotes the trace of $\mathbf {A}$ {\displaystyle \mathbf {A} }.
$\mathbf {A}$ {\displaystyle \mathbf {A} } is invertible if and only if $\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }} is invertible, and in that case $\left(\mathbf {A} ^{\mathrm {H} }\right)^{-1}=\left(\mathbf {A} ^{-1}\right)^{\mathrm {H} }$ {\displaystyle \left(\mathbf {A} ^{\mathrm {H} }\right)^{-1}=\left(\mathbf {A} ^{-1}\right)^{\mathrm {H} }}.
The eigenvalues of $\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }} are the complex conjugates of the eigenvalues of $\mathbf {A}$ {\displaystyle \mathbf {A} }.
$\left\langle \mathbf {A} x,y\right\rangle _{m}=\left\langle x,\mathbf {A} ^{\mathrm {H} }y\right\rangle _{n}$ {\displaystyle \left\langle \mathbf {A} x,y\right\rangle _{m}=\left\langle x,\mathbf {A} ^{\mathrm {H} }y\right\rangle _{n}} for any $m\times n$ {\displaystyle m\times n} matrix $\mathbf {A}$ {\displaystyle \mathbf {A} }, any vector in $x\in \mathbb {C} ^{n}$ {\displaystyle x\in \mathbb {C} ^{n}} and any vector $y\in \mathbb {C} ^{m}$ {\displaystyle y\in \mathbb {C} ^{m}}. Here, $\langle \cdot ,\cdot \rangle _{m}$ {\displaystyle \langle \cdot ,\cdot \rangle _{m}} denotes the standard complex inner product on $\mathbb {C} ^{m}$ {\displaystyle \mathbb {C} ^{m}}, and similarly for $\langle \cdot ,\cdot \rangle _{n}$ {\displaystyle \langle \cdot ,\cdot \rangle _{n}}.

Generalizations

[edit ]

The last property given above shows that if one views $\mathbf {A}$ {\displaystyle \mathbf {A} } as a linear transformation from Hilbert space $\mathbb {C} ^{n}$ {\displaystyle \mathbb {C} ^{n}} to $\mathbb {C} ^{m},$ {\displaystyle \mathbb {C} ^{m},} then the matrix $\mathbf {A} ^{\mathrm {H} }$ {\displaystyle \mathbf {A} ^{\mathrm {H} }} corresponds to the adjoint operator of $\mathbf {A}$ {\displaystyle \mathbf {A} }. The concept of adjoint operators between Hilbert spaces can thus be seen as a generalization of the conjugate transpose of matrices with respect to an orthonormal basis.

Another generalization is available: suppose $A$ {\displaystyle A} is a linear map from a complex vector space $V$ {\displaystyle V} to another, $W$ {\displaystyle W}, then the complex conjugate linear map as well as the transposed linear map are defined, and we may thus take the conjugate transpose of $A$ {\displaystyle A} to be the complex conjugate of the transpose of $A$ {\displaystyle A}. It maps the conjugate dual of $W$ {\displaystyle W} to the conjugate dual of $V$ {\displaystyle V}.

References

[edit ]

^ ^a ^b Weisstein, Eric W. "Conjugate Transpose". mathworld.wolfram.com. Retrieved 2020年09月08日.
^ H. W. Turnbull, A. C. Aitken, "An Introduction to the Theory of Canonical Matrices," 1932.
^ Chasnov, Jeffrey R. (4 February 2022). "1.6: Matrix Representation of Complex Numbers". Applied Linear Algebra and Differential Equations. LibreTexts.

External links

[edit ]

"Adjoint matrix", Encyclopedia of Mathematics , EMS Press, 2001 [1994]

Retrieved from "https://en.wikipedia.org/w/index.php?title=Conjugate_transpose&oldid=1270586942"

Definition

Example

Basic remarks

Properties

Generalizations

See also

References

External links