양의 정부호 행렬

Section 7.2 양의 정부호 행렬

이 절에서 $S$는 대칭행렬이다.

Subsection 7.2.1 양의 정부호 행렬 - 정의

정의 7.2.1. 양의 정부호 행렬.

$n \times n$대칭행렬 $S$가 조건

\begin{equation*} \boldsymbol x^T S\boldsymbol x > 0 \Leftrightarrow \boldsymbol x \not = 0,\hspace{10pt}\text{(단, $\boldsymbol x\in\mathbf R^n$)} \end{equation*}

을 만족하면, $S$를 양의 정부호 또는 양정부호행렬이라 한다. 기호로

\begin{equation*} S>0 \end{equation*}

이라 나타낸다.

간단한 경우를 살펴보자. $1 \times 1 행렬$ $\begin{bmatrix}a\end{bmatrix}$가 양정부호일 필요충분조건은 $a>0$이다. 영 아닌 모든 실수 $x$에 대하여

\begin{equation*} ax^2 > 0 \Leftrightarrow x\not =0 \end{equation*}

이 성립하기 위한 필요충분조건은 $a>0$이다. 즉, 이 경우 양정부호 조건은 실수의 양부호 개념과 동치이다. 따라서, 양정부호 개념은 실수의 부호 개념을 대칭행렬로 확장했다고 볼 수 있다.

예시 7.2.2. $2\times2$ 양정부호 행렬과 완전제곱식.

양정부호 행렬의 조건

\begin{equation*} \begin{bmatrix}x\amp y\end{bmatrix}\begin{bmatrix}a \amp b \\ b \amp c\end{bmatrix}\begin{bmatrix}x\\y\end{bmatrix}>0 \end{equation*}

를 풀어쓰면

\begin{equation*} ax^2+2bxy+cy^2>0 \end{equation*}

이다. 완전제곱으로 변형하면

\begin{align*} ax^2+2bxy+cy^2 \amp = \left(ax^2+2bxy+\frac{b^2}{a}y^2\right)-\frac{b^2}{a}y^2+cy^2\\ \amp = a\left(x + \frac{b}{a}y\right)^2+\frac{ac-b^2}{a}y^2 \end{align*}

이다. 모든 $x,y$에 대하여 위 식이 양수이려면, 제곱항의 계수인 $a$와 $\frac{ac-b^2}{a}$가 둘 다 양수이어야 한다. 즉, 행렬 $\begin{bmatrix}a \amp b \\ b \amp c\end{bmatrix}$이 양정부호일 필요충분조건은 $a>0$이고 동시에 $ac-b^2>0$인 것이다.

예시 7.2.3. 성분의 부호와 양정부호 행렬.

양정부호 개념은 실수의 부호와 관련이 있으나 행렬

\begin{equation*} \begin{bmatrix}1 \amp 1 \\ 1 \amp 1\end{bmatrix} \end{equation*}

경우 모든 성분이 양수라 해도

\begin{equation*} \begin{bmatrix}1\amp-1\end{bmatrix}\begin{bmatrix}1 \amp 1 \\ 1 \amp 1\end{bmatrix}\begin{bmatrix}1\\-1\end{bmatrix}=0 \end{equation*}

이므로 양정부호는 아니다.

대각행렬의 경우 양정부호 성질은 대각성분의 부호로 귀결된다.

명제 7.2.4.

대각행렬 $D$가 양정부호이면 $D$의 모든 대각성분이 양수이다. 역도 성립한다.

증명.

$n \times n$ 대각행렬 $D$가 양정부호행렬이면 표준기저벡터 $\boldsymbol e_i$에 대하여 $\boldsymbol e_i^\operatorname TD\boldsymbol e_i$가 양수임을 안다. 역으로, $D$의 대각성분 $d_1,\cdots,d_n$가 모두 양수이면

\begin{equation*} \begin{bmatrix}a_1\amp a_2 \amp \cdots \amp a_n \end{bmatrix}D \begin{bmatrix} a_1 \\ a_2 \\ \vdots \\ a_n \end{bmatrix} = d_1a_1^2+d_2a_2^2+\cdots +d_na_n^2 \ge 0 \end{equation*}

이고 등호 조건이 $a_1=\cdots=a_n=0$이므로, $D$는 양정부호이다.

Subsection 7.2.2 고윳값 조건

명제 7.2.5.

$S$가 양정부호이면, 모든 고윳값은 양수이다. 역도 성립한다.

증명.

스펙트랄 정리를 이용하자. $n \times n$대칭행렬 $S$의 고유벡터로 이루어진 정규직교기저를 $\boldsymbol x_1,\cdots,\boldsymbol x_n$이라 하고 대응되는 교윳값을 $\lambda_1,\cdots,\lambda_n$이라 하자. $\boldsymbol x\not=0$이면

\begin{equation*} \boldsymbol x= a_1\boldsymbol x_1+\cdots+a_n\boldsymbol x_n \end{equation*}

으로 쓸 수 있으며, $\boldsymbol x=0$일 필요충분조건은 $a_1=\cdots=a_n=0$이다.

각 $i=1,\cdots,n$마다

\begin{equation*} \boldsymbol x_i^{\operatorname T} S\boldsymbol x_i = \boldsymbol x_i^{\operatorname T} (\lambda\boldsymbol x_i)= \lambda_i \end{equation*}

이고, $i\not = j$이면 직교성에 의하여

\begin{equation*} \boldsymbol x_i^{\operatorname T} S\boldsymbol x_j = \boldsymbol x_i^{\operatorname T} (\lambda\boldsymbol x_j)=0 \end{equation*}

이다. 이를 이용하여

\begin{equation*} \left(a_1\boldsymbol x_1+\cdots+a_n\boldsymbol x_n\right)^\operatorname T S \left(a_1\boldsymbol x_1+\cdots+a_n\boldsymbol x_n\right) \end{equation*}

을 전개하면

\begin{align} \amp \left(a_1\boldsymbol x_1+\cdots+a_n\boldsymbol x_n\right)^\operatorname T S \left(a_1\boldsymbol x_1+\cdots+a_n\boldsymbol x_n\right)\tag{7.2.1}\\ = \,\amp \lambda_1a_1^2+\cdots+\lambda_na_n^2\tag{7.2.2} \end{align}

을 얻는다.

스펙트랄 정리로부터 얻은 (7.2.2)으로부터 명제를 증명해보자. 만약 $S$가 양정부호라면 (7.2.2)의 우변은 임의의 영 아닌 벡터 $(a_1,\cdots,a_n)$에 대하여 양수이어야 한다. 표준기저벡터를 대입해보면 각 $\lambda_i$가 양수임을 안다. 역으로, 모든 $\lambda_i$가 양수라면 동치관계

\begin{equation*} \lambda_1a_1^2+\cdots+\lambda_na_n^2=0 \Leftrightarrow (a_1,\cdots,a_n)=0 \end{equation*}

를 얻고, 이로부터 $S$가 양정부호임을 안다.

Subsection 7.2.3 축 조건

$S=LDL^\operatorname T$분해를 생각하자. 여기서 $D$의 대각성분은 축이다. 축과 $S$의 양정부호 성질의 관계를 논의하기에 앞서, 유용한 사실을 먼저 소개한다.

명제 7.2.6.

$A$가 가역이면

\begin{equation*} S>0 \Leftrightarrow A^\operatorname TSA>0 \end{equation*}

이다.

증명.

$A$가 가역이므로 동치관계 $A\boldsymbol x =0 \Leftrightarrow \boldsymbol x=0$가 성립한다. $(A\boldsymbol x)^\operatorname T=\boldsymbol x^\operatorname TA^\operatorname T$와 결합법칙을 적용하면

\begin{equation*} (A\boldsymbol x)^\operatorname TS(A\boldsymbol x)=\boldsymbol x^\operatorname T\left(A^\operatorname TSA\right)\boldsymbol x \end{equation*}

이다. 종합하여 원하던 동치관계 $S>0 \Leftrightarrow A^\operatorname TSA>0$를 얻는다.

명제 7.2.7.

$S$가 양정부호이면, 모든 축은 양수이다. 역도 성립한다.

증명.

분해 $A=LDL^\operatorname T$에서 $D$의 대각성분이 $A$의 축이고, $D$의 축은 $D$의 대각성분이므로, 명제 7.2.6와 분해 $S=LDL^\operatorname T$에 의하여 $S$가 대각행렬인 경우만 고려하면 충분하다. 대각행렬의 경우는 명제 7.2.4에 의하여 명제가 증명된다.

Subsection 7.2.4 분해 조건

명제 7.2.8.

$m \times n $행렬 $A$의 열이 선형독립이면 $S=A^\operatorname{T}A$는 양정부호이다. 역으로, $n \times n $ 행렬 $S$가 양정부호이면, 열벡터가 독립인 $m \times n $행렬 $A$가 존재하여 대하여 $S=A^\operatorname{T}A$가 성립한다.

증명.

$m \times n $행렬 $A$의 열이 선형독립이라고 하자. $A\boldsymbol x=0 \Leftrightarrow \boldsymbol x=0$이 성립한다.

\begin{equation*} \boldsymbol x=0 \Leftrightarrow A\boldsymbol x=0 \Leftrightarrow (A\boldsymbol x) \bullet (A\boldsymbol x)=0 \end{equation*}

인데

\begin{equation*} (A\boldsymbol x) \bullet (A\boldsymbol x)=(A\boldsymbol x)^\operatorname T (A\boldsymbol x)=\boldsymbol x^\operatorname T (A^\operatorname TA)\boldsymbol x \end{equation*}

이다. 종합하면 $A^\operatorname TA$는 양정부호이다.

역으로, $S$가 양정부호라고 하자. 명제 7.2.7에 따르면, $S=LDL^\operatorname T$분해에서 $D$는 대각행렬이고 모든 대각성분이 양수이다. $D$의 대각성분을 $d_1,\cdots,d_n$이라 하고 $\sqrt{D}$를 대각성분이 $\sqrt{d_1},\cdots,\sqrt{d_n}$인 대각행렬이라고 하자. 정의로부터 $D=\sqrt D \sqrt D^\operatorname T$이다. $A=L\sqrt D$라 하면 원하던 분해 $S=A^\operatorname TA$를 얻는다.

Subsection 7.2.5 행렬식 조건

정의 7.2.9.

$n \times n $정사각행렬 $A$의 $k$번째 주부분행렬이란, $k$개 행과 첫 $k$개 열의 공통 성분으로 구성된 $k \times k$행렬을 말한다.

예시 7.2.10. 주부분행렬.

행렬

\begin{equation*} \begin{bmatrix}a\amp b \amp c \\ d \amp e \amp f \\ g \amp h \amp i\end{bmatrix} \end{equation*}

의 $2 \times 2$ 주부분행렬은

\begin{equation*} \begin{bmatrix}a\amp b \\ d \amp e\end{bmatrix} \end{equation*}

이다.

명제 7.2.11.

$S$가 양정부호이면 모든 주부분행렬의 행렬식이 양수이다. 역도 성립한다.

증명.

어떤 $A$의 $k$번째 주부분행렬을 $A_k$로 잠시 나타내자. $S=LDL^\operatorname T$분해로부터 주부분행렬의 분해 $S_k=L_kD_kL_k^\operatorname T$를 얻는다. 각 $S_k$가 양정부호인 것은 $D_k$의 대각성분이 모두 양수인 것과 동치임을 명제 7.2.7으로부터 안다. 한편, $\det S_k=\det L_k \det D_k \det L_k^\operatorname T=\det D_k$이므로 논의를 종합하면 원하던 동치관계를 얻는다.

Subsection 7.2.6 볼록성

여기서는 볼록성을 다룬다. 가령, 평면에서 1사분면은 볼록집합인데, 양수 곱과 두 원소의 덧셈에 대하여 닫혀있기 때문이다. 한편, 세 사분면의 합집합은 덧셈에 닫혀있지 않다.

예제 7.2.12.

예시 7.2.2에서 행렬 $\begin{bmatrix}a \amp b \\ b \amp c\end{bmatrix}$ 가 양정부호인 조건을 성분에 관한 부등식으로 표현하였다. 해당 영역을 $(a,b,c)$를 축으로 하는 좌표공간에서 기술하여라. 해당 영역 두 벡터 $\boldsymbol v_1,\boldsymbol v_2$가 포함되어 있으면 그 합 $\boldsymbol v_1+\boldsymbol v_2$도 포함되어 있음을 보여라.

명제 7.2.13.

$S$가 양정부호이면, 임의의 양수 $r$에 대하여 $rS$는 양정부호이다. 그리고, $S_1$과 $S_2$가 양정부호이면 $S_1+S_2$도 양정부호이다.

증명.

정의 7.2.1로부터 알 수 있다.

Subsection 7.2.7 극값

여기서는 다변수함수

\begin{equation*} \phi \colon \mathbf R^n \to \mathbf R \end{equation*}

의 극값에 대해 논의하려 한다. 우선 $n=2$이고 $\phi$가 2차 다항함수인 경우를 살피자. $\phi$가 다항식

\begin{equation*} \phi(x,y)=a + bx + cy + dx^2+exy+fy^2 \end{equation*}

으로 주어졌다고 하자. 언제 $\phi(x,y)$는 $(x,y)=0$에서 극값을 가질까?

$\phi(x,y)$가 $(x,y)=0$에서 극값을 가지면 $\phi(x,0)$은 $x=0$에서 극값을 가진다. 이로부터 $b=0$임을 안다. 비슷하게 $c=0$이다. 따라서, $\phi(x,y)=a+\text{(이차항의 합)}$ 꼴이다. 이차항들을 모아 행렬로 표현하면

\begin{align} \phi(x,y) \amp =a + dx^2+exy+fy^2\tag{7.2.3}\\ \amp =a + \begin{bmatrix}x \amp y \end{bmatrix} \begin{bmatrix}d \amp \frac e2\\ \frac e2 \amp f \end{bmatrix} \begin{bmatrix}x \\ y \end{bmatrix}\tag{7.2.4} \end{align}

명제 7.2.14.

$\begin{bmatrix}d \amp \frac e2\\ \frac e2 \amp f \end{bmatrix}$이 양정부호이면 (7.2.4)가 $(x,y)=0$에서 유일한 최솟값을 가진다.

증명.