本文为《Linear algebra and its applications》的读书笔记

Quadratic forms

A quadratic form on $R^n$ is a function $Q$ defined on $R^n$ whose value at a vector $\boldsymbol x$ in $R^n$ can be computed by an expression of the form $Q(\boldsymbol x)=\boldsymbol x^TA\boldsymbol x$ , where $A$ is an $\times n$ symmetric matrix. The matrix $A$ is called the matrix of the quadratic form (关于二次型的矩阵).

The simplest example of a nonzero quadratic form is $Q(\boldsymbol x)=\boldsymbol x^TI\boldsymbol x=\left\|\boldsymbol x\right\|^2$ . Examples 1 and 2 show the connection between any symmetric matrix $A$ and the quadratic form $\boldsymbol x^TA\boldsymbol x$ .

EXAMPLE 1
Let $\boldsymbol x =\begin{bmatrix}x_1\\x_2\end{bmatrix}$ . Compute $\boldsymbol x^TA\boldsymbol x$ for the following matrices:

在这里插入图片描述
SOLUTION
a.

b. There are two $- 2$ entries in $A$ . Watch how they enter the calculations.

在这里插入图片描述
The presence of $4x_1x_2$ in the quadratic form in Example 1(b) is due to the $- 2$ entries off the diagonal in the matrix $A$ . In contrast, the quadratic form associated with the diagonal matrix $A$ in Example 1(a) has no $x_1x_2$ $c r o s s$ - $p r o d u c t (交叉乘积)$ term.

EXAMPLE 2
For $\boldsymbol x$ in $R^3$ , let $Q(\boldsymbol x)= 5x_1^2+ 3x_2^2+ 2x_3^2- x_1x_2 + 8x_2x_3$ . Write this
quadratic form as $\boldsymbol x^TA\boldsymbol x$ .
SOLUTION
The coefficients of $\boldsymbol x_1^2,\boldsymbol x_2^2 , \boldsymbol x_3^2$ go on the diagonal of $A$ . To make $A$ symmetric, the coefficient of $x_ix_j$ for $i\neq j$ must be split evenly between the $(i, j)$ - and $(j, i)$ -entries in $A$ . It is readily checked that

在这里插入图片描述
In some cases, quadratic forms are easier to use when they have no cross-product terms—that is, when the matrix of the quadratic form is a diagonal matrix. Fortunately, the cross-product term can be eliminated by making a suitable change of variable.

Change of Variable in a Quadratic Form 二次型的变量代换

If $\boldsymbol x$ represents a variable vector in $R^n$ , then a change of variable is an equation of the form

在这里插入图片描述

where $P$ is an invertible matrix and $\boldsymbol y$ is a new variable vector in $R^n$ . Here $\boldsymbol y$ is the coordinate vector of $\boldsymbol x$ relative to the basis of $R^n$ determined by the columns of $P$ .

If the change of variable (1) is made in a quadratic form $\boldsymbol x^TA\boldsymbol x$ , then

在这里插入图片描述
and the new matrix of the quadratic form is $P^TAP$ . Since $A$ is symmetric, there is an orthogonal matrix $P$ such that $P^TAP$ is a diagonal matrix $D$ , and the quadratic form in (2) becomes $\boldsymbol y^TD\boldsymbol y$ .

在这里插入图片描述

主轴定理

The columns of $P$ in the theorem are called the principal axes (主轴) of the quadratic form $\boldsymbol x^TA\boldsymbol x$ . The vector $\boldsymbol y$ is the coordinate vector of $\boldsymbol x$ relative to the orthonormal basis of $R^n$ given by these principal axes.

A Geometric View of Principal Axes

Suppose $Q(\boldsymbol x)=\boldsymbol x^TA\boldsymbol x$ , where $A$ is an invertible $\times 2$ symmetric matrix, and let $c$ be a constant. It can be shown that the set of all $\boldsymbol x$ in $R^2$ that satisfy

在这里插入图片描述
either corresponds to an ellipse (or circle), a hyperbola(双曲线), two intersecting lines, or a single point, or contains no points at all. If $A$ is a diagonal matrix, the graph is in standard position, such as in Figure 2.

在这里插入图片描述
If $A$ is not a diagonal matrix, the graph of equation (3) is rotated out of standard position, as in Figure 3. Finding the principal axes (determined by the eigenvectors of $A$ ) amounts to finding a new coordinate system with respect to which the graph is in standard position.

在这里插入图片描述
The positive $y_1$ -axis in Figure 3(b) is in the direction of the first column of the matrix $P$ , and the positive $y_2$ -axis is in the direction of the second column of $P$ .

Classifying Quadratic Forms 二次型的分类

When $A$ is an $n\times n$ matrix, the quadratic form $Q(\boldsymbol x)=\boldsymbol x^TA\boldsymbol x$ is a real-valued function with domain $R^n$ . Figure 4 displays the graphs of four quadratic forms with domain $R^2$ . For each point $\boldsymbol x=(x_1, x_2)$ in the domain of a quadratic form $Q$ , the graph displays the point $x_1, x_2,z)$ where $Q(\boldsymbol x)$ . Notice that except at $\boldsymbol x=\boldsymbol 0$ , the values of $Q(\boldsymbol x)$ are all positive in Figure 4(a) and all negative in Figure 4(d). The horizontal cross-sections(水平截面) of the graphs are ellipses in Figures 4(a) and 4(d) and hyperbolas in Figure 4 $(c)$ .

在这里插入图片描述

The simple $\times 2$ examples in Figure 4 illustrate the following definitions

在这里插入图片描述

positive definite (正定的)
negative definite (负定的)
indefinite (不定的)

Also, $Q$ is said to be positive semidefinite (半正定的) if $Q(\boldsymbol x)\geq0$ for all $\boldsymbol x$ , and to be negative semidefinite if $Q(\boldsymbol x)\leq 0$ for all $\boldsymbol x$ .

Theorem 5 characterizes some quadratic forms in terms of eigenvalues.

在这里插入图片描述
PROOF
By the Principal Axes Theorem, there exists an rthogonal change of variable $\boldsymbol x = P\boldsymbol y$ such that

在这里插入图片描述
where $\lambda_1,...,\lambda_n$ are the eigenvalues of $A$ . Since $P$ is invertible, there is a one-to-one correspondence between all nonzero $\boldsymbol x$ and all nonzero $\boldsymbol y$ . Thus the values of $Q(\boldsymbol x)$ for $\boldsymbol x\neq \boldsymbol 0$ coincide with the values of the expression on the right side of (4), which is obviously controlled by the signs of the eigenvalues $\lambda_1,...,\lambda_n$ , in the three ways described in the theorem.

The classification of a quadratic form is often carried over to the matrix of the form. Thus a positive definite matrix(正定矩阵) $A$ is a symmetric matrix for which the quadratic form $\boldsymbol x^TA\boldsymbol x$ is positive definite. Other terms, such as positive semidefinite matrix, are defined analogously.

Another useful way to characterize quadratic forms, often used in multivariable calculus courses.
Let $A=\begin{bmatrix}a&b\\c&d\end{bmatrix}$ . If $\lambda_1$ and $\lambda_2$ are the eigenvalues of $A$ , then the characteristic polynomial is $det(A-\lambda I)=\lambda^2-(a+d)\lambda+ad-b^2$ . Thus $\lambda_1+\lambda_2=a +d$ and $\lambda_1\lambda_2= detA$ .
Then the following statements can be easily verified:
$a$ . $Q$ is positive definite if $d e t A > 0$ and $a > 0$ .
$b$ . $Q$ is negative definite if $d e t A > 0$ and $a < 0$ .
$c$ . $Q$ is indefinite if $d e t A < 0$ .
(The $\times 2$ case can be generalized to $n\times n$ matrices.)

EXERCISE 25
Show that if $B$ is $\times n$ , then $B^TB$ is positive semidefinite; and if $B$ is $n\times n$ and invertible, then $B^TB$ is positive definite.
SOLUTION
[Hint: $\boldsymbol x^TB^TB\boldsymbol x$ ]

EXERCISE 26
Show that if an $\times n$ matrix $A$ is positive definite, then there exists a positive definite matrix $B$ such that $A = B^TB$ .
SOLUTION
[Hint: Use the orthogonal decomposition]

EXERCISE 27
Let $A$ and $B$ be symmetric $n\times n$ matrices whose eigenvalues are all positive. Show that the eigenvalues of $A + B$ are all positive.
SOLUTION
[Hint: Consider quadratic forms.]

EXERCISE
If $A$ is $\times n$ , then the matrix $G = A^TA$ is called the $G r a m$ $m a t r i x$ of $A$ . Show that the Gram matrix of any matrix $A$ is positive semidefinite, with the same rank as $A$ .
SOLUTION
[Hint: Section 6.5 Theorem 14]

在这里插入图片描述

楚列斯基分解

EXERCISE
Prove that an $n\times n$ matrix $A$ is positive definite if and only if $A$ admits a Cholesky factorization, namely, $A= R^TR$ for some invertible upper triangular matrix $R$ whose diagonal entries are all positive.
SOLUTION
[Hint: Use a QR factorization and Exercise 26.]
If $A = R^TR$ , where R is invertible, then $\boldsymbol x^TA\boldsymbol x=(R\boldsymbol x)^T(R\boldsymbol x)\geq 0$ when $\boldsymbol x\neq\boldsymbol 0$ . Thus $A$ is positive definite.

Conversely, suppose that $A$ is positive definite. Then by Exercise 26, $A = B^TB$ for some positive definite matrix $B$ . Since the eigenvalues of $B$ are positive, 0 is not an eigenvalue and so $B$ is invertible. In particular, the columns of $B$ are linearly independent. By Theorem 12 in Section 6.4, $B = Q R$ for some $n\times n$ matrix $Q$ with orthonormal columns and some upper triangular matrix $R$ with positive elements on its diagonal. Since $Q$ is square, $Q^TQ = I$ . So

在这里插入图片描述

7.2 Quadratic forms (二次型)

目录

Quadratic forms

Change of Variable in a Quadratic Form 二次型的变量代换

A Geometric View of Principal Axes

Classifying Quadratic Forms 二次型的分类

猜你喜欢