Orthogonal sets

A set of vectors $\{\boldsymbol u_1,..., \boldsymbol u_p\}$ in $\mathbb R^n$ is said to be an orthogonal set if each pair of distinct vectors from the set is orthogonal, that is, if $\boldsymbol u_i \cdot \boldsymbol u_j = 0$ whenever $\neq j$ .

在这里插入图片描述
PROOF
If $\boldsymbol 0 = c_1\boldsymbol u_1+...+ c_p\boldsymbol u_p$ for some scalars $c_1,..., c_p$ , then

在这里插入图片描述
because $\boldsymbol u_1$ is orthogonal to $\boldsymbol u_2,...,\boldsymbol u_p$ . Since $\boldsymbol u_1$ is nonzero, $\boldsymbol u_1 \cdot \boldsymbol u_1$ is not zero and so $c_1 = 0$ . Similarly, $c_2,..., c_p$ must be zero. Thus $S$ is linearly independent.

在这里插入图片描述
The next theorem suggests why an orthogonal basis is much nicer than other bases. The weights in a linear combination can be computed easily.

在这里插入图片描述

An Orthogonal Projection 正交投影

Given a nonzero vector $\boldsymbol u$ in $\mathbb R^n$ , consider the problem of decomposing a vector $\boldsymbol y$ in $\mathbb R^n$ into the sum of two vectors, one a multiple of $\boldsymbol u$ and the other orthogonal to $\boldsymbol u$ . We wish to write
$\boldsymbol y=\hat\boldsymbol y+\boldsymbol z\ \ \ \ \ \ \ \ \ \ \ (1)$

where $\hat\boldsymbol y=\alpha\boldsymbol u$ for some scalar $\alpha$ and $\boldsymbol z$ is some vector orthogonal to $\boldsymbol u$ . See Figure 2.

在这里插入图片描述
Given any scalar $\alpha$ , let $\boldsymbol z =\boldsymbol y -\alpha\boldsymbol u$ , so that (1) is satisfied. Then $\boldsymbol y -\hat\boldsymbol y$ is orthogonal to $\boldsymbol u$ if and only if

$0=(\boldsymbol y-\alpha\boldsymbol u)\cdot \boldsymbol u=\boldsymbol y\cdot\boldsymbol u-(\alpha\boldsymbol u)\cdot\boldsymbol u=\boldsymbol y\cdot\boldsymbol u-\alpha(\boldsymbol u\cdot\boldsymbol u)$

That is, (1) is satisfied with $\boldsymbol z$ orthogonal to $\boldsymbol u$ if and only if $\alpha=\frac{\boldsymbol y\cdot \boldsymbol u}{\boldsymbol u\cdot\boldsymbol u}$ and $\hat \boldsymbol y=\frac{\boldsymbol y\cdot \boldsymbol u}{\boldsymbol u\cdot\boldsymbol u}\boldsymbol u$ . The vector $\hat\boldsymbol y$ is called the orthogonal projection of $\boldsymbol y$ onto $\boldsymbol u$ , and the vector $\boldsymbol z$ is called the component of $\boldsymbol y$ orthogonal to $\boldsymbol u$ .

This projection is determined by the subspace $L$ spanned by $\boldsymbol u$ (the line through $\boldsymbol u$ and $\boldsymbol 0$ ). Sometimes $\hat \boldsymbol y$ is denoted by $proj_L \boldsymbol y$ and is called the orthogonal projection of $\boldsymbol y$ onto $L$ . That is,

在这里插入图片描述

由正交投影可以求得点到直线的距离 ( $\left \|\boldsymbol y-\hat\boldsymbol y\right \|$ )

$\boldsymbol x\mapsto proj_L\boldsymbol x$ is a linear transformation.

A Geometric Interpretation of Theorem 5

The formula for the orthogonal projection $\hat \boldsymbol y$ in

在这里插入图片描述
has the same appearance as each of the terms in Theorem 5. Thus Theorem 5 decomposes each $\boldsymbol y$ in $Span\{\boldsymbol u_1,...,\boldsymbol u_p\}$ into the sum of $p$ projections onto one-dimensional subspaces that are mutually orthogonal.

Orthonormal Sets 单位正交集

A set $\{\boldsymbol u_1,...,\boldsymbol u_p\}$ is an orthonormal set if it is an orthogonal set of unit vectors. If $W$ is the subspace spanned by such a set, then $\{\boldsymbol u_1,...,\boldsymbol u_p\}$ is an orthonormal basis for $W$ , since the set is automatically linearly independent

The simplest example of an orthonormal set is the standard basis $\{\boldsymbol e_1,...,\boldsymbol e_n\}$ for $\mathbb R^n$ . Any nonempty subset of $\{\boldsymbol e_1,...,\boldsymbol e_n\}$ is orthonormal, too.

When the vectors in an orthogonal set of nonzero vectors are normalized to have unit length, the new vectors will still be orthogonal, and hence the new set will be an orthonormal set.

Matrices whose columns form an orthonormal set are important in applications and in computer algorithms for matrix computations. Their main properties are given in Theorems 6 and 7.

在这里插入图片描述

Properties (a) and (c ) say that the linear mapping $\boldsymbol x \mapsto U\boldsymbol x$ preserves lengths and orthogonality.

Theorems 6 and 7 are particularly useful when applied to $s q u a r e$ matrices. An orthogonal matrix is a square invertible matrix $U$ such that $U^{-1} = U^T$ . It is easy to see that any square matrix with orthonormal columns is an orthogonal matrix. Surprisingly, such a matrix must have orthonormal rows, too. ( $U^T)^{-1} = (U^T)^T$ )

$detU=\pm1$

EXERCISES
Show that if an $\times n$ matrix $U$ satisfies $(U\boldsymbol x) \cdot(U\boldsymbol y)= \boldsymbol x\cdot \boldsymbol y$ for all $\boldsymbol x$ and $\boldsymbol y$ in $\mathbb R^n$ , then $U$ is an orthogonal matrix.
SOLUTION
$U\boldsymbol e_j$ is the $j$ -th column of $U$ . Since $\left\|U\boldsymbol e_j\right\|^2=(U\boldsymbol e_j) \cdot(U\boldsymbol e_j)= \boldsymbol e_j\cdot \boldsymbol e_j=1$ , the columns of $U$ are unit vectors. For $j\neq k$ , $(U\boldsymbol e_j) \cdot(U\boldsymbol e_k)= \boldsymbol e_j\cdot \boldsymbol e_k=0$ . Thus ach pair of distinct columns of $U$ is orthogonal.

6.2 Orthogonal sets (正交基)

目录

Orthogonal sets

An Orthogonal Projection 正交投影

A Geometric Interpretation of Theorem 5

Orthonormal Sets 单位正交集

猜你喜欢