w_lstsqs

Next: w_lstsqs Up: w_lstsqs Previous: w_lstsqs

5. A geometrical interpretation

Recall that the distance between two vectors is the length of their difference. Therefore $\vert \epsilon \vert$ is the distance from x to b. Moreover, x $)= \epsilon _ 1 ^ 2 + \dots + \epsilon _ m ^ 2 = \vert \epsilon \vert ^ 2$ . For a nonnegative function, minimizing the square of the function is the same as minimizing the function. Therefore

$\fbox{\parbox{2.6in}{ These statements are equivalent: \vspace{2mm} \par $A \mb... ...x{\bf x}$is as close as possible to $\mbox{\bf b}$in $\mbox{\bf R} ^ m$. }}$

Moreover, the vectors of the form x, for all possible column vectors x, form a subspace of R. Therefore the problem is really one of finding the point in a subspace that is closest to a given point; the error vector $\epsilon$ is the vector from the given point to the point in the subspace. In Figure , the parallelogram represents part of the subspace . The error vector ``err'' goes from to x.

**Figure:** Geometrical interpretation
book/12dir/geometric.eps

To see why is a subspace, notice that is just the image of the linear transformation given by xx. Or, notice that x can be expanded as x $= A _ 1 x _ 1 + \dots + A _ n x _ n$ , where means the -th column of ; therefore is the column space of , i.e., the subspace spanned by the columns of . In Figure , the columns of are the sides of the parallelogram that touch the origin.

You might think that the way to get from b to the closest point of is to go along a normal to --i.e., a line perpendicular to --and that's exactly right, as indicated by the right angle in Figure . First let's discuss the computational method that produces the answer, and then then derivation of the method.

Next: w_lstsqs Up: w_lstsqs Previous: w_lstsqs

Kirby A. Baker 2003-05-13