UBC CPSC 406 – QR Factorization

Overview

Orthogonality
QR properties
Solution of linear least-squares problems

Orthogonal vectors

Two vectors $x$ and $y$ in $\Rn$

recall cosine identity \[ x\T y = \|x\|_2\|y\|_2\cos\theta \]
$x$ and $y$ in $\mathbb{R}^n$ are orthogonal if \[ x\T y = 0 \quad (\cos\theta = 0) \]
$x$ and $y$ are orthonormal if \[ x\T y = 0, \quad x\T x = 1, \quad y\T y = 1 \]
a set of orthogonal $n$-vectors $\{q_1,\ldots,q_m\}$ are linearly independent
- if $m=n$ then it’s a basis for $\Rn$

Orthogonal matrices

An $n\times r$ matrix $Q$ is orthonormal if its columns are pairwise orthonormal: \[ Q = [q_1\ | \cdots |\ q_r], \qquad Q\T Q = \begin{bmatrix} q_1\T q_1 & q_2\T q_1 & \cdots & q_r\T q_1 \\ q_1\T q_2 & q_2\T q_2 & \cdots & q_r\T q_2 \\ \vdots & \vdots & \ddots & \vdots \\ q_1\T q_r & q_2\T q_r & \cdots & q_r\T q_r \end{bmatrix} = I_r \]

if $r=n$ (ie, $Q$ is square) then $Q$ is orthogonal \[ Q^{-1} = Q\T \quad\text{and}\quad Q\T Q = QQ\T = I_n \]
orthogonal transformations preserve lengths and angles \[ \|x\|_2 = \|Qx\|_2 \textt{and} x\T y = x\T Q\T Qy = (Qx)\T(Qy) \textt{and} \det(Q) = \pm 1 \]

Question: Orthogonal matrices

Let $Q$ be an $n\times n$ orthogonal matrix (so $Q^T Q=I_n$). Which of the following statements is always true for any vectors $x,y\in\Rn$?

$\|Qx\| = \|x\|$, but $\|Qy\| \neq \|y\|$
$(Qx)\T(Qy) = x\T y$
$\det(Q) = +1$
$\|Qx\| \ne \|x\|$, and angles are distored by $Q$?

QR Factorization

where

$Q$ is orthogonal ($Q\T Q = Q Q\T = I_m$)
$\hat R$ is upper triangular ($\hat R_{ij} = 0$ for $i > j$)
$\range(\hat{Q})=\range(A)$
$\range(\bar{Q})=\range(A)^\perp \equiv \Null(A\T)$

using LinearAlgebra
Q, R = qr(A)

Reduced QR Factorization

For $A$ $m\times n$ with $m\geq n$, full rank

taking column by column, with $\hat Q=[q_1\ |\ \cdots\ |\ q_n]$ \[ \begin{align*} a_1 &= r_{11}q_1 \\ a_2 &= r_{12}q_1 + r_{22}q_2 \\ a_3 &= r_{13}q_1 + r_{23}q_2 + r_{33}q_3 \\ &\vdots \\ a_n &= r_{1n}q_1 + r_{2n}q_2 + \cdots + r_{nn}q_n \end{align*} \]

Question: Columns of $Q$

Let $A$ be an $m \times n$ matrix ($m \ge n$) of full column rank, and suppose $A = QR$ is its reduced QR factorization. The columns of $Q$ form an orthonormal basis for which of the following subspaces?

A. The row space of $A$
B. The column space of $A$
C. The null space of $A$
D. The orthogonal complement of the column space of $A$

Nonsingular equations with QR

Given $n\times n$ matrix $A$, full rank, solve \[ A x = b \] solve by QR factorization $A = QR$ and $Q\T Q = I$

mathematically

\[ x = A^{-1}b = (QR)^{-1}b = R^{-1}Q^{-1}b = R^{-1}Q\T b \]

computationally

using LinearAlgebra
Q, R = qr(A)   # O(n^3) <-- dominant cost
y = Q'b        # O(n^2) <-- matrix-vector multiply 
x = R \ y      # O(n^2) <-- triangular solve

Geometry of Least-Squares via QR

\[ \min_{x\in\Rn}\ \|Ax-b\|^2, \qquad A = QR \]

\[ \begin{align} \|Ax-b\|^2 &= (Ax-b)\T(Ax-b) \\ &= (Ax-b)\T Q Q\T(Ax-b) \\ &= \|Q\T(Ax-b)\|^2 \\ &= \left\|\begin{bmatrix} \hat R \\ 0 \end{bmatrix}x - \begin{bmatrix} \hat Q\T \\ \bar Q\T \end{bmatrix}b\right\|^2 \\ &= \underbrace{\color{DarkRed}{\|\hat R x - \hat Q\T b\|^2}}_{\small (1)} + \underbrace{\color{Green}{\|\bar Q\T b\|^2}}_{\small (2)} \end{align} \]

where (1) is minimized when $\hat Rx=\hat Q\T b$ and (2) is constant

Question: Least-Squares and Orthogonal Projections

Consider the least-squares problem \[ \min_{x \in \mathbb{R}^n} \|A x - b\|, \] Let $A = QR$ and $c = Q^T b$. Which of the following best describes the meaning of c$?

A. $c$ is the orthogonal projection of $b$ in the original space.
B. $c$ is the coordinate vector of the projection of $b$ onto the column space of $Q$.
C. $c$ is orthogonal to every column of $Q$.
D. $c$ has no geometric interpretation for the least-squares problem.

Solving Least-Squares via QR

\[ \min_{x\in\Rn}\ \|Ax-b\|^2, \qquad A = QR \]

mathematically

\[ \begin{align} A\T A x &= A\T b \\ R\T Q\T Q R x &= R\T Q\T b \\ Rx &= Q\T b\\ x &= R^{-1}Q\T b \end{align} \]

computationally

using LinearAlgebra
F = qr(A)                # O(n^3) <-- dominant cost
Q, R = Matrix(F.Q), F.R  # extract _thin_ Q, and R
y = Q'b                  # O(n^2) <-- matrix-vector multiply
x = R \ y                # O(n^2) <-- triangular solve

more numerically stable than solving $A\T Ax = A\T b$ directly

Question: Geometric Interpretation of $R$

In the factorization $A = QR$, with $Q$ having orthonormal columns, what is the geometric interpretation of the triangular matrix $R$?

$R$ is an orthogonal matrix that preserves angles and lengths.
$R$ describes how the columns of $A$ can be expressed as linear combinations of the columns of $Q$, capturing their coordinates in the orthonormal basis.
$R$ is the null-space basis of $A$.
$R$ is a diagonal matrix containing the singular values of $A$.

Accuracy of QR vs Normal Equations

For $\epsilon$ positive, this matrix has full rank because $\sin^2(\theta)+\cos^2(\theta)=1$ \[ A = \begin{bmatrix} sin^2(\theta_1) & cos^2(\theta_1+\epsilon) & 1 \\ sin^2(\theta_2) & cos^2(\theta_2+\epsilon) & 1 \\ \vdots & \vdots & \vdots \\ sin^2(\theta_m) & cos^2(\theta_m+\epsilon) & 1 \\ \end{bmatrix} \]

using LinearAlgebra
θ = LinRange(0,3,400)
ε = 1e-7
A = @. [sin(θ)^2   cos(θ+ε)^2   θ^0]
xᵉ = [1., 2., 1.]
b  = A*xᵉ

xn = A'A \ A'b               # Compute xn via normal equations

Q, R = qr(A); Q = Matrix(Q)  # Compute xr via QR
xr = R \ (Q'b)

xb = A \ b                   # Compute xb via backslash

@show xn xr xb;

xn = [0.9195604391114242, 1.9195604388945233, 1.0804395609418005]
xr = [1.000000007591375, 2.000000007591374, 0.9999999924086256]
xb = [0.9999999998426099, 1.99999999984261, 1.0000000001573903]

QR Factorization

Overview

Orthogonal vectors

Orthogonal matrices

Question: Orthogonal matrices

QR Factorization

Reduced QR Factorization

Question: Columns of \(Q\)

Nonsingular equations with QR

mathematically

computationally

Geometry of Least-Squares via QR

Question: Least-Squares and Orthogonal Projections

Solving Least-Squares via QR

mathematically

computationally

Question: Geometric Interpretation of \(R\)

Accuracy of QR vs Normal Equations