Gram - Schmidt Orthogonalization Process

Orthogonal vectors are by definition independent, but independent vectors are not necessarily orthogonal . A set of independent vectors, however, can be rendered orthogonal. In this section we focus on orhonormal bases of a vector space V and examine a procedure whereby we can transform or convert any basis B of V into an orthonormal basis.

Descriviamo ora in dettaglio un metodo, noto come procedimento di ortonormalizzazione di Gram–Schmidt, che permette di costruire una base ortonormale partendo da una base qualsiasi di V. Consideriamo quindi uno spazio vettoriale V di dimensione n sul campo dei numeri reali, dotato di una forma bilineare simmetrica definita positiva ⟨ ⋅ ⟩. Sia v = {v₁, . . . , v_n} una base qualunque di V . Poniamo w₁ = v₁ e cerchiamo un vettore w₂, ortogonale a w₁, della forma w₂ = α₁w₁ + v₂ (notiamo che, in questo modo, il sottospazio vettoriale generato da w₁ e w₂ coincide con quello generato da v₁ e v₂). La condizione di ortogonalità tra w₁ e w₂ si esprime ponendo ⟨w₁, w₂⟩ = 0. Si ha pertanto

⟨w₁, w₂⟩ = ⟨w₁, α₁w₁ + v₂⟩ = α₁ ⟨w₁, w₁⟩ + ⟨w₁, w₂⟩ = 0

da cui si ottiene

α₁ = −⟨w₁, w₂⟩ / ⟨w₁, w₁⟩

Il vettore cercato è quindi

w₂ = v₂ −⟨w₁, w₂⟩ / ⟨w₁, w₂⟩w₁

Cerchiamo ora un vettore w₃ ortogonale al sottospazio generato da w₁ e w₂, della forma w₃ = α₁w₁ + α₂w₂ + v₃. Imponendo che w₃ sia ortogonale a w₁ e w₂, si ottengono le equazioni⟨w₁, w₃⟩ = 0 e ⟨w₂, w₃⟩ = 0. Sviluppando i calcoli, si trova:

⟨w₁, w₃⟩ = ⟨w₁, α₁w₁ + α₂w₂ + v₃⟩
= α₁ ⟨w₁, w₁⟩ + α₂⟨w₁, w₂⟩ + ⟨w₁, w₃⟩
= α₁ ⟨w₁, w₁⟩ + ⟨w₁, w₃⟩ = 0

⟨w₂, w₃⟩ = ⟨w₂, α₁w₁ + α₂w₂ + v₃⟩
= α₁ ⟨w₁, w₁⟩ + α₂⟨w₂, w₂⟩ + ⟨w₂, w₃⟩
= α₂ ⟨w₂, w₂⟩ + ⟨w₂, w₃⟩ = 0

da cui si ottiene

α₁ = −⟨w₁, w₃⟩ / ⟨w₁, w₁⟩
α₂ = −⟨w₂, w₃⟩ / ⟨w₂, w₃⟩

Il vettore cercato è quindi

w₃ = v₃ − ⟨w₁, w₃⟩w₁ / ⟨w₁, w₁⟩− ⟨w₂, w₃⟩ w₂/ ⟨w₂, w₂⟩

Constructing an Orthogonal basis for ℝ²

The transformation of a basis B = {u₁, u₂} for ℝ² into an orthogonal basis B' = {v₁, v₂} consists of two steps. See Fig.1. The first step is simple, we merely choose one of the vectors in B, say, u₁ and rename it v₁. Next as shown in Fig.1b, we project the remaining vector u₂ in B onto the vector v₁ and define a second vector to be v₂ = u₂ − proj_{v₁ u₂.}

Gram Schmidt process — The orthogonal vectors v₁ and v₂ are defined in terms of u₁ and u₂.

As seen in Fig.1(c), the vectors

v₁ = u₁
v₂ = u₂ − proj_{v₁ u₂}

Example 1. Consideriamo, in ℝ³, i due vettori indipendenti:

v₁ = (1,0,−1) v₂ = (0,1,−1)

detto V il sottospazio vettoriale di ℝ³, generato da v₁, v₂ (non è altro che il piano passante per l'origine di equazione x + y + z = 0), proponiamoci di costruire una base ortonormale di V, ossia di ortonormalizzare la base v₁, v₂. I passi sono i seguenti

Normalizziamo v₁:

u₁ = v₁/|v₁| = (1/√2, 0, −1/√2)
Calcoliamo la componente di v₂ nella direzione di u₁, data da:

(v₂ ⋅ u₁) u₁ = 1/√2(1/√2,0,−1/√2) = (1/2, 0,1/2)
Sottraiamo a v₂ la sua componente nella direzione di u₁, ottenendo così un vettore ortogonale a u₁ e che insieme ad esso genera V:

v₂ − (v₂ ⋅ u₁) u₁ = (0,1,−1) −(1/2, 1, −1/2) = (−1/2, 1, −1/2)
Normalizziamo quest'ultimo vettore,

u₂ = (−1/2, 1, −1/2)/sqrt(1/4 + 1 + 1/4) = sqrt(2/3) (−1/2,1,−1/2) = (−1/6,sqrt(2/3),−1/sqrt(6))
Scegliamo un vettori di ℝ³ indipendente da u₁,u₂, per esempio

v₃ = (0,0,1)
Sottraiamo a u₃ la sua proiezione su V

u₃' = v₃ − {(u₃ ⋅ u₁) u₁ + (u₃ ⋅ u₂)u₂} = (0,0,1) −{ −1/2 (1/√2,0,−(1/√2) − 1/√6(1/√6, sqrt(2/3), −1/√6) } = (1/3,1/3,1/2)
Normalizziamo u₃' ottenendo

u₃ = (1/3,1/3,1/3) /sqrt(1/3) = ( (1/√3,1/√3,1/√3) ■

(Gram-Schmidt process). Let {v₁, ..., v_n} ⊂ 𝔽ⁿ be linearly independent. Construct {v₁, ..., v_p} as follows:

\begin{aligned} z_{1} & = v_{1} \\ z_{k} & = v_{k} - \frac{⟨ v_{k}, z_{k - 1} ⟩}{⟨ z_{k - 1}, z_{k - 1} z_{k - 1} ⟩} - \dots - \frac{⟨ v_{k}, z_{1} ⟩}{⟨ z_{1}, z_{1} ⟩} z_{1}, k = 2, \dots, p \end{aligned}

Then for k = 1, ..., p, we have that {z₁, ..., z_p} is an orthogonal linearly independent set satisfying

Span {v₁, ..., v_k} = Span {z₁, ..., z_k} = Span {z₁/||z₁||, ..., z_p/||z_p||}

The set {z₁/||z₁||, ..., z_p/||z_p||} is an orthonormal set.

Proof. We prove this by induction. Clearly when p = 1, then z₁ = v₁, and the statements are trivial.

Next suppose that the theorem has been proven for sets with up to p - 1 vectors. Next, we are given {v₁, ..., v_p} and we construct {z₁, ..., z_p}. Notice that {z₁, ..., z_p-1} are obtained by applying the Gram-Schmidt process to {v₁, ..., v_p-1}, and thus by the induction assumption {z₁, ..., z_p-1} is a linearly independent set and holds for k = 1, ..., p -1. Let z_p be defined via the GS orthogonalization process. Observe that for k ≤ p -1.

⟨ z_{p}, z_{k} ⟩ = ⟨ v_{p}, z_{k} ⟩ - ⟨ \sum_{j = 1}^{p - 1} \frac{⟨ v_{p}, z_{j} ⟩}{⟨ z_{j}, z_{j} ⟩} z_{j}, z_{k} ⟩ = ⟨ v_{p}, z_{k} ⟩ - \frac{⟨ v_{p}, z_{k} ⟩}{⟨ z_{k}, z_{k} ⟩} ⟨ z_{k}, z_{k} ⟩ = 0

where we used that ⟨z_j, z_k⟩ = 0 for j ≠ k, 1 ≤, k ≤ p - 1. This proves the orthogonality. Also, we see that

z_p ∈ Span {z_p} + Span {z₁, ..., z_p-1} = Span {v_p} + Span {v₁, ..., v_p-1} + Span{v₁, ..., v_p}

Next, since

v_{p} = z_{p} + \sum_{j = 1}^{p - 1} \frac{⟨ v_{p}, z_{j} ⟩}{z_{j}, z_{j}} z_{j}

We have that

v_p ∈ Span {z₁, ..., z_p}

Combining these observations with the induction assumption yields

Span {v₁, ..., v_p} = Span {z₁, ..., z_p}

Since {v₁, ..., v_p} is linearly independent, they span a p dimensional space. Then {z₁, ..., z_p} also span a p dimensional space (the same one), and thus this set of vectors is also linearly independent. Finally, dividing each z_i by its length does not change the span, and makes the vectors orthonormal. □

«Epoxides Index Sulfides»

Gram - Schmidt Orthogonalization Process

Constructing an Orthogonal basis for ℝ2

Constructing an Orthogonal basis for ℝ²