If you see this, something is wrong

Collapse and expand sections

To get acquainted with the document, the best thing to do is to select the "Collapse all sections" item from the "View" menu. This will leave visible only the titles of the top-level sections.

Clicking on a section title toggles the visibility of the section content. If you have collapsed all of the sections, this will let you discover the document progressively, from the top-level sections to the lower-level ones.

Cross-references and related material

Generally speaking, anything that is blue is clickable.

Clicking on a reference link (like an equation number, for instance) will display the reference as close as possible, without breaking the layout. Clicking on the displayed content or on the reference link hides the content. This is recursive: if the content includes a reference, clicking on it will have the same effect. These "links" are not necessarily numbers, as it is possible in LaTeX2Web to use full text for a reference.

Clicking on a bibliographical reference (i.e., a number within brackets) will display the reference.

Speech bubbles indicate a footnote. Click on the bubble to reveal the footnote (there is no page in a web document, so footnotes are placed inside the text flow). Acronyms work the same way as footnotes, except that you have the acronym instead of the speech bubble.

Discussions

By default, discussions are open in a document. Click on the discussion button below to reveal the discussion thread. However, you must be registered to participate in the discussion.

If a thread has been initialized, you can reply to it. Any modification to any comment, or a reply to it, in the discussion is signified by email to the owner of the document and to the author of the comment.

Publications

The blue button below that says "table of contents" is your tool to navigate in a publication.

The left arrow brings you to the previous document in the publication, and the right one brings you to the next. Both cycle over the publication list.

The middle button that says "table of contents" reveals the publication table of contents. This table is hierarchical structured. It has sections, and sections can be collapsed or expanded. If you are a registered user, you can save the layout of the table of contents.

Publication content

Around the Solution of a Puzzle

Solve Systems of 2 equations of 2 variables

The Canonical Vector Plane

The Dot Product and the Norm in the Euclidean Plane

Angles in Radians and Trigonometric Functions

The Polar Coordinates and the Rotations

The Complex Numbers as Vectors

The Linear Mappings and their Matrices

Bases in the Plane and Change of Basis

Symmetric, Skew Symmetric and Unit Matrices

Eigenvalues and Eigenvectors

Diagonalisation in the Field of Complex Numbers

Linear Algebra in the Euclidean Plane Section 2 Test

Linear Algebra in the Euclidean Plane Section 3 Test

Linear Algebra in the Euclidean Plane Section 4 Test

Linear Algebra in the Euclidean Plane Section 5 Test

Linear Algebra in the Euclidean Plane Section 6 Test

Linear Algebra in the Euclidean Plane Section 7 Test

Linear Algebra in the Euclidean Plane Section 8 Test

Linear Algebra in the Euclidean Plane Section 9 Test

Linear Algebra in the Euclidean Plane Section 10 Test

Linear Algebra in the Euclidean Plane Section 11 Test

Linear Algebra in the Euclidean Plane Section 12 Test

Linear Algebra in the Euclidean Plane Section 13 Test

Linear Algebra in the Euclidean Plane Final Assessment

First published on Saturday, Jul 6, 2024 and last modified on Thursday, Apr 10, 2025

Bases in the Plane and Change of Basis

Fabienne Chaplais Mathedu SAS

1 Introduction

We shall now discover and use important ordered sets if vectors, the ones that constitute a basis of the plane.

2 The Bases in \( \mathbb{P}\)

A basis in \( \mathbb{P}\) is a generalisation of the canonical basis \( (\overrightarrow{i},\overrightarrow{j})\) .

We may define, as for the canonical basis, the cartesian coordinates of any vector in that basis.

2.1 The Canonical Basis

Consider the canonical basis \( (\overrightarrow{i},\overrightarrow{j})\) in the vector plane \( \mathbb{P}\) .

Then it is indeed a basis of , because it is an ordered set:

that is linearly independent:
\( \forall\;(a,b)\in\mathbb{R}^2,\;\;a\overrightarrow{i}+b\overrightarrow{j}=\overrightarrow{0}\Rightarrow a=b=0\) ,
and that is spanning for the vector plane :
\( \forall\;\overrightarrow{u}\in\mathbb{P},\;\;\exists\; (a,b)\in\mathbb{R}^2|\; \overrightarrow{u}=a\overrightarrow{i}+b\overrightarrow{j}\)

In the last item, \( a\) and \( b\) are the coordinates of \( \overrightarrow{u}\) in the canonical basis.

2.2 Linearly Independent and Linearly Dependent Ordered Sets

2.2.1 Definitions

Definition 1

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2,…,\overrightarrow{u}_n)\in\mathbb{P}^n\) are vectors, and consider the ordered set \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,…,\overrightarrow{u}_n)\) .

Then the following definitions are given.

\( S\) is linearly independent if and only if the only linear combination of the elements of \( S\) that is equal to the null vector is the one wirh all coefficients equal to\( 0\) :
\( \forall\;(a_1,a_2…,a_n)\in\mathbb{R}^n\;\; a_1\overrightarrow{u}_1+a_2\overrightarrow{u}_2+…+ a_n\overrightarrow{u}_n=\overrightarrow{0} \Rightarrow a_1=a_2=…=a_n=0\)
And \( S\) is linearly dependent if and only if it is not linearly independent.

2.2.2 First Example: An Ordered Set Made of One Non Zero Vector

Assume that \( \overrightarrow{u}\in\mathbb{P}^{*}\) is a non zero vector.

Then, as \( \overrightarrow{u}=\overrightarrow{0}\) implies \( a=0\) , the ordered set \( S=(\overrightarrow{u})\) is linearly independent.

2.2.3 Second Example: An Ordered Set Made of Two Non Aligned Non Zero Vectors

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Then, assume that \( a_1\overrightarrow{u}_1+a_2\overrightarrow{u}_2=\overrightarrow{0}\) for some \( (a_1,a_2)\) that are not zero together.

Let’s say that \( a_{2}\ne 0\) , so that \( \overrightarrow{u}_2=\frac{a_{1}}{a_{2}}\overrightarrow{u}_1\) .

That would imply that \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are aligned, on the contrary of the hypothesis.

Consequently, the ordered set \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is linearly independent.

2.2.4 Third Example: An Ordered Set Made of Two Aligned Non Zero Vectors

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in(\mathbb{P}^*)^2\) are aligned non zero vectors.

Consider the coefficient \( k\in\mathbb{R}^*\) such that \( \overrightarrow{u}_2=k\overrightarrow{u}_1\) .

Then \( a_1\overrightarrow{u}_1+a_2\overrightarrow{u}_2=\overrightarrow{0}\) for \( a_{1}=k\) and \( a_{2}=-1\ne 0\) .

Consequently, the ordered set \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is linearly dependent.

2.2.5 Fourth Example: An Ordered Set Made of Three not all Aligned Non Zero Vectors

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2,\overrightarrow{u}_3)\in(\mathbb{P}^*)^3\) are non zero vectors such that, say, \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are not aligned.

Consider the cartesian coordinates of \( \overrightarrow{u}_1\) , \( \overrightarrow{u}_2\) and \( \overrightarrow{u}_3\) , \( \overrightarrow{u}_1=\begin{bmatrix} x_{1}\\y_{1} \end{bmatrix}\) , \( \overrightarrow{u}_2=\begin{bmatrix} x_{2}\\y_{2} \end{bmatrix}\) and \( \overrightarrow{u}_3=\begin{bmatrix} x_{3}\\y_{3} \end{bmatrix}\) .

Then the sytem of linear equations in \( (a_{2},a_{3})\) :

\[ \left\{\begin{matrix}x_{2}a_{2}&+&x_{3}a_{3}&=&x_{1}\\y_{2}a_{2}&+&y_{3}a_{3}&=&y_{1}\end{matrix}\right. \]

has exactly one solution \( (a_{2},a_{3})\) .

This is because \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a linearly independant ordered set, so that the system of linear equations in \( (a_{2},a_{3})\) :

\[ \left\{\begin{matrix}x_{2}a_{2}&+&x_{3}a_{3}&=&0\\y_{2}a_{2}&+&y_{3}a_{3}&=&0\end{matrix}\right. \]

has exactly one solution \( (0,0)\), and thus its determinant \( x_2y_3-x_3y_2\) is non zero.

Consequently, the linear combination of the vectors in \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,\overrightarrow{u}_3)\) with a non zero coefficient \( 1\) is equal to

\( 1\times\overrightarrow{u}_1+(-a_{2})\overrightarrow{u}_2+(-a_{3})\overrightarrow{u}_3 =\overrightarrow{0}\) .

Hence \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,\overrightarrow{u}_3)\) is linearly dependent.

2.2.6 Characteristic Properties of the Linearly Dependent and Linearly Independent Ordered Sets

Theorem 1

Then the following assertions hold.

\( S\) is linearly independent if and only if it is either made of a unique non zero vector, or of two non zero vectors that are not aligned.
And \( S\) is linearly dependent if and only if it is either the singleton with the null vector, or at least one of the vectors in \( S\) is a linear combination of the other vectors in \( S\) .

Proof

Let’s consider the following cases for \( S\) .

\( S\) is made of a unique vector \( \overrightarrow{u}_1\) and

\( \overrightarrow{u}_1=\overrightarrow{0}\) ,
or \( \overrightarrow{u}_1\ne\overrightarrow{0}\) .

\( S\) is made of two vectors \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\) and

at least one of these vectors is the null vector,
or they are both non zero and,

they are aligned,
or they are non aligned.

\( S\) contains at lest 3 vectors \( (\overrightarrow{u}_1\) , \( (\overrightarrow{u}_2\) and \( (\overrightarrow{u}_3\) .

We have to prove that:

in the cases or a non zero unique vector and of non aligned two vectors, \( S\) is linearly independent and that no vector in \( S\) is a linear combination of the other vectors in \( S\) ,
and in all the other cases, \( S\) is linearly dependent and it is either the singleton with the null vector, or at lest one of the vectors in \( S\) is a linear combination of the other vectors in \( S\) .

Assume that \( S\) is made of a unique vector \( \overrightarrow{u}_1\) .

Assume that \( \overrightarrow{u}_1=\overrightarrow{0}\) ,
Then the linear combination of the vectors in \( S=(\overrightarrow{0})\) with a non zero coefficient \( 1\) is equal to \( 1\times\overrightarrow{0}=\overrightarrow{0}\) .
Hence \( S=(\overrightarrow{0})\) is linearly dependent and it is the singleton with the null vector.
Assume that or \( \overrightarrow{u}_1\ne\overrightarrow{0}\) .
Then, if \( a_{1}\overrightarrow{u}_1=\overrightarrow{0}\) , then \( a_{1}=0\) .
Hence \( S=(\overrightarrow{u}_{1})\) is linearly independent and it is a singleton with a non zero vector.

Assume that \( S\) is made of two vectors \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\) and

Assume that at least one of these vectors is the null vector,
With no loss of generallity, we may assume that \( \overrightarrow{u}_1\ne\overrightarrow{0}\) .
Then the linear combination of the vectors in \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) with a non zero coefficient \( 1\) is equal to \( 1\times\overrightarrow{u}_{1}+0\times\overrightarrow{u}_{2}=\overrightarrow{0}\) .
Hence \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is linearly dependent.
Moreover, \( \overrightarrow{u}_{1}=0\times\overrightarrow{u}_{2}\) , so that \( \overrightarrow{u}_{1}\) is a linear combination of \( \overrightarrow{u}_{2}\) .
Assume that they are both non zero.

Assume that \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are aligned,
Consider the real number \( k\) such that \( \overrightarrow{u}_2=k\overrightarrow{u}_1\) .
Then the linear combination of the vectors in \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) with a non zero coefficient \( -1\) is equal to \( k\overrightarrow{u}_{1}+(-1)\times\overrightarrow{u}_{2}=\overrightarrow{0}\) .
Hence \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is linearly dependent.
Moreover, \( \overrightarrow{u}_2=k\overrightarrow{u}_1\) , so that \( \overrightarrow{u}_{2}\) is a linear combination of \( \overrightarrow{u}_{1}\) .
Assume that \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are non aligned.
Then \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is linearly independent because, if there would exist real numbers \( a_{1}\) and \( a_{2}\) with, say, \( a_1\ne 0\) , such that \( a_{1}\overrightarrow{u}_1+a_{2}\overrightarrow{u}_2=\overrightarrow{0}\) , then we would have \( \overrightarrow{u}_2=\frac{a_{1}}{a_{2}}\overrightarrow{u}_1\) , and \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) would be aligned.
Moreover, \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is made of two vectors that are nont aligned.

Assume that \( S\) contains at least 3 vectors \( \overrightarrow{u}_1\) , \( \overrightarrow{u}_2\) and \( \overrightarrow{u}_3\) .
Consider the cartesian coordinates of \( \overrightarrow{u}_1\) , \( \overrightarrow{u}_2\) and \( \overrightarrow{u}_3\) , \( \overrightarrow{u}_1=\begin{bmatrix} x_{1}\\y_{1} \end{bmatrix}\) , \( \overrightarrow{u}_2=\begin{bmatrix} x_{2}\\y_{2} \end{bmatrix}\) and \( \overrightarrow{u}_3=\begin{bmatrix} x_{3}\\y_{3} \end{bmatrix}\) .
Then the sytem of linear equations in \( (a_{2},a_{3})\) :

\[ \left\{ \begin{matrix} X_{2}a_{2}&+&x_{3}a_{3}&=&x_{1}\newline y_{2}a_{2}&+&y_{3}a_{3}&=&y_{1} \end{matrix} \right. \]

has at least one solution \( (a_{2},a_{3})\) .
Consequently, the linear combination of the vectors in \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,…,\overrightarrow{u}_n)\) with a non zero coefficient \( 1\) is equal to
\( 1\times\overrightarrow{u}_1+(-a_{2})\overrightarrow{u}_2+(-a_{3})\overrightarrow{u}_3 +0\times\overrightarrow{u}_4+…+0\times\overrightarrow{u}_n=\overrightarrow{0}\) .
Hence \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,…,\overrightarrow{u}_n)\) is linearly dependent.
Moreover, \( \overrightarrow{u}_1=a_{2}\overrightarrow{u}_2+a_{3}\overrightarrow{u}_3 +0\times\overrightarrow{u}_4+…+0\times\overrightarrow{u}_n\) ,so that \( \overrightarrow{u}_{1}\) is a linear combination of \( \overrightarrow{u}_2,…,\overrightarrow{u}_n\)

2.3 Spanning and Non Spanning Ordered Sets

2.3.1 Definitions

Definition 2

Then the following definitions are given.

\( S\) is a spanning ordered set if and only if any vector in the plane is a linear combination of the vectors in \( S\) :
\( \forall\;\overrightarrow{v}\in\mathbb{P},\; \exists (a_1,a_2…,a_n)\in\mathbb{R}^n\;|\; \overrightarrow{v}=a_1\overrightarrow{u}_1+a_2\overrightarrow{u}_2+…+a_n\overrightarrow{u}_n\)
And \( S\) is a non spanning ordered set otherwise.

2.3.2 First Example: An Ordered Set Made of One Non Zero Vector

Assume that \( \overrightarrow{u}\in\mathbb{P}^{*}\) is a non zero vector.

Then, as none of the non zero vectors that are not aligned with \( \overrightarrow{u}\) is a scalar multiple of \( \overrightarrow{u}\) , the ordered set \( S=(\overrightarrow{u})\) is a non spanning ordered set.

2.3.3 Second Example: An Ordered Set Made of Two Non Aligned Non Zero Vectors

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Assume that \( \overrightarrow{v}\in\mathbb{P}\) is any vector in the plane.

Consider the cartesian coordinates of \( \overrightarrow{u}_1\) , \( \overrightarrow{u}_2\) and \( \overrightarrow{v}\) , \( \overrightarrow{u}_1=\begin{bmatrix} x_{1}\\y_{1} \end{bmatrix}\) , \( \overrightarrow{u}_2=\begin{bmatrix} x_{2}\\y_{2} \end{bmatrix}\) and \( \overrightarrow{v}=\begin{bmatrix} z\\t \end{bmatrix}\) .

Then the sytem of linear equations in \( (a_{1},a_{2})\) :

\[ \left\{\begin{matrix}x_{1}a_{1}&+&x_{2}a_{2}&=&z\\y_{1}a_{1}&+&y_{2}a_{2}&=&t\end{matrix}\right. \]

has at exactly one solution \( (a_{1},a_{2})\) .

This is because \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a linearly independant ordered set, so that the system of linear equations in \( (a_{2},a_{3})\) :

\[ \left\{\begin{matrix}x_{2}a_{2}&+&x_{3}a_{3}&=&0\\y_{2}a_{2}&+&y_{3}a_{3}&=&0\end{matrix}\right. \]

has exactly one solution \( (0,0)\), and thus its determinant \( x_2y_3-x_3y_2\) is non zero.

Consequently, \( \overrightarrow{v}=a_1\overrightarrow{u}_1+a_2\overrightarrow{u}_2\) , so that \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a spanning ordered set.

2.3.4 Third Example: An Ordered Set Made of Two Aligned Non Zero Vectors

Assume that \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in(\mathbb{P}^*)^2\) are aligned non zero vectors.

Then, as any linear combination of \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) is aligned with \( \overrightarrow{u}_1\) , no vector that is not aligned with \( \overrightarrow{u}_1\) may be written as a linear combination of \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) .

Consequently, \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a non spanning ordered set.

2.3.5 Fourth Example: An Ordered Set Made of Three not all Aligned Non Zero Vectors

Then, as \( S'=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a spanning ordered set, any vector in the plane is a linear combination of \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) , and consequently of \( \overrightarrow{u}_1\) , \( \overrightarrow{u}_2\) and \( \overrightarrow{u}_3\) , with a zero coefficient for \( \overrightarrow{u}_3\) .

Consequently, \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2,…,\overrightarrow{u}_n)\) is a spanning ordered set.

2.3.6 Properties of the Spanning Ordered Sets

Theorem 2

Then the following assertions hold.

\( S\) is a spanning ordered set if and only if it contains at least two non aligned non zero vectors.
If an ordered set contains some vectors forming a spanning ordered set, then it is spanning as well.

Proof

\( S\) is in one of the following cases.

All the vectors in \( S\) are equal to the null vector.
Then, as any linear combination of the vectors in \( S\) is equal to the null vector, no non zero vector may be written as a linear combination of the vectors in \( S\) .
All the non zero vectors in \( S\) are aligned.
Then, as any linear combination of the vectors in \( S\) is aligned with these vectors or is the null vector, no vector that is not aligned with the non zero vectors in \( S\) may be written as a linear combination of the vectors in \( S\) .
\( S\) contains at least two non aligned non zero vectors, say \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) .
Then, as \( S'=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a spanning ordered set, any vector in the plane is a linear combination of \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) , and consequently of the vectors in \( S\) , with zero coefficients for the possible other vectors in \( S\) .

If an ordered set contains some vectors forming a spanning ordered set, then it is spanning as well.
Indeed, it is sufficient to fill the linear combination of the spanning subset with zero coefficients for the other vectors in \( S\) .

2.4 Definition of a Basis in \( \mathbb{P}\)

Definition 3

A basis of the vector plane \( \mathbb{P}\) is a spanning ordered set that is linearly independent.

Let’s denote that he canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) is indeed a basis of the plane, because it is a spanning ordered set that is linearly independent.

The theorems 1 and 2 give us which kind of ordered sets are bases in \( \mathbb{P}\) .

Theorem 3

A basis of the plane \( \mathbb{P}\) is an ordered set composed of 2 non aligned non zero vectors.

As any basis of the plane is made of 2 vectors, we say that the vector plane \( \mathbb{P}\) is of dimension 2.

2.5 The Coordinates of a Vector in any Basis

Theorem 4

If \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane, then for any vector \( \overrightarrow{v}\in\mathbb{P}\) , there exists a unique couple of scalars \( (x,y)\in\mathbb{R}^2\) such that \( \overrightarrow{v}=x\overrightarrow{I}+y\overrightarrow{J}\) .

These scalars \( (x,y)\) are called the coordinates of \( \overrightarrow{v}\) in the basis \( B\) , and the column vector of these coordinates is denoted \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Note that, if \( \overrightarrow{v}\) is the column vector \( \overrightarrow{v}=\begin{bmatrix}x_0\\y_0\end{bmatrix}\) , then \( [\overrightarrow{v}]_{B_0}=\begin{bmatrix}x_0\\y_0\end{bmatrix}\) . This is because \( \overrightarrow{v}=x_0\overrightarrow{i}+y_0\overrightarrow{j}\) .

Proof (of the theorem 4)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane, and that \( \overrightarrow{v}\in\mathbb{P}\) is a vector.

Then, as \( B=(\overrightarrow{I},\overrightarrow{J})\) is a spanning ordered set, we may consider scalars \( (x,y)\in\mathbb{R}^2\) such that \( \overrightarrow{v}=x\overrightarrow{I}+y\overrightarrow{J}\) .

But these scalars are unique.

Indeed, assume that \( (x',y')\in\mathbb{R}^2\) are scalars such that \( \overrightarrow{v}=x'\overrightarrow{I}+y'\overrightarrow{J}\) as well.

Then \( (x-x')\overrightarrow{I}+(y-y')\overrightarrow{J}=\overrightarrow{0}\) .

So that, as \( B=(\overrightarrow{I},\overrightarrow{J})\) is a linearly independent ordered set,

\( x-x'=y-y'=0\) .

Consequently, \( x'=x\) and \( y'=y\) .

2.6 Geometrical View of a Linear Combination of 2 Non Aligned Non Zero Vectors

2.6.1 First Case: 2 Positive Coefficients

Geometrical view of a linear combination with positive
coefficients of 2 non aligned non zero vectors — Figure 9. Geometrical view of a linear combination with positive coefficients of 2 non aligned non zero vectors

Assume that \( (\overrightarrow{I},\overrightarrow{J})\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Assume that \( (a,b)\in(\mathbb{R}^*_+)^2\) are scalars such that \( a>0\) and \( b>0\) .

Then the following assertions hold.

The vector \( a\overrightarrow{I}\) is positively aligned with \( \overrightarrow{I}\) and its norm is \( \left\|a\overrightarrow{I}\right\|=a\left\|\overrightarrow{I}\right\|\) .
The vector \( b\overrightarrow{J}\) is positively aligned with \( \overrightarrow{J}\) and its norm is \( \left\|b\overrightarrow{J}\right\|=b\left\|\overrightarrow{J}\right\|\) .
Because of the geometrical view of the sum of two non zero vectors studied in lecture 15 of section 4, the vector \( a\overrightarrow{I}+b\overrightarrow{J}\) is built by drawing the vector \( b\overrightarrow{J}\) at the end of the vector \( a\overrightarrow{I}\) .

2.6.2 Second Case: 1 Positive Coefficient, then 1 Negative Coefficient

Geometrical view of a linear combination, with 1 positive coefficient
then 1 negative coefficient, of 2 non aligned non zero vectors — Figure 10. Geometrical view of a linear combination, with 1 positive coefficient then 1 negative coefficient, of 2 non aligned non zero vectors

\( (\overrightarrow{I},\overrightarrow{J})\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Assume that \( (a,b)\in\mathbb{R}^*_+\times\mathbb{R}^*_-\) are scalars such that \( a>0\) and \( b<0\) .

Then the following assertions hold.

The vector \( a\overrightarrow{I}\) is positively aligned with \( \overrightarrow{I}\) and its norm is \( \left\|a\overrightarrow{I}\right\|=a\left\|\overrightarrow{I}\right\|\) .
The vector \( b\overrightarrow{J}\) is negatively aligned with \( \overrightarrow{J}\) and its norm is \( \left\|b\overrightarrow{J}\right\|=|b|\left\|\overrightarrow{J}\right\|\) .
The vector \( a\overrightarrow{I}+b\overrightarrow{J}\) is built by drawing the vector \( b\overrightarrow{J}\) at the end of the vector \( a\overrightarrow{I}\) .

2.6.3 Third Case: 1 Negative Coefficient, then 1 Positive Coefficient

Geometrical view of a linear combination, with 1 negative coefficient
then 1 positive coefficient, of 2 non aligned non zero vectors — Figure 11. Geometrical view of a linear combination, with 1 negative coefficient then 1 positive coefficient, of 2 non aligned non zero vectors

Assume that \( (\overrightarrow{I},\overrightarrow{J})\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Assume that \( (a,b)\in\mathbb{R}^*_-\times\mathbb{R}^*_+\) are scalars such that \( a<0\) and \( b>0\) .

Then the following assertions hold.

The vector \( a\overrightarrow{I}\) is negatively aligned with \( \overrightarrow{I}\) and its norm is \( \left\|a\overrightarrow{I}\right\|=|a|\left\|\overrightarrow{I}\right\|\) .
The vector \( b\overrightarrow{J}\) is positively aligned with \( \overrightarrow{J}\) and its norm is \( \left\|b\overrightarrow{J}\right\|=b\left\|\overrightarrow{J}\right\|\) .
The vector \( a\overrightarrow{I}+b\overrightarrow{J}\) is built by drawing the vector \( b\overrightarrow{J}\) at the end of the vector \( a\overrightarrow{I}\) .

2.6.4 Fourth Case: 2 Negative Coefficients

Geometrical view of a linear combination with negative
coefficients of 2 non aligned non zero vectors — Figure 12. Geometrical view of a linear combination with negative coefficients of 2 non aligned non zero vectors

Assume that \( (\overrightarrow{I},\overrightarrow{J})\in(\mathbb{P}^*)^2\) are non aligned non zero vectors.

Assume that \( (a,b)\in(\mathbb{R}^*_-)^2\) are scalars such that \( a<0\) and \( b<0\) .

Then the following assertions hold.

The vector \( a\overrightarrow{I}\) is negatively aligned with \( \overrightarrow{I}\) and its norm is \( \left\|a\overrightarrow{I}\right\|=|a|\left\|\overrightarrow{I}\right\|\) .
The vector \( b\overrightarrow{J}\) is negatively aligned with \( \overrightarrow{J}\) and its norm is \( \left\|b\overrightarrow{J}\right\|=|b|\left\|\overrightarrow{J}\right\|\) .
Because of the geometrical view of the sum of two non zero vectors studied in lecture 15 of section 4, the vector \( a\overrightarrow{I}+b\overrightarrow{J}\) is built by drawing the vector \( b\overrightarrow{J}\) at the end of the vector \( a\overrightarrow{I}\) .

2.7 Geometrical View of the Coordinates of a Vector in any Basis

2.7.1 Geometrical View of a Basis of the Plane

Figure 13. Geometrical view of a basis in the plane

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Then the following geometrical facts occur.

The \( X\) axis is leaded by the vector \( \overrightarrow{I}\) .
The \( Y\) axis is leaded by the vector \( \overrightarrow{J}\) .
The coordinates network is made of parallels to the \( X\) axis and to the \( Y\) axis.
This is because we will have to draw the projections:

on the \( X\) axis along the \( Y\) axis,
and on the \( Y\) axis along the \( X\) axis.

2.7.2 Geometrical View of the Coordinates in \( B\) of a Vector Positively along the \( X\) Axis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( x\in\mathbb{R_+^*}\) is a real number such that \( x>0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\0\end{bmatrix}\) .

Then the following geometrical facts occur.

\( \overrightarrow{v}\) is positively along the \( X\) axis.
Its length is \( x\) times the length of \( \overrightarrow{I}\) .

2.7.3 Geometrical View of the Coordinates in \( B\) of a Vector in the First Quadrant of \( B\)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( (x,y)\in(\mathbb{R_+^*)^{2}}\) are real numbers such that \( x>0\) and \( y>0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

The projection of the end of \( \overrightarrow{v}\) on the \( X\) axis along the \( Y\) axis is at a distance \( x\left\|\overrightarrow{I}\right\|\) of the origin positively along the \( X\) axis.
The projection of the end of \( \overrightarrow{v}\) on the \( Y\) axis along the \( X\) axis is at a distance \( y\left\|\overrightarrow{J}\right\|\) of the origin positively along the \( Y\) axis.

2.7.4 Geometrical View of the Coordinates in \( B\) of a Vector Positively along the \( Y\) Axis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( y\in\mathbb{R_+^*}\) is a real number such that \( y>0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}0\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

\( \overrightarrow{v}\) is positively along the \( Y\) axis.
Its length is \( y\) times the length of \( \overrightarrow{J}\) .

2.7.5 Geometrical View of the Coordinates in \( B\) of a Vector in the Second Quadrant of \( B\)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( (x,y)\in\mathbb{R_-^*}\times\mathbb{R_+^*}\) are real numbers such that \( x<0\) and \( y>0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

The projection of the end of \( \overrightarrow{v}\) on the \( X\) axis along the \( Y\) axis is at a distance \( |x|\left\|\overrightarrow{I}\right\|\) of the origin negatively along the \( X\) axis.
The projection of the end of \( \overrightarrow{v}\) on the \( Y\) axis along the \( X\) axis is at a distance \( y\left\|\overrightarrow{J}\right\|\) of the origin positively along the \( Y\) axis.

2.7.6 Geometrical View of the Coordinates in \( B\) of a Vector Negatively along the \( X\) Axis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( x\in\mathbb{R_-^*}\) is a real number such that \( x<0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\0\end{bmatrix}\) .

Then the following geometrical facts occur.

\( \overrightarrow{v}\) is negatively along the \( X\) axis.
Its length is \( |x|\) times the length of \( \overrightarrow{I}\) .

2.7.7 Geometrical View of the Coordinates in \( B\) of a Vector in the Third Quadrant of \( B\)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( (x,y)\in(\mathbb{R_-^*)^{2}}\) are real numbers such that \( x<0\) and \( y<0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

The projection of the end of \( \overrightarrow{v}\) on the \( X\) axis along the \( Y\) axis is at a distance \( |x|\left\|\overrightarrow{I}\right\|\) of the origin negatively along the \( X\) axis.
The projection of the end of \( \overrightarrow{v}\) on the \( Y\) axis along the \( X\) axis is at a distance \( |y|\left\|\overrightarrow{J}\right\|\) of the origin negatively along the \( Y\) axis.

2.7.8 Geometrical View of the Coordinates in \( B\) of a Vector Negatively along the \( Y\) Axis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( y\in\mathbb{R_-^*}\) is a real number such that \( y<0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}0\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

\( \overrightarrow{v}\) is negatively along the \( Y\) axis.
Its length is \( |y|\) times the length of \( \overrightarrow{J}\) .

2.7.9 Geometrical View of the Coordinates in \( B\) of a Vector in the Fourth Quadrant of \( B\)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis in the plane.

Assume that \( (x,y)\in\mathbb{R_+^*}\times\mathbb{R_-^*}\) are real numbers such that \( x>0\) and \( y<0\) , and consider the vector \( \overrightarrow{v}\in\mathbb{P}\) such that \( [\overrightarrow{v}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Then the following geometrical facts occur.

The projection of the end of \( \overrightarrow{v}\) on the \( X\) axis along the \( Y\) axis is at a distance \( x\left\|\overrightarrow{I}\right\|\) of the origin positively along the \( X\) axis.
The projection of the end of \( \overrightarrow{v}\) on the \( Y\) axis along the \( X\) axis is at a distance \( |y|\left\|\overrightarrow{J}\right\|\) of the origin negatively along the \( Y\) axis.

3 Change of Basis and Transition Matrix

With the help of the inverse of so called transition matrix, we will be able to calculate the coordinates in a new basis when the coordinates in an old basis are known.

3.1 Introduction to the Transition Matrix

3.1.1 Example of Two Aligned Non Zero Vectors

Consider the aligned non zero vectors in the plane:

\( \overrightarrow{u}_1\) with column vector of coordinates in the canonical basis \( X_1=\begin{bmatrix}2\\1\end{bmatrix}\) ,
and \( \overrightarrow{u}_2=-\frac{1}{2}\overrightarrow{u}_1\) with column vector of coordinates in the canonical basis \( X_2=\begin{bmatrix}-1\\{-\frac{1}{2}}\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{1}\) and \( X_{2}\) :

\( P=\begin{bmatrix}2&-1\\1&-\frac{1}{2}\end{bmatrix}\)

Then the following assertions hold:

The determinant of \( P\) is \( \det(P)=2\times\left(-\frac{1}{2}\right)-1\times(-1)=-1+1=0\) .
Consequently, \( P\) is not invertible.
And, as \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are aligned, the ordered set \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is not a basis of the plane.

3.1.2 Example of Two Non Aligned Non Zero Vectors

Consider the non aligned non zero vectors in the plane:

\( \overrightarrow{u}_1\) with column vector of coordinates in the canonical basis \( X_1=\begin{bmatrix}2\\1\end{bmatrix}\) ,
and \( \overrightarrow{u}_2\) with column vector of coordinates in the canonical basis \( X_2=\begin{bmatrix}1\\{-1}\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{1}\) and \( X_{2}\) :

\( P=\begin{bmatrix}2&1\\1&-1\end{bmatrix}\) .

Then the following assertions hold:

The determinant of \( P\) is \( \det(P)=2\times(-1)-1\times 1=-2-1=-3\ne 0\) .
Consequently, \( P\) is invertible.
And, as \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) are non aligned non zero vectors, the ordered set \( S=(\overrightarrow{u}_1,\overrightarrow{u}_2)\) is a basis of the plane.

3.2 Equivalent Conditions for Two Vectors to be a Basis of the Plane

Theorem 5

Assume that \( (x_I,y_I,x_J,y_J)\in\mathbb{R}^4\) are real numbers and consider the two vectors in the plane:

\( \overrightarrow{I}\) with column vector of coordinates in the canonical basis \( X_I=\begin{bmatrix}x_{I}\\y_{I}\end{bmatrix}\) ,
and \( \overrightarrow{J}\) with column vector of coordinates in the canonical basis \( X_J=\begin{bmatrix}x_{J}\\y_{J}\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{I}\) and \( X_{J}\) :

\( P=\begin{bmatrix}x_I&x_J\\y_I&y_J\end{bmatrix}\) .

Then the following assertions are equivalent.

The ordered set \( S=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane.
\( \overrightarrow{I}\) and \( \overrightarrow{J}\) are non aligned non zero vectors.
The determinant of \( P\) is non zero.
The matrix \( P\) is invertible.

Proof

Assume that \( (x_I,y_I,x_J,y_J)\in\mathbb{R}^4\) are real numbers and consider the two vectors in the plane:

\( \overrightarrow{I}\) with column vector of coordinates in the canonical basis \( X_I=\begin{bmatrix}x_{I}\\y_{I}\end{bmatrix}\) ,
and \( \overrightarrow{J}\) with column vector of coordinates in the canonical basis \( X_J=\begin{bmatrix}x_{J}\\y_{J}\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{I}\) and \( X_{J}\) :

\( P=\begin{bmatrix}x_I&x_J\\y_I&y_J\end{bmatrix}\) .

Then the following equivalences are already known:

(I) and (II) are equivalent.
(III) and (IV) are equivalent.

We have now to prove that (II) and (III) are equivalent.

But this is equivalent to the proof of the equivalence of the negation of the assertions (II) and (III), that are:

\( \overrightarrow{I}\) and \( \overrightarrow{J}\) are aligned non zero vectors, or at least one of them is the zero vector.
\( \det(P)=0\) .

Assume that (V) is verified.

If one at least of the vectors \( \overrightarrow{I}\) and \( \overrightarrow{J}\) , say \( \overrightarrow{I}\) , is the zero vector, then \( P=\begin{bmatrix}0&x_J\\0&y_J\end{bmatrix}\) , so that: \( \det(P)=0\times y_{J} - 0\times x_{J}=0\) .

And if \( \overrightarrow{I}\) and \( \overrightarrow{J}\) are aligned non zero vectors, then we may consider the scalar \( k\) such that \( \overrightarrow{J}=k\overrightarrow{I}\) .

In that case, \( P=\begin{bmatrix}x_I&kx_I\\y_I&ky_I\end{bmatrix}\) , so that \( \det(P)=x_{I}(ky_{I})-y_{I}(kx_{I})=0\) .

Consequently, (VI) is verified.

Assume now that (VI) is verified.

Then \( x_{I}y_{J}=y_{I}x_{J}\) .

If \( x_{I}=0\) , then either \( y_{J}=0\) and thus \( \overrightarrow{I}\) is the zero vector, of \( x_{J}=0\) and thus \( \overrightarrow{I}\) and \( \overrightarrow{J}\) are both aligned with \( \overrightarrow{j}\) or one of them is the zero vector.

And if \( x_{I}\ne 0\) , then either \( x_{J}=y_{J}=0\) and thus \( \overrightarrow{J}\) is the zero vector, or \( \frac{y_{J}}{y_{I}}=\frac{x_{J}}{x_{I}}\ne 0\) , that we denote \( k\) that fraction, then \( \overrightarrow{J}=k\overrightarrow{I}\) , and \( \overrightarrow{I}\) and \( \overrightarrow{J}\) are aligned non zero vectors.

Consequently, (V) is verified.

3.3 The Transition Matrix from the New Basis to the Canonical Basis

Theorem 6

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) of the plane.

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is another basis of the plane, where:

the column vector of coordinates of \( \overrightarrow{I}\) in the canonical basis is \( X_I=\begin{bmatrix}x_I\\y_I\end{bmatrix}\) ,
and the column vector of coordinates of \( \overrightarrow{J}\) in the canonical basis is \( X_J=\begin{bmatrix}x_J\\y_J\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{I}\) and \( X_{J}\) :

\( P=\begin{bmatrix}x_I&x_J\\y_I&y_J\end{bmatrix}\) .

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector such that:

its column vector of coordinates in the canonical basis is \( [\overrightarrow{u}]_{B_0}=X=\begin{bmatrix}x_1\\x_2\end{bmatrix}\) ,
and its column vector of coordinates in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is \( [\overrightarrow{u}]_{B}=Y=\begin{bmatrix}y_1\\y_2\end{bmatrix}\)

Then \( P\) is invertible and \( X=PY\) so that \( Y=P^{-1}X\) .

Definition 4

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) of the plane.

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is another basis of the plane, where:

the column vector of coordinates of \( \overrightarrow{I}\) in the canonical basis is \( X_I=\begin{bmatrix}x_I\\y_I\end{bmatrix}\) ,
and the column vector of coordinates of \( \overrightarrow{J}\) in the canonical basis is \( X_J=\begin{bmatrix}x_J\\y_J\end{bmatrix}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{I}\) and \( X_{J}\) :

\( P=\begin{bmatrix}x_I&x_J\\y_I&y_J\end{bmatrix}\) .

Then, by definition, \( P\) is the transition matrix from the “new” basis \( B=(\overrightarrow{I},\overrightarrow{J})\) to the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) , and and its inverse \( P^{-1}\) is the transition matrix from the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) to the “new” basis \( B=(\overrightarrow{I},\overrightarrow{J})\) .

Proof (of the theorem 6)

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) of the plane.

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is another basis of the plane, where:

the column vector of coordinates of \( \overrightarrow{I}\) in the canonical basis is \( X_I=\begin{bmatrix}x_I\\y_I\end{bmatrix}\) ,
and the column vector of coordinates of \( \overrightarrow{J}\) in the canonical basis is \( X_J=\begin{bmatrix}x_J\\y_J\end{bmatrix}\) .

Then \( \overrightarrow{I}=x_I\overrightarrow{i}+y_I\overrightarrow{j}\) and \( \overrightarrow{J}=x_J\overrightarrow{i}+y_J\overrightarrow{j}\) .

Consider the matrix \( P\) with columns the column vectors \( X_{I}\) and \( X_{J}\) :

\( P=\begin{bmatrix}x_I&x_J\\y_I&y_J\end{bmatrix}\) .

Then, as \( B\) is a basis of the plane, \( P\) is invertible.

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector such that:

its column vector of coordinates in the canonical basis is \( [\overrightarrow{u}]_{B_0}=X=\begin{bmatrix}x_1\\x_2\end{bmatrix}\) ,
and its column vector of coordinates in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is \( [\overrightarrow{u}]_{B}=Y=\begin{bmatrix}y_1\\y_2\end{bmatrix}\) .

Then \( \overrightarrow{u}=x_1\overrightarrow{i}+x_2\overrightarrow{j}\) and \( \overrightarrow{u}=y_1\overrightarrow{I}+y_2\overrightarrow{J}\) .

This implies that:

\( \overrightarrow{u}=y_1(x_I\overrightarrow{i}+y_I\overrightarrow{j}) +y_2(x_J\overrightarrow{i}+y_J\overrightarrow{j}) =(y_{1}x_{I}+y_{2}x_{J})\overrightarrow{i}+(y_{1}y_{I}+y_{2}y_{J})\overrightarrow{j}\) .

Consequently,

\( X=\begin{bmatrix}x_1\\x_2\end{bmatrix}=\begin{bmatrix}x_{I}y_{1}+x_{J}y_{2}\\y_{I}y_{1}+y_{J}y_{2}\end{bmatrix} =PY\) , so that \( Y=P^{-1}X\) .

Let’s generalize that to any two basis.

3.4 The Transition Matrix from the New Basis to the Old Basis

Consider the following three bases of the plane.

The canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) .
The basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) with transition matrix \( P_{1}\) to the canonical basis.
The basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) with transition matrix \( P_{2}\) to the canonical basis.

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector such that:

its column vector of coordinates in the canonical basis is \( [\overrightarrow{u}]_{B_0}=X\) ,
its column vector of coordinates in the basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) is \( [\overrightarrow{u}]_{B_1}=X_1\) ,
and its column vector of coordinates in the basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) is \( [\overrightarrow{u}]_{B_2}=X_2\)

Then the following equalities hold.

As \( X=P_2X_2\) and \( X_1=P_1^{-1}X\) , we have \( X_1=P_1^{-1}P_2X_2\) .
Consequently, \( X_2=(P_1^{-1}P_2)^{-1}X_1=P_2^{-1}P_1X_1\) .

That’s why we say that:

\( P=P_1^{-1}P_2\) is the transition matrix from the “new” basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) to the “old” basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) ,
and its inverse \( P^{-1}=P_2^{-1}P_1\) is the transition matrix from the “old” basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) to the “new” basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) .

Moreover, the columns of the transition matrix \( P\) are the column vectors corresponding to the coordinates of the vectors \( \overrightarrow{i_2}\) and \( \overrightarrow{j_2}\) of the new basis \( B_{2}\) , in the old basis \( B_{1}\) .

Proof (of the last assertion)

Consider the following three bases of the plane.

The canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) .
The basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) with transition matrix \( P_{1}\) to the canonical basis.
The basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) with transition matrix \( P_{2}\) to the canonical basis.

Consider the transition matrix \( P=P_1^{-1}P_2\) from the “new” basis \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) to the “old” basis \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) .

Let’s denote it \( P=\begin{bmatrix} a&b\\c&d \end{bmatrix}\) .

Then the column vector of coordinates of \( \overrightarrow{i}_2\) in the basis \( B_{1}\) is

\( [\overrightarrow{i_{2}}]_{B_1}=P[\overrightarrow{i_{2}}]_{B_2} =\begin{bmatrix} a&b\\c&d \end{bmatrix}\begin{bmatrix} 1\\0 \end{bmatrix} =\begin{bmatrix} a\\c \end{bmatrix}\) ,

the first column of the matrix \( P\) .

And the column vector of coordinates of \( \overrightarrow{j}_2\) in the basis \( B_{1}\) is

\( [\overrightarrow{j_{2}}]_{B_1}=P[\overrightarrow{j_{2}}]_{B_2} =\begin{bmatrix} a&b\\c&d \end{bmatrix}\begin{bmatrix} 0\\1 \end{bmatrix} =\begin{bmatrix} b\\d \end{bmatrix}\) ,

the second column of the matrix \( P\) .

4 Change of Basis for the Matrix of a Linear Mapping

We shall now use the transition matrix and its inverse to transform the matrix of a linear mapping in some base to that matrix in another base.

4.1 Definition of the Matrix of a Linear Mapping in any Basis

Definition 5

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane \( \mathbb{P}\) .

Assume that \( f:\;\mathbb{P}\;\rightarrow\;\mathbb{P}\) is a linear mapping in the plane, and consider the matrix \( A=\begin{bmatrix} a_{11}&a_{12}\\a_{21}&a_{22} \end{bmatrix}\) such that:

the first column \( C_1=\begin{bmatrix} a_{11}\\a_{21} \end{bmatrix}\) of \( A\) is the column vector of the cartesian coordinates in the basis \( B\) of the image \( f(\overrightarrow{I})\) of the first vector \( f(\overrightarrow{I})\) of the basis \( B\) : \( C_1=[f(\overrightarrow{I})]_B\) .
and the second column \( C_2=\begin{bmatrix} a_{12}\\a_{22} \end{bmatrix}\) of \( A\) is the column vector of the cartesian coordinates in the basis \( B\) of the image \( f(\overrightarrow{J})\) of the second vector \( f(\overrightarrow{J})\) of the basis \( B\) : \( C_2=[f(\overrightarrow{J})]_B\) .

Then the matrix \( A\) is called the matrix of the linear mapping \( f\) in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) .

Theorem 7

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane \( \mathbb{P}\) .

Assume that \( f:\;\mathbb{P}\;\rightarrow\;\mathbb{P}\) is a linear mapping in the plane, and consider the matrix \( A\) of \( f\) in the basis \( B\) .

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector, and consider the column vectors of coordinates \( X=[\overrightarrow{u}]_B\) and \( Y=[f(\overrightarrow{u})]_B\) .

Then we have \( Y=AX\) .

That’s why the matrix \( A\) is called the matrix of the linear mapping \( f\) in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) .

Proof (of the theorem 7)

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane \( \mathbb{P}\) .

Assume that \( f:\;\mathbb{P}\;\rightarrow\;\mathbb{P}\) is a linear mapping in the plane, and consider the matrix \( A\) of \( f\) in the basis \( B\) .

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector, and consider the column vectors of coordinates \( [\overrightarrow{u}]_B=X=\begin{bmatrix}x_1\\x_2 \end{bmatrix}\) and \( [f(\overrightarrow{u})]_B=Y=\begin{bmatrix}y_1\\y_2 \end{bmatrix}\) .

Then \( \overrightarrow{u}=x_{1}\overrightarrow{I}+x_{2}\overrightarrow{J}\) .

As \( f\) is a linear mapping, we have:

\( f(\overrightarrow{u})=f(x_{1}\overrightarrow{I}+x_{2}\overrightarrow{J}) =x_{1}f(\overrightarrow{I})+x_{2}f(\overrightarrow{J})\) .

Consequenly, with the column vectors of coordinates:

\( Y=x_{1}C_{1}+x_{2}C_{2} =x_{1}\begin{bmatrix} a_{11}\\a_{21} \end{bmatrix} +x_{2}\begin{bmatrix} a_{12}\\a_{22} \end{bmatrix} =\begin{bmatrix} x_{1}a_{11}+x_{2}a_{12}\\x_{1}a_{21}+x_{2}a_{22} \end{bmatrix}\)

\( =\begin{bmatrix} a_{11}&a_{12}\\a_{21}&a_{22} \end{bmatrix} \begin{bmatrix}x_1\\x_2 \end{bmatrix}=AX\) .

4.2 Particular Cases of Linear Mappings

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane \( \mathbb{P}\) .

Then the matrices of the following linear mappings in the basis \( B\) are the following.

The matrix of the null mapping \( o\) in the basis \( B\) is the null matrix .
This is because \( o(\overrightarrow{I})=o(\overrightarrow{J})=\overrightarrow{0}\) .
The matrix of the identity mapping \( \text{Id}_{\mathbb{P}}\) in the basis \( B\) is the identity matrix \( I=\begin{bmatrix} 1&0\\0&1 \end{bmatrix}\) .
This is because \( \text{Id}_{\mathbb{P}}(\overrightarrow{I})=\overrightarrow{I}\) and \( \text{Id}_{\mathbb{P}}(\overrightarrow{I})=\overrightarrow{I}\) .
The matrix of the homothety \( h_{\lambda}\) of factor \( \lambda\) in the basis \( B\) is the scalar matrix \( H_{\lambda}=\begin{bmatrix}\lambda&0\\0&\lambda\end{bmatrix}=\lambda I\) .
This is because \( h_{\lambda}(\overrightarrow{I})=\lambda\overrightarrow{I}\) and \( h_{\lambda}(\overrightarrow{J})=\lambda\overrightarrow{J}\) .

4.3 Change of Basis for a Linear Mapping

Assume that \( B_1=(\overrightarrow{i}_1,\overrightarrow{j}_1)\) and \( B_2=(\overrightarrow{i}_2,\overrightarrow{j}_2)\) are two bases of the plane \( \mathbb{P}\) .

Assume that \( f:\;\mathbb{P}\;\rightarrow\;\mathbb{P}\) is a linear mapping in the plane.

Consider the following matrices:

the matrix \( A_{1}\) of \( f\) in the basis \( B_{1}\) ,
the matrix \( A_{2}\) of \( f\) in the basis \( B_{2}\) ,
and the transition matrix \( P\) from the basis \( B_{2}\) to the basis \( B_{1}\) .

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector, with column vectors of coordinates:

\( X_{1}\) in the basis \( B_{1}\) : \( X_1=[\overrightarrow{u}]_{B_1}\) ,
\( X_{2}\) in the basis \( B_{2}\) : \( X_2=[\overrightarrow{u}]_{B_2}\)

Consider the vector \( \overrightarrow{v}=f(\overrightarrow{u})\), with column vectors of coordinates:

\( Y_{1}\) in the basis \( B_{1}\) : \( Y_1=[f(\overrightarrow{u})]_{B_1}\) ,
\( Y_{2}\) in the basis \( B_{2}\) : \( Y_2=[f(\overrightarrow{u})]_{B_2}\)

Then the following equalities hold.

\( Y_2=P^{-1}Y_1\) ,
\( Y_1=A_1X_1\) ,
\( X_1=PX_2\) ,
and \( Y_2=A_2X_2\) .

So that the following calculations may be performed.

\( Y_2=P^{-1}Y_1=P^{-1}A_1X_1=P^{-1}A_1PX_2\) .

Consequently,

\( A_2=P^{-1}A_1P\) .

4.4 Application to the Calculation of the Matrix in the Canonical Basis of a Projection

Figure 24. Projection on a line along another line

Assume that \( (D)\) and \( (D')\) are straight lines secant on the origin.

Assume that \( \overrightarrow{u}\in\mathbb{P}\) is a vector in the plane.

Then the end of the projected \( p(\overrightarrow{u})\) of the vector \( \overrightarrow{u}\) on the line \( (D)\) along the line \( (D')\) , is defined as the intersection of \( (D)\) with the parallel of \( (D')\) that passes through the end of \( \overrightarrow{u}\) .

Assume that the line \( (D)\) is directed by the vector \( \overrightarrow{I}=\begin{bmatrix} I_1\\I_2 \end{bmatrix}\) .

Assume that the line \( (D')\) is directed by the vector \( \overrightarrow{J}=\begin{bmatrix} J_1\\J_2 \end{bmatrix}\) .

Then, as the lines \( (D)\) and \( (D')\) are secant, so that the vectors \( \overrightarrow{I}\) and \( \overrightarrow{J}\) are not aligned, \( B=(\overrightarrow{I},\overrightarrow{J})\) is a basis of the plane, with transition matrix to the canonical basis \( P=\begin{bmatrix} I_1&J_1\\I_2&J_2 \end{bmatrix}\) .

The transition matrix from the canonical basis to the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is the inverse of \( P\) : \( P^{-1}=\frac{1}{I_1J_2-I_2J_1} \begin{bmatrix} J_2&-J_1\\{-I_2}&I_1 \end{bmatrix}\) .

Consider the column vector \( X=\begin{bmatrix}x\\y\end{bmatrix}\) corresponding to the cartesian coordinates of the vector \( \overrightarrow{u}\) in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) .

Then the column vector corresponding to the cartesian coordinates of the vector \( p(\overrightarrow{u})\) in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is equal to \( Y=\begin{bmatrix}x\\0\end{bmatrix}\) .

So that \( p\) is a linear mapping, with matrix in the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) equal to \( \Pi_0=\begin{bmatrix}1&0\\0&0 \end{bmatrix}\) .

Let’s denote \( \Pi\) the matrix of the projection \( p\) is the canonical basis.

Then \( \Pi_0=P^{-1}\Pi P\) , so that \( \Pi=P\Pi_0P^{-1}\) .

Let’s calculate \( \Pi\) with that formula.

\( \Pi=P\Pi_0P^{-1}\)

\( =\frac{1}{I_1J_2-I_2J_1} \begin{bmatrix} I_1&J_1\\I_2&J_2 \end{bmatrix} \begin{bmatrix}1&0\\0&0 \end{bmatrix} \begin{bmatrix} J_2&-J_1\\{-I_2}&I_1 \end{bmatrix}\)

\( =\frac{1}{I_1J_2-I_2J_1}\begin{bmatrix} I_1&0\\I_2&0 \end{bmatrix} \begin{bmatrix} J_2&-J_1\\{-I_2}&I_1 \end{bmatrix}\)

\( \Pi=\frac{1}{I_1J_2-I_2J_1}\begin{bmatrix} I_1J_2&-I_1J_1\\ I_2J_2&-I_2J_1 \end{bmatrix}\)

Assume now that the line \( (D)\) has an equation \( ax+by=0\) .

then it is directed by the vector \( \overrightarrow{I}=\begin{bmatrix}b\\{-a}\end{bmatrix}\)

Assume now that the line \( (D')\) has an equation \( cx+dy=0\) .

then it is directed by the vector \( \overrightarrow{J}=\begin{bmatrix}d\\{-c}\end{bmatrix}\)

So that, if we replace the coordonates of \( \overrightarrow{I}\) and \( \overrightarrow{J}\) , we obtain:

\( \Pi=\frac{1}{ad-bc} \begin{bmatrix} -bc&-bd\\ac&ad \end{bmatrix} =\begin{bmatrix} -\frac{bc}{ad-bc}&-\frac{bd}{ad-bc}\\ \frac{ac}{ad-bc}&\frac{ad}{ad-bc} \end{bmatrix}\)

5 Direct and Reverse Orthonormal Bases

The orthonormal bases are useful bases in the euclidean plane, for which the coordinates of any vector are simply the dot products of the vectors of the basis with that vector.

Moreover, the orthonormal bases preserve the dot product and thus the norm of vectors.

And last but not least, he matrices of rotation in the the direct orhtonormal bases tare all the same.

And last but not least, the direct orhtonormal bases the matrices of rotation.

5.1 Examples of Direct and Reverse Orthonormal Basis

5.1.1 First Example

Figure 25. The canonical basis is a direct orthonormal basis

The canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) is a direct orthonormal basis because of the following facts.

The two vectors are orthogonal to each other (\( \overrightarrow{i}\bot\overrightarrow{j}\) ), that is \( \overrightarrow{i}\cdot\overrightarrow{j}=0\) .
The two vectors are unitary: \( \left\| \overrightarrow{i} \right\|=\left\| \overrightarrow{j} \right\|=1\) .
We go from \( \overrightarrow{i}\) to \( \overrightarrow{j}\) in the direct direction, that is the counter clockwise direction: \( \widehat{(\overrightarrow{i},\overrightarrow{j})}=+\frac{\pi}{2}\) .

5.1.2 Second Example

Figure 26. An example of reverse orthonormal basis

The basis \( B_1=(\overrightarrow{j},\overrightarrow{i})\) is a reverse orthonormal basis because of the following facts.

The two vectors are orthogonal to each other (\( \overrightarrow{j}\bot\overrightarrow{i}\) ), that is \( \overrightarrow{j}\cdot\overrightarrow{i}=0\) .
The two vectors are unitary: \( \left\| \overrightarrow{j} \right\|=\left\| \overrightarrow{i} \right\|=1\) .
We go from \( \overrightarrow{j}\) to \( \overrightarrow{i}\) in the reverse direction, that is the clockwise direction: \( \widehat{(\overrightarrow{j},\overrightarrow{i})}=-\frac{\pi}{2}\) .

5.1.3 Third Example

Figure 27. Another example of direct orthonormal basis

Consider the vectors \( \overrightarrow{I}=\begin{bmatrix}\frac{\sqrt{3}}{2}\\\frac{1}{2}\end{bmatrix} =\begin{bmatrix}\cos\left(\frac{\pi}{6}\right)\\\sin\left(\frac{\pi}{6}\right)\end{bmatrix}\) and \( \overrightarrow{J}=\begin{bmatrix}-\frac{1}{2}\\\frac{\sqrt{3}}{2}\end{bmatrix} =\begin{bmatrix}\cos\left(\frac{2\pi}{3}\right)\\\sin\left(\frac{2\pi}{3}\right)\end{bmatrix}\) .

Then the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is a direct orthonormal basis because of the following facts.

The two vectors are orthogonal to each other (\( \overrightarrow{I}\bot\overrightarrow{J}\) ), because \( \overrightarrow{I}\cdot\overrightarrow{J}=0\) .
The two vectors are unitary, because \( \left\| \overrightarrow{I} \right\|=\left\| \overrightarrow{J} \right\|=1\) .
We go from \( \overrightarrow{i}\) to \( \overrightarrow{j}\) in the direct direction, because
\( \widehat{(\overrightarrow{I},\overrightarrow{J})}= \widehat{(\overrightarrow{i},\overrightarrow{J})}-\widehat{(\overrightarrow{i},\overrightarrow{I})} =\frac{2\pi}{3}-\frac{\pi}{6}=+\frac{\pi}{2}\) .

5.1.4 Fourth Example

Figure 28. Another example of reverse orthonormal basis

Consider the vectors \( \overrightarrow{I}=\begin{bmatrix}\frac{\sqrt{2}}{2}\\\frac{\sqrt{2}}{2}\end{bmatrix} =\begin{bmatrix}\cos\left(\frac{\pi}{4}\right)\\\sin\left(\frac{\pi}{4}\right)\end{bmatrix}\) and \( \overrightarrow{J}=\begin{bmatrix}\frac{\sqrt{2}}{2}\\{-\frac{\sqrt{2}}{2}}\end{bmatrix} =\begin{bmatrix}\cos\left(-\frac{\pi}{4}\right)\\\sin\left(-\frac{\pi}{4}\right)\end{bmatrix}\) .

Then the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is a reverse orthonormal basis because of the following facts.

The two vectors are orthogonal to each other (\( \overrightarrow{I}\bot\overrightarrow{J}\) ), because \( \overrightarrow{I}\cdot\overrightarrow{J}=0\) .
The two vectors are unitary, because \( \left\| \overrightarrow{I} \right\|=\left\| \overrightarrow{J} \right\|=1\) .
We go from \( \overrightarrow{i}\) to \( \overrightarrow{j}\) in the reverse direction, because
\( \widehat{(\overrightarrow{I},\overrightarrow{J})}= \widehat{(\overrightarrow{i},\overrightarrow{J})}-\widehat{(\overrightarrow{i},\overrightarrow{I})} =-\frac{\pi}{4}-\frac{\pi}{4}=-\frac{\pi}{2}\) .

5.2 Definition of an Orthonormal Basis

Definition 6

A basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is said to be orthonormal if and only if the following assertions are fulfilled.

The two vectors are orthogonal to each other (\( \overrightarrow{I}\bot\overrightarrow{J}\) ), that is \( \overrightarrow{I}\cdot\overrightarrow{J}=0\) .
The two vectors are unitary, that is \( \left\| \overrightarrow{I} \right\|=\left\| \overrightarrow{J} \right\|=1\) .

Moreover, the orthonormal basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is said to be:

a direct orthonormal basis if we go from \( \overrightarrow{I}\) to \( \overrightarrow{J}\) in the direct direction, that is \( \widehat{(\overrightarrow{I},\overrightarrow{J})}=+\frac{\pi}{2}\) ,
and a reverse orthonormal basis if we go from \( \overrightarrow{I}\) to \( \overrightarrow{J}\) in the reverse direction, that is \( \widehat{(\overrightarrow{I},\overrightarrow{J})}=-\frac{\pi}{2}\)

A direct consequence of these definition is the following theorem.

Theorem 8

If \( B=(\overrightarrow{I},\overrightarrow{J})\) is a direct orthonormal basis, then \( B'=(\overrightarrow{J},\overrightarrow{I})\) is a reverse orthonormal basis.

And if \( B=(\overrightarrow{I},\overrightarrow{J})\) is a reverse orthonormal basis, then \( B'=(\overrightarrow{J},\overrightarrow{I})\) is a direct orthonormal basis.

This is because \( \widehat{(\overrightarrow{J},\overrightarrow{I})} =-\widehat{(\overrightarrow{I},\overrightarrow{J})}\) .

5.3 Transition Matrix from an Orthonormal Basis to the Canonical Basis

5.3.1 Transition Matrix from a Direct Orthonormal Basis to the Canonical Basis

Figure 29. Generic direct orthonormal basis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a direct orthonormal basis and consider the angle \( \theta=\widehat{(\overrightarrow{i},\overrightarrow{I})}\) .

Then the coordinates of \( \overrightarrow{I}\) and \( \overrightarrow{J}\) in the canonical basis are the following:

\( [\overrightarrow{I}]_{B_0}=\begin{bmatrix}\cos(\theta)\\\sin(\theta)\end{bmatrix}\) ,
and \( [\overrightarrow{J}]_{B_0}=\begin{bmatrix} \cos\left(\theta+\frac{\pi}{2}\right)\\ \sin\left(\theta+\frac{\pi}{2}\right)\end{bmatrix}= \begin{bmatrix}-\sin(\theta)\\\cos(\theta)\end{bmatrix}\) .

Consequently, the following assertions are fulfilled.

The transition matrix from the basis \( B\) to the canonical basis is equal to \( P=\begin{bmatrix} \cos(\theta)&-\sin(\theta)\\ \sin(\theta)&\cos(\theta) \end{bmatrix}\) , the matrix of the rotation of angle \( \theta\) .
And the transition matrix from the canonical basis to the basis \( B\) is equal to:
\( P^{-1} =\frac{1}{\cos^{2}(\theta)+\sin^{2}(\theta)} \begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix} =\begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix}\)
It is the matrix of the rotation of angle \( -\theta\) .

5.3.2 Transition Matrix from a Reverse Orthonormal Basis to the Canonical Basis

Figure 30. Generic reverse orthonormal basis

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a reverse orthonormal basis and consider the angle \( \theta=\widehat{(\overrightarrow{i},\overrightarrow{I})}\) .

Then the coordinates of \( \overrightarrow{I}\) and \( \overrightarrow{J}\) in the canonical basis are the following:

\( [\overrightarrow{I}]_{B_0}=\begin{bmatrix}\cos(\theta)\\\sin(\theta)\end{bmatrix}\) ,
and \( [\overrightarrow{J}]_{B_0}=\begin{bmatrix} \cos\left(\theta-\frac{\pi}{2}\right)\\ \sin\left(\theta-\frac{\pi}{2}\right)\end{bmatrix}= \begin{bmatrix}\sin(\theta)\\{-\cos(\theta)}\end{bmatrix}\) .

Consequently, the following assertions are fulfilled.

The transition matrix from the basis \( B\) to the canonical basis is equal to \( P=\begin{bmatrix} \cos(\theta)&\sin(\theta)\\ \sin(\theta)&-\cos(\theta) \end{bmatrix}\) .
And the transition matrix from the canonical basis to the basis \( B\) is equal to:
\( P^{-1} =\frac{1}{\cos^{2}(\theta)+\sin^{2}(\theta)} \begin{bmatrix} \cos(\theta)&\sin(\theta)\\\sin(\theta)&-\cos(\theta) \end{bmatrix} =\begin{bmatrix} \cos(\theta)&\sin(\theta)\\\sin(\theta)&-\cos(\theta) \end{bmatrix} =P\)
\( P\) is thus idempotent, and \( P^{2}=I\) the identity matrix.

5.4 Properties of Orthonormal Bases

5.4.1 The Coordinates of a Vector in the Canonical Basis

Theorem 9

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is an orthonormal basis and consider the vector \( \overrightarrow{u}\in\mathbb{P}\) .

Then the coordinates of \( \overrightarrow{u}\) is the basis \( B\) are \( [\overrightarrow{u}]_B=\begin{bmatrix} \overrightarrow{I}\cdot\overrightarrow{u}\\ \overrightarrow{J}\cdot\overrightarrow{u} \end{bmatrix}\) .

Proof

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is an orthonormal basis and consider the vector \( \overrightarrow{u}\in\mathbb{P}\) .

Consider the coordinates of \( \overrightarrow{u}\) is the basis \( B\) : \( [\overrightarrow{u}]_B=\begin{bmatrix}x\\y\end{bmatrix}\) .

Then \( \overrightarrow{u}=x\overrightarrow{I}+y\overrightarrow{J}\) .

Consequently, because of the bilinearity of the dot product and the fact that \( B\) is an orthonormal basis, we have:

\( \overrightarrow{I}\cdot\overrightarrow{u} =x(\overrightarrow{I}\cdot\overrightarrow{I}) +y(\overrightarrow{I}\cdot\overrightarrow{J})=x\) and \( \overrightarrow{J}\cdot\overrightarrow{u} =x(\overrightarrow{J}\cdot\overrightarrow{I}) +y(\overrightarrow{J}\cdot\overrightarrow{J})=y\) .

5.4.2 The Dot Product and the Norm do not Depend on the Orthonormal Basis

Theorem 10

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) .

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is an orthonormal basis.

Consider the vectors \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in\mathbb{P}^2\) , with coordinates in the canonical basis and in the basis \( B\) :

\( [\overrightarrow{u}_1]_{B_0}=\begin{bmatrix}x_1\\y_1\end{bmatrix}\) and \( [\overrightarrow{u}_1]_{B}=\begin{bmatrix}x'_1\\y'_1\end{bmatrix}\) ,
and \( [\overrightarrow{u}_2]_{B_0}=\begin{bmatrix}x_2\\y_2\end{bmatrix}\) and \( [\overrightarrow{u}_2]_{B}=\begin{bmatrix}x'_2\\y'_2\end{bmatrix}\) .

Consider the dot product of \( \overrightarrow{u}_1\) and \( \overrightarrow{u}_2\) :

\( \overrightarrow{u}_1\cdot\overrightarrow{u}_2=x_1x_2+y_1y_2\)

Then we have also:

\( \overrightarrow{u}_1\cdot\overrightarrow{u}_2=x'_1x'_2+y'_1y'_2\)

Corollary 1

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) .

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is an orthonormal basis.

Consider the vector \( \overrightarrow{u}\in\mathbb{P}\) , with coordinates in the canonical basis and in the basis \( B\) :

\( [\overrightarrow{u}]_{B_0}=\begin{bmatrix}x\\y\end{bmatrix}\) and \( [\overrightarrow{u}]_{B}=\begin{bmatrix}x'\\y'\end{bmatrix}\) ,

Consider the norm of \( \overrightarrow{u}\) and \( \overrightarrow{u}\) :

\( \left\| \overrightarrow{u}\right\|=\sqrt{x^2+y^2}\)

Then we have also:

\( \left\| \overrightarrow{u}\right\|=\sqrt{x'^2+y'^2}\)

This is because \( \left\| \overrightarrow{u}\right\|=\sqrt{\overrightarrow{u}\cdot\overrightarrow{u}}\) .

Proof (of the theorem 10)

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) .

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is an orthonormal basis.

Consider the vectors \( (\overrightarrow{u}_1,\overrightarrow{u}_2)\in\mathbb{P}^2\) , with coordinates in the basis \( B\) :

\( [\overrightarrow{u}_1]_{B}=\begin{bmatrix}x'_1\\y'_1\end{bmatrix}\) , so that \( \overrightarrow{u}_{1}=x'_{1}\overrightarrow{I}+y'_{1}\overrightarrow{J}\) ,
and \( [\overrightarrow{u}_2]_{B}=\begin{bmatrix}x'_2\\y'_2\end{bmatrix}\) , so that \( \overrightarrow{u}_{2}=x'_{2}\overrightarrow{I}+y'_{2}\overrightarrow{J}\) .

Consequently, because of the bilinearity of the dot product and the fact that \( B\) is an orthonormal basis, we have:

\( \overrightarrow{u}_{1}\cdot\overrightarrow{u}_{2} =x'_{1}x'_{2}(\overrightarrow{I}\cdot\overrightarrow{I}) +x'_{1}y'_{2}(\overrightarrow{I}\cdot\overrightarrow{J}) +y'_{1}x'_{2}(\overrightarrow{J}\cdot\overrightarrow{I}) +y'_{1}y'_{2}(\overrightarrow{I}\cdot\overrightarrow{J})\)

\( =x'_{1}x'_{2}+y'_{1}y'_{2}\) .

5.4.3 Application to the Formula of the Dot Product with the Cosine

We shall apply the theorem 10 to the formula \( \overrightarrow{u}\cdot\overrightarrow{v}= \left\| \overrightarrow{u} \right\| \left\| \overrightarrow{v} \right\|\cos(\theta)\) .

Assume that \( (\overrightarrow{u},\overrightarrow{v})\in(\mathbb{P}^*)^2\) are non zero vectors and consider their angle \( \theta=\widehat{(\overrightarrow{u},\overrightarrow{v})}\) .

Consider the orthonormal basis \( B=(\overrightarrow{I},\overrightarrow{J})\) such that:

\( \overrightarrow{I}=\frac{\overrightarrow{u}}{\left\| \overrightarrow{u} \right\|}\) ,
and, if \( \overrightarrow{I}=\begin{bmatrix}I_1\\I_2\end{bmatrix}\) , then \( \overrightarrow{J}=\begin{bmatrix}-I_2\\I_1\end{bmatrix}\) .

Then, in the basis \( B\) , we have:

\( [\overrightarrow{u}]_B=\begin{bmatrix}\left\| \overrightarrow{u} \right\|\\ \\0\end{bmatrix}\) ,
and \( [\overrightarrow{v}]_B=\begin{bmatrix} \left\| \overrightarrow{v} \right\|\cos(\theta)\\ \\ \left\| \overrightarrow{v}\right\|\sin(\theta)\end{bmatrix}\) .

Consequently, \( \overrightarrow{u}\cdot\overrightarrow{v}= \left\| \overrightarrow{u} \right\| \left\| \overrightarrow{v} \right\|\cos(\theta)\) . QED

5.4.4 The Matrix of a Rotation does not Depend on the Direct Orthonormal Basis

Assume that \( \alpha\in\mathbb{R}\) is a real number, and consider the rotation \( \rho_{\alpha}\) of angle \( \alpha\) .

Then the matrix of \( \rho_{\alpha}\) in the canonical basis \( B_{0}\) is \( A_{\alpha}= \begin{bmatrix}\cos(\alpha)&-\sin(\alpha)\\ \sin(\alpha)&\cos(\alpha) \end{bmatrix}\) .

Assume that \( B=(\overrightarrow{I},\overrightarrow{J})\) is a direct orthonormal basis, and consider the angle \( \theta=\widehat{(\overrightarrow{i},\overrightarrow{I})}\) between the first vectors of the bases \( B_{0}\) and \( B\) .

Then the transition matrix from the basis \( B\) to the canonical basis \( B_{0}\) is \( P=\begin{bmatrix} \cos(\theta)&-\sin(\theta)\\\sin(\theta)&\cos(\theta) \end{bmatrix}\) , and its inverse is \( P^{-1}=\begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix}\) .

Consequently, the matrix of \( \rho_{\alpha}\) in the basis \( B\) is equal to \( B_{\alpha}=P^{-1}A_{\alpha}P\) .

Let’s prove that \( B_{\alpha}=A_{\alpha}\) .

\( B_{\alpha}=\begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix} \begin{bmatrix}\cos(\alpha)&{-\sin(\alpha)} \\\sin(\alpha)&\cos(\alpha) \end{bmatrix} \begin{bmatrix} \cos(\theta)&-\sin(\theta)\\\sin(\theta)&\cos(\theta) \end{bmatrix}\)

\( =\begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix}\) \( \begin{bmatrix} \cos(\alpha)\cos(\theta)-\sin(\alpha)\sin(\theta)& -\cos(\alpha)\sin(\theta)-\sin(\alpha)\cos(\theta) \\ \sin(\alpha)\cos(\theta)+\cos(\alpha)\sin(\theta)& -\sin(\alpha)\sin(\theta) +\cos(\alpha)\cos(\theta) \end{bmatrix}\)

\( =\begin{bmatrix} \cos(\theta)&\sin(\theta)\\{-\sin(\theta)}&\cos(\theta) \end{bmatrix} \begin{bmatrix} \cos(\alpha+\theta)&-\sin(\alpha+\theta) \\ \sin(\alpha+\theta)&\cos(\alpha+\theta) \end{bmatrix}\)

\( =\begin{bmatrix} \cos(\theta)\cos(\alpha+\theta) +\sin(\theta)\sin(\alpha+\theta)& -\cos(\theta)\sin(\alpha+\theta) +\sin(\theta)\cos(\alpha+\theta) \\ -\sin(\theta)\cos(\alpha+\theta) +\cos(\theta)\sin(\alpha+\theta)& \sin(\theta)\sin(\alpha+\theta) +\cos(\theta)\cos(\alpha+\theta) \end{bmatrix}\)

\( =\begin{bmatrix} \cos(\theta-(\alpha+\theta)) & \sin(\theta-(\alpha+\theta) \\ \sin((\alpha+\theta)-\theta) & \cos(\theta-(\alpha+\theta)) \end{bmatrix}\)

\( =\begin{bmatrix}\cos(-\alpha)&\sin(-\alpha) \\\sin(\alpha)&\cos(-\alpha) \end{bmatrix} =\begin{bmatrix}\cos(\alpha)&-\sin(\alpha) \\\sin(\alpha)&\cos(\alpha) \end{bmatrix}=A_{\alpha}\) QED

5.5 Matrix of a Reflexion in a Straight Line in the Canonical Basis

5.5.1 Matrix of the Reflexion in the Bisector of the Axes

Figure 32. The bisector of the axes of the canonical basis

Consider the canonical basis \( B_0=(\overrightarrow{i},\overrightarrow{j})\) , and consider the bisector \( (B)\) of the axes, of equation \( y=x\) in \( B_{0}\) .

Consider the reflexion \( \sigma_0\) in the line \( (B)\) .

Then \( \sigma_0\) exchanges \( \overrightarrow{i}\) and \( \overrightarrow{j}\) .

Consequently, the matrix of \( \sigma_0\) in the canonical basis is \( S_0=\begin{bmatrix}0&1\\1&0 \end{bmatrix}\) .

5.5.2 Matrix of a Reflexion in a Straight Line in a Special Basis

The bisector of the axes of a specialy built direct orthonormal
basis — Figure 33. The bisector of the axes of a specialy built direct orthonormal basis

Consider the straight line \( (D)\) directed by the unit vector \( \overrightarrow{u}_0\) , of angle \( \theta\) with the vector \( \overrightarrow{i}\) .

Consider the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) where:

\( \overrightarrow{I}\) is the unit vector with angle \( \theta-\frac{\pi}{4}\) with the vector \( \overrightarrow{i}\) ,
and \( \overrightarrow{J}\) is the unit vector with angle \( \theta+\frac{\pi}{4}\) with the vector \( \overrightarrow{i}\)

Then the basis \( B=(\overrightarrow{I},\overrightarrow{J})\) is a direct orthonormal basis, and the transition matrix from the basis \( B\) to the canonical basis is \( P=\begin{bmatrix} \cos(\theta-\frac{\pi}{4})&-\sin(\theta-\frac{\pi}{4})\\ \sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4}) \end{bmatrix}\) , with inverse \( P^{-1}=\begin{bmatrix} \cos(\theta-\frac{\pi}{4})&\sin(\theta-\frac{\pi}{4})\\ -\sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4}) \end{bmatrix}\) .

Moreover, the line \( (D)\) is the bisector of the axes of the orthonormal basis \( B=(\overrightarrow{I},\overrightarrow{J})\) , and consequently the matrix, in the basis \( B\) , of the reflection \( \sigma\) in the line \( (D)\) is \( S_0=\begin{bmatrix}0&1\\1&0 \end{bmatrix}\) .

5.5.3 Matrix of the Reflection \( \sigma\) in the Canonical Basis

Let’s denote \( S\) the matrix in the canonical basis of the reflection \( \sigma\) in the line \( (D)\) .

Then \( S_0=P^{-1}SP\) , so that \( S=PS_0P^{-1}\) .

Let’s prove that \( S=\begin{bmatrix} -\cos(2\theta)&\sin(2\theta)\\ \sin(2\theta)&\cos(2\theta) \end{bmatrix}\) .

\( S=PS_0P^{-1}\)

\( =\begin{bmatrix} \cos(\theta-\frac{\pi}{4})&\sin(\theta-\frac{\pi}{4})\\ -\sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4}) \end{bmatrix} \begin{bmatrix}0&1\\1&0 \end{bmatrix} \begin{bmatrix} \cos(\theta-\frac{\pi}{4})&-\sin(\theta-\frac{\pi}{4})\\ \sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4}) \end{bmatrix}\)

\( =\begin{bmatrix} \cos(\theta-\frac{\pi}{4})&\sin(\theta-\frac{\pi}{4})\\ -\sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4}) \end{bmatrix} \begin{bmatrix} \sin(\theta-\frac{\pi}{4})&\cos(\theta-\frac{\pi}{4})\\ \cos(\theta-\frac{\pi}{4})&-\sin(\theta-\frac{\pi}{4}) \end{bmatrix}\)

Let’s denote \( \alpha=\theta-\frac{\pi}{4}\) .

Then we have:

\( S=\begin{bmatrix} \cos(\alpha)&\sin(\alpha)\\ -\sin(\alpha)&\cos(\alpha)) \end{bmatrix} \begin{bmatrix} \sin(\alpha)&\cos(\alpha)\\ \cos(\alpha)&-\sin(\alpha) \end{bmatrix}\)

\( =\begin{bmatrix} 2\cos(\alpha)\sin(\alpha)&\cos^2(\alpha)-\sin^2(\alpha)\\ \cos^2(\alpha)-\sin^2(\alpha)&-2\cos(\alpha)\sin(\alpha) \end{bmatrix}\)

\( =\begin{bmatrix} \sin(2\alpha)&\cos(2\alpha)\\ \cos(2\alpha)&-\sin(2\alpha) \end{bmatrix}\)

But \( 2\alpha=2\theta-\frac{\pi}{2}\) , so that:

\( S=\begin{bmatrix} \sin(2\theta-\frac{\pi}{2})&\cos(2\theta-\frac{\pi}{2})\\ \cos(2\theta-\frac{\pi}{2})&-\sin(2\theta-\frac{\pi}{2}) \end{bmatrix} =\begin{bmatrix} -\cos(2\theta)&\sin(2\theta)\\ \sin(2\theta)&\cos(2\theta) \end{bmatrix}\) QED

5.5.4 Matrix in the Canonical Basis of a Reflection Along a Line Given by its Equation

Assume now that the line \( (D)\) has equation \( ax+by=0\) .

Then it is directed by the unit vector \( \overrightarrow{u_0} =\begin{bmatrix} \frac{b}{\sqrt{a^2+b^2}}\\ -\frac{a}{\sqrt{a^2+b^2}}\end{bmatrix}\) , and the angle \( \theta\) is such that \( \overrightarrow{u_0}=\begin{bmatrix}\cos(\theta)\\\sin(\theta)\end{bmatrix}\) .

Consequently, \( \cos(\theta)=\frac{b}{\sqrt{a^2+b^2}}\) and \( \sin(\theta)=-\frac{a}{\sqrt{a^2+b^2}}\) .

But with the trigonometric formulae:

\( S=\begin{bmatrix} -\cos^2(\theta)+\sin^2(\theta)&2\sin(\theta)\cos(\theta)\\ 2\sin(\theta)\cos(\theta)&\cos^2(\theta)-\sin^2(\theta) \end{bmatrix}\) .

So that \( S=\begin{bmatrix} \frac{-{a^2+b^2}}{a^2+b^2}&\frac{2ab}{a^2+b^2}\\ \\ \frac{2ab}{a^2+b^2}&\frac{{a^2-b^2}}{a^2+b^2} \end{bmatrix}\) .

QED

6 Conclusion

The bases in the vector plane \( \mathbb{P}\) are a generalisation of the canonical basis that allow like the latter to reference the vectors with their coordinates.

Important particular cases are the orthonormal bases in the euclidean plane, in which the coordinates of any vector are easy to calculate with simple dot products.

With the matrices tool, we may calculate the coordinates of any vector of the plane in any basis given its coordinates in any other basis.

We may also calculate the matrix of a linear mapping in any basis given the matrix of that linear mapping in any other basis.

And we gave in that text some examples of application of that.

1 Linear Algebra Courses

1.1 Linear Algebra in the Euclidean Plane

1.1.1 Documents

1.1.2 Tests

Collapse and expand sections

Cross-references and related material

Discussions

Publications

1 Linear Algebra Courses

1.1 Linear Algebra in the Euclidean Plane

1.1.1 Documents

1.1.2 Tests

Table of contents

1 Introduction

2 The Bases in \( \mathbb{P}\)

2.1 The Canonical Basis

2.2 Linearly Independent and Linearly Dependent Ordered Sets

2.2.1 Definitions

2.2.2 First Example: An Ordered Set Made of One Non Zero Vector

2.2.3 Second Example: An Ordered Set Made of Two Non Aligned Non Zero Vectors

2.2.4 Third Example: An Ordered Set Made of Two Aligned Non Zero Vectors

2.2.5 Fourth Example: An Ordered Set Made of Three not all Aligned Non Zero Vectors

2.2.6 Characteristic Properties of the Linearly Dependent and Linearly Independent Ordered Sets

2.3 Spanning and Non Spanning Ordered Sets

2.3.1 Definitions

2.3.2 First Example: An Ordered Set Made of One Non Zero Vector

2.3.3 Second Example: An Ordered Set Made of Two Non Aligned Non Zero Vectors

2.3.4 Third Example: An Ordered Set Made of Two Aligned Non Zero Vectors

2.3.5 Fourth Example: An Ordered Set Made of Three not all Aligned Non Zero Vectors

2.3.6 Properties of the Spanning Ordered Sets

2.4 Definition of a Basis in \( \mathbb{P}\)

2.5 The Coordinates of a Vector in any Basis

2.6 Geometrical View of a Linear Combination of 2 Non Aligned Non Zero Vectors

2.6.1 First Case: 2 Positive Coefficients

2.6.2 Second Case: 1 Positive Coefficient, then 1 Negative Coefficient

2.6.3 Third Case: 1 Negative Coefficient, then 1 Positive Coefficient

2.6.4 Fourth Case: 2 Negative Coefficients

2.7 Geometrical View of the Coordinates of a Vector in any Basis

2.7.1 Geometrical View of a Basis of the Plane

2.7.2 Geometrical View of the Coordinates in \( B\) of a Vector Positively along the \( X\) Axis

2.7.3 Geometrical View of the Coordinates in \( B\) of a Vector in the First Quadrant of \( B\)

2.7.4 Geometrical View of the Coordinates in \( B\) of a Vector Positively along the \( Y\) Axis

2.7.5 Geometrical View of the Coordinates in \( B\) of a Vector in the Second Quadrant of \( B\)

2.7.6 Geometrical View of the Coordinates in \( B\) of a Vector Negatively along the \( X\) Axis

2.7.7 Geometrical View of the Coordinates in \( B\) of a Vector in the Third Quadrant of \( B\)

2.7.8 Geometrical View of the Coordinates in \( B\) of a Vector Negatively along the \( Y\) Axis

2.7.9 Geometrical View of the Coordinates in \( B\) of a Vector in the Fourth Quadrant of \( B\)

3 Change of Basis and Transition Matrix

3.1 Introduction to the Transition Matrix

3.1.1 Example of Two Aligned Non Zero Vectors

3.1.2 Example of Two Non Aligned Non Zero Vectors

3.2 Equivalent Conditions for Two Vectors to be a Basis of the Plane

3.3 The Transition Matrix from the New Basis to the Canonical Basis

3.4 The Transition Matrix from the New Basis to the Old Basis

4 Change of Basis for the Matrix of a Linear Mapping

4.1 Definition of the Matrix of a Linear Mapping in any Basis

4.2 Particular Cases of Linear Mappings

4.3 Change of Basis for a Linear Mapping

4.4 Application to the Calculation of the Matrix in the Canonical Basis of a Projection

5 Direct and Reverse Orthonormal Bases

5.1 Examples of Direct and Reverse Orthonormal Basis

5.1.1 First Example

5.1.2 Second Example

5.1.3 Third Example

5.1.4 Fourth Example

5.2 Definition of an Orthonormal Basis

5.3 Transition Matrix from an Orthonormal Basis to the Canonical Basis

5.3.1 Transition Matrix from a Direct Orthonormal Basis to the Canonical Basis

5.3.2 Transition Matrix from a Reverse Orthonormal Basis to the Canonical Basis

5.4 Properties of Orthonormal Bases

5.4.1 The Coordinates of a Vector in the Canonical Basis

5.4.2 The Dot Product and the Norm do not Depend on the Orthonormal Basis

5.4.3 Application to the Formula of the Dot Product with the Cosine

5.4.4 The Matrix of a Rotation does not Depend on the Direct Orthonormal Basis

5.5 Matrix of a Reflexion in a Straight Line in the Canonical Basis

5.5.1 Matrix of the Reflexion in the Bisector of the Axes

5.5.2 Matrix of a Reflexion in a Straight Line in a Special Basis

5.5.3 Matrix of the Reflection \( \sigma\) in the Canonical Basis

5.5.4 Matrix in the Canonical Basis of a Reflection Along a Line Given by its Equation

6 Conclusion