feature dimensions

Feature Dimensions

Overview

Feature dimensions refers to the number of degrees of freedom that a sample datapoint has. In the standard case, a data point is represented by a vector

{% \vec{v} \in \mathbb{R}^n %}

That is, the vector would look like a column vector

{% \begin{bmatrix} a \\ c \\ e \\ \end{bmatrix} %}

Projection

An example of a dataset that spans 3 dimensions.

The dataset can be reduced to two dimensions by just ignoring one dimension. The following shows the same dataset but graphed in only 2 dimensions

The result is effectively to remove one feature, or alternatively, setting it to zero. In this case, this is just projecting each point down to its shadow on the x-y plane. This type of transformation is often called a projection.

{% f(\begin{bmatrix} x \\ y \\ z \\ \end{bmatrix}) \rightarrow \begin{bmatrix} x \\ y \\ 0 \\ \end{bmatrix} %}

Alternative Algorithms

Just removing dimensions is often not the best algorithm, as it tends to discard valuable information. Other algorithms exist which are designed to retain as much information as possible when pruning dimensions.

Overview

Projection

Alternative Algorithms

Sparsity

Contents