Feature Dimensions

Overview


Feature dimensions refers to the number of degrees of freedom that a sample datapoint has. In the standard case, a data point is represented by a vector
{% \vec{v} \in \mathbb{R}^n %}
That is, the vector would look like a column vector
{% \begin{bmatrix} a \\ c \\ e \\ \end{bmatrix} %}

Projection


An example of a dataset that spans 3 dimensions.


The dataset can be reduced to two dimensions by just ignoring one dimension. The following shows the same dataset but graphed in only 2 dimensions



The result is effectively to remove one feature, or alternatively, setting it to zero. In this case, this is just projecting each point down to its shadow on the x-y plane. This type of transformation is often called a projection.
{% f(\begin{bmatrix} x \\ y \\ z \\ \end{bmatrix}) \rightarrow \begin{bmatrix} x \\ y \\ 0 \\ \end{bmatrix} %}

Alternative Algorithms


Just removing dimensions is often not the best algorithm, as it tends to discard valuable information. Other algorithms exist which are designed to retain as much information as possible when pruning dimensions.

Sparsity


Contents