Clustering

Overview


The machine learning community uses various sample datasets to test machine learning models and to teach machine learning concepts. The davinci site hosts some of these datasets in order to provide a simple way to run the various machine learning algorithms on the standard datasets.

Distance Measures


One of the key concepts in clustering is the idea of a distance between two objects, in this case, a distance between two points in the dataset, the distance between two clusters in the dataset, and the distance between a point and a cluster. The concept of distance is formalized in the notion of a metric in topology.

Algorithms


Contents