Discriminative Cluster Analysis - Robotics Institute Carnegie Mellon University

Discriminative Cluster Analysis

Conference Paper, Proceedings of (ICML) International Conference on Machine Learning, Vol. 148, pp. 241 - 248, June, 2006

Abstract

Clustering is one of the most widely used statistical tools for data analysis. Among all existing clustering techniques, k-means is a very popular method because of its ease of programming and because it accomplishes a good trade-off between achieved performance and computational complexity. However, k-means is prone to local minima problems, and it does not scale too well with high dimensional data sets. A common approach to dealing with high dimensional data is to cluster in the space spanned by the principal components (PC). In this paper, we show the benefits of clustering in a low dimensional discriminative space rather than in the PC space (generative). In particular, we propose a new clustering algorithm called Discriminative Cluster Analysis (DCA). DCA jointly performs dimensionality reduction and clustering. Several toy and real examples show the benefits of DCA versus traditional PCA+k-means clustering. Additionally, a new matrix formulation is proposed and connections with related techniques such as spectral graph methods and linear discriminant analysis are provided.

BibTeX

@conference{Frade-2006-9514,
author = {Fernando De la Torre Frade and Takeo Kanade},
title = {Discriminative Cluster Analysis},
booktitle = {Proceedings of (ICML) International Conference on Machine Learning},
year = {2006},
month = {June},
volume = {148},
pages = {241 - 248},
publisher = {ACM Press},
address = {New York, NY, USA},
keywords = {Clustering, Linear Discriminant Analysis, Component Analysis},
}