SVM Decision Boundary Based Discriminative Subspace Induction - Robotics Institute Carnegie Mellon University

SVM Decision Boundary Based Discriminative Subspace Induction

Jiayong Zhang and Yanxi Liu
Tech. Report, CMU-RI-TR-02-15, Robotics Institute, Carnegie Mellon University, June, 2002

Abstract

Dimensionality reduction is widely accepted as an analysis and modeling tool to deal with high-dimensional spaces, although researches from different disciplines have different interpretations of what properties should be preserved in the reduction process. We study the problem of linear dimension reduction for classification, with a focus on sufficient dimension reduction, i.e., inducing subspaces without loss of discriminative information. Decision boundary analysis (DBA), originally proposed by Lee & Landgrebe (1993), can directly find the smallest subspace with such property. However, existing DBA implementations are computationally expensive and sensitive to sample size. In this paper, we first formulate the problem of sufficient dimension reduction for classification in parallel terms as for regression. Disclosures of these connections lead to several meaningful observations. Then we present a novel space reduction algorithm that combines SVM and DBA, thus inheriting several appealing properties from kernel machines such as good generalization, weak assumption, and efficient computation. In addition, the proposed method provides a natural way to reduce the complexity, and even improve the accuracy, of SVM itself. We demonstrate its superiority by comparative experiments on one simulated and four real-world benchmark datasets.

BibTeX

@techreport{Zhang-2002-8469,
author = {Jiayong Zhang and Yanxi Liu},
title = {SVM Decision Boundary Based Discriminative Subspace Induction},
year = {2002},
month = {June},
institute = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-02-15},
keywords = {classification, linear dimension reduction, sufficient dimension reduction, decision boundary analysis, support vector machine, regression},
}