The Robotics Institute
Search the site
RI | Publications | Classification in Very High Dimensional Problems with Handfuls of Examples

Text only version of this site

Classification in Very High Dimensional Problems with Handfuls of Examples
M. Palatucci and T. Mitchell
Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), Springer-Verlag, September, 2007.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [177 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

Modern classification techniques perform well when the number of training examples exceed the number of features. If, however, the number of features greatly exceed the number of training examples, then these same techniques can fail. To address this problem, we present a hierarchical Bayesian framework that shares information between features by modeling similarities between their parameters. We believe this approach is applicable to many sparse, high dimensional problems and especially relevant to those with both spatial and temporal components. One such problem is fMRI time series, and we present a case study that shows how we can successfully classify in this domain with 80,000 original features and only 2 training examples per class.

Notes

Number of pages: 12

Text Reference

M. Palatucci and T. Mitchell, "Classification in Very High Dimensional Problems with Handfuls of Examples," Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), Springer-Verlag, September, 2007.

BibTeX Reference

@inproceedings{Palatucci_2007_5786,
   author = "Mark Palatucci and Tom Mitchell",
   title = "Classification in Very High Dimensional Problems with Handfuls of Examples",
   booktitle = "Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD)",
   month = "September",
   year = "2007",
   publisher = "Springer-Verlag"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu