Search

Navigator: RI | Publications | Scalable and robust group discovery on large transactional data

Graphics enhanced version of this site

Scalable and robust group discovery on large transactional data
P. Choi, A. Moore, and J.M. Kubica
tech. report CMU-RI-TR-05-60, Robotics Institute, Carnegie Mellon University, December, 2005.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference


Download [Help]

Adobe portable document format (pdf) [753 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.


Abstract

The need for time-critical analysis and understanding of the underlying group structure from transactional data has been growing in domains such as law enforcement and customs. Kubica et al. (2003) proposed k-groups, an algorithm based on probabilistic generative model for discovering underlying groups in data. Even though k-groups is reported to be signficantly faster than its predecessor GDA (Kubica et al., 2002), k-groups is too slow and memory-intensive for large data in practice. This paper presents XGDA, a framework for scalable and robust group discovery. Evaluation of the performances of XGDA and k-groups shows that XGDA can handle extremely large datasets in reasonable time and yields more robust solutions than k-groups.


Notes

Associated lab/group: Auton Lab
Associated project: Auton Project

Number of pages: 29


Text Reference

P. Choi, A. Moore, and J.M. Kubica, Scalable and robust group discovery on large transactional data, tech. report CMU-RI-TR-05-60, Robotics Institute, Carnegie Mellon University, December, 2005.


BibTeX Reference

@techreport{Choi_2005_5297,
   author = "Pak Yan Choi and Andrew Moore and Jeremy Martin Kubica",
   title = "Scalable and robust group discovery on large transactional data",
   institution = "Robotics Institute, Carnegie Mellon University",
   month = "December",
   year = "2005",
   number = "CMU-RI-TR-05-60",
   address = "Pittsburgh, PA"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu