A Brief Introduction to Bayesian Nonparametric Methods for Clustering and Time Series Analysis

Scott Niekum

Tech. Report, CMU-RI-TR-15-02, Robotics Institute, Carnegie Mellon University, 2015

View Publication

Abstract

Many descriptions of Bayesian nonparametric methods assume advanced mathematical and statistical proficiency. The goal of this tutorial is to provide a conceptual introduction to Bayesian nonparametrics that assumes only basic knowledge of standard Bayesian statistics, while also containing a few key derivations that provide mathematical insight into the presented methods. We begin by reviewing the motivation for Bayesian nonparametric methods, including DeFinetti’s theorem. The Dirichlet process and the Chinese restaurant process (and their hier- archical counterparts) are then introduced in a clustering scenario that provides a mathematical and conceptual foundation for understanding more complex models. After reviewing the basics of Hidden Markov Models, these ideas are extended to time series analysis and augmented with priors that enable partial sharing of structure across multiple time series—the Beta process and the Indian buffet process. Finally, we close with a brief discussion of inference via Markov Chain Monte Carlo sampling methods.

BibTeX

@techreport{Niekum-2015-5900,
author = {Scott Niekum},
title = {A Brief Introduction to Bayesian Nonparametric Methods for Clustering and Time Series Analysis},
year = {2015},
month = {January},
institute = {Carnegie Mellon University},
address = {Pittsburgh, PA},
number = {CMU-RI-TR-15-02},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.