The Robotics Institute
Search the site
RI | Publications | Semantic Learning for Audio Applications: A Computer Vision Approach

Text only version of this site

Semantic Learning for Audio Applications: A Computer Vision Approach
R. Sukthankar, Y. Ke, and D. Hoiem
2006 Conference on Computer Vision and Pattern Recognition Workshop, June, 2006, pp. 112.

Jump to: Download | Abstract | Notes | Text Reference | BibTeX Reference

Download [Help]

Adobe portable document format (pdf) [398 KB]

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Abstract

Recent work in machine learning has significantly benefited semantic extraction tasks in computer vision, particularly for object recognition and image retrieval. We argue that the computer vision techniques that have been successfully applied in those settings can effectively be translated to other domains, such as audio. This claim is supported by recent results in music vs. speech classification, structure from sound, robust music identification and sound object recognition. This paper focuses on two such audio applications and demonstrates how ideas from computer vision map naturally to these problems.

Notes

Associated center: VASC
Associated lab/group: Face Group

Number of pages: 1

Text Reference

R. Sukthankar, Y. Ke, and D. Hoiem, "Semantic Learning for Audio Applications: A Computer Vision Approach," 2006 Conference on Computer Vision and Pattern Recognition Workshop, June, 2006, pp. 112.

BibTeX Reference

@inproceedings{Sukthankar_2006_5618,
   author = "Rahul Sukthankar and Y. Ke and Derek Hoiem",
   title = "Semantic Learning for Audio Applications: A Computer Vision Approach",
   booktitle = "2006 Conference on Computer Vision and Pattern Recognition Workshop",
   month = "June",
   year = "2006",
   pages = "112"
}


The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University.
For updates and comments, please see these instructions.
This page maintained by robotwebmaster@ri.cmu.edu