Linear Discriminant – A New Method for Speaker Normalization

Martin Westphal, Tanja Schultz, and Alex Waibel

Conference Paper, Proceedings of 5th International Conference on Spoken Language Processing (ICSLP '98), December, 1998

View Publication

Abstract

In Vocal Tract Length Normalization (VTLN) a linear or nonlinear frequency transformation compensates for different vocal tract lengths. Finding good estimates for the speaker specific warp parameters is a critical issue. Despite good results using the Maximum Likelihood criterion to find parameters for a linear warping, there are concerns using this method. We searched for a new criterion that enhances the interclass separability in addition to optimizing the distribution of each phonetic class. Using such a criterion Linear Discriminant Analysis determines a linear transformation in a lower dimensional space. For VTLN, we keep the dimension constant and warp the training samples of each speaker such that the Linear Discriminant is optimized. Although that criterion depends on all training samples of all speakers it can iteratively provide speaker specific warp factors. We discuss how this approach can be applied in speech recognition and present first results on two different recognition tasks.

BibTeX

@conference{Westphal-1998-14814,
author = {Martin Westphal and Tanja Schultz and Alex Waibel},
title = {Linear Discriminant - A New Method for Speaker Normalization},
booktitle = {Proceedings of 5th International Conference on Spoken Language Processing (ICSLP '98)},
year = {1998},
month = {December},
keywords = {Speech Recognition, Speaker Normalization, VTLN, Linear Discriminant, German Spontaneous Scheduling Task, Chinese GlobalPhone},
}

Copyright notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.