Discriminative and Bayesian techniques for hidden Markov model speech recognition systems

Show simple item record

dc.contributor.advisor Botha, Elizabeth C. en
dc.contributor.postgraduate Purnell, Darryl William en
dc.date.accessioned 2013-09-07T15:00:35Z
dc.date.available 2005-11-01 en
dc.date.available 2013-09-07T15:00:35Z
dc.date.created 2001-04-01 en
dc.date.issued 2006-11-01 en
dc.date.submitted 2005-10-31 en
dc.description Thesis (PhD (Electronic Engineering))--University of Pretoria, 2006. en
dc.description.abstract The collection of large speech databases is not a trivial task (if done properly). It is not always possible to collect, segment and annotate large databases for every task or language. It is also often the case that there are imbalances in the databases, as a result of little data being available for a specific subset of individuals. An example of one such imbalance is the fact that there are often more male speakers than female speakers (or vice-versa). If there are, for example, far fewer female speakers than male speakers, then the recognizers will tend to work poorly for female speakers (as compared to performance for male speakers). This thesis focuses on using Bayesian and discriminative training algorithms to improve continuous speech recognition systems in scenarios where there is a limited amount of training data available. The research reported in this thesis can be divided into three categories: • Overspecialization is characterized by good recognition performance for the data used during training, but poor recognition performance for independent testing data. This is a problem when too little data is available for training purposes. Methods of reducing overspecialization in the minimum classification error algo¬rithm are therefore investigated. • Development of new Bayesian and discriminative adaptation/training techniques that can be used in situations where there is a small amount of data available. One example here is the situation where an imbalance in terms of numbers of male and female speakers exists and these techniques can be used to improve recognition performance for female speakers, while not decreasing recognition performance for the male speakers. • Bayesian learning, where Bayesian training is used to improve recognition perfor¬mance in situations where one can only use the limited training data available. These methods are extremely computationally expensive, but are justified by the improved recognition rates for certain tasks. This is, to the author's knowledge, the first time that Bayesian learning using Markov chain Monte Carlo methods have been used in hidden Markov model speech recognition. The algorithms proposed and reviewed are tested using three different datasets (TIMIT, TIDIGITS and SUNSpeech), with the tasks being connected digit recognition and con¬tinuous speech recognition. Results indicate that the proposed algorithms improve recognition performance significantly for situations where little training data is avail¬able. en
dc.description.availability unrestricted en
dc.description.department Electrical, Electronic and Computer Engineering en
dc.identifier.citation Purnell, DW 2001, Discriminative and Bayesian techniques for hidden Markov model speech recognition systems, PhD thesis, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29158 > en
dc.identifier.other H543/ag en
dc.identifier.upetdurl http://upetd.up.ac.za/thesis/available/etd-10312005-142207/ en
dc.identifier.uri http://hdl.handle.net/2263/29158
dc.language.iso en
dc.publisher University of Pretoria en_ZA
dc.rights © 2001, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. en
dc.subject Automatic speech recognition en
dc.subject Bayesian adaptation en
dc.subject Hidden markov model training en
dc.subject UCTD en_US
dc.title Discriminative and Bayesian techniques for hidden Markov model speech recognition systems en
dc.type Thesis en


Files in this item

This item appears in the following Collection(s)

Show simple item record