Discriminative and Bayesian techniques for hidden Markov model speech recognition systems

Purnell, Darryl William

UPSpace Home
→
University of Pretoria: Research Output
→
Theses and Dissertations (University of Pretoria)
→
View Item

dc.contributor.advisor	Botha, Elizabeth C.	en
dc.contributor.postgraduate	Purnell, Darryl William	en
dc.date.accessioned	2013-09-07T15:00:35Z
dc.date.available	2005-11-01	en
dc.date.available	2013-09-07T15:00:35Z
dc.date.created	2001-04-01	en
dc.date.issued	2006-11-01	en
dc.date.submitted	2005-10-31	en
dc.description	Thesis (PhD (Electronic Engineering))--University of Pretoria, 2006.	en
dc.description.abstract	The collection of large speech databases is not a trivial task (if done properly). It is not always possible to collect, segment and annotate large databases for every task or language. It is also often the case that there are imbalances in the databases, as a result of little data being available for a specific subset of individuals. An example of one such imbalance is the fact that there are often more male speakers than female speakers (or vice-versa). If there are, for example, far fewer female speakers than male speakers, then the recognizers will tend to work poorly for female speakers (as compared to performance for male speakers). This thesis focuses on using Bayesian and discriminative training algorithms to improve continuous speech recognition systems in scenarios where there is a limited amount of training data available. The research reported in this thesis can be divided into three categories: • Overspecialization is characterized by good recognition performance for the data used during training, but poor recognition performance for independent testing data. This is a problem when too little data is available for training purposes. Methods of reducing overspecialization in the minimum classification error algo¬rithm are therefore investigated. • Development of new Bayesian and discriminative adaptation/training techniques that can be used in situations where there is a small amount of data available. One example here is the situation where an imbalance in terms of numbers of male and female speakers exists and these techniques can be used to improve recognition performance for female speakers, while not decreasing recognition performance for the male speakers. • Bayesian learning, where Bayesian training is used to improve recognition perfor¬mance in situations where one can only use the limited training data available. These methods are extremely computationally expensive, but are justified by the improved recognition rates for certain tasks. This is, to the author's knowledge, the first time that Bayesian learning using Markov chain Monte Carlo methods have been used in hidden Markov model speech recognition. The algorithms proposed and reviewed are tested using three different datasets (TIMIT, TIDIGITS and SUNSpeech), with the tasks being connected digit recognition and con¬tinuous speech recognition. Results indicate that the proposed algorithms improve recognition performance significantly for situations where little training data is avail¬able.	en
dc.description.availability	unrestricted	en
dc.description.department	Electrical, Electronic and Computer Engineering	en
dc.identifier.citation	Purnell, DW 2001, Discriminative and Bayesian techniques for hidden Markov model speech recognition systems, PhD thesis, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/29158 >	en
dc.identifier.other	H543/ag	en
dc.identifier.upetdurl	http://upetd.up.ac.za/thesis/available/etd-10312005-142207/	en
dc.identifier.uri	http://hdl.handle.net/2263/29158
dc.language.iso		en
dc.publisher	University of Pretoria	en_ZA
dc.rights	© 2001, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.	en
dc.subject	Automatic speech recognition	en
dc.subject	Bayesian adaptation	en
dc.subject	Hidden markov model training	en
dc.subject	UCTD	en_US
dc.title	Discriminative and Bayesian techniques for hidden Markov model speech recognition systems	en
dc.type	Thesis	en