A cost, complexity and performance comparison of two automatic language identification architectures

Show simple item record

dc.contributor.advisor Botha, Elizabeth C. en
dc.contributor.postgraduate Combrinck, Hendrik Petrus en
dc.date.accessioned 2013-09-07T19:12:21Z
dc.date.available 2006-12-21 en
dc.date.available 2013-09-07T19:12:21Z
dc.date.created 1999-11-01 en
dc.date.issued 2006-12-21 en
dc.date.submitted 2006-12-21 en
dc.description Dissertation (M Eng (Computer Engineering))--University of Pretoria, 2006. en
dc.description.abstract This dissertation investigates the cost-complexity-performance relationship between two automatic language identification systems. The first is a state-of-the-art archi¬tecture, trained on about three hours of phonetically hand-labelled telephone speech obtained from the recognised OGLTS corpus. The second system, introduced by our¬selves, is a simpler design with a smaller, less complex parameter space. It is a vector quantisation-based approach which bears some resemblance to a system suggested by Sugiyama. Though trained on the same data, it has no need for any labels and is therefore less costly. A number of experiments are performed to find quasi-optimal parameters for the two systems. In further experiments the systems are evaluated and compared on a set of ten two-language tasks, spanning five languages. The more com¬plex system is shown to have a substantial performance advantage over the simpler design - 81% versus 65% on 40 seconds of speech. However, both results are well under reported state-of-the-art performance of 94% and would suggest that our systems can benefit from additional attention to implementation detail and optimisation of various parameters. Given the above, our suggested architecture may potentially provide an adequate solution where the high development cost associated with state-of-the-art technology and the necessary training corpora are prohibitive. en
dc.description.availability unrestricted en
dc.description.department Electrical, Electronic and Computer Engineering en
dc.identifier.citation Combrinck, HP 1999, A cost, complexity and performance comparison of two automatic language identification architectures, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/30492 > en
dc.identifier.other H418/ag en
dc.identifier.upetdurl http://upetd.up.ac.za/thesis/available/etd-12212006-141335/ en
dc.identifier.uri http://hdl.handle.net/2263/30492
dc.language.iso en
dc.publisher University of Pretoria en_ZA
dc.rights © 1999, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. en
dc.subject Speech processing systems en
dc.subject Pattern recognition systems en
dc.subject UCTD en_US
dc.title A cost, complexity and performance comparison of two automatic language identification architectures en
dc.type Dissertation en


Files in this item

This item appears in the following Collection(s)

Show simple item record