A cost, complexity and performance comparison of two automatic language identification architectures

dc.contributor.advisorBotha, Elizabeth C.en
dc.contributor.emailupetd@up.ac.zaen
dc.contributor.postgraduateCombrinck, Hendrik Petrusen
dc.date.accessioned2013-09-07T19:12:21Z
dc.date.available2006-12-21en
dc.date.available2013-09-07T19:12:21Z
dc.date.created1999-11-01en
dc.date.issued2006-12-21en
dc.date.submitted2006-12-21en
dc.descriptionDissertation (M Eng (Computer Engineering))--University of Pretoria, 2006.en
dc.description.abstractThis dissertation investigates the cost-complexity-performance relationship between two automatic language identification systems. The first is a state-of-the-art archi¬tecture, trained on about three hours of phonetically hand-labelled telephone speech obtained from the recognised OGLTS corpus. The second system, introduced by our¬selves, is a simpler design with a smaller, less complex parameter space. It is a vector quantisation-based approach which bears some resemblance to a system suggested by Sugiyama. Though trained on the same data, it has no need for any labels and is therefore less costly. A number of experiments are performed to find quasi-optimal parameters for the two systems. In further experiments the systems are evaluated and compared on a set of ten two-language tasks, spanning five languages. The more com¬plex system is shown to have a substantial performance advantage over the simpler design - 81% versus 65% on 40 seconds of speech. However, both results are well under reported state-of-the-art performance of 94% and would suggest that our systems can benefit from additional attention to implementation detail and optimisation of various parameters. Given the above, our suggested architecture may potentially provide an adequate solution where the high development cost associated with state-of-the-art technology and the necessary training corpora are prohibitive.en
dc.description.availabilityunrestricteden
dc.description.departmentElectrical, Electronic and Computer Engineeringen
dc.identifier.citationCombrinck, HP 1999, A cost, complexity and performance comparison of two automatic language identification architectures, MEng dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/30492 >en
dc.identifier.otherH418/agen
dc.identifier.upetdurlhttp://upetd.up.ac.za/thesis/available/etd-12212006-141335/en
dc.identifier.urihttp://hdl.handle.net/2263/30492
dc.language.isoen
dc.publisherUniversity of Pretoriaen_ZA
dc.rights© 1999, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.en
dc.subjectSpeech processing systemsen
dc.subjectPattern recognition systemsen
dc.subjectUCTDen_US
dc.titleA cost, complexity and performance comparison of two automatic language identification architecturesen
dc.typeDissertationen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dissertation.pdf
Size:
2.28 MB
Format:
Adobe Portable Document Format