Modelling of a speech-to-text recognition system for air traffic control and NATO air command

dc.contributor.authorZietsman, Grant
dc.contributor.authorMalekian, Reza
dc.contributor.emailgrant.zietsman@tuks.co.zaen_US
dc.date.accessioned2023-02-22T13:50:01Z
dc.date.available2023-02-22T13:50:01Z
dc.date.issued2022-12
dc.description.abstractAccent invariance in speech recognition is a challenging problem especially in the are of aviation. In this paper a speech recognition system is developed to transcribe accented speech between pilots and air traffic controllers. The system allows handling of accents in continuous speech by modelling phonemes using Hidden Markov Models (HMMs) with Gaussian mixture model (GMM) probability density functions for each state. These phonemes are used to build word models of the NATO phonetic alphabet as well as the numerals 0 to 9 with transcriptions obtained from the Carnegie Mellon University (CMU) pronouncing dictionary. Mel-Frequency Cepstral Co-efficients (MFCC) with delta and delta-delta coefficients are used for the feature extraction process. Amplitude normalisation and covariance scaling is implemented to improve recognition accuracy. A word error rate (WER) of 2% for seen speakers and 22% for unseen speakers is obtained.en_US
dc.description.departmentElectrical, Electronic and Computer Engineeringen_US
dc.description.librarianhj2023en_US
dc.description.urihttp://jit.ndhu.edu.twen_US
dc.identifier.citationZietsman, G. & Malekian, R. 2022, 'Modelling of a speech-to-text recognition system for air traffic control and NATO air command', Journal of Internet Technology, vol. 23, no. 7, pp. 1527-1539, doi : 10.53106/160792642022122307008.en_US
dc.identifier.issn1607-9264 (print)
dc.identifier.issn2079-4029 (online)
dc.identifier.other10.53106/160792642022122307008
dc.identifier.urihttps://repository.up.ac.za/handle/2263/89771
dc.language.isoenen_US
dc.publisherTaiwan Academic Network Management Committeeen_US
dc.rightsTaiwan Academic Network Management Committeeen_US
dc.subjectAutomatic speech recognition (ASR)en_US
dc.subjectHidden Markov model (HMM)en_US
dc.subjectGaussian mixture model (GMM)en_US
dc.subjectMel-frequency cepstral coefficients (MFCC)en_US
dc.subjectCovariance scalingen_US
dc.titleModelling of a speech-to-text recognition system for air traffic control and NATO air commanden_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zietsman_Modelling_2022.pdf
Size:
1.6 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: