IoT-enabled WBAN and machine learning for speech emotion recognition in patients

dc.contributor.authorOlatinwo, Damilola D.
dc.contributor.authorAbu-Mahfouz, Adnan Mohammed
dc.contributor.authorHancke, Gerhard P.
dc.contributor.authorMyburgh, Hermanus Carel
dc.date.accessioned2024-02-20T12:24:08Z
dc.date.available2024-02-20T12:24:08Z
dc.date.issued2023-03-08
dc.descriptionDATA AVAILABILITY STATEMENT : The dataset we used is available at https://pubmed.ncbi.nlm.nih.gov/29768426/ (accessed on 10 December 2022).en_US
dc.description.abstractInternet of things (IoT)-enabled wireless body area network (WBAN) is an emerging technology that combines medical devices, wireless devices, and non-medical devices for healthcare management applications. Speech emotion recognition (SER) is an active research field in the healthcare domain and machine learning. It is a technique that can be used to automatically identify speakers’ emotions from their speech. However, the SER system, especially in the healthcare domain, is confronted with a few challenges. For example, low prediction accuracy, high computational complexity, delay in real-time prediction, and how to identify appropriate features from speech. Motivated by these research gaps, we proposed an emotion-aware IoT-enabled WBAN system within the healthcare framework where data processing and long-range data transmissions are performed by an edge AI system for real-time prediction of patients’ speech emotions as well as to capture the changes in emotions before and after treatment. Additionally, we investigated the effectiveness of different machine learning and deep learning algorithms in terms of performance classification, feature extraction methods, and normalization methods. We developed a hybrid deep learning model, i.e., convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM), and a regularized CNN model. We combined the models with different optimization strategies and regularization techniques to improve the prediction accuracy, reduce generalization error, and reduce the computational complexity of the neural networks in terms of their computational time, power, and space. Different experiments were performed to check the efficiency and effectiveness of the proposed machine learning and deep learning algorithms. The proposed models are compared with a related existing model for evaluation and validation using standard performance metrics such as prediction accuracy, precision, recall, F1 score, confusion matrix, and the differences between the actual and predicted values. The experimental results proved that one of the proposed models outperformed the existing model with an accuracy of about 98%.en_US
dc.description.departmentElectrical, Electronic and Computer Engineeringen_US
dc.description.librarianam2024en_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.sponsorshipThe Council for Scientific and Industrial Research, Pretoria, South Africa through the Smart Networks collaboration initiative and IoT-Factory Program (Funded by the Department of Science and Innovation (DSI), South Africa).en_US
dc.description.urihttps://www.mdpi.com/journal/sensorsen_US
dc.identifier.citationOlatinwo, D.D.; Abu-Mahfouz, A.; Hancke, G.; Myburgh, H. IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients. Sensors 2023, 23, 2948. https://DOI.org/10.3390/s23062948.en_US
dc.identifier.issn1424-8220 (online)
dc.identifier.other10.3390/s23062948
dc.identifier.urihttp://hdl.handle.net/2263/94761
dc.language.isoenen_US
dc.publisherMDPIen_US
dc.rights© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.en_US
dc.subjectIoT WBANen_US
dc.subjectMachine learningen_US
dc.subjectDeep learningen_US
dc.subjectEdge AIen_US
dc.subjectSpeech emotionen_US
dc.subjectStandard scaleren_US
dc.subjectMin–max scaleren_US
dc.subjectRobust scaleren_US
dc.subjectData augmentationen_US
dc.subjectSpectrogramsen_US
dc.subjectRegularization techniquesen_US
dc.subjectMel spectrogramen_US
dc.subjectInternet of Things (IoT)en_US
dc.subjectWireless body area network (WBAN)en_US
dc.subjectSpeech emotion recognition (SER)en_US
dc.subjectArtificial intelligence (AI)en_US
dc.subjectConvolutional neural network (CNN)en_US
dc.subjectBidirectional long short-term memory (BiLSTM)en_US
dc.subjectMel frequency Ccepstral coefficient (MFCC)en_US
dc.subjectSDG-09: Industry, innovation and infrastructureen_US
dc.titleIoT-enabled WBAN and machine learning for speech emotion recognition in patientsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Olatinwo_IoTEnabled_2023.pdf
Size:
1.56 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: