Sentiment analysis using unsupervised learning for local government elections in South Africa

Matloga, Mokgadi Penelope

UPSpace Home
→
University of Pretoria: Research Output
→
Theses and Dissertations (University of Pretoria)
→
View Item

We are excited to announce that the repository will soon undergo an upgrade, featuring a new look and feel along with several enhanced features to improve your experience. Please be on the lookout for further updates and announcements regarding the launch date. We appreciate your support and look forward to unveiling the improved platform soon.

Show simple item record

dc.contributor.advisor	Marivate, Vukosi
dc.contributor.coadvisor	Olaleye, Kayode
dc.contributor.postgraduate	Matloga, Mokgadi Penelope
dc.date.accessioned	2024-09-13T11:57:27Z
dc.date.available	2024-09-13T11:57:27Z
dc.date.created	2024-04
dc.date.issued	2023-11
dc.description	Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.	en_US
dc.description.abstract	Understanding public sentiment is vital for political parties in order for them to be able to structure their election campaigns around voter expectations. The study focuses on unsupervised learning to assess the variation of polarity sentiment in tweets during the 2021 South African local government election campaign. The study uses a pre-trained twitter-roberta-base-sentiment-latest model from Hugging Face and unsupervised lexicon based pre-trained approaches, namely: VADER and TextBlob to determine the polarity sentiment in order to gain insight that could be applied towards informing political campaigns and to see if there are any distinct sentiment patterns or shifts during different phases of the 2021 local government elections campaigns. Furthermore, the study applies the use of suspicious patterns and K-Means methods to classify the users as either bots and human using to be able to identify the user behind the keyboard. The study also make use of OpenAI GPT model to label the dataset for fine-tuning and addresses the issue of class imbalance. VADER and TextBlob results show a significant difference from that of the twitter-roberta-base-sentiment-latest models when comparing the statistical distribution based on the sentiment results and the user classification results. Based on the results, there is a significant variation across all sentiment classes and they vary over time. Furthermore, the results revealed TRBSL and TRBSL** outperforms VADER and TextBlob based on the scores for weighted accuracy and F1-scores. It was discovered that most of the tweets were generated by humans, with only few being identified as bot-generated and having a negative sentiments.	en_US
dc.description.availability	Unrestricted	en_US
dc.description.degree	MIT (Big Data Science)	en_US
dc.description.department	Computer Science	en_US
dc.description.faculty	Faculty of Engineering, Built Environment and Information Technology	en_US
dc.identifier.citation	*	en_US
dc.identifier.other	A2024	en_US
dc.identifier.uri	http://hdl.handle.net/2263/98196
dc.language.iso	en	en_US
dc.publisher	University of Pretoria
dc.rights	© 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject	UCTD	en_US
dc.subject	Sentiment analysis	en_US
dc.subject	OpenAI	en_US
dc.subject	Fine-tuning	en_US
dc.subject	Suspicious patterns	en_US
dc.subject	User classification	en_US
dc.subject	Local government election	en_US
dc.title	Sentiment analysis using unsupervised learning for local government elections in South Africa	en_US
dc.type	Mini Dissertation	en_US