We are excited to announce that the repository will soon undergo an upgrade, featuring a new look and feel along with several enhanced features to improve your experience. Please be on the lookout for further updates and announcements regarding the launch date. We appreciate your support and look forward to unveiling the improved platform soon.
dc.contributor.advisor | Marivate, Vukosi | |
dc.contributor.coadvisor | Olaleye, Kayode | |
dc.contributor.postgraduate | Matloga, Mokgadi Penelope | |
dc.date.accessioned | 2024-09-13T11:57:27Z | |
dc.date.available | 2024-09-13T11:57:27Z | |
dc.date.created | 2024-04 | |
dc.date.issued | 2023-11 | |
dc.description | Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023. | en_US |
dc.description.abstract | Understanding public sentiment is vital for political parties in order for them to be able to structure their election campaigns around voter expectations. The study focuses on unsupervised learning to assess the variation of polarity sentiment in tweets during the 2021 South African local government election campaign. The study uses a pre-trained twitter-roberta-base-sentiment-latest model from Hugging Face and unsupervised lexicon based pre-trained approaches, namely: VADER and TextBlob to determine the polarity sentiment in order to gain insight that could be applied towards informing political campaigns and to see if there are any distinct sentiment patterns or shifts during different phases of the 2021 local government elections campaigns. Furthermore, the study applies the use of suspicious patterns and K-Means methods to classify the users as either bots and human using to be able to identify the user behind the keyboard. The study also make use of OpenAI GPT model to label the dataset for fine-tuning and addresses the issue of class imbalance. VADER and TextBlob results show a significant difference from that of the twitter-roberta-base-sentiment-latest models when comparing the statistical distribution based on the sentiment results and the user classification results. Based on the results, there is a significant variation across all sentiment classes and they vary over time. Furthermore, the results revealed TRBSL and TRBSL** outperforms VADER and TextBlob based on the scores for weighted accuracy and F1-scores. It was discovered that most of the tweets were generated by humans, with only few being identified as bot-generated and having a negative sentiments. | en_US |
dc.description.availability | Unrestricted | en_US |
dc.description.degree | MIT (Big Data Science) | en_US |
dc.description.department | Computer Science | en_US |
dc.description.faculty | Faculty of Engineering, Built Environment and Information Technology | en_US |
dc.identifier.citation | * | en_US |
dc.identifier.other | A2024 | en_US |
dc.identifier.uri | http://hdl.handle.net/2263/98196 | |
dc.language.iso | en | en_US |
dc.publisher | University of Pretoria | |
dc.rights | © 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. | |
dc.subject | UCTD | en_US |
dc.subject | Sentiment analysis | en_US |
dc.subject | OpenAI | en_US |
dc.subject | Fine-tuning | en_US |
dc.subject | Suspicious patterns | en_US |
dc.subject | User classification | en_US |
dc.subject | Local government election | en_US |
dc.title | Sentiment analysis using unsupervised learning for local government elections in South Africa | en_US |
dc.type | Mini Dissertation | en_US |