dc.contributor.advisor |
Mazarura, Jocelyn |
|
dc.contributor.coadvisor |
Fabris-Rotelli, Inger Nicolette |
|
dc.contributor.postgraduate |
Jafta, Papama Hlumela Gandhi |
|
dc.date.accessioned |
2024-02-13T09:41:21Z |
|
dc.date.available |
2024-02-13T09:41:21Z |
|
dc.date.created |
2024-04 |
|
dc.date.issued |
2023-12-07 |
|
dc.description |
Mini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2023. |
en_US |
dc.description.abstract |
In the era of social media, the analysis of Twitter data has become increasingly important for understanding the dynamics of online discourse. This research introduces a novel approach for tracking the spatial and temporal evolution of topics in Twitter data. Leveraging the spatial and temporal labels provided by Twitter for tweets, we propose the Clustered Biterm Topic Model. This model combines the Biterm Topic Model with K-medoid clustering to uncover the intricate topic development patterns over space and time. To enhance the accuracy and applicability of our model, we introduce an innovative element: a covariate-dependent matrix. This matrix incorporates essential covariate information and geographic proximity into the dissimilarity matrix used by K-Medoids clustering. By considering the inherent semantic relationships between topics and the contextual information provided by covariates and geographic proximity, our model captures the complex interplay of topics as they emerge and evolve across different regions and timeframes on Twitter. The proposed Clustered Biterm Topic Model offers a robust and versatile tool for researchers, policymakers, and businesses to gain deeper insights into the dynamic landscape of online conversations, which are inherently shaped by space and time. |
en_US |
dc.description.availability |
Restricted |
en_US |
dc.description.degree |
MSc (Advanced Data Analytics) |
en_US |
dc.description.department |
Statistics |
en_US |
dc.description.faculty |
Faculty of Natural and Agricultural Sciences |
en_US |
dc.description.sponsorship |
STATOMET
TUKS Cricket |
en_US |
dc.identifier.citation |
* |
en_US |
dc.identifier.doi |
10.25403/UPresearchdata.25208939 |
en_US |
dc.identifier.other |
A2024 |
en_US |
dc.identifier.uri |
http://hdl.handle.net/2263/94530 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
University of Pretoria |
|
dc.rights |
© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. |
|
dc.subject |
UCTD |
en_US |
dc.subject |
Short-text topic modelling |
|
dc.subject |
COVID-19 |
|
dc.subject |
Covariate-dependent weighting matrix |
|
dc.subject |
Spatial-temporal |
|
dc.subject |
K-medoids |
|
dc.subject.other |
Sustainable Development Goals (SDGs) |
|
dc.subject.other |
SDG-09: Industry, innovation and infrastructure |
|
dc.subject.other |
Natural and agricultural sciences theses SDG-09 |
|
dc.subject.other |
SDG-16: Peace, justice and strong institutions |
|
dc.subject.other |
Natural and agricultural sciences theses SDG-16 |
|
dc.title |
Spatial-temporal topic modelling of COVID-19 tweets in South Africa |
en_US |
dc.type |
Mini Dissertation |
en_US |