Spatial-temporal topic modelling of COVID-19 tweets in South Africa

Show simple item record

dc.contributor.advisor Mazarura, Jocelyn
dc.contributor.coadvisor Fabris-Rotelli, Inger Nicolette
dc.contributor.postgraduate Jafta, Papama Hlumela Gandhi
dc.date.accessioned 2024-02-13T09:41:21Z
dc.date.available 2024-02-13T09:41:21Z
dc.date.created 2024-04
dc.date.issued 2023-12-07
dc.description Mini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2023. en_US
dc.description.abstract In the era of social media, the analysis of Twitter data has become increasingly important for understanding the dynamics of online discourse. This research introduces a novel approach for tracking the spatial and temporal evolution of topics in Twitter data. Leveraging the spatial and temporal labels provided by Twitter for tweets, we propose the Clustered Biterm Topic Model. This model combines the Biterm Topic Model with K-medoid clustering to uncover the intricate topic development patterns over space and time. To enhance the accuracy and applicability of our model, we introduce an innovative element: a covariate-dependent matrix. This matrix incorporates essential covariate information and geographic proximity into the dissimilarity matrix used by K-Medoids clustering. By considering the inherent semantic relationships between topics and the contextual information provided by covariates and geographic proximity, our model captures the complex interplay of topics as they emerge and evolve across different regions and timeframes on Twitter. The proposed Clustered Biterm Topic Model offers a robust and versatile tool for researchers, policymakers, and businesses to gain deeper insights into the dynamic landscape of online conversations, which are inherently shaped by space and time. en_US
dc.description.availability Restricted en_US
dc.description.degree MSc (Advanced Data Analytics) en_US
dc.description.department Statistics en_US
dc.description.faculty Faculty of Natural and Agricultural Sciences en_US
dc.description.sponsorship STATOMET TUKS Cricket en_US
dc.identifier.citation * en_US
dc.identifier.doi 10.25403/UPresearchdata.25208939 en_US
dc.identifier.other A2024 en_US
dc.identifier.uri http://hdl.handle.net/2263/94530
dc.language.iso en en_US
dc.publisher University of Pretoria
dc.rights © 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD en_US
dc.subject Short-text topic modelling
dc.subject COVID-19
dc.subject Covariate-dependent weighting matrix
dc.subject Spatial-temporal
dc.subject K-medoids
dc.subject.other Sustainable Development Goals (SDGs)
dc.subject.other SDG-09: Industry, innovation and infrastructure
dc.subject.other Natural and agricultural sciences theses SDG-09
dc.subject.other SDG-16: Peace, justice and strong institutions
dc.subject.other Natural and agricultural sciences theses SDG-16
dc.title Spatial-temporal topic modelling of COVID-19 tweets in South Africa en_US
dc.type Mini Dissertation en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record