A Computational Intelligence Approach to Clustering of Temporal Data

dc.contributor.advisorEngelbrecht, Andries P.
dc.contributor.postgraduateGeorgieva, Kristina Slavomirova
dc.date.accessioned2015-02-23T12:16:29Z
dc.date.available2015-02-23T12:16:29Z
dc.date.created2015-04-21
dc.date.issued2015en_ZA
dc.descriptionDissertation (MSc)--University of Pretoria, 2015.en_ZA
dc.description.abstractTemporal data is common in real-world datasets. Analysis of such data, for example by means of clustering algorithms, can be difficult due to its dynamic behaviour. There are various types of changes that may occur to clusters in a dataset. Firstly, data patterns can migrate between clusters, shrinking or expanding the clusters. Additionally, entire clusters may move around the search space. Lastly, clusters can split and merge. Data clustering, which is the process of grouping similar objects, is one approach to determine relationships among data patterns, but data clustering approaches can face limitations when applied to temporal data, such as difficulty tracking the moving clusters. This research aims to analyse the ability of particle swarm optimisation (PSO) and differential evolution (DE) algorithms to cluster temporal data. These algorithms experience two weaknesses when applied to temporal data. The first weakness is the loss of diversity, which refers to the fact that the population of the algorithm converges, becoming less diverse and, therefore, limiting the algorithm’s exploration capabilities. The second weakness, outdated memory, is only experienced by the PSO and refers to the previous personal best solutions found by the particles becoming obsolete as the environment changes. A data clustering algorithm that addresses these two weaknesses is necessary to cluster temporal data. This research describes various adaptations of PSO and DE algorithms for the purpose of clustering temporal data. The algorithms proposed aim to address the loss of diversity and outdated memory problems experienced by PSO and DE algorithms. These problems are addressed by combining approaches previously used for the purpose of dealing with temporal or dynamic data, such as repulsion and anti-convergence, with PSO and DE approaches used to cluster data. Six PSO algorithms are introduced in this research, namely the data clustering particle swarm optimisation (DCPSO), reinitialising data clustering particle swarm optimisation (RDCPSO), cooperative data clustering particle swarm optimisation (CDCPSO), multi-swarm data clustering particle swarm optimisation (MDCPSO), cooperative multi-swarm data clustering particle swarm optimisation (CMDCPSO), and elitist cooperative multi-swarm data clustering particle swarm optimisation (eCMDCPSO). Additionally, four DE algorithms are introduced, namely the data clustering differential evolution (DCDE), re-initialising data clustering differential evolution (RDCDE), dynamic data clustering differential evolution (DCDynDE), and cooperative dynamic data clustering differential evolution (CDCDynDE). The PSO and DE algorithms introduced require prior knowledge of the total number of clusters in the dataset. The total number of clusters in a real-world dataset, however, is not always known. For this reason, the best performing PSO and best performing DE are compared. The CDCDynDE is selected as the winning algorithm, which is then adapted to determine the optimal number of clusters dynamically. The resulting algorithm is the k-independent cooperative data clustering differential evolution (KCDCDynDE) algorithm, which was compared against the local network neighbourhood artificial immune system (LNNAIS) algorithm, which is an artificial immune system (AIS) designed to cluster temporal data and determine the total number of clusters dynamically. It was determined that the KCDCDynDE performed the clustering task well for problems with frequently changing data, high-dimensions, and pattern and cluster data migration types.en_ZA
dc.description.availabilityUnrestricteden_ZA
dc.description.departmentComputer Scienceen_ZA
dc.identifier.citationGeorgieva, KS 2015. A Computational Intelligence Approach to Clustering of Temporal Data, MSc dissertation, University of Pretoria, Pretoria, viewed yymmdd, <http://hdl.handle.net/2263/43778>en_ZA
dc.identifier.otherA2015
dc.identifier.urihttp://hdl.handle.net/2263/43778
dc.language.isoenen_ZA
dc.publisherUniversity of Pretoriaen_ZA
dc.rights© 2015 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.en_ZA
dc.subjectComputational Intelligenceen_ZA
dc.subjectLocal network
dc.subjectClustering
dc.subjectTemporal data
dc.subjectParticle swarm optimization (PSO)
dc.subjectDifferential evolution
dc.subjectUCTD
dc.subject.otherEngineering, built environment and information technology theses SDG-09
dc.subject.otherSDG-09: Industry, innovation and infrastructure
dc.subject.otherEngineering, built environment and information technology theses SDG-11
dc.subject.otherSDG-11: Sustainable cities and communities
dc.titleA Computational Intelligence Approach to Clustering of Temporal Dataen_ZA
dc.typeDissertationen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Georgieva_Computational_2015.pdf
Size:
7.35 MB
Format:
Adobe Portable Document Format
Description:
Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: