Data clustering : application and trends

dc.contributor.authorOyewole, Gbeminiyi John
dc.contributor.authorThopil, George Alex
dc.date.accessioned2023-02-27T11:31:46Z
dc.date.issued2023-07
dc.description.abstractClustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique.en_US
dc.description.departmentGraduate School of Technology Management (GSTM)en_US
dc.description.embargo2023-11-27
dc.description.librarianhj2023en_US
dc.description.urihttps://link.springer.com/journal/10462en_US
dc.identifier.citationOyewole, G.J., Thopil, G.A. Data clustering: application and trends. Artificial Intelligence Review 56, 6439–6475 (2023). https://doi.org/10.1007/s10462-022-10325-y.en_US
dc.identifier.issn0269-2821 (print)
dc.identifier.issn1573-7462 (online)
dc.identifier.other10.1007/s10462-022-10325-y
dc.identifier.urihttps://repository.up.ac.za/handle/2263/89856
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.rights© The Author(s), under exclusive licence to Springer Nature B.V. 2022. The original publication is available at : http://link.springer.com/journal/10462.en_US
dc.subjectClusteringen_US
dc.subjectClustering classificationen_US
dc.subjectClustering componentsen_US
dc.subjectIndustry applicationsen_US
dc.subjectClustering algorithmsen_US
dc.subjectClustering trendsen_US
dc.titleData clustering : application and trendsen_US
dc.typePostprint Articleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Oyewole_Data_2023.pdf
Size:
857.33 KB
Format:
Adobe Portable Document Format
Description:
Postprint Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: