dc.contributor.author |
Stevens, Jesse
|
|
dc.contributor.author |
Wilke, Daniel Nicolas
|
|
dc.contributor.author |
Setshedi, I.I. (Isaac)
|
|
dc.date.accessioned |
2025-02-05T11:35:23Z |
|
dc.date.available |
2025-02-05T11:35:23Z |
|
dc.date.issued |
2024-09-25 |
|
dc.description |
DATA AVAILABILITY STATEMENT : The original data presented in the study are openly available in a
GitHub repository at https://github.com/Greeen16/SoftwareX-Paper (accessed on 31 May 2024).
The combined heartbeat dataset is available from Kaggle at https://www.kaggle.com/datasets/
shayanfazeli/heartbeat (accessed on 31 May 2024). |
en_US |
dc.description.abstract |
Linear latent variable models such as principal component analysis (PCA), independent
component analysis (ICA), canonical correlation analysis (CCA), and factor analysis (FA) identify
latent directions (or loadings) either ordered or unordered. These data are then projected onto the
latent directions to obtain their projected representations (or scores). For example, PCA solvers usually
rank principal directions by explaining the most variance to the least variance. In contrast, ICA solvers
usually return independent directions unordered and often with single sources spread across multiple
directions as multiple sub-sources, severely diminishing their usability and interpretability. This paper
proposes a general framework to enhance latent space representations to improve the interpretability
of linear latent spaces. Although the concepts in this paper are programming language agnostic, the
framework is written in Python. This framework simplifies the process of clustering and ranking of
latent vectors to enhance latent information per latent vector and the interpretation of latent vectors.
Several innovative enhancements are incorporated, including latent ranking (LR), latent scaling
(LS), latent clustering (LC), and latent condensing (LCON). LR ranks latent directions according to a
specified scalar metric. LS scales latent directions according to a specified metric. LC automatically
clusters latent directions into a specified number of clusters. Lastly, LCON automatically determines
the appropriate number of clusters to condense the latent directions for a given metric to enable
optimal latent discovery. Additional functionality of the framework includes single-channel and
multi-channel data sources and data pre-processing strategies such as Hankelisation to seamlessly
expand the applicability of linear latent variable models (LLVMs) to a wider variety of data. The
effectiveness of LR, LS, LC, and LCON is shown in two foundational problems crafted with two
applied latent variable models, namely, PCA and ICA. |
en_US |
dc.description.department |
Mechanical and Aeronautical Engineering |
en_US |
dc.description.librarian |
am2024 |
en_US |
dc.description.sdg |
SDG-09: Industry, innovation and infrastructure |
en_US |
dc.description.uri |
https://www.mdpi.com/journal/mca |
en_US |
dc.identifier.citation |
Stevens, J.;Wilke, D.N.; Setshedi, I. Latent Space Perspicacity
and Interpretation Enhancement
(LS-PIE) Framework. Mathematical and Computational Applications 2024, 29, 85. https://DOI.org/10.3390/mca29050085. |
en_US |
dc.identifier.issn |
1300-686X (print) |
|
dc.identifier.issn |
2297-8747 (online) |
|
dc.identifier.other |
10.3390/mca29050085 |
|
dc.identifier.uri |
http://hdl.handle.net/2263/100541 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
MDPI |
en_US |
dc.rights |
© 2024 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license. |
en_US |
dc.subject |
Latent space |
en_US |
dc.subject |
Reconstruction |
en_US |
dc.subject |
Interpretation |
en_US |
dc.subject |
Scaling |
en_US |
dc.subject |
Ranking |
en_US |
dc.subject |
Clustering |
en_US |
dc.subject |
Condensing |
en_US |
dc.subject |
SDG-09: Industry, innovation and infrastructure |
en_US |
dc.subject |
Principal component analysis (PCA) |
en_US |
dc.subject |
Factor analysis (FA) |
en_US |
dc.subject |
Canonical correlation analysis (CCA) |
en_US |
dc.subject |
Independent component analysis (ICA) |
en_US |
dc.subject |
Linear latent variable model (LLVM) |
en_US |
dc.title |
Latent space perspicacity and interpretation enhancement (LS-PIE) framework |
en_US |
dc.type |
Article |
en_US |