Fitting non-parametric mixture of regressions : introducing an EM-type algorithm to address the label-switching problem

Show simple item record

dc.contributor.author Skhosana, Sphiwe Bonakele
dc.contributor.author Kanfer, Frans H.J.
dc.contributor.author Millard, Sollie M.
dc.date.accessioned 2023-03-06T06:39:29Z
dc.date.available 2023-03-06T06:39:29Z
dc.date.issued 2022-05-21
dc.description DATA AVAILABILITY STATEMENT : Publicly available datasets were analyzed in this study. This data can be found here: https://databank.worldbank.org/source/world-development-indicators/ accessed on 15 February 2022. en_US
dc.description.abstract The non-parametric Gaussian mixture of regressions (NPGMRs) model serves as a flexible approach for the determination of latent heterogeneous regression relationships. This model assumes that the component means, variances and mixing proportions are smooth unknown functions of the covariates where the error distribution of each component is assumed to be Gaussian and hence symmetric. These functions are estimated over a set of grid points using the Expectation- Maximization (EM) algorithm to maximise the local-likelihood functions. However, maximizing each local-likelihood function separately does not guarantee that the local responsibilities and corresponding labels, obtained at the E-step of the EM algorithm, align at each grid point leading to a label-switching problem. This results in non-smooth estimated component regression functions. In this paper, we propose an estimation procedure to account for label switching by tracking the roughness of the estimated component regression functions. We use the local responsibilities to obtain a global estimate of the responsibilities which are then used to maximize each local-likelihood function. The performance of the proposed procedure is demonstrated using a simulation study and through an application using real world data. In the case of well-separated mixture regression components, the procedure gives similar results to competitive methods. However, in the case of poorly separated mixture regression components, the procedure outperforms competitive methods. en_US
dc.description.department Statistics en_US
dc.description.librarian am2023 en_US
dc.description.sponsorship STATOMET, the Bureau for Statistical and Survey Methodology at the University of Pretoria. en_US
dc.description.uri https://www.mdpi.com/journal/symmetry en_US
dc.identifier.citation Skhosana, S.B.; Kanfer, F.H.J.; Millard, S.M. Fitting Non-Parametric Mixture of Regressions: Introducing an EM-Type Algorithm to Address the Label-Switching Problem. Symmetry 2022, 14, 1058. https://DOI.org/10.3390/sym14051058. en_US
dc.identifier.issn 2073-8994 (online)
dc.identifier.other 10.3390/sym14051058
dc.identifier.uri https://repository.up.ac.za/handle/2263/89964
dc.language.iso en en_US
dc.publisher MDPI en_US
dc.rights © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. en_US
dc.subject Mixture models en_US
dc.subject Non-parametric regressions en_US
dc.subject Local-likelihood estimation en_US
dc.subject Label switching en_US
dc.subject Non-parametric Gaussian mixture of regression (NPGMR) en_US
dc.subject Expectation-maximization algorithm en_US
dc.title Fitting non-parametric mixture of regressions : introducing an EM-type algorithm to address the label-switching problem en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record