Fitting non-parametric mixture of regressions : introducing an EM-type algorithm to address the label-switching problem

dc.contributor.authorSkhosana, Sphiwe Bonakele
dc.contributor.authorKanfer, Frans H.J.
dc.contributor.authorMillard, Sollie M.
dc.contributor.emailspiwe.skhosana@up.ac.zaen_US
dc.date.accessioned2023-03-06T06:39:29Z
dc.date.available2023-03-06T06:39:29Z
dc.date.issued2022-05-21
dc.descriptionDATA AVAILABILITY STATEMENT : Publicly available datasets were analyzed in this study. This data can be found here: https://databank.worldbank.org/source/world-development-indicators/ accessed on 15 February 2022.en_US
dc.description.abstractThe non-parametric Gaussian mixture of regressions (NPGMRs) model serves as a flexible approach for the determination of latent heterogeneous regression relationships. This model assumes that the component means, variances and mixing proportions are smooth unknown functions of the covariates where the error distribution of each component is assumed to be Gaussian and hence symmetric. These functions are estimated over a set of grid points using the Expectation- Maximization (EM) algorithm to maximise the local-likelihood functions. However, maximizing each local-likelihood function separately does not guarantee that the local responsibilities and corresponding labels, obtained at the E-step of the EM algorithm, align at each grid point leading to a label-switching problem. This results in non-smooth estimated component regression functions. In this paper, we propose an estimation procedure to account for label switching by tracking the roughness of the estimated component regression functions. We use the local responsibilities to obtain a global estimate of the responsibilities which are then used to maximize each local-likelihood function. The performance of the proposed procedure is demonstrated using a simulation study and through an application using real world data. In the case of well-separated mixture regression components, the procedure gives similar results to competitive methods. However, in the case of poorly separated mixture regression components, the procedure outperforms competitive methods.en_US
dc.description.departmentStatisticsen_US
dc.description.librarianam2023en_US
dc.description.sponsorshipSTATOMET, the Bureau for Statistical and Survey Methodology at the University of Pretoria.en_US
dc.description.urihttps://www.mdpi.com/journal/symmetryen_US
dc.identifier.citationSkhosana, S.B.; Kanfer, F.H.J.; Millard, S.M. Fitting Non-Parametric Mixture of Regressions: Introducing an EM-Type Algorithm to Address the Label-Switching Problem. Symmetry 2022, 14, 1058. https://DOI.org/10.3390/sym14051058.en_US
dc.identifier.issn2073-8994 (online)
dc.identifier.other10.3390/sym14051058
dc.identifier.urihttps://repository.up.ac.za/handle/2263/89964
dc.language.isoenen_US
dc.publisherMDPIen_US
dc.rights© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.en_US
dc.subjectMixture modelsen_US
dc.subjectNon-parametric regressionsen_US
dc.subjectLocal-likelihood estimationen_US
dc.subjectLabel switchingen_US
dc.subjectNon-parametric Gaussian mixture of regression (NPGMR)en_US
dc.subjectExpectation-maximization algorithmen_US
dc.titleFitting non-parametric mixture of regressions : introducing an EM-type algorithm to address the label-switching problemen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Skhosana_Fitting_2022.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: