dc.contributor.author |
Skhosana, Sphiwe Bonakele
|
|
dc.contributor.author |
Millard, Salomon M.
|
|
dc.contributor.author |
Kanfer, F.H.J. (Frans)
|
|
dc.date.accessioned |
2024-07-11T12:39:49Z |
|
dc.date.available |
2024-07-11T12:39:49Z |
|
dc.date.issued |
2024-05 |
|
dc.description |
AVAILABILITY OF DATA AND MATERIALS :
All the data and software used in this research can be accessed through the link: Data and Software |
en_US |
dc.description.abstract |
Semi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs) are a flexible extension of Gaussian mixtures of linear regressions (GMLRs). The model assumes that the component regression functions (CRFs) are non-parametric functions of the covariate(s) whereas the component mixing proportions and variances are constants. Unfortunately, the model cannot be reliably estimated using traditional methods. A local-likelihood approach for estimating the CRFs requires that we maximize a set of local-likelihood functions. Using the Expectation-Maximization (EM) algorithm to separately maximize each local-likelihood function may lead to label-switching. This is because the posterior probabilities calculated at the local E-step are not guaranteed to be aligned. The consequence of this label-switching is wiggly and non-smooth estimates of the CRFs. In this paper, we propose a unified approach to address label-switching and obtain sensible estimates. The proposed approach has two stages. In the first stage, we propose a model-based approach to address the label-switching problem. We first note that each local-likelihood function is a likelihood function of a Gaussian mixture model (GMM). Next, we reformulate the SPGMNRs model as a mixture of these GMMs. Lastly, using a modified version of the Expectation Conditional Maximization (ECM) algorithm, we estimate the mixture of GMMs. In addition, using the mixing weights of the local GMMs, we can automatically choose the local points where local-likelihood estimation takes place. In the second stage, we propose one-step backfitting estimates of the parametric and non-parametric terms. The effectiveness of the proposed approach is demonstrated on simulated data and real data analysis. |
en_US |
dc.description.department |
Statistics |
en_US |
dc.description.librarian |
hj2024 |
en_US |
dc.description.sdg |
None |
en_US |
dc.description.sponsorship |
Open access funding provided by University of Pretoria. Partial financial support was received from STATOMET at the University of Pretoria and the New Generation of Academics programme (nGAP), Department of Higher Education, South Africa. |
en_US |
dc.description.uri |
http://link.springer.com/journal/11222 |
en_US |
dc.identifier.citation |
Skhosana, S.B., Millard, S.M. & Kanfer, F.H.J. A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressions. Statistics and Computing 34, 125 (2024). https://doi.org/10.1007/s11222-024-10435-3. |
en_US |
dc.identifier.issn |
0960-3174 (print) |
|
dc.identifier.issn |
1573-1375 (online) |
|
dc.identifier.uri |
http://hdl.handle.net/2263/96943 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
Springer |
en_US |
dc.rights |
© The Author(s) 2024. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License. |
en_US |
dc.subject |
Semi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs) |
en_US |
dc.subject |
Gaussian mixtures of linear regressions (GMLRs) |
en_US |
dc.subject |
Component regression functions (CRFs) |
en_US |
dc.subject |
EM algorithm |
en_US |
dc.subject |
Local-likelihood |
en_US |
dc.subject |
Mixture models |
en_US |
dc.subject |
Local-polynomial regression |
en_US |
dc.subject |
Expectation-maximization (EM) |
en_US |
dc.title |
A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressions |
en_US |
dc.type |
Article |
en_US |