Skhosana, Sphiwe BonakeleMillard, Salomon M.Kanfer, F.H.J. (Frans)2024-07-112024-07-112024-05Skhosana, S.B., Millard, S.M. & Kanfer, F.H.J. A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressions. Statistics and Computing 34, 125 (2024). https://doi.org/10.1007/s11222-024-10435-3.0960-3174 (print)1573-1375 (online)http://hdl.handle.net/2263/96943AVAILABILITY OF DATA AND MATERIALS : All the data and software used in this research can be accessed through the link: Data and SoftwareSemi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs) are a flexible extension of Gaussian mixtures of linear regressions (GMLRs). The model assumes that the component regression functions (CRFs) are non-parametric functions of the covariate(s) whereas the component mixing proportions and variances are constants. Unfortunately, the model cannot be reliably estimated using traditional methods. A local-likelihood approach for estimating the CRFs requires that we maximize a set of local-likelihood functions. Using the Expectation-Maximization (EM) algorithm to separately maximize each local-likelihood function may lead to label-switching. This is because the posterior probabilities calculated at the local E-step are not guaranteed to be aligned. The consequence of this label-switching is wiggly and non-smooth estimates of the CRFs. In this paper, we propose a unified approach to address label-switching and obtain sensible estimates. The proposed approach has two stages. In the first stage, we propose a model-based approach to address the label-switching problem. We first note that each local-likelihood function is a likelihood function of a Gaussian mixture model (GMM). Next, we reformulate the SPGMNRs model as a mixture of these GMMs. Lastly, using a modified version of the Expectation Conditional Maximization (ECM) algorithm, we estimate the mixture of GMMs. In addition, using the mixing weights of the local GMMs, we can automatically choose the local points where local-likelihood estimation takes place. In the second stage, we propose one-step backfitting estimates of the parametric and non-parametric terms. The effectiveness of the proposed approach is demonstrated on simulated data and real data analysis.en© The Author(s) 2024. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License.Semi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs)Gaussian mixtures of linear regressions (GMLRs)Component regression functions (CRFs)EM algorithmLocal-likelihoodMixture modelsLocal-polynomial regressionExpectation-maximization (EM)A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressionsArticle