A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressions

Loading...
Thumbnail Image

Authors

Skhosana, Sphiwe Bonakele
Millard, Salomon M.
Kanfer, F.H.J. (Frans)

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Abstract

Semi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs) are a flexible extension of Gaussian mixtures of linear regressions (GMLRs). The model assumes that the component regression functions (CRFs) are non-parametric functions of the covariate(s) whereas the component mixing proportions and variances are constants. Unfortunately, the model cannot be reliably estimated using traditional methods. A local-likelihood approach for estimating the CRFs requires that we maximize a set of local-likelihood functions. Using the Expectation-Maximization (EM) algorithm to separately maximize each local-likelihood function may lead to label-switching. This is because the posterior probabilities calculated at the local E-step are not guaranteed to be aligned. The consequence of this label-switching is wiggly and non-smooth estimates of the CRFs. In this paper, we propose a unified approach to address label-switching and obtain sensible estimates. The proposed approach has two stages. In the first stage, we propose a model-based approach to address the label-switching problem. We first note that each local-likelihood function is a likelihood function of a Gaussian mixture model (GMM). Next, we reformulate the SPGMNRs model as a mixture of these GMMs. Lastly, using a modified version of the Expectation Conditional Maximization (ECM) algorithm, we estimate the mixture of GMMs. In addition, using the mixing weights of the local GMMs, we can automatically choose the local points where local-likelihood estimation takes place. In the second stage, we propose one-step backfitting estimates of the parametric and non-parametric terms. The effectiveness of the proposed approach is demonstrated on simulated data and real data analysis.

Description

AVAILABILITY OF DATA AND MATERIALS : All the data and software used in this research can be accessed through the link: Data and Software

Keywords

Semi-parametric Gaussian mixtures of non-parametric regressions (SPGMNRs), Gaussian mixtures of linear regressions (GMLRs), Component regression functions (CRFs), EM algorithm, Local-likelihood, Mixture models, Local-polynomial regression, Expectation-maximization (EM)

Sustainable Development Goals

None

Citation

Skhosana, S.B., Millard, S.M. & Kanfer, F.H.J. A modified EM-type algorithm to estimate semi-parametric mixtures of non-parametric regressions. Statistics and Computing 34, 125 (2024). https://doi.org/10.1007/s11222-024-10435-3.