Fitting non-parametric mixture of regressions : introducing an EM-type algorithm to address the label-switching problem

Loading...
Thumbnail Image

Authors

Skhosana, Sphiwe Bonakele
Kanfer, Frans H.J.
Millard, Sollie M.

Journal Title

Journal ISSN

Volume Title

Publisher

MDPI

Abstract

The non-parametric Gaussian mixture of regressions (NPGMRs) model serves as a flexible approach for the determination of latent heterogeneous regression relationships. This model assumes that the component means, variances and mixing proportions are smooth unknown functions of the covariates where the error distribution of each component is assumed to be Gaussian and hence symmetric. These functions are estimated over a set of grid points using the Expectation- Maximization (EM) algorithm to maximise the local-likelihood functions. However, maximizing each local-likelihood function separately does not guarantee that the local responsibilities and corresponding labels, obtained at the E-step of the EM algorithm, align at each grid point leading to a label-switching problem. This results in non-smooth estimated component regression functions. In this paper, we propose an estimation procedure to account for label switching by tracking the roughness of the estimated component regression functions. We use the local responsibilities to obtain a global estimate of the responsibilities which are then used to maximize each local-likelihood function. The performance of the proposed procedure is demonstrated using a simulation study and through an application using real world data. In the case of well-separated mixture regression components, the procedure gives similar results to competitive methods. However, in the case of poorly separated mixture regression components, the procedure outperforms competitive methods.

Description

DATA AVAILABILITY STATEMENT : Publicly available datasets were analyzed in this study. This data can be found here: https://databank.worldbank.org/source/world-development-indicators/ accessed on 15 February 2022.

Keywords

Mixture models, Non-parametric regressions, Local-likelihood estimation, Label switching, Non-parametric Gaussian mixture of regression (NPGMR), Expectation-maximization algorithm

Sustainable Development Goals

Citation

Skhosana, S.B.; Kanfer, F.H.J.; Millard, S.M. Fitting Non-Parametric Mixture of Regressions: Introducing an EM-Type Algorithm to Address the Label-Switching Problem. Symmetry 2022, 14, 1058. https://DOI.org/10.3390/sym14051058.