Modelling of highly skewed longitudinal count data based on the discrete Weibull distribution

Show simple item record

dc.contributor.advisor Burger, Divan A.
dc.contributor.postgraduate Nel, Helene Mari
dc.date.accessioned 2021-01-21T08:13:59Z
dc.date.available 2021-01-21T08:13:59Z
dc.date.created 2021
dc.date.issued 2021
dc.description Mini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2021. en_ZA
dc.description.abstract Longitudinal data refer to multiple observations collected on the same subject (or unit) over time. Zero-inflated data (containing many zeros) frequently occur, resulting in overdispersion in count data. Regression models used to analyze count data are often based on the Poisson and negative binomial (NB) distribution. The Poisson distribution is restrictive when count data are overdispersed; the regression model can, therefore, give inappropriate fits when the variability in the data is larger or smaller than the theoretical variance. These two cases are, respectively, referred to as overdispersion and underdispersion. The NB distribution handles overdispersed data better compared to the Poisson distribution, but not underdispersed data. Another problem with the NB distribution is that it does not accommodate heavy-tailed or highly skewed data well. In this study, the discrete Weibull (DW) and the zero-inflated DW (ZIDW) distributions are explored in a mixed model context that models the median using a Bayesian approach. In contrast, the conventional NB and ZINB mixed-effects regression models model the mean counts over time. Results from the four mixed-effects regression models are compared. It is observed that the Bayesian DW and ZIDW mixed-effects regression models are computationally competitive with the Bayesian NB and ZINB mixed-effects regression models concerning flexibility, implementation, and convergence speed. The DW and ZIDW models are found to be excellent choices to model highly skewed longitudinal count data. en_ZA
dc.description.availability Unrestricted en_ZA
dc.description.degree MSc (Advanced Data Analytics) en_ZA
dc.description.department Statistics en_ZA
dc.description.sponsorship NRF en_ZA
dc.identifier.citation * en_ZA
dc.identifier.other A2021 en_ZA
dc.identifier.uri http://hdl.handle.net/2263/78073
dc.language.iso en en_ZA
dc.publisher University of Pretoria
dc.rights © 2019 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD en_ZA
dc.subject Mathematical statistics 895 (WST 895) en_ZA
dc.title Modelling of highly skewed longitudinal count data based on the discrete Weibull distribution en_ZA
dc.type Mini Dissertation en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record