Modelling of highly skewed longitudinal count data based on the discrete Weibull distribution

dc.contributor.advisorBurger, Divan A.
dc.contributor.emailnlientjie@gmail.comen_ZA
dc.contributor.postgraduateNel, Helene Mari
dc.date.accessioned2021-01-21T08:13:59Z
dc.date.available2021-01-21T08:13:59Z
dc.date.created2021
dc.date.issued2021
dc.descriptionMini Dissertation (MSc (Advanced Data Analytics))--University of Pretoria, 2021.en_ZA
dc.description.abstractLongitudinal data refer to multiple observations collected on the same subject (or unit) over time. Zero-inflated data (containing many zeros) frequently occur, resulting in overdispersion in count data. Regression models used to analyze count data are often based on the Poisson and negative binomial (NB) distribution. The Poisson distribution is restrictive when count data are overdispersed; the regression model can, therefore, give inappropriate fits when the variability in the data is larger or smaller than the theoretical variance. These two cases are, respectively, referred to as overdispersion and underdispersion. The NB distribution handles overdispersed data better compared to the Poisson distribution, but not underdispersed data. Another problem with the NB distribution is that it does not accommodate heavy-tailed or highly skewed data well. In this study, the discrete Weibull (DW) and the zero-inflated DW (ZIDW) distributions are explored in a mixed model context that models the median using a Bayesian approach. In contrast, the conventional NB and ZINB mixed-effects regression models model the mean counts over time. Results from the four mixed-effects regression models are compared. It is observed that the Bayesian DW and ZIDW mixed-effects regression models are computationally competitive with the Bayesian NB and ZINB mixed-effects regression models concerning flexibility, implementation, and convergence speed. The DW and ZIDW models are found to be excellent choices to model highly skewed longitudinal count data.en_ZA
dc.description.availabilityUnrestricteden_ZA
dc.description.degreeMSc (Advanced Data Analytics)en_ZA
dc.description.departmentStatisticsen_ZA
dc.description.sponsorshipNRFen_ZA
dc.identifier.citation*en_ZA
dc.identifier.otherA2021en_ZA
dc.identifier.urihttp://hdl.handle.net/2263/78073
dc.language.isoenen_ZA
dc.publisherUniversity of Pretoria
dc.rights© 2019 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_ZA
dc.subjectMathematical statistics 895 (WST 895)en_ZA
dc.titleModelling of highly skewed longitudinal count data based on the discrete Weibull distributionen_ZA
dc.typeMini Dissertationen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Nel_Modelling_2021.pdf
Size:
2.89 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: