Contaminated models of reparameterised versions of the Dirichlet-multinomial distribution

dc.contributor.advisorMakgai, Seite
dc.contributor.coadvisorBekker, Andriette
dc.contributor.emailu20439530@tuks.co.za
dc.contributor.postgraduateVan Heerden, Ockert Johannes
dc.date.accessioned2026-02-11T07:18:43Z
dc.date.available2026-02-11T07:18:43Z
dc.date.created2025
dc.date.issued2025
dc.descriptionMini Dissertation
dc.description.abstractThe Dirichlet-Multinomial (DM) distribution is often used for the modelling of multivariate count data, which has been applied in diverse areas such as microbiome studies, genetics, and ecological analysis. Despite its wide use, the distribution lacks easily interpretable parameters and the ability to account for outliers. In this study, we propose a novel reconstruction/perspective of the DM distribution: namely, reparameterisation of the DM distribution, which will be utilised to develop contaminated versions. Two reparameterisations are considered: the first in terms of the mode and a parameter referred to as the pseudo-variance and the second in terms of the mean and another pseudo-variance parameter. Such reparameterisations improve interpretability and allow the further construction of contaminated models that are robust to outliers. We consider properties such as the derived probability mass functions and moments for the proposed models. Simulation studies evaluate these models under varying scenarios, comparing estimation accuracy, bias, and computational performance. The relevance of the proposed models is illustrated via a microbiome data application. The developments from this study enhance the flexibility of the DM distribution and reinforce its usefulness for analyzing modern complex datasets in the biological and statistical sciences.
dc.description.availabilityRestricted
dc.description.degreeMSc in Advanced Data Analytics
dc.description.departmentStatistics
dc.description.facultyFaculty of Natural and Agricultural Sciences
dc.description.sdgNone
dc.description.sponsorshipNone
dc.identifier.citation*
dc.identifier.doi10.25403/UPresearchdata.31303864
dc.identifier.otherM2026
dc.identifier.urihttp://hdl.handle.net/2263/108074
dc.language.isoen
dc.publisherUniversity of Pretoria
dc.rights© 2024 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectContaminated Models
dc.subjectDirichlet-multinomial
dc.subjectOutliers
dc.subjectOverdispersion
dc.subjectReparameterisation
dc.titleContaminated models of reparameterised versions of the Dirichlet-multinomial distribution
dc.typeMini Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Van_Heerden_Contaminated_2025.pdf
Size:
2.32 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: