Contaminated models of reparameterised versions of the Dirichlet-multinomial distribution
| dc.contributor.advisor | Makgai, Seite | |
| dc.contributor.coadvisor | Bekker, Andriette | |
| dc.contributor.email | u20439530@tuks.co.za | |
| dc.contributor.postgraduate | Van Heerden, Ockert Johannes | |
| dc.date.accessioned | 2026-02-11T07:18:43Z | |
| dc.date.available | 2026-02-11T07:18:43Z | |
| dc.date.created | 2025 | |
| dc.date.issued | 2025 | |
| dc.description | Mini Dissertation | |
| dc.description.abstract | The Dirichlet-Multinomial (DM) distribution is often used for the modelling of multivariate count data, which has been applied in diverse areas such as microbiome studies, genetics, and ecological analysis. Despite its wide use, the distribution lacks easily interpretable parameters and the ability to account for outliers. In this study, we propose a novel reconstruction/perspective of the DM distribution: namely, reparameterisation of the DM distribution, which will be utilised to develop contaminated versions. Two reparameterisations are considered: the first in terms of the mode and a parameter referred to as the pseudo-variance and the second in terms of the mean and another pseudo-variance parameter. Such reparameterisations improve interpretability and allow the further construction of contaminated models that are robust to outliers. We consider properties such as the derived probability mass functions and moments for the proposed models. Simulation studies evaluate these models under varying scenarios, comparing estimation accuracy, bias, and computational performance. The relevance of the proposed models is illustrated via a microbiome data application. The developments from this study enhance the flexibility of the DM distribution and reinforce its usefulness for analyzing modern complex datasets in the biological and statistical sciences. | |
| dc.description.availability | Restricted | |
| dc.description.degree | MSc in Advanced Data Analytics | |
| dc.description.department | Statistics | |
| dc.description.faculty | Faculty of Natural and Agricultural Sciences | |
| dc.description.sdg | None | |
| dc.description.sponsorship | None | |
| dc.identifier.citation | * | |
| dc.identifier.doi | 10.25403/UPresearchdata.31303864 | |
| dc.identifier.other | M2026 | |
| dc.identifier.uri | http://hdl.handle.net/2263/108074 | |
| dc.language.iso | en | |
| dc.publisher | University of Pretoria | |
| dc.rights | © 2024 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. | |
| dc.subject | Contaminated Models | |
| dc.subject | Dirichlet-multinomial | |
| dc.subject | Outliers | |
| dc.subject | Overdispersion | |
| dc.subject | Reparameterisation | |
| dc.title | Contaminated models of reparameterised versions of the Dirichlet-multinomial distribution | |
| dc.type | Mini Dissertation |
