Statistical analysis of grouped data

Crafford, Gretel

UPSpace Home
→
University of Pretoria: Research Output
→
Theses and Dissertations (University of Pretoria)
→
View Item

dc.contributor.advisor	Smit, Chris F.	en
dc.contributor.postgraduate	Crafford, Gretel	en
dc.date.accessioned	2013-09-07T01:44:26Z
dc.date.available	2008-07-10	en
dc.date.available	2013-09-07T01:44:26Z
dc.date.created	2007-04-13	en
dc.date.issued	2007	en
dc.date.submitted	2008-07-01	en
dc.description	Thesis (PhD (Mathematical Statistics))--University of Pretoria, 2007.	en
dc.description.abstract	The maximum likelihood (ML) estimation procedure of Matthews and Crowther (1995: A maximum likelihood estimation procedure when modelling in terms of constraints. South African Statistical Journal, 29, 29-51) is utilized to fit a continuous distribution to a grouped data set. This grouped data set may be a single frequency distribution or various frequency distributions that arise from a cross classification of several factors in a multifactor design. It will also be shown how to fit a bivariate normal distribution to a two-way contingency table where the two underlying continuous variables are jointly normally distributed. This thesis is organized in three different parts, each playing a vital role in the explanation of analysing grouped data with the ML estimation of Matthews and Crowther. In Part I the ML estimation procedure of Matthews and Crowther is formulated. This procedure plays an integral role and is implemented in all three parts of the thesis. In Part I the exponential distribution is fitted to a grouped data set to explain the technique. Two different formulations of the constraints are employed in the ML estimation procedure and provide identical results. The justification of the method is further motivated by a simulation study. Similar to the exponential distribution, the estimation of the normal distribution is also explained in detail. Part I is summarized in Chapter 5 where a general method is outlined to fit continuous distributions to a grouped data set. Distributions such as the Weibull, the log-logistic and the Pareto distributions can be fitted very effectively by formulating the vector of constraints in terms of a linear model. In Part II it is explained how to model a grouped response variable in a multifactor design. This multifactor design arise from a cross classification of the various factors or independent variables to be analysed. The cross classification of the factors results in a total of T cells, each containing a frequency distribution. Distribution fitting is done simultaneously to each of the T cells of the multifactor design. Distribution fitting is also done under the additional constraints that the parameters of the underlying continuous distributions satisfy a certain structure or design. The effect of the factors on the grouped response variable may be evaluated from this fitted design. Applications of a single-factor and a two-factor model are considered to demonstrate the versatility of the technique. A two-way contingency table where the two variables have an underlying bivariate normal distribution is considered in Part III. The estimation of the bivariate normal distribution reveals the complete underlying continuous structure between the two variables. The ML estimate of the correlation coefficient ρ is used to great effect to describe the relationship between the two variables. Apart from an application a simulation study is also provided to support the method proposed.	en
dc.description.availability	unrestricted	en
dc.description.department	Statistics	en
dc.identifier.citation	Crafford, G 2007, Statistical analysis of grouped data, PhD thesis, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/25968 >	en
dc.identifier.other	D377/gm	en
dc.identifier.upetdurl	http://upetd.up.ac.za/thesis/available/etd-07012008-072755/	en
dc.identifier.uri	http://hdl.handle.net/2263/25968
dc.language.iso		en
dc.publisher	University of Pretoria	en_ZA
dc.rights	©2007, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.	en
dc.subject	Grouped data set	en
dc.subject	UCTD	en_US
dc.title	Statistical analysis of grouped data	en
dc.type	Thesis	en