dc.contributor.author |
Du Plessis, Salomi
|
|
dc.contributor.author |
Arashi, Mohammad
|
|
dc.contributor.author |
Maribe, Gaonyalelwe
|
|
dc.contributor.author |
Millard, Salomon M.
|
|
dc.date.accessioned |
2024-05-30T10:07:44Z |
|
dc.date.available |
2024-05-30T10:07:44Z |
|
dc.date.issued |
2023-11 |
|
dc.description |
DATA AVAILABILITY STATEMENT: Data is available from the authors on request. |
en_US |
dc.description.abstract |
Shrinkage estimators are often used to mitigate the consequences of multicollinearity in
linear regression models. Despite the ease with which these techniques can be applied to small- or
moderate-size datasets, they encounter significant challenges in the big data domain. Some of these
challenges are that the volume of data often exceeds the storage capacity of a single computer and
that the time required to obtain results becomes infeasible due to the computational burden of a
high volume of data. We propose an algorithm for the efficient model estimation and validation of
various well-known shrinkage estimators to be used in scenarios where the volume of the data is
large. Our proposed algorithm utilises sufficient statistics that can be computed and updated at the
row level, thus minimizing access to the entire dataset. A simulation study, as well as an application
on a real-world dataset, illustrates the efficiency of the proposed approach. |
en_US |
dc.description.department |
Statistics |
en_US |
dc.description.sdg |
SDG-08:Decent work and economic growth |
en_US |
dc.description.sdg |
SDG-09: Industry, innovation and infrastructure |
en_US |
dc.description.sponsorship |
Iran National Science Foundation. |
en_US |
dc.description.uri |
https://www.mdpi.com/journal/mathematics |
en_US |
dc.identifier.citation |
du Plessis, S.; Arashi, M.; Maribe, G.; Millard, S.M. Efficient Estimation and Validation of Shrinkage Estimators in Big Data Analytics. Mathematics 2023, 11, 4632. https://doi.org/10.3390/math11224632. |
en_US |
dc.identifier.issn |
2227-7390 (online) |
|
dc.identifier.other |
10.3390/math11224632 |
|
dc.identifier.uri |
http://hdl.handle.net/2263/96300 |
|
dc.language.iso |
en |
en_US |
dc.publisher |
MDPI |
en_US |
dc.rights |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. |
en_US |
dc.subject |
Big data |
en_US |
dc.subject |
Efficient computation |
en_US |
dc.subject |
Liu estimator |
en_US |
dc.subject |
Matrix of sufficient statistics |
en_US |
dc.subject |
Multicollinearity |
en_US |
dc.subject |
Ridge estimator |
en_US |
dc.subject |
SDG-08: Decent work and economic growth |
en_US |
dc.title |
Efficient estimation and validation of shrinkage estimators in big data analytics |
en_US |
dc.type |
Article |
en_US |