Efficient estimation and validation of shrinkage estimators in big data analytics

Show simple item record

dc.contributor.author Du Plessis, Salomi
dc.contributor.author Arashi, Mohammad
dc.contributor.author Maribe, Gaonyalelwe
dc.contributor.author Millard, Salomon M.
dc.date.accessioned 2024-05-30T10:07:44Z
dc.date.available 2024-05-30T10:07:44Z
dc.date.issued 2023-11
dc.description DATA AVAILABILITY STATEMENT: Data is available from the authors on request. en_US
dc.description.abstract Shrinkage estimators are often used to mitigate the consequences of multicollinearity in linear regression models. Despite the ease with which these techniques can be applied to small- or moderate-size datasets, they encounter significant challenges in the big data domain. Some of these challenges are that the volume of data often exceeds the storage capacity of a single computer and that the time required to obtain results becomes infeasible due to the computational burden of a high volume of data. We propose an algorithm for the efficient model estimation and validation of various well-known shrinkage estimators to be used in scenarios where the volume of the data is large. Our proposed algorithm utilises sufficient statistics that can be computed and updated at the row level, thus minimizing access to the entire dataset. A simulation study, as well as an application on a real-world dataset, illustrates the efficiency of the proposed approach. en_US
dc.description.department Statistics en_US
dc.description.sdg SDG-08:Decent work and economic growth en_US
dc.description.sdg SDG-09: Industry, innovation and infrastructure en_US
dc.description.sponsorship Iran National Science Foundation. en_US
dc.description.uri https://www.mdpi.com/journal/mathematics en_US
dc.identifier.citation du Plessis, S.; Arashi, M.; Maribe, G.; Millard, S.M. Efficient Estimation and Validation of Shrinkage Estimators in Big Data Analytics. Mathematics 2023, 11, 4632. https://doi.org/10.3390/math11224632. en_US
dc.identifier.issn 2227-7390 (online)
dc.identifier.other 10.3390/math11224632
dc.identifier.uri http://hdl.handle.net/2263/96300
dc.language.iso en en_US
dc.publisher MDPI en_US
dc.rights © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. en_US
dc.subject Big data en_US
dc.subject Efficient computation en_US
dc.subject Liu estimator en_US
dc.subject Matrix of sufficient statistics en_US
dc.subject Multicollinearity en_US
dc.subject Ridge estimator en_US
dc.subject SDG-08: Decent work and economic growth en_US
dc.title Efficient estimation and validation of shrinkage estimators in big data analytics en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record