Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands

Show simple item record

dc.contributor.author Okolie, Chukwuma
dc.contributor.author Adeleke, Adedayo
dc.contributor.author Mills, Jon
dc.contributor.author Smit, Julian
dc.contributor.author Maduako, Ikechukwu
dc.contributor.author Bagheri, Hossein
dc.contributor.author Komar, Tom
dc.contributor.author Wang, Shidong
dc.date.accessioned 2025-03-13T05:44:53Z
dc.date.available 2025-03-13T05:44:53Z
dc.date.issued 2024-04-12
dc.description CODE AVAILABILITY STATEMENT : Code written in support of this publication is publicly available at https://github.com/mrjohnokolie/ dem-enhancement. en_US
dc.description DATA AVAILABILITY STATEMENT : On reasonable request, the corresponding author will provide data that support the findings of this study. en_US
dc.description.abstract There has been a rapid evolution of tree-based ensemble algorithms which have outperformed deep learning in several studies, thus emerging as a competitive solution for many applications. In this study, ten tree-based ensemble algorithms (random forest, bagging meta-estimator, adaptive boosting (AdaBoost), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), light gradient boosting (LightGBM), histogram-based GBM, categorical boosting (CatBoost), natural gradient boosting (NGBoost), and the regularised greedy forest (RGF)) were comparatively evaluated for the enhancement of Copernicus digital elevation model (DEM) in an agricultural landscape. The enhancement methodology combines elevation and terrain parameters alignment, with featurelevel fusion into a DEM enhancement workflow. The training dataset is comprised of eight DEM-derived predictor variables, and the target variable (elevation error). In terms of root mean square error (RMSE) reduction, the best enhancements were achieved by GBM, random forest and the regularised greedy forest at the first, second and third implementation sites respectively. The computational time for training LightGBM was nearly five-hundred times faster than NGBoost, and the speed of LightGBM was closely matched by the histogram-based GBM. Our results provide a knowledge base for other researchers to focus their optimisation strategies on the most promising algorithms. en_US
dc.description.department Geography, Geoinformatics and Meteorology en_US
dc.description.librarian am2024 en_US
dc.description.sdg SDG-09: Industry, innovation and infrastructure en_US
dc.description.sponsorship FUNDING : The research reported here was funded by the (i) Commonwealth Scholarship Commission and the Foreign, Commonwealth and Development Office in the UK (CSC ID: NGCN-2021-239) (ii) University of Cape Town. We are grateful for their support. All views expressed here are those of the author(s) not the funding bodies. en_US
dc.description.uri http://www.tandfonline.com/journals/tidf20 en_US
dc.identifier.citation Chukwuma Okolie, Adedayo Adeleke, Jon Mills, Julian Smit, Ikechukwu Maduako, Hossein Bagheri, Tom Komar & Shidong Wang (2024) Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands, International Journal of Image and Data Fusion, 15:4, 430-460, DOI:10.1080/19479832.2024.2329563. en_US
dc.identifier.issn 1947-9832 (print)
dc.identifier.issn 1947-9824 (online)
dc.identifier.other 10.1080/19479832.2024.2329563
dc.identifier.uri http://hdl.handle.net/2263/101459
dc.language.iso en en_US
dc.publisher Taylor and Francis en_US
dc.rights © 2024 The Author(s). This is an Open Access article distributed under the terms of the Creative Commons Attribution License. en_US
dc.subject Copernicus en_US
dc.subject Global digital elevation model en_US
dc.subject Machine learning en_US
dc.subject Tree-based ensembles en_US
dc.subject Bagging en_US
dc.subject Boosting en_US
dc.subject Gradient boosting en_US
dc.subject Explainability en_US
dc.subject Partial dependence en_US
dc.subject Light detection and ranging (LiDAR) en_US
dc.subject SDG-09: Industry, innovation and infrastructure en_US
dc.title Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record