Okolie, ChukwumaAdeleke, AdedayoMills, JonSmit, JulianMaduako, IkechukwuBagheri, HosseinKomar, TomWang, Shidong2025-03-132025-03-132024-04-12Chukwuma Okolie, Adedayo Adeleke, Jon Mills, Julian Smit, Ikechukwu Maduako, Hossein Bagheri, Tom Komar & Shidong Wang (2024) Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands, International Journal of Image and Data Fusion, 15:4, 430-460, DOI:10.1080/19479832.2024.2329563.1947-9832 (print)1947-9824 (online)10.1080/19479832.2024.2329563http://hdl.handle.net/2263/101459CODE AVAILABILITY STATEMENT : Code written in support of this publication is publicly available at https://github.com/mrjohnokolie/ dem-enhancement.DATA AVAILABILITY STATEMENT : On reasonable request, the corresponding author will provide data that support the findings of this study.There has been a rapid evolution of tree-based ensemble algorithms which have outperformed deep learning in several studies, thus emerging as a competitive solution for many applications. In this study, ten tree-based ensemble algorithms (random forest, bagging meta-estimator, adaptive boosting (AdaBoost), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), light gradient boosting (LightGBM), histogram-based GBM, categorical boosting (CatBoost), natural gradient boosting (NGBoost), and the regularised greedy forest (RGF)) were comparatively evaluated for the enhancement of Copernicus digital elevation model (DEM) in an agricultural landscape. The enhancement methodology combines elevation and terrain parameters alignment, with featurelevel fusion into a DEM enhancement workflow. The training dataset is comprised of eight DEM-derived predictor variables, and the target variable (elevation error). In terms of root mean square error (RMSE) reduction, the best enhancements were achieved by GBM, random forest and the regularised greedy forest at the first, second and third implementation sites respectively. The computational time for training LightGBM was nearly five-hundred times faster than NGBoost, and the speed of LightGBM was closely matched by the histogram-based GBM. Our results provide a knowledge base for other researchers to focus their optimisation strategies on the most promising algorithms.en© 2024 The Author(s). This is an Open Access article distributed under the terms of the Creative Commons Attribution License.CopernicusGlobal digital elevation modelMachine learningTree-based ensemblesBaggingBoostingGradient boostingExplainabilityPartial dependenceLight detection and ranging (LiDAR)SDG-09: Industry, innovation and infrastructureAssessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural landsArticle