Automatic building footprint extraction using remote sensing data within the City of Cape Town

Show simple item record

dc.contributor.advisor Adeleke, Adedayo
dc.contributor.postgraduate Makungo, Khumeleni
dc.date.accessioned 2024-05-23T08:10:54Z
dc.date.available 2024-05-23T08:10:54Z
dc.date.created 2024-09-20
dc.date.issued 2023-11-30
dc.description Dissertation (MSc Geoinformatics)—University of Pretoria, 2023. en_US
dc.description.abstract In the City of Cape Town Metropolitan (CoCT), South Africa, GIS analysts currently delineate building footprints by digitizing aerial imagery and stereo-aerial images. This approach requires a lot of manual work. It takes a long time, is expensive, and inefficient. Recent studies have explored automatic and semi-automatic methods for extracting building footprints. Automatic extraction of building footprints from remotely sensed data is useful for urban planning, service delivery, and humanitarian efforts. However, there is currently no readily available method that can automatically extract footprints while considering the unique characteristics of the landscape, such as formal residential areas, industrial zones, and informal settlements. Therefore, the main goal of this research is to find a suitable and efficient spatial analysis method that accurately extracts building footprints of different sizes and shapes within the City of Cape Town, South Africa, using high-resolution aerial imagery and LiDAR-derived nDSM. To achieve this goal, a literature review is conducted to explore different building footprint extraction algorithms. The review identified Mask Regional Convolutional Neural Network (R-CNN) as an effective algorithm for instance segmentation and object extraction. Thus, an experiment is conducted to implement Mask R-CNN models that extract building footprints from aerial imagery and LiDAR-derived normalized Digital Surface Model (nDSM) for each of the three areas: formal residential, industrial, and informal settlements. The training focused on the Blaauwberg district, which includes formal residential areas, industrial zones, and informal settlements. Each trained model is separately tested on testing datasets for formal residential, industrial areas, and informal settlements. Evaluation metrics such as precision, recall, F1-score, and Average Precision (AP) score are calculated for each model to assess their performance in extracting building footprints from aerial imagery and LiDAR-derived nDSM in formal residential, industrial areas, and informal settlements. The Mask R-CNN algorithm proved to be very effective in extracting building footprints from high-resolution aerial imagery and LiDAR-derived nDSM in formal residential areas, achieving satisfactory precision, recall, F1-score, and AP score. In industrial areas, the Mask R-CNN algorithm is found to be highly effective in extracting footprints from LiDAR-derived nDSM. However, when extracting shacks in densely populated settlements, the Mask R-CNN algorithm performed inadequately, with an AP score of 0.28 and 0.31 from aerial imagery and LiDAR-derived nDSM, respectively. Nevertheless, the fusion of footprints extracted from LiDAR-derived nDSM and high-resolution aerial imagery improved the AP score to 0.52. Hence, this study concludes that the Mask R-CNN algorithm is highly effective in extracting building footprints in formal residential areas from both aerial imagery and LiDAR-derived nDSM, as well as industrial building footprints from LiDAR-derived nDSM. For optimal performance in informal settlements, the fusion of footprints extracted from aerial imagery and LiDAR-derived nDSM is necessary. Overall, these trained Mask R-CNN models demonstrated satisfactory performance. To enhance the existing 2D building footprint layer, these models can supplement by extracting building footprints. This updated layer will be more comprehensive and current. Various departments within the CoCT can utilize this layer for infrastructure planning, service delivery planning, land use planning, and change detection. For better performance, it is recommended to add more informal and industrial training datasets with sufficient roof variability. Fine-tuning the Mask R-CNN models will ensure accurate extraction of shacks and industrial building footprints by allowing the models to learn effectively. en_US
dc.description.availability Unrestricted en_US
dc.description.degree MSc (Geoinformatics) en_US
dc.description.department Geography, Geoinformatics and Meteorology en_US
dc.description.faculty Faculty of Natural and Agricultural Sciences en_US
dc.description.sdg SDG-09: Industry, innovation and infrastructure en_US
dc.description.sdg SDG-11: Sustainable cities and communities en_US
dc.identifier.citation * en_US
dc.identifier.doi 10.25403/UPresearchdata.25847125 en_US
dc.identifier.other S2024 en_US
dc.identifier.uri http://hdl.handle.net/2263/96189
dc.language.iso en_US en_US
dc.publisher University of Pretoria
dc.rights © 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD en_US
dc.subject Sustainable Development Goals (SDGs) en_US
dc.subject Deep Learning en_US
dc.subject Mask R-CNN en_US
dc.subject Segmentation en_US
dc.subject High-Resolution aerial imagery en_US
dc.subject LiDAR en_US
dc.subject Normalized Digital Surface Model en_US
dc.subject Detection en_US
dc.subject Building Footprint en_US
dc.title Automatic building footprint extraction using remote sensing data within the City of Cape Town en_US
dc.type Dissertation en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record