Assessing classification performance for sampled remote sensing data

Show simple item record

dc.contributor.advisor Fabris-Rotelli, Inger Nicolette
dc.contributor.coadvisor Thiede, Renate
dc.contributor.postgraduate Rangongo, Tshepiso Selaelo
dc.date.accessioned 2023-02-13T13:05:17Z
dc.date.available 2023-02-13T13:05:17Z
dc.date.created 2023
dc.date.issued 2022
dc.description Mini Dissertation (MSc)--University of Pretoria 2022. en_US
dc.description.abstract The volume of big data increases daily. Big data poses challenges in storage, management, processing, analysis and visualisation. One technique of handling big data is the use of subset or sample that is good representation of the data. For storage alleviation purposes, a subset of the big data can be obtained from metadata. This paper obtains metadata of a remote sensing image dataset for crop classification. This research proposes a sampling algorithm which makes use of multivariate stratification with the aim of obtaining a sample that best represents the population while minimising the number of images sampled. The proposed sampling algorithm performs effectively on a big spatial image dataset of crop types. The results are assessed by measuring the number of images sampled and as well as matching the proportionality of the population crop percentages. The samples obtained from the proposed algorithm are then used for land cover classification, these will be referred to as the proposed samples. An ensemble method called random forest is trained on the different samples and the accuracy is assessed. Precision, recall and F1-scores per crop type are computed as well as the overall accuracy. The random forest classifier performed best on the proposed sample with the least number of images, followed by the proposed sample with the second least number of images. The classifier performed better on the proposed samples than it did on the random samples as the proposed samples contained the most informative data. This research encourages the use of metadata for classification purposes as well as an effective way of sampling big data for crop classification. en_US
dc.description.availability Unrestricted en_US
dc.description.degree MSc en_US
dc.description.department Statistics en_US
dc.description.sponsorship NEPTTP en_US
dc.identifier.citation * en_US
dc.identifier.other A2023
dc.identifier.uri https://repository.up.ac.za/handle/2263/89449
dc.language.iso en en_US
dc.publisher University of Pretoria
dc.rights © 2022 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD en_US
dc.subject Sampling en_US
dc.subject Remote sensing en_US
dc.subject Crop classification en_US
dc.title Assessing classification performance for sampled remote sensing data en_US
dc.type Mini Dissertation en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record