Assessing classification performance for sampled remote sensing data

Rangongo, Tshepiso Selaelo

UPSpace Home
→
University of Pretoria: Research Output
→
Theses and Dissertations (University of Pretoria)
→
View Item

We are excited to announce that the repository will soon undergo an upgrade, featuring a new look and feel along with several enhanced features to improve your experience. Please be on the lookout for further updates and announcements regarding the launch date. We appreciate your support and look forward to unveiling the improved platform soon.

Show simple item record

dc.contributor.advisor	Fabris-Rotelli, Inger Nicolette
dc.contributor.coadvisor	Thiede, Renate
dc.contributor.postgraduate	Rangongo, Tshepiso Selaelo
dc.date.accessioned	2023-02-13T13:05:17Z
dc.date.available	2023-02-13T13:05:17Z
dc.date.created	2023
dc.date.issued	2022
dc.description	Mini Dissertation (MSc)--University of Pretoria 2022.	en_US
dc.description.abstract	The volume of big data increases daily. Big data poses challenges in storage, management, processing, analysis and visualisation. One technique of handling big data is the use of subset or sample that is good representation of the data. For storage alleviation purposes, a subset of the big data can be obtained from metadata. This paper obtains metadata of a remote sensing image dataset for crop classification. This research proposes a sampling algorithm which makes use of multivariate stratification with the aim of obtaining a sample that best represents the population while minimising the number of images sampled. The proposed sampling algorithm performs effectively on a big spatial image dataset of crop types. The results are assessed by measuring the number of images sampled and as well as matching the proportionality of the population crop percentages. The samples obtained from the proposed algorithm are then used for land cover classification, these will be referred to as the proposed samples. An ensemble method called random forest is trained on the different samples and the accuracy is assessed. Precision, recall and F1-scores per crop type are computed as well as the overall accuracy. The random forest classifier performed best on the proposed sample with the least number of images, followed by the proposed sample with the second least number of images. The classifier performed better on the proposed samples than it did on the random samples as the proposed samples contained the most informative data. This research encourages the use of metadata for classification purposes as well as an effective way of sampling big data for crop classification.	en_US
dc.description.availability	Unrestricted	en_US
dc.description.degree	MSc	en_US
dc.description.department	Statistics	en_US
dc.description.sponsorship	NEPTTP	en_US
dc.identifier.citation	*	en_US
dc.identifier.other	A2023
dc.identifier.uri	https://repository.up.ac.za/handle/2263/89449
dc.language.iso	en	en_US
dc.publisher	University of Pretoria
dc.rights	© 2022 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject	UCTD	en_US
dc.subject	Sampling	en_US
dc.subject	Remote sensing	en_US
dc.subject	Crop classification	en_US
dc.title	Assessing classification performance for sampled remote sensing data	en_US
dc.type	Mini Dissertation	en_US