Assessing classification performance for sampled remote sensing data

dc.contributor.advisorFabris-Rotelli, Inger Nicolette
dc.contributor.coadvisorThiede, Renate
dc.contributor.emailu17052395@tuks.co.zaen_US
dc.contributor.postgraduateRangongo, Tshepiso Selaelo
dc.date.accessioned2023-02-13T13:05:17Z
dc.date.available2023-02-13T13:05:17Z
dc.date.created2023
dc.date.issued2022
dc.descriptionMini Dissertation (MSc)--University of Pretoria 2022.en_US
dc.description.abstractThe volume of big data increases daily. Big data poses challenges in storage, management, processing, analysis and visualisation. One technique of handling big data is the use of subset or sample that is good representation of the data. For storage alleviation purposes, a subset of the big data can be obtained from metadata. This paper obtains metadata of a remote sensing image dataset for crop classification. This research proposes a sampling algorithm which makes use of multivariate stratification with the aim of obtaining a sample that best represents the population while minimising the number of images sampled. The proposed sampling algorithm performs effectively on a big spatial image dataset of crop types. The results are assessed by measuring the number of images sampled and as well as matching the proportionality of the population crop percentages. The samples obtained from the proposed algorithm are then used for land cover classification, these will be referred to as the proposed samples. An ensemble method called random forest is trained on the different samples and the accuracy is assessed. Precision, recall and F1-scores per crop type are computed as well as the overall accuracy. The random forest classifier performed best on the proposed sample with the least number of images, followed by the proposed sample with the second least number of images. The classifier performed better on the proposed samples than it did on the random samples as the proposed samples contained the most informative data. This research encourages the use of metadata for classification purposes as well as an effective way of sampling big data for crop classification.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMScen_US
dc.description.departmentStatisticsen_US
dc.description.sponsorshipNEPTTPen_US
dc.identifier.citation*en_US
dc.identifier.otherA2023
dc.identifier.urihttps://repository.up.ac.za/handle/2263/89449
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2022 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectSamplingen_US
dc.subjectRemote sensingen_US
dc.subjectCrop classificationen_US
dc.titleAssessing classification performance for sampled remote sensing dataen_US
dc.typeMini Dissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MSc mini-dissertation.pdf
Size:
5.73 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: