Identifying predictive markers in complex samples of biogenic volatile compounds using GC×GC-TOFMS and machine learning

dc.contributor.advisorNaudé, Yvette
dc.contributor.coadvisorRohwer, Egmont Richard
dc.contributor.emailu27364870@tuks.co.zaen_US
dc.contributor.postgraduatePretorius, Daniel T.
dc.date.accessioned2022-12-22T12:05:16Z
dc.date.available2022-12-22T12:05:16Z
dc.date.created2023
dc.date.issued2022
dc.descriptionDissertation (MSc (Chemistry))--University of Pretoria, 2022.en_US
dc.description.abstractSamples of biogenic VOCs are varied and complex, presenting a significant challenge to analytical scrutiny. This dual study investigates the applicability of comprehensive two-dimensional gas chromatography-time-of-flight mass spectrometry (GC×GC-TOFMS), in combination with machine learning, in identifying chemical markers — in the form of biogenic volatile organic compounds (VOCs) — as a tool of classification and prediction of discrete biological states. The first study (Identifying predictive volatile markers of genus for southern African Plectranthus and Coleus using GC×GC-TOFMS and machine learning) investigates foliar VOCs as markers of genus for southern African Plectranthus and Coleus species. The second study (Identifying predictive volatile markers of malaria infection from human skin using GC×GC-TOFMS and machine learning) investigates cutaneous VOCs from the human epidermis as markers of malaria-infection. GC×GC-TOFMS was used to analyse the relevant VOC analytes, and three machine learning algorithms (an elastic-net regression, a random forest and a support-vector machine) were used to construct models of the acquired data from a training set, and to make predictions — of genus, in the case of the first study, and on malaria-infection status, in the case of the second study — on samples from a testing set. For the first study (N=45 samples), a predictive accuracy as high as 90% was obtained (with a sensitivity of up to 100%), and a suite of sesquiterpenes (including α- and β-cubebene, β-ylangene, β-copaene, γ-cadinene and isogermacrene D) were identified as putative markers of genus Coleus. Though predictive models were not obtained in the case of the second study (N=52 samples), certain compounds were identified as being potential markers of a participant’s malaria-status. These include alcohols (such as (E)-2-octen-1-ol), sulphur species (such as isoamyl cyanide and isothiazole), and short- to long-chain aliphatic carboxylic acids (such as n-decanoic acid and 9-hexadecenoic acid).en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMSc (Chemistry)en_US
dc.description.departmentChemistryen_US
dc.identifier.citation*en_US
dc.identifier.doihttps://doi.org/10.25403/UPresearchdata.21603606en_US
dc.identifier.otherA2023
dc.identifier.urihttps://repository.up.ac.za/handle/2263/88851
dc.identifier.uriDOI: https://doi.org/10.25403/UPresearchdata.21603606
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2022 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectGC×GC-TOFMSen_US
dc.subjectVolatile organic compoundsen_US
dc.subjectMachine learningen_US
dc.subjectChemical markersen_US
dc.subjectChemical standardsen_US
dc.titleIdentifying predictive markers in complex samples of biogenic volatile compounds using GC×GC-TOFMS and machine learningen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pretorius_Identifying_2022.pdf
Size:
13.67 MB
Format:
Adobe Portable Document Format
Description:
Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: