Identifying predictive markers in complex samples of biogenic volatile compounds using GC×GC-TOFMS and machine learning

Show simple item record

dc.contributor.advisor Naudé, Yvette
dc.contributor.coadvisor Rohwer, Egmont Richard
dc.contributor.postgraduate Pretorius, Daniel T.
dc.date.accessioned 2022-12-22T12:05:16Z
dc.date.available 2022-12-22T12:05:16Z
dc.date.created 2023
dc.date.issued 2022
dc.description Dissertation (MSc (Chemistry))--University of Pretoria, 2022. en_US
dc.description.abstract Samples of biogenic VOCs are varied and complex, presenting a significant challenge to analytical scrutiny. This dual study investigates the applicability of comprehensive two-dimensional gas chromatography-time-of-flight mass spectrometry (GC×GC-TOFMS), in combination with machine learning, in identifying chemical markers — in the form of biogenic volatile organic compounds (VOCs) — as a tool of classification and prediction of discrete biological states. The first study (Identifying predictive volatile markers of genus for southern African Plectranthus and Coleus using GC×GC-TOFMS and machine learning) investigates foliar VOCs as markers of genus for southern African Plectranthus and Coleus species. The second study (Identifying predictive volatile markers of malaria infection from human skin using GC×GC-TOFMS and machine learning) investigates cutaneous VOCs from the human epidermis as markers of malaria-infection. GC×GC-TOFMS was used to analyse the relevant VOC analytes, and three machine learning algorithms (an elastic-net regression, a random forest and a support-vector machine) were used to construct models of the acquired data from a training set, and to make predictions — of genus, in the case of the first study, and on malaria-infection status, in the case of the second study — on samples from a testing set. For the first study (N=45 samples), a predictive accuracy as high as 90% was obtained (with a sensitivity of up to 100%), and a suite of sesquiterpenes (including α- and β-cubebene, β-ylangene, β-copaene, γ-cadinene and isogermacrene D) were identified as putative markers of genus Coleus. Though predictive models were not obtained in the case of the second study (N=52 samples), certain compounds were identified as being potential markers of a participant’s malaria-status. These include alcohols (such as (E)-2-octen-1-ol), sulphur species (such as isoamyl cyanide and isothiazole), and short- to long-chain aliphatic carboxylic acids (such as n-decanoic acid and 9-hexadecenoic acid). en_US
dc.description.availability Unrestricted en_US
dc.description.degree MSc (Chemistry) en_US
dc.description.department Chemistry en_US
dc.identifier.citation * en_US
dc.identifier.doi https://doi.org/10.25403/UPresearchdata.21603606 en_US
dc.identifier.other A2023
dc.identifier.uri https://repository.up.ac.za/handle/2263/88851
dc.identifier.uri DOI: https://doi.org/10.25403/UPresearchdata.21603606
dc.language.iso en en_US
dc.publisher University of Pretoria
dc.rights © 2022 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD en_US
dc.subject GC×GC-TOFMS en_US
dc.subject Volatile organic compounds en_US
dc.subject Machine learning en_US
dc.subject Chemical markers en_US
dc.subject Chemical standards en_US
dc.title Identifying predictive markers in complex samples of biogenic volatile compounds using GC×GC-TOFMS and machine learning en_US
dc.type Dissertation en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record