Machines against malaria : artificial intelligence classification models to advance antimalarial drug discovery

dc.contributor.advisorBirkholtz, Lyn-Marie
dc.contributor.coadvisorPillay, Nelishia
dc.contributor.emailu14020590@tuks.co.zaen_US
dc.contributor.postgraduateVan Heerden, Ashleigh
dc.date.accessioned2024-02-13T09:13:12Z
dc.date.available2024-02-13T09:13:12Z
dc.date.created2024-05
dc.date.issued2024-02-12
dc.descriptionThesis (PhD (Biochemistry))--University of Pretoria, 2024.en_US
dc.description.abstractEfficacy data from diverse chemical libraries, screened against the various stages of the malaria parasite Plasmodium falciparum, including asexual blood stage (ABS) parasites and transmissible gametocytes, serve as a valuable reservoir of information on the chemical space of compounds that are either active (or not) against the parasite. We postulated that this data can be mined to define chemical features associated with the sole ABS activity and/or those that provide additional life cycle activity profiles like gametocytocidal activity. Additionally, this information could provide chemical features associated with inactive compounds, which could eliminate any future unnecessary screening of similar chemical analogs. Therefore, we aimed to use machine learning to identify the chemical space associated with stage-specific antimalarial activity. We collected data from various chemical libraries that were screened against the asexual (126 374 compounds) and sexual (gametocyte) stages of the parasite (93 941 compounds), calculated the compounds’ molecular fingerprints, and trained machine learning models to recognize stage-specific active and inactive compounds. We were able to build several models that predict compound activity against ABS and dual activity against ABS and gametocytes, with Support Vector Machines (SVM) showing superior abilities with high recall (90 and 66%) and low false-positive predictions (15 and 1%). This allowed the identification of chemical features enriched in active and inactive populations, an important outcome that could be mined for essential chemical features to streamline hit-to-lead optimization strategies of antimalarial candidates. The predictive capabilities of the models held true in diverse chemical spaces, indicating that the ML models are therefore robust and can serve as a prioritization tool to drive and guide phenotypic screening and medicinal chemistry programs.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreePhD (Biochemistry)en_US
dc.description.departmentBiochemistryen_US
dc.description.facultyFaculty of Natural and Agricultural Sciencesen_US
dc.description.sponsorshipThis work was supported by the South African Department of Science and Innovation and National Research Foundation South African Research Chairs Initiative Grant (LMB UID:84627).en_US
dc.identifier.citation*en_US
dc.identifier.doihttps://doi.org/10.25403/UPresearchdata.25205870en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/94522
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectMachine learningen_US
dc.subjectP. falciparumen_US
dc.subjectECFPen_US
dc.subjectTransfer learningen_US
dc.subjectCluster-based undersamplingen_US
dc.subjectDual-activityen_US
dc.subjectUMAP
dc.subject.otherSustainable Development Goals (SDGs)
dc.subject.otherSDG-01: No poverty
dc.subject.otherNatural and agricultural sciences theses SDG-01
dc.subject.otherSDG-03: Good health and well-being
dc.subject.otherNatural and agricultural sciences theses SDG-03
dc.titleMachines against malaria : artificial intelligence classification models to advance antimalarial drug discoveryen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
VanHeerden_Machines_2024.pdf
Size:
9.33 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: