A reproducible microbiome pipeline for the identification of horizontally acquired mobile genetic elements

dc.contributor.advisorMakhalanyane, Thulani P.
dc.contributor.coadvisorBezuidt, Oliver
dc.contributor.emailrudylu.monica@gmail.comen_US
dc.contributor.postgraduateMagabotha, Sade Monica
dc.date.accessioned2024-02-01T12:33:06Z
dc.date.available2024-02-01T12:33:06Z
dc.date.created2024-04
dc.date.issued2023
dc.descriptionDissertation (MSc (Bioinformatics))--University of Pretoria, 2023.en_US
dc.description.abstractMicrobial communities of bacteria, fungi, archaea, and viruses are the most numerically abundant biological entities and regulate ecosystem services. However, the intricate and complex community interactions complicate efforts to disentangle the precise mechanisms underlying their relationships. As a result, several fundamental insights regarding microbial communities remain unclear. For instance, we lack a clear understanding regarding the extent of intra and inter-community interactions among microbial communities. These interactions may have profound effects, due to the known importance of microbial communities as mediators of biogeochemical cycling. In addition to the biological complexity, there is a lack of computational tools providing functional insights regarding evolutionary insights including the extent of intra and inter-community interactions through processes such as horizontal gene transfer. Computational tools are important for understanding microbial interactions in complex ecosystems. To reduce this knowledge deficit, we developed BioGeochemical Horizontal Gene Transfer (BioG-HGT), an automated, standalone, and reproducible pipeline. This pipeline associates horizontally transferred biogeochemical genes with their mobile genetic elements. Using a metagenomic dataset, comprised of 33 million reads and a Linux Ubuntu 20.04 LTS system with 4 cores and 24Gb RAM, BioG-HGT analyzed the dataset within 9 hours. The resultant tab delimited file includes information on the metagenome assembled genome (MAG) bins produced from the input sequence dataset. The file also includes information regarding the position of the putative mobile genetic elements, and the corresponding biogeochemical cycling genes located in the same contig. Taken together, BioG-HGT provides an easy-to-use computational tool for understanding the adaptation of microbial communities to environmental disturbances. The pipeline expands the available resources for exploring environmental mobile genetic elements and their role in the dissemination of biogeochemical cycling genes. We anticipate that this tool will facilitate studies aimed at elucidating horizontal gene transfer events in environ- mental sequence datasets. BioG-HGT is available at https://github.com/Shaedeycool/BioG-HGT_wd.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMSc (Bioinformatics)en_US
dc.description.departmentBiochemistryen_US
dc.description.facultyFaculty of Natural and Agricultural Sciencesen_US
dc.description.sponsorshipNational Research Foundationen_US
dc.description.sponsorshipSouth African National Antarctic Programen_US
dc.description.sponsorshipEU Horizon 2020 project AtlantECOen_US
dc.description.sponsorshipCentre for High Performance Computing and the University of Pretoria Centre for Bioinformatics and Computational Biologyen_US
dc.identifier.citation*en_US
dc.identifier.doi10.25403/UPresearchdata.25053947en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/94228
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectMetagenomic analysisen_US
dc.subjectBiogeochemical cyclingen_US
dc.subjectMobile genetic elementsen_US
dc.subjectPipelineen_US
dc.subjectMicrobiomeen_US
dc.subjectNext generation sequencing
dc.subjectComputational biology
dc.subject.otherSustainable Development Goals (SDGs)
dc.subject.otherSDG-13: Climate action
dc.subject.otherNatural and agricultural sciences theses SDG-13
dc.subject.otherSDG-15: Life on land
dc.subject.otherNatural and agricultural sciences theses SDG-15
dc.titleA reproducible microbiome pipeline for the identification of horizontally acquired mobile genetic elementsen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Magabotha_Reproducible_2023.pdf
Size:
4.63 MB
Format:
Adobe Portable Document Format
Description:
Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: