Development of multi-locus barcodes for identification of bacterial strains and species in environmental samples using next generation sequencing technologies

Show simple item record

dc.contributor.advisor Reva, Oleg N.
dc.contributor.postgraduate Rotimi, Adeola Mujidat
dc.date.accessioned 2019-07-08T09:46:33Z
dc.date.available 2019-07-08T09:46:33Z
dc.date.created 2019/04/24
dc.date.issued 2018
dc.description Thesis (PhD)--University of Pretoria, 2018.
dc.description.abstract Metagenomic approaches have revealed the complexity of environmental microbiomes and the advancement in whole genome sequencing showed a significant level of genetic heterogeneity on species level. It has become clear that a superior pattern of bioactivity of bacteria applicable in biotechnology, as well as the enhanced virulence of pathogens, often requires distinguishing between closely related species or sub-species. Current methods for binning of metagenomic reads usually do not allow identification below the genus level and very often, stop at the level of families. In this work, an attempt was made to improve metagenome binning resolution by creating genome-specific barcodes, based on the core and accessory gene sequences. This protocol was implemented in novel software tools available for use and download from http://bargene.bi.up.ac.za/. The most abundant barcode genes from the core genomes were found to encode for ribosomal proteins, some other central metabolic genes and ABC transporters. The performance of the created metabarcode sequences was evaluated using artificially generated and publicly available metagenomic datasets. Furthermore, a program, Barcoding 2.0, was developed to align reads against barcode sequences and calculate various parameters for scoring the alignment results and individual barcodes. Taxonomic units were identified in metagenomic samples by comparison of the calculated barcode scores to set cut-off values. In the study, it was found that varying sample sizes, i.e. the number of reads in a metagenome and metabarcode lengths had no significant effect on the sensitivity and specificity of the algorithm. Receiver operating characteristics curves were calculated for different taxonomic groups based on the results of identification of the corresponding genomes in artificial metagenomic datasets and the reliability of distinguishing between species of the same genus or family by the program was close to 100%. The results showed that the novel online tool, BarcodeGenerator (http://bargene.bi.up.ac.za/), was an efficient approach to generating barcode sequences from a set of complete genomes provided by users. Another program, Barcoder 2.0, was made available from the same resource to enable efficient and practical use of metabarcodes for visualisation of distribution of organisms of interest in environmental and clinical samples.
dc.description.availability Unrestricted
dc.description.degree PhD
dc.description.department Biochemistry
dc.identifier.citation Rotimi, AM 2018, Development of multi-locus barcodes for identification of bacterial strains and species in environmental samples using next generation sequencing technologies, PhD Thesis, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/70481>
dc.identifier.other A2019
dc.identifier.uri http://hdl.handle.net/2263/70481
dc.language.iso en
dc.publisher University of Pretoria
dc.rights © 2019 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD
dc.title Development of multi-locus barcodes for identification of bacterial strains and species in environmental samples using next generation sequencing technologies
dc.type Thesis


Files in this item

This item appears in the following Collection(s)

Show simple item record