Flexible finite automata-based algorithms for detecting microsatellites in DNA

Show simple item record

dc.contributor.advisor Kourie, Derrick G. en
dc.contributor.postgraduate De Ridder, Corne en
dc.date.accessioned 2013-09-07T11:11:38Z
dc.date.available 2010-09-13 en
dc.date.available 2013-09-07T11:11:38Z
dc.date.created 2010-05-09 en
dc.date.issued 2010-09-13 en
dc.date.submitted 2010-08-17 en
dc.description Dissertation (MSc)--University of Pretoria, 2010. en
dc.description.abstract Apart from contributing to Computer Science, this research also contributes to Bioinformatics, a subset of the subject discipline Computational Biology. The main focus of this dissertation is the development of a data-analytical and theoretical algorithm to contribute to the analysis of DNA, and in particular, to detect microsatellites. Microsatellites, considered in the context of this dissertation, refer to consecutive patterns contained by genomic sequences. A perfect tandem repeat is defined as a string of nucleotides which is repeated at least twice in a sequence. An approximate tandem repeat is a string of nucleotides repeated consecutively at least twice, with small differences between the instances. The research presented in this dissertation was inspired by molecular biologists who were discovered to be visually scanning genetic sequences in search of short approximate tandem repeats or so called microsatellites. The aim of this dissertation is to present three algorithms that search for short approximate tandem repeats. The algorithms comprise the implementation of finite automata. Thus the hypothesis posed is as follows: Finite automata can detect microsatellites effectively in DNA. "Effectively" includes the ability to fine-tune the detection process so that redundant data is avoided, and relevant data is not missed during search. In order to verify whether the hypothesis holds, three theoretical related algorithms have been proposed based on theorems from finite automaton theory. They are generically referred to as the FireìSat algorithms. These algorithms have been implemented, and the performance of FireìSat2 has been investigated and compared to other software packages. From the results obtained, it is clear that the performance of these algorithms differ in terms of attributes such as speed, memory consumption and extensibility. In respect of speed performance, FireìSat outperformed rival software packages. It will be seen that the FireìSat algorithms have several parameters that can be used to tune their search. It should be emphasized that these parameters have been devised in consultation with the intended user community, in order to enhance the usability of the software. It was found that the parameters of FireìSat can be set to detect more tandem repeats than rival software packages, but also tuned to limit the number of detected tandem repeats. Copyright en
dc.description.availability unrestricted en
dc.description.department Computer Science en
dc.identifier.citation De Ridder, C 2010, Flexible finite automata-based algorithms for detecting microsatellites in DNA, MSc dissertation, University of Pretoria, Pretoria, viewed yymmdd < http://hdl.handle.net/2263/27335 > en
dc.identifier.other C10/548/gm en
dc.identifier.upetdurl http://upetd.up.ac.za/thesis/available/etd-08172010-202532/ en
dc.identifier.uri http://hdl.handle.net/2263/27335
dc.language.iso en
dc.publisher University of Pretoria en_ZA
dc.rights © 2010, University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria. en
dc.subject Approximate tandem repeats en
dc.subject Regular expression en
dc.subject Finite automata en
dc.subject Microsatellites en
dc.subject UCTD en_US
dc.title Flexible finite automata-based algorithms for detecting microsatellites in DNA en
dc.type Dissertation en


Files in this item

This item appears in the following Collection(s)

Show simple item record