An automated detection process for Eukaryotic ribonucleic acid (RNA) Polymerase II Promoter is presented in this dissertation. We employ an artificial neural network (ANN) in conjunction with features that were selected using an information-theoretic approach. Firstly an introduction is given where the problem is described briefly. Some background is given about the biological and genetic principles involved in DNA, RNA and Promoter detection. The automation process is described with each step given in detail. This includes the data information gathering, feature generation, and the full ANN process. The ANN section of the project is split up in a generation process, a training section as well as a testing section. Lastly the final detection program was tested and compared to other promoter detection systems. An improvement of at least 10% in positive prediction value (PPV) in comparison with current state-of-the-art solutions was obtained. Note: A Companion CD should accompany this report that contains all the program code and some of the source data that was used in this project. All the references to “Companion CD”, reference number  are references to these programs.acquisition process, how the different samples were split into different sets and statistical.
Dissertation (MEng (Computer Engineering))--University of Pretoria, 2007.