The advances of stemming algorithms in text analysis from 2013 to 2018

Show simple item record

dc.contributor.advisor Van Deventer, J.P.
dc.contributor.coadvisor Kruger, Rendani Maarten
dc.contributor.postgraduate Liu, Yi Yu
dc.date.accessioned 2019-10-09T14:23:01Z
dc.date.available 2019-10-09T14:23:01Z
dc.date.created 19/09/03
dc.date.issued 2019
dc.description Dissertation (MCom)--University of Pretoria, 2019.
dc.description.abstract Stemming is an activity within the pre-processing step of Text Analysis. It plays a role in the Text Analysis results. It drives Data Mining in fields such as Business Information Systems. Eight percent of existing organisational data that contributes Big Data is in an unstructured format. One of the focus areas within the concept of “Big Data” is the complexity of processing the data and being able to represent the results in such a way that they are easily understood. This challenge has been taken up by researchers over time. To determine the advances in Stemming Algorithm research, a systematic review was performed on articles on Stemming Algorithms published in journals from 2013 to 2018. Data was collected from accessible scholarly databases. The articles were then filtered by year and topic. The remaining articles were processed through a set of methodological quality criteria. The final articles were put through a bi-gram Text Analysis process to answer the research questions. The results concluded that the research focus for Stemming Algorithms has started to decrease as it reaches the plateau of productivity. The results show an evident drop in the collected articles from 58 in 2017 to 19 in 2018. Results show that information retrieval is still a common field of application for Stemming Algorithms. A major unexpected set of themes revolves around artificial intelligence, based on an increase in interest in this topic. Results show that a focus on Stemming Algorithms has shifted away from its development and moved towards its application. There is also a high interest in social media as an application of Stemming Algorithms. Future research suggestions include designing a Stemming Algorithm that would automatically and responsively adapt to the historical and morphological changes of language text.
dc.description.availability Unrestricted
dc.description.degree MCom
dc.description.department Informatics
dc.description.librarian TM2019
dc.identifier.citation Liu, YY 2019, The advances of stemming algorithms in text analysis from 2013 to 2018, MCom Dissertation, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/71712>
dc.identifier.other S2019
dc.identifier.uri http://hdl.handle.net/2263/71712
dc.language.iso en
dc.publisher University of Pretoria
dc.rights © 2019 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject UCTD
dc.title The advances of stemming algorithms in text analysis from 2013 to 2018
dc.type Dissertation


Files in this item

This item appears in the following Collection(s)

Show simple item record