The capability of search tools to retrieve words with specific properties from large text collections

Show simple item record

dc.contributor.author Ball, L.H. (Liezl Hilde)
dc.contributor.author Bothma, T.J.D. (Theodorus Jan Daniel)
dc.date.accessioned 2021-05-21T05:33:23Z
dc.date.available 2021-05-21T05:33:23Z
dc.date.issued 2020-12
dc.description.abstract INTRODUCTION: With the increase in the availability of digital text collections for humanities researchers, tools to enable enhanced retrieval are required. If words with very specific properties could be retrieved from a text collection more accurate linguistic and other analyses can be made. There are a range of properties and metadata that could be specified for retrieval, from morphological data up to bibliographic data. Furthermore, the bibliographic data should not only be on item level but extended to the text-level. For example, in an anthology each section could be encoded with the author of that section. Such extended metadata will enable fine-grained retrieval. METHOD: In this study, current tools were evaluated to determine to what extent they allow users to retrieve words with specific properties from a text collection. ANALYSIS: The analysis is limited to the following criteria: interface design, metadata, search options, filtering and search results. RESULTS: Currently, it is not possible for a user to retrieve words with specific properties from a text collection. CONCLUSION: An extended set of metadata should be used to encode text to enable retrieval of words on a fine-grained level. en_ZA
dc.description.department Information Science en_ZA
dc.description.librarian pm2021 en_ZA
dc.description.uri http://informationr.net/ir en_ZA
dc.identifier.citation Ball, L., & Bothma, T. (2020). The capability of search tools to retrieve words with specific properties from large text collections. In Proceedings of ISIC, the Information Behaviour Conference, Pretoria, South Africa, 28 September - 1 October, 2020. Information Research, 25(4), paper isic2030. Retrieved from http://InformationR.net/ir/25-4/isic2020/isic2030.html (Archived by the Internet Archive at https://bit.ly/3meU2cA) https://doi.org/10.47989/irisic2030 en_ZA
dc.identifier.issn 1368-1613 (online)
dc.identifier.other 10.47989/irisic2030
dc.identifier.uri http://hdl.handle.net/2263/79985
dc.language.iso en en_ZA
dc.publisher University of Borås en_ZA
dc.rights © The authors, 2020. This is an open access article licensed under a Creative Commons Attribution 4.0 International license. en_ZA
dc.subject Digital text en_ZA
dc.subject Research en_ZA
dc.subject Search tools en_ZA
dc.subject Information retrieval en_ZA
dc.subject.other Engineering, built environment and information technology articles SDG-09
dc.subject.other SDG-09: Industry, innovation and infrastructure
dc.title The capability of search tools to retrieve words with specific properties from large text collections en_ZA
dc.type Article en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record