The capability of search tools to retrieve words with specific properties from large text collections
dc.contributor.author | Ball, L.H. (Liezl Hilde) | |
dc.contributor.author | Bothma, T.J.D. (Theodorus Jan Daniel) | |
dc.date.accessioned | 2021-05-21T05:33:23Z | |
dc.date.available | 2021-05-21T05:33:23Z | |
dc.date.issued | 2020-12 | |
dc.description.abstract | INTRODUCTION: With the increase in the availability of digital text collections for humanities researchers, tools to enable enhanced retrieval are required. If words with very specific properties could be retrieved from a text collection more accurate linguistic and other analyses can be made. There are a range of properties and metadata that could be specified for retrieval, from morphological data up to bibliographic data. Furthermore, the bibliographic data should not only be on item level but extended to the text-level. For example, in an anthology each section could be encoded with the author of that section. Such extended metadata will enable fine-grained retrieval. METHOD: In this study, current tools were evaluated to determine to what extent they allow users to retrieve words with specific properties from a text collection. ANALYSIS: The analysis is limited to the following criteria: interface design, metadata, search options, filtering and search results. RESULTS: Currently, it is not possible for a user to retrieve words with specific properties from a text collection. CONCLUSION: An extended set of metadata should be used to encode text to enable retrieval of words on a fine-grained level. | en_ZA |
dc.description.department | Information Science | en_ZA |
dc.description.librarian | pm2021 | en_ZA |
dc.description.uri | http://informationr.net/ir | en_ZA |
dc.identifier.citation | Ball, L., & Bothma, T. (2020). The capability of search tools to retrieve words with specific properties from large text collections. In Proceedings of ISIC, the Information Behaviour Conference, Pretoria, South Africa, 28 September - 1 October, 2020. Information Research, 25(4), paper isic2030. Retrieved from http://InformationR.net/ir/25-4/isic2020/isic2030.html (Archived by the Internet Archive at https://bit.ly/3meU2cA) https://doi.org/10.47989/irisic2030 | en_ZA |
dc.identifier.issn | 1368-1613 (online) | |
dc.identifier.other | 10.47989/irisic2030 | |
dc.identifier.uri | http://hdl.handle.net/2263/79985 | |
dc.language.iso | en | en_ZA |
dc.publisher | University of Borås | en_ZA |
dc.rights | © The authors, 2020. This is an open access article licensed under a Creative Commons Attribution 4.0 International license. | en_ZA |
dc.subject | Digital text | en_ZA |
dc.subject | Research | en_ZA |
dc.subject | Search tools | en_ZA |
dc.subject | Information retrieval | en_ZA |
dc.subject.other | Engineering, built environment and information technology articles SDG-09 | |
dc.subject.other | SDG-09: Industry, innovation and infrastructure | |
dc.title | The capability of search tools to retrieve words with specific properties from large text collections | en_ZA |
dc.type | Article | en_ZA |