The capability of search tools to retrieve words with specific properties from large text collections

dc.contributor.authorBall, L.H. (Liezl Hilde)
dc.contributor.authorBothma, T.J.D. (Theodorus Jan Daniel)
dc.date.accessioned2021-05-21T05:33:23Z
dc.date.available2021-05-21T05:33:23Z
dc.date.issued2020-12
dc.description.abstractINTRODUCTION: With the increase in the availability of digital text collections for humanities researchers, tools to enable enhanced retrieval are required. If words with very specific properties could be retrieved from a text collection more accurate linguistic and other analyses can be made. There are a range of properties and metadata that could be specified for retrieval, from morphological data up to bibliographic data. Furthermore, the bibliographic data should not only be on item level but extended to the text-level. For example, in an anthology each section could be encoded with the author of that section. Such extended metadata will enable fine-grained retrieval. METHOD: In this study, current tools were evaluated to determine to what extent they allow users to retrieve words with specific properties from a text collection. ANALYSIS: The analysis is limited to the following criteria: interface design, metadata, search options, filtering and search results. RESULTS: Currently, it is not possible for a user to retrieve words with specific properties from a text collection. CONCLUSION: An extended set of metadata should be used to encode text to enable retrieval of words on a fine-grained level.en_ZA
dc.description.departmentInformation Scienceen_ZA
dc.description.librarianpm2021en_ZA
dc.description.urihttp://informationr.net/iren_ZA
dc.identifier.citationBall, L., & Bothma, T. (2020). The capability of search tools to retrieve words with specific properties from large text collections. In Proceedings of ISIC, the Information Behaviour Conference, Pretoria, South Africa, 28 September - 1 October, 2020. Information Research, 25(4), paper isic2030. Retrieved from http://InformationR.net/ir/25-4/isic2020/isic2030.html (Archived by the Internet Archive at https://bit.ly/3meU2cA) https://doi.org/10.47989/irisic2030en_ZA
dc.identifier.issn1368-1613 (online)
dc.identifier.other10.47989/irisic2030
dc.identifier.urihttp://hdl.handle.net/2263/79985
dc.language.isoenen_ZA
dc.publisherUniversity of Boråsen_ZA
dc.rights© The authors, 2020. This is an open access article licensed under a Creative Commons Attribution 4.0 International license.en_ZA
dc.subjectDigital texten_ZA
dc.subjectResearchen_ZA
dc.subjectSearch toolsen_ZA
dc.subjectInformation retrievalen_ZA
dc.subject.otherEngineering, built environment and information technology articles SDG-09
dc.subject.otherSDG-09: Industry, innovation and infrastructure
dc.titleThe capability of search tools to retrieve words with specific properties from large text collectionsen_ZA
dc.typeArticleen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ball_Capability_2020.pdf
Size:
590.67 KB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: