A few-shot learning approach for a multilingual agro-information question answering system

dc.contributor.advisorMarivate, Vukosi
dc.contributor.coadvisorNabende, Joyce
dc.contributor.emailthefiskanibanda@gmail.comen_US
dc.contributor.postgraduateBanda, Fiskani Ella
dc.date.accessioned2024-09-12T09:57:45Z
dc.date.available2024-09-12T09:57:45Z
dc.date.created2024-04
dc.date.issued2023-12
dc.descriptionMini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.en_US
dc.description.abstractAgriculture plays a crucial role in numerous households across Sub-Saharan Africa. Developing a question answering system that utilizes agricultural expertise and agro-information can effectively bridge the support gap for farmers in the local community. Most advances in question answering research involve large language models trained on extensive data. Nevertheless, the conventional approach of fine-tuning has demonstrated a significant decline in performance when models are fine-tuned on a small amount of data. This decline is primarily attributed to the disparities between the objectives of pretraining and fine-tuning. One proposed alternative is to utilize prompt-based fine-tuning, which permits the model to be fine-tuned with only a few examples. Extensive research has been done on the application of these methods to tasks such as text classification and not question answering. This research aims to study the feasibility of recent fewshot learning approaches, such as FewshotQA and Null prompting, for domain-specific agricultural data in 4 South African languages. We evaluated the overall performance of these approaches and investigated the effects of adapting these approaches for cross-lingual extractive question answering of domain-specific data. The results obtained in this study have shown valuable insight into the applicability of these methods to domain-specific data. These results have shown that these methods are capable of adequately capturing the textual information of domain-specific data from the initial subset of data points. Thus, there is potential for using these methods as a practical solution for limited data.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMIT (Big Data Science)en_US
dc.description.departmentComputer Scienceen_US
dc.description.facultyFaculty of Engineering, Built Environment and Information Technologyen_US
dc.identifier.citation*en_US
dc.identifier.otherA2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/98153
dc.language.isoenen_US
dc.publisherUniversity of Pretoria
dc.rights© 2021 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectNatural language processing (NLP)en_US
dc.subjectLow resource languagesen_US
dc.subjectExtractive question answeringen_US
dc.subjectCross-lingualen_US
dc.titleA few-shot learning approach for a multilingual agro-information question answering systemen_US
dc.typeMini Dissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Banda_Few-shot_2023.pdf
Size:
6.13 MB
Format:
Adobe Portable Document Format
Description:
Mini Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: