Surface realization architecture for low-resourced African languages

dc.contributor.authorMahlaza, Zola
dc.contributor.authorKeet, C. Maria
dc.contributor.emailz.mahlaza@up.ac.zaen_US
dc.date.accessioned2023-09-18T06:08:47Z
dc.date.available2023-09-18T06:08:47Z
dc.date.issued2023-08
dc.description.abstractThere has been growing interest in building surface realization systems to support the automatic generation of text in African languages. Such tools focus on converting abstract representations of meaning to text. Since African languages are low-resourced, economical use of resources and general maintainability are key considerations. However, there is no existing surface realizer architecture that possesses most of the maintainability characteristics (e.g., modularity, reusability, and analyzability) that will lead to maintainable software that can be used for the languages. Moreover, there is no consensus surface realization architecture created for other languages that can be adapted for the languages in question. In this work, we solve this by creating a novel surface realizer architecture suitable for low-resourced African languages that abide by the features of maintainable software. Its design comes after a granular analysis, classification, and comparison of the architectures used by 77 existing NLG systems. We compare our architecture to existing architectures and show that it supports the most features of a maintainable software product.en_US
dc.description.departmentInformaticsen_US
dc.description.sponsorshipHasso Plattner Institute for Digital Engineering through the HPI Research School at UCT and the National Research Foundation (NRF) of South Africaen_US
dc.description.urihttps://dl.acm.org/journal/tallipen_US
dc.identifier.citationZola Mahlaza and C. Maria Keet. 2023. Surface Realization Architecture for Low-resourced African Languages. ACM Transactions on Asian and Low-Resource Language Information Processing 22, 3, Article 84 (March 2023), 26 pages. https://doi.org/10.1145/3567594.en_US
dc.identifier.issn2375-4699 (print)
dc.identifier.issn2375-4702 (online)
dc.identifier.other10.1145/3567594
dc.identifier.urihttp://hdl.handle.net/2263/92301
dc.language.isoenen_US
dc.publisherAssociation for Computing Machineryen_US
dc.rights© 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.en_US
dc.subjectComputing methodologiesen_US
dc.subjectNatural language generationen_US
dc.subjectSoftware and its engineeringen_US
dc.subjectSoftware architecturesen_US
dc.subjectAfrican languagesen_US
dc.titleSurface realization architecture for low-resourced African languagesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mahlaza_Surface_2023.pdf
Size:
935.77 KB
Format:
Adobe Portable Document Format
Description:
Main article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: