Constructing an XML database of linguistics data

Loading...
Thumbnail Image

Authors

Kroeze, J.H. (Jan Hendrik)
Bothma, T.J.D. (Theodorus Jan Daniel)
Matthee, Machdel C.

Journal Title

Journal ISSN

Volume Title

Publisher

Vaal Triangle Faculty of Northwest University in South Africa

Abstract

A language-oriented, multi-dimensional database of the linguistic characteristics of the Hebrew text of the Old Testament can enable researchers to do ad hoc queries. XML is a suitable technology to transform free text into a database. A clause’s word order can be kept intact while other features such as syntactic and semantic functions can be marked as elements or attributes. The elements or attributes from the XML “database” can be accessed and processed by a 4th generation programming language, such as Visual Basic. XML is explored as an option to build an exploitable database of linguistic data by representing inherently multi-dimensional data, including syntactic and semantic analyses of free text.

Description

Keywords

XML, Database, Morphology, Morpho-syntax, Syntax, Semantics, Hebrew

Sustainable Development Goals

Citation

Kroeze, JH, Bothma, TJD & Matthee, MC 2010, 'Constructing an XML database of linguistics data', Journal for Transdisciplinary Research in Southern Africa, vol. 6, no. 1, pp. 139 – 174. [http://www.td-sa.net/]