Botha, Gerrit Reinier
(University of Pretoria, 2008-09-09)
We investigate the factors that determine the performance of text-based language identification, with a particular focus on the 11 official languages of South Africa. Our study uses n-gram statistics as features for ...