Author:Botha, GR; Barnard, EDate:Nov 2007The authors investigate the factors that determine the performance of text-based language identification, with a particular focus on the 11 official languages of South Africa, using n-gram statistics as features for classification. For a fixed ...Read more