ResearchSpace

Statistical translation with scarce resources: a South African case study

Show simple item record

dc.contributor.author Ronald, K
dc.contributor.author Barnard, E
dc.date.accessioned 2007-07-03T10:18:56Z
dc.date.available 2007-07-03T10:18:56Z
dc.date.issued 2006-11
dc.identifier.citation Ronald, K and Barnard, E. 2006. Statistical translation with scarce resources: a South African case study. 17th Annual Symposium of the Pattern Recognition Association of South Africa, Parys, South Africa, 29 Nov - 1 Dec 2006, pp 5 en
dc.identifier.uri http://hdl.handle.net/10204/880
dc.description www.meraka.org.za/pubs/KRonald.pdf en
dc.description This paper is published in the SAIEE Africa Research Journal, Vol 98(4), pp 136-140
dc.description.abstract Statistical machine translation techniques offer great promise for the development of automatic translation systems. However, the realization of this potential requires the availability of significant amounts of parallel bilingual texts. This paper reports on an attempt to reduce the amount of text that is required to obtain an acceptable translation system, through the use of active and semi-supervised learning. Systems were built using resources collected from South African government websites and the results evaluated using a standard automatic evaluation metric (BLEU). The authors show that significant improvements in translation quality can be achieved with very limited parallel corpora, and that both active learning and semi-supervised learning are useful in this context. en
dc.language.iso en en
dc.subject Language resources en
dc.subject Phrase-based translation en
dc.title Statistical translation with scarce resources: a South African case study en
dc.type Conference Presentation en
dc.identifier.apacitation Ronald, K., & Barnard, E. (2006). Statistical translation with scarce resources: a South African case study. http://hdl.handle.net/10204/880 en_ZA
dc.identifier.chicagocitation Ronald, K, and E Barnard. "Statistical translation with scarce resources: a South African case study." (2006): http://hdl.handle.net/10204/880 en_ZA
dc.identifier.vancouvercitation Ronald K, Barnard E, Statistical translation with scarce resources: a South African case study; 2006. http://hdl.handle.net/10204/880 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Ronald, K AU - Barnard, E AB - Statistical machine translation techniques offer great promise for the development of automatic translation systems. However, the realization of this potential requires the availability of significant amounts of parallel bilingual texts. This paper reports on an attempt to reduce the amount of text that is required to obtain an acceptable translation system, through the use of active and semi-supervised learning. Systems were built using resources collected from South African government websites and the results evaluated using a standard automatic evaluation metric (BLEU). The authors show that significant improvements in translation quality can be achieved with very limited parallel corpora, and that both active learning and semi-supervised learning are useful in this context. DA - 2006-11 DB - ResearchSpace DP - CSIR KW - Language resources KW - Phrase-based translation LK - https://researchspace.csir.co.za PY - 2006 T1 - Statistical translation with scarce resources: a South African case study TI - Statistical translation with scarce resources: a South African case study UR - http://hdl.handle.net/10204/880 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record