ResearchSpace

Practical approach on implementation of Wordnets for South African languages

Show simple item record

dc.contributor.author Sefara, Tshephisho J
dc.contributor.author Mokgonyane, TB
dc.date.accessioned 2021-03-07T17:34:29Z
dc.date.available 2021-03-07T17:34:29Z
dc.date.issued 2021-03
dc.identifier.citation Sefara, T.J. & Mokgonyane, T. 2021. Practical approach on implementation of Wordnets for South African languages. http://hdl.handle.net/10204/11826 . en_ZA
dc.identifier.isbn 978-9-464027-31-0
dc.identifier.uri http://hdl.handle.net/10204/11826
dc.description.abstract This paper proposes the implementation of WordNets for five South African languages, namely, Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa to be added to open multilingual WordNets (OMW) on natural language toolkit (NLTK). The African WordNets are converted from Princeton WordNet (PWN) 2.0 to 3.0 to match the synsets in PWN 3.0. After conversion, there were 7157, 11972, 1288, 6380, and 9460 lemmas for Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa respectively. Setswana, isiXhosa, Sepedi contains more lemmas compared to 8 languages in OMW and isiZulu contains more lemmas compared to 7 languages in OMW. A library has been published for continuous development of African WordNets in OMW using NLTK. en_US
dc.format Fulltext en_US
dc.language.iso en en_US
dc.relation.uri www.globalwordnet.org en_US
dc.relation.uri https://www.globalwordnet.co.za/wp-content/uploads/2021/01/pre-conference-proceedings.pdf en_US
dc.relation.uri /www.globalwordnet.co.za/programme/ en_US
dc.source Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021 en_US
dc.subject WordNet en_US
dc.subject South African languages en_US
dc.subject Natural language toolkit en_US
dc.subject NLTK en_US
dc.title Practical approach on implementation of Wordnets for South African languages en_US
dc.type Conference Presentation en_US
dc.description.pages 6pp en_US
dc.description.note Paper presented at the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021 en_US
dc.description.cluster Next Generation Enterprises & Institutions
dc.description.impactarea Data Science en_US
dc.identifier.apacitation Sefara, T. J., & Mokgonyane, T. (2021). Practical approach on implementation of Wordnets for South African languages. http://hdl.handle.net/10204/11826 en_ZA
dc.identifier.chicagocitation Sefara, Tshephisho J, and TB Mokgonyane. "Practical approach on implementation of Wordnets for South African languages." <i>Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021</i> (2021): http://hdl.handle.net/10204/11826 en_ZA
dc.identifier.vancouvercitation Sefara TJ, Mokgonyane T, Practical approach on implementation of Wordnets for South African languages; 2021. http://hdl.handle.net/10204/11826 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Sefara, Tshephisho J AU - Mokgonyane, TB AB - This paper proposes the implementation of WordNets for five South African languages, namely, Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa to be added to open multilingual WordNets (OMW) on natural language toolkit (NLTK). The African WordNets are converted from Princeton WordNet (PWN) 2.0 to 3.0 to match the synsets in PWN 3.0. After conversion, there were 7157, 11972, 1288, 6380, and 9460 lemmas for Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa respectively. Setswana, isiXhosa, Sepedi contains more lemmas compared to 8 languages in OMW and isiZulu contains more lemmas compared to 7 languages in OMW. A library has been published for continuous development of African WordNets in OMW using NLTK. DA - 2021-03 DB - ResearchSpace DP - CSIR J1 - Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021 KW - WordNet KW - South African languages KW - Natural language toolkit KW - NLTK LK - https://researchspace.csir.co.za PY - 2021 SM - 978-9-464027-31-0 T1 - Practical approach on implementation of Wordnets for South African languages TI - Practical approach on implementation of Wordnets for South African languages UR - http://hdl.handle.net/10204/11826 ER - en_ZA
dc.identifier.worklist 24195 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record