dc.contributor.author |
Sefara, Tshephisho J
|
|
dc.contributor.author |
Mokgonyane, TB
|
|
dc.date.accessioned |
2021-03-07T17:34:29Z |
|
dc.date.available |
2021-03-07T17:34:29Z |
|
dc.date.issued |
2021-03 |
|
dc.identifier.citation |
Sefara, T.J. & Mokgonyane, T. 2021. Practical approach on implementation of Wordnets for South African languages. http://hdl.handle.net/10204/11826 . |
en_ZA |
dc.identifier.isbn |
978-9-464027-31-0 |
|
dc.identifier.uri |
http://hdl.handle.net/10204/11826
|
|
dc.description.abstract |
This paper proposes the implementation of WordNets for five South African languages, namely, Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa to be added to open multilingual WordNets (OMW) on natural language toolkit (NLTK). The African WordNets are converted from Princeton WordNet (PWN) 2.0 to 3.0 to match the synsets in PWN 3.0. After conversion, there were 7157, 11972, 1288, 6380, and 9460 lemmas for Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa respectively. Setswana, isiXhosa, Sepedi contains more lemmas compared to 8 languages in OMW and isiZulu contains more lemmas compared to 7 languages in OMW. A library has been published for continuous development of African WordNets in OMW using NLTK. |
en_US |
dc.format |
Fulltext |
en_US |
dc.language.iso |
en |
en_US |
dc.relation.uri |
www.globalwordnet.org |
en_US |
dc.relation.uri |
https://www.globalwordnet.co.za/wp-content/uploads/2021/01/pre-conference-proceedings.pdf |
en_US |
dc.relation.uri |
/www.globalwordnet.co.za/programme/ |
en_US |
dc.source |
Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021 |
en_US |
dc.subject |
WordNet |
en_US |
dc.subject |
South African languages |
en_US |
dc.subject |
Natural language toolkit |
en_US |
dc.subject |
NLTK |
en_US |
dc.title |
Practical approach on implementation of Wordnets for South African languages |
en_US |
dc.type |
Conference Presentation |
en_US |
dc.description.pages |
6pp |
en_US |
dc.description.note |
Paper presented at the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021 |
en_US |
dc.description.cluster |
Next Generation Enterprises & Institutions |
|
dc.description.impactarea |
Data Science |
en_US |
dc.identifier.apacitation |
Sefara, T. J., & Mokgonyane, T. (2021). Practical approach on implementation of Wordnets for South African languages. http://hdl.handle.net/10204/11826 |
en_ZA |
dc.identifier.chicagocitation |
Sefara, Tshephisho J, and TB Mokgonyane. "Practical approach on implementation of Wordnets for South African languages." <i>Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021</i> (2021): http://hdl.handle.net/10204/11826 |
en_ZA |
dc.identifier.vancouvercitation |
Sefara TJ, Mokgonyane T, Practical approach on implementation of Wordnets for South African languages; 2021. http://hdl.handle.net/10204/11826 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - Sefara, Tshephisho J
AU - Mokgonyane, TB
AB - This paper proposes the implementation of WordNets for five South African languages, namely, Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa to be added to open multilingual WordNets (OMW) on natural language toolkit (NLTK). The African WordNets are converted from Princeton WordNet (PWN) 2.0 to 3.0 to match the synsets in PWN 3.0. After conversion, there were 7157, 11972, 1288, 6380, and 9460 lemmas for Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa respectively. Setswana, isiXhosa, Sepedi contains more lemmas compared to 8 languages in OMW and isiZulu contains more lemmas compared to 7 languages in OMW. A library has been published for continuous development of African WordNets in OMW using NLTK.
DA - 2021-03
DB - ResearchSpace
DP - CSIR
J1 - Proceedings of the 11th Global Wordnet Conference, University of Pretoria, Pretoria, South Africa, 18 - 22 January 2021
KW - WordNet
KW - South African languages
KW - Natural language toolkit
KW - NLTK
LK - https://researchspace.csir.co.za
PY - 2021
SM - 978-9-464027-31-0
T1 - Practical approach on implementation of Wordnets for South African languages
TI - Practical approach on implementation of Wordnets for South African languages
UR - http://hdl.handle.net/10204/11826
ER - |
en_ZA |
dc.identifier.worklist |
24195 |
en_US |