ResearchSpace

The effects of data size on text-independent automatic speaker identification system

Show simple item record

dc.contributor.author Mokgonyane, TB
dc.contributor.author Sefara, Tshephisho J
dc.contributor.author Manamela, MJ
dc.contributor.author Modipa, TI
dc.date.accessioned 2019-12-18T10:35:45Z
dc.date.available 2019-12-18T10:35:45Z
dc.date.issued 2019-08
dc.identifier.citation Mokgonyane, T.B., Sefara, T.J., Manamela, M.J. & Modipa, T.I. 2019. The effects of data size on text-independent automatic speaker identification system. In: International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD), Winterton, South Africa, 5-6 August 2019 en_US
dc.identifier.isbn 978-1-5386-9236-3
dc.identifier.isbn 978-1-5386-9237-0
dc.identifier.issn DOI: 10.1109/ICABCD.2019.8851018
dc.identifier.uri https://ieeexplore.ieee.org/document/8851018
dc.identifier.uri http://hdl.handle.net/10204/11274
dc.description Presented in: International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD), Winterton, South Africa, 5-6 August 2019. This is the accepted version of the published item. en_US
dc.description.abstract Speaker recognition is a technique that automatically identifies a speaker from a recording of their speech utterance. Speaker recognition technologies are taking a new direction due to rapid progress in artificial intelligence. Research in the field of speaker recognition has shown fruitful results. There is, however, not much work done for African indigenous languages that have limited speech data resources. This paper presents how data size impacts the accuracy of an automatic speaker recognition system models, focusing on the Sepedi language as it is one of the South African under-resourced language. The speech data used is acquired from the South African Centre for Digital Language Resources. Four machine learning models, namely, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), Multilayer Perceptrons (MLP) and Logistic Regression (LR) are trained under four data setting environment. LR performed better than other models with the highest accuracy of 91% while SVM obtained the highest increase of 4% in accuracy as data size increases. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.ispartofseries Workflow;22849
dc.subject K-nearest neighbors en_US
dc.subject Logistic regression en_US
dc.subject Multilayer-perceptron en_US
dc.subject Speaker recognition en_US
dc.subject Support vector machine en_US
dc.subject Text-independent en_US
dc.title The effects of data size on text-independent automatic speaker identification system en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Mokgonyane, T., Sefara, T. J., Manamela, M., & Modipa, T. (2019). The effects of data size on text-independent automatic speaker identification system. IEEE. http://hdl.handle.net/10204/11274 en_ZA
dc.identifier.chicagocitation Mokgonyane, TB, Tshephisho J Sefara, MJ Manamela, and TI Modipa. "The effects of data size on text-independent automatic speaker identification system." (2019): http://hdl.handle.net/10204/11274 en_ZA
dc.identifier.vancouvercitation Mokgonyane T, Sefara TJ, Manamela M, Modipa T, The effects of data size on text-independent automatic speaker identification system; IEEE; 2019. http://hdl.handle.net/10204/11274 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Mokgonyane, TB AU - Sefara, Tshephisho J AU - Manamela, MJ AU - Modipa, TI AB - Speaker recognition is a technique that automatically identifies a speaker from a recording of their speech utterance. Speaker recognition technologies are taking a new direction due to rapid progress in artificial intelligence. Research in the field of speaker recognition has shown fruitful results. There is, however, not much work done for African indigenous languages that have limited speech data resources. This paper presents how data size impacts the accuracy of an automatic speaker recognition system models, focusing on the Sepedi language as it is one of the South African under-resourced language. The speech data used is acquired from the South African Centre for Digital Language Resources. Four machine learning models, namely, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), Multilayer Perceptrons (MLP) and Logistic Regression (LR) are trained under four data setting environment. LR performed better than other models with the highest accuracy of 91% while SVM obtained the highest increase of 4% in accuracy as data size increases. DA - 2019-08 DB - ResearchSpace DP - CSIR KW - K-nearest neighbors KW - Logistic regression KW - Multilayer-perceptron KW - Speaker recognition KW - Support vector machine KW - Text-independent LK - https://researchspace.csir.co.za PY - 2019 SM - 978-1-5386-9236-3 SM - 978-1-5386-9237-0 SM - DOI: 10.1109/ICABCD.2019.8851018 T1 - The effects of data size on text-independent automatic speaker identification system TI - The effects of data size on text-independent automatic speaker identification system UR - http://hdl.handle.net/10204/11274 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record