dc.contributor.author |
Mokgonyane, TB
|
|
dc.contributor.author |
Sefara, Tshephisho J
|
|
dc.contributor.author |
Manamela, MJ
|
|
dc.contributor.author |
Modipa, TI
|
|
dc.date.accessioned |
2019-12-18T10:35:45Z |
|
dc.date.available |
2019-12-18T10:35:45Z |
|
dc.date.issued |
2019-08 |
|
dc.identifier.citation |
Mokgonyane, T.B., Sefara, T.J., Manamela, M.J. & Modipa, T.I. 2019. The effects of data size on text-independent automatic speaker identification system. In: International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD), Winterton, South Africa, 5-6 August 2019 |
en_US |
dc.identifier.isbn |
978-1-5386-9236-3 |
|
dc.identifier.isbn |
978-1-5386-9237-0 |
|
dc.identifier.issn |
DOI: 10.1109/ICABCD.2019.8851018 |
|
dc.identifier.uri |
https://ieeexplore.ieee.org/document/8851018
|
|
dc.identifier.uri |
http://hdl.handle.net/10204/11274
|
|
dc.description |
Presented in: International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD), Winterton, South Africa, 5-6 August 2019. This is the accepted version of the published item. |
en_US |
dc.description.abstract |
Speaker recognition is a technique that automatically identifies a speaker from a recording of their speech utterance. Speaker recognition technologies are taking a new direction due to rapid progress in artificial intelligence. Research in the field of speaker recognition has shown fruitful results. There is, however, not much work done for African indigenous languages that have limited speech data resources. This paper presents how data size impacts the accuracy of an automatic speaker recognition system models, focusing on the Sepedi language as it is one of the South African under-resourced language. The speech data used is acquired from the South African Centre for Digital Language Resources. Four machine learning models, namely, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), Multilayer Perceptrons (MLP) and Logistic Regression (LR) are trained under four data setting environment. LR performed better than other models with the highest accuracy of 91% while SVM obtained the highest increase of 4% in accuracy as data size increases. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
IEEE |
en_US |
dc.relation.ispartofseries |
Workflow;22849 |
|
dc.subject |
K-nearest neighbors |
en_US |
dc.subject |
Logistic regression |
en_US |
dc.subject |
Multilayer-perceptron |
en_US |
dc.subject |
Speaker recognition |
en_US |
dc.subject |
Support vector machine |
en_US |
dc.subject |
Text-independent |
en_US |
dc.title |
The effects of data size on text-independent automatic speaker identification system |
en_US |
dc.type |
Conference Presentation |
en_US |
dc.identifier.apacitation |
Mokgonyane, T., Sefara, T. J., Manamela, M., & Modipa, T. (2019). The effects of data size on text-independent automatic speaker identification system. IEEE. http://hdl.handle.net/10204/11274 |
en_ZA |
dc.identifier.chicagocitation |
Mokgonyane, TB, Tshephisho J Sefara, MJ Manamela, and TI Modipa. "The effects of data size on text-independent automatic speaker identification system." (2019): http://hdl.handle.net/10204/11274 |
en_ZA |
dc.identifier.vancouvercitation |
Mokgonyane T, Sefara TJ, Manamela M, Modipa T, The effects of data size on text-independent automatic speaker identification system; IEEE; 2019. http://hdl.handle.net/10204/11274 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - Mokgonyane, TB
AU - Sefara, Tshephisho J
AU - Manamela, MJ
AU - Modipa, TI
AB - Speaker recognition is a technique that automatically identifies a speaker from a recording of their speech utterance. Speaker recognition technologies are taking a new direction due to rapid progress in artificial intelligence. Research in the field of speaker recognition has shown fruitful results. There is, however, not much work done for African indigenous languages that have limited speech data resources. This paper presents how data size impacts the accuracy of an automatic speaker recognition system models, focusing on the Sepedi language as it is one of the South African under-resourced language. The speech data used is acquired from the South African Centre for Digital Language Resources. Four machine learning models, namely, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), Multilayer Perceptrons (MLP) and Logistic Regression (LR) are trained under four data setting environment. LR performed better than other models with the highest accuracy of 91% while SVM obtained the highest increase of 4% in accuracy as data size increases.
DA - 2019-08
DB - ResearchSpace
DP - CSIR
KW - K-nearest neighbors
KW - Logistic regression
KW - Multilayer-perceptron
KW - Speaker recognition
KW - Support vector machine
KW - Text-independent
LK - https://researchspace.csir.co.za
PY - 2019
SM - 978-1-5386-9236-3
SM - 978-1-5386-9237-0
SM - DOI: 10.1109/ICABCD.2019.8851018
T1 - The effects of data size on text-independent automatic speaker identification system
TI - The effects of data size on text-independent automatic speaker identification system
UR - http://hdl.handle.net/10204/11274
ER -
|
en_ZA |