dc.contributor.author |
Kleynhans, N
|
|
dc.date.accessioned |
2015-03-12T09:39:33Z |
|
dc.date.available |
2015-03-12T09:39:33Z |
|
dc.date.issued |
2014-11 |
|
dc.identifier.citation |
Kleynhans, N. 2014. An investigation into spoken audio topic identification using the Fisher Corpus. Proceedings of the 2014 PRASA, RobMech and AfLat International Joint Symposium, Cape Town, South Africa, 27-28 November 2014, pp 1-5 |
en_US |
dc.identifier.uri |
http://hdl.handle.net/10204/7899
|
|
dc.description |
Proceedings of the 2014 PRASA, RobMech and AfLat International Joint Symposium, Cape Town, South Africa, 27-28 November 2014, pp 1-5 |
en_US |
dc.description.abstract |
There are many services that generate or archive vast amounts of spoken audio data, on a daily basis. It is infeasible for humans to create annotations, by listening to all this data, so automatic methods are needed to process the data and create appropriate transcriptions or assign relevant annotations. There are many important informational aspects associated with the audio data, but one common component is the topic under discussion. Knowing the topic can help process the data in a variety of ways – segment an audio stream by dynamic-topic tracking, cluster similar audio recordings according based on the topic or improve ASR recognition outputs by using appropriate language models. In this work, the best spoken audio topic identification system achieved an error rate of 17.6%, using an ASR system that produced an average word error rate of 57% and supervised latent Dirichlet allocation topic modelling technique. The proposed language model topic modelling technique produced the worst results, highlighting the sensitivity to high ASR word error rates. The support vector machine topic classifier, which made use of a simplified term-weighted feature vector, performed comparably to that of the term frequency inverse document frequency feature vectors. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Pattern Recognition Association of South Africa |
en_US |
dc.relation.ispartofseries |
Workflow;14040 |
|
dc.subject |
Spoken audio data |
en_US |
dc.subject |
Latent semantic analysis |
en_US |
dc.subject |
LSA |
en_US |
dc.subject |
Fisher Corpus |
en_US |
dc.title |
An investigation into spoken audio topic identification using the Fisher Corpus |
en_US |
dc.type |
Conference Presentation |
en_US |
dc.identifier.apacitation |
Kleynhans, N. (2014). An investigation into spoken audio topic identification using the Fisher Corpus. Pattern Recognition Association of South Africa. http://hdl.handle.net/10204/7899 |
en_ZA |
dc.identifier.chicagocitation |
Kleynhans, N. "An investigation into spoken audio topic identification using the Fisher Corpus." (2014): http://hdl.handle.net/10204/7899 |
en_ZA |
dc.identifier.vancouvercitation |
Kleynhans N, An investigation into spoken audio topic identification using the Fisher Corpus; Pattern Recognition Association of South Africa; 2014. http://hdl.handle.net/10204/7899 . |
en_ZA |
dc.identifier.ris |
TY - Conference Presentation
AU - Kleynhans, N
AB - There are many services that generate or archive vast amounts of spoken audio data, on a daily basis. It is infeasible for humans to create annotations, by listening to all this data, so automatic methods are needed to process the data and create appropriate transcriptions or assign relevant annotations. There are many important informational aspects associated with the audio data, but one common component is the topic under discussion. Knowing the topic can help process the data in a variety of ways – segment an audio stream by dynamic-topic tracking, cluster similar audio recordings according based on the topic or improve ASR recognition outputs by using appropriate language models. In this work, the best spoken audio topic identification system achieved an error rate of 17.6%, using an ASR system that produced an average word error rate of 57% and supervised latent Dirichlet allocation topic modelling technique. The proposed language model topic modelling technique produced the worst results, highlighting the sensitivity to high ASR word error rates. The support vector machine topic classifier, which made use of a simplified term-weighted feature vector, performed comparably to that of the term frequency inverse document frequency feature vectors.
DA - 2014-11
DB - ResearchSpace
DP - CSIR
KW - Spoken audio data
KW - Latent semantic analysis
KW - LSA
KW - Fisher Corpus
LK - https://researchspace.csir.co.za
PY - 2014
T1 - An investigation into spoken audio topic identification using the Fisher Corpus
TI - An investigation into spoken audio topic identification using the Fisher Corpus
UR - http://hdl.handle.net/10204/7899
ER -
|
en_ZA |