ResearchSpace

An investigation into spoken audio topic identification using the Fisher Corpus

Show simple item record

dc.contributor.author Kleynhans, N
dc.date.accessioned 2015-03-12T09:39:33Z
dc.date.available 2015-03-12T09:39:33Z
dc.date.issued 2014-11
dc.identifier.citation Kleynhans, N. 2014. An investigation into spoken audio topic identification using the Fisher Corpus. Proceedings of the 2014 PRASA, RobMech and AfLat International Joint Symposium, Cape Town, South Africa, 27-28 November 2014, pp 1-5 en_US
dc.identifier.uri http://hdl.handle.net/10204/7899
dc.description Proceedings of the 2014 PRASA, RobMech and AfLat International Joint Symposium, Cape Town, South Africa, 27-28 November 2014, pp 1-5 en_US
dc.description.abstract There are many services that generate or archive vast amounts of spoken audio data, on a daily basis. It is infeasible for humans to create annotations, by listening to all this data, so automatic methods are needed to process the data and create appropriate transcriptions or assign relevant annotations. There are many important informational aspects associated with the audio data, but one common component is the topic under discussion. Knowing the topic can help process the data in a variety of ways – segment an audio stream by dynamic-topic tracking, cluster similar audio recordings according based on the topic or improve ASR recognition outputs by using appropriate language models. In this work, the best spoken audio topic identification system achieved an error rate of 17.6%, using an ASR system that produced an average word error rate of 57% and supervised latent Dirichlet allocation topic modelling technique. The proposed language model topic modelling technique produced the worst results, highlighting the sensitivity to high ASR word error rates. The support vector machine topic classifier, which made use of a simplified term-weighted feature vector, performed comparably to that of the term frequency inverse document frequency feature vectors. en_US
dc.language.iso en en_US
dc.publisher Pattern Recognition Association of South Africa en_US
dc.relation.ispartofseries Workflow;14040
dc.subject Spoken audio data en_US
dc.subject Latent semantic analysis en_US
dc.subject LSA en_US
dc.subject Fisher Corpus en_US
dc.title An investigation into spoken audio topic identification using the Fisher Corpus en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Kleynhans, N. (2014). An investigation into spoken audio topic identification using the Fisher Corpus. Pattern Recognition Association of South Africa. http://hdl.handle.net/10204/7899 en_ZA
dc.identifier.chicagocitation Kleynhans, N. "An investigation into spoken audio topic identification using the Fisher Corpus." (2014): http://hdl.handle.net/10204/7899 en_ZA
dc.identifier.vancouvercitation Kleynhans N, An investigation into spoken audio topic identification using the Fisher Corpus; Pattern Recognition Association of South Africa; 2014. http://hdl.handle.net/10204/7899 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Kleynhans, N AB - There are many services that generate or archive vast amounts of spoken audio data, on a daily basis. It is infeasible for humans to create annotations, by listening to all this data, so automatic methods are needed to process the data and create appropriate transcriptions or assign relevant annotations. There are many important informational aspects associated with the audio data, but one common component is the topic under discussion. Knowing the topic can help process the data in a variety of ways – segment an audio stream by dynamic-topic tracking, cluster similar audio recordings according based on the topic or improve ASR recognition outputs by using appropriate language models. In this work, the best spoken audio topic identification system achieved an error rate of 17.6%, using an ASR system that produced an average word error rate of 57% and supervised latent Dirichlet allocation topic modelling technique. The proposed language model topic modelling technique produced the worst results, highlighting the sensitivity to high ASR word error rates. The support vector machine topic classifier, which made use of a simplified term-weighted feature vector, performed comparably to that of the term frequency inverse document frequency feature vectors. DA - 2014-11 DB - ResearchSpace DP - CSIR KW - Spoken audio data KW - Latent semantic analysis KW - LSA KW - Fisher Corpus LK - https://researchspace.csir.co.za PY - 2014 T1 - An investigation into spoken audio topic identification using the Fisher Corpus TI - An investigation into spoken audio topic identification using the Fisher Corpus UR - http://hdl.handle.net/10204/7899 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record