dc.contributor.author | Odriozola Sustaeta, Igor | |
dc.contributor.author | Hernáez Rioja, Inmaculada | |
dc.contributor.author | Navas Cordón, Eva | |
dc.contributor.author | Serrano García, Luis | |
dc.contributor.author | Sánchez de la Fuente, Jon | |
dc.date.accessioned | 2019-05-15T15:14:03Z | |
dc.date.available | 2019-05-15T15:14:03Z | |
dc.date.issued | 2018-11-23 | |
dc.identifier.citation | IberSPEECH 2018 21-23 November 2018, Barcelona, Spain : 50-54 (2018) | es_ES |
dc.identifier.uri | http://hdl.handle.net/10810/32816 | |
dc.description.abstract | This paper shows a research on the behaviour of the observa-tion likelihoods generated by the central state of asilenceHMM(Hidden Markov Model) trained for Automatic Speech Recog-nition (ASR) using cepstral mean and variance normalization(CMVN). We have seen that observation likelihood shows astable behaviour under different recording conditions, and thischaracteristic can be used to discriminate betweenspeechandsilenceframes. We present several experiments which provethat the mere use of a decision threshold produces robust re-sults for very different recording channels and noise conditions.The results have also been compared with those obtained by twostandard VAD systems, showing promising prospects. All in all,observation likelihood scores could be useful as the basis for thedevelopment of future VAD systems, with further research andanalysis to refine the results. | es_ES |
dc.description.sponsorship | This work has been partially supported by the EU(FEDER) under grant TEC2015-67163-C2-1-R (RESTORE)(MINECO/FEDER, UE) and by the Basque Government undergrant KK-2017/00043 (BerbaOla) | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | International Speech Communication Association | es_ES |
dc.relation | info:eu-repo/grantAgreement/MINECO/TEC2015-67163-C2-1-R | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | observation likelihood | es_ES |
dc.subject | cepstral normal-ization | es_ES |
dc.subject | VAD | es_ES |
dc.title | The observation likelihood of silence: analysis and prospects for VAD applications | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.rights.holder | (c) 2018 ISCA | es_ES |
dc.relation.publisherversion | https://www.isca-speech.org/archive/IberSPEECH_2018/abstracts/IberS18_P1-7_Odriozola.html | es_ES |
dc.identifier.doi | 10.21437/IberSPEECH.2018-11 | |
dc.departamentoes | Ingeniería de comunicaciones | es_ES |
dc.departamentoeu | Komunikazioen ingeniaritza | es_ES |