Analysis of Deep Learning-Based Decision-Making in an Emotional Spontaneous Speech Task
dc.contributor.author | De Velasco Vázquez, Mikel | |
dc.contributor.author | Justo Blanco, Raquel | |
dc.contributor.author | López Zorrilla, Asier | |
dc.contributor.author | Torres Barañano, María Inés | |
dc.date.accessioned | 2023-01-23T16:01:36Z | |
dc.date.available | 2023-01-23T16:01:36Z | |
dc.date.issued | 2023-01-11 | |
dc.identifier.citation | Applied Sciences 13(2) : (2023) // Article ID 980 | es_ES |
dc.identifier.issn | 2076-3417 | |
dc.identifier.uri | http://hdl.handle.net/10810/59420 | |
dc.description.abstract | In this work, we present an approach to understand the computational methods and decision-making involved in the identification of emotions in spontaneous speech. The selected task consists of Spanish TV debates, which entail a high level of complexity as well as additional subjectivity in the human perception-based annotation procedure. A simple convolutional neural model is proposed, and its behaviour is analysed to explain its decision-making. The proposed model slightly outperforms commonly used CNN architectures such as VGG16, while being much lighter. Internal layer-by-layer transformations of the input spectrogram are visualised and analysed. Finally, a class model visualisation is proposed as a simple interpretation approach whose usefulness is assessed in the work. | es_ES |
dc.description.sponsorship | The research presented in this paper was conducted as part of the AMIC, AMIC-PdC, BEWORD and MENHIR projects, which received funding from the Spanish Minister of Science under grants TIN2017-85854-C4-3-R, PDC2021-120846-C43 and PID2021-126061OB-C42 and from the European Union’s H2020-MSCA-RISE-2018 Research and Innovation Staff Exchange, under Grant No. 823907. The first author also received a PhD scholarship from the University of the Basque Country UPV/EHU, PIF17/310. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | MDPI | es_ES |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/823907 | es_ES |
dc.relation | info:eu-repo/grantAgreement/MINECO/TIN2017-85854-C4-3-R | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICINN/PDC2021-120846-C43 | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICINN/PID2021-126061OB-C42 | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | emotion detection | es_ES |
dc.subject | speech processing | es_ES |
dc.subject | explainable artificial intelligence | es_ES |
dc.subject | machine learning | es_ES |
dc.title | Analysis of Deep Learning-Based Decision-Making in an Emotional Spontaneous Speech Task | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.date.updated | 2023-01-20T14:23:13Z | |
dc.rights.holder | © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/ 4.0/). | es_ES |
dc.relation.publisherversion | https://www.mdpi.com/2076-3417/13/2/980 | es_ES |
dc.identifier.doi | 10.3390/app13020980 | |
dc.contributor.funder | European Commission | |
dc.departamentoes | Electricidad y electrónica | |
dc.departamentoes | Lenguajes y sistemas informáticos | |
dc.departamentoeu | Elektrizitatea eta elektronika | |
dc.departamentoeu | Lengoaia eta Sistema Informatikoak |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's license is described as © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/ 4.0/).