Show simple item record

dc.contributor.authorDe Velasco Vázquez, Mikel ORCID
dc.contributor.authorJusto Blanco, Raquel ORCID
dc.contributor.authorLópez Zorrilla, Asier ORCID
dc.contributor.authorTorres Barañano, María Inés ORCID
dc.date.accessioned2023-01-23T16:01:36Z
dc.date.available2023-01-23T16:01:36Z
dc.date.issued2023-01-11
dc.identifier.citationApplied Sciences 13(2) : (2023) // Article ID 980es_ES
dc.identifier.issn2076-3417
dc.identifier.urihttp://hdl.handle.net/10810/59420
dc.description.abstractIn this work, we present an approach to understand the computational methods and decision-making involved in the identification of emotions in spontaneous speech. The selected task consists of Spanish TV debates, which entail a high level of complexity as well as additional subjectivity in the human perception-based annotation procedure. A simple convolutional neural model is proposed, and its behaviour is analysed to explain its decision-making. The proposed model slightly outperforms commonly used CNN architectures such as VGG16, while being much lighter. Internal layer-by-layer transformations of the input spectrogram are visualised and analysed. Finally, a class model visualisation is proposed as a simple interpretation approach whose usefulness is assessed in the work.es_ES
dc.description.sponsorshipThe research presented in this paper was conducted as part of the AMIC, AMIC-PdC, BEWORD and MENHIR projects, which received funding from the Spanish Minister of Science under grants TIN2017-85854-C4-3-R, PDC2021-120846-C43 and PID2021-126061OB-C42 and from the European Union’s H2020-MSCA-RISE-2018 Research and Innovation Staff Exchange, under Grant No. 823907. The first author also received a PhD scholarship from the University of the Basque Country UPV/EHU, PIF17/310.es_ES
dc.language.isoenges_ES
dc.publisherMDPIes_ES
dc.relationinfo:eu-repo/grantAgreement/EC/H2020/823907es_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/TIN2017-85854-C4-3-Res_ES
dc.relationinfo:eu-repo/grantAgreement/MICINN/PDC2021-120846-C43es_ES
dc.relationinfo:eu-repo/grantAgreement/MICINN/PID2021-126061OB-C42es_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectemotion detectiones_ES
dc.subjectspeech processinges_ES
dc.subjectexplainable artificial intelligencees_ES
dc.subjectmachine learninges_ES
dc.titleAnalysis of Deep Learning-Based Decision-Making in an Emotional Spontaneous Speech Taskes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.date.updated2023-01-20T14:23:13Z
dc.rights.holder© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/ 4.0/).es_ES
dc.relation.publisherversionhttps://www.mdpi.com/2076-3417/13/2/980es_ES
dc.identifier.doi10.3390/app13020980
dc.contributor.funderEuropean Commission
dc.departamentoesElectricidad y electrónica
dc.departamentoesLenguajes y sistemas informáticos
dc.departamentoeuElektrizitatea eta elektronika
dc.departamentoeuLengoaia eta Sistema Informatikoak


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/ 4.0/).
Except where otherwise noted, this item's license is described as © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/ 4.0/).