An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies
dc.contributor.author | Lleida, Eduardo | |
dc.contributor.author | Rodríguez Fuentes, Luis Javier ![]() | |
dc.contributor.author | Tejedor, Javier | |
dc.contributor.author | Ortega, Alfonso | |
dc.contributor.author | Miguel, Antonio | |
dc.contributor.author | Bazán, Virginia | |
dc.contributor.author | Pérez, Carmen | |
dc.contributor.author | de Prada, Alberto | |
dc.contributor.author | Peñagarikano Badiola, Mikel ![]() | |
dc.contributor.author | Varona Fernández, Amparo | |
dc.contributor.author | Bordel García, German | |
dc.contributor.author | Torre-Toledano, Doroteo | |
dc.contributor.author | Álvarez, Aitor | |
dc.contributor.author | Arzelus, Haritz | |
dc.date.accessioned | 2023-09-26T13:49:33Z | |
dc.date.available | 2023-09-26T13:49:33Z | |
dc.date.issued | 2023-07-25 | |
dc.identifier.citation | Applied Sciences 13(15) : (2023) // Article ID 8577 | es_ES |
dc.identifier.issn | 2076-3417 | |
dc.identifier.uri | http://hdl.handle.net/10810/62679 | |
dc.description.abstract | Evaluation campaigns provide a common framework with which the progress of speech technologies can be effectively measured. The aim of this paper is to present a detailed overview of the IberSpeech-RTVE 2022 Challenges, which were organized as part of the IberSpeech 2022 conference under the ongoing series of Albayzin evaluation campaigns. In the 2022 edition, four challenges were launched: (1) speech-to-text transcription; (2) speaker diarization and identity assignment; (3) text and speech alignment; and (4) search on speech. Different databases that cover different domains (e.g., broadcast news, conference talks, parliament sessions) were released for those challenges. The submitted systems also cover a wide range of speech processing methods, which include hidden Markov model-based approaches, end-to-end neural network-based methods, hybrid approaches, etc. This paper describes the databases, the tasks and the performance metrics used in the four challenges. It also provides the most relevant features of the submitted systems and briefly presents and discusses the obtained results. Despite employing state-of-the-art technology, the relatively poor performance attained in some of the challenges reveals that there is still room for improvement. This encourages us to carry on with the Albayzin evaluation campaigns in the coming years. | es_ES |
dc.description.sponsorship | This work was partially supported by Radio Televisión Española through the RTVE Chair at the University of Zaragoza, and Red Temática en Tecnologías del Habla (RED2022-134270-T), funded by AEI (Ministerio de Ciencia e Innovación); It was also partially funded by the European Union’s Horizon 2020 research and innovation program under Marie Skłodowska-Curie Grant 101007666; in part by MCIN/AEI/10.13039/501100011033 and by the European Union “NextGenerationEU”/ PRTR under Grants PDC2021-120846C41 PID2021-126061OB-C44, and in part by the Government of Aragon (Grant Group T3623R); it was also partially funded by the Spanish Ministry of Science and Innovation (OPEN-SPEECH project, PID2019-106424RB-I00) and by the Basque Government under the general support program to research groups (IT-1704-22), and by projects RTI2018-098091-B-I00 and PID2021-125943OB-I00 (Spanish Ministry of Science and Innovation and ERDF) as well. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | MDPI | es_ES |
dc.relation | info:eu-repo/grantAgreement/MICINN/PID2019-106424RB-I00 | es_ES |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/101007666 | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | IberSpeech Challenge | es_ES |
dc.subject | RTVE2022 database | es_ES |
dc.subject | Albayzin evaluations | es_ES |
dc.subject | speech-to-text transcription | es_ES |
dc.subject | speaker diarization and identity assignment | es_ES |
dc.subject | text and speech alignment | es_ES |
dc.subject | search on speech | es_ES |
dc.title | An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.date.updated | 2023-08-11T14:33:35Z | |
dc.rights.holder | © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). | es_ES |
dc.relation.publisherversion | https://www.mdpi.com/2076-3417/13/15/8577 | es_ES |
dc.identifier.doi | 10.3390/app13158577 | |
dc.contributor.funder | European Commission | |
dc.departamentoes | Electricidad y electrónica | |
dc.departamentoeu | Elektrizitatea eta elektronika |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's license is described as © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).