Automatic Classification of Synthetic Voices for Voice Banking Using Objective Measures

Alonso, Agustin; García Romillo, Víctor; Hernáez Rioja, Inmaculada; Navas Cordón, Eva; Sánchez de la Fuente, Jon

dc.contributor.author	Alonso, Agustin
dc.contributor.author	García Romillo, Víctor
dc.contributor.author	Hernáez Rioja, Inmaculada
dc.contributor.author	Navas Cordón, Eva
dc.contributor.author	Sánchez de la Fuente, Jon
dc.date.accessioned	2022-03-14T09:11:34Z
dc.date.available	2022-03-14T09:11:34Z
dc.date.issued	2022-02-27
dc.identifier.citation	Applied Sciences 12 5) : (2022) // Article ID 2473	es_ES
dc.identifier.issn	2076-3417
dc.identifier.uri	http://hdl.handle.net/10810/55917
dc.description.abstract	Speech is the most common way of communication among humans. People who cannot communicate through speech due to partial of total loss of the voice can benefit from Alternative and Augmentative Communication devices and Text to Speech technology. One problem of using these technologies is that the included synthetic voices might be impersonal and badly adapted to the user in terms of age, accent or even gender. In this context, the use of synthetic voices from voice banking systems is an attractive alternative. New voices can be obtained applying adaptation techniques using recordings from people with healthy voice (donors) or from the user himself/herself before losing his/her own voice. In this way, the goal is to offer a wide voice catalog to potential users. However, as there is no control over the recording or the adaptation processes, some method to control the final quality of the voice is needed. We present the work developed to automatically select the best synthetic voices using a set of objective measures and a subjective Mean Opinion Score evaluation. A prediction algorithm of the MOS has been build which correlates similarly to the most correlated individual measure.	es_ES
dc.description.sponsorship	This work has been funded by the Basque Government under the project ref. PIBA 2018-035 and IT-1355-19. This work is part of the project Grant PID 2019-108040RB-C21 funded by MCIN/AEI/10.13039/501100011033.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	MDPI	es_ES
dc.relation	info:eu-repo/grantAgreement/MICINN/PID 2019-108040RB-C21	es_ES
dc.rights	info:eu-repo/semantics/openAccess	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/es/
dc.subject	STOI	es_ES
dc.subject	ESTOI	es_ES
dc.subject	NISQA	es_ES
dc.subject	SIIB	es_ES
dc.subject	speech adaptation	es_ES
dc.subject	voice banki	es_ES
dc.title	Automatic Classification of Synthetic Voices for Voice Banking Using Objective Measures	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.date.updated	2022-03-10T14:18:36Z
dc.rights.holder	2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).	es_ES
dc.relation.publisherversion	https://www.mdpi.com/2076-3417/12/5/2473/htm	es_ES
dc.identifier.doi	10.3390/app12052473
dc.departamentoes	Ingeniería de comunicaciones
dc.departamentoeu	Komunikazioen ingeniaritza

Files in this item

Name:: applsci-12-02473-v2.pdf
Size:: 315.3Kb
Format:: PDF
Description:: Artículo principal

View/Open

This item appears in the following Collection(s)

Artículos

Show simple item record

2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Except where otherwise noted, this item's license is described as 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).