dc.contributor.author | Saratxaga Couceiro, Ibon | |
dc.contributor.author | Sánchez de la Fuente, Jon | |
dc.contributor.author | Wu, Zhizheng | |
dc.contributor.author | Hernáez Rioja, Inmaculada | |
dc.contributor.author | Navas Cordón, Eva | |
dc.date.accessioned | 2017-11-21T15:11:43Z | |
dc.date.available | 2017-11-21T15:11:43Z | |
dc.date.issued | 2016-04-16 | |
dc.identifier.citation | Speech Communication 81 : 30–41 (2016) | es_ES |
dc.identifier.issn | 0167-6393 | |
dc.identifier.uri | http://hdl.handle.net/10810/23565 | |
dc.description.abstract | Taking advantage of the fact that most of the speech processing techniques neglect the phase information, we seek to detect phase perturbations in order to prevent synthetic impostors attacking Speaker Verification systems. Two Synthetic Speech Detection (SSD) systems that use spectral phase related information are reviewed and evaluated in this work: one based on the Modified Group Delay (MGD), and the other based on the Relative Phase Shift, (RPS). A classical module-based MFCC system is also used as baseline. Different training strategies are proposed and evaluated using both real spoofing samples and copy-synthesized signals from the natural ones, aiming to alleviate the issue of getting real data to train the systems. The recently published ASVSpoof2015 database is used for training and evaluation. Performance with completely unrelated data is also checked using synthetic speech from the Blizzard Challenge as evaluation material. The results prove that phase information can be successfully used for the SSD task even with unknown attacks. | es_ES |
dc.description.sponsorship | This work has been partially supported by the Basque Government (ElkarOla Project, KK-2015/00,098) and the Spanish Ministry of Economy and Competitiveness (Restore project, TEC2015-67,163-C2-1-R). | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Elsevier B.V. | es_ES |
dc.relation | info:eu-repo/grantAgreement/MINECO/TEC2015-67,163-C2-1-R | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.subject | Synthetic speech detection | es_ES |
dc.subject | phase | es_ES |
dc.subject | RPS | es_ES |
dc.subject | MGD | es_ES |
dc.title | Synthetic speech detection using phase information | es_ES |
dc.type | info:eu-repo/semantics/preprint | es_ES |
dc.rights.holder | (c) 2016 Elsevier | es_ES |
dc.relation.publisherversion | http://www.sciencedirect.com/science/article/pii/S0167639316300772 | es_ES |
dc.identifier.doi | 10.1016/j.specom.2016.04.001 | |
dc.departamentoes | Ingeniería de comunicaciones | es_ES |
dc.departamentoeu | Komunikazioen ingeniaritza | es_ES |