Synthetic speech detection using phase information

Saratxaga Couceiro, Ibon; Sánchez de la Fuente, Jon; Wu, Zhizheng; Hernáez Rioja, Inmaculada; Navas Cordón, Eva

dc.contributor.author	Saratxaga Couceiro, Ibon
dc.contributor.author	Sánchez de la Fuente, Jon
dc.contributor.author	Wu, Zhizheng
dc.contributor.author	Hernáez Rioja, Inmaculada
dc.contributor.author	Navas Cordón, Eva
dc.date.accessioned	2017-11-21T15:11:43Z
dc.date.available	2017-11-21T15:11:43Z
dc.date.issued	2016-04-16
dc.identifier.citation	Speech Communication 81 : 30–41 (2016)	es_ES
dc.identifier.issn	0167-6393
dc.identifier.uri	http://hdl.handle.net/10810/23565
dc.description.abstract	Taking advantage of the fact that most of the speech processing techniques neglect the phase information, we seek to detect phase perturbations in order to prevent synthetic impostors attacking Speaker Verification systems. Two Synthetic Speech Detection (SSD) systems that use spectral phase related information are reviewed and evaluated in this work: one based on the Modified Group Delay (MGD), and the other based on the Relative Phase Shift, (RPS). A classical module-based MFCC system is also used as baseline. Different training strategies are proposed and evaluated using both real spoofing samples and copy-synthesized signals from the natural ones, aiming to alleviate the issue of getting real data to train the systems. The recently published ASVSpoof2015 database is used for training and evaluation. Performance with completely unrelated data is also checked using synthetic speech from the Blizzard Challenge as evaluation material. The results prove that phase information can be successfully used for the SSD task even with unknown attacks.	es_ES
dc.description.sponsorship	This work has been partially supported by the Basque Government (ElkarOla Project, KK-2015/00,098) and the Spanish Ministry of Economy and Competitiveness (Restore project, TEC2015-67,163-C2-1-R).	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Elsevier B.V.	es_ES
dc.relation	info:eu-repo/grantAgreement/MINECO/TEC2015-67,163-C2-1-R	es_ES
dc.rights	info:eu-repo/semantics/openAccess	es_ES
dc.subject	Synthetic speech detection	es_ES
dc.subject	phase	es_ES
dc.subject	RPS	es_ES
dc.subject	MGD	es_ES
dc.title	Synthetic speech detection using phase information	es_ES
dc.type	info:eu-repo/semantics/preprint	es_ES
dc.rights.holder	(c) 2016 Elsevier	es_ES
dc.relation.publisherversion	http://www.sciencedirect.com/science/article/pii/S0167639316300772	es_ES
dc.identifier.doi	10.1016/j.specom.2016.04.001
dc.departamentoes	Ingeniería de comunicaciones	es_ES
dc.departamentoeu	Komunikazioen ingeniaritza	es_ES

Files in this item

Name:: Speech Communication SSD using ...
Size:: 771.2Kb
Format:: PDF
Description:: Preprint

View/Open

This item appears in the following Collection(s)

Comunicaciones

Show simple item record