papers AI Learner
The Github is limit! Click to go to the new site.

Adversarial Speaker Verification

2019-04-29
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong

Abstract

The use of deep networks to extract embeddings for speaker recognition has proven successfully. However, such embeddings are susceptible to performance degradation due to the mismatches among the training, enrollment, and test conditions. In this work, we propose an adversarial speaker verification (ASV) scheme to learn the condition-invariant deep embedding via adversarial multi-task training. In ASV, a speaker classification network and a condition identification network are jointly optimized to minimize the speaker classification loss and simultaneously mini-maximize the condition loss. The target labels of the condition network can be categorical (environment types) and continuous (SNR values). We further propose multi-factorial ASV to simultaneously suppress multiple factors that constitute the condition variability. Evaluated on a Microsoft Cortana text-dependent speaker verification task, the ASV achieves 8.8% and 14.5% relative improvements in equal error rates (EER) for known and unknown conditions, respectively.

Abstract (translated by Google)
URL

http://arxiv.org/abs/1904.12406

PDF

http://arxiv.org/pdf/1904.12406


Similar Posts

Comments