TY - GEN
T1 - A lightweight end-to-end anti-spoofing voice model based on WavLM
AU - Fu, Jingchang
AU - Au Yeung, Siu Kei
AU - Hung, Kevin
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2025/2/5
Y1 - 2025/2/5
N2 - In the rapidly evolving domain of voice-assisted technologies, ensuring the authenticity and security of voice interactions is paramount, especially in scenarios involving identity verification. This paper introduces a novel, lightweight anti-spoofing voice detection model that integrates the WavLM-Base for feature extraction with a straightforward, resource-efficient classification framework. This model effectively discriminates between genuine and spoofed audio inputs, emphasizing the utility of self-supervised learning models in enhancing security measures within Automatic Speaker Verification systems without necessitating extensive computational resources. The approach leverages innovative techniques to address the growing challenges in voice security, contributing significantly to the robustness and reliability of voice-based systems. We achieve an EER of 0.45% on the ASVspoof2019LA dataset, demonstrating the competitive performance of our method.
AB - In the rapidly evolving domain of voice-assisted technologies, ensuring the authenticity and security of voice interactions is paramount, especially in scenarios involving identity verification. This paper introduces a novel, lightweight anti-spoofing voice detection model that integrates the WavLM-Base for feature extraction with a straightforward, resource-efficient classification framework. This model effectively discriminates between genuine and spoofed audio inputs, emphasizing the utility of self-supervised learning models in enhancing security measures within Automatic Speaker Verification systems without necessitating extensive computational resources. The approach leverages innovative techniques to address the growing challenges in voice security, contributing significantly to the robustness and reliability of voice-based systems. We achieve an EER of 0.45% on the ASVspoof2019LA dataset, demonstrating the competitive performance of our method.
KW - Anti-spoofing voice
KW - End-to-end
KW - Self-supervised learning
KW - WavLM
UR - http://www.scopus.com/inward/record.url?scp=85219510259&partnerID=8YFLogxK
U2 - 10.1145/3708597.3708621
DO - 10.1145/3708597.3708621
M3 - Conference contribution
AN - SCOPUS:85219510259
T3 - ICACS 2024 - Proceedings of the 2024 8th International Conference on Algorithms, Computing and Systems
SP - 157
EP - 160
BT - ICACS 2024 - Proceedings of the 2024 8th International Conference on Algorithms, Computing and Systems
T2 - 8th International Conference on Algorithms, Computing and Systems, ICACS 2024
Y2 - 11 October 2024 through 13 October 2024
ER -