A lightweight end-to-end anti-spoofing voice model based on WavLM

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the rapidly evolving domain of voice-assisted technologies, ensuring the authenticity and security of voice interactions is paramount, especially in scenarios involving identity verification. This paper introduces a novel, lightweight anti-spoofing voice detection model that integrates the WavLM-Base for feature extraction with a straightforward, resource-efficient classification framework. This model effectively discriminates between genuine and spoofed audio inputs, emphasizing the utility of self-supervised learning models in enhancing security measures within Automatic Speaker Verification systems without necessitating extensive computational resources. The approach leverages innovative techniques to address the growing challenges in voice security, contributing significantly to the robustness and reliability of voice-based systems. We achieve an EER of 0.45% on the ASVspoof2019LA dataset, demonstrating the competitive performance of our method.

Original languageEnglish
Title of host publicationICACS 2024 - Proceedings of the 2024 8th International Conference on Algorithms, Computing and Systems
Pages157-160
Number of pages4
ISBN (Electronic)9798400718304
DOIs
Publication statusPublished - 5 Feb 2025
Event8th International Conference on Algorithms, Computing and Systems, ICACS 2024 - Hong Kong, Hong Kong
Duration: 11 Oct 202413 Oct 2024

Publication series

NameICACS 2024 - Proceedings of the 2024 8th International Conference on Algorithms, Computing and Systems

Conference

Conference8th International Conference on Algorithms, Computing and Systems, ICACS 2024
Country/TerritoryHong Kong
CityHong Kong
Period11/10/2413/10/24

Keywords

  • Anti-spoofing voice
  • End-to-end
  • Self-supervised learning
  • WavLM

Fingerprint

Dive into the research topics of 'A lightweight end-to-end anti-spoofing voice model based on WavLM'. Together they form a unique fingerprint.

Cite this