Combined generative adversarial network and fuzzy C-means clustering for multi-class voice disorder detection with an imbalanced dataset

Kwok Tai Chui, Miltiadis D. Lytras, Pandian Vasant

Research output: Contribution to journalArticlepeer-review

44 Citations (Scopus)

Abstract

The world has witnessed the success of artificial intelligence deployment for smart healthcare applications. Various studies have suggested that the prevalence of voice disorders in the general population is greater than 10%. An automatic diagnosis for voice disorders via machine learning algorithms is desired to reduce the cost and time needed for examination by doctors and speech-language pathologists. In this paper, a conditional generative adversarial network (CGAN) and improved fuzzy c-means clustering (IFCM) algorithm called CGAN-IFCM is proposed for the multi-class voice disorder detection of three common types of voice disorders. Existing benchmark datasets for voice disorders, the Saarbruecken Voice Database (SVD) and the Voice ICar fEDerico II Database (VOICED), use imbalanced classes. A generative adversarial network offers synthetic data to reduce bias in the detection model. Improved fuzzy c-means clustering considers the relationship between adjacent data points in the fuzzy membership function. To explain the necessity of CGAN and IFCM, a comparison is made between the algorithm with CGAN and that without CGAN. Moreover, the performance is compared between IFCM and traditional fuzzy c-means clustering. Lastly, the proposed CGAN-IFCM outperforms existing models in its true negative rate and true positive rate by 9.9-12.9% and 9.1-44.8%, respectively.

Original languageEnglish
Article number4571
JournalApplied Sciences (Switzerland)
Volume10
Issue number13
DOIs
Publication statusPublished - 1 Jul 2020

Keywords

  • Artificial intelligence
  • Fuzzy c-means clustering
  • Generative adversarial network
  • Imbalanced dataset
  • Machine learning
  • Multi-class detection
  • Smart healthcare
  • Synthetic data
  • Voice disorders

Fingerprint

Dive into the research topics of 'Combined generative adversarial network and fuzzy C-means clustering for multi-class voice disorder detection with an imbalanced dataset'. Together they form a unique fingerprint.

Cite this