A Semisupervised Approach for Industrial Anomaly Detection via Self-Adaptive Clustering

Xiaoxue Ma, Jacky Keung, Pinjia He, Yan Xiao, Xiao Yu, Yishu Li

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

With the rapid development of the Industrial Internet of Things, log-based anomaly detection has become vital for smart industrial construction that has prompted many researchers to contribute. To detect anomalies based on log data, semisupervised approaches stand out from supervised and unsupervised approaches because they only require a portion of labeled data and are relatively stable. However, the state-of-the-art semisupervised approaches still suffer from two main problems: manual parameter setting and unsatisfactory performance with high false positives. We propose AdaLog, an integrated semisupervised approach based on self-adaptive clustering, for industrial anomaly detection. In particular, the clustering step performs automatic label probability estimation by distinguishing 12 situations so that the label probability of each unlabeled data can be carefully calculated, leading to high accuracy. In addition, AdaLog employs a pretrained model to learn contextual information comprehensively and a transformer-based model to detect anomalies efficiently. To alleviate class imbalance, an undersampling method is incorporated. The results on three popular datasets demonstrate that AdaLog significantly outperforms three state-of-the-art semisupervised approaches by 17.8%-2489.8% on average in terms of F1-score, and is even superior to two supervised approaches in most cases with average improvements of 10.9%-23.8%.

Original languageEnglish
Pages (from-to)1687-1697
Number of pages11
JournalIEEE Transactions on Industrial Informatics
Volume20
Issue number2
DOIs
Publication statusPublished - 1 Feb 2024
Externally publishedYes

Keywords

  • Clustering
  • deep learning
  • intelligent anomaly detection
  • transformer

Fingerprint

Dive into the research topics of 'A Semisupervised Approach for Industrial Anomaly Detection via Self-Adaptive Clustering'. Together they form a unique fingerprint.

Cite this