TY - JOUR
T1 - A new semi-supervised fuzzy clustering method based on latent representation learning and information fusion
AU - Zhu, Hengdong
AU - Kan, Baoshuo
AU - Li, Yong
AU - Yan, Enliang
AU - Weng, Heng
AU - Wang, Fu Lee
AU - Hao, Tianyong
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/2
Y1 - 2025/2
N2 - Fuzzy clustering is a simple but efficient clustering method, which aims to deal with ambiguous and overlapping data classification boundaries and provide detailed membership degree information. However, the complex structure of high-dimensional data in the real world easily causes the inefficiency of widely used distance metrics. In addition, traditional fuzzy clustering methods cannot utilize supervision information to guide membership degree matrix learning, which limits clustering performance. In this paper, we propose a new semi-supervised fuzzy clustering method based on latent representation learning and information fusion. Specifically, our method utilizes a deep autoencoder to learn the underlying low-dimensional structure of data, thereby mitigating the impact of redundant features. Moreover, a semi-supervised constraint term based on information fusion is designed to make full use of prior knowledge to supervise the clustering process. On the one hand, a new constraint distance is proposed by leveraging pairwise constraint information to re-evaluate the membership degree of data. On the other hand, the semi-supervised constraint term is constructed based on label information to guide the membership degree matrix learning. Comprehensive experiments on a variety of standard datasets show that our method achieves better performance compared with state-of-the-art baseline methods, demonstrating the effectiveness of the proposed method in fuzzy clustering.
AB - Fuzzy clustering is a simple but efficient clustering method, which aims to deal with ambiguous and overlapping data classification boundaries and provide detailed membership degree information. However, the complex structure of high-dimensional data in the real world easily causes the inefficiency of widely used distance metrics. In addition, traditional fuzzy clustering methods cannot utilize supervision information to guide membership degree matrix learning, which limits clustering performance. In this paper, we propose a new semi-supervised fuzzy clustering method based on latent representation learning and information fusion. Specifically, our method utilizes a deep autoencoder to learn the underlying low-dimensional structure of data, thereby mitigating the impact of redundant features. Moreover, a semi-supervised constraint term based on information fusion is designed to make full use of prior knowledge to supervise the clustering process. On the one hand, a new constraint distance is proposed by leveraging pairwise constraint information to re-evaluate the membership degree of data. On the other hand, the semi-supervised constraint term is constructed based on label information to guide the membership degree matrix learning. Comprehensive experiments on a variety of standard datasets show that our method achieves better performance compared with state-of-the-art baseline methods, demonstrating the effectiveness of the proposed method in fuzzy clustering.
KW - Fuzzy clustering
KW - Information fusion
KW - Latent representation
KW - Semi-supervised
UR - http://www.scopus.com/inward/record.url?scp=85214565553&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2025.112717
DO - 10.1016/j.asoc.2025.112717
M3 - Article
AN - SCOPUS:85214565553
SN - 1568-4946
VL - 170
JO - Applied Soft Computing Journal
JF - Applied Soft Computing Journal
M1 - 112717
ER -