TY - GEN
T1 - Beyond Log Parsers
T2 - 49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025
AU - Sun, Yicheng
AU - Keung, Jacky
AU - Yu, Hi Kuen
AU - Liu, Shuo
AU - Liao, Yihan
AU - Zhang, Jingyu
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Log anomaly detection is critical for ensuring software system reliability and security, yet challenges persist in log parser dependency, small-scale dataset applicability, and hyperparameter tuning efficiency. Existing methods over-rely on predefined log templates, leading to information loss and high computational overhead. Additionally, anomaly detection models often struggle with limited log data, and hyperparameter tuning remains computationally expensive in dynamic environments. In this paper, we empirically evaluate seven state-of-the-art anomaly detection models across varied software systems, assessing the necessity of log parsers and model performance on small-scale datasets. Furthermore, we propose SMAC-, an enhanced real-time hyperparameter optimization framework, integrating stochastic gradient descent (SGD) and adaptive learning to improve model adaptability and efficiency. Our experiments on six benchmark datasets demonstrate that SMAC- achieves an overall average F1-score improvement of 4.27%, a 27.55% reduction in hyperparameter tuning time compared to other models, and a 1.35% increase in F1-score when adapting to newly emerging logs, compared to its counterpart without SGD integration. These findings underscore the practical advantages of AI-driven log analysis, providing valuable insights into scalable, software-engineered anomaly detection.
AB - Log anomaly detection is critical for ensuring software system reliability and security, yet challenges persist in log parser dependency, small-scale dataset applicability, and hyperparameter tuning efficiency. Existing methods over-rely on predefined log templates, leading to information loss and high computational overhead. Additionally, anomaly detection models often struggle with limited log data, and hyperparameter tuning remains computationally expensive in dynamic environments. In this paper, we empirically evaluate seven state-of-the-art anomaly detection models across varied software systems, assessing the necessity of log parsers and model performance on small-scale datasets. Furthermore, we propose SMAC-, an enhanced real-time hyperparameter optimization framework, integrating stochastic gradient descent (SGD) and adaptive learning to improve model adaptability and efficiency. Our experiments on six benchmark datasets demonstrate that SMAC- achieves an overall average F1-score improvement of 4.27%, a 27.55% reduction in hyperparameter tuning time compared to other models, and a 1.35% increase in F1-score when adapting to newly emerging logs, compared to its counterpart without SGD integration. These findings underscore the practical advantages of AI-driven log analysis, providing valuable insights into scalable, software-engineered anomaly detection.
KW - AI-Driven Log Anomaly Detection
KW - Empirical Software Anomaly Analysis
KW - Log Analysis
KW - Real-Time Hyperparameter Optimization
KW - Scalable Log Parsing and Detection
UR - https://www.scopus.com/pages/publications/105016206544
U2 - 10.1109/COMPSAC65507.2025.00173
DO - 10.1109/COMPSAC65507.2025.00173
M3 - Conference contribution
AN - SCOPUS:105016206544
T3 - Proceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025
SP - 1382
EP - 1387
BT - Proceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025
A2 - Shahriar, Hossain
A2 - Alam, Kazi Shafiul
A2 - Ohsaki, Hiroyuki
A2 - Cimato, Stelvio
A2 - Capretz, Miriam
A2 - Ahmed, Shamem
A2 - Ahamed, Sheikh Iqbal
A2 - Majumder, AKM Jahangir Alam
A2 - Haque, Munirul
A2 - Yoshihisa, Tomoki
A2 - Cuzzocrea, Alfredo
A2 - Takemoto, Michiharu
A2 - Sakib, Nazmus
A2 - Elsayed, Marwa
Y2 - 8 July 2025 through 11 July 2025
ER -