TY - GEN
T1 - Identifying the Cause of Performance Issues of Pretrained Language Model for Educational Technology
AU - Chan, Pak Yuen Patrick
AU - Keung, Jacky
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Online self-learning platforms, such as questionanswering (Q&A) websites, are popular technology for learning, and utilising Pretrained Language Models (PLMs) to maintain their content qualities is a common practice. However, it is challenging to identify the cause that affects the performance of PLMs. In this study, we propose using machine common sense to identify the cause affecting the performance of PLMs. We conducted an empirical experiment with three PLMs using a publicly available dataset. We select 45000 data points as the training data and 1000 as the testing data. We first train the PLMs with training data, then run machine common sense tests to examine their reasoning abilities. We define the causal relationship between content quality and reasoning ability, and use the cause to derive Metamorphic Relations (MR) of Metamorphic Testing (MT) for creating follow-up testing datasets. We analyse the changes between source and follow-up outputs to see whether the identified cause affects the performance. Results show that the reasonableness of the content is the cause that affects the performance of PLM, which has reasoning abilities. In addition, the proposed approach in this study is effective for identifying and validating the cause that affects the performance of PLMs, even on devices with limited computer resources. Future research can apply our approach and seek different machine common sense tests and counterfactual analysing techniques to identify different causes of performance issues of different PLMs.
AB - Online self-learning platforms, such as questionanswering (Q&A) websites, are popular technology for learning, and utilising Pretrained Language Models (PLMs) to maintain their content qualities is a common practice. However, it is challenging to identify the cause that affects the performance of PLMs. In this study, we propose using machine common sense to identify the cause affecting the performance of PLMs. We conducted an empirical experiment with three PLMs using a publicly available dataset. We select 45000 data points as the training data and 1000 as the testing data. We first train the PLMs with training data, then run machine common sense tests to examine their reasoning abilities. We define the causal relationship between content quality and reasoning ability, and use the cause to derive Metamorphic Relations (MR) of Metamorphic Testing (MT) for creating follow-up testing datasets. We analyse the changes between source and follow-up outputs to see whether the identified cause affects the performance. Results show that the reasonableness of the content is the cause that affects the performance of PLM, which has reasoning abilities. In addition, the proposed approach in this study is effective for identifying and validating the cause that affects the performance of PLMs, even on devices with limited computer resources. Future research can apply our approach and seek different machine common sense tests and counterfactual analysing techniques to identify different causes of performance issues of different PLMs.
KW - causality
KW - Content quality prediction
KW - machine common sense
KW - metamorphic testing
KW - pre-trained language model
KW - self-learning platform
UR - https://www.scopus.com/pages/publications/105015826291
U2 - 10.1109/ISET65607.2025.00044
DO - 10.1109/ISET65607.2025.00044
M3 - Conference contribution
AN - SCOPUS:105015826291
T3 - Proceedings - 2025 International Symposium on Educational Technology, ISET 2025
SP - 179
EP - 183
BT - Proceedings - 2025 International Symposium on Educational Technology, ISET 2025
A2 - Chui, Kwok Tai
A2 - Jaikaeo, Chaiporn
A2 - Niramitranon, Jitti
A2 - Kaewmanee, Wattana
A2 - Ng, Kwan-Keung
A2 - Ongkunaruk, Pornthipa
T2 - 11th International Symposium on Educational Technology, ISET 2025
Y2 - 22 July 2025 through 25 July 2025
ER -