TY - GEN
T1 - Heterogeneous Features Integration in Deep Knowledge Tracing
AU - Cheung, Lap Pong
AU - Yang, Haiqin
N1 - Publisher Copyright:
© 2017, Springer International Publishing AG.
PY - 2017
Y1 - 2017
N2 - Knowledge tracing is a significant research topic in educational data mining. The goal is to automatically trace students’ knowledge states by analyzing their exercise performance. Recently proposed Deep Knowledge Tracing (DKT) model has shown a significant improvement to solve this task by applying deep recurrent neural networks to learn interaction between knowledge components and exercises. The input of the model is only the one-hot encoding to represent the exercise tags and it excludes all other heterogeneous features, which may degrade the performance. To further improve the model performance, researchers have analyzed the heterogeneous features and provided manual ways to select the features and discretize them appropriately. However, the feature engineering efforts are not feasible for data with a huge number of features. To tackle with them, we propose an automatic and intelligent approach to integrate the heterogeneous features into the DKT model. More specifically, we encode the predicted response and the true response into binary bits and combine them with the original one-hot encoding feature as the input to a Long Short Term Memory (LSTM) model, where the predicted response is learned via Classification And Regression Trees (CART) on the heterogeneous features. The predicted response plays the role of determining whether a student will answer the exercise correctly, which can relieve the effect of exceptional samples. Our empirical evaluation on two educational datasets verifies the effectiveness of our proposal.
AB - Knowledge tracing is a significant research topic in educational data mining. The goal is to automatically trace students’ knowledge states by analyzing their exercise performance. Recently proposed Deep Knowledge Tracing (DKT) model has shown a significant improvement to solve this task by applying deep recurrent neural networks to learn interaction between knowledge components and exercises. The input of the model is only the one-hot encoding to represent the exercise tags and it excludes all other heterogeneous features, which may degrade the performance. To further improve the model performance, researchers have analyzed the heterogeneous features and provided manual ways to select the features and discretize them appropriately. However, the feature engineering efforts are not feasible for data with a huge number of features. To tackle with them, we propose an automatic and intelligent approach to integrate the heterogeneous features into the DKT model. More specifically, we encode the predicted response and the true response into binary bits and combine them with the original one-hot encoding feature as the input to a Long Short Term Memory (LSTM) model, where the predicted response is learned via Classification And Regression Trees (CART) on the heterogeneous features. The predicted response plays the role of determining whether a student will answer the exercise correctly, which can relieve the effect of exceptional samples. Our empirical evaluation on two educational datasets verifies the effectiveness of our proposal.
KW - Decision tree
KW - Knowledge tracing
KW - Recurrent neural networks
UR - http://www.scopus.com/inward/record.url?scp=85035148239&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-70096-0_67
DO - 10.1007/978-3-319-70096-0_67
M3 - Conference contribution
AN - SCOPUS:85035148239
SN - 9783319700953
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 653
EP - 662
BT - Neural Information Processing - 24th International Conference, ICONIP 2017, Proceedings
A2 - Zhao, Dongbin
A2 - El-Alfy, El-Sayed M.
A2 - Liu, Derong
A2 - Xie, Shengli
A2 - Li, Yuanqing
T2 - 24th International Conference on Neural Information Processing, ICONIP 2017
Y2 - 14 November 2017 through 18 November 2017
ER -