TY - GEN
T1 - Beyond Scores
T2 - 10th IEEE International Conference on Behavioural and Social Computing, BESC 2023
AU - Wei, Han
AU - Li, Zongxi
AU - Xie, Haoran
AU - Hung, Kevin
AU - Wang, Minhong
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Predicting student performance based on continuous assessment scores is an important task in educational data mining (EDM). The task aims to provide personalized feedback and intervention for students at risk of failing the course or dropping out. In this paper, we examine the limitations of using raw scores as input features for predicting student outcomes, as raw scores can be biased and sensitive to the variation of assignment difficulty. We propose a novel approach that uses rank instead of score as the input feature for predicting student performance. Rank is a relative measure that reflects the performance of a student compared to other students in the same course or cohort, which can reduce bias caused by different score distributions and assignment difficulties, and capture the relative position and improvement of each student. In this paper, we use the idea of positional embedding to encode rank into a dense representation for prediction models. Positional encoding assigns a vector representation to each rank based on its ordinal position in a sequence. We apply positional encoding to various recurrent neural network (RNN) models and other machine learning models, such as logistic regression and support vector machines. We compare their performance on public datasets of student assessment scores. Our results show that using rank with positional encoding can significantly improve the prediction accuracy and reliability of RNN models and their variants.
AB - Predicting student performance based on continuous assessment scores is an important task in educational data mining (EDM). The task aims to provide personalized feedback and intervention for students at risk of failing the course or dropping out. In this paper, we examine the limitations of using raw scores as input features for predicting student outcomes, as raw scores can be biased and sensitive to the variation of assignment difficulty. We propose a novel approach that uses rank instead of score as the input feature for predicting student performance. Rank is a relative measure that reflects the performance of a student compared to other students in the same course or cohort, which can reduce bias caused by different score distributions and assignment difficulties, and capture the relative position and improvement of each student. In this paper, we use the idea of positional embedding to encode rank into a dense representation for prediction models. Positional encoding assigns a vector representation to each rank based on its ordinal position in a sequence. We apply positional encoding to various recurrent neural network (RNN) models and other machine learning models, such as logistic regression and support vector machines. We compare their performance on public datasets of student assessment scores. Our results show that using rank with positional encoding can significantly improve the prediction accuracy and reliability of RNN models and their variants.
KW - educational data mining
KW - machine learning
KW - positional encoding
KW - rank
KW - recurrent neural network
KW - student performance prediction
UR - http://www.scopus.com/inward/record.url?scp=85184657064&partnerID=8YFLogxK
U2 - 10.1109/BESC59560.2023.10386879
DO - 10.1109/BESC59560.2023.10386879
M3 - Conference contribution
AN - SCOPUS:85184657064
T3 - Proceedings of the 2023 IEEE International Conference on Behavioural and Social Computing, BESC 2023
BT - Proceedings of the 2023 IEEE International Conference on Behavioural and Social Computing, BESC 2023
Y2 - 30 October 2023 through 1 November 2023
ER -