European Journal of Education, cilt.60, sa.4, 2025 (SSCI, Scopus)
Assessing achievement is a complex task due to its dependence on multiple factors and the hierarchical structure of educational data, yet surveys like TIMSS offer valuable insights into its determining factors like students' mathematics anxiety. However, disregarding the nested structure of data and ignoring the assumptions of models causes poor performance such as inaccurate predictions and biased estimates. Our research utilises linear mixed models (LMMs) and machine learning (ML) techniques (e.g., REEM-tree and GP boosting) especially chosen for their abilities to model nested data and capture non-linear relationships. This study is a pioneer in the literature as these ML algorithms are implemented for the first time in TIMSS. Accordingly, mathematical tendency and emotional factors are the two primary predictors of mathematics achievement across all methods, acknowledging the possibility of potential bias due to reliance on self-report responses. However, there are variations in the effect size of the students' origins among the methods. This indicates different algorithms yield distinct results according to their inner processes and priorities, such as revealing statistical significance of predictors or contributing to predictive performance. Moreover, gender has a negligible impact across all models in our analysis, caused by cultural differences in the sample. Overall, while LMMs are widely accepted, ML methods remain competitive alternatives in prediction and flexibility. All three methods yield similar benchmarks, yet ML methods offer slightly better performance in RMSE, MAE, and MAPE while exhibiting high predictive power and capturing nonlinearity and interaction. Although they take more computation time, parallel processing mitigates this in larger datasets. Consequently, ML methods and LMMs concurrently provide broader and more precise insights in terms of predictive and inferential gains.