Forecasting Performance of Machine Learning, Time Series and Hybrid Methods for Low and High Frequency Time Series

Özdemir, OZANCAN; Yozgatlıgil, CEYLAN

doi:10.1111/stan.12326

Forecasting Performance of Machine Learning, Time Series and Hybrid Methods for Low and High Frequency Time Series

Atıf İçin Kopyala

Özdemir O., Yozgatlıgil C.

STATISTICA NEERLANDICA, cilt.00390402, sa.78 (2), ss.441-474, 2024 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 00390402 Sayı: 78 (2)
Basım Tarihi: 2024
Doi Numarası: 10.1111/stan.12326
Dergi Adı: STATISTICA NEERLANDICA
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, ABI/INFORM, Business Source Elite, Business Source Premier, EconLit, zbMATH
Sayfa Sayıları: ss.441-474
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

One of the main objectives of the time series analysis is forecasting, so both Machine Learning methods and statistical methods have been proposed in the literature. In this study, we compare the forecasting performance of some of these approaches. In addition to traditional forecasting methods, which are the Naive and Seasonal Naive Methods, S/ARIMA, Exponential Smoothing, TBATS, Bayesian Exponential Smoothing Models with Trend Modifications and STL Decomposition, the forecasts are also obtained using seven different machine learning methods, which are Random Forest, Support Vector Regression, XGBoosting, BNN, RNN, LSTM, and FFNN, and the hybridization of both statistical time series and machine learning methods. The data set is selected proportionally from various time domains in M4 Competition data set. Thereby, we aim to create a forecasting guide by considering different preprocessing approaches, methods, and data sets having various time domains. After the experiment, the performance and impact of all methods are discussed. Therefore, most of the best models are mainly selected from machine learning methods for forecasting. Moreover, the forecasting performance of the model is affected by both the time frequency and forecast horizon. Lastly, the study suggests that the hybrid approach is not always the best model for forecasting. Hence, this study provides guidelines to understand which method will perform better at different time series frequencies.