Comparative analysis of the forecasting quality of the classical statistical model and the machine learning model on the data of the Russian stock market
https://doi.org/10.24182/2073-6258-2021-20-3-52-63
Abstract
The main objective of this work is to compare the predictive ability of the classical machine learning model — ARIMA, as the most common and well-studied baseline model, and the ML model based on a sequential neural network — in this case, LSTM. The goal is to maximize accuracy and minimize error — selecting the most appropriate model for predicting time series with the highest accuracy. A description is given for these mathematical models. An algorithm is also proposed for forecasting time series using these models, based on the «Rolling window» approach. Practical implementation is implemented using the Python programming environment with the Pandas, Numpy, pmdarima, Keras, Statsmodels libraries. To train the models, we used stock data at the closing price per share of the leading Russian companies: Yandex, VTB, KamAZ, Kiwi, Gazprom, NLMK, Rosneft, Alrosa for the period. The studies carried out demonstrate the predictive superiority of the approach based on neural networks, while the RMSE is 71% less than the same indicator for the ARIMA model, which allows us to conclude that the use of the LSTM model is preferable for this class of problems.
About the Authors
A. V. ShcherbininaRussian Federation
Specialist
Moscow
A. V. Alzheev
Russian Federation
Masterstudent
Moscow
References
1. Magnus Ya.R., Katyshev P.K., Peresetskii A.A. Ekonometrika. Nachal’nyi kurs. – M.: Delo; 2007. 504 s.
2. Schmidhuber J. Habilitation thesis: System modeling and optimization. – Munich: Technical University of Munich; 1993. 209 p. (In Germ.).
3. Hochreiter S., Schmidhuber J. Long short-term memory. Neural Computation. 1997; 9(8): 1735–1780. DOI:10.1162/neco.1997.9.8.1735. PMID 9377276.
4. Brownlee J. Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras. Machine Learning Mastery. 2016; https://machinelearningmastery.com/time-series-prediction-lstm-recurrent-neural-networks-python-keras.
5. Gers F.A., Schmidhuber J., Cummins F. Learning to Forget: Continual Prediction with LSTM. Neural Computation. 2000; 12(10): 2451–2471.
6. Kratovich P.V. Neironnye seti i modeli ARIMA dlya prognozirovaniya kotirovok. Programmnye produkty i sistemy. 2011; (1): 95–98.
7. Garcia1 F., Guijarro F., Moya I., Oliver J. Estimating returns and conditional volatility: a comparison between the ARMA-GARCH-M Models and the Backpropagation Neural Network. Int. J. Complex Systems in Science. 2012;1(2):21-26. ISSN 2174-6036.
8. Krauss C., Do X.A., Huck N. Deep neural networks, gradientboosted trees, random forests: Statistical arbitrage on the S&P 500. European Journal of Operational Research. 2016; 259(2). DOI:10.1016/j.ejor.2016.10.031.
9. Chung J., Lee D., Seo Y., Yoo C.D. Deep attribute networks. NIPS Workshop on Deep Learning and Unsupervised Feature Learning. 2012; 3(2012). http://www.eng.uwaterloo.ca/~jbergstr/files/nips_dl_2012/Paper%2011.pdf.
10. Fischera T., Krauss C. Deep Learning with Long Short-term Memory Networks for Financial Market Predictions. Eur. J. Oper. Res. 2018; 270(2): 654–669.
11. Ekonometrika. Pod red. Eliseevoi I.I. – M.: Finansy i statistika, 2006. 576 s. ISBN 5-279-02786-3.
12. Armstrong J.S., Collopy F. Error Measures For Generalizing About Forecasting Methods: Empirical Comparisons. International Journal of Forecasting. 1992; 8(1): 69–80. DOI:10.1016/0169-2070(92)90008-w.
13. Hyndman R.J., Athanasopoulos G. Forecasting: Principles and Practice. 2 edition. Melbourne, Australia: OTexts; 2018. 382 p. ISBN-13: 978-0987507112.
Review
For citations:
Shcherbinina A.V., Alzheev A.V. Comparative analysis of the forecasting quality of the classical statistical model and the machine learning model on the data of the Russian stock market. Scientific notes of the Russian academy of entrepreneurship. 2021;20(3):52-63. (In Russ.) https://doi.org/10.24182/2073-6258-2021-20-3-52-63