Developing a Probabilistic Modeling Algorithm for Movement Patterns in Financial Markets Using Data Mining and Optimal Data Size Determination (A Comparative Study with ARIMA-(G)ARCH econometric models)
الملخص
In the volatile realm of financial markets, this research outlines a roadmap to enhance the accuracy of financial market trend predictions by proposing an algorithm based on the conditional probability distribution of sequential movement patterns and data mining techniques to determine the optimal size of probabilistic modeling data. The study demonstrates the algorithm’s efficiency and generalizability through its application to multiple global and Arab financial markets (14 financial indices from global and Arab stock markets).
The key findings of this research highlight the superiority of the proposed algorithm—which incorporates optimal data size determination—over both the algorithm without optimal data size determination and ARIMA-(G)ARCH econometric models. This conclusion is supported by the application of the Predictive Model Value-Added criterion, which provides an intrinsic evaluation of the utility of any applied predictive model. Additionally, a statistically significant difference analysis was conducted between: (1) the average prediction accuracy of the proposed algorithm and econometric models, and (2) the proposed algorithm with and without optimal data size determination. All applied comparisons and criteria conclusively demonstrated the superiority of the proposed algorithm with optimal data size determination, confirming its stability, consistency, and generalizability. These innovations open new avenues for refining investment strategies and mitigating risks, establishing the algorithm as a valuable tool for investors and financial analysts.
References
- Asare, C., Asante, D., & Essel, J. F.(2023). Probabilistic LSTM Modeling for Stock Price Prediction with Monte Carlo dropout Long Short-Term Memory Network. International Journal of Innovative Science and Research Technology. Volume 8, Issue 7, July – 2023. ISSN No:-2456-2165.
- Babichev, D., & Bach, F. (2018). Constant step size stochastic gradient descent for probabilistic modeling. arXiv preprint arXiv:1804.05567.
- Bertsekas, D. P., & Tsitsiklis, J. N. (2008). Introduction to Probability. Athena Scientific.P:45.
- Bou-Hamad, I., & Jamali, I. (2020). Forecasting financial time-series using data mining models: A simulation study. Research in International Business and Finance, 51, 101072
- Bousbaa, Z., Sanchez-Medina, J., & Bencharef, O. (2023). Financial time series forecasting: a data stream mining-based system. Electronics,MDPI, 12(9), 2039.
- Dar, G. F., Padi, T. R., & Rekha, S. (2022). Stock price prediction using a Markov chain Model: a study for TCS Share Values. Advances and Applications in Statistics, 80, 83-101.
- Evans, M. J., & Rosenthal, J. S. (2004). Probability and Statistics: The Science of Uncertainty. Springer.P12.
- Han, J., Kamber, M., & Pei, J. (2012). Data Mining: Concepts and. Techniques, Waltham: Morgan Kaufmann Publishers.(PP:3-12).
- Luo, J. (2022, January). Modeling of data mining technology in financial data recognition mining and forecasting. In 2022 4th International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 1168-1171). IEEE.
- Monika, P., Ruchjana, B., & Abdulla, A. (2022). The implementation of the ARIMA-ARCH model using data mining for forecasting rainfall in Bandung city. International Journal of Data and Network Science, 6(4), 1309-1318.
- Nugraha, E. S., & Alvina, C. (2024). THE APPLICATION OF STANDARD GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSCEDASTICITY (SGARCH) MODEL IN FORECASTING THE STOCK PRICE OF BARITO PACIFIC. BAREKENG Journal of Mathematics and Its Applications, 18(2), 0849-0862.
- Provost, F. (2013). Data Science for Business: What you need to know about data mining and data-analytic thinking (Vol. 355). O'Reilly Media, Inc.
- Rojas, F., Leiva, V., Huerta, M., & Martin-Barreiro, C. (2021). Lot-size models with uncertain demand considering its skewness/kurtosis and stochastic programming applied to hospital pharmacy with sensor-related COVID-19 data. Sensors, 21(15), 5198.
- Ross, S. M. (2010). Introduction to Probability Models (10th ed.). Academic Press.PP:3-5.
- Yahaya, A., Etuk, E., & Emeka, A. (2021). Comparative performance of arima and garch model in forecasting crude oil price data. Asian Journal of Probability and Statistics, 15(4), 251-275.