Prediction of Amazon’s Stock Price Based on ARIMA, XGBoost, and LSTM Models






Finding the best model to predict the trend of stock prices is an issue that has always garnered attention, and it is also closely related to investors’ investment dynamics. Even the commonly used autoregressive integrated moving average (ARIMA), extreme gradient boosting (XGBoost), and long short-term memory (LSTM) have their own advantages and disadvantages. We use mean squared error (MSE) to judge the most suitable model for predicting Amazon’s stock price from many aspects and find that LSTM is the model with the best fitting effect and the closest to the real curve. However, the LSTM model still needs to improve in terms of performance so as to reduce the bias. We anticipate the discovery of more models that are apt for predicting stocks in the future.


Asness CS, Frazzini A, Pedersen LH, 2012, Leverage Aversion and Risk Parity. Financial Analysts Journal, 68(1): 47–59.

Qin J, Tao Z, Huang S, et al., 2021, Stock Price Forecast Based on ARIMA Model and BP Neural Network Model. 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), IEEE, 426–430.

Ho SL, Xie M, 1998, The Use of ARIMA Models for Reliability Forecasting and Analysis. Computers & Industrial Engineering, 35(1–2): 213–216.

Benvenuto D, Giovanetti M, Vassallo L, et al., 2020, Application of the ARIMA Model on the COVID-2019 Epidemic Dataset. Data in Brief, 29: 105340.

Zhang GP, 2003, Time Series Forecasting Using a Hybrid ARIMA and Neural Network Model. Neurocomputing, 50: 159–175.

Contreras J, Espinola R, Nogales FJ, et al., 2003, ARIMA Models to Predict Next-Day Electricity Prices. IEEE Transactions on Power Systems, 18(3): 1014–1020.

Ariyo AA, Adewumi AO, Ayo CK, 2014, Stock Price Prediction Using the ARIMA Model. 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, IEEE, 106–112.

Smagulova K, James AP, 2019, A Survey on LSTM Memristive Neural Network Architectures and Applications. The European Physical Journal Special Topics, 228(10): 2313–2324.

Gers FA, Schraudolph NN, Schmidhuber J, 2002, Learning Precise Timing with LSTM Recurrent Networks. Journal of Machine Learning Research, 3: 115–143.

Graves A, Fernandez S, Schmidhuber J, 2005, Bidirectional LSTM Networks for Improved Phoneme Classification and Recognition. International Conference on Artificial Neural Networks, Springer, 799–804.

Chen K, Zhou Y, Dai F, 2015, A LSTM-Based Method for Stock Returns Prediction: A Case Study of China Stock Market. 2015 IEEE International Conference on Big Data (Big Data), IEEE, 2823–2824.

Fu R, Zhang Z, Li L, 2016, Using LSTM and GRU Neural Network Methods for Traffic Flow Prediction. 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC), IEEE, 324–328.

Liao J, Zhang R, 2018, Dynamic Weighting Multi Factor Stock Selection Strategy Based on XGBoost Machine Learning Algorithm. 2018 IEEE International Conference of Safety Produce Informatization (IICSPI), IEEE, 868–872.

Basak S, Kar S, Saha S, et al., 2019, Predicting the Direction of Stock Market Prices Using Tree-Based Classifiers. The North American Journal of Economics and Finance, 47: 552–567.

Kumar DS, Thiruvarangan BC, Vishnu A, et al., 2022, Analysis and Prediction of Stock Price Using Hybridization of SARIMA and XGBoost. 2022 International Conference on Communication, Computing and Internet of Things (IC3IoT), IEEE, 1–4.

Gumelar AB, Setyorini H, Adi DP, et al., 2020, Boosting the Accuracy of Stock Market Prediction using XGBoost and Long Short-Tergumem Memory. 2020 International Seminar on Application for Technology of Information and Communication (iSemantic), IEEE, 609–613.

Cao J, Li Z, Li J, 2019, Financial Time Series Forecasting Model Based on CEEMDAN and LSTM. Physica A: Statistical Mechanics and Its Applications, 519: 127–139.

Jiang H, He Z, Ye G, et al., 2020, Network Intrusion Detection Based on PSO-XGBoost Model. IEEE Access, 8: 58392–58401.

Ma X, Sha J, Wang D, et al., 2018, Study on a Prediction of P2P Network Loan Default Based on the Machine Learning LightGBM and XGBoost Algorithms According to Different High Dimensional Data Cleaning. Electron Commerce Res Appl, 31: 24–39.