Predicting Flight Delays Using LSTM and BILSTM Models with Shap Interpretation
Downloads
Flight delays represent a critical challenge in air transportation, affecting passenger satisfaction, operational efficiency, and financial outcomes. This study develops predictive models for flight delay duration at Juanda International Airport, Surabaya, using Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) networks integrated with a Shapley Additive Explanations (SHAP) interpretability method. The research utilized 242,638 flight observations spanning January 2023 to October 2025, incorporating flight operational and meteorological variables. The dataset was partitioned into training (62.35%), validation (9.01%), and testing (28.64%) subsets. After Min-Max normalization and preprocessing, models were designed with varying hyperparameters through grid search optimization. Performance evaluation employed Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). Results demonstrated that LSTM with three hidden layers and eight neurons achieved superior performance, with an MAE of 16.888 minutes and an RMSE of 22.401 minutes on test data. BiLSTM yielded an MAE of 24.399 minutes and an RMSE of 34.596 minutes, establishing LSTM as the optimal model. SHAP interpretation revealed that operational factors—including flight type, airline, and airport routes—represent the dominant predictors of delays, followed by meteorological factors such as wind speed and temporal factors including scheduling time. Although R² values remained relatively low, this research provides valuable interpretability insights into delay determinants, enabling data-driven decision-making for airport management and airlines to enhance punctuality and operational efficiency.
Ain, N. (2024). Navigating passenger compensation: Implications for airlines and consumers. Journal of Air Law & Commerce, 89(3), 507.
Aini, H., School, N., Tekhnologi, T., & Yogyakarta, K. (2023). Analysis of delay management due to weather related to operational technicalities at Citilink Airlines at Komodo Labuan Bajo Airport. Journal of General Studies and Research, 1(4), 71–83.
Anupkumar, A. (2023). Investigating the costs and economic impact of flight delays in the aviation industry and the potential strategies for reduction.
Ballakur, A. A., & Arya, A. (2020). Empirical evaluation of gated recurrent neural network architectures in aviation delay prediction. IEEE Transactions on Aerospace and Electronic Systems, 56(3), 1890–1902.
Britto, R., Dresner, M., & Voltes, A. (2012). The impact of flight delays on passenger demand and societal welfare. Transportation Research Part E: Logistics and Transportation Review, 48(2), 460–469.
Chai, T., & Draxler, R. R. (2014). Root mean square error (RMSE) or mean absolute error (MAE)? Arguments against avoiding RMSE in the literature. Geoscientific Model Development, 7(3), 1247–1250.
Chen, H., Jiang, L., Yang, H., Lu, Z., Fu, Y., Li, L., & Yu, Z. (2020). An efficient hardware architecture with adjustable precision and extensible range to implement sigmoid and tanh functions. Electronics, 9(10), 1739.
Choi, C. (2018). Time series prediction with recurrent neural networks in presence of missing data. Journal of Machine Learning Research, 19(1), 1–25.
Cui, Z., Ke, R., Pu, Z., & Wang, Y. (2020). Stacked bidirectional and unidirectional LSTM recurrent neural network for predicting network-wide traffic state with missing values. Transportation Research Part C: Emerging Technologies, 118, 102674.
Depuru, S., Sirisala, S., Akuthota, K., Suresh Reddy, B. V., Amala, K., & Sivanantham, S. (2024). Enhancing flight delay prediction and classification using a hybrid Bi-LSTM. Communications on Applied Nonlinear Analysis, 31(6S), 15–28.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
Han, J., Kamber, M., & Pei, J. (2011). Data mining: Concepts and techniques (3rd ed.). Elsevier.
Hanke, J. E., & Wichern, D. W. (2014). Business forecasting (9th ed.). Pearson Education Limited.
International Air Transport Association. (2021). Economic performance of the airline industry. International Air Transport Association.
Jatavallabha, A., Gerlach, J., & Naresh, A. (2024). Deciphering air travel disruptions: A machine learning approach. arXiv. https://arxiv.org/abs/2408.02802
Li, J., Ji, R., Li, C., Yang, X., Li, J., Li, Y., Xiong, X., Fang, Y., Ding, S., & Cui, T. (2023). Prediction of flight arrival delay time using U.S. Bureau of Transportation Statistics. In 2023 IEEE Symposium Series on Computational Intelligence (pp. 603–608). IEEE.
Wu, C.-L. (2016). Airline operations and delay management: Insights from airline economics, networks and strategic schedule planning. Routledge.
Zámková, M., Rojík, S., Prokop, M., & Stolín, R. (2022). Factors affecting international flight delays and their impact on airline operation and management and passenger compensation fees in air transport industry: Case study of selected airlines in Europe. Sustainability, 14(22), 14763.
Copyright (c) 2026 Ifayanti Rohmatul Hidayah, Irhamah Irhamah, Santi Puteri Rahayu

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

