Designing a Hybrid Machine Learning Model for Weather Forecasting in Batam City
DOI:
https://doi.org/10.55537/jistr.v5i1.1504Keywords:
weather prediction, hybrid model, time series analysis, multi source data, tropical climate, Machine Learning, AIAbstract
Accurate weather forecasting in tropical regions such as Batam City is challenging due to high climate variability and frequent data gaps caused by unstable atmospheric conditions. This study aims to develop a reliable daily average temperature forecasting system using a hybrid approach that combines the Seasonal Autoregressive Integrated Moving Average (SARIMA) model and the Long Short-Term Memory (LSTM) neural network. The main novelty of this research lies in the residual hybridization method, where SARIMA is used to capture linear seasonal patterns and LSTM is applied to model the non-linear residual components, as well as the use of a multi-source data integration strategy to fill missing data. Historical temperature data from BMKG and other publicly available meteorological sources were merged to produce a continuous dataset covering the period from 2015 to 2021. The study evaluated several model architectures, including standalone statistical models, standalone machine learning models, and hybrid models, to identify the most effective approach. The experimental results show that the SARIMA–LSTM hybrid model outperformed the other models, achieving a high prediction accuracy with an R² value of 0.92 and a Root Mean Square Error (RMSE) of 1.73°C. These findings demonstrate that integrating linear and non-linear models can significantly improve temperature forecasting performance and provide a practical solution for weather monitoring in tropical environments
Downloads
References
[1] A. S. Ramadhan, A. Yudono, and A. W. Hasyim, "Dampak Pertumbuhan Kota dan Perubahan Tutupan Lahan Terhadap Temperature Humidity Index di Pulau Batam," Planning for Urban Region and Environment Journal (PURE), vol. 12, no. 1, 2023. Available: https://purejournal.ub.ac.id/index.php/pure/article/view/965
[2] S. Ardabili, A. Mosavi, M. Dehghani, and A. R. Várkonyi-Kóczy, "Deep learning and machine learning in hydrological processes, climate change and Earth systems: a systematic review," in Lecture Notes in Networks and Systems, vol. 151, 2020, pp. 52–62. DOI: https://doi.org/10.1007/978-3-030-36841-8_5
[3] O. Fathi, "Time series forecasting using a hybrid ARIMA and LSTM model," Velvet Consulting, 202X. [Online].
[4] Q. Zhang, Z. Li, S. Snowling, A. Siam, and W. El-Dakhakhni, "Predictive models for wastewater flow forecasting based on time series analysis and Artificial Neural Network," Water Science and Technology, vol. 80, no. 2, pp. 243–253, Jul. 2019. DOI: https://doi.org/10.2166/wst.2019.263
[5] P. Do, C. W. Chow, R. Rameezdeen, and N. Gorjian, "Wastewater inflow time series forecasting at low temporal resolution using SARIMA model: A case study in South Australia," Environmental Science and Pollution Research, vol. 29, no. 47, pp. 70984–70999, May 2022. DOI: https://doi.org/10.1007/s11356-022-20777-y
[6] R. Fawzy, A. S. Eltrass, and H. M. Elkamchouchi, "A new deep learning hybrid model for accurate web traffic time series forecasting," in 2024 6th Novel Intelligent and Leading Emerging Sciences Conference (NILES), Oct. 2024, pp. 403–406. DOI: https://doi.org/10.1109/niles63360.2024.10753194
[7] S. Kumari and P. Muthulakshmi, "SARIMA model: An efficient machine learning technique for weather forecasting," Procedia Computer Science, vol. 235, pp. 656–670, 2024. DOI: https://doi.org/10.1016/j.procs.2024.04.064
[8] M. L. Hossain, S. M. Shams, and S. M. Ullah, "Time-series and Deep Learning Approaches for Renewable Energy Forecasting in Dhaka: A comparative study of ARIMA, SARIMA, and LSTM models," Discover Sustainability, vol. 6, no. 1, Aug. 2025. DOI: https://doi.org/10.1007/s43621-025-01733-5
[9] W. Fransiska et al., "Penerapan Rantai Markov Dalam Peramalan Cuaca (Studi Kasus: Cuaca Harian di Kota Padang)," Buana Matematika: Jurnal Ilmiah Matematika dan Pendidikan Matematika, vol. 12, no. 2, pp. 117–126, 2022. DOI: https://doi.org/10.36456/buanamatematika.v12i2.6374
[10] Badan Pusat Statistik Kota Batam, "Hasil Sensus Penduduk Batam 2020," Berita Resmi Statistik, pp. 1–11, 2021. [Online].
[11] Y. Jiang, Z. Pan, X. Zhang, S. Garg, A. Schneider, Y. Nevmyvaka, and D. Song, "Empowering Time Series Analysis with Large Language Models: A Survey," arXiv preprint, 2024. DOI: https://doi.org/10.48550/arXiv.2402.03182
[12] R. R. Guerra, A. Vizziello, P. Savazzi, E. Goldoni, and P. Gamba, "Forecasting LoRaWAN RSSI using weather parameters: A comparative study of ARIMA, Artificial Intelligence and hybrid approaches," Computer Networks, vol. 243, p. 110258, Apr. 2024. DOI: https://doi.org/10.1016/j.comnet.2024.110258
[13] Fachrurrazi, S. Husin, Tripoli, and Mubarak, "Neural network for the standard unit price of the building area," Procedia Engineering, vol. 171, pp. 282–293, 2017. DOI: https://doi.org/10.1016/j.proeng.2017.01.336
[14] "Implementasi Long Short-Term Memory pada Prediksi Harga Saham PT Aneka Tambang Tbk," Jurnal Ilmiah Komputasi, vol. 21, no. 1, Mar. 2022. DOI: https://doi.org/10.32409/jikstik.21.1.2815
[15] Q. Liu, W. Shi, and Z. Chen, "Fatigue life prediction for vibration isolation rubber based on parameter-optimized support vector machine model," Fatigue & Fracture of Engineering Materials & Structures, vol. 42, no. 3, pp. 710–718, 2019. DOI: https://doi.org/10.1111/ffe.12945
[16] C. Schröer, F. Kruse, and J. M. Gómez, "A systematic literature review on applying CRISP-DM process model," Procedia Computer Science, vol. 181, pp. 526–534, 2021. DOI: https://doi.org/10.1016/j.procs.2021.01.199
[17] F. Martinez-Plumed et al., "CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories," IEEE Transactions on Knowledge and Data Engineering, vol. 33, no. 8, pp. 3048–3061, Aug. 2021. DOI: https://doi.org/10.1109/tkde.2019.2962680
[18] H. V. Minh et al., "Modelling and predicting annual rainfall over the Vietnamese Mekong Delta (VMD) using SARIMA," Discover Geoscience, vol. 2, no. 1, Jun. 2024. DOI: https://doi.org/10.1007/s44288-024-00018-0
[19] J. Xiao, X. Zhu, C. Huang, X. Yang, F. Wen, and M. Zhong, "A new approach for stock price analysis and prediction based on SSA and SVM," International Journal of Information Technology & Decision Making, vol. 18, no. 01, pp. 287–310, Jan. 2019. DOI: https://doi.org/10.1142/s021962201841002x
[20] P. Kabbilawsh, D. S. Kumar, and N. R. Chithra, "Forecasting long-term monthly precipitation using SARIMA models," Journal of Earth System Science, vol. 131, no. 3, Aug. 2022. DOI: https://doi.org/10.1007/s12040-022-01927-9
[21] L. Ilias, E. Sarmas, V. Marinakis, D. Askounis, and H. Doukas, "Unsupervised domain adaptation methods for photovoltaic power forecasting," Applied Soft Computing, vol. 149, p. 110979, Dec. 2023. DOI: https://doi.org/10.1016/j.asoc.2023.110979
[22] J. A. Segovia, J. F. Toaquiza, J. R. Llanos, and D. R. Rivas, "Meteorological variables forecasting system using machine learning and open-source software," Electronics, vol. 12, no. 4, p. 1007, Feb. 2023. DOI: https://doi.org/10.3390/electronics12041007
[23] F. Fallucchi and M. Gozzi, "Puzzle pattern, a systematic approach to multiple behavioral inheritance implementation in object-oriented programming," Applied Sciences, vol. 14, no. 12, p. 5083, Jun. 2024. DOI: https://doi.org/10.3390/app14125083
[24] M. Nilashi, O. Keng Boon, G. Tan, B. Lin, and R. Abumalloh, "Critical data challenges in measuring the performance of Sustainable Development Goals: Solutions and the role of big-Data Analytics," Harvard Data Science Review, vol. 5, no. 3, Jul. 2023. DOI: https://doi.org/10.1162/99608f92.545db2cf
[25] C. Zhang et al., "Large Language Model-Brained GUI Agents: A Survey," arXiv preprint, May 2025. DOI: https://doi.org/10.48550/arXiv.2411.18279
[26] Y. Zhao, W. Zhang, and X. Liu, "Grid search with a weighted error function: Hyper-parameter optimization for financial time series forecasting," Applied Soft Computing, vol. 154, p. 111362, Mar. 2024. DOI: https://doi.org/10.1016/j.asoc.2024.111362
[27] Windary W, Hasugian AH. Data mining of rural digital technology adoption factors using Apriori algorithm. Journal of Information Systems and Technology Research. 2025 Sept 30;4(3):224–33. doi:10.55537/jistr.v4i3.1324
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Yefta Christian, Jupiter Agustio Liu Siaw Ping

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



