Prediction of Jakarta's Air Quality Using a Stacking Framework of CLSTM, CatBoost, SVR, and XGBoost

Authors

  • Usman Syapotro Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia
  • Silvia Ratna Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia
  • M. Muflih Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia
  • Haldi Budiman Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia
  • M. Rezqy Noor Ridha Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia
  • Muhammad Hamdani Islamic University of Kalimantan Muhammad Arsyad Al-Banjari, Indonesia

Keywords:

Prediction of Jakarta Air Quality, PM 10, Stacking, Extreme Gradient Boosting (XGBoost)

Abstract

Air quality prediction, particularly in estimating PM10 particle concentration, is a
significant challenge in major cities like Jakarta, which experience high levels of air pollution.
This study aims to develop an air quality prediction model using an innovative stacking
framework that combines several machine learning algorithms, namely ConvLSTM, CatBoost,
SVR, and XGBoost. The methodology employed in this research is an experimental approach,
where each model is trained and tested individually before being integrated into the stacking
framework. The dataset used was sourced from the Kaggle platform, containing historical air
quality data in Jakarta. Performance evaluation was conducted by measuring the Root Mean
Squared Error (RMSE) for each model. The results of the study showed that the ConvLSTM
model produced an RMSE of 13.5168, CatBoost with an RMSE of 13.4113, and SVR with an
RMSE of 14.2725. To improve prediction accuracy, the researchers employed a stacking
approach of the four models (ConvLSTM, CatBoost, SVR, and XGBoost), which yielded a
much lower RMSE of 0.8093. Thus, this stacking framework has proven to significantly
enhance air quality prediction performance, particularly in predicting PM10 concentrations in
Jakarta.

Published

2024-11-26

How to Cite

Syapotro, U., Ratna, S., Muflih, M., Budiman, H., Noor Ridha, M. R., & Hamdani, M. (2024). Prediction of Jakarta’s Air Quality Using a Stacking Framework of CLSTM, CatBoost, SVR, and XGBoost. Journal of Data Science, 2024. Retrieved from https://iuojs.intimal.edu.my/index.php/jods/article/view/586