Assessment of Air Pollutants of Dhanbad Using Machine Learning Techniques

Authors

  • Uday Kumar Sinha, K. Bandyopadhayay, S. C. Dutta

Keywords:

Machine learning models, air quality, air pollutants, CPCB, data preprocessing, performance criteria

Abstract

A comprehensive assessment of air pollution in Dhanbad for the months of April 2019 through March 2023 was conducted using a support vector machines. random forest, xgboost, and decision tree methods in relation to seven main pollutants into the air (PM10, NO, NO2, NH3, SO2, CO, and O3). A randomly selected 30-day period was used to create line and bar graphs using accuracy & error matrices for 7 pollutants in various models. Our investigation found that the Random Forest model estimates Dhanbad air contaminants with the lowest MAE.Among the aforementioned models, the Random Forest one stands head and shoulders above the others.

Downloads

Download data is not yet available.

References

O. Bouakline et al., “Prediction of daily PM 10 concentration using machine learning,” in 2020 IEEE 2nd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), IEEE, 2020, pp. 1–5.

K. Tripathi and P. Pathak, “Deep learning techniques for air pollution,” in 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), IEEE, 2021, pp. 1013–1020.

A. Al Yammahi and Z. Aung, “Forecasting the concentration of NO2 using statistical and machine learning methods: A case study in the UAE,” Heliyon, vol. 9, no. 2, 2023.

S. Peng, J. Zhu, Z. Liu, B. Hu, M. Wang, and S. Pu, “Prediction of Ammonia Concentration in a Pig House Based on Machine Learning Models and Environmental Parameters,” Animals, vol. 13, no. 1, p. 165, 2022.

P. Bhalgat, S. Bhoite, and S. Pitare, “Air quality prediction using machine learning algorithms,” Int. J. Comput. Appl. Technol. Res., vol. 8, no. 9, pp. 367–370, 2019.

P. Kadam and S. Vijayumar, “Prediction model: CO 2 emission using machine learning,” in 2018 3rd International Conference for Convergence in Technology (I2CT), IEEE, 2018, pp. 1–3.

O. A. Ghoneim and B. R. Manjunatha, “Forecasting of ozone concentration in smart city using deep learning,” in 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, 2017, pp. 1320–1326.

U. K. Sinha, K. Bandyopadhyay and S. C. Dutta, ‘Air Quality of Dhanbad During Pre-Lockdown, Lockdown And Post-Lockdown Periods’ Journal of Emerging Technologies and Innovative Research (ISSN: 2349-5162), Volume 8, Issue 12, 2021.

S. B. Sonu and A. Suyampulingam, “Linear regression based air quality data analysis and prediction using python,” in 2021 IEEE Madras Section Conference (MASCON), IEEE, 2021, pp. 1–7.

[10] C. Cortes and V. Vapnik, “Support-vector networks,” Mach. Learn., vol. 20, pp. 273–297, 1995.

U. K. Sinha, K. Bandyopadhayay, and S. C. Dutta, “Prediction of PM10 AND CO Using Support Vector Regression To Analyses The Air Quality of Dhanbad,” Emerg. TRENDS MULTI Discip. Res. Innov., p. 1, 2022.

S. Singh, “Prediction of Air Pollution Using Random Forest,” Ann. Romanian Soc. Cell Biol., pp. 19314–19322, 2021.

B. Pan, “Application of XGBoost algorithm in hourly PM2. 5 concentration prediction,” in IOP conference series: earth and environmental science, IOP publishing, 2018, p. 012127.

M. Hussain, S. Afrin, A. Irin, and S. K. Park, “Applying Decision Tree Algorithm for Air Quality Prediction in Bangladesh,” in 2021 5th International Conference on Electrical Information and Communication Technology (EICT), IEEE, 2021, pp. 1–6.

S. B. Kotsiantis, D. Kanellopoulos, and P. E. Pintelas, “Data preprocessing for supervised leaning,” Int. J. Comput. Sci., vol. 1, no. 2, pp. 111–117, 2006.

D. Parbat and M. Chakraborty, “A python based support vector regression model for prediction of COVID19 cases in India,” Chaos Solitons Fractals, vol. 138, p. 109942, 2020.

J. D. Hunter, “Matplotlib: A 2D graphics environment,” Comput. Sci. Eng., vol. 9, no. 03, pp. 90–95, 2007.

M. Waskom, “seaborn: statistical data visualization,” J. Open Source Softw., vol. 6, no. 60, p. 3021, Apr. 2021, doi: 10.21105/joss.03021.

F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Mach. Learn. PYTHON.

D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput. Sci., vol. 7, p. e623, 2021.

Downloads

Published

12.06.2024

How to Cite

Uday Kumar Sinha. (2024). Assessment of Air Pollutants of Dhanbad Using Machine Learning Techniques. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 2649 –. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/6698

Issue

Section

Research Article