Assessment of Air Pollutants of Dhanbad Using Machine Learning Techniques
Keywords:
Machine learning models, air quality, air pollutants, CPCB, data preprocessing, performance criteriaAbstract
A comprehensive assessment of air pollution in Dhanbad for the months of April 2019 through March 2023 was conducted using a support vector machines. random forest, xgboost, and decision tree methods in relation to seven main pollutants into the air (PM10, NO, NO2, NH3, SO2, CO, and O3). A randomly selected 30-day period was used to create line and bar graphs using accuracy & error matrices for 7 pollutants in various models. Our investigation found that the Random Forest model estimates Dhanbad air contaminants with the lowest MAE.Among the aforementioned models, the Random Forest one stands head and shoulders above the others.
Downloads
References
O. Bouakline et al., “Prediction of daily PM 10 concentration using machine learning,” in 2020 IEEE 2nd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), IEEE, 2020, pp. 1–5.
K. Tripathi and P. Pathak, “Deep learning techniques for air pollution,” in 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), IEEE, 2021, pp. 1013–1020.
A. Al Yammahi and Z. Aung, “Forecasting the concentration of NO2 using statistical and machine learning methods: A case study in the UAE,” Heliyon, vol. 9, no. 2, 2023.
S. Peng, J. Zhu, Z. Liu, B. Hu, M. Wang, and S. Pu, “Prediction of Ammonia Concentration in a Pig House Based on Machine Learning Models and Environmental Parameters,” Animals, vol. 13, no. 1, p. 165, 2022.
P. Bhalgat, S. Bhoite, and S. Pitare, “Air quality prediction using machine learning algorithms,” Int. J. Comput. Appl. Technol. Res., vol. 8, no. 9, pp. 367–370, 2019.
P. Kadam and S. Vijayumar, “Prediction model: CO 2 emission using machine learning,” in 2018 3rd International Conference for Convergence in Technology (I2CT), IEEE, 2018, pp. 1–3.
O. A. Ghoneim and B. R. Manjunatha, “Forecasting of ozone concentration in smart city using deep learning,” in 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, 2017, pp. 1320–1326.
U. K. Sinha, K. Bandyopadhyay and S. C. Dutta, ‘Air Quality of Dhanbad During Pre-Lockdown, Lockdown And Post-Lockdown Periods’ Journal of Emerging Technologies and Innovative Research (ISSN: 2349-5162), Volume 8, Issue 12, 2021.
S. B. Sonu and A. Suyampulingam, “Linear regression based air quality data analysis and prediction using python,” in 2021 IEEE Madras Section Conference (MASCON), IEEE, 2021, pp. 1–7.
[10] C. Cortes and V. Vapnik, “Support-vector networks,” Mach. Learn., vol. 20, pp. 273–297, 1995.
U. K. Sinha, K. Bandyopadhayay, and S. C. Dutta, “Prediction of PM10 AND CO Using Support Vector Regression To Analyses The Air Quality of Dhanbad,” Emerg. TRENDS MULTI Discip. Res. Innov., p. 1, 2022.
S. Singh, “Prediction of Air Pollution Using Random Forest,” Ann. Romanian Soc. Cell Biol., pp. 19314–19322, 2021.
B. Pan, “Application of XGBoost algorithm in hourly PM2. 5 concentration prediction,” in IOP conference series: earth and environmental science, IOP publishing, 2018, p. 012127.
M. Hussain, S. Afrin, A. Irin, and S. K. Park, “Applying Decision Tree Algorithm for Air Quality Prediction in Bangladesh,” in 2021 5th International Conference on Electrical Information and Communication Technology (EICT), IEEE, 2021, pp. 1–6.
S. B. Kotsiantis, D. Kanellopoulos, and P. E. Pintelas, “Data preprocessing for supervised leaning,” Int. J. Comput. Sci., vol. 1, no. 2, pp. 111–117, 2006.
D. Parbat and M. Chakraborty, “A python based support vector regression model for prediction of COVID19 cases in India,” Chaos Solitons Fractals, vol. 138, p. 109942, 2020.
J. D. Hunter, “Matplotlib: A 2D graphics environment,” Comput. Sci. Eng., vol. 9, no. 03, pp. 90–95, 2007.
M. Waskom, “seaborn: statistical data visualization,” J. Open Source Softw., vol. 6, no. 60, p. 3021, Apr. 2021, doi: 10.21105/joss.03021.
F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Mach. Learn. PYTHON.
D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput. Sci., vol. 7, p. e623, 2021.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Uday Kumar Sinha, K. Bandyopadhayay, S. C. Dutta

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.