1 Department of Mathematics and Statistics, Austin Peay State University, Tennessee, USA.
2 Department of Data Science, Wentworth Institute of Technology, Massachusetts, USA.
3 Department of Cybersecurity Management, University of Illinois at Springfield, Illinois, USA.
4 Department of Public Health, University of Illinois at Springfield, Illinois, USA.
International Journal of Science and Research Archive, 2025, 16(03), 1039-1055
Article DOI: 10.30574/ijsra.2025.16.3.2665
Received on 16 August 2025; revised on 22 September 2025; accepted on 24 September 2025
Enforcement of the Safe Drinking Water Act helps in protecting public health, yet agencies tasked with enforcement are besieged with numerous violations. In this research, a supervised machine learning approach was employed to predict the final regulatory action ('Resolved' or 'Archived') of 7,268 past drinking water violations within the U.S. Environmental Protection Agency's SDWIS database. Logistic Regression, Random Forest, and Gradient Boosting models were tuned using hyperparameter tuning and then trained on the data. The human-tuned Gradient Boosting model performed the best in prediction, with 82.3% accuracy and an F1-Score of 0.87 on the test set. Feature importance analysis revealed that the violation of the specific regulatory rule was the key predictor, followed by the quantitative extent of the violation (exceedance ratio) and the contaminant type. Findings reveal that the fate of drinking water violations is highly predictable based on their original characteristics. The research offers a proof-of-concept for predictive analytics as an environmental governance tool through which regulatory agencies may filter and prioritize high-priority violations to be addressed on a targeted basis.
Drinking Water Quality; Machine Learning; Regulatory Compliance; Predictive Modeling; Gradient Boosting; Safe Drinking Water Act (Sewa); Environmental Governance
Preview Article PDF
Abdul-Waliyyu Bello, Awele Okolie, Jacinta Izundu and Anastesia Izundu. Predicting regulatory violations in public drinking water systems: A data-driven approach. International Journal of Science and Research Archive, 2025, 16(03), 1039-1055. Article DOI: https://doi.org/10.30574/ijsra.2025.16.3.2665.
Copyright © 2025 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0







