Explainable Ensemble Machine Learning Framework for Multi-Class Ecotoxicity and Drinkability Prediction of Groundwater Using Hydro-Chemical Indicators

Explainable Ensemble Machine Learning Framework for Multi-Class Ecotoxicity and Drinkability Prediction of Groundwater Using Hydro-Chemical Indicators

Research Square – News/Updates
Research Square – News/UpdatesApr 29, 2026

Why It Matters

HydroNet‑X delivers near‑perfect predictive performance while offering transparent insights, enabling water utilities and regulators to make faster, data‑driven decisions that protect public health and ecosystems.

Key Takeaways

  • HydroNet‑X reached 99.2% classification accuracy on 1,345 samples.
  • SHAP identified EC, TDS, and TH as top predictive indicators.
  • Ensemble combines XGBoost, AdaBoost, and polynomial SVM for robustness.
  • Model outperformed traditional XGBoost, SVM, and logistic regression.
  • Framework supports real‑time decision making for water safety management.

Pulse Analysis

Groundwater quality monitoring has traditionally relied on labor‑intensive laboratory tests and basic statistical models, which can delay critical interventions. As urbanization and industrial activity accelerate, the need for rapid, accurate assessments has become paramount for both public health agencies and environmental regulators. Machine‑learning approaches promise speed, but many act as black boxes, limiting trust among stakeholders who require clear justification for water‑safety decisions.

HydroNet‑X addresses this gap by fusing three powerful algorithms—Extreme Gradient Boosting, Adaptive Boosting, and a polynomial Support Vector Machine—into a single ensemble that leverages the strengths of each method. Trained on 13 WHO‑compliant hydro‑chemical indicators, the model achieved a 99.2% overall accuracy and an F1‑score of 0.994 across four drinkability categories. Crucially, SHAP (Shapley Additive Explanations) analysis demystifies the model, revealing Electrical Conductivity, Total Dissolved Solids, and Total Hardness as the primary drivers of ecotoxicity and potability predictions. This transparency not only validates the scientific relevance of the selected parameters but also equips decision‑makers with actionable insights.

The implications extend beyond academic performance metrics. Water utilities can integrate HydroNet‑X into real‑time monitoring dashboards, reducing reliance on costly lab analyses and accelerating response to contamination events. Policymakers gain a robust decision‑support tool that aligns with sustainability goals, enabling proactive resource management and compliance with stringent water‑quality standards. As the framework scales to additional regions and incorporates more diverse datasets, it could set a new benchmark for explainable AI in environmental risk assessment, fostering broader adoption across the water sector.

Explainable Ensemble Machine Learning Framework for Multi-Class Ecotoxicity and Drinkability Prediction of Groundwater Using Hydro-Chemical Indicators

Comments

Want to join the conversation?

Loading comments...