Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/34567
Appears in Collections:Biological and Environmental Sciences Journal Articles
Peer Review Status: Refereed
Title: Decoding river pollution trends and their landscape determinants in an ecologically fragile karst basin using a machine learning model
Author(s): Xu, Guoyu
Fan, Hongxiang
Oliver, David M
Dai, Yibin
Li, Hengpeng
Shi, Yuejie
Long, Haifei
Xiong, Kangning
Zhao, Zhongming
Contact Email: david.oliver@stir.ac.uk
Keywords: Ecologically fragile karst basin
Water quality assessment
XGBoost regression
Shapley additive explanations
Determinant analysis
Issue Date: Nov-2022
Date Deposited: 22-Aug-2022
Citation: Xu G, Fan H, Oliver DM, Dai Y, Li H, Shi Y, Long H, Xiong K & Zhao Z (2022) Decoding river pollution trends and their landscape determinants in an ecologically fragile karst basin using a machine learning model. Environmental Research, 214 (Part 4), Art. No.: 113843. https://doi.org/10.1016/j.envres.2022.113843
Abstract: Karst watersheds accommodate high landscape complexity and are influenced by both human-induced and natural activity, which affects the formation and process of runoff, sediment connectivity and contaminant transport and alters natural hydrological and nutrient cycling. However, physical monitoring stations are costly and labor-intensive, which has confined the assessment of water quality impairments on spatial scale. The geographical characteristics of catchments are potential influencing factors of water quality, often overlooked in previous studies of highly heterogeneous karst landscape. To solve this problem, we developed a machining learning method and applied Extreme Gradient Boosting (XGBoost) to predict the spatial distribution of water quality in the world's most ecologically fragile karst watershed. We used the Shapley Addition interpretation (SHAP) to explain the potential determinants. Before this process, we first used the water quality damage index (WQI-DET) to evaluate the water quality impairment status and determined that CODMn, TN and TP were causing river water quality impairments in the WRB. Second, we selected 46 watershed features based on the three key processes (sources-mobilization-transport) which affect the temporal and spatial variation of river pollutants to predict water quality in unmonitored reaches and decipher the potential determinants of river impairments. The predicting range of CODMn spanned from 1.39 mg/L to 17.40 mg/L. The predictions of TP and TN ranged from 0.02 to 1.31 mg/L and 0.25–5.72 mg/L, respectively. In general, the XGBoost model performs well in predicting the concentration of water quality in the WRB. SHAP explained that pollutant levels may be driven by three factors: anthropogenic sources (agricultural pollution inputs), fragile soils (low organic carbon content and high soil permeability to water flow), and pollutant transport mechanisms (TWI, carbonate rocks). Our study provides key data to support decision-making for water quality restoration projects in the WRB and information to help bridge the science:policy gap.
DOI Link: 10.1016/j.envres.2022.113843
Rights: This item has been embargoed for a period. During the embargo please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study. Accepted refereed manuscript of: Xu G, Fan H, Oliver DM, Dai Y, Li H, Shi Y, Long H, Xiong K & Zhao Z (2022) Decoding river pollution trends and their landscape determinants in an ecologically fragile karst basin using a machine learning model. Environmental Research, 214, Art. No.: 113843. https://doi.org/10.1016/j.envres.2022.113843 © 2022, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
Licence URL(s): http://creativecommons.org/licenses/by-nc-nd/4.0/

Files in This Item:
File Description SizeFormat 
ER-22-971_R1.pdfFulltext - Accepted Version2.57 MBAdobe PDFView/Open



This item is protected by original copyright



A file in this item is licensed under a Creative Commons License Creative Commons

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.