1
|
Masache A, Mdlongwa P, Maposa D, Sigauke C. Short-term forecasting of solar irradiance using decision tree-based models and non-parametric quantile regression. PLoS One 2024; 19:e0312814. [PMID: 39661613 PMCID: PMC11634012 DOI: 10.1371/journal.pone.0312814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 10/15/2024] [Indexed: 12/13/2024] Open
Abstract
The renewable energy industry requires accurate forecasts of intermittent solar irradiance (SI) to effectively manage solar power generation and supply. Introducing the random forests (RFs) model and its hybridisation with quantile regression modelling, the quantile regression random forest (QRRF), can help improve the forecasts' accuracy. This paper assesses the RFs and QRRF models against the quantile generalised additive model (QGAM) by evaluating their forecast performances. A simulation study of multivariate data-generating processes was carried out to compare the forecasting accuracy of the models when predicting global horizontal solar irradiance. The QRRF and QGAM are completely new forecasting frameworks for SI studies, to the best of our knowledge. Simulation results suggested that the introduced QRRF compared well with the QGAM when predicting the forecast distribution. However, the evaluations of the pinball loss scores and mean absolute scaled errors demonstrated a clear superiority of the QGAM. Similar results were obtained in an application to real-life data. Therefore, we recommend that the QGAM be preferred ahead of decision tree-based models when predicting solar irradiance. However, the QRRF model can be used alternatively to predict the forecast distribution. Both the QGAM and QRRF modelling frameworks went beyond representing forecast uncertainty of SI as probability distributions around a prediction interval to give complete information through the estimation of quantiles. Most SI studies conducted are residual and/or non-parametric modelling that are limited to represent information about the conditional mean distribution. Extensions of the QRRF and QGAM frameworks can be made to model other renewable sources of energy that have meteorological characteristics similar to solar irradiance.
Collapse
Affiliation(s)
- Amon Masache
- Department of Statistics and Operations Research, National University of Science and Technology, Bulawayo, Zimbabwe
| | - Precious Mdlongwa
- Department of Statistics and Operations Research, National University of Science and Technology, Bulawayo, Zimbabwe
| | - Daniel Maposa
- Department of Statistics and Operations Research, University of Limpopo, Sovenga, South Africa
| | - Caston Sigauke
- Department of Mathematical and Computational Sciences, University of Venda, Thohoyandou, South Africa
| |
Collapse
|
2
|
Rota F, Scherrer D, Bergamini A, Price B, Walthert L, Baltensweiler A. Unravelling the impact of soil data quality on species distribution models of temperate forest woody plants. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 944:173719. [PMID: 38839003 DOI: 10.1016/j.scitotenv.2024.173719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 04/12/2024] [Accepted: 05/31/2024] [Indexed: 06/07/2024]
Abstract
Soil properties influence plant physiology and growth, playing a fundamental role in shaping species niches in temperate forest ecosystems. Here, we investigated the impact of soil data quality on the performance of species distribution models (SDMs) of 41 woody plant species in Swiss forests. We compared models based on measured soil properties with those based on digitally mapped soil properties on regional (Swiss Forest Soil Maps) and global scales (SoilGrids). We first calibrated topo-climatic SDMs with measured soil data and plant species presences and absences from mature temperate forest stand plots. We developed further models using the same soil predictors, but with values extracted from digital soil maps at the nearest neighbouring plots of the Swiss National Forestry Inventory. The predictive power of SDMs without soil information compared to those with soil information, as well as measured soil information vs digitally mapped, was evaluated with metrics of model performance and variable contribution. On average, models with measured and digitally mapped soil properties performed significantly better than those without soil information. SDMs based on measured and Swiss Forest Soil Maps showed higher performance, especially for species with an 'extreme' niche position (e.g., preference for high or low pH), compared to those using SoilGrids. Nevertheless, if no regional soil maps are available, SoilGrids should be tested for their potential to improve SDMs. Moreover, among the tested soil predictors, pH, and clay content of the topsoil layers most improved the predictive power of SDMs for forest woody plants. In conclusion, we demonstrate the value of regional soil maps for predicting the distribution of woody species across strong environmental gradients in temperate forests. The improved accuracy of SDMs and insights into drivers of distribution may support forest managers in strategies supporting e.g. biodiversity conservation, or climate adaptation planning.
Collapse
Affiliation(s)
- Francesco Rota
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland.
| | - Daniel Scherrer
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| | - Ariel Bergamini
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| | - Bronwyn Price
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| | - Lorenz Walthert
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| | - Andri Baltensweiler
- Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| |
Collapse
|
3
|
Wang YY, Yang WX, Du QJ, Liu ZH, Lu MH, You CG. Construction and evaluation of a liver cancer risk prediction model based on machine learning. World J Gastrointest Oncol 2024; 16:3839-3850. [PMID: 39350987 PMCID: PMC11438789 DOI: 10.4251/wjgo.v16.i9.3839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 07/31/2024] [Accepted: 08/07/2024] [Indexed: 09/09/2024] Open
Abstract
BACKGROUND Liver cancer is one of the most prevalent malignant tumors worldwide, and its early detection and treatment are crucial for enhancing patient survival rates and quality of life. However, the early symptoms of liver cancer are often not obvious, resulting in a late-stage diagnosis in many patients, which significantly reduces the effectiveness of treatment. Developing a highly targeted, widely applicable, and practical risk prediction model for liver cancer is crucial for enhancing the early diagnosis and long-term survival rates among affected individuals. AIM To develop a liver cancer risk prediction model by employing machine learning techniques, and subsequently assess its performance. METHODS In this study, a total of 550 patients were enrolled, with 190 hepatocellular carcinoma (HCC) and 195 cirrhosis patients serving as the training cohort, and 83 HCC and 82 cirrhosis patients forming the validation cohort. Logistic regression (LR), support vector machine (SVM), random forest (RF), and least absolute shrinkage and selection operator (LASSO) regression models were developed in the training cohort. Model performance was assessed in the validation cohort. Additionally, this study conducted a comparative evaluation of the diagnostic efficacy between the ASAP model and the model developed in this study using receiver operating characteristic curve, calibration curve, and decision curve analysis (DCA) to determine the optimal predictive model for assessing liver cancer risk. RESULTS Six variables including age, white blood cell, red blood cell, platelet counts, alpha-fetoprotein and protein induced by vitamin K absence or antagonist II levels were used to develop LR, SVM, RF, and LASSO regression models. The RF model exhibited superior discrimination, and the area under curve of the training and validation sets was 0.969 and 0.858, respectively. These values significantly surpassed those of the LR (0.850 and 0.827), SVM (0.860 and 0.803), LASSO regression (0.845 and 0.831), and ASAP (0.866 and 0.813) models. Furthermore, calibration and DCA indicated that the RF model exhibited robust calibration and clinical validity. CONCLUSION The RF model demonstrated excellent prediction capabilities for HCC and can facilitate early diagnosis of HCC in clinical practice.
Collapse
Affiliation(s)
- Ying-Ying Wang
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Wan-Xia Yang
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Qia-Jun Du
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Zhen-Hua Liu
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Ming-Hua Lu
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| | - Chong-Ge You
- Laboratory Medicine Center, The Second Hospital & Clinical Medical School, Lanzhou University, Lanzhou 730030, Gansu Province, China
| |
Collapse
|
4
|
Hameed MM, Mohd Razali SF, Wan Mohtar WHM, Yaseen ZM. Examining optimized machine learning models for accurate multi-month drought forecasting: A representative case study in the USA. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:52060-52085. [PMID: 39134798 DOI: 10.1007/s11356-024-34500-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 07/23/2024] [Indexed: 09/06/2024]
Abstract
The Colorado River has experienced a significant streamflow reduction in recent decades due to climate change, resulting in pronounced hydrological droughts that pose challenges to the environment and human activities. However, current models struggle to accurately capture complex drought patterns, and their accuracy decreases as the lead time increases. Thus, determining the reliability of drought forecasting for specific months ahead presents a challenging task. This study introduces a robust approach that utilizes the Beluga Whale Optimization (BWO) algorithm to train and optimize the parameters of the Regularized Extreme Learning Machine (RELM) and Random Forest (RF) models. The applied models are validated against a KNN benchmark model for forecasting drought from one- to six-month ahead across four hydrological stations distributed over the Colorado River. The achieved results demonstrate that RELM-BWO outperforms RF-BWO and KNN models, achieving the lowest root-mean square error (0.2795), uncertainty (U95 = 0.1077), mean absolute error (0.2104), and highest correlation coefficient (0.9135). Also, the current study uses Global Multi-Criteria Decision Analysis (GMCDA) as an evaluation metric to assess the reliability of the forecasting. The GMCDA results indicate that RELM-BWO provides reliable forecasts up to four months ahead. Overall, the research methodology is valuable for drought assessment and forecasting, enabling advanced early warning systems and effective drought countermeasures.
Collapse
Affiliation(s)
- Mohammed Majeed Hameed
- Department of Civil Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia.
- Department of Civil Engineering, Al-Maarif University, 31001, Ramadi City, Iraq.
| | - Siti Fatin Mohd Razali
- Department of Civil Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
- Smart and Sustainable Township Research Centre (SUTRA), Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
| | - Wan Hanna Melini Wan Mohtar
- Department of Civil Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
- Smart and Sustainable Township Research Centre (SUTRA), Universiti Kebangsaan Malaysia, 43600, Bangi, Selangor, Malaysia
| | - Zaher Mundher Yaseen
- Civil and Environmental Engineering Department, King Fahd University of Petroleum & Minerals, Dhahran, 31261, Saudi Arabia
| |
Collapse
|
5
|
Xu Y, Du H, Mao F, Li X, Zhou G, Huang Z, Guo K, Zhang M, Luo X, Chen C, Zhao Y. Effects of chlorophyll fluorescence on environment and gross primary productivity of moso bamboo during the leaf-expansion stage. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 360:121185. [PMID: 38788407 DOI: 10.1016/j.jenvman.2024.121185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 05/06/2024] [Accepted: 05/13/2024] [Indexed: 05/26/2024]
Abstract
Chlorophyll fluorescence is the long-wave light released by the residual energy absorbed by vegetation after photosynthesis and dissipation, which can directly and non-destructively reflect the photosynthetic state of plants from the perspective of the mechanism of photosynthetic process. Moso bamboo has a substantial carbon sequestration ability, and leaf-expansion stage is an important phenological period for carbon sequestration. Gross primary production (GPP) is a key parameter reflecting vegetation carbon sequestration process. However, the ability of chlorophyll fluorescence in moso bamboo to explain GPP changes is unclear. The research area of this study is located in the bamboo forest near the flux station of Anji County, Zhejiang Province, where an observation tower is built to monitor the carbon flux and meteorological change of bamboo forest. The chlorophyll fluorescence physiological parameters (Fp) and fluorescence yield (Fy) indices were measured and calculated for the leaves of newborn moso bamboo (I Du bamboo) and the old leaves of 4- to 5-year-old moso bamboo (Ⅲ Du bamboo) during the leaf-expansion stage. The chlorophyll fluorescence in response to the environment and its effect on carbon flux were analyzed. The results showed that: Fv/Fm, Y(II) and α of Ⅰ Du bamboo gradually increased, while Ⅲ Du bamboo gradually decreased, and FYint and FY687/FY738 of Ⅰ Du bamboo were higher than those of Ⅲ Du bamboo; moso bamboo was sensitive to changes in air temperature(Ta), relative humidity(RH), water vapor pressure(E), soil temperature(ST) and soil water content (SWC), the Fy indices of the upper, middle and lower layers were significantly correlated with Ta, E and ST; single or multiple vegetation indices were able to estimate the fluorescence yield indices well (all with R2 greater than 0.77); chlorophyll fluorescence (Fp and Fy indices) of Ⅰ Du bamboo and Ⅲ Du bamboo could explain 74.4% and 72.7% of the GPP variation, respectively; chlorophyll fluorescence and normalized differential vegetation index of the canopy (NDVIc) could estimate GPP well using random forest (Ⅰ Du bamboo: r = 0.929, RMSE = 0.069 g C·m-2; Ⅲ Du bamboo: r = 0.899, RMSE = 0.134 g C·m-2). The results of this study show that chlorophyll fluorescence can provide a basis for judging the response of moso bamboo to environmental changes and can well explain GPP. This study has important scientific significance for evaluating the potential mechanisms of growth, stress feedback and photosynthetic carbon sequestration of bamboo.
Collapse
Affiliation(s)
- Yanxin Xu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Huaqiang Du
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China.
| | - Fangjie Mao
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Xuejian Li
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Guomo Zhou
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Zihao Huang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Keruo Guo
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Meng Zhang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Xin Luo
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Chao Chen
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| | - Yinyin Zhao
- State Key Laboratory of Subtropical Silviculture, Zhejiang A & F University, Hangzhou, 311300, China; Key Laboratory of Carbon Cycling in Forest Ecosystems and Carbon Sequestration of Zhejiang Province, Zhejiang A & F University, Hangzhou, 311300, China; School of Environmental and Resources Science, Zhejiang A & F University, Hangzhou, 311300, China
| |
Collapse
|
6
|
Tatar M, Farokhi S, Araz OM, Deshpande A, Wilson FA. Association of social vulnerability and influenza vaccination rates for Annual Medicare Enrollees at the county-level in the United States. Prev Med 2023; 177:107782. [PMID: 37980957 DOI: 10.1016/j.ypmed.2023.107782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/08/2023] [Accepted: 11/11/2023] [Indexed: 11/21/2023]
Abstract
INTRODUCTION Influenza is a preventable acute respiratory illness with a high potential to cause serious complications and is associated with high mortality and morbidity in the US. We aimed to determine the specific community-level vulnerabilities for different race/ethnic communities that are most predictive of influenza vaccination rates. METHODS We conducted a machine learning analysis (XGBoost) to identify community-level social vulnerability features that are predictive of influenza vaccination rates among Medicare enrollees across counties in the US and by race/ethnicity. RESULTS Population density per square mile in a county is the most important feature in predicting influenza vaccination in a county, followed by unemployment rates and the percentage of mobile homes. The gain relative importance of these features are 11.6%, 9.2%, and 9%, respectively. Among whites, population density (17% gain relative importance) was followed by the percentage of mobile homes (9%) and per capita income (8.7%). For Black/African Americans, the most important features were population density (12.8%), percentage of minorities in the county (8.0%), per capita income (6.9%), and percent of over-occupied housing units (6.8%). Finally, for Hispanics, the top features were per capita income (8.4%), percentage of mobile homes (8.0%), percentage of non-institutionalized persons with a disability (7.9%), and population density (7.6%). CONCLUSIONS Our study may have implications for the success of large vaccination programs in counties with high social vulnerabilities. Further, our findings suggest that policies and interventions seeking to increase rates of vaccination in race/ethnic minority communities may need to be tailored to address their specific socioeconomic vulnerabilities.
Collapse
Affiliation(s)
- Moosa Tatar
- Department of Pharmaceutical Health Outcomes and Policy, University of Houston, Houston, TX, United States of America.
| | - Soheila Farokhi
- Department of Computer Science, Utah State University, Logan, UT, United States of America.
| | - Ozgur M Araz
- College of Business, University of Nebraska- Lincoln, Lincoln, NE.
| | - Abhishek Deshpande
- Center for Value-Based Care Research, Cleveland Clinic, Cleveland, OH, United States of America.
| | - Fernando A Wilson
- Matheson Center for Health Care Studies, University of Utah, Salt Lake City, UT, United States of America; Department of Population Health Sciences, University of Utah, Salt Lake City, UT, United States of America; Department of Economics, University of Utah, Salt Lake City, UT, United States of America.
| |
Collapse
|
7
|
Tatar M, Faraji MR, Keyes K, Wilson FA, Jalali MS. Social vulnerability predictors of drug poisoning mortality: A machine learning analysis in the United States. Am J Addict 2023; 32:539-546. [PMID: 37344967 DOI: 10.1111/ajad.13445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 04/16/2023] [Accepted: 06/05/2023] [Indexed: 06/23/2023] Open
Abstract
BACKGROUND AND OBJECTIVES Drug poisoning is a leading cause of unintentional deaths in the United States. Despite the growing literature, there are a few recent analyses of a wide range of community-level social vulnerability features contributing to drug poisoning mortality. Current studies on this topic face three limitations: often studying a limited subset of vulnerability features, focusing on small sample sizes, or solely including local data. To address this gap, we conducted a national-level analysis to study the impacts of several social vulnerability features in predicting drug mortality rates in the United States. METHODS We used machine learning to investigate the role of 16 social vulnerability features in predicting drug mortality rates for US counties in 2014, 2016, and 2018-the most recent available data. We estimated each vulnerability feature's gain relative contribution in predicting drug poisoning mortality. RESULTS Among all social vulnerability features, the percentage of noninstitutionalized persons with a disability is the most influential predictor, with a gain relative contribution of 18.6%, followed by population density and the percentage of minority residents (13.3% and 13%, respectively). Percentages of households with no available vehicles, mobile homes, and persons without a high school diploma are the following features with gain relative contributions of 6.3%, 5.8%, and 5.1%, respectively. CONCLUSION AND SCIENTIFIC SIGNIFICANCE We identified social vulnerability features that are most predictive of drug poisoning mortality. Public health interventions and policies targeting vulnerable communities may increase the resilience of these communities and mitigate the overdose death and drug misuse crisis.
Collapse
Affiliation(s)
- Moosa Tatar
- Center for Value-Based Care Research, Cleveland Clinic, Cleveland, Ohio, USA
| | - Mohammad R Faraji
- Department of Computer Science and Information Technology, Institute for Advanced Studies in Basic Sciences, Zanjan, Iran
| | - Katherine Keyes
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, New York, USA
| | - Fernando A Wilson
- Matheson Center for Health Care Studies, University of Utah, Salt Lake City, Utah, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah, USA
| | - Mohammad S Jalali
- MGH Institute for Technology Assessment, Harvard Medical School, Boston, Massachusetts, USA
- Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
8
|
Sheikholeslami R, Hall JW. Global patterns and key drivers of stream nitrogen concentration: A machine learning approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 868:161623. [PMID: 36657680 PMCID: PMC10933795 DOI: 10.1016/j.scitotenv.2023.161623] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/22/2022] [Accepted: 01/11/2023] [Indexed: 06/17/2023]
Abstract
Anthropogenic loading of nitrogen to river systems can pose serious health hazards and create critical environmental threats. Quantification of the magnitude and impact of freshwater nitrogen requires identifying key controls of nitrogen dynamics and analyzing both the past and present patterns of nitrogen flows. To tackle this challenge, we adopted a machine learning (ML) approach and built an ML-driven representation that captures spatiotemporal variability in nitrogen concentrations at global scale. Our model uses random forests to regress a large sample of monthly measured stream nitrogen concentrations onto a set of 17 predictors with a spatial resolution of 0.5-degree over the 1990-2013, including observations within the pixel and upstream drivers. The model was validated with data from rivers outside the training dataset and was used to predict nitrogen concentrations in 520 major river basins of the world, including many with scarce or no observations. We predicted that the regions with highest median nitrogen concentrations in their rivers (in 2013) were: United States (Mississippi), Pakistan, Bangladesh, India (Indus, Ganges), China (Yellow, Yangtze, Yongding, Huai), and most of Europe (Rhine, Danube, Vistula, Thames, Trent, Severn). Other major hotspots were the river basins of the Sebou (Morroco), Nakdong (South Korea), Kitakami (Japan), and Egypt's Nile Delta. Our analysis showed that the rate of increase in nitrogen concentration between 1990s and 2000s was greatest in rivers located in eastern China, eastern and central parts of Canada, Baltic states, Pakistan, mainland southeast Asia, and south-eastern Australia. Using a new grouped variable importance measure, we also found that temporality (month of the year and cumulative month count) is the most influential predictor, followed by factors representing hydroclimatic conditions, diffuse nutrient emissions from agriculture, and topographic features. Our model can be further applied to assess strategies designed to reduce nitrogen pollution in freshwater bodies at large spatial scales.
Collapse
Affiliation(s)
- Razi Sheikholeslami
- School of Geography and the Environment, University of Oxford, Oxford, UK; Environmental Change Institute, University of Oxford, Oxford, UK; Department of Civil Engineering, Sharif University of Technology, Tehran, Iran.
| | - Jim W Hall
- School of Geography and the Environment, University of Oxford, Oxford, UK; Environmental Change Institute, University of Oxford, Oxford, UK
| |
Collapse
|
9
|
Huynh TMT, Ni CF, Su YS, Nguyen VCN, Lee IH, Lin CP, Nguyen HH. Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph191912180. [PMID: 36231480 PMCID: PMC9566676 DOI: 10.3390/ijerph191912180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 09/20/2022] [Accepted: 09/20/2022] [Indexed: 05/07/2023]
Abstract
Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability.
Collapse
Affiliation(s)
- Thi-Minh-Trang Huynh
- Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
| | - Chuen-Fa Ni
- Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
- Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan
- Correspondence: (C.-F.N.); (Y.-S.S.)
| | - Yu-Sheng Su
- Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung 202301, Taiwan
- Correspondence: (C.-F.N.); (Y.-S.S.)
| | - Vo-Chau-Ngan Nguyen
- College of Environment and Natural Resources, Can Tho University, Can Tho 94000, Vietnam
| | - I-Hsien Lee
- Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
- Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan
| | - Chi-Ping Lin
- Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
- Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan
| | - Hoang-Hiep Nguyen
- Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
| |
Collapse
|
10
|
Novel MLR-RF-Based Geospatial Techniques: A Comparison with OK. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11070371] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Geostatistical estimation methods rely on experimental variograms that are mostly erratic, leading to subjective model fitting and assuming normal distribution during conditional simulations. In contrast, Machine Learning Algorithms (MLA) are (1) free of such limitations, (2) can incorporate information from multiple sources and therefore emerge with increasing interest in real-time resource estimation and automation. However, MLAs need to be explored for robust learning of phenomena, better accuracy, and computational efficiency. This paper compares MLAs, i.e., Multiple Linear Regression (MLR) and Random Forest (RF), with Ordinary Kriging (OK). The techniques were applied to the publicly available Walkerlake dataset, while the exhaustive Walker Lake dataset was validated. The results of MLR were significant (p < 10 × 10−5), with correlation coefficients of 0.81 (R-square = 0.65) compared to 0.79 (R-square = 0.62) from the RF and OK methods. Additionally, MLR was automated (free from an intermediary step of variogram modelling as in OK), produced unbiased estimates, identified key samples representing different zones, and had higher computational efficiency.
Collapse
|
11
|
Corden E, Siddiqui SH, Sharma Y, Raghib MF, Adorno W, Zulqarnain F, Ehsan L, Shrivastava A, Ahmed S, Umrani F, Rahman N, Ali R, Iqbal NT, Moore SR, Ali SA, Syed S. Distance from Healthcare Facilities Is Associated with Increased Morbidity of Acute Infection in Pediatric Patients in Matiari, Pakistan. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:11691. [PMID: 34770204 PMCID: PMC8583418 DOI: 10.3390/ijerph182111691] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 10/29/2021] [Accepted: 11/04/2021] [Indexed: 12/21/2022]
Abstract
The relationship between environmental factors and child health is not well understood in rural Pakistan. This study characterized the environmental factors related to the morbidity of acute respiratory infections (ARIs), diarrhea, and growth using geographical information systems (GIS) technology. Anthropometric, address and disease prevalence data were collected through the SEEM (Study of Environmental Enteropathy and Malnutrition) study in Matiari, Pakistan. Publicly available map data were used to compile coordinates of healthcare facilities. A Pearson correlation coefficient (r) was used to calculate the correlation between distance from healthcare facilities and participant growth and morbidity. Other continuous variables influencing these outcomes were analyzed using a random forest regression model. In this study of 416 children, we found that participants living closer to secondary hospitals had a lower prevalence of ARI (r = 0.154, p < 0.010) and diarrhea (r = 0.228, p < 0.001) as well as participants living closer to Maternal Health Centers (MHCs): ARI (r = 0.185, p < 0.002) and diarrhea (r = 0.223, p < 0.001) compared to those living near primary facilities. Our random forest model showed that distance has high variable importance in the context of disease prevalence. Our results indicated that participants closer to more basic healthcare facilities reported a higher prevalence of both diarrhea and ARI than those near more urban facilities, highlighting potential public policy gaps in ameliorating rural health.
Collapse
Affiliation(s)
- Elise Corden
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Saman Hasan Siddiqui
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Yash Sharma
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Muhammad Faraz Raghib
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - William Adorno
- Department of Engineering Systems and Environment, University of Virginia, Charlottesville, VA 22903, USA;
| | - Fatima Zulqarnain
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Lubaina Ehsan
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Aman Shrivastava
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Sheraz Ahmed
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Fayaz Umrani
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Najeeb Rahman
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Rafey Ali
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Najeeha T. Iqbal
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Sean R. Moore
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
| | - Syed Asad Ali
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
| | - Sana Syed
- Department of Pediatrics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA; (E.C.); (Y.S.); (M.F.R.); (F.Z.); (L.E.); (A.S.); (S.R.M.)
- Department of Paediatrics and Child Health, Aga Khan University, Karachi 74800, Pakistan; (S.H.S.); (S.A.); (F.U.); (N.R.); (R.A.); (N.T.I.)
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22903, USA
- School of Data Science, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|
12
|
Changes in Meadow Phenology in Response to Grazing Management at Multiple Scales of Measurement. REMOTE SENSING 2021. [DOI: 10.3390/rs13204028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Riparian and ground-water dependent ecosystems found in the Great Basin of North America are heavily utilized by livestock and wildlife throughout the year. Due to this constant pressure, grazing can be a major influence on many groundwater dependent resources. It is important for land managers to understand how intensity and timing of grazing affect the temporal availability of these commodities (i.e., biodiversity, water filtration, forage, habitat). Shifts in forage or water availability could potentially be harmful for fauna that rely on them at specific times of the year. Seven meadow communities, each consisting of three distinct vegetative communities, were grazed at three intensities to determine the relationship between grazing management and phenological timing of vegetation. The agreement of on-the-ground measurements, near-surface digital cameras (phenocams), and satellite-based indices of greenness was examined for a two-year period (2019–2020) over these grazing and vegetative community gradients. Field determined phenology, phenocam Green Chromatic Coordinate (GCC), and Landsat Normalized Difference Vegetation Index (NDVI) were all highly correlated and the relationship did not change across the treatments. Timing of growth varied in these ecosystems depending on yearly precipitation and vegetative type. Communities dominated by mesic sedges had growing seasons which stopped earlier in the year. Heavier grazing regimes, however, did not equate to significant changes in growing season. Ultimately, shifts in phenology occurred and were successfully monitored at various spatial and temporal scales.
Collapse
|
13
|
Reich MS, Flockhart DTT, Norris DR, Hu L, Bataille CP. Continuous‐surface geographic assignment of migratory animals using strontium isotopes: A case study with monarch butterflies. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- Megan S. Reich
- Department of Biology University of Ottawa Ottawa ON Canada
| | - D. T. Tyler Flockhart
- Appalachian Laboratory University of Maryland Center for Environmental Science Frostburg MD USA
| | - D. Ryan Norris
- Department of Integrative Biology University of Guelph Guelph ON Canada
- Nature Conservancy of Canada Toronto ON Canada
| | - Lihai Hu
- Department of Earth and Environmental Sciences University of Ottawa Ottawa ON Canada
| | - Clément P. Bataille
- Department of Biology University of Ottawa Ottawa ON Canada
- Department of Earth and Environmental Sciences University of Ottawa Ottawa ON Canada
| |
Collapse
|
14
|
Affiliation(s)
- Arkajyoti Saha
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD
| | - Sumanta Basu
- Department of Statistics and Data Science, Cornell University, Ithaca, NY
| | - Abhirup Datta
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
15
|
Reducing the Uncertainty of Radiata Pine Site Index Maps Using an Spatial Ensemble of Machine Learning Models. FORESTS 2021. [DOI: 10.3390/f12010077] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Site Index has been widely used as an age normalised metric in order to account for variation in forest height at a range of spatial scales. Although previous research has used a range of modelling methods to describe the regional variation in Site Index, little research has examined gains that can be achieved through the use of regression kriging or spatial ensemble methods. In this study, an extensive set of environmental surfaces were used as covariates to predict Site Index measurements covering the environmental range of Pinus radiata D. Don plantations in Chile. Using this dataset, the objectives of this research were to (i) compare predictive precision of a range of geostatistical, parametric, and non-parametric models, (ii) determine whether significant gains in precision can be attained through use of regression kriging, (iii) evaluate the precision of a spatial ensemble model that utilises predictions from the five most precise models, through using the model prediction with lowest error for a given pixel, and (iv) produce a map of Site Index across the study area. The five most precise models were all geostatistical and they included ordinary kriging and four regression kriging models that were based on partial least squares or random forests. A spatial ensemble model that was constructed from these five models was the most precise of those developed (RMSE = 1.851 m, RMSE% = 6.38%) and it had relatively little bias. Climatic and edaphic variables were the strongest determinants of Site Index and, in particular, variables that are related to soil water balance were well represented within the most precise predictive models. These results highlight the utility of predicting Site Index using a range of approaches, as these can be used to construct a spatial ensemble that may be more precise than predictions from the constituent models.
Collapse
|