جستجوی مقالات مرتبط با کلیدواژه

data pre processing algorithms

در نشریات گروه کشاورزی

تکرار جستجوی کلیدواژه data pre processing algorithms در مقالات مجلات علمی

انتخاب همه

ارزیابی مدلهای تلفیقی هوش مصنوعی در تخمین غلظت مواد جامد محلول در بالادست شهر ساری

علیرضا قائمی، مهدی اژدری مقدم*، سارینا کیخا

مجله مهندسی و مدیریت آبخیز، سال شانزدهم شماره 1 (بهار 1403)، صص 50 -63

مقدمه

رودخانه ها، به عنوان منابع حیاتی طبیعت و پایه های اصلی توسعه پایدار شناخته می شوند. لذا، کمیت و کیفیت آب رودخانه ها جز پارامترهای ارزشمند محسوب می شوند. افزایش فعالیت های کشاورزی و صنعتی، باعث کاهش کیفیت منابع آب در بسیاری از نقاط شده است. تخلیه فاضلاب، زباله و کودهای شیمیایی در روستاهای حاشیه رودخانه ها از مهم ترین منابع آلودگی کیفیت آب به شمار می رود. حجم ورود پساب های شهری و کشاورزی به این رودخانه، باعث افزایش مقادیر آلاینده ها شده، به طوری که در بازه سال های 1372 تا انتهای سال 1396، میانگین آلاینده های موجود در ایستگاه ورند برای سه آلاینده مواد جامد محلول، کلر و سدیم به ترتیب برابر 507.49، 2.16 و 2.47 بوده است. بنابراین، برآورد دقیق پارامترهای کیفیت آب، یک نیاز اساسی برای مدیریت کیفیت آب، سلامت انسان، مصرف عمومی و مصارف خانگی است.

مواد و روش ها

حوضه تجن، با مساحتی در حدود 4147.22 کیلومتر مربع دارای میانگین دبی رودخانه و بارندگی سالانه به ترتیب 20 متر مکعب بر ثانیه و 539 میلی متر است. بیشترین و کمترین ارتفاع حوضه، به ترتیب 3728 و 26 متر گزارش شده است. فعالیت های مختلفی نظیر کشاورزی و سدسازی در این رودخانه صورت می گیرد. در این پژوهش، از تلفیق دو مدل برنامه نویسی بیان ژن (GEP) و شبکه عصبی مصنوعی (ANN)، با یکی از الگوریتم پیش پردازش داده به نام EMD برای تخمین یکی از پارامتر مهم کیفیت آب به نام مواد جامد محلول (TDS) استفاده شد. بدین منظور، در این پژوهش، برای مدلسازی و برآورد مقادیر مواد جامد محلول از چندین داده کیفی شامل سدیم، کلسیم، منیزیم، سولفات بی کربنات، سولفوریک اسید و کلر که در بازه سال های 1372 تا انتهای سال 1396 در ایستگاه ورند اندازه گیری شده، استفاده شد.

نتایج و بحث:

در ابتدا، نتایج حاصل از داده های مشاهداتی در طول مدت نمونه برداری نشان دادند که مقادیر TDS در حدود 80 درصد از نمونه ها در بازه 300 تا 600 میلی گرم بر لیتر بوده که بیانگر کیفیت خوب آب این رودخانه بوده است. برای مقایسه عملکرد مدل های مستقل و ترکیبی برای تخمین پارامترهای کیفی رودخانه در مراحل آموزش و آزمون، شاخص های آماری ضریب همبستگی (R)، ریشه میانگین مربعات خطا (RMSE)، میانگین انحراف خطا (MBE)، ضریب نش (NSE)، تابع هدف (OBJ) و نسبت RSD، استفاده شدند. نتایج حاصل از این مطالعه نشان داد که مدل ترکیبی حاصل از برنامه نویسی بیان ژن و تجزیه مود تجربی (EMD-GEP) توانست با کمترین خطا (0.23=RSD و 24.41=RMSE) عملکرد بالاتری را در مقایسه با سایر مدل ها مانند مدل GEP (0.44 =RSD و RMSE=47.27) در تخمین این پارامتر کیفی آب داشته باشد. علاوه بر این، مدل ترکیبی حاصل از شبکه عصبی مصنوعی و تجزیه مود تجربی (EMD-ANN) با 36.64 =RMSE و 0.95=R بعد از EMD-GEP، در رتبه دوم قرار گرفت. مطابق نتایج حاصل از تابع هدف برای هریک از مدل ها، مدل EMD-GEP دارای مقادیر تابع هدف کمتری (15.92=OBJ) نسبت به سایر مدل ها در مدلسازی مواد جامد محلول است. درحالی که، بیشترین مقدار تابع هدف (29.34=OBJ) متعلق به مدلGEP است.

نتیجه گیری

به منظور، تخمین مواد جامد محلول در رودخانه تجن، از روش های ANN و GEP در این پژوهش استفاده شد. سپس، برای تقویت برآوردها از الگوریتم های تجزیه مود تجربی برای پیش پردازش داده های سری زمانی استفاده شد. نتایج به دست آمده از مدل های هوشمند تلفیقی با استفاده از شاخص های آماری خطا، مانند ضریب همبستگی، ریشه میانگین مربعات خطا، مورد ارزیابی قرار گرفت. نتایج نشان داد که روش EMD، باعث تقویت مدل های ANN و GEP، در راستای تخمین این پارامتر کیفی آب در ایستگاه ورند شده اند. به طوری که مدل های ترکیبی حاصل از GEP و ANN توانست 48.35 درصد و 14.02 درصد خطای RMSE را به ترتیب در مقایسه با دو مدل مستقل GEP و ANN کاهش دهد.

کلید واژگان: الگوریتم های پیش پردازش داده ها، برنامه نویسی بیان ژن، شبکه عصبی مصنوعی، رودخانه تجن

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Evaluation of integrated artificial intelligence models in estimating total dissolved solid concentrations in the upstream of Sari city

Alireza Ghaemi, Mahdi Azhdary Moghaddam *, Sarina Keikha

Journal of Watershed Engineering and Management, Volume:16 Issue: 1, 2024, PP 50 -63

Introduction

Rivers are known as the vital resources of nature and the main foundations of sustainable development. Therefore, the quantity and quality of river water are considered valuable parameters. The increase in agricultural and industrial activities has reduced the quality of water resources in many places. The discharge of sewage, garbage and chemical fertilizers in the villages along the rivers is one of the most important sources of water quality pollution. The amount of urban and agricultural wastewater entering this surface has caused an increase in the amount of pollutants, so that in the period of 1993 to the end of 2017, the average amounts the three pollutants of total dissolved solid, chlorine and sodium in Varand Station are respectively 507.49, 2.16 and 2.47. Therefore, accurate estimation of water quality parameters is a basic requirement for water quality management, human health, public consumption and domestic use.

Materials and methods

Tajan River basin with an area of about 4147.22 square kilometers has an average river discharge and annual rainfall of 20 cubic meters per second and 539 mm respectively. The highest and lowest elevations of this River basin have been reported as 3728 and 26 meters, respectively. Various human activities such as agriculture and dam construction are carried out in this river. Therefore, evaluationg the water quality of this river basin is required. In this research, the combination of two Gene Expression Programming Models (GEP) and Artificial Neural Network (ANN) with a data preprocessing algorithm called Empirical Mode Decomposition (EMD) was used to estimate one of the important parameters of water quality called Total Dissolved Solids (TDS). For this purpose, in this research, some of qualitative parameters including sodium, calcium, magnesium, sulfate bicarbonate, sulfuric acid and chlorine, which were measured in the period of 1993 to the end of 2017 at Varand station, were used to estimate the concentration of total dissolved solids.

Results and discussion

At first, the results of the observation data during the sampling period indicated that the TDS values in about 80% of the samples were in the range of 300 to 600 mg.liter-1, which reprsented the good quality of the water of this river. In order to compare the performance of independent and integrated approaches in estimating the quality parameters of the Tajan River in the training and testing stages, the evaluation benchmarks including Correlation Coefficient (R), Root Mean Square Error (RMSE), Mean Deviation of Error (MBE), Nash Coefficient (NSE), Objective Function (OBJ) and RSD ratio were applied. The results of this study demonestrated that the integrated model of Gene Expression Programming and Empirical Mode Decomposition (EMD-GEP) with the lowest error (RSD=0.23 and RMSE=24.41) was the most accurate model in TDS estimating compared to other models such as GEP (RSD=0.44 and RMSE=47.27). In addition, the integrated model of Artificial Neural Network and Empirical Mode Decomposition (EMD-ANN) with RMSE=36.64 and R=0.95 was stood at the second rank. Additionally, the outcomes of the Objective Function (OBJ) represented that EMD-GEP model could achieved the lowest OBJ value (15.92) than other techniques in the TDS modeling. While, the highest value of the OBJ=29.34 belonged to the GEP model.

Conclusion

ANN and GEP methods were applied in this research to estimate TDS concentarion in the Tajan River. After that, to increase the accuracy of the models, EMD technique was recruited to decompose the time series dataset. The results obtained from the integrated models were evaluated using some error statistical benchmarks such as correlation coefficient, root mean square error. The results showed that the EMD method could play an essential role in increasing the ANN and GEP performance so as to estimate this water quality parameter in Varand station. So that EMD-GEP and EMD-ANN could reduce the RMSE error by 48.35% and 14.02%, respectively, compared to the two independent models of GEP and ANN.

Keywords: Artificial Neural Network, Data pre-processing algorithms, Gene Expression Programming, Tajan River

Abstract View Paper Research/Original Article Original: Persian
مدلسازی مواد جامد محلول با استفاده از رویکرد مبتنی بر هوش مصنوعی بر پایه الگوریتم های پیش پردازش کننده

سعید پی پل زاده، رضا مستوری*، نازنین شاه کرمی

مجله پژوهش آب ایران، پیاپی 44 (بهار 1401)، صص 25 -37

کیفیت آب یکی از مهم ترین عوامل موثر در زندگی سالم و حیات بشر است. از این رو، شناسایی مواد جامد محلول (TDS) یکی از مهمترین عوامل آن می باشد؛ که بسیاری از برنامه های توسعه منابع آب در شناسایی این عوامل اجرا خواهند شد. پیش بینی دقیق پارامترهای کیفیت آب یک نیاز اساسی برای مدیریت کیفیت آب، سلامت انسان، مصرف عمومی و مصارف خانگی است. در این مطالعه، با استفاده از یک الگوریتم پیش پردازش داده جدید، EEMD، برای تخمین یکی از پارامترهای مهم کیفیت آب، بنام TDS استفاده شده است. پذیرش و قابلیت اطمینان مدل های پیشنهادی (به عنوان مثال ، شبکه های عصبی مصنوعی (ANN) ، EEMD-ANN ، ماشین بردار پشتیبان(SVM) و EEMD-SVM) با استفاده از پنج معیار عملکرد و نمودارهای تصویری ارزیابی شد. مقایسه نتایج بین مدل های مستقل و ترکیبی نشان داد که الگوریتم های پیش-پردازنده می توانند عملکرد مدل SVM ترکیبی را برای برآورد پارامتر کیفی TDS افزایش دهد. به عنوان مثال، مدل EEMD-SVM باRMSE برابر 23/20برای مرحله آموزش و RMSE برابر 29/27 برای مرحله آزمایش در ایستگاه ورند و RMSE = 45.26 برای مرحله آموزش و RMSE =40.06 برای مرحله آزمایش در ایستگاه گرمرود، از سایر مدل های ترکیبی و مستقل بهتر عمل کرده است. از این رو، می توان روش ترکیبی ماشین های بردار پشتیبان بر پایه الگوریتم پیش پردازش کننده EEMD را به عنوان یک مدل برتر به تصمیم گیران جهت برنامه ریزی و مدیریت در زمینه تشخصی کیفیت آب رودخانه ها پیشنهاد کرد.

کلید واژگان: پارامترهای کیفیت آب، جامدات جامد محلول، الگوریتم های پیش پردازش داده ها، شبکه های عصبی مصنوعی، ماشین بردار پشتیبان

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Modeling total suspended solids using artificial intelligence methods based on pre processing algorithms

Saeed Pipelzadeh, Reza Mastouri *, Nazanin Shahkarami

Iranian Water Research Journal, Volume:16 Issue: 44, 2022, PP 25 -37

Introduction

Water quality is one of the most important factors in healthy living and human life. In this sense, some of the water quality parameters should be controlled in maintaining the human health and welfare. In today’s industrial world, most of the global natural water sources, including those in Iran, contain impurities such as the TDS. Numerous factors that are includes include cations such as sodium ion (Na+), potassium ion (K+), calcium ion (Ca+2), and magnesium ion (Mg+2) and anions such as chloride ion (Cl-) and bicarbonate ion (HCO3- ) with sulphate ion (SO42-) affect the concentration of these parameters in natural water systems. The total dissolved solids (TDS) is one of its most important factors; Many water resources development programs will be implemented to identify these factors. Accurate prediction of water quality parameters is a basic need for water quality management, human health, public consumption and household consumption. In the last decades, artificial intelligence (AI) techniques have become viable and popular due to their advantages, and have been widely developed in solving a variety of environmental engineering and water quality engineering problems.

Methods

For the estimation of water quality parameters (WQPs), Singh et al. (2011) utilized the clustering method, or support vector clustering (SVC), to optimize surface water quality monitoring in the city of Lucknow, India. The overall view of the water quality index of their study area revealed that most of the study area come under highly to very highly polluted zones. Tan et al. (2012) predicted phosphorus values in China with the least square support vector regression (LSSVR) method. They compared the efficiency of the LSSVR method with neural networks of the radial basis function (RBF) and back-propagation (BP). Experimental results showed that the small sample case with noise, LSSVM method was better than multi-layer BP and RBF neural network and is able to better meet the requirements of water quality prediction. Liu et al. (2013) addressed WQPs prediction in aquaculture employing the GP and real-value genetic algorithm-SVM (RGA-SVR). They used the GA to modify the coefficients of the SVR method. The results showed the superiority of the RGA-SVR algorithm over other methods based on the root mean square error (RMSE) and mean absolute percentage error (MAPE). Ghavidel and Montaseri (2014) employed ANN, GEP, and ANFIS with grid partition as well as ANFIS with subtractive clustering (ANFIS-SC) to predict TDS values of the Zarinehroud basin, Iran. A comparison was made between the above AI approaches, and the results demonstrated the superiority of GEP over the other intelligent models. Abyane (2014) compared artificial neural network (ANN) with multivariate linear regression (MLR) for prediction of BOD and COD in the wastewater treatment plant. In their study, ANN could predict BOD and COD parameters with higher precision than MLR.

Results

Due to complex characteristics of time series WQPs, a standalone model can hardly satisfy the estimation accuracy requirements. Therefore, the hybrid models combined with different single models will be an effective way to improve the WQPs estimation accuracy. This study proposes a new and accurate hybrid model for predicting WQP (i.e., TDS) using ions at Varand and Garmrood, two hydrometric stations of Tajan basin, Iran. The proposed WQP estimating framework was developed based on the combination of a data pre-processing algorithms (i.e., EEMD) with two AI-based models that was not addressed by the literature related to the WQPs modelling. Acceptance and reliability of proposed hybridized and standalone models (e.g., artificial neural networks (ANN), EEMD-ANN, support vector machine (SVM) and EEMD-SVM) using five performance criteria and visual diagrams were evaluated. Comparison of results between independent and hybrid models showed that EEMD data pre-processing algorithm can increase the performance of the hybrid SVM model for estimating the TDS quality parameter in both training and testing stages at both considered hydrometric stations. For example, the EEMD-SVM model with RMSE = 20.23 for the training phase and RMSE = 27.29 for the test phase at Varand station and RMSE = 45.26 for the training phase and RMSE = 40.06 for the test phase at Garmrood station has performed better than other hybrid and standalone models. In general, the proposed hybridized model of support vector machines based on EEMD data pre-processing algorithm can be proposed as a superior model to decision makers for planning and management in the field of river water quality detection and determination.

Keywords: Water quality parameters, total dissolved solids, data pre-processing algorithms, Artificial neural networks, Support Vector Machine

Abstract View Paper Research/Original Article Original: Persian

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

data pre processing algorithms