Magiran | جستجوی کلیدواژه "support vector regression"

شبیه سازی جریان ماهانه رودخانه با استفاده از مدل رگرسیون ماشین بردار پشتیبان بهبود یافته با استفاده از الگوریتم بهینه سازی گرگ خاکستری

صفورا پیروزمهر، سعید شعبانلو*، فریبرز یوسفوند، بهروز یعقوبی، احمد رجبی، محمدعلی ایزدبخش

مجله تحقیقات آب و خاک ایران، سال پنجاه و ششم شماره 1 (پیاپی 109، فروردین 1404)، صص 151 -170

اندازه گیری جریان رودخانه ها یکی از مهم ترین مسائل در مدیریت رودخانه است به همین دلیل همواره تلاش می شود از روش های دقیقی برای اندازه گیری آن استفاده شود. هدف این مطالعه بهبود کارایی مدل رگرسیون بردار پشتیبان (SVR) با استفاده از الگوریتم بهینه سازی گرگ خاکستری (GWO) برای مدل سازی جریان ماهانه رودخانه است. برای این کار از داده های ماهانه جریان ماهانه رودخانه، بارندگی و دما طی 15 سال (از سال 1400 تا 1385) استفاده شد. برای انتخاب بهترین متغیرهای ورودی به مدل SVR و GWO-SVR از روش سعی و خطا استفاده شد. بر اساس نتایج حاصل از این روش Q(t-1) R(t-1),T(t-1) بهترین متغیرهای مستقل برای شبیه؜سازی متغیر Q_t هستند. از 80درصد همه داده؜ها برای آموزش و 20 درصد داده؜ها برای صحت سنجی مدل؜های SVR و GWO-SVR استفاده شد، از شاخص؜های R^2، RMS وNSE برای ارزیابی کارایی مدل؜ها استفاده شد. برای توسعه مدل؜ها از توابع فعال ساز خطی (LKF)، چندجمله ایی (PKF)،تابع پایه شعاعی (RBF)، سیگموید (SKF) استفاده شد. ابتدا از روش سعی و خطا برای تعیین پارامترهای توابع فعال ساز استفاده شد.بر اساس نتایجحاصل از این مطاله مدل SVR با تابع فعال ساز چندجمله ایی بهترین عملکرد را در مرحله آموزش و صحت سنجی دارد و با تابع فعال ساز خطی بدترین عملکرد را در مرحله اموزش و صحت سنجی دارد. سپس از الگوریتم GWO برای تعیین پارامترهای توابع فعال ساز استفاده شد. بر اساس نتایج حاصل مدل SVR با الگورییتم GWO عملکرد بهتری دارد. بنابرین برای شبیه؜سازی جریان ماهانه اب رودخانه با استفاده از این مدل بهتر است به جای روش سعی و خطا از الگوریتم GWO استفاده شود

کلید واژگان: جریان ماهانه رودخانه، رگرسیون بردار پشتیبان، الگوریتم بهینه سازی گرگ خاکستری، مدیریت منابع آب

Simulation of monthly river flow using improved support vector machine regression model using gray wolf optimization algorithm

Safoura Pirouzmehr, Saeid Shabanlou *, Fariborz Yosefvand, Behrouz Yaghoubi, Ahmad Rajabi, Mohammadali Izadbakhsh

Iranian Journal of Soil and Water Research, Volume:56 Issue: 1, 2025, PP 151 -170

Measuring the flow of rivers is one of the most important issues in river management, that's why it is always tried to use accurate methods for its measurement. The aim of this study is to enhance the performance of Support Vector Regression (SVR) model using the Gray Wolf Optimization (GWO) algorithm for monthly river flow modeling. For this purpose, the monthly data of river flow, precipitation and temperature during 15 years (from 1400 to 2015) are used. The trial and error procedure is used to select the best input variables to the SVR and GWO-SVR models. Based on the results of this method, Q(t-1), R(t-1), T(t-1) are the best independent variables for simulating the variable Q_t. 80% of all data are used for training and 20% for validating the SVR and GWO-SVR models. Also, R^2, RMS and NSE indices are utilized to evaluate the efficiency of the models, linear (LKF), polynomial (PKF), radial basis function (RBF), and sigmoid (SKF) activation functions are used to develop the models. First, the trial and error procedure is used to determine the parameters of the activation functions. Based on the results of this study, the SVR model with the polynomial activation function has the best performance in the training and validation stage, and the worst performance with the linear activation function in the training and verification stages. Then, the GWO algorithm is used to determine the parameters of the activation functions. Based on the results, the SVR model performs better with the GWO algorithm. Therefore, to simulate the monthly flow of river using this model, it is better to use the GWO algorithm instead of the trial and error procedure.

Keywords: Monthly River Flow, Support Vector Regression, Gray Wolf Optimization Algorithm, Water Resources Management

کاربرد روش ماشین بردار پشتیبان در شبیهسازی عملکرد جو با استفاده از شاخصهای خشکسالی

هادی رمضانی اعتدالی*، مژگان احمدی

مجله آبیاری و زهکشی ایران، سال هجدهم شماره 5 (آذر و دی 1403)، صص 735 -744

تخمین به موقع و دقیق عملکرد محصول برای تنظیم مدیریت زراعی و تضمین پایداری کشاورزی مهم است. در این پژوهش به بررسی کاربرد روش رگرسیون بردار پشتیبان در شبیه سازی عملکرد جو در ده مزرعه انتخابی در دشت قزوین طی دوره 2020-2019 با استفاده از شاخص های خشکسالی NDVI، MSAVI و EVI پرداخته شده است. برای شاخص های خشکسالی از ماهواره سنتینل2 استفاده شد. نتایج با استفاده از آماره های R2، RMSEو ME ارزیابی شد. به منظور بررسی کاربرد روش رگرسیون بردار پشتیبان در شبیه سازی عملکرد جو با استفاده از شاخص های خشکسالی مذکور، هفت سناریو تعریف شد. نتایج ارزیابی ارتباط بین شاخص های NDVI، MSAVI و EVI با عملکرد جو نشان داد که روش رگرسیون بردار پشتیبان در همه سناریوها به جز سناریو اول در مرحله آزمون با احتمال معنی داری 95 % (P-value= 0.0) و ضریب تبیین بیش از 70/0 و مقدار کم شاخص RMSE تخمین مناسبی از عملکرد محصول جو داشته است. نتایج این پژوهش، کاربرد مفیدی در مدیریت و برنامه ریزی توسعه کشت جو بر اساس شاخص های خشکسالی در آینده خواهد داشت.

کلید واژگان: شاخص های خشکسالی، سنتینل2، رگرسیون بردار پشتیبان، عملکرد جو

Application of support vector machine method in simulating barley wield using drought indices

Hadi Ramezani Etedali *, Mojgan Ahmadi

Iranian Journal of Irrigation & Drainage, Volume:18 Issue: 5, 2024, PP 735 -744

Timely and accurate crop yield estimation is important to regulate crop management and ensure agricultural sustainability. This research investigates the application of the support vector regression method in simulating the yield of barley in ten selected farms in the Qazvin plain from 2019 to 2020 using the drought indices NDVI, MSAVI, and EVI. Sentinel 2 satellite was used for drought indices. The results were evaluated using R2, RMSE, and ME statistics. To investigate the application of the support vector regression method in simulating the yield of barley using the mentioned drought indicators, seven scenarios were defined. The results of evaluating the relationship between NDVI, MSAVI, and EVI indices with yield barley showed that the support vector regression method in all scenarios except the first scenario in the test phase with a significant probability of 95% (P-value=0.0) and an explanatory coefficient of more than 0.7 and the low value of the RMSE index has a good estimate of the yield of the barley product. The results of this research will be useful in managing and planning the development of barley cultivation based on drought indices in the future.

Keywords: Drought Indices, Sentinel 2, Support Vector Regression, Yield Barley

ارزیابی سختی آب زیرزمینی دشت خرم آباد با استفاده از مدلهای هیبریدی مبتنی بر الگوریتم های فراابتکاری

حمیدرضا باباعلی، ابراهیم نوحانی، رضا دهقانی*

مجله آبخوان و قنات، سال پنجم شماره 1 (پیاپی 8، بهار و تابستان 1403)، صص 125 -138

در این پژوهش از مدل هیریدی رگرسیون بردار پشتیبان با الگوریتم های بهینه سازی موجک، کرم شب تاب و گرگ خاکستری به منظور برآورد میزان سختی آب زیرزمینی چاه ناصروند واقع در دشت خرم آباد استان لرستان استفاده شد. جهت مدل سازی از داده های کیفی چاه موجود شامل پارامتر هیدروژن کربنات (HCO3)، کلرید (Cl)، سولفات (So4)، منیزیم (mg)، کلسیم (ca) و سختی آب (TH) همگی بر حسب ppm در مقیاس زمانی ماهانه در طی سال آبی (1402-1382) بعنوان ورودی و میزان سختی آب زیرزمینی به عنوان خروجی مدل انتخاب گردید. به منظور ارزیابی عملکرد مدلها از معیارهای ارزیابی ضریب همبستگی، ریشه میانگین مربعات خطا، میانگین قدر مطلق خطا و ضریب نش ساتکلیف استفاده شد. همچنین جهت تحلیل نتایج مدلها از نمودار سری زمانی و باکس پلات و تیلور استفاده شد. نتایج نشان داد سناریو های ترکیبی در مدلهای مورد بررسی باعث بهبود عملکرد مدل می شود. مقایسه نتایج نشان داد مدل رگرسیون بردار پشتیبان - موجک عملکرد بهتری نسبت به مدل رگرسیون بردار پشتیبان-گرگ خاکستری در مدل سازی دارد، بگونه ای که مدل رگرسیون بردار پشتیبان - موجک با ضریب همبستگی 0/917، کمترین ریشه میانگین مربعات (ppm) 0/190، کمترین میانگین قدر مطلق خطا (ppm) 0/115 و بیشترین ضریب نش ساتکلیف 0/920 در مرحله صحت سنجی در اولویت قرار گرفت. درمجموع نتایج نشان داد استفاده از مدلهای هوشمند مبتنی بر رویکرد رگرسیون بردار پشتیبان می تواند رویکردی موثر در مدیریت کیفی آبهای زیرزمینی باشد.

کلید واژگان: آب زیرزمینی، خرم اباد، رگرسیون بردار پشتیبان، فراابتکاری

Assessment of Groundwater Hardness in Khorramabad Plain Using Hybrid Models Based on Metaheuristic Algorithms

Hamidreza Babaali, Ebrahim Nohani, Reza Dehghani *

Journal of Auifer and Qanat, Volume:5 Issue: 1, 2024, PP 125 -138

In this study, a hybrid Support Vector Regression (SVR) model, combined with optimization algorithms including Wavelet, Firefly, and Grey Wolf, was employed to estimate groundwater hardness at the Naservand well in the Khorramabad Plain of Lorestan Province. For modeling, monthly quality data of the well during the water year (1382-1402) were used. This included parameters such as bicarbonate (HCO3), chloride (Cl), sulfate (SO4), magnesium (Mg), calcium (Ca), and total hardness (TH), all measured in ppm, which were used as inputs. Groundwater hardness was selected as the model output. To evaluate the performance of the models, assessment metrics such as the correlation coefficient, root mean square error (RMSE), mean absolute error (MAE), and Nash-Sutcliffe efficiency coefficient (NSE) were used. Time series, box plots, and Taylor diagrams were also used for analyzing the model results. The results indicated that hybrid scenarios in the studied models improve the model’s performance. Comparing results showed that the Support Vector Regression-Wavelet model performed better than the Support Vector Regression-Grey Wolf model, with the Support Vector Regression-Wavelet model achieving a correlation coefficient of 0.917, the lowest RMSE of 0.190 ppm, the lowest MAE of 0.115 ppm, and the highest NSE of 0.920 in the validation stage. Overall, the results indicate that using intelligent models based on the Support Vector Regression approach can be an effective approach for managing groundwater quality.

Keywords: Groundwater, Khorramabad, Support Vector Regression, Metaheuristics

برآورد دمای خاک در اقلیم های مختلف با استفاده از روش های داده محور

آیناز وفائی، عرفان عبدی، سعید صمدیان فرد*

نشریه دانش آب و خاک، سال سی و چهارم شماره 3 (پاییز 1403)، صص 143 -160

دمای خاک یکی از جنبه های مهم کشاورزی و هیدرولوژی است و اندازه گیری دقیق آن برای اطمینان از رشد و نمو مطلوب گیاه بسیار مهم است. دمای خاک عاملی است که بر بسیاری از فرآیندها مانند جوانه زنی، میزان رطوبت خاک، هوادهی، سرعت نیتریفیکاسیون تبدیل آمونیاک به نیترات و در دسترس بودن مواد مغذی گیاه تاثیر می گذارد. با توجه به این که داده های دمای خاک در بعضی از ایستگاه های سینوپتیک اندازه گیری می شود، اغلب داده ها دارای محدودیت و یا نواقصی هستند. با این حال انتخاب بهترین روش جهت پیش بینی و تخمین دمای خاک با سایر داده های هواشناسی موجود، رویکردی موثر و کار آمد در بسیاری از زمینه ها می باشد؛ لذا در مطالعه حاضر، توانایی مدل های داده محور رگرسیون فرایند گاوسی (GPR)، رگرسیون ماشین بردار پشتیبان (SVR)، الگوریتم M5P، رگرسیون خطی (LR) و شبکه عصبی پرسپترون چندلایه (MLP) در برآورد دمای خاک سه ایستگاه اراک، رامسر و شیراز طی دوره آماری 32 ساله با استفاده از پنج معیار اعتبارسنجی مورد ارزیابی قرار گرفت. نتایج بدست آمده نشان داد که سناریو هشتم M5P و LR با داشتن جذر میانگین مربعات خطای کمتر به ترتیب «899/0و 889/0» برای ایستگاه رامسر، «958/0 و949/0» برای ایستگاه اراک و «966/0 و953/0» برای ایستگاه شیراز، عملکرد بهتری نسبت به سایر مدل ها داشته است. همچنین پارامتر های رطوبت نسبی و دمای هوا از موثر ترین پارامتر های هواشناسی مورد نیاز در برآورد دمای خاک شناخته شد، بطوری که افزودن این پارامتر ها باعث افزایش دقت مدل می شود.

کلید واژگان: پیش بینی، داده های هواشناسی، رگرسیون مدل گاوسی، رگرسیون ماشین بردار پشتیبان، شبکه عصبی چندلایه

Estimating soil temperature at various climates using data-driven methods

Aynaz Vafaei, Erfan Abdi, Saeed Samadianfard *

Journal of Soil and water knowledge, Volume:34 Issue: 3, 2024, PP 143 -160

Background and Objectives

Soil temperature is one of the important factors in agriculture and hydrology, and its accurate measurement is very important to ensure the optimal growth and development of the plants. Soil temperature is a factor that affects many processes such as seed germination, soil moisture level, aeration, nitrification and availability of plant nutrients. Because the soil temperature data is measured in some synoptic stations, most of the data have limitations or are incomplete. However, choosing the best method to estimate soil temperature with other available meteorological data is an efficient approach in many fields. Soil temperature depends on several factors including color, slope, vegetation, density, humidity and amount of sunlight. Currently, some physical models are available that are intrinsically related to the state of soil heat flow and energy balance in underlying soils to estimate soil temperature. The importance of soil temperature in agricultural sciences and hydrology, on the one hand, and the existence of many difficulties in recording this vital parameter, have led researchers to seek a relationship between soil temperature and other parameters in order to be able to estimate soil temperature with optimal accuracy.

Methodology

In this research, daily soil temperature values were collected during the time period of 1990-2022 in Ramsar, Arak and Shiraz stations. On the other hand, the parameters of minimum temperature (Tmin), maximum temperature (Tmax), average temperature (Tm), maximum relative humidity (Umax), minimum relative humidity (Umin), average relative humidity (Um), average wind speed (FFM) and Sunshine hours (SSHN) was considered as the input parameters and soil temperature (T-soil) as the target parameter. It is worth mentioning that the way of choosing different input compounds to estimate the value of soil temperature in the studied models is based on having a higher correlation with soil temperature based on the thermal map. Moreover, the ability of data-driven models of Gaussian process regression (GPR), support vector regression (SVR), M5P algorithm, linear regression (LR), and multilayer perceptron (MLP) neural network in estimating soil temperature was evaluated using different statistical parameters of correlation coefficient (R), root mean square error (RMSE), Nash Sutcliffe coefficient (NS), average absolute value of percentage error (MAPE) and Wilmot agreement index (WI).

Findings

The evaluation of five GPR, SVR, M5P, LR and MLP models for three stations of Arak, Ramsar and Shiraz shows that the 8th M5P scenario and the 8th LR scenario with lower root mean square error respectively (0.899 and 0.889) for Ramsar station, (0.958 and 0.949) for Arak station and (0.966 and 0.953) for Shiraz station have better performance than other studied models. Also, the evaluation of the impact of the input parameters in creating the scenario for the models shows that the parameters of relative humidity and air temperature had more important role than other input parameters. So that by adding parameters of relative humidity and air temperature, the accuracy of the model has increased. Therefore, these parameters are among the most key and important parameters of soil temperature.

Conclusion

The analysis and evaluation of soil characteristics has an important impact in the fields of hydrology, agriculture and climate. On the other hand, soil temperature has a direct relationship with the amount of moisture available to the plant, so that an increase in soil temperature can increase the transpiration rate of plants, and as a result, soil moisture decreases. Soil temperature is also an essential factor in agriculture because it determines whether plants can grow, and controls soil chemistry and biology and atmosphere-land gas exchange. Therefore, predicting soil temperature is very important for successful crop management and yield optimization. So, In this research, five data-driven methods of GPR, SVR, M5P, LR and MLP were used to predict soil temperature in Arak, Ramsar and Shiraz stations during the time period of 1990-2022. The obtained results were compared using statistical parameters and it was concluded that the 8th M5P scenario and the 8th LR scenario have shown the best performance in three stations with the lowest error compared to all scenarios. Therefore, the application of the mentioned models to predict the soil temperature has proper accuracy and is recommended for management and evaluation in terms of environmental and civil aspects.

Keywords: Estimation, Gaussian Model Regression, Meteroogical Parameters, Multilayer Neural Network, Support Vector Regression

مطالعه مقایسه ای با استفاده از روش داده محور در مقابل رویکرد ترکیبی در جهت برآورد تبخیر-تعرق مرجع روزانه در اهواز

میلاد شرفی*، سینا بشارت، کامران زینال زاده

فصلنامه علوم و مهندسی آبیاری، سال چهل و هفتم شماره 2 (تابستان 1403)، صص 53 -68

پیش بینی تبخیر-تعرق مرجع روزانه یک ابزار تعیین کننده و مفید در کشاورزی پایدار و مسائل هیدرولوژیک، به ویژه در طراحی و مدیریت سیستم های منابع آب می باشد. استفاده از مدل های هیبریدی با کمک عوامل اقلیمی روشی موثر در فرآیند پیش بینی تبخیر-تعرق مرجع روزانه است. بنابراین، در این مطالعه توانایی مدل رگرسیون بردار پشتیبان (SVR) و مدل ترکیبی رگرسیون بردار پشتیبان با الگوریتم مگس میوه (SVR-FOA) در برآورد تبخیر-تعرق مرجع روزانه در ایستگاه اهواز، طی دوره 2022-2000 با استفاده از چهار معیار آماری مورد ارزیابی قرار گرفت. ورودی های مورد استفاده شامل پارامترهای میانگین دما، حداقل دما، حداکثر دما، متوسط رطوبت نسبی، حداقل رطوبت نسبی، حداکثر رطوبت نسبی، سرعت باد و ساعات آفتابی بود. آنالیز حساسیت پارامترهای ورودی با استفاده از ضریب همبستگی پیرسون نیز نشان داد که در میان پارامترهای ورودی، پارامتر ساعات آفتابی و رطوبت نسبی از مولفه های موثر بر پیش بینی تبخیر بودند به طوری که تاثیر مستقیمی روی مقدار تبخیر روزانه داشته و باعث کاهش خطا در تمام مدل ها گردیدند. نتایج به دست آمده نشان داد که سناریو ششم مدل SVR-FOA بهترین عملکرد را با کمترین خطا (mm/day 24/1) نسبت به تمامی مدل ها ارائه داد. در بین سناریوهای مدل SVR نیز سناریو سوم مدل SVR کمترین خطا را (mm/day 45/1)، نسبت به سایر ترکیبات SVR از خود نشان داد. نتایج حاصل از این پژوهش نشان داد که سناریو ششم مدل SVR-FOA بهترین عملکرد را داشته و نیز الگوریتم هیبریدی مگس میوه باعث بهبود عملکرد رگرسیون بردار پشتیبان در برآورد تبخیر-تعرق مرجع روزانه گردید.

کلید واژگان: پیش بینی، بهینه سازی، الگوریتم مگس میوه، رگرسیون بردار پشتیبان، اهواز

A comparative study using a data-driven method versus a hybrid approach to estimate daily reference evapotranspiration in Ahvaz

Milad Sharafi *, Sina Besharat, Kamran Zeinalzadeh

Irrigation Sciences and Engineering, Volume:47 Issue: 2, 2024, PP 53 -68

Daily reference evapotranspiration prediction is a decisive and useful tool in sustainable agriculture and hydrological issues, especially in the design and management of water resources systems. The use of hybrid models with the help of climatic factors is an effective method in the daily reference evapotranspiration forecasting process. Therefore, in this study, the ability of the support vector regression model (SVR) and the combined model of support vector regression with the fruit fly algorithm (SVR-FOA) in estimating daily reference evapotranspiration in Ahvaz station during the period of 2000-2022 using four statistical criteria was evaluated. The inputs used included parameters of average temperature, minimum temperature, maximum temperature, average relative humidity, minimum relative humidity, maximum relative humidity, wind speed, and sunshine hours. The sensitivity analysis of the input parameters using Pearson's correlation coefficient also showed that among the input parameters, the parameters of sunshine hours and relative humidity were effective components in the prediction of evapotranspiration, thus reducing the error in all models. The obtained results showed that the sixth scenario of the SVR-FOA model provided the best performance with the lowest error (1.24 mm/day) compared to all models. Among the scenarios of the SVR model, the third scenario of the SVR model showed the lowest error (1.45 mm/day) compared to other SVR combinations. The results of this research showed that the sixth scenario of the SVR-FOA model had the best performance, and the fruit fly hybrid algorithm improved the performance of the support vector regression in estimating daily reference evapotranspiration.

Keywords: Prediction, Optimization, Fruit Fly Algorithm, Support Vector Regression, Ahvaz

پیش بینی تبخیر-تعرق مرجع روزانه با روش های مدل درختی، رگرسیون بردار پشتیبان و رگرسیون فرآیند گاوسی

سعید صمدیان فرد*، زهرا روستا، میلاد شرفی

نشریه دانش آب و خاک، سال سی و چهارم شماره 2 (تابستان 1403)، صص 156 -176

تبخیر-تعرق یکی از مهم ترین عوامل محدود کننده توسعه کشاورزی در مناطق خشک و نیمه خشک می باشد. به دلیل محدودیت های اقتصادی و سایر محدودیت ها همواره جمع آوری داده های تبخیر-تعرق چالش های فراوانی را برای محققان در پی داشته است. لذا هدف از مطالعه حاضر پیش بینی تبخیر-تعرق مرجع روزانه در دو ایستگاه آستارا و اصفهان با استفاده از مدل های رگرسیون فرآیند گاوسی، رگرسیون بردار پشتیبان، مدل درختی M5P و رگرسیون خطی M5Rules است. برای این منظور داده های هواشناسی روزانه ایستگاه ها شامل دما، رطوبت نسبی، سرعت باد و ساعات آفتابی طی دوره 2021-1990 به عنوان ورودی مدل ها به کار برده شد. بررسی پارامترهای ورودی نشان داد که رطوبت نسبی بیش ترین تاثیر را بر دقت پیش بینی مدل ها داشته است. همچنین جهت ارزیابی کارایی مدل ها از معیارهای ارزیابی مختلفی استفاده شد. ارزیابی مدل های به کار رفته در ایستگاه آستارا نشان داد که سناریو پنجم با کاربرد پارامتر های حداکثر دما، حداقل دما، میانگین دما، رطوبت نسبی حداکثر و رطوبت نسبی میانگین مدل های M5P و M5Rules با داشتن مقدار خطای (mm day-1) 42/1، بالاترین دقت را نسبت به سایر مدل ها داشته-اند. در ایستگاه اصفهان نیز سناریو هشتم مدل M5P و M5Rules با کاربرد پارامتر های حداکثر دما، حداقل دما، میانگین دما، رطوبت نسبی حداکثر و رطوبت نسبی میانگین ، رطوبت نسبی حداقل، ساعات آفتابی و سرعت باد با داشتن مقدار خطای (mm day-1) 86/1، بهترین عملکرد را نسبت به سایر مدل ها داشتند. لذا مدل های M5P و M5Rules با موفقیت تبخیر-تعرق مرجع را پیش بینی کرده و روابط ریاضی ساده مستخرج از آنها برای استفاده در تعیین نیاز آبی گیاهان توصیه میگردد.

کلید واژگان: پیش بینی، تبخیر-تعرق، رگرسیون بردار پشتیبان، آستارا، منابع آب

Prediction of daily reference evapotranspiration with M5P, Gaussian process regression and Support vector regression methods

Saeed Samadianfard *, Zahra Rousta, Milad Sharafi

Journal of Soil and water knowledge, Volume:34 Issue: 2, 2024, PP 156 -176

Background and Objectives

Indiscriminate use of water resources and the occurrence of drought in recent years have caused many problems in the country's water resources. The increasing shortage of water resources and high irrigation costs require developing new irrigation methods for optimal water consumption, which can minimize the amount of water used to produce yields. Evapotranspiration is one of the most important parameters needed to estimate the water balance in any ecosystem. Evapotranspiration is an essential parameter in the hydrological cycle process in natural ecosystems, which links the water and energy balance of the earth's surface with the atmosphere. Reference evapotranspiration (ET0) plays an important role in the availability of water resources and stimulating the hydrological effect of climate change. Accurate estimation of ET0 is necessary for forecasting climate changes, predicting and monitoring droughts, assessing the lack of availability of water resources, assessing crop water needs, and planning irrigation. FAO's Penman-Monteith method is known as a standard reference method for estimating ET0. However, this model and, in general, water balance-based assessment methods require accurate and long-term meteorological data, which are not always and everywhere available. Therefore, alternative methods for predicting ET0 at different temporal and spatial scales should be developed, which are easily applied and require fewer input data without compromising the estimation accuracy. Also, due to the high rate of evapotranspiration in the coastal and central stations of the country, so far, few studies have predicted the ET0 parameter. Therefore, this study was carried out to predict daily reference evapotranspiration in Isfahan and Astara stations.

Methodology

The current study is forecasting daily reference evapotranspiration in two stations of Astara and Isfahan using Gaussian Process Regression (GPR), Support Vector Regression (SVR), M5P tree model, and M5Rules linear regression. For this purpose, the daily meteorological data of the stations including average temperature, minimum temperature, maximum temperature, average relative humidity, minimum relative humidity, maximum relative humidity, wind speed, and sunshine hours during the period of 1990-2021 as inputs to the models was used. Also, to evaluate the effectiveness of the models, the evaluation criteria of determination coefficient (R2), root mean square error (RMSE), Nash-Sutcliffe coefficient (NS), and Wilmott's index of agreement (WI) were used.

Findings

The evaluation of the results of different scenarios of the GPR model in Astara station showed that the fifth scenario was recognized as the best scenario of this model due to having a lower error value (RMSE=1.52 mm day-1). For the M5Rules model, the fifth scenario has performed better than the other scenarios of the M5Rules model due to having fewer inputs and similar errors compared to the sixth to eighth scenarios (RMSE=1.42 mm day-1). In the M5P model, the fifth scenario has a higher accuracy than the other scenarios due to having a lower error value (RMSE=1.42 mm day-1). For the SVR model, the sixth scenario with the least error (RMSE=1.58 mm day-1) was selected as the best scenario compared to other scenarios of the SVR model. For the Isfahan station, for the GPR model, the fifth scenario has performed better than the other scenarios due to having fewer inputs. The comparison of M5Rules model scenarios also showed that the eighth scenario with RMSE=1.85 (mm day-1), had higher accuracy than other scenarios. The seventh scenario of the M5P model has performed better than other scenarios due to its RMSE=1.86 (mm day-1). Finally, the evaluation of SVR model scenarios showed that the eighth scenario with RMSE=1.88 (mm day-1) had a better performance than other scenarios.

Conclusion

The comparison of the models used to predict daily reference evapotranspiration in Astara station showed that the fifth scenario of M5P and M5Rules models having evaluation criteria of R2=0.76, RMSE=1.42 (mm day-1), NS=0.7 and WI=0.89 had the highest accuracy compared to other models and showed the best performance. Also, the evaluation of the results of the models in Isfahan station showed that the eighth scenario of the M5Rules model, having the evaluation criteria of R2=0.8, RMSE=1.85 (mm day-1), NS=0.8 and WI=0.94 had the best performance compared to other models and the M5Rules model was selected as the best model. Also, the seventh scenario of the M5P model had almost the same performance as the eighth scenario of the M5Rules model and showed a good performance. Therefore, M5P and M5Rules models successfully predicted reference evapotranspiration. One of the limitations of the present study is the lack of access to dew point temperature and solar radiation data. Therefore, the use of these parameters is suggested for further studies.

Keywords: Forecasting, Evapotranspiration, Support Vector Regression, Astara, Water Resources

برآورد تبخیر روزانه از تشت کلاس A با استفاده از پنج روش داده کاوی (مطالعه موردی: ایستگاه هواشناسی تبریز)

خدیجه سیف زاده، داود زارع حقی*، سعید صمدیان فرد، محمدرضا نیشابوری، فاطمه میکائیلی

نشریه دانش آب و خاک، سال سی و سوم شماره 4 (زمستان 1402)، صص 167 -184

تبخیر یکی از عوامل اثرگذار در چرخه هیدرولوژیکی است که تخمین صحیح آن نقش مهمی در توسعه پایدار و مدیریت بهینه منابع آب در کشورهای مواجه با بحران آب ایفا می کند. هدف از این پژوهش، ارزیابی عملکرد روش های داده کاوی جهت برآورد تبخیر روزانه از تشت کلاس A در ایستگاه تبریز می باشد. در این پژوهش از داده های هواشناسی روزانه ایستگاه تبریز در طی دوره 16 ساله (2018- 2003) استفاده گردید. برآورد میزان تبخیر از تشت کلاس Aبا استفاده از روش های رگرسیون بردار پشتیبان (SVR)، رگرسیون فرآیند گاوسی (GPR)، مدل درختی M5، جنگل تصادفی (RF) و رگرسیون خطی (LR) انجام گرفت. 10 سناریو ترکیبی بر اساس همبستگی بین متغیرهای هواشناسی و تبخیر برای واسنجی و صحتسنجی روش های مورد مطالعه مدنظر قرار گرفت. نتایج بررسی های آماری نشان داد که در ایستگاه تبریز، مقادیر تخمینی تبخیر روش GPR با جذر میانگین مربعات خطای برابر با 9/1 میلی متر بر روز و ضریب نش- ساتکلیف برابر با 81/0 و در روش SVR با جذر میانگین مربعات خطای برابر با 92/1 میلی متر بر روز و ضریب نش- ساتکلیف 80/0، از عملکرد مناسبی در شبیه سازی مقدار تبخیر روزانه از تشت کلاس Aبرخوردار بوده اند. در نهایت برای ایستگاه هواشناسی تبریز، مدل های GPR و SVR برای سناریو شماره 10 با همه متغیرها و دارا بودن بهترین عملکرد، به عنوان مدل هایی با دقت مناسب پیشنهاد گردید. همچنین متغیرهای سرعت باد و تابش خورشیدی به عنوان موثرترین متغیرها در برآورد میزان تبخیر از تشت کلاس A معرفی شدند.

کلید واژگان: تبخیر، جنگل تصادفی، رگرسیون بردار پشتیبان، رگرسیون خطی، رگرسیون فرآیند گاوسی

Estimation of Daily Evaporation from Class A Pan using Five Data Mining Methods (Case Study: Tabriz Synoptic Station)

Khadigeh Seifzadeh, Davoud ZAREHAGHI *, Saeed Samadianfard, Mohammad Reza Neyshabouri, Fatemeh Mikaeili

Journal of Soil and water knowledge, Volume:33 Issue: 4, 2023, PP 167 -184

Background and Objectives

Evaporation is one of the main components of hydrological cycle and one of the effective climatic variables in arid areas such as Iran. Accurate estimate of evaporation rate plays an important role in sustainable development and optimal management of water resources. Evaporation is one of the essential processes, because it depends on meteorological variables such as solar radiation, air temperature, wind speed, relative humidity and atmospheric pressure, which are related to the topography and the climate of the region. Class A pan-evaporation is one of the standard and direct tools for measuring evaporation, which is used all over the world due to its ease of application in determining evaporation. However, in most stations accurate evaporation recording is not practical due to instrument limitations and maintenance problems. On the other hand, the temporal and spatial distribution of evaporation stations compared to meteorological stations is limited, so according to the problems mentioned, the use of meteorological variables in estimating the rate of evaporation from the pan will be useful. In different regions, the impact of different climatic factors on changes evaporation from the pan has not be fully understood, so the relatively accurate estimation and prediction of this phenomenon is an effective step in the relevant fields. In recent years, for estimating the amount of evaporation from the pan, a variety of intelligent systems and software calculations such as data mining methods have been developed.

Methodology

In this study, meteorological data of Tabriz station in the period of 2003 to 2018 have been used to estimate the evaporation values from the class A pan. For this purpose, a simple correlation between meteorological variables and evaporation from class A pan was created and based on the result of this correlation, in the studied station the minimum temperature and relative humidity were inversely and the maximum and average temperature were directly affected by evaporation. Thus, ten combined scenarios were defined and modeling was performed using Support vector regression (SVR), Gaussian process regression (GPR), M5tree, Random forest (RF) and Linear regression (LR) methods. It should be noted that in this study, 70% of the data were selected for training and 30% for testing. Finally, the performance of each method in estimating evaporation values was evaluated using root mean squared error (RMSE), mean absolute error (MAE), Nash- Sutcliffe coefficient (NS) and Akaike information criterion (AIC).

Findings

The results showed that GPR10 method with RMSE = 1.90 mm/day, MAE = 1.48, NS = 0.81 and SVR10 method with RMSE = 1.92 mm/day, MAE = 1.51, NS = 0.8 had reasonable performance in estimating the values of daily evaporation from class A pan. The GPR method showed its higher capability to estimate daily evaporation values in all definition scenarios with the least error and the most accuracy. The SVR model with appropriate results was in the second place. The results of statistical parameters for random forest model were even weaker than the results of linear regression. In general, scenario number 10 with all meteorological variables and scenario number 1 with only the input minimum temperature variable had the best and weakest results among all defined scenarios, respectively. Scenarios 6 to 10 have more accuracy and less error and modeling structures with the least number of variables has the least accuracy. Also, wind speed and solar radiation variables were introduced as the most effective factors in estimating the evaporation rate from class A pan.

Conclusion

Evaporation is one of the important processes that cause the losses of half of precipitation in arid and semi- arid regions. Accordingly, knowledge of the amount of evaporation and its modeling as one of the most important hydrological variables in agricultural research and factors related to water and soil of great importance. So, accurate estimation of this phenomenon is essential. In this study, meteorological data from Tabriz station were utilized to assessment capability of machine learning methods. Evaporation values were estimated using five data mining methods including SVR, GPR, M5, RF and LR. Conclusively, the results of evaluation criteria indicated that GPR and SVR models using all variable of meteorological data performed more accurate than others. Finally, both of them are recommended to estimate the amount of evaporation from class A pan.

Keywords: Evaporation, Gaussian process regression, Linear Regression, Random forest, Support Vector Regression

برآورد سرعت آستانه فرسایش بادی با استفاده از تونل باد همراه با الگوریتم های یادگیری ماشین

منیره مینا، عبدالمجید ثامنی، سید علی اکبر موسوی*، یعقوب قنبری

مجله تحقیقات آب و خاک ایران، سال پنجاه و چهارم شماره 6 (پیاپی 90، شهریور 1402)، صص 933 -947

فرسایش بادی یک عامل تخریب زمین در سراسر جهان به ویژه در مناطق خشک و نیمه خشک ایران است. این پدیده تحت تاثیر ویژگی های خاکی زیادی قرار دارد. هدف اصلی مطالعه حاضر برآورد سرعت آستانه فرسایش بادی با استفاده از ویژگی های قابل اندازه گیری زودیافت خاک همراه با روش های داده کاوی بود. برای این منظور، سرعت آستانه فرسایش بادی در 100 منطقه در استان فارس با استفاده از تونل باد قابل حمل اندازه گیری شد. سرعت آستانه فرسایش بادی توسط الگوریتم رگرسیون بردار پشتیبان با استفاده از ویژگی های قابل اندازه گیری زودیافت پیش بینی شد. در همین راستا، به منظور دستیابی به مجموعه ویژگی های موثر در برآورد سرعت آستانه فرسایش بادی، از الگوریتم ژنتیک استفاده شد. نتایج نشان داد که ویژگی های رطوبت خاک (77/0 =r)، توزیع اندازه ذرات خاک از جمله میانگین وزنی قطر خاکدانه ها (87/0 = r) و جزء فرسایش پذیر خاک (81/0- =r)، مقاومت فروروی (75/0 = r) و ماده آلی (33/0 =r) همبستگی زیاد و معنی داری با سرعت آستانه فرسایش بادی داشتند و همچنین در تعیین سرعت آستانه فرسایش بادی در منطقه نقش کلیدی دارند. با توجه به معیارهای ارزیابی، مدل تلفیقی رگرسیون بردار پشتیبان به همراه الگوریتم ژنتیک بهترین عملکرد و دقیق ترین برآورد را برای سرعت آستانه فرسایش بادی داشته است (53/0 = RMSE و 92/0 = R2) و می تواند یک روش امیدوار کننده برای برآورد سرعت آستانه فرسایش بادی باشد.

کلید واژگان: الگوریتم ژنتیک، توزیع اندازه ذرات، رگرسیون بردار پشتیبان، فرسایندگی خاک، مقاومت فروروی

Prediction of Wind Erosion Threshold Velocity Using Portable Wind Tunnel Combined with Machine Learning Algorithms

Monireh Mina, Abdolmajid Sameni, Ali Akbar Moosavi *, Yaghoub Ghanbari

Iranian Journal of Soil and Water Research, Volume:54 Issue: 6, 2023, PP 933 -947

Wind erosion is a key process in land degradation worldwide, especially in arid and semi-arid regions of Iran. This phenomenon is affected by many soil characteristics. The main objective of this study was to estimate the wind erosion threshold velocity using easily measurable soil characteristics along with data mining methods. For this purpose, wind erosion threshold velocity was measured in 100 areas in Fars province using a portable wind tunnel. Wind erosion threshold velocity was predicted by a support vector regression algorithm using easily measurable soil properties. In this regard, a genetic algorithm was used in order to obtain a set of parameters effective in estimating wind erosion threshold velocity. The results showed that the characteristics of soil moisture (r = 0.77), the size distribution of soil particles including the mean weight diameter of aggregate (r = 0.87) and the wind-erodible fraction of soils (r = -0.81), penetration resistance (r = 0.75), and organic matter (r = 0.33) have a high and significant correlation with wind erosion threshold velocity and play a key role in determining the threshold velocity of wind erosion in the region. According to the evaluation criteria, the combined support vector regression model with the genetic algorithm had the best performance and the most accurate estimate for wind erosion threshold velocity (RMSE = 0.53 and R2 = 0.92) and can be a promising method for estimation of wind erosion threshold velocity.

Keywords: Genetic Algorithm, Particle size distribution, penetration resistance, Soil Erodibility, Support Vector Regression

بررسی قابلیت های رویکرد یادگیری ماشین در پیش بینی جریان سطحی روزانه با استفاده از برخی داده های هواشناسی و شاخص تفاضلی نرمال شده برف (مطالعه موردی: حوضه آبخیز لتیان و ناورود)

محبوبه فلاح، حسینعلی بهرامی*، حسین اسدی

مجله تحقیقات آب و خاک ایران، سال پنجاه و سوم شماره 5 (پیاپی 77، امرداد 1401)، صص 1127 -1144

پیش بینی دقیق جریان سطحی برای مدیریت منابع آب به ویژه پیش بینی سیل و فرسایش خاک بسیار مهم است. در مطالعه حاضر، قابلیت سه روش یادگیری ماشین (ML) شامل رگرسیون بردار پشتیبان (SVR)، شبکه عصبی مصنوعی با پس انتشار خطا (ANN-BP) و رگرسیون تقویت گرادیان (GBR) با استفاده از داده های هواشناسی و پوشش برف سنجنده MODIS برای پیش بینی جریان سطحی روزانه در دو حوضه مختلف لتیان و ناورود بررسی شد. برای توسعه مدل، چهار متغیر اصلی شامل باران روزانه (P)، دمای حداکثر (Tmax) ، دمای حداقل (Tmin) و شاخص تفاضلی نرمال شده برف (NDSI) از سنجنده MODIS در طول سال های 1379-1397 استفاده شد. کارایی این مدل ها با استفاده از شاخص های آماری مورد ارزیابی قرار گرفت. نتایج شبیه سازی نشان داد که همه مدل ها نتایج رضایت بخشی را در شبیه سازی جریان سطحی روزانه با استفاده از متغیرهای هواشناسی به عنوان پارامترهای ورودی مدل ها ارایه کردند. همچنین، کارایی همه مدل های ML مورد مطالعه، زمانی که شاخص NDSI به عنوان متغیر تخمین گر در شبیه سازی اعمال شد، بهبود یافت. بهترین کارایی را در بین تمام مدل های مورد مطالعه در هر دو حوضه، مدل GBR نشان داد. مدل SVR پایین ترین کارایی را در پیش بینی جریان سطحی روزانه برای هر دو مرحله آموزش و اعتبارسنجی در اکثر موارد نشان داد. به طور کلی، نتایج شبیه سازی در حوضه لتیان نسبت به حوضه ناورود در هر دو مرحله آموزش و اعتبارسنجی بهتر بود و نسبت به دو مدل دیگر، بهترین کارایی در مدل GBR با ضریب همبستگی (85/0=R)، ضریب کارایی نش-ساتکلیف 72/0=NS و جذر میانگین مربعات خطا (m3/s43/3(RMSE= با استفاده از شاخص NDSI در حوضه لتیان مشاهده شده است که نشان دهنده تاثیر زیاد ذوب برف در ایجاد جریان سطحی در مناطق برف خیز است.

کلید واژگان: شبکه های عصبی مصنوعی، مدل یادگیری ماشین، شاخص تفاضلی نرمال شده برف، رگرسیون بردار پشتیبان

Investigating Capabilities of Machine Learning Techniques in Forecasting Daily Streamflow Using Some Meteorological Data and Normalized Difference Snow Index (Case Study: Latian and Navroud Basins)

Mahboobeh Fallah, Hosseinali Bahrami *, Hossein Asadi

Iranian Journal of Soil and Water Research, Volume:53 Issue: 5, 2022, PP 1127 -1144

Accurate prediction of streamflow is crucial for water resources management, especially for the prediction of floods and soil erosion. In the current study, the capability of three machine learning (ML) methods, including Support Vector Regression (SVR), Artificial Neural Network with Backpropagation (ANN-BP), and Gradient Boosting Regression (GBR) was investigated using meteorological observations and MODIS snow cover data to forecast daily streamflow in two different basins, namely Latian and Navroud. For model development, four major predictors, including daily rainfall (P), maximum temperature (Tmax), minimum temperature (Tmin), and the Normalized Difference Snow Index (NDSI) from the MODIS satellite were used from 2000 to 2018. The performance of these models was evaluated using statistical indices. Simulation results revealed that all models presented satisfactory results in simulating daily streamflow using meteorological predictors, and the efficiency of all applied models was improved when the NDSI index was applied as an additional predictor. The best performance was observed in GBR among all studied models in both basins, whereas SVR revealed the lowest performance in forecasting streamflow for both validation and calibration steps in most cases. In general, the simulation results demonstrated higher accuracy in Latian basin than Navroud basin in both calibration and validation steps. The best performance among all models was observed using GBR with R = 0.85, NS=0.72, and RMSE = 3.43 m3/s using the NDSI index in Latian basin indicating the significant effect of snowmelt on streamflow generation in snowmelt-dominated regions.

Keywords: Artificial Neural Networks, Machine Learning Model, NDSI, Support Vector Regression

بررسی رطوبت سطح خاک شهرستان اردبیل با استفاده داده های ماهواره ای لندست 8 و سنتیل 1

صیاد اصغری سراسکانرود*، فریبا اسنفدیاری درآباد، الهام ملانوری، شیوا صفری

فصلنامه حفاظت منابع آب و خاک، سال یازدهم شماره 4 (پیاپی 44، تابستان 1401)، صص 89 -103

زمینه و هدف

رطوبت سطحی خاک، متغیری مهم در چرخه آبی طبیعت بوده و می تواند تحت تاثیر عوامل مختلفی از جمله دما و مشخصات خاک قرار گیرد. استفاده از سنسوهای زمین برای اندازه گیری رطوبت خاک منجر به صرف زمان و توزیع نامناسب نمونه ها در مقیاس های بزرگ شود بنابراین سنجش ازدوری می تواند ابزار مهمی در برآورد رطوبت خاک باشد. هدف پژوهش حاضر استفاده از مدل TOTRAM با استفاده از تصاویر لندست 8 و روش SVR با استفاده از تصاویر سنتیل1 برای برآورد رطوبت خاک می باشد.

روش پژوهش

شهرستان اردبیل به عنوان مرکز استان اردبیل در شمال غرب کشور واقع است. در مطالعه حاضر برای استخراج رطوبت خاک از دو روش TOTRAM بر مبنای توزیع پیکسل در فضای LST-VI و روش SVR با استفاده از تکنیک SAR و داده سنتینل 1 استفاده شده است. جهت پیاده سازی روش TOTRAM تصاویر لندست 8 مرتبط با تاریخ های 29/4/1398 و 30/05/1398 دانلود و پس از استخراج نقشه های NDVI و LST، اقدام به بررسی همبستگی بین متغیر وابسته رطوبت و متغیرهای مستقل دما و پوشش گیاهی با استفاده از رگرسیون وزن دار جغرافیایی (GWR) شده است. برای اجرای روش SVR پس از دستیابی به تصاویر سنتینل 1 مربوط به تاریخ های 31/05/1398 و 27/04/1398، داده های رطوبت خاک محصول FLDAS و محصول 500 متری سالانه ماهواره مودیس (MCD12Q1) جهت طبقه بندی پوشش اراضی در سامانه Google Earth engine فراخوانی شدند و نقشه های مرتبط با رطوبت خاک استخراج شد. پس از استخراج نقشه های رطوبت نحوه ی توزیع رطوبت با استفاده از شاخص محلی موران بررسی شده است. بر طبق تعریف این شاخص مقادیر مثبت یک برای این شاخص نشان دهنده ی خوشه ای بودن توزیع خواهد بود.

یافته ها

بررسی نقشه رطوبت حاصل از روش SVR تمرکز رطوبت در مناطقی با حضور پوشش گیاهی و آب را نشان داد و تغییر وضعیت رطوبت از تیر به مرداد قابل مشاهده بوده است. الگوی رطوبت انعکاس الگوی بارشی را نشان داده است به طوری که حداکثر بارش و رطوبت در فروردین بوده و در تابستان هر دو مولفه ی بارش و رطوبت کاهش داشته اند. بررسی روش TOTRAM و اعمال روش GWR همبستگی کامل NDVI-LST و رطوبت را نشان داد. البته همبستگی بین LST و رطوبت با مقادیر (بتا) B و خطای استاندارد (SE) 995/0 و صفر متناسب با مرداد و 981/0 و صفر متناسب با تیرماه بیشترین همبستگی را نسبت به متغیر پوشش گیاهی با پارامتر وابسته ی رطوبت نشان داده است که این همبستگی در مرداد ماه با افزایش مقدار ضریب تعیین R2 به 997/0 و کاهش معنی داری NDVI به مقدار 415/0 در تیرماه به مراتب بیشتر شده است. اعمال شاخص محلی موران با مقادیر کمتر از 0.05 برایp-value و مقادیر مثبت z و عدد نزدیک مثبت یک برای شاخص موران خوشه ای بودن توزیع متغیر رطوبت را نشان داده است.

نتایج

بررسی نتایج روش های TOTRAM و SVR وابستگی وضعیت رطوبت خاک به شرایط و خوشه ای بودن توزیع رطوبت را نشان داد. با توجه به ضرایب همبستگی حاصل از رگرسیون وزن دار جغرافیایی همبستگی بیشتری بین متغیر دما و رطوبت به ویژه در مرداد ماه به دلیل کاهش تراکم پوشش گیاهی مشاهده شده است. بررسی نقشه های الگوریتم SVR نشان داد در مناطقی با حضور پوشش گیاهی و بخصوص تراکم آن شاهد افزایش و با افزایش دما شاهد کاهش رطوبت هستیم. همچنین هماهنگی الگوی های رطوبت الگوریتم SVR و بارش رابطه مستقیم بین رطوبت و بارش را نشان داد. با توجه به اینکه روش SVR از تصاویر سنتینل 1 و پارامترهایی نظیر شدت پراکنش رادار و طبقه بندی پوشش اراضی استفاده می کند می توان انتظار نتایج دقیق تری از این الگوریتم داشت.

کلید واژگان: دمای سطح زمین، رگرسیون بردارپشتیبان، شاخص تفاضلی نرمال شده پوشش گیاهی، TOTRAM

Investigation of Soil Surface Moisture in Ardabil City Using Landsat 8 and Sentile 1 satellite Data

Sayyad Asghari Saraskanrod *, Fariba Esfandayari Darabad, Elham Mollanouri, Shiva Safary

Journal of Water and Soil Resources Conservation, Volume:11 Issue: 4, 2022, PP 89 -103

Background and Aim

Surface soil moisture is an important variable in nature's water cycle and can be affected by various factors, including temperature and soil characteristics. The use of ground sensors for measuring moisture can lead to spending time and expense and inappropriate distribution of samples on large scales. Therefore, Remote sensing observations can be an important tool in estimating soil moisture. The present study aims to use the TOTRAM model using Landsat 8 images and the SVR method using Sentile 1 images to estimate soil moisture.

Methods

In the present study, two TOTRAM methods based on pixel distribution in LST- VI space and the SVR method were used to extract soil moisture using the SAR technique and Sentinel 1 data. To implement the TOTRAM method, Landsat 8 images related to 4/29/1398 and 5/30/1398 are downloaded and after extracting NDVI and LST maps, The correlation between the dependent variable of moisture and independent temperature variables and vegetation variables has been investigated using Geographically weighted regression (GWR). To implement the SVR method after acquiring Sentinel 1 images related to 31/05/1398 and 27/04/1398, Soil Moisture Data Product FLDAS and 500 meters product of Modis Satellite (MCD12q1) were called to classify land cover in the Google Earth Engine system, and maps related to soil moisture were extracted. After extracting the moisture maps the distribution of moisture using the local Moran index has been investigated. By defining this index, positive values for this index represent the cluster of distribution.

Results

Examination of the soil moisture map obtained by the SVR method showed the concentration of moisture in areas with vegetation and water and the change in moisture status from July to August was visible. The humidity pattern has shown the reflection of the precipitation pattern so that maximum precipitation and humidity were observed in April and in summer both precipitation and humidity components decreased. Examination of the TOTRAM method and application of the GWR method has shown a complete correlation between NDVI LST and moisture. However, the correlation between LST and humidity with B (values) and standard error (SE) of 0.995 and zero corresponding to July and 0.981 and zero corresponding to August showed the highest correlation with vegetation variable with moisture dependence parameter, which this correlation In August, with increasing the coefficient of determination of R2 to 0.997 and a significant decrease of NDVI to the value of 0.415 in July, it has increased much more. Application of Moran local index with values less than 0.05 for p-value and positive values for z and near positive number 1 for Moran index showed the cluster distribution of moisture variable.

Conclusion

The results of TOTRAM and SVR methods showed the dependence of soil moisture status on conditions and cluster moisture distribution. According to the correlation coefficients of geographical regression, there is a greater correlation between temperature and humidity variables, especially in August, due to the decrease in vegetation density. The results of the SVR algorithm maps showed that in areas with the presence of vegetation, especially dense vegetation, we see an increase and with increasing temperature, we see a decrease in humidity. Also, the coordination of moisture patterns of the SVR algorithm and precipitation showed a direct relationship between moisture and precipitation. Considering that the SVR method uses parameters such as radar scattering intensity and land cover classification, as well as the use of Sentinel 1 radar images by this algorithm, more accurate results can be expected from this algorithm.

Keywords: LST, NDVI, support vector regression, TOTRAM

ترکیب مدل محاسبات نرم مبتنی بر الگوریتم یادگیری ماشین و تحلیل مولفه-های اصلی جهت پیش بینی بارندگی

لاله پرویز

مجله مهندسی آبیاری و آب ایران، پیاپی 47 (بهار 1401)، صص 283 -301

تاثیر تغییرات بارندگی بر منابع آب، تولیدات کشاورزی نیاز به روش کارآمدی جهت پیش بینی بارندگی را آشکار می سازد. در این تحقیق یکی از روش های محاسبات نرم در راستای پیش بینی بارندگی با رویکرد کاهش داده توسعه داده شد. داده های ورودی مدل متوسط دمای هوا، دمای نقطه شبنم، متوسط فشار سطح دریا، متوسط فشار ایستگاه، میانگین رطوبت نسبی و میانگین سرعت باد در ایستگاه های تبریز، اهر و جلفا بودند. روش مورد استفاده در این تحقیق شامل رگرسیون بردار پشتیبان، Epsilon و Nu ،می باشد. در تمام ایستگاه های مورد مطالعه استفاده از رگرسیون بردار پشتیبان Nu نسبت به Epsilon منجر به کاهش خطا شد به طوری که مقادیر UII با رگرسیون بردار پشتیبان Nu در ایستگاه های تبریز، اهر و جلفا به ترتیب 19/19، 88/5 و78/15 درصد کاهش داشت. نتایج بیانگر محدویت استفاده از رویکرد کاهش داده برای داده هایی با فاکتور KMO پایین تر از 5/0 است که شامل ایستگاه های تبریز و اهر بودند. تحلیل مولفه های اصلی در هر دو نوع رگرسیون بردار پشتیبان عملکرد مدل را افزایش داد به طوری که در ایستگاه جلفا با بکارگیری تحلیل مولفه های اصلی مقادیر dدر رگرسیون بردار پشتیبان Epsilon و Nu 6/16 و 5/17 درصد افزایش یافت. اجرای چرخش وریماکس در پیش پردازش داده های ورودی به رگرسیون نسبت به تحلیل مولفه های اصلی قوی تر عمل کرد. در این راستا مقادیرRRMSE وRMSE در ایستگاه جلفا با استفاده از رگرسیون بردار پشتیبان Epsilon و با اجرای چرخش به ترتیب 66/6 و 45/6 درصد کاهش داشت. بنابراین تحلیل مولفه های اصلی ابزار مناسبی جهت ارتقاء عملکرد روش های محاسبات نرم با رعایت قیود می باشد.

کلید واژگان: بارندگی، تحلیل مولفه های اصلی، رگرسیون بردار پشتیبان، فاکتور KMO

Combining soft Computing Model Based on Machine Learning Algorithm and Principal Component Analysis for Precipitation Forecasting

Laleh Parviz

Irrigation & Water Engineering, Volume:12 Issue: 47, 2022, PP 283 -301

The effect of precipitation changes on water resources, agricultural production reveals the need for accurate methods for precipitation forecasting. In this research, one of the soft computing methods was developed in order to forecast precipitation with the data reduction approach. Input data of model was mean air temperature, dew point temperature, mean sea level pressure, mean station pressure, mean relative humidity and mean wind speed at Tabriz, Ahar and Jolfa stations. The method used in this study includes Epsilon and Nu support vector regression. In all studied stations, the use of Nu support vector regression compared to Epsilon reduced the error so that UII values with Nu support vector regression in Tabriz, Ahar and Jolfa stations were decreased 19.19, 5.88 and 15.78%, respectively. The results indicate the limitation of using the data reduction approach for data with KMO factor lower than 0.5, which included Tabriz and Ahar stations. Principal component analysis in both types of support vector regression increased the performance of the model so that in Jolfa station by using principal component analysis d values of Epsilon and Nu support vector regression increased by 16.6 and 17.5%. Execution of Verimax rotation in preprocessing of input data to regression was stronger than principal component analysis. In this regard, RRMSE and RMSE values in Jolfa station using Epsilon support vector regression were decreased 6.66 and 6.45%. Therefore, principal component analysis is a suitable tool to improve the performance of soft computing methods by regarding the relevant constraints.

Keywords: Precipitation, Support vector regression, KMO factor, Principal component analysis

استفاده از توابع انتقالی و طیفی خاک در برآورد ظرفیت تبادل کاتیونی خاک های آهکی استان فارس

منیره مینا، مهروز رضایی*، عبدالمجید ثامنی، سید علی اکبر موسوی، سید رشید فلاح شمسی

مجله تحقیقات آب و خاک ایران، سال پنجاه و دوم شماره 11 (پیاپی 71، بهمن 1400)، صص 2911 -2922

ظرفیت تبادل کاتیونی (CEC) از مهم ترین ویژگی های شیمیایی خاک است که در حاصلخیزی خاک نقش بسزایی دارد. با این حال، روش های استاندارد آزمایشگاهی برای اندازه گیری CEC دشوار، هزینه بر و زمان بر است. این پژوهش با هدف برآورد CEC خاک با به کارگیری روش های نوین مانند 1) توابع انتقالی خاک (PTF) بر اساس ویژگی های زودیافت خاک با استفاده از رگرسیون خطی چندگانه (MLR)، 2) طیف سنجی خاک (,Vis - NIR2500 تا 400 نانومتر) با استفاده از رگرسیون حداقل مربعات جزیی (PLSR) و رگرسیون بردار پشتیبان (SVR) انجام شده است. همچنین با استفاده از نتایج تجزیه و تحلیل ضریب رگرسیون، طول موج های کلیدی برای برآورد CEC معرفی شدند. برای این منظور، CEC در آزمایشگاه با استفاده از روش سدیم استات برای 72 نمونه خاک جمع آوری شده از خاک های آهکی استان فارس اندازه گیری شد و بازتاب طیفی نمونه های خاک با دستگاه طیف سنج اندازه گیری شد. این روش ها از مجموعه واسنجی (70% داده ها) ساخته شده و با مجموعه اعتبارسنجی (30% داده ها) ارزیابی شدند. نتایج نشان داد دقت نتایج روش طیف سنجی از دقت نتایج PTF بیشتر بود. در این پژوهش، طول موج های 566، 854، 1354، 1418، 1906، 2071، 2203، 2319 و 2341 نانومتر به عنوان طول موج های کلیدی برای برآورد CEC خاک به دست آمد. مدل SVR در مقایسه با PLSR عملکرد بهتری داشت. به طور کلی این پژوهش نشان داد که روش طیف سنجی (Vis- NIR) یک روش امیدوارکننده برای برآورد CEC خاک می باشد.

کلید واژگان: رگرسیون بردار پشتیبان، رگرسیون حداقل مربعات جزئی، رگرسیون خطی چندگانه، طول مو ج های کلیدی

Using Soil Pedotransfer and Spectrotransfer Functions to Estimate Cation Exchange Capacity in Calcareous Soils, Fars Province

Monireh Mina, Mahrooz Rezaei *, Abdolmajid Sameni, AliAkbar Moosavi, RASHID FALLAH SHAMSI

Iranian Journal of Soil and Water Research, Volume:52 Issue: 11, 2022, PP 2911 -2922

Cation exchange capacity (CEC) is one of the most important soil chemical properties that plays an important role in soil fertility. However, standard laboratory methods for measuring CEC are difficult, costly, and time-consuming. The aim of this study was to use new methods such as 1) Pedotransfer Functions (PTF) based on the basic soil properties using Multiple Linear Regression (MLR), 2) soil spectroscopy (Vis–NIR, 400 – 2500 nm) using Partial Least Squares Regression (PLSR), and Support Vector Regression (SVR), for estimating soil CEC. Also, from the regression coefficient analysis, key wavelengths were introduced. For this purpose, CEC was measured using the sodium acetate method for 72 soil samples collected, and spectral reflection of soil samples was determined using spectroscopy. These methods are made from a calibration set (70% of data) and evaluated with a validation set (30% of data). The results showed that Vis - NIR method performed better than PTF. In this study, wavelength ranges around 566, 854, 1354, 1418, 1906, 2071, 2203, 2319, and 2341 nm were investigated as the key wavelengths for estimation of CEC. Furthermore, the results of prediction models showed that SVR has a better performance than PLSR. This study proved that Vis-NIR is a promising method for soil CEC estimation.

Keywords: Support Vector Regression, Partial least squares regression, Multiple Linear Regression, Key Wavelength

اثرات تغییر اقلیم مدل CMIP5 در شبیه سازی مخزن سد جامیشان تحت سناریو تغییر الگوی کشت

بنفشه رحیمی، مریم حافظ پرست*

نشریه فناوری های پیشرفته در بهره وری آب، سال یکم شماره 1 (زمستان 1400)، صص 1 -23

امروزه اثرات تغییر اقلیم و گرمایش جهانی به دلیل افزایش گاز های گلخانه ای در جهان به اثبات رسیده است. در این تحقیق مقادیر ماهیانه دما، بارش و دبی سد جامیشان در سال های 2017-1988 به عنوان دوره پایه در نظر گرفته شده است. شبیه سازی منابع و مصارف حوضه آبریز سدجامیشان در مدل WEAP با الگوهای کشت موجود در منطقه مورد بررسی قرار گرفت. از خروجی های سناریو RCP8.5 مدل HADGEM2_ES از سری مدلهای گزارش پنجم CMIP5 استفاده شد و خروجی این مدل برای منطقه مورد نظر ریزمقیاس شد. با استفاده از روش عامل تغییر داده های مدل اقلیمی ریزمقیاس شد و پارامترهای ماهانه دما و بارش سد جامیشان برای دوره ی 2050-2021 تولید گردید.تغییرات رواناب منطقه مورد نظر درمدل های SVM، GEP و IHACRES پیش بینی گردید. نتایج مدل اقلیمی به طور میانگین افزایش دمای 5/1 درجه سلسیوس و افزایش بارش 5 درصد را نشان می دهد. نتایج حاصل از پیش بینی دبی در هر سه مدل حاکی از کاهش رواناب است که بیشترین کاهش رواناب مربوط به SVM با 6/21 درصد و کمترین کاهش رواناب مربوط به IHACRES با 4 درصد می باشد مدل های IHACRES و GEP نسبت به روش SVR از دقت مطلوب تری برخوردار هستند. شبیه سازی منابع و مصارف درالگوهای کشت مختلف در WEAP نشان می دهد بیش ترین تامین برابر با 9/74 درصد در مدل GEP و کمترین تامین برابر با 70 درصد در مدل SVR که به ترتیب مربوط به الگوی کشت یک و الگوی کشت سه می باشد.

کلید واژگان: بارش- رواناب، ماشین بردار پشتیبان، برنامه ریزی بیان ژن، ویپ، الگوی کشت

Impact of cmip5 climate change model in Jamishan dam reservoir simulation under cropping pattern change scenario

Banafshe Rahimi, Maryam Hafezparast *

Journal of Advanced Technologies in Water Efficiency, Volume:1 Issue: 1, 2022, PP 1 -23

In this research, monthly temperature, precipitation and discharge values of Jamishan Dam in 1988-2017 have been considered as the base period. Simulation of resources and uses of Dam-d-Gomishan basin in WEAP model with existing cropping patterns in the region was investigated. To evaluate the effect of climate change on precipitation and temperature parameters in this region, the outputs of the RCP8.5 scenario of the HADGEM2_ES models were used from the cmip5 fifth report series and the output of these models was downscaled for the desired area. In this study, using the method of change factor downscaling the climatic model and monthly temperature and precipitation parameters of Jamishan Dam for the period 2021-2050 were generated. In order to investigate the runoff of the area due to climate change, SVM, GEP and IHACRES models were investigated and compared. The results of the climatic model show an average temperature increase of 1.5 °C and an increase in precipitation of 5%. In general, results of discharge forecasting in all three models indicate a decrease in runoff that the highest runoff reduction was related to SVM with 21.6% and the lowest runoff reduction was related to IHACRES with 4%. In this study, IHACRES and GEP models have more favorable accuracy than SVR method. Simulation of different cropping patterns in WEAP shows that the highest supply is equal to 74.9% in GEP model and the lowest supply is equal to 70% in SVR model which is related to cropping pattern 1 and cropping pattern 3, respectively.

Keywords: Rainfall-runoff, Support vector regression, Genetic programming, WEAP, Crop pattern

اثر پیش پردازش و کاهش ابعاد ورودی مدل پیش بینی دبی بر عملکرد مدل رگرسیون بردار پشتیبان بهینه شده الگوریتم ژنتیک

سید احسان فاطمی*، صدیقه دارابی چغابلکی، مریم حافظ پرست

نشریه فناوری های پیشرفته در بهره وری آب، سال یکم شماره 1 (زمستان 1400)، صص 24 -45

پیش بینی دقیق و صحیح جریان آب های سطحی در برنامه ریزی اصولی و مدیریت صحیح منابع آب نقش بسزایی دارد، برای دست یافتن به این مهم مدل های پیش بینی مختلف که با استفاده از روابط ریاضی بر پایه اطلاعات هیدرولوژی بنا شده اند، می توانند داده های مورد نیاز را با دقت کافی پیش بینی کنند. در این مطالعه از داده های دبی جریان ماهانه ایستگاه هیدرومتری پل چهر در یک دوره آماری 48 ساله (1397شهریور-1350مهر) استفاده شد. دو سناریوی اصلی با و بدون اعمال پیش پردازش (استانداردسازی) با دو رویکرد سری زمانی یا غیرسری زمانی بررسی شد. همچنین از الگوریتم جنگل تصادفی برای بررسی کاهش ابعاد ورودی مدل استفاده شد. در هر سناریو معیارهای ارزیابی مدل تغییرات واریانس، ضریب تبیین و مجذور مربعات خطا محاسبه گردید. در همه حالت ها به ترتیب 80 و 20 درصد داده ها برای آموزش و تست مدل در نظر گرفته شده است. مدل نوشته شده به زبان برنامه نویسی پایتون است. از الگوریتم ژنتیک برای بهینه سازی پارامترهای روش رگرسیون بردار پشتیبان استفاده شد. نتایج نشان داد که ابتدا استاندارد سازی سپس درنظر نگرفتن توالی سری زمانی داده ها، کاهش ابعاد ورودی مدل نیز استفاده از الگوریتم ژنتیک برای بهینه سازی پارامترهای مدل رگرسیون بردار پشتیبان به ترتیب بیش ترین اثر بر دقت پیش بینی را دارد به طوریکه بالاترین ضریب تبیین برای داده های آموزش برابر 85/0 و برای تست معادل6/0 است. چنانچه عمل استاندارسازی داده ها صورت نگیرد منظور نمودن رویکرد سری زمانی و کاهش ابعاد ورودی مدل منجر به نتایج بهتری در پیش بینی مدل SVR خواهد شد و استفاده از بهینه ساز الگوریتم ژنتیک نسبت به مدل ساده آن تاثیر معنی داری بر بهبود نتایج خواهد داشت.

کلید واژگان: پیش بینی دبی، ماشین بردار پشتیبان، الگوریتم ژنتیک

The effect of preprocessing and reducing the input dimensions of the flow prediction model on optimized support vector regression by genetic algorithm

Seyed Ehsan Fatemi *, Sedigheh Darabi Cheghabaleki, Maryam Hafezparast

Journal of Advanced Technologies in Water Efficiency, Volume:1 Issue: 1, 2022, PP 24 -45

Accurate and correct prediction of surface water flow plays an important role in the principled planning and proper management of water resources. To achieve this, various prediction models using mathematical relationships based on hydrological information can be used. In this study, monthly discharge of Polechehr hydrometric station for a 48-year has been used (Sep. 2018-October 1971). Two main scenarios with and without pre-processing (standardization), two time series or non-time series approaches were considered. Also, two cases with and without feature selection have been considered by a random forest algorithm. In all cases, 80% and 20% of the data are intended for model training and testing, respectively. The entire coding process is done in the Python programming platform. Genetic algorithm was used to optimize the parameters of the support vector regression method. The results showed that standardization, non-time series approach, reducing the dimensionality of the model input to select and also using genetic algorithm to optimize the parameters of the support vector regression model have the greatest effect on prediction accuracy, respectively. So that the highest coefficient of explanation for training data is 0.85 and for testing is equal to 0.6.If standardization is not applying on the data, adopting a time series approach and feature selection will lead to better results in predicting the SVR model, and also the use of genetic algorithm optimizer compared to the simple model will have a significant effect on improving results.

Keywords: Discharge Forecasting, Genetic Algorithm, Support vector regression

برآورد ضریب آبدهی سرریزهای کلید پیانویی انحنادار با استفاده از ترکیب رگرسیون بردار پشتیبان و الگوریتم های ملخ و کرم شب تاب

حجت کرمی*، علیرضا رضایی آهوانویی

مجله مهندسی آبیاری و آب ایران، پیاپی 46 (زمستان 1400)، صص 186 -202

سرریزهای کلیدپیانویی نوع جدیدی از سرریزها هستند که در جهت افزایش ظرفیت تخلیه سدها و کانال ها طراحی می شوند. در صورتی که کلیدهای تشکیل دهنده این مدل سرریز بر روی کمانی از یک دایره قرار بگیرند، آن را سرریز کلیدپیانویی انحنادار می نامند. در این پژوهش عملکرد سه مدل هوشمند رگرسیون بردار پشتیبان (SVR)، رگرسیون بردار پشتیبان- کرم شب تاب (SVR-FA) و رگرسیون بردار پشتیبان- ملخ (SVR-GOA) برای پیش بینی میزان آبدهی سرریزهای کلیدپیانویی انحنادار مورد ارزیابی قرار گرفته است. ضریب تعیین (R2)، میانگین مربعات خطا (MAE) ، جذر میانگین مربعات خطا (RMSE) و شاخص پراکندگی (SI) چهار شاخص آماری می باشند که برای تعیین دقت مدل های هوشمند به کار گرفته شده است. نتیجه این معیارهای ارزیابی در دوره آزمون نشان می دهد که مدل SVR-GOA با مقادیر 99275/0، 01202/0، 00026/0 و 00046/0 نسبت به مدل SVR-FA با مقادیر 95666/0، 03844/0، 00200/0 و 00342/0 و SVR با مقادیر 94249/0، 04013/0، 06027/0 و 00410/0 به ترتیب برای شاخص های R2،MAE ،RMSE و SI از دقت بیشتری در پیش بینی آبدهی سرریز کلیدپیانویی انحنادار برخوردار است.

کلید واژگان: : رگرسیون بردار پشتیبان، الگوریتم ملخ، الگوریتم کرم شب تاب، سرریز کلیدپیانویی انحنادار، ضریب آبدهی

Estimating Discharge Coefficient of Curved Piano Key Overflows Using Combination of Support Vector Regression and Grasshopper and Firefly Algorithms

Hojat Karami *, Alireza Rezaei Ahvanooei

Irrigation & Water Engineering, Volume:12 Issue: 46, 2022, PP 186 -202

Piano-Key weirs are a new type of overflow that are designed to increase the drainage capacity of dams and canals. If the keys forming this overflow model are placed on an arc of a circle, it is called curved piano-key weir. In this research, the performance of three models of Intelligent Support Vector Regression (SVR), Support Vector Regression- Firefly (SVR-FA) and Support Vector Regression- Grasshopper (SVR-GOA) to predict curved piano-key weir flow rate were evaluated. Determination Coefficient (R2), Mean Squared Error (MAE), Root Mean Squared Error (RMSE), and Scattering Index (SI) are four statistical indicators that are used to determine the accuracy of intelligent models. The result of these evaluation criteria during the test period is that the SVR-GOA model with values of 0.99275, 0.01220, 0.00026 and 0.00046 compared to the SVR-FA model with values of 0.95666, 0.03844, 0.00200 and 0.00342 and SVR with values of 0.94249, 0.04013, 0.06027 and 0.00410 for R2, MAE, RMSE and SI indicators, are more accurate in predicting curved piano-key weir flow rate

Keywords: Support vector regression, Grasshopper Algorithm, Firefly Algorithm, Curved Piano-Key Weir, Discharge coefficient

ارزیابی روش های رگرسیونی نوین جهت تخمین بار رسوبی معلق در رودخانه سیستان

سارا محمدی، فرزاد حسن پور*، سلمان شریف آذری، فرید فروغی

مجله مهندسی آبیاری و آب ایران، پیاپی 46 (زمستان 1400)، صص 1 -15

برآورد صحیح حجم رسوب حمل شده توسط رودخانه از اهمیت زیادی در پروژه هایی آبی برخوردار است. این مطالعه بر روی 1682 زوج داده برداشت شده که در بازه زمانی 1374-1390 صورت گرفت از روابط تجربی انتقال رسوب، مدل های رگرسیونی بردار پشتیبان و نزدیکترین همسایه توسعه داده شده به منظور پیش بینی بار معلق رسوب رودخانه سیستان استفاده گردید. در این راستا علاوه بر متغیرهای دمای حداقل، دمای حداکثر، دبی و بار معلق برداشت شده، دبی کلاسه نیز به عنوان یک متغیر ورودی تاثیرگذار در مدل سازی رسوب معین مورد بررسی قرار گرفت. برای ترکیب های متفاوت از ورودی ها در هر کدام از روش های رگرسیون بردار پشتیبان و رگرسیون ناپارامتری بهترین ساختار مدل رگرسیونی توجه به معیارهای ارزیابی تعیین گردید. نتایج این تحقیق نشان داد که در بین روابط تجربی روش توفالتی با مقدار جذر میانگین مربعات 66558 تن در روز و ضریب تبیین 705/0 بهتر از سایر روش ها عمل کرده است. همچنین مدل رگرسیون بردار پشتیبان با ترکیب ورودی های دبی، دمای حداقل، دمای حداکثر و دبی کلاسه با مقدار جذر میانگین مربعات خطا 3/2809 تن در روز و ضریب تبیین 96/0 بهترین نتیجه را در برآورد بار معلق رسوب رودخانه دارا بود. بر اساس نتایج این تحقیق روش های رگرسیونی با تفاوت قابل ملاحظه ای بهتر از روابط تجربی، بار معلق رسوب رودخانه سیستان را برآورد کردند.

کلید واژگان: انتقال رسوب، رگرسیون بردار پشتیبان، رگرسیون نزدیک ترین همسایه، روش توفالتی، مدل های تجربی

Assessment of Modern Regression Methods to Suspended Sediment Load Estimation in the Sistan River

Sara Mohamadi, Farzad Hassanpourir *, Salman Sharifazari, Farid Frooghi

Irrigation & Water Engineering, Volume:12 Issue: 46, 2022, PP 1 -15

Correct estimation of the river sediment volume is important for many water resources projects.In this study, the empirical equations of sediment transport, support vector regression (SVR), and developed k-nearest neighbor regression (KNN) were used in order to estimate suspended sediment load in the Sistan River. In this regard, in addition to the maximum temperature, minimum temperature, and dischargeand Suspended Loud In the period 1374 to 1390, it was paired on 1682, discharge, the classified discharge was used as effecting variable to suspended sediment load modeling. For each of the input combinations in support vector regression and development k-nearest neighbor regression the best structure of the regression model determined based on the performance criteria. The Result showed that the Toffaleti method is the best method between empirical equations with R2 equal to 0.705 and RMSE equal to 66558 Ton/day. Also, the support vector regression model with discharge, minimum temperature, maximum temperature and classified discharge as the set of input data is the best model in estimation of suspended sediment load with R2 equal to 0.96 and RMSE equal to 2809.3 Ton/day. The results indicate that the regression method estimate suspended sediment load much better than empirical equations in the Sistan River.

Keywords: Empirical Models, K-Nearest Neighbor Regression, Sediment transport, Support vector regression, Toffalti Method

کاربرد مدل هیبریدARIMA و رگرسیون بردار پشتیبان جهت بهبود پیش بینی سری زمانی

لاله پرویز*، بهاره سعید آبادی

فصلنامه حفاظت منابع آب و خاک، سال دهم شماره 2 (پیاپی 38، زمستان 1399)، صص 43 -54

بررسی دقیق ساختار اصلی سری زمانی نقش مهمی در افزایش دقت پیش بینی مدل ARIMA دارد. هدف این تحقیق بررسی تاثیر جداسازی مدلسازی بخش خطی و غیر خطی سری زمانی در نتایج مدل ARIMA است. تفکیک مدلسازی سری های عملکرد محصول گندم و ذرت دانه ای (استان های کرمانشاه و اصفهان) در بخش خطی مربوط به مدل ARIMA بود و در بخش غیرخطی با رگرسیون بردار پشتیبان انجام گرفت (مدل هیبرید). نتایج مدلسازی می تواند تحت تاثیر نوع ترکیب مورد استفاده بخش غیر خطی در مدل هیبرید تغییر یابد، به عنوان نمونه در سری زمانی ذرت دانه ای در استان کرمانشاه مقدار RMSE در ترکیبی فقط با باقی مانده ها 52/1 و در ترکیبی با سری زمانی 03/15 برآورد شد. در سری زمانی گندم در استان اصفهان با مدل هیبرید میزان کاهش آماره های RMSE،MAE و UII به ترتیب برابر با 94/45، 29/52 و 46 درصد بود که بیانگر بهبود نتایج با مدل هیبرید و تفکیک مدلسازی بخش خطی و غیر خطی سری زمانی است. مقادیر GMER در هر چهار سری زمانی بزرگتر از یک بودند که حاکی از بیش برآورد مقادیر پیش بینی شده مدل هیبرید می باشد. مقایسه متوسط مقادیر آماره ها در دو استان حاکی از تاثیر نوع اقلیم در مبحث مدلسازی است چرا که متوسط مقادیر هر آماره در هر دو مدل (ARIMA و هیبرید) و در هر دو محصول در استان اصفهان نسبت به کرمانشاه کاهش داشت (میزان کاهش RMSE و UII به ترتیب 72/24 و 24/12 درصد). بنابراین تفکیک مدلسازی بخش خطی و غیر خطی می تواند دقت نتایج مدل ARIMA را افزایش دهد.

کلید واژگان: ARIMA، بخش غیر خطی، رگرسیون بردار پشتیبان، هیبرید

Application of hybrid ARIMA and support vector regression model for improvement of time series forecasting

Laleh Parviz *, Bahareh Saeedabdi

Journal of Water and Soil Resources Conservation, Volume:10 Issue: 2, 2021, PP 43 -54

Accurate investigation related to the structure of time series plays an important role in increasing the accuracy of ARIMA forecasting. The aim of this research is to investigate the effect of modeling decomposition of linear and non linear parts of time series on ARIMA model results. The decomposition of wheat and maize yield time series (in Kermanshah and Esfahan provinces) in the linear part was related to ARIMA and in the non linear part was conducted with support vector regression (hybrid model). The kind of configuration of non linear part of hybrid model is more important for example in the maize time series of Kermanshah, the values of RMSE for configuration with residual was 1.52 and for time series configuration was 15.03. The decreasing of RMSE, MAE and UII for wheat time series of Esfahan with hybrid model was 45.94%, 52.29% and 46%, respectively which is indicative of hybrid model improvement. The value of GMER in all four time series was greater than one which indicates the overestimation of hybrid model. Comparison the average of each criteria with two models and crops in each province indicated the effect of climate on modeling process because the average of criteria in Esfahan province decreased rather to Kermanshah (RMSE decreasing= 24.72%, UII decreasing=12.24%). Therefore, decomposition of time series to linear and non linear parts of time series can increase the accuracy of ARIMA model results.

Keywords: ARIMA, Nonlinear, support vector regression, Hybrid

بهره گیری از روش های رگرسیون بردار پشتیبان و شبکه عصبی مصنوعی در مدلسازی بارش رواناب (مطالعه موردی: حوضه آبریز سد صفارود)

نازنین نجیب زاده، کوروش قادری*، محمد مهدی احمدی

مجله آبیاری و زهکشی ایران، سال سیزدهم شماره 6 (بهمن و اسفند 1398)، صص 1709 -1720

مدلسازی بارش رواناب در اکثر پروژه های مدیریت و برنامه ریزی منابع آب و کنترل سیلاب امری ضروری و پیچیده می باشد. در این تحقیق مدل هایی برای شبیه سازی فرآیند بارش رواناب در زیر حوضه سد صفا رود واقع در حوضه آبریز هلیل رود با استفاده از روش های شبکه عصبی مصنوعی پرسپترون چند لایه (MLP)، شبکه عصبی تابع شعاع مدار (RBF)، رگرسیون ماشین بردار پشتیبان با توابع کرنل خطی (SVR Linear) و پایه شعاعی (SVR RBF) در محیط نرم افزاری IBM SPSS Modeler توسعه داده شده است. بدین منظور از داده های هیدرومتری ایستگاه هنجان و داده های بارش ایستگاه های هنجان، رابر، چشمه عروس و میدان واقع در منطقه مورد مطالعه طی دوره آماری 1394-1385 در مقیاس روزانه استفاده شد. 70 درصد داده ها به عنوان داده های آموزش و 30 درصد آنها به عنوان داده های آزمون مورد استفاده قرار گرفت. پس از محاسبه ضرایب همبستگی جزئی متغیرهای بارش و دبی، شش الگوی مختلف جهت مدلسازی رواناب روزانه ایستگاه هنجان تعیین شد. مقادیر شاخص های آماری متوسط قدر مطلق خطا (MAE)، مجذور میانگین مربعات خطا (RMSE) و ضریب همبستگی (R) در بهترین الگو مرحله آزمون به ترتیب برابر 032/0، 229/0، 967/0 برای مدل SVR Linear 5 ایستگاه هنجان بدست آمدند. نتایج حاکی از عملکرد مناسب روش های MLP و SVR Linear در مدلسازی بارش رواناب در محدوده مورد مطالعه می باشد.

کلید واژگان: بارش رواناب، رگرسیون بردار پشتیبان، سد صفارود، شبکه عصبی مصنوعی

Rainfall-Runoff modelling using Support Vector Regression and Artificial Neural Network Models ( Case study: SafaRoud Dam Watershed)

Nazanin Najibzade, Koroush Qaderi *, Mohmmad Mehdi Ahmadi

Iranian Journal of Irrigation & Drainage, Volume:13 Issue: 6, 2020, PP 1709 -1720

Rainfall-runoff modeling is an important and complex aspect in most water resource management and planning projects. In this study, Perespetron multi-layered artificial neural network (MLP), Radial basis function Neural Network (RBF), and support vector machine regression with linear kernel functions (SVR linear) were used to develop some models in SPSS to simulate Rainfall-runoff process in subarea of Safaroud dam, located in Halil Rood watershed. To do so, hydrometric data of Hanjan station and rainfall data of Hanjan, Rabor, Cheshme Aroos, and Meidan stations, located in the studied area, were used. 70% of the data were used as training data and 30% were used as test data. After calculating the partial correlation coefficients of the rainfall and discharge, six different patterns were used to model the daily rainfall of Hanjan station. In the best pattern of the test level, for SVR Linear 5 model, Mean Absolute error (MAE), Root Mean Squared Error (RMSE) and correlation coefficient (r) were equal to 0.032, 0.229, and 0.967, respectively. The results proved the efficient performance of MLP and SVR Linear in Rainfall-runoff modeling in the studied area.

Keywords: Rainfall-Runoff, Support vector regression, Safaroud Dam, Artificial Neural Network

بررسی عملکرد روش های داده مبنا در تخمین نقاط مهم رطوبتی در منطقه شاهرود

امید نوروزی انگنایی*، محمدجواد خلفی، محبوبه کریمی سورند

فصلنامه علوم و مهندسی آبیاری، سال چهل و دوم شماره 4 (زمستان 1398)، صص 29 -44

آگاهی از نقاط مهم رطوبتی، برای مطالعه های آبیاری در مزرعه بسیار ضروری می باشد اما اندازه گیری این اطلاعات به روش مستقیم بسیار پرهزینه و وقت گیر است. روش های داده مبنا می توانند روش مناسبی برای تخمین این پارامترها باشد. تحقیق حاضر به برآورد نقاط مهم رطوبتی شامل ظرفیت زراعی و نقطه پژمردگی دایم به وسیله پارامترهای زودیافت با سه روش شبکه عصبی، رگرسیون خطی چندمتغیره و رگرسیون بردار پشتیبان در منطقه شاهرود پرداخته است. پس از نرمال سازی داده های مورد نظر جدول ضریب همبستگی متغیرهای ورودی احتمالی با خروجی های مورد نظر تشکیل شد و معنی داری همبستگی متغیرهای ورودی و خروجی از نظر آماری بررسی گردید. سپس، مدل سازی با روش های مذکور انجام و نتایج مورد ارزیابی قرار گرفت. نتایج نشان داد که روش رگرسیون بردار پشتیبان کارایی بهتری نسبت به دو روش دیگر دارد. مقادیر ضریب تعیین، انحراف جذر میانگین مربعات خطا و ریشه میانگین مربعات خطا نرمال شده در بهترین مدل رگرسیون بردار پشتیبان ، به ترتیب برابر 85/0 ، 12/3 و 89/12 برای ظرفیت زراعی و 83/0 ، 58/1و 84/14 برای نقطه پژمردگی دایم و برای شبکه های عصبی مقادیر 72/0 ، 48/3 و 36/14 برای ظرفیت زراعی و 75/0 ، 90/1 و 91/17 برای نقطه پژمردگی به دست آمد. با توجه به بررسی های صورت گرفته در این تحقیق، می توان بیان نمود که مدل های رگرسیون بردار پشتیبان با تابع کرنل خطی پایه شعاعی قادر خواهند بود با خطای پایین و ضریب تعیین بالا نقاط مهم رطوبتی خاک را پیش بینی کنند و همچنین می توانند جایگزین بسیار خوبی برای روش های سنتی هم چون شبکه های عصبی و رگرسیون خطی باشند.

کلید واژگان: رگرسیون خطی، شبکه های عصبی مصنوعی، رگرسیون بردار پشتیبان، ظرفیت زراعی، نقطه پژمردگی دایم

Investigating the Performance of Data-based Methods in Estimating Important Moisture Points in Shahrood Area

Omid Norouzi Engnaee *, Mohammad Javad Khalafi, Mahboobeh Karimi Soorvand

Irrigation Sciences and Engineering, Volume:42 Issue: 4, 2019, PP 29 -44

Awareness of the important moisture points is crucial for irrigation studies on the farm, but measuring this information in a direct way is very costly and time consuming. Therefore, several models and relationships have been developed as Pedotransfer functions which indirectly predict the hydrological properties of the soil using readily available soil data with the aid of a series of proper mathematical relationships (Nguyen et al., 2015). Since the measurement of important moisture points is a time consuming, costly and difficult work, many attempts have been made in order to use simpler soil properties such as texture, the amount of organic matter, and bulk density. Pedotransfer functions are indeed predictive functions which establish relationship between the soil’s readily available and latency data (e.g., the percentage of sand, silt and clay, bulk density and organic matter) including the parameters of the moisture curve (field capacity and permanent wilting point) (Botulla et al., 2013). Moreover, the functions that can be successfully implemented in an area may not have suitable adaptations in another area with real values. There are several methods for obtaining Pedotransfer functions, among them are linear regression (LR), artificial neural networks, fuzzy adaptive-neural inference, and support regression vector. Various researchers have studied the development of Pedotransfer functions and evaluated the predictive models in the water and soil sciences. As a sample, Shop and Lajj (1998) estimated the soil moisture curve using the neural network. They found that the artificial neural network was better than some of the regression Pedotransfer functions provided by other researchers, and if more readily available properties were used as inputs, the prediction accuracy increased. However, there was always a significant difference between the predicted and measured moisture values. Zhang et al. (2007) estimated the soil moisture curve for 110 non-calcareous soil samples with different tissue classes through the artificial neural networks and regression models. They showed that the neural network predicts the moisture curve better than the regression method with higher correlation coefficient in most tissue classes. Lin et al. (2009) argued that the SVM method was much faster trained than the artificial neural network. SVM was also found to have a more accurate prediction than the artificial neural network. Chen et al. (2010) used support vector machines to model daily rainfall and compared the results with that of the multivariate analysis method. It was found that the results of predictions from SVM were more accurate. In turn, Kaihua et al. (2014) used support vector machines to predict cationic capacity on different horizons of the soil in Qingdao, China. They performed their studies at 208 points on two horizons of the soil, and concluded that the SVM model improved predictions. Considering the significance of knowing the important points of soil moisture in Shahrood area for agricultural projects and irrigation schedules, developing appropriate Pedotransfer functions and evaluating models is necessary so as to obtain moisture of the field capacity and permanent wilting point. This research, thus, evaluates the performance of three models of support vector regression, artificial neural networks, and linear regression in the development of soil Pedotransfer functions and the effect of number and type of input variables on the performance of the models.

Keywords: Linear Regression, Artificial Neural Networks, Support Vector Regression, Field Capacity, Permanent Wilting Point

ترمیم داده های مفقود هواشناسی با روش های تکاملی و یادگیری ماشین مطالعه موردی: بارش و دمای ماهانه درازمدت مشهد

محبوبه فرزندی، سید حسین ثنایی نژاد*، بیژن قهرمان، مجید سرمد

نشریه آب و خاک، سال سی و سوم شماره 2 (پیاپی 64، خرداد و تیر 1398)، صص 361 -377

بارش و دما از مهم ترین متغیرهای هوا و اقلیم شناسی هستند. طول دوره آماری اهمیت بسزایی در دقت تحلیل این دو متغیر دارد. حجم نمونه کمتر از 100 سال نمی تواند نوسانات دراز مدت را به خوبی منعکس کند. طولانی ترین آمار مربوط به دما و بارش ماهانه مشهد نزدیک به 125 سال (از حدود 1893 الی 2017) است. متاسفانه این آمار مفقودی دارد. ترمیم داده های مفقود و افزایش دقت برآورد آن ها هدف این پژوهش است. ایستگاه هایی از کشورهای مجاور به عنوان ایستگاه های مبنا انتخاب شدند. ابتدا داده های مفقود با برازش ده الگوی رگرسیونی چندگانه برای بارش ماهانه (با ضرایب تعیین 63/0 تا 81/0) و شش الگو برای دمای ماهانه (986/0تا 993/0) ترمیم شدند. سپس برای کاهش خطاها، پارامترهای الگوهای رگرسیونی با روش های GA و ACO بهینه شدند. افزون بر این دو روش ANN و SVR نیز به منظور الگوسازی این داده ها نیز به کار گرفته شدند. نتایج نشان داد GA و ACO دقت برآورد داده های مفقود بارش را نسبت به روش های رگرسیونی فوق به طور چشمگیری افزایش می دهد. کمترین RMSE بین تمام الگوهای رگرسیونی بارش 79/9 میلی متر است. این معیار با روش GA به 560/2 میلی متر و با ACO به 559/2 کاهش می بابد. کمترین RMSE بین الگوهای رگرسیونی دما 986/0 میلی متر است. این معیار با روش ANN به 726/0 میلی متر و با SVR نیز به 551/0 کاهش می بابد. مقایسه ترمیم دما و بارش نشان می دهد که روش های تکاملی برای بارش و روش های یادگیری ماشین برای دما عملکرد بهتری دارند.

کلید واژگان: الگوریتم ژنتیک، داده مفقود، رگرسیون بردار پشتیبان، شبکه عصبی مصنوعی، کلونی مورچگان

Imputation of Missing Meteorological Data with Evolutionary and Machine Learning Methods Case Study: Long-term Monthly Precipitation and Temperature of Mashhad

M. Farzandi, H. Sanaeinejad *, B. Ghahraman, M. Sarmad

Journal of water and soil, Volume:33 Issue: 2, 2019, PP 361 -377

Introduction

Temperature and precipitation are two of the main variables in meteorology and climatology. These are basic inputs in water resource management. The length of the statistical period plays a pivotal role in the accurate analysis of these variables. Observation data at Iran's first synoptic station from 1330 (1951) is available at the Iranian Meteorological Organization website The historical monthly precipitation and temperature of five stations in Iran is available since 1880 with missing data. These data measured by the Embassy of the United States and Britain from the Qajar period and recorded in World Weather records books. These synoptic stations include Mashhad, Isfahan, Tehran, Bushehr, and Jask. The monthly missing data were predominantly recorded during World War II (1941-1949). Unfortunately, these data have missing. Therefore, the accuracy of simulating these variables is very important. The current research aimed to predict the missing values of monthly temperature and precipitation in Mashhad station. The stations in the neighboring countries were selected due to the distance to Mashhad, relationship, and completeness of data since 1880, as the predictive variables. Monthly precipitation of Ashgabat from Tajikistan and Sarakhs, Kooshkah, Bayram Ali, Kerki and Repetek from Turkmenistan were selected as an independent variable in the making of Missing Rainfall in Mashhad. Also, the temperature of Ashgabat, Bayram Ali, Gudan, Sarakhs, and Tajan were selected to restore the monthly temperature of the Mashhad station. This research has fitted ten multiple regression models to monthly rainfall of Mashhad station and has fitted 6 multiple regression to the monthly temperature of Mashhad. then the parameters of these patterns are optimized by genetic and Ant Colony algorithm. Also, the Artificial Neural Network (MLP) model and Support vector regression have been selected and implemented in order to simulate monthly precipitation and temperature data of Mashhad.

Materials and Methods

In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables when the focus is on the relationship between a dependent variable and one or more independent variables (or 'predictors'). Genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to generate high-quality solutions to optimization and search problems by relying on bio-inspired operators such as mutation, crossover, and selection. Ant colony optimization algorithm (ACO) is a probabilistic technique for solving computational problems which can be reduced to finding good paths through graphs. This algorithm is a member of the ant colony algorithms family, in swarm intelligence methods, and it constitutes some metaheuristic optimizations. Artificial neural networks are one of the main tools used in machine learning. As the “neural” part of their name suggests, they are brain-inspired systems which are intended to replicate the way that we humans learn. Neural networks consist of input and output layers, as well as (in most cases) a hidden layer consisting of units that transform the input into something that the output layer can use. They are excellent tools for finding patterns which are far too complex or numerous for a human programmer to extract and teach the machine to recognize. In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. Given a set of training examples, each marked as belonging to one or the other of two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making it a non-probabilistic binary linear classifier (although methods such as Platt scaling exist to use SVM in a probabilistic classification setting).

Results and Discussion

At the first stage, several multiple regressions were fitted to monthly precipitation (with coefficients ranging from 0.63 to 0.81) and six patterns for monthly temperature (0.986-0.993). Afterward, GA and ACO were applied to improve the accuracy of the selected regression models by optimizing their parameters. At the next stage, ANN and SVR were used to estimate the monthly missing values separately. Finally, the results of the previous stages were compared using the root mean square error (RMSE), and the optimal models were applied to determine the missing values of monthly temperature and precipitation of Mashhad. The results showed that the Genetic Algorithm and Ant Colony increase the accuracy of the estimation of missing rainfall data significantly more than the previous methods. The lowest error criterion (RMSE) between regression patterns is 9.8 millimeters. By genetic algorithm, this criterion is reduced to 2.56 mm, and by ant colony algorithm to 2.559.

Conclusion

Comparison of the above methods in restoration temperature and precipitation shows that evolutionary methods (GA and ACO) are the best for estimating the missing monthly precipitation and machine learning methods (ANN and SVR) are the best to imputation missing data of monthly temperature.

Keywords: Ant colony, Artificial neural network, Genetic algorithm, Missing data, Support vector regression

به جمع مشترکان مگیران بپیوندید!

support vector regression