جستجوی مقالات مرتبط با کلیدواژه

k-means algorithm

در نشریات گروه فنی و مهندسی

انتخاب همه

Improve Color Image Clustering with Using Combination of GLS, Giza Pyramid Construction and K-Means Algorithm

Masood Shahrian, Madjid Khalilian*

Journal of Computer and Robotics, Volume:18 Issue: 2, Summer and Autumn 2025, PP 45 -51

Color image clustering is recognized as a complex challenge in the field of image processing. To improve the results of image clustering, meta-heuristic optimization algorithms can be employed. These algorithms are typically straightforward and can efficiently tackle problems in a short time frame, which offers distinct advantages. However, as the complexity of the problem increases, the solutions derived from these algorithms often fail to represent the optimal solution, resulting in limitations for their practical use. Thus, improving the performance and accuracy of existing algorithms is essential for broadening their applicability. Many meta-heuristic algorithms struggle to maintain an appropriate balance between exploration and exploitation during their update processes, and this issue has not been sufficiently addressed. In this research, we present a novel approach to image clustering. Our method integrates an enhanced Giza Pyramid Construction (GPC) with the Guided Learning Strategy (GLS) and k-means clustering. The GLS strategy assesses the standard deviation of historical positions of individuals across recent generations to evaluate population dispersion and deduce the type of guidance the algorithm requires at any given time. When the algorithm leans towards exploration, this strategy steers it towards exploitation, and vice versa. By identifying and addressing the algorithm’s current needs, this strategy can significantly improve the performance of various optimization algorithms. Furthermore, the Giza Pyramid Construction, inspired by the historical practices of ancient Egypt, mathematically models the behavior of worker groups engaged in constructing large pyramids. We assess the effectiveness of our proposed algorithm in the context of color image clustering and compare the results against several established evaluators that can analyze internal cluster evaluations and inter-cluster distances. Our findings demonstrate that the proposed method achieves superior results compared to other state-of-the-art techniques, based on both objective and subjective evaluation metrics.

Keywords: Image Processing, Color Image Clustering, Giza Pyramid Construction, GLS, K-Means Algorithm

Abstract View Paper Research/Original Article Original: English
مدیریت اعتماد مبتنی بر بازخورد کاربران در محیط ابری با استفاده از الگوریتم بهینه فاخته

مقتدا سلیمانی مبارکه، گلناز آقایی قزوینی *، بابک نیکمرد

نشریه فناوری اطلاعات و ارتباطات ایران، سال شانزدهم شماره 61 (پاییز و زمستان 1403)، صص 147 -164

مدیریت اعتماد مبتنی بر بازخورد کاربران در محیط ابری از اهمیت زیادی برخوردار است. در محیط ابری انتخاب تامین کننده برای کاربران ابر، چالش برانگیز است. این موضوع که آیا انتخاب تامین کننده بر مبنای اولویت های کاربر و پارامترهای ثبت شده تا چه میزان دقیق است به عوامل زیادی بستگی دارد. در مطالعات پیشین چارچوب های زیادی در خصوص نحوه محاسبه اعتمادهای عینی و درونی ارائه شده است. در روش های موجود با استفاده از الگوریتم های جستجوی فاخته، ژنتیک و مگس میوه بهره برده شده است. قالب تحقیقات انجام شده به رتبه بندی، محاسبه پارامترها و یا سرعت محاسبه و میزان دقت در ارزیابی پارامترها پرداخته شده است که معمولا یا در بهینه محلی گیر کرده و یا زمان پاسخ بسیار کند بوده است.در این روش کاهش زمان ارزیابی اعتماد نسبت به الگوریتم های قبلی همچون ژنتیک به دلیل این که پارامتر های کمتری برای تنظیم دارد مشهود است، با تغییر در شعاع تخمگذاری و افزایش بررسی در فضای بیشتری از مسئله، الگوریتم بهینه فاخته نسبت به الگوریتم مرجع، سرعت همگرایی بیشتر، حداقل به میزان 5.9 درصد را دارد. در خصوص میزان دقت دسترسی کاربر به قابل اعتمادترین ارائه دهنده نیز با تغییر در جمعیت و پارامترهایی همچون تعداد فراهم کنندگان، کاربران و تکرار الگوریتم همچنان نتیجه بهتری حاصل شده است. در مجموع، نتایج حاصل شده نشان می دهد،مسئله با استفاده از الگوریتم COA در زمان بسیار کمتر نسبت به سایر الگوریتم ها به نقطه بهینه همگرا می شود و نتیجه ای دقیق تر بدست می آید

کلید واژگان: رایانش ابری، مدیریت اعتماد، بازخورد کاربران، الگوریتم بهینه فاخته، الگوریتم K-Means

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Trust Management Based on User Feedback in Cloud Computing Environment by Using Cuckoo Optimization Algorithm

Muqtada Soleimani Mobarake, Golnaz Aghaee Ghazvini*, Babak Nikmard

Journal of Information and Communication Technology, Volume:16 Issue: 61, 2025, PP 147 -164

Cloud computing provides computational services such as servers, memory, storage space, databases, networks, software, analytics, and information as virtualized resources through the internet to offer faster innovation, flexible resources, and cost savings at scale. Although cloud computing service providers are innovatively expanding their services, trust is one of the major obstacles to the progress of this matter. Trust is the biggest issue in cloud computing since trust is an effective guarantor during interactions between the users and the providers. Trust is one of the most fundamental methods for increasing confidence in resources provided in the cloud environment and is important in cloud business environments. With the increasing number of cloud services providers in the cloud computing environment and the number of users, the selection of provider has become a major challenge. The Coa algorithm has a higher convergence speed, at least by 5.9%, compared to the studied algorithms. In this research, an optimization approach based on a metaheuristic process using the COA algorithm combined with the K-means clustering algorithm is proposed to solve the optimization problem of selecting the best provider in the trust management third-party component layer based on parameters. In this method, while reducing trust evaluation time, the accuracy of user access to the most trusted provider based on user priorities has increased compared to previous methods. This can increase user confidence and improve the quality of service providers.

Keywords: Cloud Computing, Trust Management, User Feedback, Cuckoo Optimization Algorithm, K-Means Algorithm

Abstract View Paper Research/Original Article Original: Persian
خوشه‏ بندی و هموارسازی با استفاده از داده های واقعی جمع آوری شده از اتوبوس شهری برای استخراج ‏چرخه رانندگی در شهر کرمانشاه

فرشاد محمدصادقی، محمد آزادی*، مسعود مسیح طهرانی، علی ملکان، علی کشاورزی، حامد سعیدی گوگرچین

نشریه تحقیقات موتور، پیاپی 75 (تابستان 1403)، صص 1 -17

هر چرخه رانندگی به منظور بررسی مصرف سوخت، آلاینده های خودرویی، الگوی رانندگی ویژه هر منطقه برای برنامه ریزی بهتر و بهبود استانداردهای مربوطه در منطقه مدنظر تهیه می شود. در این پژوهش از روش چرخه رانندگی واقعی (RDE) که می توان گفت یکی از دقیق ترین روش ها برای استخراج چرخه رانندگی به حساب می آید استفاده شده است. برای جمع آوری داده های موردنظر از GPS گوشی های همراه و به صورت سوار بر اتوبوس های شهری در خیابان های شهر کرمانشاه استفاده شده است. این داده ها در طول مسیر حرکت اتوبوس از مبدا تا مقصد مقدار سرعت، شتاب گیری، مدت زمان سرعت ثابت، مقدار کاهش شتاب، ارتفاع از سطح دریا، تعداد فرایند ترمزگیری و مدت زمان توقف خودرو را در مسیر ثبت می کنند. در این پژوهش با استفاده از روش خوشه بندی داده های واقعی گرفته شده در سطح شهر و روش میانگین کی (K-means) به استخراج چرخه رانندگی اتوبوس های شهری کرمانشاه پرداخته شده است. با تحلیل داده های گرفته شده سوار بر اتوبوس دیده می شود که رانندگان اتوبوس در شهر کرمانشاه تمایلی به شتاب گیری و حرکت با سرعت ثابت تند را ندارند که این امر مصرف سوخت را در چرخه اتوبوس شهری کاهش می دهد، که می توان گفت این کار به صورت آگاهانه و به همین منظور توسط راننده ها صورت می گیرد.

کلید واژگان: چرخه رانندگی، خوشه بندی، روش میانگین کی، اتوبوس شهری، شهر کرمانشاه

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Clustering and smoothing using real data collected from city buses to extract the driving ‎cycle in Kermanshah city

Farshad Mohammadsadeghi, Mohammad Azadi *, Masoud Masih Tehrani, Ali Malekan, Ali Keshavarzi, Hamed Saeidi Googarchin

Journal of Engine Research, Volume:20 Issue: 75, 2024, PP 1 -17

Each driving cycle designed to monitor fuel consumption, vehicle emissions, and the specific driving patterns of a region is created to facilitate better planning and enhancement of local standards. In this study, the Real Driving Emissions (RDE) method, regarded as one of the most precise techniques for determining driving cycles, was employed. To gather the necessary data, GPS from mobile phones was utilized while traveling on city buses through the streets of Kermanshah. This data collection included recording speed, acceleration, duration of constant speed, deceleration, altitude, number of braking events, and the duration of stops along the route from origin to destination. The research employed the clustering of real city data and the K-means method to derive the driving cycle for Kermanshah’s city buses. Analyzing the collected data reveals that bus drivers in Kermanshah tend to avoid rapid acceleration and high constant speeds, leading to reduced fuel consumption in the city bus cycle, indicating that this behavior is intentional and aimed at this goal.

Keywords: Driving Cycle, Clustering, K-Means Algorithm, City Bus, Kermanshah City

Abstract View Paper Research/Original Article Original: Persian
Presenting a technique for identifying and diagnosing epileptic seizures using nonlinear feature extraction based on DT-CWT coefficients of brain EEG signals with a deep

Abdullah Jafari Chashmi *

Journal of Future Generation of Communication and Internet of Things, Volume:2 Issue: 4, Oct 2023, PP 28 -35

Epilepsy is a type of brain disease that can be diagnosed by observing EEG signals. The disease often occurs in children. However, some cases are also seen in adults. Diagnosing this disease in the early stages is a challenging task for doctors. In this work, the authors have classified epileptic and normal EEG signal by adopting deep learning approach. To achieve the efficient features, the dual tree complex wavelet (DTCWT) is considered. Then, the decomposed wavelet coefficients are applied to nonlinear feature extraction. These features are used as input to the Radial Hybrid Basis Function (RBF) class. Using the proposed method, about 99% classification accuracy is observed. This requires significant improvement of the proposed algorithm compared to other previously presented algorithms. It is the first time that nonlinear feature extraction on DT-CWT coefficients of an EEG signal is used to diagnose epilepsy.

Keywords: epilepsy, k-Means Algorithm, Nonlinear features, radial basis function networks, brain EEG classification, Feature reduction

Abstract View Paper Research/Original Article Original: English
بهبود کارآیی خوشه بندی داده ها با الگوریتم های تکاملی آشوب گونه

سجاد منطقی*، سارا خسروانی پور

نشریه پژوهش های نوین در شهر هوشمند، سال یکم شماره 4 (تابستان 1402)، صص 6 -25

امروزه، خوشه بندی نقش مهمی را در اغلب زمینه های تحقیقاتی مانند مهندسی، پزشکی، زیست شناسی، داده کاوی و... ایفا می نماید. در واقع خوشه بندی به معنای تقسیم بندی بدون نظارت می باشد. داده ها با استفاده از آن به دسته هایی که از نظر پارامترهای موردعلاقه، شباهت بیشتری به یکدیگر دارند، تقسیم می گردند. یکی از روش های معروف در این زمینه k-means می باشد. در این روش علی رغم وابستگی به شرایط اولیه و همگرایی به نقاط بهینه محلی، تعداد N داده به k خوشه با سرعت بالا، دسته بندی می شوند. در این مقاله جهت رفع مشکلات موجود از روش ترکیبی مبتنی بر الگوریتم های تکاملی و تئوری آشوب و k-means بهره گرفته خواهد شد؛ که علاوه بر رفع مشکلات ذکرشده، مستقل از تعداد متغیرها نیز خواهد بود. در این مقاله به منظور اعتبارسنجی، روش های پیشنهادی بر روی 13 مجموعه متفاوت مشهور پیاده سازی می گردد و نتایج با روش های الگوریتم ژنتیک، اجتماع ذرات، کلونی زنبور عسل، تبرید شبیه سازی شده، تکاملی تفاضلی، جستجوی هارمونی و k-means مقایسه خواهند گردید. توانایی بالا و مقاوم بودن این روش ها بر اساس نتایج مشهود خواهد بود.

کلید واژگان: خوشه بندی، الگوریتم K-Means، الگوریتم های تکاملی، آشوب، الگوریتم تکاملی آشوب گونه

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Improving the efficiency of data clustering with chaotic evolutionary algorithms

Sajad Manteghi*, Sara Khosravani Pour

Journal of New Researches in the Smart City, Volume:1 Issue: 4, 2023, PP 6 -25

Nowadays, clustering plays an important role in most research fields such as engineering, medicine, biology, data mining, etc. In fact, clustering means unsupervised division. By using it, the data are divided into categories that are more similar to each other in terms of the parameters of interest. One of the famous methods in this field is k-means. In this method, despite the dependence on initial conditions and convergence to local optimal points, N numbers of data are grouped into k clusters with high speed. In this article, to solve the existing problems, the combined method is used based on evolutionary algorithms, chaos theory and k-means; that is in addition to solving the mentioned problems, it will also be independent of the number of variables. In this article, for the purpose of validation, the proposed methods are implemented on 13 different famous collections, and the results are compared with genetic algorithm, particle community, bee colony, simulated refrigeration, differential evolution, harmony search, and k-means methods. The high ability and robustness of these methods will be evident based on the results.

Keywords: Clustering, K-Means Algorithm, Evolutionary Algorithms, Chaos, Chaoticevolutionary Algorithm

Abstract View Paper Research/Original Article Original: Persian
طراحی مدل داده کاوی تلفیقی خوشه بندی-وابستگی جهت بررسی رفتار مصرف انرژی واحدهای صنعتی

فرزاد رحیمی موگویی، رضا کامران راد*، عظیم الله زارعی

نشریه انرژی ایران، سال بیست و پنجم شماره 3 (پیاپی 96، پاییز 1401)، صص 65 -78

با توجه به سهم بالای مصرف برق در صنایع کشور، طی چند سال اخیر طرح های مختلفی از جمله خاموشی های سراسری در اوقات پیک مصرف اجرا شده است. امروزه داده کاوی به عنوان فرآیند کشف الگوهای مفید از پایگاه داده و یکی از روش های موثر برای تجزیه و تحلیل، مدل سازی و پیش بینی مصرف انرژی کاربرد فراوانی پیدا کرده است. در این مطالعه، مدلی تلفیقی جهت بررسی رفتار مصرف برق با استفاده از تکنیک های خوشه بندی کا میانگین و قوانین وابستگی جهت کشف و استخراج الگو از مجموعه داده های مربوط به مصرف برق واحدهای صنعتی مستقر در یکی از شهرک های صنعتی استان تهران طراحی شده است. مشاهدات نشان می دهد که طی ماه های گرم سال، میانگین مصرف واحدهای خوشه پرمصرف که حدود 34 درصد واحدهای صنعتی مورد مطالعه را شامل می شود، حدود 4.2 برابر مصرف خوشه کم مصرف و حدود 1.7 برابر مصرف خوشه متوسط است. با بکارگیری مدل پیشنهادی در این پژوهش ضمن شناسایی واحدهای پرمصرف و اعمال سیاست های هوشمندانه و عادلانه در خاموشی اجباری، می توان علاوه بر تشویق واحدهای صنعتی به بهینه سازی مصرف انرژی، از ایجاد خسارت ناشی از توقف های اجباری تولید ممانعت کرد. رویکرد نوآورانه این مدل قادر به کنترل حجم زیادی از داده ها برای برنامه ریزی مناطق مختلف با هدف بهینه سازی در مصرف انرژی آن می باشد.

کلید واژگان: داده کاوی، خوشه بندی، تجزیه و تحلیل سلسله مراتبی، الگوریتم کا میانگین، قوانین وابستگی، بهینه سازی مصرف انرژی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Design of integrated clustering-association data mining model to study the electricity consumption behavior of industrial units

Farzad Rahimi, Reza Kamranrad*, Azimolah Zarei

Iranina journal of Energy, Volume:25 Issue: 3, 2023, PP 65 -78

Due to the high share of electricity consumption in the country's industries, in recent years, various projects have been implemented, including reducing the amount of load by implementing nationwide blackouts during peak consumption times. Today, data mining is widely used as a process of discovering useful patterns from the database and one of the effective methods for analyzing, modeling and predicting energy consumption. In this study, an integrated clustering-association data mining model has been designed to investigate power consumption behavior to discover and extract the pattern from the power consumption data set of industrial units located in one of the industrial towns of Tehran province. Observations show that during the warm months of the year, the average consumption of high-consumption cluster units, which includes about 34% of the studied industrial units, is about 4.2 times the consumption of low-consumption clusters and about 1.7 times the consumption of medium clusters. By using the proposed model in this study, in addition to identifying high-consumption units and implementing smart and fair policies in forced shutdowns, it is possible to prevent damage caused by forced shutdowns and industrial units can be encouraged to optimize energy consumption. The innovative approach of this model is able to control large volumes of data for planning different areas with the aim of optimizing its energy consumption.

Keywords: Data mining, clustering, hierarchical analysis, K-means algorithm, Dependency laws, optimization of energy consumption

Abstract View Paper Research/Original Article Original: Persian
VMRCD :الگوریتمی برای مسیریابی اتکاپذیر با استفاده از خوشه بندی مجدد در شبکه های خودرویی با ارتباط V2V

آرش قربان نیادلاور*، زهرا جورمند، لیلا ترکاشوند رحمتی

فصلنامه مدل سازی در مهندسی، پیاپی 71 (زمستان 1401)، صص 87 -99

کنترل ازدحام ترافیک وسایل نقلیه نقش مهمی در مدیریت زمان دارد و بامطالعه موردی در شبکه های VANET مبتنی بر خوشه بندی ،الگوریتمی برای مسیریابی اتکاپذیر با استفاده از خوشه بندی مجدد در شبکه های خودرویی با ارتباط V2V ارایه داده ایم.تجمع وسایل نقلیه نقش مستقیمی در کنترل ازدحام ترافیک دارد ، در الگوریتم VMRCD با در نظر گرفتن پارامترهای شاخص ، تابع هدفی ارایه داده ایم که با ادغام نمودن پارامترهایی همچون وزن و فاصله ، طول مسیر را نسبت به روش های پیشین بهینه کرده ایم و با خوشه بندی مجدد تعداد برخورد وسایل نقلیه را با استفاده از اصل همسایگی کاهش داده ایم. در این حالت با در نظر گرفتن فاصله اقلیدسی می توان از تجمع وسایل نقلیه با استفاده از مینیمم فاصله جلوگیری کرده تا مسیری بهینه ای ایجاد نماییم. با کمک تابع هدف به یک مسیر اتکاپذیر دست یافته ایم تا تحویل بسته و تاخیر END-TO-END را بهینه کنیم.درنهایت با استفاده از الگوریتم پیشنهادی با دسته بندی ورودی ها و زمان اجرای واقعی نسبت به الگوریتم SCRS مسیریابی اتکاپذیر را افزایش داده ایم و همچنین مصرف سوخت را کاهش داده ایم.

کلید واژگان: اتکاپذیری، الگوریتم k-means، خوشه بندی مجدد، طول عمر سرخوشه، وزن دهی، VMRCD

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

VMRCD: an algorithm for dependable routing using the re-clustering for vehicular networks with v2v communication

Arash Ghorbannia Delavar *, Zahra Jormand, Leila Torkashvand Rahmati

Journal of Modeling in Engineering, Volume:20 Issue: 71, 2022, PP 87 -99

Vehicle traffic congestion control has an essential role in time management and with a case study on clustering-based VANET networks, an algorithm for dependable routing using the re-clustering for vehicular networks with v2v communication has been proposed.The aggregation of vehicles has a direct role in controlling traffic congestion, By observation the index parameters, we obtain the objective function, that merging parameter such as weight and distance, it optimizes the route length compared to previous methods, And by re-clustering, we decreased the number of vehicle collisions using the neighborhood principle. In this case, by considering the Euclidean distance, we can prevent the aggregation of vehicles using the minimum distance to create an optimal route. With the help of the objective function, we will achieve a dependable route to optimize packet delivery and the delay END-TO-END.Ultimately, using the proposed an algorithm, By categorization the inputs and real execution time, we increase the dependable routing compared to the SCRS algorithm, and we also decreased fuel consumption.

Keywords: Dependability, K-Means Algorithm, Lifetime Cluster Head, Re-clustering, VMRCD, weighting

Abstract View Paper Research/Original Article Original: Persian
Adaptive Z2 Protection Scheme Based on Setting Groups for Distance Relaying of Wind Farms Integrated Power Systems

Y. Fattahyan, N. Ramezani*, I. Ahmadi

Iranian Journal of Electrical and Electronic Engineering, Volume:18 Issue: 3, Sep 2022, P 2407

Using doubly-fed induction generator (DFIG) based onshore wind farms in power systems may lead to mal-operation of the second zone (Z2) of distance protection due to the uncertain number of available wind turbines on the one hand and the function of DFIGs control system to maintain the bus voltage on the other hand. In such cases, variable injected current by the wind farm causes distance relay fall in trouble to distinguish whether the fault point is in the Z2 operating area or not. In the current study, an adaptive settings scheme is proposed to determine the Z2 setting value of distance relays for such cases. The proposed method is based on the adaptive approach and the settings group facility of the commercial relays. The proposed method applies the k-means clustering approach to decrease the number of setting values calculated by the adaptive approach to the number of applicable settings group in the distance relay and uses the Particle Swarm Optimization (PSO) algorithms to achieve the optimum setting values. The high accuracy of the proposed method in comparison with other methods, suggested in the literatures, is shown by applying them to the IEEE 14-bus grid.

Keywords: DFIG, Distance Relay, K-Means Algorithm, PSO, Setting Groups

Abstract View Paper Research/Original Article Original: English
شناسایی و پیش بینی رفتار مصرف با استفاده از درخت تصمیم و هرم ارزش مصرف مشترکین

عظیمه مظفری*، علیرضا علی احمدی، اعظم مظفری

مجله آب و فاضلاب، پیاپی 139 (خرداد و تیر 1401)، صص 89 -106

آنچه در شرایط کنونی اهمیت دارد، توجه به الگوی رفتار مصرف آب و شناخت مشترکین و مصرف کنندگان با جایگاه بالاتر در هرم ارزش در حوزه سیاست گذاری مدیریت مصرف آب است. هرم ارزش، ابزاری است که مشترکین باارزش از نظر مصرف را شناسایی می کند، بنابراین کاربرد و اهمیت زیادی در حوزه مدیریت مصرف آب به منظور شناسایی مشترکین پرمصرف و کم مصرف دارد. ازاین رو در این پژوهش به منظور شناسایی الگوی رفتاری مصرف مشترکین شرکت آب و فاضلاب شیراز بر مبنای هرم ارزش مصرف آنها و پیش بینی مشترکین با جایگاه بالاتر در هرم ارزش، از تکنیک های داده کاوی استفاده شد. در چارچوب روش پیشنهادی ابتدا داده های مربوط به مصرف مشترکین آب شامل مشترکین مسکونی، تجاری و صنعتی، عمومی و اداری، اماکن مذهبی و آموزشی، نظامی و غیردولتی برای 2 سال متوالی از پایگاه داده شرکت آب و فاضلاب شیراز استخراج شد و پس از تعیین تعداد بهینه خوشه با استفاده از شبکه عصبی خودسازمان ده و شاخص دیویس بولدین، عملیات خوشه بندی با استفاده از الگوریتم Kمیانگین انجام شد. لازم به ذکر است که شاخص ها و معیارهای خوشه بندی مشترکین شامل نوع کاربری، محل سکونت، میزان مصرف، سابقه انشعاب غیرمجاز، تعداد اخطار قطع و زمان پرداخت قبوض هستند که با استفاده از نظر متخصصین مشخص شده اند. در ادامه ضمن محاسبه ارزش مصرف مشترکین هر خوشه و ترسیم هرم ارزش مصرف مشترکین، با استفاده از الگوریتم درخت تصمیم به پیش بینی و کشف الگوهای رفتاری مشترکین پرداخته شد. نتایج نشان داد که مشترکین شرکت آب و فاضلاب شیراز از نظر الگوی رفتار مصرف در 6 خوشه قرار می گیرند که ضمن ترسیم هرم ارزش مصرف، این 6 خوشه در 3 کلاس مشترکین با مصرف زیاد، مشترکین با مصرف متوسط و مشترکین کم مصرف دسته بندی شدند. پس از پیاده سازی درخت تصمیم، صحت درخت برابر با 92/78 بود که بر اساس آن مشترکین این 3 کلاس، 11 الگوی رفتاری داشتند که پیش‎ بینی کننده نوع مصرف بود. به این ترتیب طبق 11 الگوی رفتاری مصرف مشترکین شرکت آب و فاضلاب شیراز می توان میزان مصرف مشترکین جدید را پیش بینی و جایگاه آن در هرم ارزش را تعیین کرد.

کلید واژگان: الگوریتم Kمیانگین، داده کاوی، شبکه عصبی خودسازمان ده، شرکت آب و فاضلاب شیراز، مدیریت مصرف آب، مشترکین آب

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Identification and Prediction of Consumption Behavior Using Decision Tree and Consumer Value Pyramid

Azime Mozafari *, Alireza Ali Ahmadi, Azam Mozafari

Journal of Water & Wastewater, Volume:33 Issue: 139, 2022, PP 89 -106

What is important in the current situation it is important to pay attention to patterns of the water consumption behavior of subscribers and to identify subscribers and consumers with a higher position in the value pyramid in the field of water consumption management policy. Value pyramid is a tool that identifies valuable customers in terms of consumption, and so, has great application and importance in the field of water consumption management to identify high-consumption and low-consumption customers. Therefore, in the present study, in order to identify the patterns of consumption behavior of Shiraz Water and Wastewater Company subscribers based on their consumption value pyramid and to predict the subscribers with a higher position in the value pyramid, data mining techniques have been used. In the framework of the proposed method, first, the data of water subscribers' consumption including residential, commercial and industrial subscribers, public and administrative, religious and educational sites, military and non-governmental, for two consecutive years, were extracted from the database of Shiraz Water and Wastewater Company. After determining the optimal number of clusters using self-organizing map and Davis Bouldin index, clustering operation is performed using K-Means algorithm. It should be noted that the indicators and criteria for subscriber clustering include the type of use, location, consumption, history of unauthorized branching, number of disconnection notices and time of payment of bills which have been identified using the opinion of experts. Then, while calculating the consumption value of subscribers in each cluster and plotting the subscriber consumption value pyramid, the decision tree algorithm is used to predict and discover the behavioral patterns of subscribers. The results show that Shiraz Water and Wastewater Company subscribers are divided into six clusters in terms of consumption behavior patterns. While drawing the consumption value pyramid, these six clusters are classified into three classes: high consumption subscribers, medium consumption subscribers and low consumption subscribers. After implementing the decision tree, the accuracy of the tree was 78.92 that, according to the results of the decision tree, the subscribers of these three classes have 11 patterns of behavior that predict the type of consumption. Thus, according to the 11 behavioral patterns of the subscribers of Shiraz Water and Wastewater Company, the consumption of new subscribers can be predicted and its position in the value pyramid can be determined.

Keywords: K-means Algorithm, Data mining, Self-Organized neural network, Shiraz Water, Wastewater Company, Water consumption management, Water Subscribers

Abstract View Paper Research/Original Article Original: Persian
Performance measurement in the health care sector: A leader-follower network DEA model based on double frontier analysis

E. Vaezi *

Scientia Iranica, Volume:29 Issue: 3, May & Jun 2022, PP 1662 -1684

Measuring the performance of laboratories as one of the most significant areas of healthcare plays a key role in the quality of laboratories management. In this paper, we consider a three-stage network comprised of a leader and two followers in respect to the additional desirable and undesirable inputs and outputs. The suggested model simulates the internal structure of a diagnostic lab (the pre-test, the test and the post-test). The criteria for evaluation are achieved by using the Fuzzy Delphi technique. Due to the social, economic and environmental impacts of health care systems, the significance of sustainability criteria is obvious in the case study indicators. We utilize the non-cooperative approach multiplicative model to measure the efficiency of the overall system and the performances of decision-making units (DMUs) from both the optimistic and pessimistic views. The non-cooperative models from these view cannot be converted into linear models. Therefore, a heuristic method is suggested to convert the nonlinear models into linear models. Finally, after obtaining the efficiencies based on the double-frontier view, the DMUs are ranked and classified into three clusters by the k-means algorithm.

Keywords: Network DEA, Medical Diagnostic Laboratories, Sustainability, Non-Cooperative Game, Double-frontier, Additional Inputs, undesirable outputs, K-means algorithm

Abstract View Paper Research/Original Article Original: English
یک الگوریتم مبتنی بر افرازبندی گراف برای خوشه بندی سامانه های نرم افزاری با ابعاد بزرگ

بابک پوراصغر*، حبیب ایزدخواه، شهریار لطفی، خیام صالحی

فصلنامه پردازش علائم و داده ها، سال هجدهم شماره 4 (پیاپی 50، زمستان 1400)، صص 37 -47

از روش های خوشه بندی برای بازیابی ساختار نرم افزار جهت فهم درست آن و همچنین بازسازی نرم افزار استفاده می شود. در ادبیات موضوع، بیشتر الگوریتم های ارایه شده برای خوشه بندی سامانه های نرم افزاری به دو دسته الگوریتم های مبتنی بر جستجو و الگوریتم های سلسله مراتبی طبقه بندی می شوند و الگوریتمی از رده مبتنی بر افراز برای خوشه بندی یک سامانه نرم افزاری ارایه نشده است. این روش ها سعی دارند که گراف وابستگی موجودیت به دست آمده از کد منبع سامانه نرم افزاری را به چند مجموعه راسی افراز کنند. در سامانه های نرم افزاری، موجودیت می تواند رده، تابع و یا یک فایل باشد. با توجه به چندجمله ای غیر قطعی، سخت بودن مساله خوشه بندی، در سال های اخیر از روش های تکاملی و مبتنی بر جستجو مانند الگوریتم ژنتیک برای این حل این مساله، زیاد استفاده شده است. هر چند این الگوریتم ها در برخی موارد می توانند ساختار مناسبی از نرم افزار را به دست آورند، اما برای نرم افزار های با ابعاد بزرگ، با توجه به زمان اجرا و حافظه مصرفی زیاد، قابل اجرا نیستند؛ همچنین، این روش ها از اطلاعات و دانش گرافی موجود در گراف وابستگی موجودیت استفاده ی چندانی نمی کنند. در این مقاله یک الگوریتم مبتنی بر افراز ارایه شده است که بتوان از آن در خوشه بندی نرم افزار نیز استفاده کرد. همچنین، یک نوع فاصله جدید برای قیاس تشابه و عدم تشابه ارایه شده است. انتظار می رود روش پیشنهادی بتواند در قیاس با سایر روش های موجود، خوشه بندی هایی با کیفیت بالاتر و نزدیک به خوشه بندی فرد خبره، تولید کند. برای بررسی صحت اجرای الگوریتم، آن را بر روی نرم افزار موزیلا فایرفاکس اجرا کرده و نتایج را با الگوریتم های مطرح این حوزه، مقایسه کرده ایم.

کلید واژگان: مهندسی نرم افزار، مهندسی معکوس، خوشه بندی نرم افزار، الگوریتم K-means

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

A partition-based algorithm for clustering large-scale software systems

Babak Pourasghar*, Habib Izadkhah, Shahriar Lotfi, Khayyam Salehi

Signal and Data Processing, Volume:18 Issue: 4, 2022, PP 37 -47

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawbacks such as finiteness criterion and arbitrary decisions occurred in the process. Because of the NP-hardness of clustering software systems, evolutionary and search-based algorithms are more commonly used algorithm than hierarchical ones. In evolutionary algorithms, the clustering of software systems is considered as a problem of searching over some possible clustering candidates. Although these algorithms are often able to achieve an appropriate structure of the software, they are not applicable in clustering large-scale software. Furthermore, these algorithms are unable to consider the knowledge in the artifact dependency graph, which extracted from the source code of the software. In software systems, an artifact can be everything like a class, a function, or a file. In this paper, a new partition-based clustering algorithm is presented. This algorithm attempts to partition the artifact dependency graph considering the knowledge therein. Moreover, a new distance criterion is presented to measure the similarity and dissimilarity of the artifacts. The proposed algorithm starts with the artifact dependency graph and creates the similarity matrices of the artifacts. So, it attempts to refine the partition candidate until a fixed point is reached. We expect that the proposed method compared with other methods could lead to achieve the clustering with high quality and similar to the expert's clustering based on MoJo-FM measure. To demonstrate the applicability and validity of the proposed algorithm, a large-scale case study, Mozilla Firefox, is employed. The results demonstrate that the proposed algorithm outperforms the commonly used evolutionary methods in the literature.

Keywords: Software Engineering, Reverse Engineering, Software Clustering, K-means algorithm

Abstract View Paper Research/Original Article Original: Persian
Braking intensity recognition with optimal K-means clustering algorithm

Ali Mirmohammad Sadeghi, Abdollah Amirkhani *, Behrooz Mashadi

Journal of Computational and Applied Research in Mechanical Engineering, Volume:11 Issue: 2, Winter-Spring 2022, PP 409 -423

Recognizing a driver’s braking intensity plays a pivotal role in developing modern driver assistance and energy management systems. Therefore, it is especially important to autonomous and electric vehicles. This paper aims at developing a strategy for recognizing a driver’s braking intensity based on the pressure produced in the brake master cylinder. In this regard, a model-based, synthetic data generation concept is used to generate the training dataset. This technique involves two closed-loop controlled models: an upper-level longitudinal vehicle dynamics model and a lower-level brake hydraulic dynamic model. The adaptive particularly tunable fuzzy particle swarm optimization algorithm is recruited to solve the optimal K-means clustering. By doing so, the best number of clusters and positions of the centroids can be determined. The obtained results reveal that the brake pressure data for a vehicle traveling the new European driving cycle can be best partitioned into two clusters. A driver’s braking intensity may, therefore, be clustered as moderate or intensive. With the ability to automatically recognize a driver’s pedal feel, the system developed in this research could be implemented in intelligent driver assistance systems as well as in electric vehicles equipped with intelligent, electromechanical brake boosters.

Keywords: Vehicle safety systems, Clustering, K-means Algorithm, Hydraulic brake system

Abstract View Paper Research/Original Article Original: English
مقایسه عملکرد الگوریتم های ماشین بردار پشتیبان و میانگین کی در استخراج چرخه واقعی رانندگی ترکیبی تهران- آمل

تابان مهر قرائتی، علی مومنی موحد، محمد آزادی*، سید اشکان موسویان

مجله مهندسی مکانیک امیرکبیر، سال پنجاه و سوم شماره 9 (آذر 1400)، صص 5017 -5036

چرخه های رانندگی داده هایی بر حسب سرعت و زمان هستند که از آن ها در طراحی خودروها، مدیریت حمل و نقل و سوخت، تبیین و بهسازی شاخص های استاندارد استفاده می شود. در این پژوهش، چهار چرخه رانندگی ترکیبی بر اساس داده های واقعی استخراج گردید. برای رسیدن به این هدف، داده برداری با استفاده از یک خودروی سواری با موتور بنزین سوز به روش تعقیب خودرو ، در مسیری از تهران به آمل، تحت شرایط رانندگی واقعی انجام شد. پس از آن در نرم افزار متلب، با استفاده از الگوریتم های ماشین بردار پشتیبان و میانگین کی و با در نظر گرفتن مقادیر میان برد و میانگین به عنوان مراکز دسته ها، یک کد برای تولید چرخه های مورد نظر، و محاسبه پارامترهای مشخصه ی آن ها، نظیر سرعت متوسط، درصد زمان پیمایش خودرو در حالت های درجا، بدون شتاب، شتابگیری مثبت و منفی ایجاد شد. سپس این چرخه ها بر اساس میانگین خطای نسبی، خطای ریشه میانگین مربع و آزمون مربع چی با یکدیگر مورد مقایسه قرار گرفتند. نتایج نشان داد اگرچه، چرخه های استخراج شده توسط ماشین بردار پشتیبان به بازه زمانی مجاز (کمتر از 1800 ثانیه) نزدیک ترند، اما چرخه استخراج شده توسط الگوریتم میانگین کی و میانگین به عنوان مراکز دسته های ایجاد شده، کمترین خطاها را ثبت نموده است. این چرخه علاوه بر آن که بیشتر زمان خود را در حال حرکت شتابدار بوده است، دامنه نوسانات شتابی وسرعتی بیشتری را نسبت به دیگر چرخه های مورد مقایسه گزارش نمود.

کلید واژگان: چرخه رانندگی، ماشین بردار پشتیبان، الگوریتم میانگین کی، چرخه ترکیبی، دسته بندی و طبقه بندی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Comparison of Support Vector Machine and K-Means Algorithms Performance in Extracting the Real Driving Cycle of Combined Tehran-Amol

Tabanmehr Qaraati, Ali Momeni Movahed, Mohammad Azadi *, Seyed Ashkan Moosavian

Amirkabir Journal Mechanical Engineering, Volume:53 Issue: 9, 2021, PP 5017 -5036

Driving cycles represent the vehicle speed as a function of time and are used in vehicle design, fuel management, and the improvement of standard indicators. In this study, four combined driving cycles were extracted using real data. The data was collected from a passenger car with a gasoline engine under real driving conditions while driven from Tehran to Amol based on the car chasing method. A code was generated in MATLAB software to create the desired cycle using support vector machine and K-means algorithms considering mid-range and mean values as group centers. The characteristic parameters of the cycles such as the average speed and the percentage of the car travel time at idle, cruise, accelerating, and decelerating conditions were also calculated. These cycles were compared based on the mean relative error, the root-mean-square error, and the Chi-square test. The results showed that the cycles extracted by the support vector machine were closer to the allowable time interval (less than 1800 seconds); however, the cycle extracted by the K-means algorithm with the mean value as the centers of the generated categories, recorded the least errors. This cycle, in addition to spending most of its time in accelerated motion, represented a greater amplitude of acceleration and velocity fluctuations than other cycles.

Keywords: Driving Cycle, Support Vector Machine, K-Means Algorithm, Combined Cycle, Classification, And Clustering

Abstract View Paper Research/Original Article Original: Persian
A novel selective clustering framework for appropriate labeling of clusters based on K-means algorithm

Fateme Moslehi, Abdorrahman Haeri *, MohammadReza Gholamian

Scientia Iranica, Volume:27 Issue: 5, Sep Oct 2020, PP 2621 -2634

Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. One of the challenges of clustering is to identify the appropriate label for each cluster. The selection of a label is done in such a way as to provide a proper description of the cluster records. In some cases, choosing the appropriate label is not easy due to the results and structure of each cluster. The aim of this study is to present an algorithm based on the K-means clustering in order to facilitate the allocation of labels to each cluster. Moreover, in many data mining issues, the data set contains a large number of fields and therefore, the identification of the fields and the extraction of subsets from the required fields is an important issue. With the help of the proposed algorithm, the important and influential variables of the data set would be identified and the subset of the required fields would be selected.

Keywords: Clustering, K-means algorithm, Feature selection, labeling of the clusters

Abstract View Paper Research/Original Article Original: English
مدل سازی مزرعه بادی مجهز به ژنراتور القایی دوسوتغذیه با معادل سازی بخش مکانیکی با استفاده از خوشه بندی بر اساس ضریب جذب توان

سامان احمدی، ایمان پورفر*، امین ساکی

فصلنامه مدل سازی در مهندسی، پیاپی 60 (بهار 1399)، صص 169 -182

با توجه به افزایش چشم گیر استفاده از توان تولیدی توربین های بادی و توسعه روزافزون مزارع بادی، دیگر تاثیر آن ها بر روی سیستم های قدرت قابل چشم پوشی نبوده و مدل سازی مناسب مزارع بادی از اهمیت خاصی برخوردار شده است. در بسیاری از مطالعات، هنگام مطالعه اثر یک مزرعه بادی روی سیستم قدرت، به جای مدل سازی دقیق همه توربین های موجود در مزرعه، عملکرد مزرعه بادی در نقطه کوپل مشترک با شبکه به صورت عملکرد مزرعه معادلی شامل یک ژنراتور و یک توربین معادل، که از معادل سازی توربین های بادی مزرعه واقعی حاصل شده است، در نظر گرفته می شود. این موضوع علاوه بر اینکه موجب سادگی تحلیل سیستم می شود، کاهش چشمگیر بار محاسباتی مطالعات را نیز به دنبال خواهد داشت. برای بهتر شدن دقت می توان بخش مکانیکی را با استفاده از روش های دسته بندی بجای یک توربین با چند توربین معادل سازی کرد. تاکنون روش های مختلفی برای انجام این نوع معادل سازی در مزارع بادی معرفی شده اند که همه آنها از ویژگی سرعت باد به عنوان شاخص دسته بندی توربین ها استفاده می کنند. در این مقاله برای دسته-بندی توربین های بادی بجای سرعت باد استفاده از ضریب جذب توان توربین بادی به عنوان ویژگی دسته بندی پیشنهاد شده است. سپس در ادامه با معادل سازی بخش مکانیکی توربین ها در هریک از خوشه ها نتایج با حالتی که دسته بندی بر اساس سرعت باد صورت گرفته باشد و همینطور با مدل دقیق مورد مقایسه قرار گرفته است. برای شبیه سازی از نرم افزار MATLAB استفاده شده است. نتایج حاکی از آنست که روش مورد استفاده، موجب بهبود دقت نتایج حاصل از معادل سازی خواهد شد.

کلید واژگان: مزرعه بادی، ژنراتور القایی دوسو تغذیه، کاهش مدل، الگوریتم k-means

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Modeling Doubly Fed Induction Generator Based Wind Farms by Aggregating the Mechanical Parts Using Clustering by Power Coefficient

Saman Ahmadi, Iman Pourfar *, Amin Saki

Journal of Modeling in Engineering, Volume:18 Issue: 60, 2020, PP 169 -182

Due to the increase in wind power production and the increasing development of wind farms its impact on power systems cannot be neglected and modeling of wind farms has been converted to an important topic in power engineering. When studying the effects of a wind farm on the power system, Instead of modeling the wind farm in detail, the performance of the wind farm at the point of the common coupling can be considered as an equivalent model includes an equivalent generator and an equivalent turbine. This approach in addition to simplify the system analysis would considerably reduce the simulations burden. In order to improve the accuracy of this equivalent model the mechanical parts of the wind turbines can be replaced with more than one equivalent turbine using clustering algorithms. Traditionally these algorithms consider the wind speed as the feature for clustering. In this paper it is proposed to classify the wind turbines by power coefficient of each turbine instead of the wind speed. The simulation results obtained using this approach is verified against the conventional approach and also the detailed model. Simulations are done in MATLAB. The results indicate that the accuracy of the equivalent model obtained using proposed approach, is considerably improved.

Keywords: Wind Farm, Doubly Fed Induction Generator (DFIG), Equivalent Model, K-Means Algorithm

Abstract View Paper Research/Original Article Original: Persian
Extracting Tehran Refuse Collection Truck Driving Cycle and Estimating the Braking Energy

Sohrab Pakdel Bonab*, Afshin Kazerooni, Gholamhassan Payganeh, Mohsen Esfahanian

Automotive Science and Engineering, Volume:10 Issue: 1, Winter 2020, PP 3149 -3157

Driving cycle is used to assess fuel consumption, pollutant emissions and performance of the vehicle. The aim of this paper is to extract the driving cycle for refuse collection truck and estimate its braking energy. For this purpose, after selecting the target truck and geographic area, the equipment needed to measure the required variables were prepared and mounted on the truck. Then, the actual data were collected from the performance of the target Truck while performing its mission. Since the amount of braking energy depends on the speed, truck mass and road grade, the speed of the vehicle is measured simultaneously with the truck mass and road grade. The collected data are then processed and subdivided into micro-trips. The micro-trips are clustered according to the number of state spaces using the K-Means algorithm. Next, the representative micro trips are selected from within the clusters and the final driving cycle is generated. The representative driving cycle shows that the truck speed is zero at 47% of the working time. Finally, the amount of braking power and accumulative braking energy in the driving cycle is calculated.

Keywords: Driving cycle, Driving characteristics, Refuse collection truck, Braking energy, K-Means algorithm, Micro trips

Abstract View Paper Research/Original Article Original: English
بازشناسی کور کدهای بلوکی در حضور نویز بالا با استفاده از روش های آماری

شیدا جهاندار، علی آقاگل زاده، سید جواد کاظمی تبار*

مجله علوم و فناوری های پدافند نوین، سال دهم شماره 4 (پیاپی 38، زمستان 1398)، صص 373 -381

بازشناسی کور پارامترهای کدهای تصحیح خطای مستقیم از روی رشته بیت دریافتی در سمت گیرنده، در کاربردهای نظامی و تجاری بسیار مورد توجه قرار گرفته است. در واقع شناسایی طرح کدینگ استفاده شده در فرستنده بدون هیچ گونه اطلاعات قبلی، یک عمل چالشی است که توسط دشمن انجام می گیرد. ازآنجایی که برای شنود نیاز به کد برداری بیت های کدشده در مبدا است، لازم است ابتدا مشخصات کد استفاده شده توسط دشمن شناسایی گردد. یکی از این مشخصات طول کد و نیز طول بیت های توازن مورد استفاده است. روش های مختلفی برای بازشناسی کور کدهای تصحیح خطای کانال ارائه گردیده است. در این مقاله سعی بر این است تا روشی آماری برای بازشناسی طول کلمه کد و طول بلوک اطلاعات ارائه شود که ضمن مقاومت نسبت به افزایش خطا، عملکرد آن با افزایش طول کد نیز کاهش نیابد. بدین منظور با استفاده از برخی الگوریتم های خوشه بندی ازجمله الگوریتم K-Means و الگوریتم Jenks Natural Breaks و درنهایت با ارائه یک الگوریتم ابتکاری به بررسی این موضوع برای کدهای بلوکی خطی باینری سامانمند پرداخته می شود. نتایج شبیه سازی در نرم افزار متلب نشان می دهند که روش های پیشنهادی در این مقاله علاوه بر پیچیدگی محاسباتی پایین و سرعت اجرای بالا، نتایج مطلوبی در شناسایی پارامترهای کدهای بلوکی سامانمند با طول های بلند و درصد بالایی از خطا را دارند.

کلید واژگان: بازشناسی کور، کدهای بلوکی سیستماتیک، واریانس، الگوریتم K-means، الگوریتم Jenks natural breaks

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Blind Recognition of Block Code Parameters in the Presence of High SNR Using Statistical Techniques

Sheida Jahandar, Ali Aghagolzadeh, Javad Kazemitabar *

Journal of Passive Defence Science and Technology, Volume:10 Issue: 4, 2020, PP 373 -381

Blind recognition of error correction codes parameters from intercepted bit-stream at the receiver side, is highly considered in military and commercial applications. In fact, identification of the encoding scheme used in the transmitter without any prior information, is a challenging task to the adversary. Several methods have been presented for blind code recognition. In this paper, a statistical method for recognition of the length of the code word and the length of the block of information is presented. This scheme not only is resistant to error, but also its performance sustains in long codes. In this work, the method has been tested using some clustering algorithms such as K-Means and Jenks Natural Breaks. Then, a novel method to extract features of systematic binary linear block codes has been presented. Simulation results in MATLAB show that the proposed method, in addition to having low computational complexity and high performance rate, have an acceptable result in identifying systematic block codes with long lengths and even at high error levels.

Keywords: Blind Recognition, Systematic Block Codes, Variance, K-Means Algorithm, Jenks Natural Breaks Algorithm

Abstract View Paper Research/Original Article Original: Persian
Decision Support Approach on Occupational Safety Using Data Mining

Naghmeh Khosrowabadi, Rouzbeh Ghousi*, Ahmad Makui

International Journal of Industrial Engineering and Productional Research, Volume:30 Issue: 2, Jun 2019, PP 149 -164

With regard to the industry's development, occupational safety is a key factor in protecting the worker's health, achieving organizational goals and increasing productivity. Therefore, research is needed to investigate the factors affecting occupational safety. This research, based on the information gathered from the paint halts of one of the industrial units of Tehran, uses data mining technique to identify the important factors.Initially with Literature review to 2018, an insight into existing approaches and new ideas earned. Then, with a significant 5600 units of data, the results of the charts, association rules and K-means algorithm were used to extract the latent knowledge with the least error without human intervention from the six-step methodology of Crisp for data mining.The results of charts, association rules, and K-means algorithm for clustering are in a line and have been successful in determining effective factors such as important age groups and education, identifying important events, identifying the halls and finally, the root causes of major events that were the research questions.The results reveal the importance of very young and young age with often diploma education and low experience, in major accidents involving bruising, injury, and torsion, often due to self-unsafe act and unsafe conditions as slipping or collision with things. In addition, the important body members, hands and feet in the color retouching and surface color cabins are more at risk. These results help improve safety strategies. Finally, suggestions for future research were presented.

Keywords: Occupational Safety, Data mining, CRISP, Association rules, K-means algorithm

Abstract View Paper Research/Original Article Original: English
تعیین مناطق همگن هیدرولوژیکی در غرب حوضه هامون-جازموریان.

افشین جهان شاهی، کاکا شاهدی*، کریم سلیمانی، علیرضا مقدم نیا

مجله تحقیقات منابع آب ایران، سال پانزدهم شماره 1 (پیاپی 48، بهار 1398)، صص 223 -235

خصوصیات گسترده هیدرولوژیکی در دسترس (داده های بارش، دما، جریان و مشخصات فیزیوگرافی حوضه ها) می توانند برای استخراج حوضه های مشابه هیدرولوژیکی مورد استفاده قرار گیرند. در این تحقیق از یک الگوریتم خوشه بندی در روش تحلیل خوشه ایبه عنوان یک روش جدید و کارامد، برای گروه بندی حوضه های آبخیز به چندین گروه یا خوشه استفاده شد. به منظور درک تشابه هیدرولوژیکی از 28 ویژگی (توصیف گر) موقعیت جغرافیایی، فیزیوگرافی، اقلیمی و کاربری اراضی مربوط به 15 حوضه با خصوصیات ناهمگن واقع در بخش غربی حوضه هامون-جازموریان استفاده شد. در محیط نرم افزار RStudio با استفاده از الگوریتم PCA مولفه ها و ویژگی های اصلی تعیین، سپس تعداد خوشه های بهینه با معیار دیویس-بولدین مشخص و با الگوریتم k-means حوضه ها به کلاس های همگن خوشه بندی گردیدند. نتایج نشان داد که ویژگی های عرض جغرافیایی مرکزثقل، مساحت، طول رودخانه اصلی، ارتفاع ایستگاه آبسنجی، شیب و درصد مساحت مراتع فقیر به عنوان ویژگی های اصلی از بین 28 ویژگی می باشند، همچنین معیار دیویس-بولدین برای تعداد خوشه های برابر با 3، مقدار 2/46 بدست آمد که مبین تعداد خوشه ها در الگوریتم k-means می باشد. پس از خوشه بندی حوضه ها مشخص گردید که اکثر حوضه های موجود در خوشه های یکسان از نظر مکانی در مجاورت یکدیگر قرار دارند. نتایج حاصل از این تحقیق ما را قادر به تفسیر رفتار هیدرولوژیکی منطقه مطالعاتی برای اهدافی نظیر تعمیم دهی جریان در حوضه های فاقد آمار این منطقه و تحلیل فراوانی منطقه ای سیلاب می سازد.

کلید واژگان: خوشه بندی، الگوریتم K-means، الگوریتم PCA، معیار دیویس-بولدین

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Determination of Hydrological Homogenous Regions in the West of Hamoun-Jazmourian River Basin

Afshin Jahanshahi, Kaka Shahedi *, Karim Solaimani, Alireza Moghaddam Nia

Iran Water Resources Research, Volume:15 Issue: 1, 2019, PP 223 -235

Extensive available hydrological characteristics (precipitation, temperature, streamflow data and physiographic attributes of catchments) can be used to extract hydrological similar catchments. In this research cluster analysis as a new and effective method, was used for grouping catchments into several groups or clusters. In order to understanding the hydrologic similarity, 28 characteristics (descriptors) of location, physiographic, climatic and land use of 15 catchments with heterogeneous characteristics located in the western part of the Hamoun-Jazmourian river basin were used. Selecting of characteristics were done based on the hydrological response specification which provided insight into the hydrologic performance of the catchments. In RStudio software, using PCA algorithm, the components and main characteristics were extracted, then the number of optimum clusters with the Davies-Bouldin criterion was determined and the clustering of the catchments into homogenous classes was performed using k-means algorithm. The results showed that the latitude of gravity center, area, length of main river, height of hydrometric gauge, slope and percentage of poor rangelands are as the main attributes from 28 attributes, also, the Davies-Bouldin criterion was 2.46 for the number of clusters equal to 3, which indicates the number of clusters in the k-means algorithm. After clustering the catchments, it was determined that most of the catchments in the same clusters are located in the vicinity of each other. The results of this study enable us to interpret the hydrologic behavior in the study area for purposes such as streamflow regionalization in ungauged catchments of this region and regional flood frequency analysis.

Keywords: Clustering, K-means Algorithm, PCA Algorithm, Davies-Bouldin criterion

Abstract View Paper Research/Original Article Original: Persian
تجزیه و تحلیل مشتریان مخابرات خراسان جنوبی با گسترش مدل RFM به LRFM

وحید باباییان، سیده عطیه سرفرازی

مجله هوش مصنوعی و داده کاوی، سال هفتم شماره 2 (Spring 2019)، صص 331 -340

مشاهده متن مقاله پژوهشی/اصیل زبان: انگلیسی

Analyzing Customers of South Khorasan Telecommunication Company with Expansion of RFM to LRFM Model

V. Babaiyan *, Seyyede A. Sarfarazi

Journal of Artificial Intelligence and Data Mining, Volume:7 Issue: 2, Spring 2019, PP 331 -340

Telecommunication Companies use data mining techniques to maintain good relationships with their existing customers and attract new customers and identifying profitable/unprofitable customers. Clustering leads to better understanding of customer and its results can be used to definition and decision-making for promotional schemes. In this study, we used the 999-customer purchase records in South Khorasan Telecommunication Company which has been collected during a year. The purpose of this study is to classify customers into several clusters. Since the clusters and the number of their members are determined, high-consumption users will be logged out of the system and high-value customers who are missed will be identified. In this research we divided our customers into five categories: loyal, potential, new, missed and high-consumption by using the Clementine software, developing the RFM model to the LRFM model and TwoStep and k_Means algorithms. Thus, this category will be a good benchmark for company's future decisions and we can make better decisions for each group of customers in the future.

Keywords: LRFM Model, TwoStep algorithm, k-Means algorithm

Abstract View Paper Research/Original Article Original: English

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

k-means algorithm