Clustering of Iranian synoptic stations based on meteorological and geographical parameters
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
Clustering is an instrument that divides existing data into different groups. Generally, the number of clusters is determined based on the least changes within the group and the most changes outside the group. The study area is country of Iran. Coordinates of longitude, latitude, altitude, average temperature, relative humidity and total monthly rainfall of 420 synoptic stations from its establishment until 2018 have been used in this study. After reviewing, screening and repairing the data, only 375 stations remained to continue the research. Due to the length of the statistical period is an important factor influencing clustering, the stations are statistically divided into three periods: less than 5 years with 42 stations; 1-6 years with 33 stations and more than 10 years with 300 stations, were classified. Seven methods of hierarchical clustering (3 subsets), separation (2 subsets) and ward (2 subsets) have been used in this study. Cophenetic correlation coefficient, Silhouette width test are two indicators of clustering and selection. The coding was performed in R statistical software. Based on the Cophenetic and Silhouette coefficient indices, the best number and method of clustering for 1-5-year data are 4 clusters with the middle axis separation method, for the data of 6-10 years are 5 clusters with the mean-centered hierarchical method and for stations with a statistical period of more than 10 years are 4 clusters with the separation average axis method. The zoning of the clusters is plotted on the geographical map of Iran using ARCGIS software for all three categories. Keywords: Clustering, Geographical coordinates, Synoptic, Iran.Clustering is an instrument that divides existing data into different groups. Generally, the number of clusters is determined based on the least changes within the group and the most changes outside the group. The study area is country of Iran. Coordinates of longitude, latitude, altitude, average temperature, relative humidity and total monthly rainfall of 420 synoptic stations from its establishment until 2018 have been used in this study. After reviewing, screening and repairing the data, only 375 stations remained to continue the research. Due to the length of the statistical period is an important factor influencing clustering, the stations are statistically divided into three periods: less than 5 years with 42 stations; 1-6 years with 33 stations and more than 10 years with 300 stations, were classified. Seven methods of hierarchical clustering (3 subsets), separation (2 subsets) and ward (2 subsets) have been used in this study. Cophenetic correlation coefficient, Silhouette width test are two indicators of clustering and selection. The coding was performed in R statistical software. Based on the Cophenetic and Silhouette coefficient indices, the best number and method of clustering for 1-5-year data are 4 clusters with the middle axis separation method, for the data of 6-10 years are 5 clusters with the mean-centered hierarchical method and for stations with a statistical period of more than 10 years are 4 clusters with the separation average axis method. The zoning of the clusters is plotted on the geographical map of Iran using ARCGIS software for all three categories. Keywords: Clustering, Geographical coordinates, Synoptic, Iran.Clustering is an instrument that divides existing data into different groups. Generally, the number of clusters is determined based on the least changes within the group and the most changes outside the group. The study area is country of Iran. Coordinates of longitude, latitude, altitude, average temperature, relative humidity and total monthly rainfall of 420 synoptic stations from its establishment until 2018 have been used in this study. After reviewing, screening and repairing the data, only 375 stations remained to continue the research. Due to the length of the statistical period is an important factor influencing clustering, the stations are statistically divided into three periods: less than 5 years with 42 stations; 1-6 years with 33 stations and more than 10 years with 300 stations, were classified. Seven methods of hierarchical clustering (3 subsets), separation (2 subsets) and ward (2 subsets) have been used in this study. Cophenetic correlation coefficient, Silhouette width test are two indicators of clustering and selection. The coding was performed in R statistical software. Based on the Cophenetic and Silhouette coefficient indices, the best number and method of clustering for 1-5-year data are 4 clusters with the middle axis separation method, for the data of 6-10 years are 5 clusters with the mean-centered hierarchical method and for stations with a statistical period of more than 10 years are 4 clusters with the separation average axis method. The zoning of the clusters is plotted on the geographical map of Iran using ARCGIS software for all three categories. Keywords: Clustering, Geographical coordinates, Synoptic, Iran.
Keywords:
Language:
Persian
Published:
Journal of Climate Research, Volume:14 Issue: 55, 2024
Pages:
15 to 28
https://www.magiran.com/p2721051
سامانه نویسندگان
مقالات دیگری از این نویسنده (گان)
-
Development of the framework of an irrigation management optimization model considering crop rotation
Mohammadali Boush, *, Seyed Mohammadreza Naghedifar, Hussin Banjad, Sedigheh Sadeghi
Iranian Journal of Soil and Water Research, -
Scheduling and optimal delivery of water in irrigation networks by combining the AquaCrop model and genetic algorithm
Parisa Kahkhamoghadam, Ali Naghi Ziaei *, , Amin Kanooni, Sedigheh Sadeghi
Journal of Water and Soil Management and Modeling,