A Density-Based Fuzzy Clustering Algorithm Using Multi-Representatives Points Per Cluster Based on a New Distance Measure Using KNN Algorithm
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
In analyzing phenomena around us, clustering is among the most commonly used techniques in machine learning for comparing, and categorizing them into different groups based on intrinsic features. One of the main challenges facing clustering algorithms is selecting a suitable representative for each cluster. Existing algorithms often choose a single representative, which can lead to suboptimal performance on many datasets (especially asymmetric datasets). This process is completely dependent on the type of internal distribution of the clusters, and that single point may not be a suitable representative for that cluster. The proposed algorithm for dealing with datasets, inspired by the fuzzy ALM method and avoiding complex formulas, and calculations, initially breaks the system down into simpler (two-dimensional) systems. After spreading ink drops, by finding the vertical Narrow path and the horizontal narrow path, it selects a set of points as the representation of each cluster. The proposed algorithm, unlike many conventional algorithms, provides a representative set for each cluster and also enhances the algorithm's performance in dealing with datasets that have an asymmetric structure by introducing a new distance measure based on the KNN method and utilizing the set of prime numbers. The Accuracy, F1-Score, and AMI achieved when working with many low-dimensional, and high-dimensional datasets has been higher compared to algorithms such as FUALM, HiDUALM, K-Means, DBSCAN, DENCLUE and IRFLLRR and in some cases, the achieved accuracy has been equal to 100 percent.
Keywords:
Language:
English
Published:
International Journal of Engineering, Volume:38 Issue: 12, Dec 2025
Pages:
2865 to 2876
https://www.magiran.com/p2867242