Fuzzy clustering analysis of compositional data and comparing it with exploratory compositional data dendrogram, case study: Anar region stream sediments geochemistry

Abstract:
Summary: One of the most important methods in unsupervised datamining is clustering that when applied on variables leads to dimension reduction. Among all of them, fuzzy clustering methods are preferred because of special features and better flexibility in partitioning groups. In this study, FANNY algorithm proposed by Kauffmann and Rousseuw has been applied in variable clustering of the geochemical stream sediments that have a compositional nature. Referring to the extensive recent researches and novel methods presented in opening compositional data, another definition of distance is needed for them to be transformed isometrically to the euclidean space to be interpretable with classical operations. In this case study after preparation of geochemical stream sediments data of Anar region in Kerman, first the exploratory dendrogram of the simplex space was plotted and 4 clusters were obtained. Then using fanny algorithm, clr-transformed variables were clustered. It showed an acceptable conformity with the dendrogram results. In case of determining the balances of SBP manually instead of default and with a prior knowledge, the results of exploratory dendrogram would be more precise.
Introduction
Geochemical exploration based on stream sediment analysis, is one of the most important methods in assessing mineral potentials in prospecting brownfield areas. Different statistical methods have been developed to identify the pattern of groups of associated geochemical elements in the last decades. In this research, stream sediments data clustering of Anar exploratory region have been analyzed with a particular perspective of the closed nature of geochemical datasets using two known methods, fuzzy clustering and exploratory dendrogram.
Methodology and Approaches: First, using R software compositions-package, exploratory dendrogram of compositional data was calculated and plotted based on ward criterion and default sequential binary partition balances in simplex space. Due to applying this method, 4 clusters were detected. Then by applying fanny algorithm (cluster package) –one of the most flexible ones in fuzzy clusterings –on clr- transformed data, 4 clusters with the best silhouette were determined. The fuzzification degree was selected in a way that would be near to crisper methods like dendrogram in order to compare the results.
Results and
Conclusions
Although different methods applied on transformed compositional data, their similar results showed very good conformity with lithology and geological structures. It presented a good separation in simplex space. If the balances in SBP are to be defined manually, the reduced dimensions of the variables would be more informative.
Language:
Persian
Published:
Journal of Aalytical and Numerical Methods in Mining Engineering, Volume:6 Issue: 12, 2017
Pages:
11 to 19
magiran.com/p1668048  
دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:
اشتراک شخصی
با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!
اشتراک سازمانی
به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!
توجه!
  • حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
  • پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.
In order to view content subscription is required

Personal subscription
Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.
Organization subscription
Please contact us to subscribe your university or library for unlimited access!