Text mining: Concepts and methods

Message:
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:

nowadays, a huge amount of available information on the web is text documents and articles. Text mining is a way to extract unstructured and semi-structured information from this available information on the Internet and Also, mining process of the text of knowledge and unknown, incomprehensible and potential patterns among the multitude of datasets. This research is a type of library studies. Although text mining methods are mostly based on Latin sources, but by searching Persian databases, we have found over the past decade, the subject of text mining has become doubly important for Iranian researchers, especially students of computer science and information technology; So that a significant part of the conference papers related to computer science and technology are articles related to this field. Research findings show that text mining is an application of data mining and the main difference between them is : the extraction of patterns from text with natural language in text mining, while data mining operates on structured databases. Text mining processes have two main phases: document preprocessing and knowledge extraction. So far, eight techniques have been introduced for text mining which are: Information extraction, information retrieval, text summarization, classification, clustering, visualization, natural language processing and belief mining. In recent years, much attention has been paid to text mining in the international and national spheres. The dramatic increase in textual data has prompted researchers to look for ways to explore this data. Naturally, Iranian researchers have been no exception. Text mining, with all its methods and techniques, is an effort to assist researchers in extracting useful and valuable knowledge and information from the mass of unstructured texts scattered throughout the Internet.

Language:
Persian
Published:
Journal of the Popularization of Science, Volume:12 Issue: 21, 2022
Pages:
156 to 171
magiran.com/p2455688  
دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:
اشتراک شخصی
با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!
اشتراک سازمانی
به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!
توجه!
  • حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
  • پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.
In order to view content subscription is required

Personal subscription
Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.
Organization subscription
Please contact us to subscribe your university or library for unlimited access!