HesNegar: Persian Sentiment WordNet
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
Awareness of other's opinions plays a crucial role in the decision making process performed by simple customers to top-level executives of manufacturing companies and various organizations. Today, with the advent of Web 2.0 and the expansion of social networks, a vast number of texts related to people's opinions have been created. However, exploring the enormous amount of documents, various opinion sources and opposing opinions about an entity have made the process of extracting and analyzing opinions very difficult. Hence, there is a need for methods to explore and summarize the existing opinions. Accordingly, there has recently been a new trend in natural language processing science called "opinion mining". The main purpose of opinion mining is to extract and detect peoples positive or negative sentiments (sense of satisfaction) from text reviews. The absence of a comprehensive Persian sentiment lexicon is one of the main challenges of opinion mining in Persian.
In this paper, a new methodology for developing Persian Sentiment WordNet (HesNegar) is presented using various Persian and English resources. A corpus of Persian reviews developed for opinion mining studies are introduced. To develop HesNegar, a comprehensive Persian WordNet (FerdowsNet), with high recall and proper precision (based on Princeton WordNet), was first created. Then, the polarity of each synset in English SentiWordNet is mapped to the corresponding words in HesNegar. In the conducted tests, it was found that HesNegar has a precision score of 0.86 a recall score of 0.75 and it can be used as a comprehensive Persian SentiWordNet. The findings and developments made in this study could prove useful in the advancement of opinion mining research in Persian and other similar languages, such as Urdu and Arabic.
In this paper, a new methodology for developing Persian Sentiment WordNet (HesNegar) is presented using various Persian and English resources. A corpus of Persian reviews developed for opinion mining studies are introduced. To develop HesNegar, a comprehensive Persian WordNet (FerdowsNet), with high recall and proper precision (based on Princeton WordNet), was first created. Then, the polarity of each synset in English SentiWordNet is mapped to the corresponding words in HesNegar. In the conducted tests, it was found that HesNegar has a precision score of 0.86 a recall score of 0.75 and it can be used as a comprehensive Persian SentiWordNet. The findings and developments made in this study could prove useful in the advancement of opinion mining research in Persian and other similar languages, such as Urdu and Arabic.
Keywords:
Language:
Persian
Published:
Signal and Data Processing, Volume:15 Issue: 1, 2018
Pages:
71 to 86
magiran.com/p1837197
دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:
اشتراک شخصی
با عضویت و پرداخت آنلاین حق اشتراک یکساله به مبلغ 1,390,000ريال میتوانید 70 عنوان مطلب دانلود کنید!
اشتراک سازمانی
به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!
توجه!
- حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران میشود.
- پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانههای چاپی و دیجیتال را به کاربر نمیدهد.
In order to view content subscription is required
Personal subscription
Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.
Organization subscription
Please contact us to subscribe your university or library for unlimited access!