An Algorithm for Fuzzification of WordNets and its Application in Sentiment Analysis
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
WordNet-like Lexical Databases (WLDs) group English words into sets of synonyms called “synsets.” Synsets are utilized for several applications in the field of text mining. However, they were also open to criticism because although, in theory, not all the members (i.e. word senses) of a synset represent the meaning of that synset with the same degree, in practice, in WLDs they are considered as members of the synset identically. Correspondingly, the fuzzy version of synonym sets, called fuzzy-synsets were proposed. But, to the best or our knowledge. In this study, we present an algorithm for constructing fuzzy version of WLDs of any language, given a corpus of documents and a word-sense-disambiguation system of that language. A theoretical proof is also proposed for the validity of results of the proposed algorithm. Then, inputting the open-American-online-corpus (OANC) and UKB word-sense-disambiguation to the algorithm, we construct and publish online the fuzzified version English WordNet (FWN), and apply them in a Sentiment Analysis problem.
Keywords:
Language:
Persian
Published:
Engineering Management and Soft Computing, Volume:9 Issue: 2, 2024
Pages:
119 to 131
https://www.magiran.com/p2716049
سامانه نویسندگان
مقالات دیگری از این نویسنده (گان)
-
Noor-Vajeh: A Benchmark Dataset for Keyword Extraction from Persian Papers
Mohammadamin Taheri*, Mohammadebrahim Shenassa, Behrouz Minaei-Bidgoli, Sayyed Ali Hossayni
Signal and Data Processing, -
A Benchmark for Analyzing Knowledge Graph Embedding for Link Prediction Problem in Low-Resource Languages
Najmeh Torabian, Behrooz Minaei-Bidgoli *, Mohsen Jahanshahi
Journal of Soft Computing and Information Technology,