Persian Slang Text Conversion to Formal and Deep Learning of Persian Short Texts on Social Media for Sentiment Classification
Author(s):
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:
Background and Objectives
The lack of a suitable tool for the analysis of conversational texts in Persian language has made various analyzes of these texts, including Sentiment Analysis, difficult. In this research, it has we tried to make the understanding of these texts easier for the machine by providing PSC, Persian Slang Convertor, a tool for converting conversational texts into formal ones, and by using the most up-to-date and best deep learning methods along with the PSC, the sentiment learning of short Persian language texts for the machine in a better way.Methods
Be made More than 10 million unlabeled texts from various social networks and movie subtitles (as dialogue texts) and about 10 million news texts (as official texts) have been used for training unsupervised models and formal implementation of the tool. 60,000 texts from the comments of Instagram social network users with positive, negative, and neutral labels are considered as supervised data for training the emotion classification model of short texts. The latest methods such as LSTM, CNN, BERT, ELMo, and deep processing techniques such as learning rate decay, regularization, and dropout have been used. LSTM has been utilized in the research, and the best accuracy has been achieved using this method.Results
Using the official tool, 57% of the words of the corpus of conversation were converted. Finally, by using the formalizer, FastText model and deep LSTM network, the accuracy of 81.91 was obtained on the test data.Conclusion
In this research, an attempt was made to pre-train models using unlabeled data, and in some cases, existing pre-trained models such as ParsBERT were used. Then, a model was implemented to classify the Sentiment of Persian short texts using labeled data.Keywords:
Language:
English
Published:
Journal of Electrical and Computer Engineering Innovations, Volume:13 Issue: 1, Winter-Spring 2025
Pages:
27 to 42
https://www.magiran.com/p2817456
سامانه نویسندگان
از نویسنده(گان) این مقاله دعوت میکنیم در سایت ثبتنام کرده و این مقاله را به فهرست مقالات رزومه خود پیوست کنند.
راهنما
مقالات دیگری از این نویسنده (گان)
-
An Effective Model for Ontology Relations Efficacy on Stock prices: A Case Study of the Persian Stock Market
Mohammadhossein Samani, *
Journal of Information Technology Management, Summer 2024 -
A DSR Approach to predict liquidity risk using CNN and Sentiment Analysis
Hamed Mirashk, *, Mehrdad Kargari, Mohammadali Rastegar Sorkhe, Mohammad Talebi
Management Research in Iran,