Split and rephrase: Simple Syntactic Sentences for NLP applications

Message:
Article Type:
Research/Original Article (بدون رتبه معتبر)
Abstract:

In today's world, simplifying compound and complex sentences into simple sentences is crucial for enhancing machine understanding in various natural language processing (NLP) tasks, such as inference, machine translation, and information extraction. This simplification process improves accuracy. Consequently, our research is inspired by a text simplification method called "split and rephrase." We introduce a new sequence-to-sequence text generation model that transforms complex sentences into simple ones based on the conjunction "and" in Persian. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of 0.47 in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent. By utilizing linguistic models with millions or even billions of parameters, our approach facilitates a better understanding of text complexities and more accurate identification of breaking points. Our results show an output accuracy of 0.47 in the BLEU score for the generated simple sentences, which are both grammatically correct and fluent.

Language:
English
Published:
Journal of Innovations in Computer Science and Engineering, Volume:2 Issue: 1, Winter and Spring 2024
Pages:
63 to 69
https://www.magiran.com/p2861719  
سامانه نویسندگان
  • Author (3)
    Ghasem Darzi
    (1393) دکتری الهیات، دانشگاه تهران
    Darzi، Ghasem
اطلاعات نویسنده(گان) توسط ایشان ثبت و تکمیل شده‌است. برای مشاهده مشخصات و فهرست همه مطالب، صفحه رزومه را ببینید.
مقالات دیگری از این نویسنده (گان)