The Architecture of Farsi Knowledge Graph System

Author(s):

MohamadBagher Sajadi , Behrouz Minaei Bidgoli*

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

The knowledge graph plays an important role in the Semantic Web and Natural Language Processing (NLP) tools. There are many knowledge bases in different languages, however lack of Farsi-specific knowledge base appears some defects in research and industrial applications. In this study, the most comprehensive knowledge base in Farsi language is presented, which consists of more than 500K of entities and 7 million relations, which is accessible open source. Data is supplied 3 sources: Farsi Wikipedia and its structured data such as infobox, Web tables, relation extraction module. According to the semantic web, RDF data model and OWL2 ontology employed to implement the Farsi Knowledge Graph (FKG). Resources and their relations are stored in triple format, therefor access to the knowledge graph is provided by a SPARQL endpoint. An ontology, retrieved from DBpedia ontology, was developed and improved Based on resources of Farsi Wikipedia. Also, more than 8000 templates and properties of Wikipedia were mapped to the ontology automatically and manually. Furthermore, a part of the ontology was mapped to the FarsNet, the Persian WordNet, for research purposes. In the graph, there are a large amount of information on a variety of topics including famous people, important places, organizations and companies, literary and art works, physiology, biology, events, species, astronomy, etc. According to the Linked data, most of entities in the FKG have been connected to DBpedia and Wikidata resources by owl:sameAs. In order to achieve high performance and flexible data model, a two-level architecture for storing data was designed to separate data from metadata. This design plays a key role in update operation and managing versions. For evaluation purposes, a small part of triples were randomly collected to build a test dataset for manually inspection. Experimental results demonstrate that more than 94% of triples were obtained correctly through the process of extraction, conversion, mapping, transformation and store. Future of internet according to the semantic web will be a complex and huge global knowledge base, therefor the FKG can play a significant role in defining and developing this emerging technology.

Keywords:

Knowledge Base , RDF , Semantic Web , Farsi Language , Linked Data

Language:

Persian

Published:

Journal of Information Processing and Management, Volume:35 Issue: 2, 2020

Pages:

425 to 461

magiran.com/p2103540

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

پژوهشنامه پردازش و مدیریت اطلاعات

Journal of Information Processing and Management

فصلنامه علوم انسانی

آخرین شماره | آرشیو

ISSN: 2251-8223 eISSN: 2251-8231

تا پاییز 1384 با نام «علوم اطلاع رسانی» منتشر شده است.

صاحب امتیاز:

پژوهشگاه علوم و فناوری اطلاعات ایران

مدیر مسئول:

دکتر محمد حسن زاده

سردبیر:

دکتر سید رحمت الله فتاحی

تلفن نشریه: ۰۲۱-۶۶۴۹۴۹۸۰

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه پذیرش الکترونیکی مقاله

به جمع مشترکان مگیران بپیوندید!

The Architecture of Farsi Knowledge Graph System

MohamadBagher Sajadi , Behrouz Minaei Bidgoli*

Knowledge Base , RDF , Semantic Web , Farsi Language , Linked Data

پژوهشنامه پردازش و مدیریت اطلاعات

Journal of Information Processing and Management