جستجوی مقالات مرتبط با کلیدواژه

web document retrieval

در نشریات گروه کتابداری و مدیریت اطلاعات

تکرار جستجوی کلیدواژه web document retrieval در نشریات گروه علوم انسانی

تکرار جستجوی کلیدواژه web document retrieval در مقالات مجلات علمی

انتخاب همه

استخراج ویژگیهای متنی و معنایی در یادگیری رتبهبندی جهت بازیابی اسناد وب

محدثه محجوب*، فائزه انسان، ساناز کشوری، پرستو جعفرزاده، محمدامین کیوان زاد

پژوهشنامه پردازش و مدیریت اطلاعات، سال سی و ششم شماره 4 (پیاپی 106، تابستان 1400)، صص 1081 -1112

با ظهور وب معنایی، تعریف و استفاده از ویژگی های معنایی در الگوریتم های یادگیری رتبه بندی هم مطرح شده است. یک چالش مهم در این زمینه عدم استفاده از ویژگی های جامع و همچنین، عدم ترکیب کامل از ویژگی های متنی و معنایی است. در این مقاله، با تعریف ویژگی های معنایی جدید در چهار دسته ویژگی های مبتنی بر گراف و پایگاه دانش، ویژگی های مبتنی بر تکرار موجودیت، ویژگی های مبتنی بر فیلدهای متنی، و ویژگی های مبتنی بر نمایش برداری کلمات و متون به این چالش پاسخ داده شده است. جهت ارزیابی از مجموعه داده MQ-2007 متعلق به LETOR4، که حاوی ویژگی های متنی آماده است، و شش الگوریتم یادگیری رتبه بندی استاندارد استفاده شده است. نتایج تجربی نشان می دهد که ویژگی های معنایی و نیز ترکیب آن ها با ویژگی های متنی باعث بهبود 50 درصدی نسبت به استفاده از تنها ویژگی های متنی می شوند. در انتها، از یک الگوریتم انتخاب ویژگی برای انتخاب بهترین ویژگی های معنایی استفاده شده که منجر به بهبود 7 درصدی نسبت به الگوریتم های رتبه بندی بدون انتخاب ویژگی شده است.

کلید واژگان: یادگیری رتبه بندی، ویژگی های معنایی، بازیابی اسناد وب، داده های پیوندی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Extraction of Effective Textual and Semantic Features in Learning to Rank for Web Document Retrieval

Mohaddeseh Mahjoob*, Faezeh Ensan, Sanaz Keshvari, Parastoo Jafarzadeh, Mohammadamin Keyvanzad

Journal of Information Processing and Management, Volume:36 Issue: 4, 2021, PP 1081 -1112

Ranking algorithms, as the core of web search systems, are responsible for finding and ranking the most relevant documents to user information needs from the crawled and indexed corpus. With the ever-increasing amount of available training data, ranking technologies are moving towards using Machine Learning methods, described as Learning to Rank algorithms. The basic Learning to Rank systems mainly have used textual features while ignoring semantic features. With the advent of Semantic Web, there is an emerging interest in developing and using semantic features for Learning to Rank systems. An important challenge is that there is currently no comprehensive study on the combined usage of textual and semantic features for Learning to Rank systems. In this paper, first, we define and implement four new sets of semantic features based on Knowledge Graph, Entity Repetition, Textual Fields and Vector Representation of Words and Texts. For experimental analysis, we used the MQ-2007 dataset from LETOR 4, which includes a set of textual features. The results of running six standard Learning to Rank Algorithms show that by using semantic features, either in isolation or in combination with textual features, significantly increases the performance. The increase in performance is even more significant when we limit the tests to hard queries. We also implemented an existing Feature Selection algorithm to test whether it can improve the results even further. The results showed improvements for some Learning to Rank algorithms, yet failed to improve on others.

Keywords: Learning to Rank, Semantic Features, Web Document Retrieval, Linked Data, Feature Selection

Abstract View Paper Research/Original Article Original: Persian

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

web document retrieval