Magiran | نشریه ماشین بینایی و پردازش تصویر، سال هشتم شماره 3 (پاییز 1400)

انتخاب همه

ارائه یک توصیفگر عمومی مبتنی بر الگوی دودویی محلی وزندار به منظور بازیابی تصاویرمادون قرمز

محمدحسین شکور* صفحات 1-12

یکی از توصیفگرهای مهم بافتی تحت عنوان الگوی دودویی محلی می باشد. الگوی دودویی محلی در استخراج ویژگیهای بافتی کاربرد زیادی دارد. در این مقاله نوعی از الگوی دودویی محلی ارایه شده است که از آن نه به عنوان توصیفگر بافت، بلکه به عنوان یک توصیفگر عمومی استفاده شده است. یعنی همانند توصیفگر تبدیل مستقل از مقیاس ویژگی (SIFT) عمل می کند. روش پیشنهادی بر اساس میزان اختلاف نقطه مرکزی با نقاط همسایگی وزن های لازم را تولید می کند و این مقادیر وزن در مقادیر الگوی دودویی ضرب شده و استفاده می شود. ضمن اینکه بجای الگوی دودویی محلی برای کاهش تعداد ویژگیها از الگوی دودویی محلی متقارن مرکزی CS-LBP استفاده شده است. مزیت روش پیشنهادی نسبت به SIFT سرعت بالای آن است. ضمن اینکه بجای تصاویر عادی از تصاویر دوربین های مادون قرمز استفاده شده است. این تصاویر نسبت به تصاویر معمول دارای لبه هایی تارتر هستند و بازیابی آنها با توصیفگرهای عمومی عملا نتایج خوبی تولید نمی کند. نتایج پیاده سازی نشان می دهد که روش پیشنهادی هم از لحاظ سرعت و هم از لحاظ دقت نسبت به توصیفگرهای عمومی مانندSIFT بهتر عمل می کند ضمن اینکه از روش الگوی دودویی محلی متقارن مرکزی هم دقت بالاتری دارد.

کلیدواژگان: الگوی دودویی محلی، تبدیل مستقل از مقیاس ویژگی، بازیابی تصاویر، تصاویرمادون قرمز

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی
ارائه یک توصیفگر جدید به منظور استفاده در سیستم های شناسایی هویت زیستی مبتنی بر ویژگی های بهینه شده چروکیدگی انگشتان دست و ناخن

حدیث حیدری، عبدالله چاله چاله* صفحات 13-25

در این مقاله یک توصیفگر بافتی جدید به منظور استفاده در سیستم های شناسایی هویت زیستی مبتنی بر مشخصه ی بافت پشت دست ارایه می شود. ابتدا تصاویر دست افراد مختلف توسط یک وسیله اخذ تصاویر جمع آوری، سپس با نگاشت تصاویر از فضای رنگی RGB به YUV و آستانه گیری مولفه U، پیکسل های پوست تشخیص داده شده و پنج انگشت از تصویر مجزا می گردند. سپس با استفاده از مرکز ثقل، طول و جهت انگشتان، نواحی چروکیدگی انگشتان و ناخن استخراج می شوند. ویژگی های بافتی چروکیدگی انگشتان با استفاده از توصیفگر پیشنهادی الگوهای چهارجهتی مبتنی بر آستانه گیری آماری و ویژگی های ناخن نیز با استفاده از تبدیل موجک گسسته تا سه سطح استخراج می شوند. به علاوه، با تعیین یک تابع هزینه در الگوریتم تکاملی بهینه سازی دسته ذرات، مجموعه ای از ضرایب اهمیت به ویژگی های بافتی دست اعمال شده و درنتیجه این ویژگی ها بهینه می گردند. در نهایت با بهره گیری از معیار اندازه گیری فاصله d1میزان مشابهت تصاویر محاسبه و هویت اشخاص با دقت 13/92% شناسایی شده است. نتایج آزمایش ها روی مجموعه تصاویر تهیه شده توسط نگارندگان با نام FKP_Nail، نشان می دهد که روش ارایه شده در مقایسه با روش های شناخته شده موجود دقت و سرعت بهتری داشته و از محاسبات ساده تری نیز استفاده می کند.

کلیدواژگان: الگوهای چهارجهتی، توصیفگر بافتی، چروکیدگی انگشتان دست، ناخن، بهینه سازی دسته ذرات

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی
یک روش کارا برای غربالگری اولیه بیماری گلوکوم بر اساس محاسبه نسبت کاپ به دیسک نوری با استفاده از شبکه های عصبی کانولوشنی

فاطمه معادی، ندا فرجی*، محمدرضا حسن نژاد بی بالان صفحات 27-43

تشخیص و درمان به موقع بیماری گلوکوم می تواند از پیشروی این بیماری و از دست دادن ناگهانی بینایی جلوگیری کند. گلوکوم دیسک و کاپ نوری را که درون دیسک قرار دارد تحت تاثیر قرار می دهد. در این مقاله ابتدا به محلی سازی دیسک نوری پرداخته شده و سپس قطعه بندی دیسک و کاپ نوری به منظور تشخیص براساس نسبت کاپ به دیسک نوری (CDR) انجام می شود. برای مرحله محلی سازی دیسک از شبکه عصبی کانولوشنی مبتنی بر ناحیه سریعتر (Faster-RCNN) با شبکه پیش آموزش دیده ResNet50 استفاده می شود. مرحله قطعه بندی توسط معماری U-Net اصلاح شده با استفاده از شبکه SE-ResNet50 در قسمت کدگذار آن اجرا شده و در نهایت CDR مورد ارزیابی قرار می گیرد. پایگاه های داده Drishti-GS1 و RIM-ONE v3 برای آموزش و تست روش پیشنهادی و دسته داده MESSIDOR صرفا در مرحله تست بکار رفته است. همچنین به منظور قطعه بندی کاپ و دیسک نوری دو رویکرد برای درنظرگرفتن حاشیه نویسی کاپ و دیسک نوری در حقیقت مبنای دسته داده Drishti-GS1 پیشنهاد می شود. در رویکرد پیشنهادی دوم و طبق معیار F1-score، نتیجه قطعه بندی کاپ و دیسک نوری برای دسته داده Drishti-GS1 به ترتیب 0.93 و 0.97، برای دسته داده RIM-ONE v3 به ترتیب 0.79 و 0.95 و برای دسته داده MESSIDOR به ترتیب 0.84 و 0.93 بدست آمده که قابل رقابت با سایر کارها است.
کلیدواژگان: یادگیری عمیق، شبکه های عصبی کانولوشنی، گلوکوم، نسبت کاپ به دیسک نوری، قطعه بندی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی
تشخیص مکان و زاویه گل‏ های زعفران در تصویر با استفاده از شبکه‏ های همگشتی عمیق

محمدرضا محمدی*، مهرداد کرمی صفحات 45-55

تشخیص خودکار مکان و زاویه برش گل ‏های زعفران یک گام اساسی برای فرآوری خودکار گل زعفران است. با توجه به تنوع موجود میان گل‏ های زعفران، الگوریتم‏ های سنتی نمی‏ توانند دقت قابل قبول را نتیجه دهند. در این مقاله، یک الگوریتم مبتنی بر شبکه‏ های همگشتی عمیق برای حل این مسئله پیشنهاد شده است. در معماری پیشنهادی، تصویر به تعدادی ناحیه مربعی کوچک تقسیم می‏ شود و در هر ناحیه وجود محل برش گل زعفران تخمین زده می‏ شود. همچنین، محل نسبی مناسب برای برش گل زعفران نسبت به مرکز ناحیه تخمین زده می‏ شود. برای تخمین زاویه، ابتدا محدوده 360 درجه به تعدادی قسمت تقسیم می‏ شود و زاویه گل به یکی از مقادیر اختصاص می‏ یابد. سپس، زاویه نسبی نسبت به مرکز ناحیه تخمین زده می‏ شود. به منظور ارزیابی عملکرد الگوریتم پیشنهادی، یک پایگاه داده شامل 163 تصویر و 3035 گل جمع‏ آوری شده است و پارامترهای هر گل توسط یک خبره مشخص شده است. ارزیابی الگوریتم پیشنهادی نشان می‏ دهد که پارامترهای بیش از 95% گل‏ها به درستی تخمین زده شده‏ اند.

کلیدواژگان: گل زعفران، یادگیری عمیق، شبکه‏های همگشتی عمیق، تشخیص هدف

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی
یادگیری غیرنظارتی ویژگی برای ارزیابی کیفیت کور تصاویر فراتفکیک پذیر

مریم کریمی*، منصور نجاتی صفحات 57-67

افزایش تفکیک پذیری تصاویر، یک مسئله کلاسیک در زمینه پردازش تصویر است که هدف آن ایجاد تصاویر با تفکیک پذیری بالا از تصویر با تفکیک پذیری پایین است. الگوریتم های بسیاری در این زمینه تاکنون ارایه شده اند. با این حال، ارزیابی موثر کیفیت این نوع تصاویر همچنان بعنوان یک زمینه تحقیقاتی چالش برانگیز باقی مانده است. روش های معمول ارزیابی کیفیت تصویر، همخوانی کافی با معیارهای ادراکی ندارند. لذا ارایه روش های ارزیابی کیفیت خاص تصاویر فراتفکیک پذیر، اهمیت بالایی دارد. در این مقاله ما یک روش ارزیابی کیفیت بدون مرجع برای تصاویر فراتفکیک پذیر ارایه می کنیم که با یادگیری بدون برچسب یک کتاب اتم ها روی تصاویر مرجع با تفکیک پذیری بالا و بازنمایی بلوک های تصاویر فراتفکیک پذیر با این کتاب، ویژگی های محلی تولید می نماید که قادرند تخریب های ناشی از افزایش تفکیک پذیری را به خوبی توصیف نمایند. این ویژگی ها با یک رویکرد مناسب هرمی، تلفیق شده و بردار ویژگی سراسری از تصویر تولید می کنند. این بردارها و امتیازات کیفیت چشمی در نهایت، جهت آموزش یک مدل رگرسیون مورد استفاده قرار می گیرند. نتایج تجربی نشان می دهند که این روش در عین سادگی، سرعت و عدم نیاز به حجم بالای داده آموزشی، کارایی بهتری نسبت به روش های موجود دارد.
کلیدواژگان: ارزیابی کیفیت تصاویر فراتفکیک پذیر، افزایش تفکیک پذیری، ارزیابی کیفیت کور، بازنمایی تنک، یادگیری کتاب اتم ها، یادگیری غیرنظارتی ویژگی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی
استفاده از میدان های عصبی پویا در یک الگوریتم بینایی ماشین به منظور ردگیری همزمان اهداف

شیوا کامکار*، حمید ابریشمی مقدم، رضا لشگری صفحات 69-81

ردگیری همزمان چند شیء یکی از موضوعات مهم در زمینه بینایی ماشین و پایه کاربردهای بسیاری مانند انواع سامانه های نظارتی انسانی و حیوانی است. از اصلی ترین چالش های الگوریتم های این حوزه، مرتبط سازی داده ها به ویژه هنگام بروز انسداد می باشد. از آنجا که انسان ها عملکرد مناسبی در مواجهه با این چالش دارند، انتظار می رود استفاده از الگوریتم های ملهم از مغز در ردگیری همزمان اهداف بتواند به بهبود کارایی این گونه از سامانه ها منجر شود. میدان های عصبی پویا سازگاری بالایی با عملکرد نورونی و شناختی مغز انسان دارند. ما در این مقاله از این میدان ها بهره بردیم تا مانند حافظه کاری انسان در نگهداری و پردازش اطلاعاتی مانند مکان اشیاء، جهت و سرعت آن ها ایفای نقش کنند. استخراج این اطلاعات با استفاده از شگردهای بینایی ماشین صورت می گیرد. ارزیابی این روش با مقایسه عملکرد آن با روش های اخیرا توسعه داده شده روی مجموعه داده حاوی ویدیوهای ضبط شده از حرکات آزادانه لاروهای قزل آلا که در دسترس همگان قرار دارد انجام شده است. نتایج نشان داد که روش پیشنهادی نه تنها از روش های رقیب عملکرد بهتری دارد، بلکه تقریبا در همه موارد به ویژه بعد از انسداد قادر است مرتبط سازی داده ها را به درستی انجام دهد.
کلیدواژگان: ردگیری همزمان اشیاء، میدان های عصبی پویا، مرتبط سازی داده ها، انسداد، ویدیو های میکروسکوپی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

انتخاب همه

A general descriptor based-on weighted local binary pattern forinfrared images retrieval

MohammadHossein Shakoor * Pages 1-12

Local Binary Pattern (LBP) is one of popular texture images descriptors. It is used to extraction features of texture images for classification. However, in this paper a new version of LBP is proposed that is not used for this purpose. The proposed descriptor is used for image retrieval. In other words it can be compare to Scale Invariant Features Transform (SIFT). In term of speed, the proposed method outperforms the SIFT. In addition, in this paper it is used for Infrared images retrieval. These images are low quality and low contrast and if they are used without any pre processing and enhancement, SIFT descriptor versions cannot extract good features of them. The proposed method can provide better result with higher speed. The proposed method have been compared to center symmetric LBP (CS-LBP) and SIFT by using both infrared images and standard image data-sets.

Keywords: Local binary pattern, infrared images, Image Retrieval

Abstract View Paper Research/Original Article Original: Persian
A new descriptor for biometric user authentication systems based on optimized features of finger knuckle print and fingernail

Hadis Heidari, Abdolah Chalechale * Pages 13-25

In this paper a new texture descriptor is presented for biometric authentication systems based on hand dorsal texture characteristics. First, different hand images are collected using an acquisition device, then the images are mapped from RGB color space to YUV and the five fingers are isolated from images with thresholding of the U component for skin detection. Utilizing the center of gravity, length and direction of the fingers, the finger knuckle print and nail regions are determined. Then, the texture features of finger knuckle are described using four directional patterns based on statistical thresholding. Moreover, the features of nails are described using discrete wavelet transform. In addition, utilizing a cost function in the evolutionary particle swarm optimization algorithm, a set of significance coefficients is applied to the textural features of the hand, and as a result, these features are optimized. Finally, using the d1 distance measurement criterion, the similarity of the images has been calculated and the identity of individuals has been identified with the recognition rate of 92.13%. The experimental results on a set of images collected by the authors, called FKP_Nail, show that the proposed method is more accurate and faster than known existing methods while it uses simpler calculations.

Keywords: Four directional patterns, texture descriptor, finger knuckle print, Nail, Particle Swarm Optimization (PSO)

Abstract View Paper Research/Original Article Original: Persian
An Efficient Method for Early Glaucoma Screening based on Calculating Optic Cup to Disc Ratio Using Convolutional Neural Networks

Fatemeh Maadi, Neda Faraji *, Mohammadreza Hassannejad Bibalan Pages 27-43

Early diagnosis and treatment of glaucoma can prevent the progression of this disease and the sudden loss of vision. Glaucoma affects the optic disc and optic cup located inside the optic disc. In this paper, first, the optic disc is localized and then segmentation of optic disc and cup is performed to diagnose based on the optic cup to disc ratio (CDR). A Faster Region-based Convolutional Neural Network (Faster-RCNN) with the pre-trained ResNet50 network is used for the optic disc localization step. The segmentation step is performed by the modified U-Net architecture using the SE-ResNet50 network as its encoding layers, and finally CDR is evaluated. The Drishti-GS1 and RIM-ONE v3 databases are used to train and test the proposed method and the MESSIDOR database is only used in the test phase. In addition, for segmentation of optic disc and cup, two approaches are proposed to consider the optic disc and cup annotations in the Drishti-GS1 data set’s ground truth. In the second proposed approach and according to the F1-score criteria, the result of optic cup and disc segmentation for Drishti-GS1 data set is 0.93 and 0.97, respectively, for RIM-ONE v3 data set is 0.79 and 0.95, respectively, and for MESSIDOR data set is 0.84 and 0.93, respectively, which is competitive with other works.
Keywords: Deep Learning, Convolutional Neural Networks, Glaucoma, optic cup to disc ratio, Segmentation

Abstract View Paper Research/Original Article Original: Persian
Detection of the Location and Angle of Saffron Flowers using Deep Convolutional Networks

MohammadReza Mohammadi *, Mehrdad Karami Pages 45-55

Automatic detection of the angle and position of saffron flowers is an essential step for automatic processing of saffron flowers. Due to the diversity of saffron flowers, traditional algorithms can not yield acceptable accuracy. In this paper, an algorithm based on the deep convolutional networks is proposed. In the proposed architecture, the image is divided into a number of small square areas, and in each region, the existence of the flower is estimated. Also, the relative location suitable for cutting the saffron flower is estimated relative to the center of the region. To estimate the angle, the 360-degree range is first divided into several parts and the angle of the flower is classified to one of the values. Then, the relative angle with the center of the region is estimated. In order to evaluate the performance of the proposed algorithm, a dataset of 163 images and 3035 flowers has been collected, and the parameters for each flower are annotated by an expert. The evaluation of the proposed algorithm shows that more than 95% of the flowers are correctly estimated.

Abstract View Paper Research/Original Article Original: Persian
Unsupervised Feature Learning for Blind Quality Assessment of Super-Resolved Images

Maryam Karimi *, Mansour Nejati Pages 57-67

Image super-resolution is a classic image processing issue that aims to create high-resolution images from low-resolution images. Although many algorithms in this field have been proposed so far, effective quality evaluationof such images remains a challenging research area. Conventional image quality assessment measures are not sufficiently consistent with subjective judgments on these images. Therefore, it is very important to provide specific quality assessment methods for image super-resolution. In this paper, we propose a no-reference quality evaluation method for super-resolved images that, by learning a dictionary on high-resolution images and representing super-resolved blocks, produce local features that can describe super-resolution degradations well. These features are pooled togetherby a suitable pyramidal approach and produce a global feature vector of the image. These vectors and subjective quality scores are ultimately used to train a regression model. Experimental results show that this method is not only simple and high speed, but also does not require large volumes of training data and is more efficient than existing methods.
Keywords: image quality assessment (IQA), Super-resolution image quality assessment (SRIQA), Blind image quality assessment, Sparse representation, Dictionary Learning, Unsupervised feature learning, Human visual system (HVS)

Abstract View Paper Research/Original Article Original: Persian
A Computer Vision Algorithm based on Dynamic Neural Fields for Multiple-Object Tracking

Shiva Kamkar *, Hamid Abrishami Moghaddam, Reza Lashgari Pages 69-81

Tracking multiple objects (MOT) is an important topic in almost all computer vision-related areas. One of the most vital challenges in front of MOT’s algorithms is data association, particularly when partial or complete occlusions occur. On the other hand, the human can handle this challenge in everyday scenarios for example while driving a car on a highway or riding a bicycle. Accordingly, we used a brain-inspired method to propose an MOT algorithm that can overcome the above challenge. The proposed method is based on dynamic neural field as a brain-inspired algorithm that can mimic both neural and cognitive functions of the brain. Besides, we benefited from computer vision techniques to find targets and extract features such as their locations, directions, and velocities. We applied our method on an online dataset containing videos recorded from natural movements of zebrafish larvae. Evaluation results using two metrics MOTA and MOTP showed that the proposed method has a promising performance compared to the state-of-the-art algorithms. It can associate all information correctly both in the presence and absence of occlusion events.
Keywords: multiple object tracking, dynamic neural fields, data association, occlusion, microscopic videos

Abstract View Paper Research/Original Article Original: Persian

به جمع مشترکان مگیران بپیوندید!

فهرست مطالب

نشریه ماشین بینایی و پردازش تصویر
سال هشتم شماره 3 (پاییز 1400)

نشریه ماشین بینایی و پردازش تصویر

Machine Vision and Image Processing

به جمع مشترکان مگیران بپیوندید!

فهرست مطالب

نشریه ماشین بینایی و پردازش تصویر سال هشتم شماره 3 (پاییز 1400)

نشریه ماشین بینایی و پردازش تصویر

Machine Vision and Image Processing

نشریه ماشین بینایی و پردازش تصویر
سال هشتم شماره 3 (پاییز 1400)