جستجوی مقالات مرتبط با کلیدواژه

sparse coding

در نشریات گروه فنی و مهندسی

انتخاب همه

پردازش تصویر با استفاده از کدگذاری تنک و طبقه بندی انطباقی

فریماه شرافتی، جعفر طهمورث نژاد*

نشریه مهندسی برق، سال پنجاهم شماره 4 (پیاپی 94، زمستان 1399)، صص 1683 -1696

به دلیل افزایش حجم تصاویر تولیدشده توسط دوربین ها و دستگاه های مختلف، پردازش تصویر در بسیاری از کاربردها ازجمله پزشکی، امنیتی و رانندگی اهمیت و جایگاه بالایی یافته است. بااین حال بیشتر مدل های ایجادشده در حوزه پردازش تصویر کارایی چندانی نداشته و میزان خطای آن ها در برخی کاربردها تاثیرگذار است. علت اصلی ناکامی بیشتر مدل های ساخته شده، اختلاف توزیع بین داده های آموزشی (دامنه منبع) و داده های تست (دامنه هدف) می باشد. درواقع، مدل ساخته شده، قابلیت تعمیم دهی به داده هایی با خصوصیات و توزیع های متفاوت از داده های آموزشی را ندارد، به همین دلیل در مواجهه با داده های جدید دچار افت شدیدی می شود. در این مقاله ما یک روش جدید با نام کدگذاری تنک و طبقه بندی انطباقی (SADA) پیشنهاد می دهیم که یک مدل پردازش تصویری ایجاد می کند که در مقابل تغییرات داده ای مقاوم می باشد. مدل پیشنهادی با ایجاد یک زیر فضای مشترک بین دامنه های منبع و هدف اختلاف توزیع آن ها را به حداقل رسانده و موجب بهبود کارایی می شود. همچنین SADA با انتخاب نمونه هایی از دامنه منبع که با دامنه هدف مرتبط می باشند اختلاف توزیع بین دامنه ها را کاهش می دهد. علاوه بر آن، SADA با تطبیق پارامترهای مدل ایجادشده، یک مدل تطبیق پذیر برای مواجهه با شیفت داده ها ایجاد می کند. نتایج به دست آمده از آزمایش های متنوع، نشان می دهد که روش پیشنهادی ما، برتری قابل ملاحظه ای نسبت به تمام روش های تطبیق دامنه جدید دارد.

کلید واژگان: پردازش تصویر، تطبیق دامنه های بصری، کدگذاری تنک، وزن دهی مجدد نمونه، طبقه بندی انطباقی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Image Processing via Sparse Coding and Adaptive Classification

F. Sherafati, J. Tahmoresnezhad *

Journal of Electrical Engineering, Volume:50 Issue: 4, 2021, PP 1683 -1696

Due to the growing increase of generated images via cameras and various instruments, image processing has found an important role in most of practical usages including medical, security and driving. However, most of the available models has no considerable performance and in some usages the amount of error is very effective. The main cause of this failure in most of available models is the distribution mismatch across the source and target domains. In fact, the made model has no generalization to test data with different properties and distribution compared to the source data, and its performance degrades dramatically to face with new data. In this paper, we propose a novel approach entitled Sparse coding and ADAptive classification (SADA) which is robust against data drift across domains. The proposed model reduces the distribution difference across domains via generating a common subspace between the source and target domains and increases the performance of model. Also, SADA reduces the distribution mismatch across domains via the selection of the source samples which are related to target samples. Moreover, SADA adapts the model parameters to build an adaptive model to encounter with data drift. Our variety of experiments demonstrate that the proposed approach outperforms all stat-of-the-art domain adaptation methods.

Keywords: Image processing, visual domains adaptation, sparse coding, sample reweighting, adaptive classification

Abstract View Paper Research/Original Article Original: Persian
ارائه یک روش جدید فشرده ‏سازی تصاویر چهره با استفاده از نمایش تنک سیگنال‏ و الگوریتم یادگیری دیکشنری RLS-DLA

امیرمسعود طاهری، همایون مهدوی نسب*

نشریه مهندسی مخابرات جنوب، پیاپی 37 (پاییز 1399)، صص 27 -42

رشد فناوری و افزایش تصاعدی اطلاعات نیاز به حجم ذخیره سازی بیشتر داده های اطلاعاتی گوناگون را افزایش داده است. در این راستا فشرده سازی تصویر به عنوان ابزاری کارآمد جهت کاهش افزونگی و صرفه جویی در حجم ذخیره سازی و کاهش پهنای باند انتقالی داده های تصویری به کار می آید. هنگامی که فشرده سازی یک دسته یا خانواده از تصاویر، مانند پایگاه داده تصاویر چهره یک سازمان یا موسسه یا پایگاه داده MRI یک بیمارستان بزرگ یا پایگاه داده اثر انگشت مدنظر باشد افزونگی اطلاعات افزایش یافته و فشرده سازی اهمیت و الزام بیشتری پیدا می کند. در این میان تصاویر چهره با توجه به کاربرد وسیعی که به عنوان رایج ترین تصاویر پایگاه داده سازمان های و نهادهای مختلف مانند اداره های پلیس، نهادهای نظامی، دانشگاه ها و شرکت های بزرگ دارند مورد توجه بیشتری قرار دارند. به همین خاطر ارایه الگوریتمی که بتواند این دسته از تصاویر را با کیفیت بیشتر و نرخ بالاتری فشرده کند اهمیت بسیاری دارد. در این مقاله با استفاده از حوزه جدیدی از پردازش سیگنال به نام نمایش تنک و روش یادگیری دیکشنری RLS-DLA الگوریتم جدیدی برای فشرده سازی تصویر ارایه شده است که می تواند برای فشرده سازی پایگاه داده تصاویر به کار رود. در این الگوریتم تصاویر با به کارگیری چند دیکشنری به نحو وفقی بر اساس کیفیت بازسازی مورد نیاز آن ها فشرده می شوند. نتایج به دست آمده از الگوریتم پیشنهادی نشان دهنده عملکرد موثر و برتری معنی دار آن نسبت به روش های پیشرفته و مطرحی همچون JPEG2000 است به طوری که به افزایش کیفیتی در حدود 0.5 dB تا 1.2 dB در نرخ بیت یکسان دست می یابد.

کلید واژگان: نمایش تنک، کدگذاری تنک، فشرده‏ سازی تصویر، یادگیری دیکشنری، RLS-DLA، JPEG2000

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

A novel face images compression scheme using sparse signal representation and RLS_DLA dictionary learning algorithm

AmirMasoud Taheri, Homayoun Mahdavi-Nasab *

Journal of Southern Communication Engineering, Volume:10 Issue: 37, 2020, PP 27 -42

Due to the rapid growth of information technology and exponential increasing of information the need for more and more storage capacity and efficiency has increased. Image compression is an important tool to reduce the redundancy of images data in order to be able to store or transmit them in an efficient manner. When images are limited to a specific and limited family of images like MRI databases of a hospital or facial image database of a university or an organization or fingerprint image databases, this limitation increases the total spatial redundancy. Thus, efficient storage of such images is beneficial, and their compression becomes an appealing application, and this urges algorithms specially tailored for the task of content base image compression to surpass general purpose compression algorithms. The facial images, due to their wide application as the most common images in the organizations and companies are more considerable for image compression. In this paper a new image compression scheme using sparse coding and RLS-DLA redundant dictionary learning is proposed that can be used for compressing of face image databases. In the proposed method, several dictionaries are exploited adaptively based on the required image quality to enhance the overall rate-distortion. The simulation results show that this scheme outperforms the state-of-art algorithms like JPEG2000 by about 0.5 to 1.2 dB for reconstructed images PSNR.

Keywords: sparse representation, sparse coding, Image compression, dictionary learning, JPEG2000

Abstract View Paper Research/Original Article Original: Persian
حداقل سازی خطای پیش بینی مدل های طبقه بندی تصویر با استفاده از کدگذاری تنک و تطبیق دامنه

فریماه شرافتی، جعفر طهمورث نژاد*

نشریه مهندسی برق، سال پنجاهم شماره 1 (پیاپی 91، بهار 1399)، صص 231 -242

تطبیق دامنه می تواند دانش را از یک مجموعه آموزشی (دامنه منبع) به یک مجموعه آزمایشی (دامنه هدف) انتقال دهد تا بازدهی مدل یادگرفته شده از داده های آموزشی افزایش یابد. علاوه بر این، استفاده از کدگذاری تنک، مدل یادگرفته شده را بسیار مختصر نموده و کنترل آن را ساده می نماید. با این حال، اختلاف توزیع بین دامنه های منبع و هدف بازدهی مدل را کاهش می دهد. در این مقاله، ما یک مدل تطبیق دامنه بدون نظارت پیشنهاد می دهیم تا خطای پیش بینی مدل های طبقه بندی تصاویر را کاهش دهیم. از وزن دهی مجدد نمونه ها برای مدیریت داده های اضافه و اطلاعات بلااستفاده داده های منبع در نمایش جدید استفاده می شود. همچنین، اختلاف توزیع شرطی بین دامنه های منبع و هدف با استفاده از روی هم گذاری زیرفضاها کاهش داده می شود. روش پیشنهادی ما یک طبقه بند مستقل از دامنه تنک در زیرفضای به دست آمده می باشد که ساختار داده های ورودی را حفظ می کند. آزمایشات گسترده نشان می دهد که روش پیشنهادی ما بر روی پایگاه داده های واقعی در مقایسه با روش های به روز در حوزه یادگیری ماشین و تطبیق دامنه، 49/4% بهبود در صحت طبقه بندی دارد.

کلید واژگان: پردازش تصویر، تطبیق دامنه، کدگذاری تنک، اختلاف توزیع شرطی، تطبیق زیرفضا

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Prediction Error Minimization of Image Classification Models via Sparse Coding and Domain Adaptation

F. Sherafati, J. Tahmoresnezhad*

Journal of Electrical Engineering, Volume:50 Issue: 1, 2020, PP 231 -242

Domain adaptation can transfer knowledge from a training set (source domain) to a test set (target domain), promoting the performance of the model learned from the training set. In addition, sparse coding makes the learned model more succinct and easy to manipulate. However, the existence of the distribution mismatch across the source and target domains reduce the performance of model. In this paper, we propose an unsupervised domain adaptation model to minimize the prediction error of image classification. Sample reweighting is utilized to handle redundant and useless information of source data in the new representation. Moreover, the difference of the conditional distributions across the source and target domains is reduced along with the subspace alignment. Our proposed approach learns a sparse domain-invariant classifier in a latent subspace with preserving the structure of the input data. Extensive experiments demonstrate that our proposed approach shows 4.49% improvement in classification accuracy on real-world datasets compared to state-of-the-art machine learning and domain adaptation methods.

Keywords: Image processing, domain adaptation, sparse coding, conditional distribution difference, subspace alignment

Abstract View Paper Research/Original Article Original: Persian
تخمین حالت سه بعدی بدن انسان از یک تصویر بوسیله شبکه عصبی کانولوشن و کدگذاری و بازنمایی تنک با رویکرد مبتنی بر مدل

حسن علی کرمی، فرزین یغمایی*، محمدجواد فدایی اسلام

نشریه ماشین بینایی و پردازش تصویر، سال ششم شماره 2 (پاییز و زمستان 1398)، صص 27 -41

در زمینه تخمین و ساخت اسکلت حالت سه بعدی بدن انسان از طریق بندهای بدن (body joints) بوسیله یک تصویر دوبعدی، چالش های عمق و خودانسدادی وجود دارد که مانع از تخمین دقیق می گردد. در این مقاله به تخمین حالت سه بعدی بدن انسان با دو رویکرد مختلف پرداخته شده است. بدین منظور، رویکرد اول پیشنهادی با تمرکز بر عمق حالت دوبعدی حقیقت اصلی بوسیله کدگذاری و بازنمایی تنک و تصحیح گر مبتنی بر مدل، حالت سه بعدی بدن انسان استخراج می شود. در رویکرد دوم پیشنهادی به کمک روش مبتنی بر یادگیری شبکه های عصبی کانولوشن، تخمین حالت دوبعدی بدن انسان بدست می آید، سپس بوسیله کدگذاری و بازنمایی تنک و تصحیح گر مبتنی بر مدل، تخمین عمق حالت استخراج می شود. نتایج حاصل از این روش، برتری تخمین حالت و عمق سه بعدی بدن انسان را نسبت به رویکردهای پیشین نشان می دهد. تخمین حالت های سه بعدی انجام شده در روش پیشنهادی نشان می دهد میانگین خطای بازسازی نسبت به کارهای مشابه کاهش قابل توجهی داشته است

کلید واژگان: شبکه های عصبی کانولوشنی، کدگذاری و بازنمایی تنک، اسکلت حالت سه بعدی بدن انسان، تخمین حالت سه بعدی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

3D Human Pose Estimation on a 2D Image using Convolutional Neural Networks and Sparse Coding

Hassan Alikarami, Farzin Yaghmaee *, Mohammad Javad Fadaiee Eslam

Machine Vision and Image Processing, Volume:6 Issue: 2, 2020, PP 27 -41

There are challenges such as depth perception and self-occlusion, in the field of 3D human pose estimation and reconstruction which obstructs precise estimation of body joints. In this paper, we first extract human pose by focusing on 2D ground-truth using sparse coding and. In the second approach, we use a learning-based Convolutional Neural Networks using sparse coding and a model based rectifier to extract the estimated pose. Pose estimation by proposedmethod has reduced the mean error of the reconstruction in comparison with the state of the artworks.

Keywords: Convolutional Neural Networks, Sparse Coding, Representation, 3D Pose Skeleton, 3D Pose Estimation

Abstract View Paper Research/Original Article Original: Persian
روشی نوین برای توصیف و دسته بندی تصاویر بافتی رنگی با استفاده از کدگذاری تنک ویژگی های چهارگانی

هادی هادی زاده*

نشریه ماشین بینایی و پردازش تصویر، سال ششم شماره 2 (پاییز و زمستان 1398)، صص 147 -158

رنگ و بافت دو مولفه بسیار مهم در تشخیص و تمایز بین اشیاء مختلف در دنیای واقعی می باشند. اخیرا، نمایش چهارگانی (کواترنیونی) تصاویر تبدیل به یک شیوه کارآمد برای توصیف تصاویر رنگی شده است. با استفاده از نمایش چهارگانی تصاویر رنگی، امکان پردازش و در نظر گرفتن اطلاعات متقابل بین کانال های رنگی تصاویر به صورت توامان فراهم می شود. تاکنون عملگرهای چهارگانی ساده ای همچون عملگرهای چرخش، انعکاس و انتقال کلیفورد برای تصاویر رنگی توسعه یافته اند. با اعمال این عملگرهای ساده بر تصاویر رنگی، می توان اطلاعات کم عمقی در خصوص رابطه بین کانال های رنگی یک تصویر به دست آورد. در این مقاله، ابتدا عملگرهای چهارگانی جدیدی موسوم به عملگرهای چهارگانی ترکیبی را پیشنهاد می دهیم که از ترکیب متوالی و دلخواه عملگرهای چهارگانی ساده تشکیل می شوند. با استفاده از عملگرهای پیشنهادی می توان اطلاعات عمیق تری از تصاویر رنگی را استخراج نمود. سپس، روشی مبتنی بر مفهوم کدگذاری تنک را برای دسته بندی تصاویر بافتی رنگی توصیف شده توسط عملگرهای چهارگانی ترکیبی پیشنهاد می دهیم. نتایج تجربی نشان می دهند که روش پیشنهادی برای دسته بندی تصاویر بافتی رنگی از دقت بالاتری در مقایسه با سایر روش های موجود بر روی سه پایگاه مشهور از تصاویر بافتی رنگی برخوردار است

کلید واژگان: بافت، کدگذاری تنک، چهارگان، اتم، عملگرهای چهارگانی، رنگ

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

A Novel Color Texture Classification using Sparse Coding based on Quaternionic Representation

Hadi Hadizadeh *

Machine Vision and Image Processing, Volume:6 Issue: 2, 2020, PP 147 -158

Texture and color are two important attributes for object recognition. Recently, quaternionic representation of color images have been used as an effective method for color image processing. Using such a representation, it is possible to consider the mutual interaction between different color channels. In the last decade, several quaternion operations like rotation, reflection, and Clifford translation have been developed. Such operators are able to extract shallow information from the color images. In this paper, we first propose a set of new quaternion operators called hybrid quaternionic operators, which can be produced by a cascade of several simple quaternionic operators. Such operators can extract deeper information from the color images. We then use such operators, and present a novel color texture classification method using the concept of sparse coding. Experimental results indicate that the proposed method outperforms several existing and popular methods.

Keywords: Texture, Quaternion, Sparse Coding

Abstract View Paper Research/Original Article Original: Persian
A Novel Singing Voice Separation Method Based on Sparse Non-Negative Matrix Factorization and Low-Rank Modeling

S. Mavaddati*

Iranian Journal of Electrical and Electronic Engineering, Volume:15 Issue: 2, Jun 2019, PP 161 -171

A new single channel singing voice separation algorithm is presented in this paper. This field of signal processing provides important capability in various areas dealing with singer identification, voice recognition, data retrieval. This separation procedure is done using a decomposition model based on the spectrogram of singing voice signals. The novelty of the proposed separation algorithm is related to different issues listed in the following: 1) The decomposition scheme employs the vocal and music models learned using sparse non-negative matrix factorization algorithm. The vocal signal and music accompaniment can be considered as sparse and low-rank components of a singing voice segment, respectively. 2) An alternating factorization algorithm is used to decompose input data based on the modeled structures of the vocal and musical components. 3) A voice activity detection algorithm is introduced based on the energy of coding coefficients matrix in the training step to learn the basis vectors that are related to instrumental parts. 4) In the separation phase, these non-vocal atoms are updated to the new test conditions using the domain transfer approach to result in a proper separation procedure with low reconstruction error. The performance evaluation of the proposed algorithm is done using different measures and leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures. The average improvement values of the proposed separation algorithm for PESQ, fwSegSNR, SDI, and GNSDR measures in comparison with previous separation methods in two defined test scenario and three mentioned SMR levels are 0.53, 0.84, 0.39, and 2.19, respectively.

Keywords: Singing Voice Separation, Dictionary Learning, Incoherence, Sparse Coding, Voice Activity Detector.

Abstract View Paper Research/Original Article Original: English
تشخیص نواحی مزاحم بصری در تصاویر به وسیله روش نمایش تنک دومرحله ای و وزن دار نمونه آزمون

فردین صبوری، فرزین یغمایی *

نشریه مهندسی برق و مهندسی کامپیوتر ایران، سال شانزدهم شماره 3 (پیاپی 48، پاییز 1397)، صص 213 -222

مخاطب یک تصویر مایل است که در کوتاه ترین زمان، پیام اصلی تصویر را دریافت کند. از این رو سیستم بینایی انسان توجه بصری را ناخودآگاه به سمت نواحی برجسته، با فرض وجود اطلاعات مفید در آنها هدایت می کند. عملا این فرض همواره صادق نبوده و در مواردی، نواحی برجسته صرفا موجب مزاحمت بصری می گردند. از این رو در کاربردهای مختلف نیاز به ساز و کاری جهت تشخیص این نواحی می باشد تا با حذف این نواحی، حواس مخاطب از سوژه اصلی تصویر پرت نشود. همچنین نادیده گرفتن این نواحی، کمک شایانی است به روش هایی که بر پایه تشخیص نواحی برجسته و مهم عمل می کنند. بدین منظور در این مقاله، بر اساس روش های منطبق بر چالش عدم توازن دسته ها، هر قطعه از تصاویر آموزشی با توجه به ماسک آنها به 9 دسته افراز می شود که شماره هر دسته متناسب با شدت مزاحمت است. سپس ویژگی های مبتنی بر قطعه استخراج و دسته هر قطعه بر اساس روش نمایش تنک دومرحله ای و وزن دار نمونه آزمون که بر مبنای سیستم کدگذاری و بازنمایی تنک است، تعیین می شود. به منظور ارزیابی دقیق روش پیشنهادی و مقایسه آن با سایر روش ها، 4 معیار ارزیابی با رویکردهای مختلف معرفی و پیشنهاد می شود. با ارزیابی و سنجش نتایج نشان داده می شود که روش پیشنهادی علی رغم زمان بر بودن، نسبت به کارهای پیشین دارای دقت بیشتری است.

کلید واژگان: سیستم بینایی انسان، توجه بصری، نواحی برجسته، مزاحمت بصری، سیستم کدگذاری و بازنمایی تنک

چکیده مشاهده متن زبان: فارسی

Visual Distractors Detecting in Images Using Weighted Two Phase Test Sample Sparse Representation Method

F. Sabouri, F. yaghmaee *

Iranian Journal of Electrical and Computer Engineering, Volume:16 Issue: 3, 2019, PP 213 -222

The image observer usually wants to receive the message and the main subject of the image in the shortest time. Hence, assuming there is useful information in the salient regions, the human vision system unconsciously guides visual attention towards them. This assumption is not always correct in practice, and in some cases, salient regions merely cause visual distractions. Therefore, in different applications, a mechanism is needed to identify these regions. To prevent from distracting observer’s attention from the main subject, these regions are eliminated. Furthermore, neglecting these regions could be of considerable assistance to the methods that function base on salient regions recognition. So, in this paper, Based on the methods of the class imbalance challenge each segment of training images in the dataset is a partition to 9 classes according to the relevant mask in the dataset, that the number of each class is proportional to its disturbance intensity. Then, segment-based features are extracted and determining the class of each segment is determined according to WTPTSSR method, which is based on the Sparse Coding and Representation system.Finally, in order to precisely analyzing the proposed method and comparing it to other approaches, four analysis criteria with different performances are presented. According to results, despite being time-consuming, the proposed method has a higher accuracy than the previous ones.

Keywords: Human vision system, visual attention, salient regions, visual distractor, sparse coding, representation

Abstract View Paper Regular Article Original: Persian
آشکارسازی ناهنجاری ها در تصاویر ابرطیفی بر پایه کدگذاری تنک و پنجره های محلی متحرک

سید رضا صوف باف، محمودرضا صاحبی*، برات مجردی

نشریه علوم و فنون نقشه برداری، سال هشتم شماره 2 (پیاپی 30، پاییز 1397)، صص 115 -132

در این تحقیق یک روش نوین جهت آشکارسازی ناهنجاری ها در تصاویر ابرطیفی بر پایه کدگذاری تنک و با استفاده از پنجره های متحرک محلی پیشنهاد شده است. مهمترین نقطه قوت این روش فراهم نمودن شرایط و امکان قضاوت بهتر در خصوص احتمال وقوع ناهنجاری در داده های ابرطیفی با بکارگیری روشی با قابلیت تجمیع و هم افزایی اطلاعات هر پیکسل تصویری طی عبور پنجره متحرک از آن می باشد. در این روش با عبور یک پنجره ی متحرک، هر پیکسل موقعیت های مکانی مختلفی را نسبت به همسایگان مکانی خود تجربه نموده و در هر یک از موقعیت های مذکور یک دیکشنری محلی که مبین داده های پس زمینه می باشد، بصورت بهینه با استفاده از الگوریتمK-SVD تشکیل شده و بازسازی داده های موجود به روش تخمین تنک در پنجره متحرک با بگارگیری الگوریتم SOMP صورت می پذیرد. بنابراین در هر موقعیت پنجره برای هر پیکسل خطای بازسازی با روش کدگذاری تنک مورد محاسبه قرار می گیرد. با توجه به استفاده از دیکشنری پس زمینه در بازسازی کلیه داده ها، هر گاه احتمال وقوع ناهنجاری در پیکسل مورد بررسی بیشتر باشد، مقدار خطای بازسازی آن نیز بزرگتر خواهد بود. لذا با بررسی این خطا در موقعیت های مختلف هر پیکسل نسبت به پنجره متحرک، می توان به مجموعه ای از خطاهای بازسازی برای آن پیکسل دست یافت که در نهایت واریانس آنها به عنوان معیار آشکارسازی ناهنجاری در نظر گرفته می شود. مقایسه نتایج آشکارسازی با روش پیشنهادی در این تحقیق با الگوریتم هایی مانند GRX, LRX, CRD وBJSR با بکارگیری چهار نوع داده ابرطیفی اعم از واقعی و شبیه سازی شده، حاکی از کارایی بهتر آشکارساز پیشنهادی به میزان متوسط حدود 9 درصد نسبت به آنها می باشد.

کلید واژگان: تصاویر ابرطیفی، آشکارسازی ناهنجاری، کدگذاری تنک، پنجره متحرک، KSVD

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Hyperspectral Anomaly Detection Based on Sparse Coding Using Sliding Windows

S. R. Soofbaf, M. R. Sahebi *, B. Mojaradi

Journal of Geomatics Science and Technology, Volume:8 Issue: 2, 2018, PP 115 -132

Since the anomalies are unknown targets with low probabilities of occurrence which are significantly different from their neighbors, anomaly detection could be considered as one of the most important information extraction approaches from hyperspectral data. Various types of parametric and non-parametric algorithms have been developed in this area from the 1990's decade. Recently, sparse representation methods have been introduced and successfully accepted as a useful tool for anomaly detection based on the recovery of the majority of high-dimensional signals via a low-dimensional subspace through a dictionary of normalized signals called atoms. In other words, having a dictionary composed of bases denoting the background subspace enables the accurate recovery of background signals. Moreover, the presence of anomaly signals, assuming their deviation from the background subspace, will not have a precise estimation by the background dictionary. Hence the main idea of these anomaly detection methods is focused on evaluating recovery errors of signals by a dictionary that describes the background subspace. In such procedure, removing the atoms that describe the anomaly in the background dictionary can be considered as one of the essential actions. To this aim making diversity in the definition of spatial neighborhoods of spectral signals, as well as voting-based judgment in different situations of the spatial distribution could be proposed. In other words, by designing an optimized local dictionary, based on a local sliding window, the votes of each signal in terms of anomaly presence in each spatial neighborhood could be calculated with the aim of achieving better judgment. In this paper, a new anomaly detector for hyperspectral images is proposed based on simultaneous sparse representation using a new structured sliding window. The main contribution of this research is to improve the judgments about the anomaly presence probability using information collected during transition of the mentioned sliding window for each pixel under test. In this algorithm, each pixel experiences various spatial positions with respect to the neighbors through the transition of the sliding window. In each position, an optimized local background dictionary is molded using a well-known K-SVD method as an iterative process and the recovery error of sparse coding for each pixel under test is calculated using a simultaneous orthogonal matching pursuit algorithm (SOMP). So, the votes of each pixel in terms of the anomaly presence in each neighborhood are calculated and finally the variance of these estimated errors is considered as the anomaly detection criterion. The experimental results of the proposed method using four datasets (synthetic and real datasets) proved its higher performance compared to the GRX, LRX, CRD and BJSR detectors with an average efficiency improvement of about 9%. In addition automatic tuning of the proposed algorithm parameters (level of sparsity and the size of sliding window) and developing parallel processing techniques to improve the running time of this algorithm are the focus of our future research. It is notable that the implementation of this idea and its success showed that development of voting algorithms and the combination of the results could be considered as an efficient approach could also be utilized in other hyperspectral image processing algorithms.

Keywords: Keywords: Hyperspectral Images, Animaly Detection, Sparse Coding, Sliding Window, KSVD

Abstract View Paper Research/Original Article Original: Persian
بهبود کیفیت تصاویر در ناوبری پهپاد با استفاده از روش فراتفکیک پذیری مبتنی بر شبکه عصبی کانولوشنی با نگاشت چندلایه

سید محمدرضا موسوی، مجید آقابابایی *، پیمان خزایی پول، محمد خویشه

فصلنامه دریا فنون، سال چهارم شماره 1 (پیاپی 7، بهار 1396)، صص 1 -11

ناوبری هوشمندانه و خودکار پهپاد براساس انطباق تصاویر دریافتی از پهپاد با تصاویر ماهواره ای یکی از جدیدترین گونه های ناوبری می باشد که بسیار مورد توجه محققان و صنعتگران این حوزه قرار گرفته است. این روش هم از نظر جنگ الکترونیک و هم از نظر کارآیی ، زمانی موثر است که تصاویری با کیفیت بالا موجود باشد تا ویژگی های تصاویر را بتوان استخراج نمود. اما یکی از عواملی که سبب کاهش بهره استخراج ویژگی های تصاویر و انطباق آن با تصاویر ماهواره ای می شود، ماتی است. مات زدایی از تصاویر تبدیل به یک موضوع به روز و چالش برانگیز برای محققان شده است. در این مقاله روشی نوین برای بهسازی کیفیت تصاویر با استفاده از روش فراتفکیک پذیری مبتنی بر شبکه عصبی کانولوشنی CNN با چند لایه نگاشت غیرخطی ارائه گردید که نقش بسزایی در مات زدایی و نویززدایی از تصاویر پهپاد دارد. نتایج نشان می دهد که این روش نسبت به روش های مطرح امروزی، کارآیی بهتری دارد. به طوری که روش پیشنهادی میزان کیفیت تصویر را نسبت به روش های مطرح، براساس معیار نسبت پیک سیگنال به نویز (PSNR) تقریبا به اندازه 5% افزایش می دهد.

کلید واژگان: ناوبری، سیستم موقعیت یاب جهانی، فراتفکیک پذیری، پراکندگی، شبکه عصبی کانولوشنی

چکیده مشاهده متن زبان: انگلیسی

Improving Quality of Images in UAVs Navigation using Super-Resolution Techniques based on Convolutional Neural Network with Multi-Layer Mapping

S. M. Mousavi

Marine Technology, Volume:4 Issue: 1, 2017, PP 1 -11

Intelligent and autonomous UAVs navigation, which is based on the compliance of received images from UAVs with those from satellites, is one of the newest types of navigation that has received much attentions from researchers and industrialists of this area. This approach is effective, in terms of electronic warfare and efficiency, when high-quality images are available so that the effective features of images can be extracted. However, blurring is one of the main destructive factors leading to decrease the extraction rate and then weak satellites images adaptation. So, Image de-blurring has become a new challenging issue for researchers. In this paper, a new method is proposed for improving image quality, using super-resolution techniques based on Convolutional Neural Network (CNN) with non-linear multi-layer mapping, which plays an important role in de-blurring and removing noises from UAV images. The simulation results show that the proposed method has much better performance compared to the other benchmark techniques in term of peak signal to noise ratio (PSNR) so that the proposed method increases the aforementioned criteria about 5%.

Keywords: Navigation, GPS, Super-Resolution, Sparse Coding, CNN

Abstract View Paper Original: English
Face Recognition using an Affine Sparse Coding approach

M. Nikpour, R. Karami *, R. Ghaderi

Journal of Artificial Intelligence and Data Mining, Volume:5 Issue: 2, Summer-Autumn 2017, PP 223 -234

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hence the classification performance may be decreased. In this paper, we propose an Affine Graph Regularized Sparse Coding approach for face recognition problem. Experiments on several well-known face datasets show that the proposed method can significantly improve the face classification accuracy. In addition, some experiments have been done to illustrate the robustness of the proposed method to noise. The results show the superiority of the proposed method in comparison to some other methods in face classification.

Keywords: Sparse coding, Manifold Learning, Face recognition, Graph Regularization

Abstract View Paper Original: English
A new Sparse Coding Approach for Human Face and Action Recognition

Mohsen Nikpour*, Mohammad Reza Karami Molaei, Reza Ghaderi

Journal of Information Systems and Telecommunication, Volume:5 Issue: 1, Jan-Mar 2017, P 16

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image, video and etc. In the cases where we have some similar images from the different classes, using the sparse coding method the images may be classified into the same class and devalue classification performance. In this paper, we propose an Affine Graph Regularized Sparse Coding approach for resolving this problem. We apply the sparse coding and graph regularized sparse coding approaches by adding the affinity constraint to the objective function to improve the recognition rate. Several experiments has been done on well-known face datasets such as ORL and YALE. The first experiment has been done on ORL dataset for face recognition and the second one has been done on YALE dataset for face expression detection. Both experiments have been compared with the basic approaches for evaluating the proposed method. The simulation results show that the proposed method can significantly outperform previous methods in face classification. In addition, the proposed method is applied to KTH action dataset and the results show that the proposed sparse coding approach could be applied for action recognition applications too.

Keywords: Sparse Coding, Manifold Learning, Graph Regularization, Affinity, Image Representation, Image Classification

Abstract View Paper Original: English
خلاصه سازی مقیاس پذیر ویدئویی با استفاده از انتخاب واژه نامه تنک

حسن فرسی *، مهندس پوریا اعتضادی فر

مجله علوم و فناوری های پدافند نوین، سال هفتم شماره 4 (پیاپی 26، زمستان 1395)، صص 315 -326

یکی از حوزه های مهم در پدافند غیرعامل، شناسایی تهدیدات و اعلام هشدار است. یکی از روش های پرکاربرد در حوزه شناسایی بررسی داده های ویدئویی به منظور شناسایی اهداف ناشناس و اعلام هشدار است. به منظور بررسی سریع، با دقت بالا روش های خلاصه سازی ویدیو ارائه شده است. همچنین در طول سال های گذشته، ایجاد ویدیو دیجیتالی منجر به رشد نمایی محتوای ویدئویی شده است. به منظور افزایش قابلیت استفاده از این حجم بالای ویدیو، تحقیقات بسیاری به انجام رسیده و خلاصه سازی ویدیو به جهت مرور سریع این مجموعه ویدئویی بزرگ و برای کمک به فهم سریع محتوای داده های ویدئویی پیشنهاد شده است. در خلاصه سازی ویدیو، تصاویری به عنوان نماینده از هر صحنه انتخاب می شود تا مروری تصویری از تمام فیلم به دست آید. اخیرا روش هایی با استفاده از فرمول بندی تنک برای خلاصه سازی ویدیو، داده های ویدئویی را به میزان زیادی نسبت به دیگر روش ها خلاصه نموده اند. در این مقاله به خلاصه سازی ویدیو، به عنوان یک مسئله انتخاب واژه نامه تنک پرداخته می شود. بدین منظور، با استفاده از روشی جدید بر پایه کدینگ تنک، می توان به میزان زیادی خلاصه سازی داده های ویدئویی را نسبت به دیگر روش های خلاصه سازی ویدیو که با روش تنک و یا روش های دیگر پیشنهادشده اند بهبود بخشید. اساس این روش بر پایه حل معادله بهینه سازی با استفاده از آستانه گذاری نرم است که پیچیدگی کمتری نسبت به روش های پیشنهادشده اخیر است. این امر را می توان با بررسی میزان پیچیدگی روش پیشنهادی با روش های متداول اخیر متوجه شد. در انتها نتایج آزمایش برای مجموعه داده های معیاری زمین حقیقت و با روش های State of the art، ادعای ما در بهبود میزان خلاصه سازی روش پیشنهادی را نشان می دهد.

کلید واژگان: خلاصه سازی ویدیو، انتخاب واژه نامه، فریم کلیدی، آنالیز ویدیو، کدینگ تنک

چکیده مشاهده متن زبان: فارسی

Scalable Videos Summarization Using Sparse Dictionary Selection

Dr. Hassan Farsi *, Pouryia Etezadifar

Journal of Passive Defence Science and Technology, Volume:7 Issue: 4, 2017, PP 315 -326

One of the important topics of passive defense is threats detection and waning alarm. One of the most widely used methods in detection field is video data investigation in order to identify unknown targets and warning alarm. In order to evaluate a fast and high-precision technique, video summarization is presented. Also, during the past years, creation of digital videos has caused exponential growth of video content. To increase the high volume of video usability, a lot of researches have been done and video summarization has been proposed to quick view of large video collection and quick understanding of the content of video data. In the video summarization, pictures are selected as a representative of each scene to obtain a visual overview of whole video. Recently, new methods using sparse formulation are suggested for video summarization being more effective in video data summarization than other methods. In this paper, video summarization is presented as a sparse dictionary selection problem. For this purpose, using a new method based on sparse coding, have been able to improve video data summarization compared to other video summarization methods based on sparse or other coding. Finally, the results for the ground truth data collection and State of the art methods, shows improvement our claim in the video summary on proposed method.

Keywords: Video Summarization, Dictionary Selection, Key Frame, Video Analysis, Sparse Coding

Abstract View Paper Original: Persian

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

sparse coding