جستجوی مقالات مرتبط با کلیدواژه

convolutional neural network

در نشریات گروه فناوری اطلاعات

تکرار جستجوی کلیدواژه convolutional neural network در نشریات گروه فنی و مهندسی

انتخاب همه

Exploring a Novel Multi-Channel Structure to Improve Facial Expression Recognition on Occluded Samples Using Deep Convolutional Neural Network

Mohammadhossein Zolfagharnasab, Mohammad Bahrani *, Masood Hamed Saghayan, Fatemeh Sadat Masoumi

Journal of Artificial Intelligence, Applications, and Innovations, Volume:1 Issue: 2, Spring 2024, PP 26 -41

The development of Artificial Intelligence (AI) models with an accurate prediction of human facial expression has become a significant challenge for the cases in which masks and sunglasses cover critical facial areas. Given that a substantial portion of human interactions involves non-verbal communication, accurately detecting human emotions such as anger, fear, disgust, happiness, sadness, and surprise would benefit a wide range of applications, from security assessments to psychological treatments. As a workaround, the current study explores the performance of a novel multi-channel arrangement comprised of a Haar-wavelet, Histogram of Oriented Gradients (HOG), and grayscale filters to improve the predictions of deep Convolutional Neural Network (CNN) on occluded results. This study uses the FER-2013 dataset and produces occluded samples by applying a virtual mask that covers almost 55% of facial areas comprising the mouth, lips, and jaw locations. Further investigations, including the impact of each filter, utilizing pre-trained models on occluded samples (transfer learning), and comparison to prior models are also carried out. The proposed approach yields an accuracy rate of 71% for non-occluded and 66% for the occluded samples, which are 6% to 11% higher than the base model. Further transfer learning technique increases the accuracy metrics by 18%, indicating that non-occluded pre-trained models can reveal a broader range of features and their relation, which to some extent compensates for the removed features due to the occlusion. These results suggest the potential capabilities of the proposed technique for similar imaging applications.

Keywords: Sample Occlusion, Facial Expression Recognition, Convolutional Neural Network, Transfer Learning, Histogram Of Oriented Gradients, Haar Wavelet

Abstract View Paper Research/Original Article Original: English
تشخیص و شناسایی خطا در سیستم های فتوولتائیک با استفاده از شبکه عصبی عمیق VGG16

سمانه عظیمی، محمد منثوری*، مهدی اخباری

نشریه فناوری اطلاعات و ارتباطات ایران، سال شانزدهم شماره 61 (پاییز و زمستان 1403)، صص 247 -260

تشخیص خطا در آرایه های فتوولتائیک (PV) جهت افزایش توان خروجی و همچنین طول عمر مفید یک سیستم PV ضروری است. وجود شرایطی مانند سایه جزئی، خطاهای امپدانس بالا و وجود سامانه ردیاب نقطه حداکثر توان (MPPT)، تشخیص خطا را در شرایط محیطی به چالش می کشد. بیشتر تحقیقات انجام شده در این زمینه فقط در چند سناریو از عیوب به شناسایی و طبقه بندی پرداخته اند. این پژوهش با استفاده از شبکه ی عصبی کانولوشنی عمیق از پیش آموزش داده شده VGG16)) و با بهره گیری از ویژگی ها استخراج شده بوسیله اسکالوگرام های دوبعدی تولیدشده از داده های سیستم PV، به شناسایی و طبقه بندی خطا در سیستم PV با استفاده از یک شبکه عصبی کاملا متصل می پردازد. برخلاف روش های قبلی پیشنهادشده در ادبیات موضوع تشخیص و طبقه بندی عیوب، موارد مختلف معیوب همراه با ترکیب MPPT در مطالعه ما در نظر گرفته شده است. در این تحقیق نشان داده شده است که روش پیشنهادی شاملCNN از پیش آموزش دیده تنظیم شده، از روش های موجود بهتر عمل می کند و به دقت تشخیص خطای 375/83 درصد دست پیدا می کند.

کلید واژگان: آرایه فتوولتائیک، ردیاب نقطه حداکثر توان، طبقه بندی خطا، شبکه عصبی کانولوشنی VGG16، اسکالوگرام

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Fault Diagnosis and Detection in Photovoltaic Systems Using Neural Network VGG16

Samaneh Azimi, Mohammad Manthouri*, Mahdi Akhbari

Journal of Information and Communication Technology, Volume:16 Issue: 61, 2025, PP 247 -260

Fault detection in photovoltaic (PV) arrays is necessary to increase the output power and also the useful life of a PV system. The presence of conditions such as partial shade, high impedance faults, and the maximum power point detector (MPPT) system make the fault detection of PV in environmental conditions more challenging. The literature identified and classified defects just in few scenarios. In this study two-dimensional scalograms are generated from PV system data. The VGG16 as a pretrained convolutional neural network is used for feature extraction. Finally, to identify and classify faults in the PV system a fully connected neural network is trained. Unlike the previous methods proposed in the literature on the subject of defect detection and classification, various defective cases with MPPT combination are considered in this research. It has been shown that the proposed method including pre-trained CNN performs better than the existing methods and achieves an error detection accuracy of 83.375%.

Keywords: Photovoltaic Array, Maximum Power Point Tracking, Fault Classification, Convolutional Neural Network, Scalograms

Abstract View Paper Research/Original Article Original: Persian
Genetic algorithm-based hyperparameter optimization of convolutional neural network models for white blood cells classification

Ahmad Nasrollahpour *, Mohammad Khanabadi, Toktam Khatibi

Journal of Innovations in Computer Science and Engineering, Volume:2 Issue: 1, Winter and Spring 2024, PP 1 -8

Detecting white blood cells (WBC) in microscopic images is essential in medical diagnosis. Manual analysis of these images is time-consuming and has a high error rate. Using object detection for WBCs detection with deep convolutional neural networks (CNN) can be considered a practical and effective solution. In this study, a CNN model is proposed to classify these images. In order to achieve optimal training performance, CNNs have many hyperparameters, such as dropout rate, number of hidden units in each hidden layer, activation function, loss function and optimizer, which need to be optimized. Therefore, a hyperparameter optimization approach based on a genetic algorithm is suggested, which can then be used to select the best combination parameters to improve accuracy and efficiency in detecting white blood cells in microscopic images. This new approach is significant and flexible for medical technicians to use in clinical practice for examining blood cell microscopy. In this research, the images were classified into five classes and the mean accuracy of the model for the five classes was 87%, which is considered a good accuracy for classification into five classes.

Keywords: Image Processing, Deep Learning, Metaheuristic Algorithm, Convolutional Neural Network, Histopathology Image Processing

Abstract View Paper Research/Original Article Original: English
Presenting Multiple Biometric Identity Verification Model Based on Business Intelligence in Electronic Prescription System

Ahrar Hosseini, Reza Radfar *, Ataollah Abtahi

Journal of Computer and Robotics, Volume:17 Issue: 2, Summer and Autumn 2024, PP 77 -92

This research highlights the significance of biometric authentication in ensuring secure and reliable identification across various domains. The proposed approach combines fingerprint and eye biometric features, leveraging the "or" relationship to authenticate individuals using either feature when necessary. By incorporating multiple biometric features, the model enhances accuracy and reliability, providing a comprehensive solution for identity verification. The use of a convolutional neural network (CNN) enables high accuracy and effective processing of diverse image types. The results demonstrate remarkable performance, with accuracy rates exceeding 99% in both fingerprint and eye biometric authentication. The model's integration of multiple features and advanced machine learning techniques enhances authentication processes, enabling better management of access control and reducing unauthorized access risks. Future research should explore expanding the range of biometric features and further improving the model's architecture and algorithms to adapt to emerging technologies and security requirements. Overall, this research emphasizes the importance of biometric authentication, presents an effective approach, and paves the way for advancements in the field, enabling organizations to establish robust and secure identification systems.

Keywords: Multi-Factor Authentication, Convolutional Neural Network, Image Processing, Fingerprint Authentication, Eye Biometrics Authentication

Abstract View Paper Research/Original Article Original: English
Improving intrusion detection system in the internet of things using a combination of convolutional neural network and cuckoo algorithm

Ali Shahriari, Mohammadhosein Davarpour, Mohammad Ahmadinia*

Journal of Future Generation of Communication and Internet of Things, Volume:3 Issue: 1, Jan 2024, PP 1 -9

The Internet of Things (IoT) refers to the connection of various devices to each other via the internet. Conceptually, the IoT can be defined as a dynamic, self-configuring network infrastructure based on standards and participatory communication protocols. The main goal of the IoT is to lead towards a better and safer community. However, one of the fundamental challenges in developing the IoT is the issue of security, and intrusion detection systems are one of the main methods to create security in the IoT. On the other hand, Convolutional Neural Network (CNN), with its specific features, is one of the best methods for analyzing network data. This network is a type of deep neural network composed of multiple layers that can ultimately reduce the dimensions of features. Additionally, the cuckoo algorithm has parameters required for configuration in the initial search, which are very few and can naturally and efficiently cope with multi-state problems. In this paper, a new method for intrusion detection in the IoT using CNN and feature selection by the cuckoo algorithm is presented. Simulation results indicate the satisfactory performance of the proposed method.

Keywords: Internet Of Things, Intrusion Detection, Convolutional Neural Network, Cuckoo Algorithm, Dimensionality Reduction

Abstract View Paper Research/Original Article Original: English
Improving intrusion detection system in the internet of things using a combination of convolutional neural network and cuckoo algorithm

Ali Shahriari, Mohammad Davarpour, Mohammad Ahmadinia*

Journal of Future Generation of Communication and Internet of Things, Volume:2 Issue: 4, Oct 2023, PP 1 -9

The Internet of Things (IoT) refers to the connection of various devices to each other via the internet. Conceptually, the IoT can be defined as a dynamic, self-configuring network infrastructure based on standards and participatory communication protocols. The main goal of the IoT is to lead towards a better and safer community. However, one of the fundamental challenges in developing the IoT is the issue of security, and intrusion detection systems are one of the main methods to create security in the IoT. On the other hand, Convolutional Neural Network (CNN), with its specific features, is one of the best methods for analyzing network data. This network is a type of deep neural network composed of multiple layers that can ultimately reduce the dimensions of features. Additionally, the cuckoo algorithm has parameters required for configuration in the initial search, which are very few and can naturally and efficiently cope with multi-state problems. In this paper, a new method for intrusion detection in the IoT using CNN and feature selection by the cuckoo algorithm is presented. Simulation results indicate the satisfactory performance of the proposed method.

Keywords: Internet Of Things, Intrusion Detection, Convolutional Neural Network, Cuckoo Algorithm, Dimensionality Reduction

Abstract View Paper Research/Original Article Original: English
بهبود شبکه های Unet برای قطعه بندی تصاویر پزشکی با اضافه کردن لایه های مکانیزم توجه

محمدرجب قانع*، عباس بحرالعلوم، مهدی افتخاری

نشریه ماشین بینایی و پردازش تصویر، سال دهم شماره 4 (زمستان 1402)، صص 49 -59

قطعه بندی تصاویر پزشکی یکی از مهم ترین گام ها در تحلیل تصاویر پزشکی، جهت بهبود تشخیص و یافته ها است. یکی از متداول ترین روش های قطعه بندی در یادگیری عمیق، استفاده از شبکه های Unet است. وجود لایه های متراکم در قسمت رمزگشای Unet، اجازه استخراج اطلاعات از لایه های عمیق تر را نمی دهد؛ همچنین به علت محدودیت میدان دریافتی هسته های کانولوشن، اطلاعات و وابستگی های دوربرد به خوبی در نظر گرفته نمی شوند. در این مقاله، هدف طراحی یک ساختار در اتصالات پرش به منظور کاهش شکاف معنایی بین ناحیه رمزگذار و رمزگشا است. استخراج بهتر و تمرکز بیشتر برروی ویژگی های محلی و سراسری در مجموعه داده های مختلف، از ویژگی های این ساختار است.همچنین یک ساختار توجه به منظور کاهش پارامترهای شبکه و بهبود نتایج، در گلوگاه شبکه طراحی شده است. این روش برروی 6 مجموعه داده پزشکی ارزیابی شده است که نتایج به دست آمده در دو معیار ارزیابی Diceو Iou نشان می دهد مدل پیشنهاد شده نتایج بهتری نسبت به Unet و روش های مبتنی بر آن دارد.

کلید واژگان: قطعه بندی تصاویر پزشکی، شبکه های عصبی پیچشی، مکانیزم توجه، یادگیری عمیق، Unet

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Improving Unet Networks for Medical Image Segmentation by adding Attention Mechanism Layers

Mohammad Rajabghane *, Abbas Bahrololoum, Mahdi Eftekhari

Machine Vision and Image Processing, Volume:10 Issue: 4, 2024, PP 49 -59

Medical image segmentation is one of the most important steps in medical image analysis to improve diagnosis and findings. One of the most common segmentation methods in deep learning is the use of Unet networks. The presence of overlapping layers in the Unet decoder does not allow extracting information from deeper layers. Also, due to the limited range of the received field of convolution cores, long-range information and dependencies are not considered well. In this article, our goal is to place a structure in the area between encoder and decoder in the Unet model in order to fill the semantic gap between the encoder and decoder area and better extract features by paying attention to local and global features. This model makes the target region more prominent in different medical datasets. We have conducted our experiment on 6 medical data sets, and the results obtained in two evaluation criteria, Dice and Iou, show that our proposed model has better results than Unet and based methods.

Keywords: Medical Image Segmentation, convolutional neural network, Attention Mechanisms, Deep Learning, Unet

Abstract View Paper Research/Original Article Original: Persian
بررسی جامع کارایی روش نهانکاوی مبتنی بر یادگیری عمیق در کشف روشهای حوزه مکان

وجیهه ثابتی*، مهدیه سمیعی ولوجردی

نشریه منادی امنیت فضای تولید و تبادل اطلاعات (افتا)، سال دوازدهم شماره 2 (پیاپی 24، پاییز و زمستان 1402)، صص 42 -50

نهاننگاری، هنر مکاتبات پنهانی است که در آن یک پیام به صورت مخفیانه منتقل میشود و نهانکاوی، هنر کشف حضور اطلاعات پنهان است. شبکه های عصبی پیچشی برخلاف روش های نهانکاوی سنتی، با استخراج خودکار ویژگیها، وجود داده را تشخیص میدهند. در مقالات مختلف، عملکرد مدلهای موجود بر روی تعداد محدودی از روش های نهاننگاری حوزه مکان گزارش شده است. هدف اصلی این مقاله، ارایه یک شبکه عصبی پیچشی و بررسی جامع عملکرد آن در کشف روش های حوزه مکان مختلف است. مدل پیشنهادی از سه قسمت پیش پردازش، ماژول پیچشی و طبقهبند تشکیل شده است. در لایه ماژول پیچشیلایه تمام متصل تشکیلشده است. از روش های جاسازی در بیت کم ارزش، 3 بلوک و طبقه بند از 5 از جاسازی در مقدار اختلاف پیکسلها و جاسازی مبتنی بر ایده تطبیقی برای تست استفاده شده است. روش وجود داده های با طولهای حتی بسیار کم در روش های دو گروه 97٪ پیشنهادی میتواند با دقت بالاتر از اول را شناسایی کند. عملکرد روش پیشنهادی در کشف درصد جاسازی های بسیار کم روش تطبیقی با دقت بسیار مناسب است و این ویژگی نقطه تمایز مدل پیشنهادی نسبت به روش های سنتی است. زیرا 70٪ بالای موفقیت روش های استخراج ویژگی دستی به دلیل کم بودن تغییرات ویژگیهای آماری در سطوح جاسازی پایین، بسیار کمتر است.

کلید واژگان: نهاننگاری، نهانکاوی، نهاننگاری مبتنی بر حوزه مکان، یادگیری عمیق، شبکه عصبی پیچشی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

A comprehensive evaluation of deep learning based steganalysis performance in detecting spatial methods

Vajiheh Sabeti*, Mahdiyeh Samiei

Journal Monadi for Cyberspace Security (AFTA), Volume:12 Issue: 2, 2024, PP 42 -50

Steganalysis is the art of detecting the existence of hidden data. Recent research has revealed that convolutional neural networks (CNNs) can detect data through automatic feature extraction. Several studies investigated the performance of existing models using a limited number of spatial steganography methods. This study aims to propose a CNN and comprehensively investigate its efficiency in detecting different spatial methods. The proposed model comprises three modules: preprocessing, convolutional (five blocks), and classifier (three fully connected layers). The test results for the least-significant-bit (LSB) and pixel-value differencing (PVD) based methods indicate that the proposed method can detect data of even concise length with high accuracy and a low error. The proposed method also detects complexity-based LSB-M (CBL) as an adaptive approach. Lower embedding rates make this success even more impressive. Manual feature extraction has much lower success rates due to low variations of statistical features at low embedding rates than the proposed model.

Keywords: Steganalysis, Spatial-based steganography, Deep learning, Convolutional neural network

Abstract View Paper Research/Original Article Original: Persian
Comparing the Semantic Segmentation of High-Resolution Images Using Deep Convolutional Networks: SegNet, HRNet, CSE-HRNet and RCA-FCN

Nafiseh Sadeghi *, Homayoun Mahdavi-Nasab, Mansoor Zeinali, Hossein Pourghasem

Journal of Information Systems and Telecommunication, Volume:11 Issue: 4, Oct-Dec 2023, PP 359 -367

Semantic segmentation is a branch of computer vision, used extensively in image search engines, automated driving, intelligent agriculture, and disaster management, and other machine-human interactions. Semantic segmentation aims to predict a label for each pixel from a given label set, according to semantic information. Among the proposed methods and architectures, researchers have focused on deep learning algorithms due to their good feature learning results. Thus, many studies have explored the structure of deep neural networks, especially convolutional neural networks. Most of the modern semantic segmentation models are based on fully convolutional networks (FCN), which first replaces the fully connected layers in common classification networks by convolutional layers, getting pixel-level prediction results. After that, a lot of methods are proposed to improve the basic FCN methods results. With the increasing complexity and variety of existing data structures, more powerful neural networks and the development of existing networks are needed. This study aims to segment a high-resolution (HR) image dataset into six separate classes. This paper semantic segmentation methods based on deep learning. Here, an overview of some important deep learning architectures will be presented with a focus on methods producing remarkable scores in segmentation metrics such as accuracy and F1-score. Finally, their segmentation results will be discussed.

Keywords: Semantic segmentation, Convolutional neural network, Deep neural network, High-resolution image processing

Abstract View Paper Research/Original Article Original: English
بررسی روشهای خودکار تفسیر تصاویر پزشکی بر پایه یادگیری عمیق

حسنیه ذوالفقاری، مریم رستگارپور*، محمد تشنه لب، عباس کوچاری، علیرضا احسانبخش

نشریه ماشین بینایی و پردازش تصویر، سال دهم شماره 3 (پاییز 1402)، صص 1 -29

تفسیرخودکار تصاویر، زمینه جدیدی از هوش مصنوعی است که دو شاخه پردازش زبان طبیعی و یادگیری ماشین را به خدمت می گیرد. تحقیقاتی که در سالهای اخیر بر روی این مقوله انجام شده و نتایج قابل قبولی که در این زمینه حاصل شده است از یک طرف و نیاز جامعه پزشکی به تفسیرخودکار تصاویر پزشکی از طرف دیگر، محققان را بر آن داشته تا این رویکرد را در این زمینه نیز به کار گیرند. تفسیر خودکار تصاویر پزشکی نسبت به مساله توصیف خودکار تصاویر طبیعی، چالش برانگیزتر می باشد.کمیت و کیفیت مجموعه داده های موجود در این مقوله نسبت به مجموعه داده های تفسیر تصاویر طبیعی کمتر است، تفسیرها غیرساختار یافته اند و تفسیر تصاویر طبیعی، شامل توصیف اشیاء و روابط بین آنها با یک یا چند جمله است درحالی که شرح تصاویر پزشکی شامل درک یافته های بالینی و ارایه یک گزارش دقیق از پاراگراف های مختلف است؛ تا فقط آنچه از نظر بالینی مهم است به جای آنچه در تصویر از نظر اشیاء وجود دارد برجسته گردد. در راستای رسیدن به نتایج مطلوب روش های متعددی پیشنهاد شده است که در این بین روش های مبتنی بر یادگیری عمیق، به نتایج بهتری دست یافته است. این مقاله به معرفی مجموعه داده ها، معیارهای ارزیابی و روش های توسعه یافته بر پایه یادگیری عمیق در زمینه تفسیرخودکار تصاویر پزشکی می پردازد تا کمکی در راستای درک ادبیات موجود و برجسته نمودن مسیرهای آینده در این زمینه باشد.

کلید واژگان: تفسیر تصاویر پزشکی، تفسیر تصاویر، شبکه عصبی کانولوشن، شبکه عصبی بازگشتی، مکانیزم توجه

چکیده مشاهده متن مقاله مروری زبان: فارسی

The current state of automatic medical image captioning using deep learning

Hosniyeh Zolfaghari, Maryam Rastgarpour *, Mohammad Teshnehlab, Abbas Koochari, Alireza Ehsanbakhsh

Machine Vision and Image Processing, Volume:10 Issue: 3, 2023, PP 1 -29

Image captioning is a new field of artificial intelligence which uses both natural language processing and machine learning methods. Recently, several researches conducted on Image captioning have led to the acceptable results in this field. Moreover, the need of the medical community for automatic interpretation of medical images, prompted researchers to apply this approach in the field of automatic interpretation of medical images. The medical images captioning is more challenging than the problem of image description because medical images are mostly gray and accompanied by noise, the existing data sets have significantly few samples; On the other hand, the provided interpretations are not structured and have variable lengths. In order to achieve the desired results, many algorithms have been proposed, among which methods based on deep learning have achieved proper results. This article introduces datasets, evaluation criteria and methods developed based on deep learning in the field of automatic interpretation of medical images in order to help understand the existing literature and highlight future directions in this field.

Keywords: Medical Image Captioning, image captioning, convolutional neural network, Recurrent Neural Network, attention mechanism

Abstract View Paper Review Article Original: Persian
بازیابی تصاویر پزشکی ریه با استفاده از درهم سازی با ناظر، انتخاب ویژگی mRMR و شبکه های عصبی کانولوشنی عمیق

فاطمه محمودی، فرساد زمانی بروجنی*

مجله رایانش نرم و فناوری اطلاعات، سال یازدهم شماره 4 (زمستان 1401)، صص 1 -18

توسعه پزشکی نوین از یک طرف امکان ذخیره سازی تصاویر پزشکی را فراهم کرده است و از طرف دیگر بدلیل افزایش روزانه ذخیره سازی این قبیل داده، مدیریت و بازیابی آن ها را نیز با مشکل مواجه ساخته است. با توجه به آنکه تصاویر پزشکی به عنوان ابزاری قدرتمند در تشخیص زودرس اغلب بیماری ها مورد استفاده هستند، ارایه سیستمی توانمند که بتواند از حجم رو به رشد تصاویر پزشکی، تصاویری با محتوای مشابه را بازیابی نماید، در کنترل و درمان بسیار موثر است. در این مقاله یک سیستم بازیابی تصاویر پزشکی مبتنی بر شبکه عصبی سیامی متشکل از دو زیر شبکه کانولوشن با 13 لایه ارایه شده است. برای رسیدن به زیر مجموعه بهینه از ویژگی های عمیق استخراج شده توسط سیامی، از تکنیک حداقل افزونگی- حداکثر همبستگی (mRMR) استفاده شده است و پس از درهم سازی باینری ویژگی ها، بازیابی تصاویر مشابه با استفاده از فاصله Hamming انجام می شود. اگر چه مدل مطرح قابلیت بازیابی انواع تصاویر پزشکی سطح خاکستری را دارد، اما برای ارزیابی آن، از دو نوع تصاویر ریه، شامل تصاویر سی تی اسکن بیماران کووید-19 در پایگاه داده CT-COV و تصاویر اشعه X بیماران ذات الریه در پایگاه Pneumonia استفاده شده است. نتایج حاکی از آن است که روش پیشنهاد شده در پایگاه کووید به ترتیب در 5 و 10 تصویر بازیابی توانسته است به میانگین دقت 93.83 % و 92.73 % و در پایگاه داده ذات الریه به میانگین دقت 100 % دست یابد که در مقایسه با روش های پیشین توانسته است بازیابی تصاویر ریه را بهبود ببخشد.

کلید واژگان: درهم سازی، شبکه عصبی سیامی، انتخاب ویژگی با حداقل افزونگی- حداکثر همبستگی، بازیابی تصاویر ریه

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Lung Image Retrieval Based on Supervised Hashing, MRMR Feature Selection and Deep Convolutional Neural Network

Fatemeh Mahmoodi, Farsad Zamani Boroujeni *

Journal of Soft Computing and Information Technology, Volume:11 Issue: 4, 2023, PP 1 -18

On the one hand, the development of modern medicine has made it possible to store medical images, and on the other hand, due to the daily increase in the storage of such data, it has also made their management and recovery difficult. Considering that medical images are used as a powerful tool in the early diagnosis of most diseases, providing a powerful system that can retrieve images with similar content from the growing volume of medical images is very effective in control and treatment. In this article, a medical image retrieval system based on Siamese neural network consisting of two convolutional sub-networks with 13 layers is presented. To reach the optimal subset of deep features extracted by Siamese, the Minimum Redundancy-Maximum Relevant (mRMR) technique has been used, and after binary hashing of the features, similar images are retrieved using Hamming distance. Although the proposed model is capable of retrieving a variety of gray scale medical images, two types of lung images have been used to evaluate it, including CT scan images of Covid-19 patients in the CT-COV database and X-ray images of pneumonia patients in the Pneumonia database. The results indicate that the proposed method in the Covid database has been able to achieve an average precision of 93.83% and 92.73% in 5 and 10 retrieved images respectively, and an average precision of 100% in the pneumonia database, which is compared to previous methods have been able to improve the retrieval of lung images.

Keywords: Supervised hashing, convolutional neural network, Deep Learning, lung image retrieval, Minimal-Redundancy-Maximal-Relevance (MRMR)

Abstract View Paper Research/Original Article Original: Persian
Detection of COVID-19 Using a Pre-trained CNN Model Over Chest X-ray Images

Mohammadreza Behnia, Touba Torabipour, Safieh Siadat *

International Journal of Web Research, Volume:5 Issue: 2, Autumn-Winter 2022, PP 103 -112

Lung infection is the most dangerous sign of Covid 19. X-ray images are the most effective means of diagnosing this virus. In order to detect this disease, deep learning algorithms and machine vision are widely used by computer scientists. Convolutional neural networks (CNN), DenseNet121, Resnet50, and VGG16 were used in this study for the detection of Covid-19 in X-ray images. In the current study, 1341 chest radiographs from the COVID-19 dataset were used to detect COVID-19 including infected and Healthy classes using a modified pre-trained CNN (train and test accuracy of 99.75% and 99.63%, respectively). The DENSENET121 model has a training accuracy of 43.89% and a test accuracy of 57.89%, respectively. The train and test accuracy of ResNet-50 are, respectively, 89.43% and 90%. Additionally, the CNN model has test and train accuracy of 98.13% and 96.73%, respectively. The suggested model has COVID-19 detection accuracy that is at least 1% higher than all other models.

Keywords: convolutional neural network, Deep Learning, Chest X-Ray, COVID-19

Abstract View Paper Research/Original Article Original: English
توصیف محتوای تصاویر به صورت خودکار با بکارگیری شبکه عصبی کپسولی و روش تعبیه سازی ELMo

شیما جوانمردی، علی محمد لطیف*، محمدتقی صادقی

نشریه ماشین بینایی و پردازش تصویر، سال دهم شماره 1 (بهار 1402)، صص 75 -91

توصیف محتوای تصویر به صورت خودکار توسط ماشین یک مشکل چالش برانگیز در بینایی کامپیوتر است و هدف آن تولید توضیحات قابل درک توسط کامپیوتر برای تصاویر می باشد. استفاده از شبکه های عصبی پیچشی (CNN) نقش مهمی در زمینه توصیف تصاویر ایفا کرده است. با این حال، در طول فرآیند تولید توصیف های مرتبط با تصویر دو چالش عمده برای CNN وجود دارد که عبارتند از: عدم توجه آنها به روابط و ساختارهای سلسله مراتبی مکانی بین اشیای درون تصویر، و عدم ثبات در مقابل تغییرات چرخشی تصاویر. به منظور رفع این چالش ها این مقاله با بهره گیری از یک شبکه کپسولی بهبود یافته، به توصیف محتوای تصویر با استفاده از پردازش زبان طبیعی می پردازد. شبکه کپسولی با درنظر گرفتن موقعیت مکانی اشیای درون تصویر نسبت به یکدیگر، اطلاعات مربوط به محتوای تصویر را ارایه می دهد. یک کپسول شامل مجموعه ای از نورون ها است که پارامترهای وضعیت اشیا درون تصویر مانند سایز، جهت، مقیاس و روابط اشیا نسبت به هم را در نظر می گیرند. این کپسول ها تمرکز ویژه ای بر استخراج ویژگی های معنادار برای استفاده در فرآیند تولید توضیحات مرتبط برای مجموعه ای معین از تصاویر دارند. آزمایش های کیفی روی مجموعه داده های MS-COCO با بهره گیری از شبکه کپسولی و روش تعبیه سازی ELMo، منجر به بهبود 2 تا 5 درصدی معیارهای ارزیابی شده، در مقایسه با مدل های زیرنویس تصویر موجود شده است.

کلید واژگان: توصیف تصاویر، یادگیری عمیق، شبکه کپسولی، شبکه عصبی پیچشی، پردازش زبان طبیعی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Automatic image captioning using capsule neural network and ELMo embedding technique

Shima Javanmardi, Ali Mohammad Latif *, MohammadTaghi Sadeghi

Machine Vision and Image Processing, Volume:10 Issue: 1, 2022, PP 75 -91

Automatic image captioning is a challenging task in computer vision and aims to generate computer-understandable descriptions for images. Employing convolutional neural networks (CNN) has a key role in image caption generation. However, during the process of generating descriptions for an image, there are two major challenges for CNN, such as: they do not consider the relationships and spatial hierarchical structures between the objects in the image, and the lack of resistance against rotational changes of the images. In order to solve these challenges, this paper presents an improved capsule network to describe image content using natural language processing by considering the relations between the objects . A capsule contains a set of neurons that consider the parameters of the state of objects in the image, such as size, direction, scale, and relationships of objects to each other. These capsules have a special focus on extracting meaningful features for use in the process of generating relevant descriptions for a given set of images. Qualitative tests on the MS-COCO dataset using the capsule network and ELMo embedding technique have resulted in 2-5% improvement in the evaluated metrics compared to existing image captioning models.

Keywords: image captioning, Deep Learning, convolutional neural network, Natural Language Processing

Abstract View Paper Research/Original Article Original: Persian
استفاده از تحلیل احساسات و ترکیب روش های یادگیری ماشین برای تشخیص هرزنامه در توییتر

مهدی سالخورده حقیقی، امین الله کرمانی

نشریه فناوری اطلاعات و ارتباطات ایران، سال چهاردهم شماره 51 (بهار و تابستان 1401)، صص 129 -153

محبوبیت شبکه های اجتماعی بخصوص توییتر چالش جدیدی را روبروی محققان قرار داده است و آن چیزی نیست جز هرزنامه . روش های گوناگون زیادی برای مقابله با آنها ارایه شده است. بعضی از این روش ها اگرچه در ابتدا کارآمد بودند اما به مرور توسط تولید کنندگان هرزنامه دور زده شدند. در این تحقیق تلاش داریم با استفاده از یکی از جدیدترین روش های تشخیص هرزنامه و ترکیب آن با تحلیل احساسات دقت تشخیص هرزنامه را افزایش دهیم. ما با استفاده از روش تعبیه سازی، کلمات متن توییت را به عنوان ورودی به یک معماری شبکه عصبی پیچشی داده و خروجی تشخیص دهنده متن هرزنامه یا متن عادی خواهد بود. هم زمان با استخراج ویژگی های مناسب در شبکه توییتر و اعمال روش های یادگیری ماشین بر روی آنها تشخیص هرزنامه بودن توییت را بصورت مجزا محاسبه می کنیم. در نهایت خروجی هر دو روش را به یک شبکه پیچشی تلفیقی وارد می کنیم تا خروجی آن تشخیص نهایی هرزنامه یا نرمال بودن متن توییت را تعیین کند. ما در این تحقیق از دو مجموعه داده متعادل و نامتعادل استفاده می کنیم تا تاثیر مدل پیشنهادی را بر روی دو نوع داده بررسی کنیم. نتایج پژوهش نشان دهنده بهبود کارایی روش پیشنهادی در هر دو مجموعه داده می باشد.

کلید واژگان: توییتر، هرزنامه، تعبیه لغات، شبکه های عصبی پیچشی، تحلیل احساسات، CNN

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Using Sentiment Analysis and Combining Classifiers for Spam Detection in Twitter

mehdi salkhordeh haghighi, Aminolah Kermani

Journal of Information and Communication Technology, Volume:14 Issue: 51, 2022, PP 129 -153

The welcoming of social networks, especially Twitter, has posed a new challenge to researchers, and it is nothing but spam. Numerous different approaches to deal with spam are presented. In this study, we attempt to enhance the accuracy of spam detection by applying one of the latest spam detection techniques and its combination with sentiment analysis. Using the word embedding technique, we give the tweet text as input to a convolutional neural network (CNN) architecture, and the output will detect spam text or normal text. Simultaneously, by extracting the suitable features in the Twitter network and applying machine learning methods to them, we separately calculate the Tweeter spam detection. Eventually, we enter the output of both approaches into a Meta Classifier so that its output specifies the final spam detection or the normality of the tweet text. In this study, we employ both balanced and unbalanced datasets to examine the impact of the proposed model on two types of data. The results indicate an increase in the accuracy of the proposed method in both datasets.

Keywords: Spam Detection, Twitter, Word Embedding, Convolutional neural network, Deep learning, sentiment analysis, Ensemble Learning

Abstract View Paper Research/Original Article Original: Persian
تشخیص بیماری کووید-19 با ترکیب روش های دستی و یادگیری عمیق از روی تصاویر اولتراسوند

ویدا اسماعیلی، محمود محصل فقهی*

نشریه ماشین بینایی و پردازش تصویر، سال نهم شماره 4 (زمستان 1401)، صص 31 -41

بیماری کروناویروس 2019 یا کووید-19، یک بیماری حاد تنفسی با قدرت انتقال ویروس بالا است که موجب نرخ بالای مرگ و میر در سراسر جهان شده است. اگرچه تشخیص سریع این بیماری می تواند نقش حیاتی در بهبود بیمار داشته باشد، اما انجام آزمایش های رادیوگرافی توسط کادر درمان فرآیند زمان بری است. بنابراین، استفاده از تصاویر اولتراسوند و روش های یادگیری عمیق توصیه می شود. تکنیک اولتراسوند بدون اشعه است و می تواند در بخش های اطفال و مراقبت های ویژه برای بیماران خاص مورد استفاده قرار گیرد. با این حال، تصاویر آن دارای نویز است که عملکرد روش های یادگیری عمیق را تحت تاثیر قرار می دهد. به همین منظور، در این مقاله، ما روش الگوی باینری محلی یکنواخت را که در مقابل نویز مقاوم است با روش یادگیری عمیق ترکیب می کنیم. ابتدا الگوی باینری محلی یکنواخت بر روی دو صفحه ی زمانی محاسبه می شود تا ویژگی های مربوط به تظاهرات کووید-19 در تصاویر متوالی اولتراسوند استخراج شود و سپس، ماتریس به دست آمده به عنوان ورودی شبکه ی کانولوشنی داده می شود. با توجه به آزمایش های انجام شده، روش پیشنهادی عملکرد بهتری در مقایسه با سایر روش های پیشرفته دارد. نتایج نشان می دهد که دقت شناسایی کووید-19 از روی داده های اولتراسوند با استفاده از روش پیشنهادی 98.5 درصد است.

کلید واژگان: تشخیص کووید-19، تصاویر اولتراسوند، یادگیری عمیق

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Diagnosis of Covid-19 Disease by Combining Hand-crafted and Deep-learning Methods on Ultrasound Data

Vida Esmaeili, Mahmood Mohassel Feghhi *

Machine Vision and Image Processing, Volume:9 Issue: 4, 2022, PP 31 -41

Coronavirus 2019, or Covid-19, is an acute respiratory disease with high virus transmission capacity that has led to high mortality rates worldwide. Although rapid diagnosis can play an essential role in the patient's recovery, radiography by the treatment staff is a time-consuming process.Therefore, the use of ultrasound data and deep learning techniques are recommended. The ultrasound technique is radiation-free and can be used in pediatric wards and intensive care units for specific patients. However, its data have noise that affects the performance of deep learning methods. To this end, in this paper, we combine the uniform local noise-resistant binary pattern method with the deep learning method. First, a uniform local binary pattern is calculated on two temporal planes to extract the features of the Covid-19 manifestations in consecutive ultrasound data, and then, the resulting matrix is given as the input of the convolutional network. According to the performed experiments, the proposed method has better performance compared to that of the other state-of-art methods. The results show that the diagnosis accuracy of the Covid-19 from the ultrasound data using the proposed method is 98.5%.

Keywords: Covid-19 diagnosis, ultrasound data, convolutional neural network, Deep Learning

Abstract View Paper Research/Original Article Original: Persian
A Survey on Review Spam Detection Methods using Deep Learning Approach

Mahmoud Aliarab, Kazim Fouladi *

International Journal of Web Research, Volume:5 Issue: 1, Spring-Summer 2022, PP 19 -24

Review spam is an opinion written to promote or demote a product or brand on websites and other internet services by some users. Since it is not easy for humans to recognize these types of opinions, a model can be provided to detect them. In recent years, much research has been done to detect these types of reviews, and with the expansion of deep neural networks and the efficiency of these networks in various issues, in recent years, multiple types of deep neural networks have been used to identify spam reviews. This paper reviews the proposed deep learning methods for the problem of review spam detection. Challenges, evaluation criteria, and datasets in this area are also examined.

Keywords: Review Spam Detection, Opinion Spam, Deep Learning, convolutional neural network, Long Short-Term Memory (LSTM), Literature Survey

Abstract View Paper Research/Original Article Original: English
Persian SMS Spam Detection using Machine Learning and Deep Learning Techniques

Roya Khorashadizade, Somayyeh Jafarali Jassbi *, Alireza Yari

International Journal of Web Research, Volume:5 Issue: 1, Spring-Summer 2022, PP 56 -65

Spams are well-known examples of unsolicited text or messages which are sent by unknown individuals and cause issues for smartphone users. The inconvenience imposed on users, the loss of network traffic, the rise in the calculated cost, occupying more physical space on the mobile phone, and abusing and defrauding recipients are but a few of their downsides. Consequently, the automated identification of suspicious and spam messages is undoubtedly vitally important. Additionally, text messages which are smartly composed might be difficult to recognize. However, the present methodologies in this subject are hindered by the absence of adequate Persian datasets. A huge body of research and experiments has revealed that techniques based on deep and combined learning are superior at identifying unpleasant text messages. This work sought to develop an effective strategy for identifying SMS spam through utilizing combining machine learning classification algorithms together with deep learning models. After applying preprocessing on our gathered dataset, the suggested technique applies two convolutional neural network layers, the first of which being an LSTM layer, and the second one which is a fully connected layer to extract the data characteristics, thereby implementing the suggested deep learning approach. As part of the Machine Learning methodologies, the vector support machine makes use of the data and features at hand to determine the ultimate classification. Results indicate that the suggested model is implemented more effectively than the existing techniques, and an accuracy of 97.7% was achieved as a result.

Keywords: SMS Spam, spam detection, Support Vector Machine, convolutional neural network, LSTM

Abstract View Paper Research/Original Article Original: English
افزایش دقت بازشناسی صحنه های طبیعی پویا با استفاده از همبستگی بین نقشه های ویژگی در شبکه های عصبی پیچشی

صفورا حیدری، عباس ابراهیمی مقدم*، مرتضی خادمی، هادی هادی زاده

نشریه ماشین بینایی و پردازش تصویر، سال هشتم شماره 2 (تابستان 1400)، صص 43 -55

بازشناسی صحنه های پویا یکی از زمینه های تحقیقاتی اساسی در حوزه بینایی ماشین بشمار می رود. در این مقاله با استفاده از شبکه های عصبی پیچشی (CNN)، روشی موثر جهت بازشناسی صحنه های پویا ارایه می شود. در روش پیشنهادی، همبستگی بین نقشه های ویژگی حاصل از لایه های مختلف یک شبکه عصبی به عنوان بردار های ویژگی حاوی اطلاعات ویدیو، مورد استفاده قرار گرفته است. در این روش، ابتدا N فریم از ویدیو انتخاب شده و به کمک یک شبکه عصبی پیچشی، نقشه های ویژگی مربوط به فریم های منتخب، استخراج شده و برای هر فریم، یک ماتریس گرام محاسبه می شود که بیانگر ویژگی های مکانی فریم های ویدیو است. سپس با قطعه بندی زمانی فریم های منتخب و میانگین گیری بر روی ماتریس های گرام این فریم ها، اطلاعات زمانی نیز لحاظ می شود. با انجام عملیات کدینگ ویژگی ها و سپس pooling، برای هر ویدیو یک بردار ویژگی به منظور طبقه بندی ویدیو حاصل می شود. نتایج شبیه سازی ها بر روی سه مجموعه داده مطرح در این زمینه نشان می دهد که روش پیشنهادی از دقت بازشناسی بهتری در مقایسه با سایر روش های مطرح در این زمینه تحقیقاتی برخوردار بوده و دقت بازشناسی را تا 9% برای مجموعه داده Maryland و 3% برای مجموعه داده YUP++ بهبود بخشیده است.

کلید واژگان: بازشناسی صحنه های پویا، شبکه عصبی پیچشی، همبستگی نقشه های ویژگی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Improving the Accuracy of Natural Dynamic Scenes Recognition using Correlation of Feature Maps in CNNs

Safoora Heidari, Abbas Ebrahimi Moghadam *, Morteza Khademi Doroh, Hadi Hadizadeh

Machine Vision and Image Processing, Volume:8 Issue: 2, 2021, PP 43 -55

Dynamic scene recognition is one of the fundamental research fields in machine vision. In this paper, an effective dynamic scene recognition method using convolutional neural networks is proposed. In the proposed method the correlation of feature maps of different layers in a neural network is exploited as a feature vector containing video information. Firstly, N frames of video are selected and fed into a network to exploit the feature maps, then a Gram matrix indicating the spatial information of the frames of video is calculated. Subsequently, using temporal slicing over selected frames and averaging over the Gram matrices of these frames, temporal information is considered. Encoding features followed by pooling operation, a feature vector is obtained for classification. Experimental evaluations on benchmark dynamic scene datasets demonstrate the effectiveness of the proposed method in comparison with the state-of-the-art methods in this research field and has improved the recognition accuracy about 9% for Maryland dataset and about 3% for YUP++ dataset.

Keywords: Dynamic Scene Recognition, convolutional neural network, Correlation of Feature Maps

Abstract View Paper Research/Original Article Original: Persian
A Deep Learning Approach for Diagnosis Chest Diseases

Touba Torabipour *, Yousef Jahangirigolshavari, Safieh Siadat

International Journal of Web Research, Volume:4 Issue: 1, Spring-Summer 2021, PP 10 -17

The human chest contains vital organs such as the heart, lungs, and other organs. Chest radiology is one of the best and least costly methods to diagnose chest diseases. In this study, proposed a new method to diagnose 14 main diseases of the chest such as (cardiomegaly, emphysema, effusion, hernia, nodule, pneumothorax, atelectasis, pleural - thickening, mass, edema, integration, penetration, fibrosis, pneumonia) using the neural network and deep learning to increase accuracy, sensitivity, and specificity. The proposed method is implemented in the form of a web application and is available as a decision-making system for physicians to diagnose chest diseases.The results of the simulation on the sample dataset showed that the diagnosis of chest diseases was 98.93%, indicating the high efficiency of the new method. Finally, the proposed method was compared with other deep learning architectures such as densenet121, vgg16, exception architecture on the same dataset, which showed a 5% higher accuracy than them.

Keywords: Deep Learning, convolutional neural network, Chest X-Ray

Abstract View Paper Research/Original Article Original: English
ارایه یک مدل هوشمند به منظور تشخیص چندوجهی شخصیت کاربران با استفاده از روش های یادگیری ژرف

حسین صدر*، فاطمه محدث دیلمی، مرتضی ترخان

نشریه فناوری اطلاعات و ارتباطات ایران، سال سیزدهم شماره 47 (بهار و تابستان 1400)، صص 72 -89

با توجه به رشد قابل‌توجه اطلاعات و داده‌های متنی که توسط انسان‌ها در شبکه‌های ‌مجازی تولید می‌شوند، نیاز به سیستم‌هایی است که بتوان به کمک آن‌ها به‌صورت خودکار به تحلیل داده‌ها پرداخت و اطلاعات مختلفی را از آن‌ها استخراج کرد. یکی از مهم‌ترین داده‌های متنی موجود در سطح وب دید‌گاه‌های افراد نسبت به یک موضوع مشخص است. متن‌های منتشرشده توسط کاربران در فضای مجازی می‌تواند معرف شخصیت آن‌ها باشد. الگوریتم‌های یادگیری ماشین می‌تواند انتخاب مناسبی برای تجزیه‌و‌تحلیل این‌گونه مسایل باشند، اما به‌منظور غلبه بر پیچیدگی و پراکندگی محتوایی و نحوی داده‌ها نیاز به الگوریتم‌های یادگیری ژرف بیش از پیش در این حوزه احساس می‌شود. در این راستا، هدف این مقاله به‌کارگیری الگوریتم‌های یادگیری ژرف به‌منظور دسته‌بندی متون برای پیش‌بینی شخصیت می‌باشد. برای رسیدن به این هدف، شبکه عصبی کانولوشنی با مدل آدابوست به‌منظور دسته‌بندی داده‌ها ترکیب گردید تا بتوان به کمک آن داده‌های آزمایشی که با خطا دسته‌بندی ‌شده‌اند را در مرحله دوم دسته‌بندی با اختصاص ضریب آلفا، با دقت بالاتری دسته‌بندی کرد. مدل پیشنهادی این مقاله روی دو مجموعه داده ایزیس و یوتیوب آزمایش شد و بر اساس نتایج بدست آمده مدل پیشنهادی از دقت بالاتری نسبت به سایر روش‌های موجود روی هر دو مجموعه داده برخودار است.

کلید واژگان: یادگیری ژرف، شبکه عصبی کانولوشنی، مدل ترکیب آدابوست، تشخیص شخصیت، داده های متنی

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

An Intelligent Model for Multidimensional Personality Recognition of Users using Deep Learning Methods

Hossein Sadr *

Journal of Information and Communication Technology, Volume:13 Issue: 47, 2021, PP 72 -89

Due to the significant growth of textual information and data generated by humans on social networks, there is a need for systems that can automatically analyze the data and extract valuable information from them. One of the most important textual data is people's opinions about a particular topic that are expressed in the form of text. Text published by users on social networks can represent their personality. Although machine learning based methods can be considered as a good choice for analyzing these data, there is also a remarkable need for deep learning based methods to overcome the complexity and dispersion of content and syntax of textual data during the training process. In this regard, the purpose of this paper is to employ deep learning based methods for personality recognition. Accordingly, the convolutional neural network is combined with the Adaboost algorithm to consider the possibility of using the contribution of various filter lengths and gasp their potential in the final classification via combining various classifiers with respective filter sizes using AdaBoost. The proposed model was conducted on Essays and YouTube datasets. Based on the empirical results, the proposed model presented superior performance compared to other existing models on both datasets.

Keywords: Deep Learning, Convolutional Neural Network, Adaboost Combinational Model, Personality Recognition, Textual Data

Abstract View Paper Research/Original Article Original: Persian

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

convolutional neural network