Design an intelligent system based on a computational cognitive model using attention network task

Author(s):

Azadeh Haratiannezhadi , Saeed Setayeshi , Javad Hatami

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

Introduction

Speech is the most effective way to exchange information. In a speech, the voice of a speaker carries additional information other than the words and grammar content of the speech, i.e., age, gender, emotional state, etc. Many studies have been conducted with various approaches to emotional content of speech. These studies show that emotion content in speech has a dynamic nature. The dynamics of speech makes it difficult to extract the emotion hidden in a speech. This study evaluates the implicit emotion in a message through emotional speech processing by applying the Mel-Frequency Cepstral Coefficient(MFCC) and Short-Time Fourier Transform(STFT) features.

Method

The input data is the Berlin Emotional Speech Database consisting of seven emotional states, anger, boredom, disgust, anxiety/fear, happiness, sadness, and neutral version. MATLAB software is used to input audio files of the database. Next, the MFCC and STFT features are extracted. Feature vectors for each method is calculated based on seven statistical values, i.e. minimum, maximum, mean, standard deviation, median, skewness, and kurtosis. Then, they are used as an input to an Artificial Neural Network. Finally, the recognition of emotional states is done by training functions based on different algorithms.

Results

The results show that the average and accuracy of emotional states recognized by using STFT features are better and more robust than MFCC features. Also, emotional states of anger and sadness have higher rate of recognition among the other emotions.

Conclusion

STFT features showed to be better than MFCC features to extract implicit emotion in speech.

Keywords:

Emotional Speech , Emotion Recognition , Short Time Fourier Transform , Mel-Frequency Cepstral Coefficients , Emotional Speech Processing

Language:

Persian

Published:

Advances in Cognitive Science, Volume:22 Issue: 1, 2020

Pages:

81 to 92

magiran.com/p2150607

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

دسترسی سراسری کاربران دانشگاه پیام نور!

اعضای هیئت علمی و دانشجویان دانشگاه پیام نور در سراسر کشور، در صورت ثبت نام با ایمیل دانشگاهی، تا پایان فروردین ماه 1403 به مقالات سایت دسترسی خواهند داشت!

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

فصلنامه تازه های علوم شناختی

Advances in Cognitive Science

فصلنامه علوم انسانی

آخرین شماره | آرشیو

ISSN: 1561-4174

صاحب امتیاز:

موسسه آموزش عالی علوم شناختی

مدیر مسئول:

دکتر محسن میرمحمدصادقی

سردبیر:

دکتر عباس حق پرست

تلفن نشریه: ۰۲۱-۷۶۲۹۱۱۳۰ (داخلی 7)

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه پذیرش الکترونیکی مقاله راهنمای نویسندگان

سامانه نویسندگان

Corresponding Author (1)

Setayeshi, Saeed

(1373) دکتری مهندسی برق و کامپیوتر، Dalhousie University, Canada

اطلاعات نویسنده(گان) توسط ایشان ثبت و تکمیل شده‌است. برای مشاهده مشخصات و فهرست همه مطالب، صفحه رزومه را ببینید.

به جمع مشترکان مگیران بپیوندید!

Design an intelligent system based on a computational cognitive model using attention network task

Azadeh Haratiannezhadi , Saeed Setayeshi , Javad Hatami

Emotional Speech , Emotion Recognition , Short Time Fourier Transform , Mel-Frequency Cepstral Coefficients , Emotional Speech Processing

فصلنامه تازه های علوم شناختی

Advances in Cognitive Science