Robust Speech Recognition using Long Short Term Memory Networks and Bottleneck Features

Author(s):

Amin Moaven Joula , Ahmad Akbari* , Babak Naser Sharif

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

Deep neural networks have been widely used in speech recognition systems in recent years. However, the robustness of these models in the presence of environmental noise has been less discussed. In this paper, we propose two approaches for the robustness of deep neural networks models against environmental additive noise. In the first approach, we propose to increase the robustness of long short-term memory (LSTM) networks in the presence of noise based on their abilities in learning long-term noise behavior. For this purpose, we propose to use noisy speech for training models. In this way, LSTMs are trained in a noise-aware manner. The results on the noisy TIMIT dataset show that if the models are trained with noisy speech rather than clean speech, recognition accuracy will be improved up to 18%. In the second approach, we propose to reduce noise effects on the extracted features using a denoised autoencoder network and to use the bottleneck features to compress the feature vector and represent the higher level of input features. This method increases the accuracy of the proposed recognition system in the first approach by 4% in the presence of noise.

Keywords:

Speech recognition , Noise robustness , Multicondition data , Autoencoder network , Long short term memory network

Language:

Persian

Published:

Journal of Electrical Engineering, Volume:49 Issue: 3, 2020

Pages:

1333 to 1343

magiran.com/p2071667

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با عضویت و پرداخت آنلاین حق اشتراک یک‌ساله به مبلغ 1,390,000ريال می‌توانید 70 عنوان مطلب دانلود کنید!

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

نشریه مهندسی برق

Journal of Electrical Engineering

فصلنامه فنی مهندسی به زبان فارسی، انگلیسی

آخرین شماره | آرشیو

ISSN: 2008-7799 eISSN: 2538-3051

برای مشاهد آرشیو به نشریه «دانشکده فنی تبریز» مراجعه نمایید.

صاحب امتیاز:

رئیس دانشکده مهندسی برق و کامپیوتر

مدیر مسئول:

دکتر میرهادی سیدعربی

سردبیر:

دکتر ابراهیم بابایی

تلفن نشریه: ۰۴۱-۳۳۳۰۰۸۳۲

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه پذیرش الکترونیکی مقاله

به جمع مشترکان مگیران بپیوندید!

Robust Speech Recognition using Long Short Term Memory Networks and Bottleneck Features

Amin Moaven Joula , Ahmad Akbari* , Babak Naser Sharif

Speech recognition , Noise robustness , Multicondition data , Autoencoder network , Long short term memory network

نشریه مهندسی برق

Journal of Electrical Engineering