Developing a Reinforcement Learning Algorithm to Model Pavlovian Approach Bias on Bidirectional Planning

Author(s):

Reza Kakooee , MohammadTaghi Hamidi Beheshti* , Mehdi Keramati

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

Introduction

The decision- making process in the human brain is controlled by two mechanisms: Pavlovian and instrumental learning systems. The Pavlovian system learns the stimulus- outcome association independent of action; a process that manifests itself in the tendency to approach reward- associated stimuli. The instrumental controller, on the other hand, learns the action- outcome association. Instrumental learning is not limited to the current action's outcome and may evaluate a sequence of future actions in the form of forward planning. Nonetheless, forward planning may not be the only planning process used by instrumental learning. Humans may also use backward planning to evaluate actions sequences. However, backward planning has received less attention so far. Previous research has shown that despite the independence of Pavlovian and instrumental learning, they interact with each other such that the Pavlovian approach tendency biases forward planning, causing it to make decisions that may not be optimal actions from the instrumental learning perspective. Nevertheless, the effect of Pavlovian learning on backward planning has not yet been studied.

Materials and Methods

This paper designs a navigation experiment that allows investigating forward, backward, and bidirectional planning. Moreover, we embed Pavlovian approach cues into the maps to investigate how they bias the three forms of planning.

Results

Statistical analysis of the collected data indicates the existence of backward planning and shows that the Pavlovian- approach cues bias the planning. This bias is stronger in forward planning compared to backward planning and is even stronger in bidirectional planning. In the context of reinforcement learning, we developed a bidirectional planning algorithm under the Pavlovian approach tendency.

Conclusion

The simulation results are consistent with the experimental results and indicate that the effect of Pavlovian bias can be modeled as pruning of decision trees.

Keywords:

Decision Making , Strategic Planning , Conditioning , Operant , Computer Simulation

Language:

Persian

Published:

The Neuroscience Journal of Shefaye Khatam, Volume:9 Issue: 4, 2022

Pages:

51 to 59

https://www.magiran.com/p2399705

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با ثبت ایمیلتان و پرداخت حق اشتراک سالانه به مبلغ 1,950,000 ريال، بلافاصله متن این مقاله را دریافت کنید.اعتبار دانلود 70 مقاله نیز در حساب کاربری شما لحاظ خواهد شد.

پرداخت حق اشتراک به معنای پذیرش "شرایط خدمات" پایگاه مگیران از سوی شماست.

پست الکترونیکی

اگر مقاله ای از شما در مگیران نمایه شده، برای استفاده از اعتبار اهدایی سامانه نویسندگان با ایمیل منتشرشده ثبت نام کنید. ثبت نام

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر ثبت نام با ایمیل دانشگاهی/سازمانی

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

فصلنامه علوم اعصاب شفای خاتم

The Neuroscience Journal of Shefaye Khatam

فصلنامه پزشکی به زبان فارسی و انگلیسی

آخرین شماره | آرشیو

ISSN: 1887-2322

صاحب امتیاز:

مرکز تحقیقات علوم اعصاب شفا، بیمارستان خاتم الانبیا تهران

مدیر مسئول:

دکتر هادی کاظمی

سردبیر:

دکتر علی گرجی

تلفن نشریه: ۰۲۱-۸۳۵۵۴۹۱۱

اطلاعات بیشتر نشریه

درباره نشریه پیام به نشریه سایت اختصاصی نشریه پذیرش الکترونیکی مقاله

به جمع مشترکان مگیران بپیوندید!

Developing a Reinforcement Learning Algorithm to Model Pavlovian Approach Bias on Bidirectional Planning

Reza Kakooee , MohammadTaghi Hamidi Beheshti* , Mehdi Keramati

Decision Making , Strategic Planning , Conditioning , Operant , Computer Simulation

فصلنامه علوم اعصاب شفای خاتم

The Neuroscience Journal of Shefaye Khatam