Machine Learning Models for Housing Prices Forecasting using Registration Data

Author(s):

Mehdi Farahzadi , Rahman Farnoosh* , MohammadHassan Behzadi

Message:

Article Type:

Research/Original Article (دارای رتبه معتبر)

Abstract:

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient Boosting Regression Algorithm (XGBR), and the Long Short-Term Memory Neural Network Algorithm (LSTM). This research has been done using the data of the Statistics Center of Iran, which contains information on the purchase and sale of residential units in Tehran in the years 2014 to 2020 and includes 998299 transactions and 11 features. Loss of data, batch data conversion, normalization, etc. are performed on the housing data set to obtain the final and error-free data set. To divide the data set into training and test data sets, the important and practical method of cross-validation or K-Fold has been used because of its simplicity and effectiveness and as a universally valid method. Various evaluation criteria such as MSE, RMSE, MAE,ME and R2 were used to compare the models and identify the best model. Comparison of models in terms of all evaluation criteria in all K-fold subsets proves the stability and superiority of the Extreme Gradient Boosting Regression model.

Keywords:

Housing price forecasting , nearest neighbor regression , random forest regression , support vector regression , long short-Term memory neural network , and extreme gradient boosting regression.

Language:

English

Published:

Journal of Statistical Research of Iran, Volume:17 Issue: 1, Winter and Spring 2020

Pages:

191 to 214

https://www.magiran.com/p2542444

دانلود و مطالعه متن این مقاله با یکی از روشهای زیر امکان پذیر است:

اشتراک شخصی

با ثبت ایمیلتان و پرداخت حق اشتراک سالانه به مبلغ 1,950,000 ريال، بلافاصله متن این مقاله را دریافت کنید.اعتبار دانلود 70 مقاله نیز در حساب کاربری شما لحاظ خواهد شد.

پرداخت حق اشتراک به معنای پذیرش "شرایط خدمات" پایگاه مگیران از سوی شماست.

پست الکترونیکی

اگر مقاله ای از شما در مگیران نمایه شده، برای استفاده از اعتبار اهدایی سامانه نویسندگان با ایمیل منتشرشده ثبت نام کنید. ثبت نام

اشتراک سازمانی

به کتابخانه دانشگاه یا محل کار خود پیشنهاد کنید تا اشتراک سازمانی این پایگاه را برای دسترسی نامحدود همه کاربران به متن مطالب تهیه نمایند!

اطلاعات بیشتر ثبت نام با ایمیل دانشگاهی/سازمانی

توجه!

حق عضویت دریافتی صرف حمایت از نشریات عضو و نگهداری، تکمیل و توسعه مگیران می‌شود.
پرداخت حق اشتراک و دانلود مقالات اجازه بازنشر آن در سایر رسانه‌های چاپی و دیجیتال را به کاربر نمی‌دهد.

In order to view content subscription is required

Personal subscription

Subscribe magiran.com for 70 € euros via PayPal and download 70 articles during a year.

Organization subscription

Please contact us to subscribe your university or library for unlimited access!

More information

علمی مصوب

توقف انتشار

Journal of Statistical Research of Iran

نشریه پژوهش های آماری ایران

دوفصلنامه علوم پایه به زبان انگلیسی

آخرین شماره | آرشیو

ISSN: 1735-1294 eISSN: 2538-5763

انتشار این نشریه متوقف شده‌است.

صاحب امتیاز:

پژوهشکده آمار

مدیر مسئول:

دکتر حمیدرضا نواب پور

سردبیر:

دکتر مجتبی گنجعلی

تلفن نشریه: ۰۲۱-۸۸۶۳۰۴۴۰

اطلاعات بیشتر نشریه

درباره نشریه

به جمع مشترکان مگیران بپیوندید!

Machine Learning Models for Housing Prices Forecasting using Registration Data

Mehdi Farahzadi , Rahman Farnoosh* , MohammadHassan Behzadi

Housing price forecasting , nearest neighbor regression , random forest regression , support vector regression , long short-Term memory neural network , and extreme gradient boosting regression.​​​​​

Journal of Statistical Research of Iran

نشریه پژوهش های آماری ایران

Housing price forecasting , nearest neighbor regression , random forest regression , support vector regression , long short-Term memory neural network , and extreme gradient boosting regression.