Travel time prediction with machine learning: competition of linear regression, multivariate regression, random forest and deep neural network

Message:
Article Type:
Research/Original Article (دارای رتبه معتبر)
Abstract:

Accurate travel time prediction is one of the important issues in the field of traffic and transportation that can significantly affect the daily life of people and organizations. In this research, four different machine learning methods including linear regression, multivariate regression, random forest and deep artificial neural network were trained to predict travel time. The purpose of this research is to predict travel time for use in intelligent traffic systems and to use and compare several new methods, including deep neural network and random forest regression, as well as considering new parameters in the computations such as weather conditions, traffic flow, travel time, and accidents and the traffic locking points compared to other studies are the innovation and comprehensiveness of this study compared to other studies. In the design and implementation of this research, real traffic data taken from Google map was used and analyzed. This data includes information such as traffic conditions, season, time of day, weather conditions, and route characteristics. The results of this research show that the deep neural network (DNN) model with R2 equal to 0.833 has a very good performance among the investigated models. This model explains 0.833% of the variance of the data and the distribution of the residuals in it is relatively central with a mean of zero and a distribution close to normal. The linear regression model with R2 equal to 0.615 has a poorer performance than DNN and explains 0.615% of the data variance. But the random regression model with R2 equal to 0.955 has one of the best performances in competition with DNN and explains 0.955% of the data variance. MSE and RMSE parameters were also used to evaluate the performance of the models, and as a result, a multidimensional comparison was made between the models, and the random forest model resulted in the lowest error values. Since in the collected traffic data, traffic accidents and consequently traffic locking points are also used in the models, and considering that the random forest model is more effectively adapted to the data despite the presence of noise and anomaly, the R2 value of this model is higher than R2 of Deep neural networks, due to the overfitting nature of Deep Learning methods.

Language:
Persian
Published:
Journal of Geomatics Science and Technology, Volume:14 Issue: 2, 2024
Pages:
1 to 18
https://www.magiran.com/p2824783  
سامانه نویسندگان
  • Aghamohammadi، Hossein
    Corresponding Author (2)
    Aghamohammadi, Hossein
    Assistant Professor remote sensing and GIS, Science And Research Branch, Islamic Azad University, تهران, Iran
  • Azizi، Zahra
    Author (4)
    Azizi, Zahra
    Assistant Professor Remote Sensing, Science And Research Branch, Islamic Azad University, تهران, Iran
اطلاعات نویسنده(گان) توسط ایشان ثبت و تکمیل شده‌است. برای مشاهده مشخصات و فهرست همه مطالب، صفحه رزومه را ببینید.
مقالات دیگری از این نویسنده (گان)