Travel time prediction with machine learning: competition of linear regression, multivariate regression, random forest and deep neural network
Accurate travel time prediction is one of the important issues in the field of traffic and transportation that can significantly affect the daily life of people and organizations. In this research, four different machine learning methods including linear regression, multivariate regression, random forest and deep artificial neural network were trained to predict travel time. The purpose of this research is to predict travel time for use in intelligent traffic systems and to use and compare several new methods, including deep neural network and random forest regression, as well as considering new parameters in the computations such as weather conditions, traffic flow, travel time, and accidents and the traffic locking points compared to other studies are the innovation and comprehensiveness of this study compared to other studies. In the design and implementation of this research, real traffic data taken from Google map was used and analyzed. This data includes information such as traffic conditions, season, time of day, weather conditions, and route characteristics. The results of this research show that the deep neural network (DNN) model with R2 equal to 0.833 has a very good performance among the investigated models. This model explains 0.833% of the variance of the data and the distribution of the residuals in it is relatively central with a mean of zero and a distribution close to normal. The linear regression model with R2 equal to 0.615 has a poorer performance than DNN and explains 0.615% of the data variance. But the random regression model with R2 equal to 0.955 has one of the best performances in competition with DNN and explains 0.955% of the data variance. MSE and RMSE parameters were also used to evaluate the performance of the models, and as a result, a multidimensional comparison was made between the models, and the random forest model resulted in the lowest error values. Since in the collected traffic data, traffic accidents and consequently traffic locking points are also used in the models, and considering that the random forest model is more effectively adapted to the data despite the presence of noise and anomaly, the R2 value of this model is higher than R2 of Deep neural networks, due to the overfitting nature of Deep Learning methods.
-
Comparison of Linear and Nonlinear Regression Models based on Machine Learning for Estimating Chlorophyll-a values in Coastal Waters of Qeshm and Hormoz Island
Mitra Naimi, *, Mohammadseddiq Mortazavi, Seyedeh Laili Mohebbi-Nozar, Mojtaba Ezam
Hydrophysics Journal, -
Tracking Air Pollution Changes Using Sentinel-5P Data in Khuzestan.
Behnaz Ghaderi, Payam Alemi Safaval, Mohammadreza Tabesh, *
Desert, Summer -Autumn 2024 -
Worn Tissue Renovation Planning Using Volunteered Geographic Information (VGI)
Seyed Mohamad Ebrahim Mousavi, Mohammad Hasan Vahidnia *
Amayesh Journal, -
Use of UAVs and Lidar to identify the spatiotemporal elevation changes of AlamKooh Glacier
Sara Sheshangosht, Hossein Agamohammadi *, Nematollah Karimi, , Mohammadhassan Vahidnia
Journal of of Geographical Data (SEPEHR), -
Investigation of Spatio-Temporal Patterns of Suburban Traffic Accidents in Isfahan Province in GIS Environment
M. Rahmati, H. Aghamohammadi *, S. Behzadi, A.A. Alesheikh
Journal of Remote Sensing and Geoinformation Research, -
Evaluation of failure risk in the sewerage system using Bayesian network and spatial multi-criteria decision making
Seyed Morteza Ghoreishi, MohammadHassan Vahidnia, Aminreza Neshat
Journal of GIS & RS Application in Planing,