31/03/2017
The prediction of the destination location at the time of pickup is an important problem with potential for substantial impact on the efficiency of a GPS enabled taxi service. While this problem has been explored earlier in the batch data set-up, we propose in this paper new solutions in the streaming data set-up. We examine four incremental learning methods using a Damped window model namely, Multivariate multiple regression, spherical-spherical regression, Randomized spherical K-NN regression and an Ensemble of these methods for their effectiveness in solving the destination prediction problem. The performance of these methods on several large datasets are evaluated using suitably chosen metrics and they were also compared with some other existing methods. The Multivariate multiple regression method and the Ensemble of the three methods are found to be the two best performers. The next pickup location problem is also considered and the aforementioned methods are examined for their suitability using real world datasets. As in the case of destination prediction problem, here
also we find that the Multivariate multiple regression method and the
Ensemble of the three methods gives better performance than the rest.