Abstract

Predicting the future location of vehicles is essential for safety-critical applications such as advanced driver assistance systems (ADAS) and autonomous driving. This paper introduces a novel approach to simultaneously predict both the location and scale of target vehicles in the first-person (egocentric) view of an ego-vehicle. We present a multi-stream recurrent neural network (RNN) encoder-decoder model that separately captures both object location and scale and pixel-level observations for future vehicle localization. We show that incorporating dense optical flow improves prediction results significantly since it captures information about motion as well as appearance change. We also find that explicitly modeling future motion of the ego-vehicle improves the prediction accuracy, which could be especially beneficial in intelligent and automated vehicles that have motion planning capability. To evaluate the performance of our approach, we present a new dataset of first-person videos collected from a variety of scenarios at road intersections, which are particularly challenging moments for prediction because vehicle trajectories are diverse and dynamic.

Comment: To appear on ICRA 2019


Original document

The different versions of the original document can be found in:

http://dx.doi.org/10.1109/icra.2019.8794474
https://ui.adsabs.harvard.edu/abs/2018arXiv180907408Y/abstract,
https://academic.microsoft.com/#/detail/2968393007
Back to Top

Document information

Published on 01/01/2018

Volume 2018, 2018
DOI: 10.1109/icra.2019.8794474
Licence: CC BY-NC-SA license

Document Score

0

Views 3
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?