A Generalised Method for Adaptive Longitudinal Control Using Reinforcement Learning

Pathak, Shashank; Bag, Suvam; Nadkarni, Vijay

doi:10.1007/978-3-030-01370-7_37

A Generalised Method for Adaptive Longitudinal Control Using Reinforcement Learning

Shashank Pathak¹⁸,
Suvam Bag¹⁹ &
Vijay Nadkarni¹⁹

Conference paper
First Online: 31 December 2018

1387 Accesses
1 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 867))

Abstract

Adaptive cruise control (ACC) seeks intelligent and adaptive methods for longitudinal control of the cars. Since more than a decade, high-end cars have been equipped with ACC typically through carefully designed model-based controllers. Unlike the traditional ACC, we propose a reinforcement learning based approach – RL-ACC. We present the RL-ACC and its experimental results from the automotive-grade car simulators. Thus, we obtain a controller which requires minimal domain knowledge, is intuitive in its design, can accommodate uncertainties, can mimic human-like behaviour and may enable human-trust in the automated system. All these aspects are crucial for a fully autonomous car and we believe reinforcement learning based ACC is a step towards that direction.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
The vehicle is equipped with both long and short range radars with limits [80, 240] and [0.2 100] m respectively. Unlike the problems like parking, the ACC does not require very close range detection. Here, sensor fusion is used to homogenise the readings from both the radars.

References

Drivecore - ces2018. http://wardsauto.com/technology/visteon-looks-play-big-role-autonomous-vehicles-drivecore/. Accessed Feb 2018
Oktal - simulation in motion. http://www.oktal.fr/en/automotive/range-of-simulators/software. Accessed Feb 2018
Vtd - virtual test drive. https://vires.com/vtd-vires-virtual-test-drive/. Accessed Feb 2018
Bando, M., Hasebe, K., Nakayama, A., Shibata, A., Sugiyama, Y.: Dynamical model of traffic congestion and numerical simulation. Phys. Rev. E 51(2), 1035 (1995)
Article Google Scholar
Boer, E.R.: Car following from the drivers perspective. Transp. Res. Part F: Traffic Psychol. Behav. 2(4), 201–206 (1999)
Article Google Scholar
Brackstone, M., McDonald, M.: Car-following: a historical review. Transp. Res. Part F: Traffic Psychol. Behav. 2(4), 181–196 (1999)
Article Google Scholar
Gazis, D.C., Herman, R., Rothery, R.W.: Nonlinear follow-the-leader models of traffic flow. Oper. Res. 9(4), 545–567 (1961)
Article MathSciNet Google Scholar
Gipps, P.G.: A behavioural car-following model for computer simulation. Transp. Res. Part B: Methodol. 15(2), 105–111 (1981)
Article Google Scholar
Gray, R., Regan, D.: Accuracy of estimating time to collision using binocular and monocular information. Vision Res. 38(4), 499–512 (1998)
Article Google Scholar
Helly, W.: Simulation of bottlenecks in single lane traffic flow, presentation at the symposium on theory of traffic flow. Research laboratories, General Motors, New York, pp. 207–238 (1959)
Google Scholar
Lagoudakis, M.G., Parr, R.: Least-squares policy iteration. J. Mach. Learn. Res. 4(Dec), 1107–1149 (2003)
MathSciNet MATH Google Scholar
Littman, M.L., Dean, T.L., Leslie, P.K.: On the complexity of solving markov decision problems, pp. 394–402. Morgan Kaufmann Publishers Inc. (1995)
Google Scholar
Michaels, R.M.: Perceptual factors in car following. In: Proceedings of the 2nd International Symposium on the Theory of Road Traffic Flow, London, England, OECD (1963)
Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M.: Playing atari with deep reinforcement learning. arXiv preprintarXiv:1312.5602 (2013)
Moody, J., Saffell, M.: Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875–889 (2001)
Article Google Scholar
Nagel, K., Schreckenberg, M.: A cellular automaton model for freeway traffic. Journal de physique I 2(12), 2221–2229 (1992)
Article Google Scholar
Newell, G.F.: A simplified car-following theory: a lower order model. Transp. Res. Part B: Methodol. 36(3), 195–205 (2002)
Article Google Scholar
Peters, J., Vijayakumar, S., Schaal, S.: Reinforcement learning for humanoid robotics. In: Proceedings of the Third IEEE-RAS International Conference on Humanoid Robots, pp. 1–20 (2003)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, vol. 1. MIT Press Cambridge, Cambridge (1998)
Google Scholar
Treiber, M., Hennecke, A., Helbing, D.: Congested traffic states in empirical observations and microscopic simulations. Phys. Rev. E 62(2), 1805 (2000)
Article Google Scholar
Wiedemann, R.: Simulation des straßenverkehrsflusses. schriftenreihe heft 8. Institute for Transportation Science, University of Karlsruhe, Germany (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Visteon Electronics GmbH, An der RaumFabrik 33b, 76227, Karlsruhe, Germany
Shashank Pathak
Visteon Corporation, 2901 Tasman Drive, Santa Clara, CA, 95054, USA
Suvam Bag & Vijay Nadkarni

Authors

Shashank Pathak
View author publications
You can also search for this author in PubMed Google Scholar
Suvam Bag
View author publications
You can also search for this author in PubMed Google Scholar
Vijay Nadkarni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shashank Pathak .

Editor information

Editors and Affiliations

Baden-Wuerttemberg Cooperative State University, Karlsruhe, Germany
Marcus Strand
Humanoids and Intelligence Systems Lab, KIT - Karlsruher Institut für Technologie, Karlsruhe, Germany
Rüdiger Dillmann
University of Padua , Padua, Italy
Emanuele Menegatti
University of Padua, Padua, Italy
Stefano Ghidoni

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pathak, S., Bag, S., Nadkarni, V. (2019). A Generalised Method for Adaptive Longitudinal Control Using Reinforcement Learning. In: Strand, M., Dillmann, R., Menegatti, E., Ghidoni, S. (eds) Intelligent Autonomous Systems 15. IAS 2018. Advances in Intelligent Systems and Computing, vol 867. Springer, Cham. https://doi.org/10.1007/978-3-030-01370-7_37

Download citation

DOI: https://doi.org/10.1007/978-3-030-01370-7_37
Published: 31 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01369-1
Online ISBN: 978-3-030-01370-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics