Autonomous driving has became one of the most hot trends in artificial intelligence area in recent years thanks to the machine learning algorithms. However, most of the autonomous driving studies are still limited to discrete action space. In this study, we propose to implement Deep Deterministic Policy Gradient algorithm for learning driving behavior over the continuous actions. For this purpose, a driving simulator is employed which interfaces with IPG CarMker software where the virtual environment and dynamical vehicle model can be built.”Human-in-the-loop” is performed in order to gather the data and a neural network which is implemented in Behavior Layer is trained to recognize two different scenarios-forward driving and stop. Based on the scenario the agent is dealing with, the actions are learnt and suggested from the DDPG algorithm. The experimental results show that DDPG algorithm is able to learn the optimal policy with continuous actions reliably for both scenarios.

Autonomous vehicle driving via deep deterministic policy gradient

Braghin F.;Arrigoni S.
2019

Abstract

Autonomous driving has became one of the most hot trends in artificial intelligence area in recent years thanks to the machine learning algorithms. However, most of the autonomous driving studies are still limited to discrete action space. In this study, we propose to implement Deep Deterministic Policy Gradient algorithm for learning driving behavior over the continuous actions. For this purpose, a driving simulator is employed which interfaces with IPG CarMker software where the virtual environment and dynamical vehicle model can be built.”Human-in-the-loop” is performed in order to gather the data and a neural network which is implemented in Behavior Layer is trained to recognize two different scenarios-forward driving and stop. Based on the scenario the agent is dealing with, the actions are learnt and suggested from the DDPG algorithm. The experimental results show that DDPG algorithm is able to learn the optimal policy with continuous actions reliably for both scenarios.
Proceedings of the ASME Design Engineering Technical Conference
978-0-7918-5921-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11311/1128128
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 0
social impact