RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Autonomous driving has became one of the most hot trends in artificial intelligence area in recent years thanks to the machine learning algorithms. However, most of the autonomous driving studies are still limited to discrete action space. In this study, we propose to implement Deep Deterministic Policy Gradient algorithm for learning driving behavior over the continuous actions. For this purpose, a driving simulator is employed which interfaces with IPG CarMker software where the virtual environment and dynamical vehicle model can be built.”Human-in-the-loop” is performed in order to gather the data and a neural network which is implemented in Behavior Layer is trained to recognize two different scenarios-forward driving and stop. Based on the scenario the agent is dealing with, the actions are learnt and suggested from the DDPG algorithm. The experimental results show that DDPG algorithm is able to learn the optimal policy with continuous actions reliably for both scenarios.

Autonomous vehicle driving via deep deterministic policy gradient

Huang W.;Braghin F.;Arrigoni S.

2019-01-01

Abstract

Autonomous driving has became one of the most hot trends in artificial intelligence area in recent years thanks to the machine learning algorithms. However, most of the autonomous driving studies are still limited to discrete action space. In this study, we propose to implement Deep Deterministic Policy Gradient algorithm for learning driving behavior over the continuous actions. For this purpose, a driving simulator is employed which interfaces with IPG CarMker software where the virtual environment and dynamical vehicle model can be built.”Human-in-the-loop” is performed in order to gather the data and a neural network which is implemented in Behavior Layer is trained to recognize two different scenarios-forward driving and stop. Based on the scenario the agent is dealing with, the actions are learnt and suggested from the DDPG algorithm. The experimental results show that DDPG algorithm is able to learn the optimal policy with continuous actions reliably for both scenarios.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Titolo del libro
	
				Proceedings of the ASME Design Engineering Technical Conference
			
	ISBN (International Standard Book Number)
	
				978-0-7918-5921-6
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1128128

Citazioni

ND

5

0

social impact