This work proposes a novel control methodology to achieve gait symmetry in trans-femoral amputated patients with prostheses. The proposed approach allows to overcome the limits of classical model-based control strategies by introducing a Deep Reinforcement Learning (DRL) method trained ad hoc for generating the velocity control signals fed into the active lower-limb robotic prosthesis. More specifically, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm is used to concurrently learn a Q-function and the best policy. The proposal has the advantages of being model-free and capable of adapting to different walking velocities, just requiring few measurements and without the need to online re-tune the control parameters when the human motions change. The proposed model-free approach has been tested in a realistic scenario simulated in the CoppeliaSim environment relying on gait patterns retrieved experimentally by means of markers placed on a human subject.

Deep reinforcement learning of robotic prosthesis for gait symmetry in trans-femoral amputated patients

Incremona, Gian Paolo;
2021-01-01

Abstract

This work proposes a novel control methodology to achieve gait symmetry in trans-femoral amputated patients with prostheses. The proposed approach allows to overcome the limits of classical model-based control strategies by introducing a Deep Reinforcement Learning (DRL) method trained ad hoc for generating the velocity control signals fed into the active lower-limb robotic prosthesis. More specifically, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm is used to concurrently learn a Q-function and the best policy. The proposal has the advantages of being model-free and capable of adapting to different walking velocities, just requiring few measurements and without the need to online re-tune the control parameters when the human motions change. The proposed model-free approach has been tested in a realistic scenario simulated in the CoppeliaSim environment relying on gait patterns retrieved experimentally by means of markers placed on a human subject.
2021
29th Mediterranean Conference on Control and Automation (MED) 2021
Reinforcement learning
Legged locomotion
File in questo prodotto:
File Dimensione Formato  
drl_prosthesis_MED21_original.pdf

Accesso riservato

Descrizione: Articolo principale
: Publisher’s version
Dimensione 6.01 MB
Formato Adobe PDF
6.01 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1182329
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact