skip to main content
Primo Search
Search in: Busca Geral

End-to-End Visual Obstacle Avoidance for a Robotic Manipulator using Deep Reinforcement Learning

Sanches, Felipe Padula

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Ciências Matemáticas e de Computação 2021-06-28

Acesso online

  • Título:
    End-to-End Visual Obstacle Avoidance for a Robotic Manipulator using Deep Reinforcement Learning
  • Autor: Sanches, Felipe Padula
  • Orientador: Romero, Roseli Aparecida Francelin
  • Assuntos: Aprendizado Por Reforço Profundo; Visão Robótica; Manipuladores Robóticos; Desvio De Obstáculos; Controle De Movimento; Deep Reinforcement Learning; Obstacle Avoidance; Robot Manipulators; Robot Vision; Motion Control
  • Notas: Dissertação (Mestrado)
  • Descrição: Recent changes in industrial paradigms enforce that robots must be intelligent and capable of decision-making. Robotic manipulators need to satisfy many requirements for operating properly. Perhaps the most fundamental one is the capability of operating in its environment without collisions. In this work, we perform visual obstacle avoidance on goal-reaching tasks of a robotic manipulator using an end-to-end Deep Reinforcement Learning model. The motion control policy is responsible for reaching a target position while at the same time avoiding an obstacle positioned randomly in the scene. This policy uses vision and proprioceptive sensor data to operate. We train the reinforcement learning agent using Twin-Delayed DDPG (TD3) algorithm in a simulated environment, utilizing the Unity game engine and the ML-Agents toolkit. Experiments demonstrate that the agent can successfully learn a meaningful policy to avoid obstacles using images.
  • DOI: 10.11606/D.55.2021.tde-30082021-100712
  • Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Ciências Matemáticas e de Computação
  • Data de criação/publicação: 2021-06-28
  • Formato: Adobe PDF
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.