Reinforcement learning for legged robot locomotion

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Reinforcement learning for legged robot locomotion Xie, Zhaoming

Abstract

Deep reinforcement learning (DRL) offers a promising approach for the synthesis of control policies for legged robots locomotion. However, it remains challenging to learn policies that are robust to uncertainty in the real world to put on physical robots or policies that can handle complicated environments. In this thesis, we take several significant steps towards efficiently learning legged locomotion skills with DRL. First, we present a framework to learn feedback policies for a bipedal robotCassie, utilizing rough motion sketches. An iterative design process is then proposed to refine, compress and combine policies for effective sim-to-real transfer. Second, we explore the role of dynamics randomization on a quadrupedal robotLaikago. We demonstrate that with appropriate design choices, dynamics randomization is often not necessary for sim-to-real. We further analyze situations that randomization would become necessary. Third, we propose and analyze multiple curriculum learning approaches to solve the challenging stepping stone tasks for bipedal locomotion. We demonstrate that gradually increasing task difficulties can reliably train policies that solve challenging stepping stone sequences. Finally, we investigate the combination of reinforcement learning and model-based control by training quadrupedal policies using a centroidal model. [An errata to this thesis/dissertation was made available on 2022-02-09.]

Item Metadata

Title	Reinforcement learning for legged robot locomotion
Creator	Xie, Zhaoming
Supervisor	Van de Panne, M. (Michiel), 1965-
Publisher	University of British Columbia
Date Issued	2021
Description	Deep reinforcement learning (DRL) offers a promising approach for the synthesis of control policies for legged robots locomotion. However, it remains challenging to learn policies that are robust to uncertainty in the real world to put on physical robots or policies that can handle complicated environments. In this thesis, we take several significant steps towards efficiently learning legged locomotion skills with DRL. First, we present a framework to learn feedback policies for a bipedal robotCassie, utilizing rough motion sketches. An iterative design process is then proposed to refine, compress and combine policies for effective sim-to-real transfer. Second, we explore the role of dynamics randomization on a quadrupedal robotLaikago. We demonstrate that with appropriate design choices, dynamics randomization is often not necessary for sim-to-real. We further analyze situations that randomization would become necessary. Third, we propose and analyze multiple curriculum learning approaches to solve the challenging stepping stone tasks for bipedal locomotion. We demonstrate that gradually increasing task difficulties can reliably train policies that solve challenging stepping stone sequences. Finally, we investigate the combination of reinforcement learning and model-based control by training quadrupedal policies using a centroidal model. [An errata to this thesis/dissertation was made available on 2022-02-09.]
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2021-12-07
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0404507
URI	http://hdl.handle.net/2429/80396
Degree	Doctor of Philosophy - PhD
Program	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2022-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Item Media

Loading media...

Item Citations and Data

Permanent URL (DOI):

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Reinforcement learning for legged robot locomotion Xie, Zhaoming

Abstract

Item Metadata

Item Media

Item Citations and Data

Permanent URL (DOI):

Rights

Usage Statistics