HALGAN

Introduction

This repository contains code for the NeurIPS 2019 paper "Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs". Link to the paper: https://arxiv.org/pdf/1901.11529.pdf

A Hallucinatory GAN inserts a visual goal into failed trajectories during off-policy reinforcement learning. A corresponding reward is also hallucinated during batched sampling, if the task is hallucinated to be completed during the sampled transition. The combination of visual hallucinations and increased encounter with real and hallucinated rewards enable agents to quickly start learning sparse reward tasks. This approach extends the HER algorithm to visual settings.

The left image shows various outputs of HALGAN in one of the environments. The goal is to navigate to a red box. The top left image shows an image from a failed trajectory where the goal is not in sight. In the remaining images, HALGAN is asked to hallucinate the goal on top of the failed image in specific locations, with increasing distance from top to bottom and increasing yaw from left to right. The right image shows the benefit of training using HALGAN vs. various baselines. For a full explanation of the baselines and the training process please refer to the paper.

Dependencies

This code was developed using Python 3.5.2 on Ubuntu 16.04. In addition, the following dependencies should be installed:

keras (version 2.2.0)
Tensorflow (version 1.8.0)
Offworld fork of keras-rl. Switch to branch offworld-halgan.
Offworld fork of gym-miniworld. Switch to branch offworld-halgan.

Usage

This repository contains:

Code to train a HALGAN from scratch
Script for setting up training of a HALGAN equipped DDPG agent from offworld branch of keras-rl. Please refer to the offworld keras-rl repository for code to a HALGAN DQN agent as well.
Data and trained models for the MiniWorld environment presented in the paper.

Simply run

cd src/
python train_halgan.py

to start training HALGAN on the MiniWorld task. Trained models will start saving in experiments/halgan-[ENV]/[datestamp] along with some example images. Training data for this environment is present in data/MiniWorld-SimToReal1Cont-v0/training-data.

data/MiniWorld-SimToReal1Cont-v0/ also contains a trained HALGAN model for this environment. Run

cd src/
python train_ddpg.py

to start training a HALGAN DDPG agent that uses this model. Run

cd src/
python train_ddpg.py --mode vanilla

for vanilla DDPG comparison. Check mode argument for available baseline comparisons.

Citation

Please use the following bibtex for citing this work in your publications

@inproceedings{sahni2019addressing,
title = {Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs},
author = {Sahni, Himanshu and Buckley, Toby and Abbeel, Pieter and Kuzovkin, Ilya},
booktitle = {Advances in Neural Information Processing Systems 33},
year = {2019},
url = {https://arxiv.org/pdf/1901.11529.pdf}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data/MiniWorld-SimToReal1Cont-v0		data/MiniWorld-SimToReal1Cont-v0
images		images
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HALGAN

Contents

Introduction

Dependencies

Usage

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

offworld-projects/research-halgan

Folders and files

Latest commit

History

Repository files navigation

HALGAN

Contents

Introduction

Dependencies

Usage

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages