August 2016 – Hado van Hasselt

Atari videos

(This contents of this NIPS spotlight video is similar to the post below, although the post is a bit more detailed.) Reinforcement learning agents can learn to play video games (for instance Atari games) by themselves. The original DQN algorithm and many of its successors clip the rewards they receive while learning. This helps stabilize the deep learning,…

Learning values across many orders of magnitude

Our paper about adaptive target normalization in deep learning was accepted at NIPS 2016. A preprint can be found on arXiv.org. The abstract and a more informal summary can be found below. Update: There are now videos of the effect of the new approach on Atari. Abstract Most learning algorithms are not invariant to the scale of the function that…