A platform for public participation in and discussion of the human perspective on machine-made moral decisions. It allows you to browse and judge scenarios where self-driving cars must make difficult moral decisions.

The paper shows that GANs can be viewed as actor-critic methods in an environment where the actor cannot affect the reward and presents strategies for stabilizing training for each class of models. By David Pfau, Oriol Vinyals (Google).

Video Pixel Network (VPN) estimates the discrete joint distribution of the raw pixel values in a video. VPN achieves the best possible performance on the Moving MNIST benchmark, a leap over the previous state of the art, and the generated videos show only minor deviations from the ground truth. (DeepMind)

It is possible for multiple robots to share their experience with one another, and thereby, learn a policy collectively. This work explores distributed and asynchronous policy learning as a means to achieve generalization and improved training times. (Google)

A neural machine translation (NMT) framework for simultaneous translation in which an agent learns to make decisions on when to translate from the interaction with a pre-trained NMT environment. By Jiatao Gu, Graham Neubig, Kyunghyun Cho, Victor O.K. Li (NYU).