Pong using policy gradients