The actor-critic algorithm