Online Model Learning Algorithms for Actor-Critic Control