Fleet Planning Under Demand and Fuel Price Uncertainty Using Actor-Critic Reinforcement Learning