Multi Agent Deep Recurrent Q-Learning for Different Traffic Demands