Combining MPC and reinforcement learning in a model-reference framework for urban traffic signal control