DMQL: Deep Maximum Q-Learning