REAL Reinforcement Learning