Memory-based Modeling and Prioritized Sweeping in Reinforcement Learning