A fast hybrid reinforcement learning framework with human corrective feedback