Flexible Heuristic Dynamic Programming for Reinforcement Learning in Quadrotors