Continuous state and action Q-learning framework applied to quadrotor UAV control