Safe Online Robust Exploration for Reinforcement Learning Control of Unmanned Aerial Vehicles