Preference-driven demonstrations ranking for inverse reinforcement learning