The Condition-Based Maintenance Scheduling Challenge: A Reinforcement Learning Interpretation