Evaluation of physical damage associated with action selection strategies in reinforcement learning