Reinforcement Learning-Based Design of Side-Channel Countermeasures