Assessment of Reinforcement Learning for CubeSat concept generation