Quantum gradient estimation and its application to quantum reinforcement learning