Learning Optimal Controllers for Linear Systems with Multiplicative Noise via Policy Gradient