Policy gradient methods for discrete time linear quadratic regulator with random parameters Deyue Li ESAIM: COCV, 30 (2024) 26 Published online: 09 April 2024 DOI: 10.1051/cocv/2024014