Policy gradient methods for discrete time linear quadratic regulator with random parametersDeyue LiESAIM: COCV, 30 (2024) 26DOI: https://doi.org/10.1051/cocv/2024014