Author of the publication

Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.

, , , , , and . IEEE Trans. Neural Networks Learn. Syst., 33 (11): 6584-6598 (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic., , , and . CoRR, (2020)On the Optimization Landscape of Dynamic Output Feedback Linear Quadratic Control., , , and . IEEE Trans. Autom. Control., 69 (2): 920-935 (February 2024)Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization*., , , , and . ITSC, page 1-7. IEEE, (2020)Fixed-Dimensional and Permutation Invariant State Representation of Autonomous Driving., , , , , , and . CoRR, (2021)Model-Based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian., , , , , , , , and . IEEE Trans. Neural Networks Learn. Syst., 35 (1): 466-478 (January 2024)Belief state separated reinforcement learning for autonomous vehicle decision making under uncertainty., , , , , , and . ITSC, page 586-592. IEEE, (2021)RL-Driven MPPI: Accelerating Online Control Laws Calculation With Offline Policy., , , , , , , and . IEEE Trans. Intell. Veh., 9 (2): 3605-3616 (February 2024)Direct and indirect reinforcement learning., , , , , , and . Int. J. Intell. Syst., 36 (8): 4439-4467 (2021)Optimization Landscape of Policy Gradient Methods for Discrete-Time Static Output Feedback., , , , , and . IEEE Trans. Cybern., 54 (6): 3588-3601 (June 2024)Smoothing Policy Iteration for Zero-sum Markov Games., , , , , and . CoRR, (2022)