Author of the publication

Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching.

, , , , , , and . ICDM, page 1541-1546. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching., , , , , , and . ICDM, page 1541-1546. IEEE, (2021)Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation., , , and . WWW, page 292-302. ACM / IW3C2, (2020)Deep Reinforcement Learning for Ride-sharing Dispatching and Repositioning., , , , , and . IJCAI, page 6566-6568. ijcai.org, (2019)Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching., , , , and . ICDM, page 617-626. IEEE Computer Society, (2018)Origin-destination Flow Prediction with Vehicle Trajectory Data and Semi-supervised Recurrent Neural Network., , , , , , and . IEEE BigData, page 1450-1459. IEEE, (2019)Offline Model-based Adaptable Policy Learning., , , , , , and . NeurIPS, page 8432-8443. (2021)Reinforcement Learning for Ridesharing: A Survey., , and . ITSC, page 2447-2454. IEEE, (2021)A Deep Value-network Based Approach for Multi-Driver Order Dispatching., , , , , , , and . CoRR, (2021)Robust Low-Rank Tensor Recovery: Models and Algorithms., and . SIAM J. Matrix Anal. Appl., 35 (1): 225-253 (2014)InBEDE: Integrating Contextual Bandit with TD Learning for Joint Pricing and Dispatch of Ride-Hailing Platforms., , , , , , , and . ICDM, page 61-70. IEEE, (2019)