Author of the publication

A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.

, , , and . PAKDD (2), volume 11440 of Lecture Notes in Computer Science, page 394-406. Springer, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Convergence Analysis of Graphical Game-Based Nash Q-Learning using the Interaction Detection Signal of N-Step Return., , , and . ICASSP, page 1-5. IEEE, (2023)Leveraging transition exploratory bonus for efficient exploration in Hard-Transiting reinforcement learning problems., , , and . Future Gener. Comput. Syst., (August 2023)Online attentive kernel-based temporal difference learning., , , , , and . Knowl. Based Syst., (October 2023)GUARD: Multigranularity-based Unsupervised Anomaly Detection Algorithm for Multivariate Time Series., , , , and . CCIS, page 25-30. IEEE, (2022)Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient., , , , and . AAAI, page 11542-11550. AAAI Press, (2023)A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions., , , and . PAKDD (2), volume 11440 of Lecture Notes in Computer Science, page 394-406. Springer, (2019)New Galois hulls of generalized Reed-Solomon codes., , and . Finite Fields Their Appl., (2022)An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward., and . IEEE Trans. Neural Networks Learn. Syst., 32 (5): 2285-2291 (2021)Learning Credit Assignment for Cooperative Reinforcement Learning., , , and . CoRR, (2022)Modified Retrace for Off-Policy Temporal Difference Learning., , , , , and . UAI, volume 216 of Proceedings of Machine Learning Research, page 303-312. PMLR, (2023)