Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.

C. Zhang, H. Wang, S. Yang, and Y. Gao. PAKDD (2), volume 11440 of Lecture Notes in Computer Science, page 394-406. Springer, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yang Yang

Other publications of authors with the same name

Convergence Analysis of Graphical Game-Based Nash Q-Learning using the Interaction Detection Signal of N-Step Return.Y. Zhuang, S. Yang, W. Li, and Y. Gao. ICASSP, page 1-5. IEEE, (2023)Leveraging transition exploratory bonus for efficient exploration in Hard-Transiting reinforcement learning problems.S. Yang, H. Wang, S. Dong, and X. Chen. Future Gener. Comput. Syst., (August 2023)Online attentive kernel-based temporal difference learning.X. Chen, G. Yang, S. Yang, H. Wang, S. Dong, and Y. Gao. Knowl. Based Syst., (October 2023)GUARD: Multigranularity-based Unsupervised Anomaly Detection Algorithm for Multivariate Time Series.F. Meng, Q. Yang, Z. He, S. Yang, and W. Tang. CCIS, page 25-30. IEEE, (2022)Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient.W. Chen, W. Li, X. Liu, S. Yang, and Y. Gao. AAAI, page 11542-11550. AAAI Press, (2023)A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.C. Zhang, H. Wang, S. Yang, and Y. Gao. PAKDD (2), volume 11440 of Lecture Notes in Computer Science, page 394-406. Springer, (2019)New Galois hulls of generalized Reed-Solomon codes.Y. Wu, C. Li, and S. Yang. Finite Fields Their Appl., (2022)An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward.S. Yang, and Y. Gao. IEEE Trans. Neural Networks Learn. Syst., 32 (5): 2285-2291 (2021)Learning Credit Assignment for Cooperative Reinforcement Learning.W. Chen, W. Li, X. Liu, and S. Yang. CoRR, (2022)Modified Retrace for Off-Policy Temporal Difference Learning.X. Chen, X. Ma, Y. Li, G. Yang, S. Yang, and Y. Gao. UAI, volume 216 of Proceedings of Machine Learning Research, page 303-312. PMLR, (2023)

BibSonomy

Disambiguation of "Yang, Shangdong"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.

Please choose a person to relate this publication to

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yang, Shangdong"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.

Please choose a person to relate this publication to

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Contextual Bandit Approach to Personalized Online Recommendation via Sparse Interactions.