From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control., , , и . CoRR, (2017)Data Perturbation for Escaping Local Maxima in Learning., , , и . AAAI/IAAI, стр. 132-139. AAAI Press / The MIT Press, (2002)Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering, и . CoRR, (2012)Stochastic Neural Networks with Monotonic Activation Functions, , , , и . (2015)cite arxiv:1601.00034v2.pdfComment: AISTATS 2016.An experimental methodology for response surface optimization methods., , и . J. Glob. Optim., 53 (4): 699-736 (2012)AlgaeDICE: Policy Gradient from Arbitrary Experience., , , , , и . CoRR, (2019)Learning Gene Regulatory Networks via Globally Regularized Risk Minimization., и . RECOMB-CG, том 4751 из Lecture Notes in Computer Science, стр. 83-95. Springer, (2007)Improving Policy Gradient by Exploring Under-appreciated Rewards., , и . ICLR (Poster), OpenReview.net, (2017)Practical PAC Learning., и . IJCAI, стр. 1169-1177. Morgan Kaufmann, (1995)Planning and Learning with Stochastic Action Sets., , , , , , и . IJCAI, стр. 4674-4682. ijcai.org, (2018)