@dblp

Off-policy learning based on weighted importance sampling with linear computational complexity.

, and . UAI, page 552-561. AUAI Press, (2015)

Links and resources

Tags