@dblp

A Policy Iteration Algorithm for Learning from Preference-Based Feedback.

, and . IDA, volume 8207 of Lecture Notes in Computer Science, page 427-437. Springer, (2013)

Links and resources

Tags