Article,

On environment difficulty and discriminating power

J. Hernández-Orallo.
Autonomous Agents and Multi-Agent Systems, (2014)
DOI: 10.1007/s10458-014-9257-1

Abstract

This paper presents a way to estimate the difficulty and discriminating power of any task instance. We focus on a very general setting for tasks: interactive (possibly multi-agent) environments where an agent acts upon observations and rewards. Instead of analysing the complexity of the environment, the state space or the actions that are performed by the agent, we analyse the performance of a population of agent policies against the task, leading to a distribution that is examined in terms of policy complexity. This distribution is then sliced by the algorithmic complexity of the policy and analysed through several diagrams and indicators. The notion of environment response curve is also introduced, by inverting the performance results into an ability scale. We apply all these concepts, diagrams and indicators to two illustrative problems: a class of agent-populated elementary cellular automata, showing how the difficulty and discriminating power may vary for several environments, and a multi-agent system, where agents can become predators or preys, and may need to coordinate. Finally, we discuss how these tools can be applied to characterise (interactive) tasks and (multi-agent) environments. These characterisations can then be used to get more insight about agent performance and to facilitate the development of adaptive tests for the evaluation of agent abilities.

BibTeX key: hernandez-orallo-environment-difficulty-discriminating-2014
entry type: article
year: 2014
journal: Autonomous Agents and Multi-Agent Systems
pages: 1--53
publisher: Springer US
issn: 1387-2532
language: English
DOI: 10.1007/s10458-014-9257-1
url: http://dx.doi.org/10.1007/s10458-014-9257-1

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 hernandez-orallo-environment-difficulty-discriminating-2014 %A Hernández-Orallo, José %D 2014 %I Springer US %J Autonomous Agents and Multi-Agent Systems %K Kolmogorov_complexity SAT_problem alife cellular_automata environment_difficulty item_response_theory reinforcement_learning %P 1--53 %R 10.1007/s10458-014-9257-1 %T On environment difficulty and discriminating power %U http://dx.doi.org/10.1007/s10458-014-9257-1 %X This paper presents a way to estimate the difficulty and discriminating power of any task instance. We focus on a very general setting for tasks: interactive (possibly multi-agent) environments where an agent acts upon observations and rewards. Instead of analysing the complexity of the environment, the state space or the actions that are performed by the agent, we analyse the performance of a population of agent policies against the task, leading to a distribution that is examined in terms of policy complexity. This distribution is then sliced by the algorithmic complexity of the policy and analysed through several diagrams and indicators. The notion of environment response curve is also introduced, by inverting the performance results into an ability scale. We apply all these concepts, diagrams and indicators to two illustrative problems: a class of agent-populated elementary cellular automata, showing how the difficulty and discriminating power may vary for several environments, and a multi-agent system, where agents can become predators or preys, and may need to coordinate. Finally, we discuss how these tools can be applied to characterise (interactive) tasks and (multi-agent) environments. These characterisations can then be used to get more insight about agent performance and to facilitate the development of adaptive tests for the evaluation of agent abilities.

@article{hernandez-orallo-environment-difficulty-discriminating-2014, abstract = {This paper presents a way to estimate the difficulty and discriminating power of any task instance. We focus on a very general setting for tasks: interactive (possibly multi-agent) environments where an agent acts upon observations and rewards. Instead of analysing the complexity of the environment, the state space or the actions that are performed by the agent, we analyse the performance of a population of agent policies against the task, leading to a distribution that is examined in terms of policy complexity. This distribution is then sliced by the algorithmic complexity of the policy and analysed through several diagrams and indicators. The notion of environment response curve is also introduced, by inverting the performance results into an ability scale. We apply all these concepts, diagrams and indicators to two illustrative problems: a class of agent-populated elementary cellular automata, showing how the difficulty and discriminating power may vary for several environments, and a multi-agent system, where agents can become predators or preys, and may need to coordinate. Finally, we discuss how these tools can be applied to characterise (interactive) tasks and (multi-agent) environments. These characterisations can then be used to get more insight about agent performance and to facilitate the development of adaptive tests for the evaluation of agent abilities.}, added-at = {2014-04-30T13:29:26.000+0200}, author = {Hernández-Orallo, José}, biburl = {https://www.bibsonomy.org/bibtex/2740640716ccff15e7a77ef5fd90f0303/mhwombat}, doi = {10.1007/s10458-014-9257-1}, interhash = {bd8f7bd777865897d5dd7b3370147df9}, intrahash = {740640716ccff15e7a77ef5fd90f0303}, issn = {1387-2532}, journal = {Autonomous Agents and Multi-Agent Systems}, keywords = {Kolmogorov_complexity SAT_problem alife cellular_automata environment_difficulty item_response_theory reinforcement_learning}, language = {English}, pages = {1--53}, publisher = {Springer US}, timestamp = {2016-07-12T19:25:30.000+0200}, title = {On environment difficulty and discriminating power}, url = {http://dx.doi.org/10.1007/s10458-014-9257-1}, year = 2014 }

BibSonomy

On environment difficulty and discriminating power

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on