copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AGENT: A Benchmark for Core Psychological Reasoning

T. Shu, A. Bhandwaldar, C. Gan, K. Smith, S. Liu, D. Gutfreund, E. Spelke, J. Tenenbaum, and T. Ullman. (2021)cite arxiv:2102.12321Comment: ICML 2021, 12 pages, 7 figures.

Abstract

For machine agents to successfully interact with humans in real-world settings, they will need to develop an understanding of human mental life. Intuitive psychology, the ability to reason about hidden mental variables that drive observable actions, comes naturally to people: even pre-verbal infants can tell agents from objects, expecting agents to act efficiently to achieve goals given constraints. Despite recent interest in machine agents that reason about other agents, it is not clear if such agents learn or hold the core psychology principles that drive human reasoning. Inspired by cognitive development studies on intuitive psychology, we present a benchmark consisting of a large dataset of procedurally generated 3D animations, AGENT (Action, Goal, Efficiency, coNstraint, uTility), structured around four scenarios (goal preferences, action efficiency, unobserved constraints, and cost-reward trade-offs) that probe key concepts of core intuitive psychology. We validate AGENT with human-ratings, propose an evaluation protocol emphasizing generalization, and compare two strong baselines built on Bayesian inverse planning and a Theory of Mind neural network. Our results suggest that to pass the designed tests of core intuitive psychology at human levels, a model must acquire or have built-in representations of how agents plan, combining utility computations and core knowledge of objects and physics.

Description

AGENT: A Benchmark for Core Psychological Reasoning

Links and resources

BibTeX key: shu2021agent
entry type: misc
year: 2021
url: http://arxiv.org/abs/2102.12321
note: cite arxiv:2102.12321Comment: ICML 2021, 12 pages, 7 figures

@charleslbryant's tags highlighted

Cite this publication

@misc{shu2021agent, abstract = {For machine agents to successfully interact with humans in real-world settings, they will need to develop an understanding of human mental life. Intuitive psychology, the ability to reason about hidden mental variables that drive observable actions, comes naturally to people: even pre-verbal infants can tell agents from objects, expecting agents to act efficiently to achieve goals given constraints. Despite recent interest in machine agents that reason about other agents, it is not clear if such agents learn or hold the core psychology principles that drive human reasoning. Inspired by cognitive development studies on intuitive psychology, we present a benchmark consisting of a large dataset of procedurally generated 3D animations, AGENT (Action, Goal, Efficiency, coNstraint, uTility), structured around four scenarios (goal preferences, action efficiency, unobserved constraints, and cost-reward trade-offs) that probe key concepts of core intuitive psychology. We validate AGENT with human-ratings, propose an evaluation protocol emphasizing generalization, and compare two strong baselines built on Bayesian inverse planning and a Theory of Mind neural network. Our results suggest that to pass the designed tests of core intuitive psychology at human levels, a model must acquire or have built-in representations of how agents plan, combining utility computations and core knowledge of objects and physics.}, added-at = {2023-07-29T22:14:24.000+0200}, author = {Shu, Tianmin and Bhandwaldar, Abhishek and Gan, Chuang and Smith, Kevin A. and Liu, Shari and Gutfreund, Dan and Spelke, Elizabeth and Tenenbaum, Joshua B. and Ullman, Tomer D.}, biburl = {https://www.bibsonomy.org/bibtex/2ca6eaf28c1a8d07914bde77b6106d5b4/charleslbryant}, description = {AGENT: A Benchmark for Core Psychological Reasoning}, interhash = {99e3648f01dcaa37401ead9859fe24fb}, intrahash = {ca6eaf28c1a8d07914bde77b6106d5b4}, keywords = {agents machine}, note = {cite arxiv:2102.12321Comment: ICML 2021, 12 pages, 7 figures}, timestamp = {2023-07-29T22:14:24.000+0200}, title = {AGENT: A Benchmark for Core Psychological Reasoning}, url = {http://arxiv.org/abs/2102.12321}, year = 2021 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AGENT: A Benchmark for Core Psychological Reasoning

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML AGENT: A Benchmark for Core Psychological Reasoning

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AGENT: A Benchmark for Core Psychological Reasoning

Comments and Reviews
(0)