Author of the publication

To what extent do human explanations of model behavior align with actual model behavior?

, , , , , and . BlackboxNLP@EMNLP, page 1-14. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer., , , and . NAACL-HLT, page 1865-1874. Association for Computational Linguistics, (2018)Dynabench: Rethinking Benchmarking in NLP., , , , , , , , , and 9 other author(s). NAACL-HLT, page 4110-4124. Association for Computational Linguistics, (2021)Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality., , , , and . NAACL-HLT, page 4362-4379. Association for Computational Linguistics, (2021)Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?, , , , and . CoRR, (2023)CoNAL: Anticipating Outliers with Large Language Models., , and . CoRR, (2022)Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches., , , and . CoRR, (2021)Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants., , , , , and . CoRR, (2021)Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA., , , , , and . CoRR, (2020)Benchmarking Long-tail Generalization with Likelihood Splits., and . EACL (Findings), page 933-953. Association for Computational Linguistics, (2023)SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples., , and . EMNLP, page 7832-7848. Association for Computational Linguistics, (2023)