copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive Test Generation Using a Large Language Model

M. Schäfer, S. Nadi, A. Eghbali, and F. Tip. (2023)cite arxiv:2302.06527.

Abstract

Unit tests play a key role in ensuring the correctness of software. However, manually creating unit tests is a laborious task, motivating the need for automation. This paper presents TestPilot, an adaptive test generation technique that leverages Large Language Models (LLMs). TestPilot uses Codex, an off-the-shelf LLM, to automatically generate unit tests for a given program without requiring additional training or few-shot learning on examples of existing tests. In our approach, Codex is provided with prompts that include the signature and implementation of a function under test, along with usage examples extracted from documentation. If a generated test fails, TestPilot's adaptive component attempts to generate a new test that fixes the problem by re-prompting the model with the failing test and error message. We created an implementation of TestPilot for JavaScript and evaluated it on 25 npm packages with a total of 1,684 API functions to generate tests for. Our results show that the generated tests achieve up to 93.1% statement coverage (median 68.2%). Moreover, on average, 58.5% of the generated tests contain at least one assertion that exercises functionality from the package under test. Our experiments with excluding parts of the information included in the prompts show that all components contribute towards the generation of effective test suites. Finally, we find that TestPilot does not generate memorized tests: 92.7% of our generated tests have $łeq$ 50% similarity with existing tests (as measured by normalized edit distance), with none of them being exact copies.

Description

Adaptive Test Generation Using a Large Language Model

Links and resources

BibTeX key: schafer2023adaptive
entry type: misc
year: 2023
url: http://arxiv.org/abs/2302.06527
note: cite arxiv:2302.06527

@woobanseok's tags highlighted

javascript

Cite this publication

@misc{schafer2023adaptive, abstract = {Unit tests play a key role in ensuring the correctness of software. However, manually creating unit tests is a laborious task, motivating the need for automation. This paper presents TestPilot, an adaptive test generation technique that leverages Large Language Models (LLMs). TestPilot uses Codex, an off-the-shelf LLM, to automatically generate unit tests for a given program without requiring additional training or few-shot learning on examples of existing tests. In our approach, Codex is provided with prompts that include the signature and implementation of a function under test, along with usage examples extracted from documentation. If a generated test fails, TestPilot's adaptive component attempts to generate a new test that fixes the problem by re-prompting the model with the failing test and error message. We created an implementation of TestPilot for JavaScript and evaluated it on 25 npm packages with a total of 1,684 API functions to generate tests for. Our results show that the generated tests achieve up to 93.1% statement coverage (median 68.2%). Moreover, on average, 58.5% of the generated tests contain at least one assertion that exercises functionality from the package under test. Our experiments with excluding parts of the information included in the prompts show that all components contribute towards the generation of effective test suites. Finally, we find that TestPilot does not generate memorized tests: 92.7% of our generated tests have $\leq$ 50% similarity with existing tests (as measured by normalized edit distance), with none of them being exact copies.}, added-at = {2023-07-11T04:06:10.000+0200}, author = {Schäfer, Max and Nadi, Sarah and Eghbali, Aryaz and Tip, Frank}, biburl = {https://www.bibsonomy.org/bibtex/29d25b914ff154cfe27713740ddd85a23/woobanseok}, description = {Adaptive Test Generation Using a Large Language Model}, interhash = {eacd03f244d5a1fe8dca84f5c091cbee}, intrahash = {9d25b914ff154cfe27713740ddd85a23}, keywords = {javascript}, note = {cite arxiv:2302.06527}, timestamp = {2023-07-11T04:06:10.000+0200}, title = {Adaptive Test Generation Using a Large Language Model}, url = {http://arxiv.org/abs/2302.06527}, year = 2023 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive Test Generation Using a Large Language Model

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Adaptive Test Generation Using a Large Language Model

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Adaptive Test Generation Using a Large Language Model

Comments and Reviews
(0)