Inproceedings,

My AI Wants to Know if This Will Be on the Exam: Testing OpenAI’s Codex on CS2 Programming Exercises

J. Finnie-Ansley, P. Denny, A. Luxton-Reilly, E. Santos, J. Prather, and B. Becker.
Proceedings of the 25th Australasian Computing Education Conference, page 97-104. ACM, (January 2023)
DOI: 10.1145/3576123.3576134

Abstract

The introduction of OpenAI Codex sparked a surge of interest in the impact of generative AI models on computing education practices. Codex is also the underlying model for GitHub Copilot, a plugin which makes AI-generated code accessible to students through auto-completion in popular code editors. Research in this area, particularly on the educational implications, is nascent and has focused almost exclusively on introductory programming (or CS1) questions. Very recent work has shown that Codex performs considerably better on typical CS1 exam questions than most students. It is not clear, however, what Codex’s limits are with regard to more complex programming assignments and exams. In this paper, we present results detailing how Codex performs on more advanced CS2 (data structures and algorithms) exam questions taken from past exams. We compare these results to those of students who took the same exams under normal conditions, demonstrating that Codex outscores most students. We consider the implications of such tools for the future of undergraduate computing education.

BibTeX key: Finnie_Ansley_2023
entry type: inproceedings
booktitle: Proceedings of the 25th Australasian Computing Education Conference
year: 2023
month: jan
pages: 97-104
publisher: ACM
series: ACE ’23
collection: ACE ’23
DOI: 10.1145/3576123.3576134
url: http://dx.doi.org/10.1145/3576123.3576134

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{Finnie_Ansley_2023, abstract = {The introduction of OpenAI Codex sparked a surge of interest in the impact of generative AI models on computing education practices. Codex is also the underlying model for GitHub Copilot, a plugin which makes AI-generated code accessible to students through auto-completion in popular code editors. Research in this area, particularly on the educational implications, is nascent and has focused almost exclusively on introductory programming (or CS1) questions. Very recent work has shown that Codex performs considerably better on typical CS1 exam questions than most students. It is not clear, however, what Codex’s limits are with regard to more complex programming assignments and exams. In this paper, we present results detailing how Codex performs on more advanced CS2 (data structures and algorithms) exam questions taken from past exams. We compare these results to those of students who took the same exams under normal conditions, demonstrating that Codex outscores most students. We consider the implications of such tools for the future of undergraduate computing education. }, added-at = {2023-12-06T05:48:53.000+0100}, author = {Finnie-Ansley, James and Denny, Paul and Luxton-Reilly, Andrew and Santos, Eddie Antonio and Prather, James and Becker, Brett A.}, biburl = {https://www.bibsonomy.org/bibtex/27bb667054316628962ba18676923e9c4/brusilovsky}, booktitle = {Proceedings of the 25th Australasian Computing Education Conference}, collection = {ACE ’23}, description = {My AI Wants to Know if This Will Be on the Exam: Testing OpenAI’s Codex on CS2 Programming Exercises | Proceedings of the 25th Australasian Computing Education Conference}, doi = {10.1145/3576123.3576134}, interhash = {d0e857c4e8712b413cf723adbe74a38a}, intrahash = {7bb667054316628962ba18676923e9c4}, keywords = {llm programming}, month = jan, pages = {97-104}, publisher = {ACM}, series = {ACE ’23}, timestamp = {2023-12-06T05:48:53.000+0100}, title = {My AI Wants to Know if This Will Be on the Exam: Testing OpenAI’s Codex on CS2 Programming Exercises}, url = {http://dx.doi.org/10.1145/3576123.3576134}, year = 2023 }

BibSonomy

My AI Wants to Know if This Will Be on the Exam: Testing OpenAI’s Codex on CS2 Programming Exercises

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on