GitHub - Natasha-R/ChatGPT-Study: A study analysing AI-generated code, focusing on software engineering use-cases.

This data is associated with the following research paper: What an AI-Embracing Software Engineering Curriculum Should Look Like: An Empirical Study.

Abstract: It is not possible to reliably prevent the use of artificial intelligence (AI) tools, nor would that be desirable as AI offers many benefits for students. We recommend that appropriate AI usage be taught within software engineering courses and AI tools integrated into examinations. In order to most effectively support today’s students, software engineering curricula must embrace AI.

The purpose of this study is to analyse AI-generated code, with a focus on software engineering use-cases. The seven core areas of this research are described below.

This dataset can also be accessed on the IEEE dataport: https://dx.doi.org/10.21227/4rxb-zv06

Analysis of AI and Student-written Code

AI chatbots (ChatGPT-4, ChatGPT-3.5, Bing Chat and Bard) were used to generate code solutions to Java programming tasks (milestone assignments) taken from the 2021 presentation of the Software Engineering 2 Bachelor's course at TH Köln. Student solutions to the same assignments were stored anonymously. The differences between the AI and human written code solutions to "milestone 0" are analysed, and a simple classification model was trained to distinguish between AI and human written code. For the analysis, two approaches are compared: representing the code using manually defined features, and by OpenAI's text embedding vectors.

AI or Student-written Code Predictions

In order to determine how effectively AI-written code can be detected "by eye", a mixed set of AI- and human-written code solutions to the "Software Engineering 2" 2021 milestone 0 assignment was anonymised and given to two faculty members, who made predictions as to whether each solution was written by either a student or AI chatbot.

Analysis of AI and Human-written Python Code

ChatGPT was used to generate Python code solutions to the HumanEval problem set. The differences between the AI and human-written canonical code solutions are analysed, and a model is trained to distinguish between the two classes. The feature and embedding representation approaches are compared.

ChatGPT Capabilities Experiment

An evaluation of the capabilities of ChatGPT at completing software engineering university course assignments. By prompting ChatGPT-4 with only the original task description and provided document comments, purely AI-written code was generated as the solution to the "Software Engineering 2" 2023 assignment, comprising of the creation of a complex eCommerce system.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
AI Tools Experiences and Guidelines		AI Tools Experiences and Guidelines
AI or Student-written Code Predictions		AI or Student-written Code Predictions
Analysis of AI and Human-written Python Code		Analysis of AI and Human-written Python Code
Analysis of AI and Student-written Code		Analysis of AI and Student-written Code
ChatGPT Capabilities Experiment		ChatGPT Capabilities Experiment
Student Experiences with ChatGPT		Student Experiences with ChatGPT
Student Survey		Student Survey
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analysis of AI and Student-written Code

AI or Student-written Code Predictions

Analysis of AI and Human-written Python Code

ChatGPT Capabilities Experiment

Student Experiences with ChatGPT

AI Tools Experiences and Guidelines

Student Survey

About

Languages

License

Natasha-R/ChatGPT-Study

Folders and files

Latest commit

History

Repository files navigation

Analysis of AI and Student-written Code

AI or Student-written Code Predictions

Analysis of AI and Human-written Python Code

ChatGPT Capabilities Experiment

Student Experiences with ChatGPT

AI Tools Experiences and Guidelines

Student Survey

About

Topics

Resources

License

Stars

Watchers

Forks

Languages