Sat 21 Feb 2026 13:40 - 14:00 at Meeting Room 102 - Detecting AI Generated Code Chair(s): Ryan Dougherty
With the rapid surge of generative AI, many tools have been introduced, such as Google’s Gemini and OpenAI’s GPT-4, with the well-intentioned goal of supporting programmers [5, 8]. These tools can be used by professional programmers to help write code efficiently as well as support debugging and testing; however, we recently began to notice an increase in the number of novice programmers who become highly dependent on Large Language Models (LLMs) to code for them rather than using LLMs as a learning tool [2, 10]. In our CS1 course, approximately 10-15% of the students (out of~350) were cited for academic misconduct due to direct plagiarism from LLMs, many of which performed poorly due to an over-reliance on generative AI.
To address this, we developed a machine learning-based tool to detect AI-generated code. The tool utilized datasets consisting of thousands of student submissions (in C++) from introductory programming courses and we created an equal number of AI-generated solutions using carefully curated prompts based on the same programming prompts. We trained traditional ML models (Random Forest, XGBoost, etc.) on a labeled dataset, and our best-performing model achieved high precision and recall. Notably, the models remained robust even when trained with noisy data that included AI-generated samples. Our goal is to provide the community with a model that can be customized to any course program to encourage early detection and intervention of plagiarized code generated by LLMs.
Sat 21 FebDisplayed time zone: Central Time (US & Canada) change
Sat 21 Feb
Displayed time zone: Central Time (US & Canada) change
13:40 - 15:00 | Detecting AI Generated Code Papers at Meeting Room 102 Chair(s): Ryan Dougherty United States Military Academy | ||
13:40 20mTalk | Detecting AI-Generated Code in Introductory Programming Courses Papers Aryan Ramachandra pc, Suhani Chaudhary University of California, Riverside, Justin Tran University of California, Riverside, Riti Desai University of California, Riverside, Ashley Pang UC Riverside, Mariam Salloum BCOE/Computer Science | ||
14:00 20mTalk | LLM-Based Explainable Detection of LLM-Generated Code in Python Programming CoursesGlobal Papers Jeonghun Baek The University of Tokyo, Tetsuro Yamazaki University of Tokyo, Akimasa Morihata University of Tokyo, Junichiro Mori The University of Tokyo, Yoko Yamakata The University of Tokyo, Kenjiro Taura The University of Tokyo, Shigeru Chiba The University of Tokyo | ||
14:40 20mTalk | AI in the Eyes of Middle Schoolers: Perceptions, Attitudes, and LiteracyGlobalK12 Papers Maria Kasinidou Open University of Cyprus, Styliani Kleanthous Open University of Cyprus, Jahna Otterbacher Open University of Cyprus | ||