Sat 21 Feb 2026 13:40 - 14:00 at Meeting Room 102 - Detecting AI Generated Code Chair(s): Ryan Dougherty

With the rapid surge of generative AI, many tools have been introduced, such as Google’s Gemini and OpenAI’s GPT-4, with the well-intentioned goal of supporting programmers [5, 8]. These tools can be used by professional programmers to help write code efficiently as well as support debugging and testing; however, we recently began to notice an increase in the number of novice programmers who become highly dependent on Large Language Models (LLMs) to code for them rather than using LLMs as a learning tool [2, 10]. In our CS1 course, approximately 10-15% of the students (out of~350) were cited for academic misconduct due to direct plagiarism from LLMs, many of which performed poorly due to an over-reliance on generative AI.

To address this, we developed a machine learning-based tool to detect AI-generated code. The tool utilized datasets consisting of thousands of student submissions (in C++) from introductory programming courses and we created an equal number of AI-generated solutions using carefully curated prompts based on the same programming prompts. We trained traditional ML models (Random Forest, XGBoost, etc.) on a labeled dataset, and our best-performing model achieved high precision and recall. Notably, the models remained robust even when trained with noisy data that included AI-generated samples. Our goal is to provide the community with a model that can be customized to any course program to encourage early detection and intervention of plagiarized code generated by LLMs.

Sat 21 Feb

Displayed time zone: Central Time (US & Canada) change

13:40 - 15:00
Detecting AI Generated Code Papers at Meeting Room 102
Chair(s): Ryan Dougherty United States Military Academy
13:40
20m
Talk
Detecting AI-Generated Code in Introductory Programming Courses
Papers
Aryan Ramachandra pc, Suhani Chaudhary University of California, Riverside, Justin Tran University of California, Riverside, Riti Desai University of California, Riverside, Ashley Pang UC Riverside, Mariam Salloum BCOE/Computer Science
14:00
20m
Talk
LLM-Based Explainable Detection of LLM-Generated Code in Python Programming CoursesGlobal
Papers
Jeonghun Baek The University of Tokyo, Tetsuro Yamazaki University of Tokyo, Akimasa Morihata University of Tokyo, Junichiro Mori The University of Tokyo, Yoko Yamakata The University of Tokyo, Kenjiro Taura The University of Tokyo, Shigeru Chiba The University of Tokyo
14:40
20m
Talk
AI in the Eyes of Middle Schoolers: Perceptions, Attitudes, and LiteracyGlobalK12
Papers
Maria Kasinidou Open University of Cyprus, Styliani Kleanthous Open University of Cyprus, Jahna Otterbacher Open University of Cyprus