How AI Helps Students Learn in Programming Classes: Tracing ‘Prompt-Level Trajectories’

Tracing Prompt-Level Trajectories to Understand Student Learning with AI in Programming Education

arXivApr 12, 2026Tianyu Shao, Miguel Feijóo-García, Yi Zhang, Hugo Castellanos, Tawfiq Salem0 views

View original →

HCI Today summarized the key points

Background

•This article reports a study analyzing how students in a Python class use AI—such as ChatGPT—to solve problems.

Main Points

•The research team examined conversation logs and code submissions from 146 of 163 participants, comparing question-asking styles alongside the resulting code.
•Students most often delegated the entire solution to AI, but there were also cases where they mixed approaches—either revising step by step or combining strategies.
•84.4% of the submissions used code generated by AI with little change, and the overall code tended to cluster into similar forms.

Conclusion

•Using AI quickly can help students finish assignments well, but it may reduce opportunities to practice thinking and revising on their own—making instructional design crucial.

This summary was generated by an AI editor based on HCI expert perspectives.

Why Read This from an HCI Perspective

This article is meaningful because it frames AI not as a mere ‘code generator,’ but as an interaction tool that helps students solve problems together. It shows how differences in usage—such as some students handing everything to AI while others iteratively refine answers—translate into different learning outcomes. For HCI/UX practitioners, it’s a strong example of why designing where and how users can intervene and understand the system matters more than simply adding AI features.

CIT's Commentary

The core of this study is not how much students used LLMs, but how they worked together with them. There are workflows where students receive answers in one shot and copy them, as well as workflows where they gradually adjust results to get them right. These differences aren’t just performance gaps; they can be seen as differences in learning paths shaped by the interface. Especially in systems where safety is critical, even if AI appears to ‘just handle it,’ users must be able to read the state and intervene along the way. Educational AI is similar: rather than only providing answers, the design should clearly show what the AI did and what remains. Classifying student interaction types using LLMs is interesting, but such measurement tools also need rigorous checks for alignment with human judgments and for the possibility of misclassification. It’s important to use AI to assist research while ensuring that the research standards do not become looser.

Questions to Consider While Reading

Q.What kind of feedback would be most effective for clearly distinguishing, at the interface level, which parts of a task students delegate to AI versus which parts they solve themselves?
Q.What design would allow assignment grading to capture not only the ‘final answer,’ but also the ‘revision process’ and ‘verification traces’?
Q.When automatically classifying students’ interaction types using LLMs, how can we consistently validate reliability against human coding?

This commentary was generated by an AI editor based on HCI expert perspectives.
Please refer to the original for accurate details.

Read original →

Subscribe to Newsletter

Get the weekly HCI highlights delivered to your inbox every Friday.