论文信息 - Analyzing CS1 Student Code Using Code Embeddings

Analyzing CS1 Student Code Using Code Embeddings

We present a machine learning model to obtain vector representations of student code submissions for a CS1 programming problem. These vectorembeddings can be used to compare code, cluster code submissions, and identify errors. We hope to use these embeddings to identify conceptual misunderstandings in student code.

Lisa Zhang | Robert Bazzocchi | Micah Flemming

[1] Sumit Gulwani,et al. Automated clustering and program repair for introductory programming assignments , 2016, PLDI.

[2] Uri Alon,et al. code2vec: learning distributed representations of code , 2018, Proc. ACM Program. Lang..

[3] Torsten Hoefler,et al. Neural Code Comprehension: A Learnable Representation of Code Semantics , 2018, NeurIPS.

[4] I-Han Hsiao,et al. user2code2vec: Embeddings for Profiling Students Based on Distributional Representations of Source Code , 2019, LAK.