Skip to content

Commit a65946a

Browse files
iteratermallamanis
authored andcommitted
Create kovalchuk2022human.markdown
1 parent 73d570a commit a65946a

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
---
2+
layout: publication
3+
title: "Human perceiving behavior modeling in evaluation of code generation models"
4+
authors: S. Kovalchuk, V. Lomshakov, A. Aliev
5+
conference: GEM
6+
year: 2022
7+
additional_links:
8+
- {name: "ACLAnthology", url: "https://aclanthology.org/2022.gem-1.24/"}
9+
tags: ["code generation", "evaluation", "human evaluation", ]
10+
---
11+
Within this study, we evaluated a series of code generation models based on CodeGen and GPTNeo to compare the metric-based performance and human evaluation. For a deeper analysis of human perceiving within the evaluation procedure we’ve implemented a 5-level Likert scale assessment of the model output using a perceiving model based on the Theory of Planned Behavior (TPB). Through such analysis, we showed an extension of model assessment as well as a deeper understanding of the quality and applicability of generated code for practical question answering. The approach was evaluated with several model settings in order to assess diversity in quality and style of answer. With the TPB-based model, we showed a different level of perceiving the model result, namely personal understanding, agreement level, and readiness to use the particular code. With such analysis, we investigate a series of issues in code generation as natural language generation (NLG) problems observed in a practical context of programming question-answering with code.

0 commit comments

Comments
 (0)