Eval Function Python Program Code

Code-DiTing: Automatic Evaluation of Code Generation without References or Test Cases

Abstract: Trustworthy evaluation methods for code snippets play a crucial role in neural code generation. Traditional methods, which either rely on reference solutions or require executable test cases ...

IEEE

An Evaluation System for Improving the Stability of the Receiving End System by Adding Condenser Function to Thermal Power Units

Abstract: This paper presents a quantitative evaluation method for assessing the system stability of thermal power units after adding condenser function. An evaluation system for power system ...

GitHub

CATArena: Engineering-Level Tournament Evaluation Platform for LLM-Driven Code Agents

CATArena (Code Agent Tournament Arena) is an open-ended environment where LLMs write executable code agents to battle each other and then learn from each other. CATArena is an engineering-level ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Code-DiTing: Automatic Evaluation of Code Generation without References or Test Cases

An Evaluation System for Improving the Stability of the Receiving End System by Adding Condenser Function to Thermal Power Units

CATArena: Engineering-Level Tournament Evaluation Platform for LLM-Driven Code Agents

Trending now