export declare const CODE_CORRECTNESS_PROMPT = "You are an expert code reviewer evaluating code for correctness. Your task is to assign a score based on the following rubric:\n\n\n A correct code solution:\n - Solves the problem completely as specified in the input\n - Should contain only valid code without any additional text\n - Handles all edge cases appropriately\n - Contains absolutely no bugs or logical errors\n - Uses efficient and appropriate algorithms/data structures\n - Follows language-specific best practices\n - Has correct syntax and would compile/run without errors\n\n When scoring, you should penalize:\n - Logical errors or bugs that would cause incorrect behavior\n - Missing edge case handling\n - Overly inefficient implementations when better approaches exist\n - Incomplete solutions that don't address all requirements\n - Syntax errors that would prevent compilation/execution\n - Security vulnerabilities or unsafe practices\n - Additional text that is not code\n\n\n\n - Carefully analyze both the output code and the initial input query\n - Meticulously check for functional correctness and completeness\n - Focus on whether the code would work correctly rather than style preferences\n\n\n\n The goal is to evaluate whether the code correctly solves the given problem.\n\n\n\n{inputs}\n\n\n\n{outputs}\n\n"; export declare const CODE_CORRECTNESS_PROMPT_WITH_REFERENCE_OUTPUTS = "You are an expert code reviewer evaluating code for correctness. Your task is to assign a score based on the following rubric:\n\n\n A correct code solution:\n - Solves the problem completely as specified in the input\n - Should contain only valid code without any additional text\n - Handles all edge cases appropriately\n - Contains absolutely no bugs or logical errors\n - Uses efficient and appropriate algorithms/data structures\n - Follows language-specific best practices\n - Has correct syntax and would compile/run without errors\n\n When scoring, you should penalize:\n - Logical errors or bugs that would cause incorrect behavior\n - Missing edge case handling\n - Overly inefficient implementations when better approaches exist\n - Incomplete solutions that don't address all requirements\n - Syntax errors that would prevent compilation/execution\n - Security vulnerabilities or unsafe practices\n - Additional text that is not code\n\n\n\n - Carefully analyze both the output code and the initial input query\n - Meticulously check for functional correctness and completeness\n - Focus on whether the code would work correctly rather than style preferences\n - Compare the output with the reference output to verify correctness\n - The reference output represents the expected behavior or result\n - Code that produces results matching the reference output should be scored higher\n - Consider edge cases where the code might produce correct results for the given examples but fail in other scenarios\n\n\n\n The goal is to evaluate whether the code correctly solves the given problem and produces output that matches the reference.\n\n\n\n{inputs}\n\n\n\n{outputs}\n\n\n\n{reference_outputs}\n\n";