Question 1

What is confidence-evaluator?

Accepted Answer

The Confidence Evaluator is a professional-grade requirement analysis tool that assesses the clarity, completeness, and feasibility of software tasks using the ISO/IEC/IEEE 29148:2018 standard. It functions as a sophisticated guardrail for AI agents, calculating a structured confidence score to determine if a prompt provides sufficient detail for successful execution. By identifying ambiguities and missing constraints before coding begins, it significantly reduces errors and improves the reliability of AI-driven development workflows.

Question 2

When should I use confidence-evaluator?

Accepted Answer

confidence-evaluator is useful in the following scenarios: • Feature Implementation Gatekeeping: Automatically evaluate new feature requests to ensure all inputs, outputs, and success criteria are clearly defined before the agent starts writing code. • Bug Fix Clarity Assessment: Analyze bug reports to confirm they are unambiguous and provide enough context for a verifiable fix, preventing wasted effort on poorly defined issues. • Architectural Change Validation: Use ISO-standard criteria to check if proposed system modifications are consistent with the existing project structure and technically feasible. • Autonomous Workflow Optimization: Enable the 'confidence_policy' in AI settings to force the agent to ask clarifying questions whenever a user's instruction falls below a specific quality threshold.

name	confidence-evaluator
description	Evaluate requirement clarity and completeness using ISO/IEC/IEEE 29148:2018 criteria. Use when user asks to implement features, fix bugs, or make changes. Automatically invoked when confidence_policy is enabled in ai-settings.json.

Criterion	Weight	Evaluation Questions
Unambiguous formulation	20	Is there only one way to interpret this?
Completeness (input/output/constraints)	20	Are all inputs, outputs, and constraints defined?
Verifiable result	15	Can completion be objectively measured?
Consistency with project	10	Does it conflict with existing requirements?
Rationale (source)	5	Is the reason for this requirement stated?
Technical feasibility	5	Is it achievable within constraints?

Criterion	Weight	Evaluation Questions
Structured prompt	10	Is it logically organized?
Explicit tasks	7	Does it use "must/shall/должен"?
Result examples	4	Are concrete examples provided?
Decomposable	4	Can it be broken into subtasks?

confidence-evaluator

When & Why to Use This Skill

Use Cases

Confidence Evaluator Skill

When to Use This Skill

Configuration

Evaluation Process

Step 1: Calculate Intuitive Estimate

Step 2: Calculate Structured Score (maximum 100 points)

Requirements Category (60 points)

Formatting Category (40 points)

Step 3: Calculate Final Confidence

Step 4: Compare with Threshold

Output Format

When Confidence is Sufficient

When Confidence is Insufficient

References