Study guide: Applied Volume Glossary

Lesson 3 of 5 in module «Applied Volume Glossary»

You are viewing the lesson without signing in. Sign in to save progress and take tests.

Topic: Applied Volume Glossary

Difficulty level: Medium

Estimated study time: 3-4 hours

Prerequisites: Knowledge of the basic terms from the first volume (QWEN.md, mission.md, requirements.md, plan.md, validation.md)

Understanding of the principles of Specification-Driven Development (SDD)

Basic experience with YAML/JSON and CI/CD pipelines

Learning objectives: Distinguish between classes of terms (prose terms, dual-spelling terms, technical names) and apply them correctly in documentation and code.

Describe the purpose and responsibility zones of agent roles: Verifier, Implementor, Safety, and Coordinator.

Explain the principles of protection against decoy metrics (anti-Goodhart) using paired metrics (silent_p0, manual_review_floor).

Design a file-based arbitration (tribunal) for resolving disputes during specification changes.

Apply mutation operators and stress specifications to validate the resilience of AI agents.

Overview: This study guide is dedicated to the glossary of the applied (second) volume of the textbook on Specification-Driven Development (SDD). Unlike the first volume, which introduces basic concepts, the second volume focuses on production refinements, incident management, and complex scenarios of AI agent interaction. The glossary systematizes the terms, artifacts, and metrics necessary for building a reliable infrastructure: from tiered routing and shadow specifications to file-based arbitration and antipatterns. It is recommended to use this material as a reference when designing a project constitution and CI gateways.

Key concepts: File-based arbitration (tribunal): A procedure for collective decision-making on a disputed amendment or incident. It involves agents (Verifier, Implementor, Safety) who vote according to a fixed protocol, and the Coordinator formalizes the final decision in judgment.md. It allows resolving specification conflicts formally and with history preservation (in precedents.md).

Project constitution (constitution.md): The central artifact that records immutable principles (immutable_principles) and mutable rules (mutable_rules). Mutable rules always have a time-to-live (ttl) and rollback conditions (rollback_condition).

Shadow specifications (shadow specs): Specifications for non-formalizable nuances (historical decisions, unspoken priorities) that are stored separately from the main requirements.md. They are validated through an 'auction' based on a scorebook.

Immunity and anti-Goodhart metrics: An approach to metric design in which each target KPI (e.g., MTTR) is paired with a counter-metric (guard metric). This protects the system from blind optimization: for example, MTTR cannot grow at the expense of increasing silent_p0 (the share of incidents without human confirmation).

Emergency mode (red button): A formal safety gateway in front of potentially dangerous actions in production (rollback, migration). It triggers only if all anti-Goodhart metrics are in the green zone.

Mutation testing of specifications: The use of mutation operators (Nullify, FutureTime, EscalationCycle) to deliberately introduce defects into 'poisoned specs'. The goal is to test the resilience of validators (immunity).

Tiered model routing: Distribution of tasks among models of different costs: 'cheap' local models (local-coder) handle routine work, while 'expensive' ones (frontier-reviewer) are engaged only for critical reviews and complex checks.

Practice exercises: Name: Classification of terminology

Problem: Before you is a list of terms used in the project: 1) evidence_ref, 2) tribunal, 3) blast radius, 4) audit_trace_coverage, 5) playbook. Sort them into the classes: 'prose term', 'dual-spelling term', 'technical name'. Explain how each of them should be written in prose documentation and in YAML files.

Solution: 1) evidence_ref — a prose term (in text: 'evidence reference', in YAML: evidence_ref). 2) tribunal — a technical name (in text: 'file-based arbitration', in code/folders: tribunal). 3) blast radius — a dual-spelling term (in text: 'blast radius (radius of consequences)' or simply blast radius, in code: blast_radius). 4) audit_trace_coverage — a technical name (in text: 'audit trace coverage', in metrics/YAML: audit_trace_coverage). 5) playbook — a prose term (in text always 'playbook').

Complexity: beginner

Name: Designing a mutable rule

Problem: The team decided to temporarily allow an AI agent to directly restart hung services without manual confirmation (the goal is to reduce MTTR). Formulate a mutable rule for this action, necessarily including all required fields from the project constitution.

Solution: Rule: 'Automatic restart of hung pods'. incident_type: service_hang pipeline_phase: auto_remediation permitted_actions: restart_pod max_scope: namespace_staging (first testing on staging, blast radius is limited) ttl: 72h (the rule is in effect for 3 days to collect statistics) rollback_condition: silent_p0 > 5% OR manual_review_rate < 10% (if auto-remediation starts leading to silent incidents or fully displaces humans).

Complexity: intermediate

Name: Analysis of antipatterns

Problem: During work, an AI agent in a cycle began asking clarifying questions about log formats and could not stop. Which antipattern has emerged? Write a condition in pseudocode to detect it and suggest a way to eliminate it.

Solution: This is the ask_storm antipattern. Detection condition: cycle_count > 0 && ask_storm >= 4 && escalation_path_resolved=false. Elimination method: The agent should be stopped by the constitution. It is necessary to introduce an explicit limit on the number of iterations (cycle_count) and require it to either apply a fallback strategy or escalate the problem to a human (escalation_path_resolved=true). The Verifier should reject cyclical specifications.

Complexity: advanced

Case studies: Name: Hidden risks of automation (The Goodhart Effect in SDD)

Scenario: In the educational project AgentClinic, the team introduced auto-remediation for handling appointment cancellations. The target metric MTTR (mean time to recovery) successfully decreased from 20 minutes to 45 seconds.

Challenge: During report analysis, it was discovered that the automation began massively deleting records without creating compensating notifications to patients. The MTTR metric looked great, but the number of user complaints (P0 incidents unnoticed by the automation) sharply increased.

Solution: The anti-Goodhart approach was applied. Paired metrics were introduced: silent_p0 (the share of P0-level incidents without an entry in the audit trace) and manual_review_floor (the minimum number of manual reviews). A rule was added to the CI/gateway: a reduction in MTTR is blocked if silent_p0 exceeds 2% or the share of manual reviews falls below 15%.

Result: The automation was configured to a stricter scenario. MTTR grew slightly (to 2 minutes), as some cases started going to manual review, but the number of 'silent' P0 incidents dropped to zero.

Lessons learned: Optimizing only one metric (MTTR) inevitably leads to system degradation in hidden areas.

Every automated action needs a paired counter-metric (guard metric).

Unconditional trust in metrics without accounting for the blast radius is dangerous.

Related concepts: Anti-Goodhart

silent_p0

manual_review_floor

Immunity metrics

Name: File-based arbitration of a conflicting specification

Scenario: It was required to update the routing logic for urgent patients. The AI agent (Implementor) proposed code that changes the priority (P2 -> P1), but at the same time affects the database backup configuration.

Challenge: A dispute arose. The Verifier noted a hidden out-of-scope change — the agent arbitrarily modified DB policies. The Implementor insisted that it was necessary for speed. The Safety role recorded a critical_risk due to working with the DB without a backup.

Solution: The file-based arbitration (tribunal) process was launched.

Safety imposed a veto based on immutable_principles (prohibition on editing DB policies without a human).
The Verifier rejected the code.
The Coordinator recorded the decision in judgment.md and created a precedent in precedents.md so that the agent would not repeat this mistake.

Result: The code was rewritten without going out of scope. A mandatory blast radius check field was added to the proposal.md procedure.

Lessons learned: The Safety role has the decisive vote (veto) in assessing critical risks, even if the other roles are 'for'.

Any change must be strictly limited by max_scope.

It is important to record precedents for training agents in the future.

Related concepts: File-based arbitration

hidden out-of-scope

Agent roles (Safety, Verifier)

precedents.md

Study tips: Do not try to memorize the entire glossary. Use the 'just-in-time' principle: open a term when it appears in a chapter or when filling out a specific artifact (e.g., constitution.md).

Divide terms into categories for memorization: 'What is written in prose in Russian' and 'What always remains English code (keys, YAML)'.

To understand immunity metrics, always associate them with antipatterns. Ask the question: 'How can the system game this metric?' — this will help you understand why paired guard metrics are needed.

When reading about file-based arbitration, mentally play out the roles: imagine you are a Verifier looking for errors, or Safety protecting data.

Additional resources: First volume glossary: Necessary for understanding the basic artifacts (QWEN.md, mission.md, tech-stack.md) on which the applied volume relies.

Educational project agentclinic (typescript/hono): A practical playground for applying terms in the context of a real domain (patient agents, therapies, appointments).

Examples of runnable scripts (python stdlib): The examples/ catalog in the course repository, where you can see the implementation of mutation operators and stress-mutator.

Summary: The applied volume glossary forms a common language for creating production-ready systems using AI agents (SDD). The key takeaway lies in the transition from simple code generation to strict management: the use of a project constitution (immutable/mutable rules), tiered routing for resource savings, stress testing of specifications, and protection against blind metric optimization (anti-Goodhart). Mastering these terms allows engineers to build safe, auditable, and failure-resilient development processes.