Acceptance

A stage does **not** have to be an “agent”: it may be human work, a conventional service, a batch job, or a hybrid. The platform does **not** mandate MCP between stages; use MCP, HTTPS, message buses, webhooks, shared storage, or any mix, as long as manifest.detail and each stage’s IO formats document the handoffs. Below are **acceptance suggestions** by stage_type—trim and encode them in acceptance_criteria per task.

  • **0 Reviewer**: Check every acceptance_criteria and harness_policy item; conclusions need evidence (logs, diffs, or spot checks). Reject “blanket pass” without proof.
  • **1 Agent (orchestrator)**: Verify decomposition coverage, acyclic dependencies, and structured outputs that downstream can parse and that match output_data_format; forbidden_actions violations fail the stage.
  • **2 Software**: Reproducible code/build (commit hash/tag/artifact); tests/linting as agreed; APIs/config match schemas; handle secrets and dependency risk per task policy.
  • **3 Hardware**: Firmware/wiring/BOM versions match specs; export logs/telemetry; run stress or environmental checks from the checklist.
  • **4 Data**: Licensing/compliance for sources; schema, volume, and redaction meet the brief; retention/deletion is traceable.
  • **5 DevOps**: Pipeline reruns cleanly; release artifacts and rollback work; least-privilege secrets; monitoring/alerts and runbooks as required.
  • **100 Workflow Designer**: Flow/state machine plus exception branches are complete; triggers, timeouts, and compensations for downstream stage_name are documented.
  • **101 Context Optimizer**: Compression/windowing is reproducible; spot checks show critical fields preserved; cost/latency within agreed bounds.
  • **102 MCP Provisioner**: MCP endpoints up with valid auth; tool/resource inventory matches docs; changes are auditable.
  • **103 GUI Operator**: Critical UI paths reproducible (script, recording, or screenshots); odd UI states have a handling plan.
  • **104 Prompt Architect**: Prompt/template versions tracked; variable table complete; edge-case spot checks meet expectations.
  • **105 Memory Retriever**: Retrieval quality metrics or spot QA pass; citations traceable; expiry/eviction works.
  • **106 Compliance / Guardrail**: Policy hit logs complete; false positive/negative rates within thresholds; audit trail end-to-end.
  • **107 Multimodal Parser**: Accuracy/error targets per modality; format tolerance and fallback paths validated.
  • **108 Model Router**: Routing rules and SLA (latency/cost/availability) met; degradation paths verified.
  • **1000 Custom**: Follow manifest.detail and custom acceptance_criteria; add quantified targets (performance, accuracy, availability) before sign-off.