Acceptance
A stage does **not** have to be an “agent”: it may be human work, a conventional service, a batch job, or a hybrid. The platform does **not** mandate MCP between stages; use MCP, HTTPS, message buses, webhooks, shared storage, or any mix, as long as manifest.detail and each stage’s IO formats document the handoffs. Below are **acceptance suggestions** by stage_type—trim and encode them in acceptance_criteria per task.
- **0 Reviewer**: Check every acceptance_criteria and harness_policy item; conclusions need evidence (logs, diffs, or spot checks). Reject “blanket pass” without proof.
- **1 Agent (orchestrator)**: Verify decomposition coverage, acyclic dependencies, and structured outputs that downstream can parse and that match output_data_format; forbidden_actions violations fail the stage.
- **2 Software**: Reproducible code/build (commit hash/tag/artifact); tests/linting as agreed; APIs/config match schemas; handle secrets and dependency risk per task policy.
- **3 Hardware**: Firmware/wiring/BOM versions match specs; export logs/telemetry; run stress or environmental checks from the checklist.
- **4 Data**: Licensing/compliance for sources; schema, volume, and redaction meet the brief; retention/deletion is traceable.
- **5 DevOps**: Pipeline reruns cleanly; release artifacts and rollback work; least-privilege secrets; monitoring/alerts and runbooks as required.
- **100 Workflow Designer**: Flow/state machine plus exception branches are complete; triggers, timeouts, and compensations for downstream stage_name are documented.
- **101 Context Optimizer**: Compression/windowing is reproducible; spot checks show critical fields preserved; cost/latency within agreed bounds.
- **102 MCP Provisioner**: MCP endpoints up with valid auth; tool/resource inventory matches docs; changes are auditable.
- **103 GUI Operator**: Critical UI paths reproducible (script, recording, or screenshots); odd UI states have a handling plan.
- **104 Prompt Architect**: Prompt/template versions tracked; variable table complete; edge-case spot checks meet expectations.
- **105 Memory Retriever**: Retrieval quality metrics or spot QA pass; citations traceable; expiry/eviction works.
- **106 Compliance / Guardrail**: Policy hit logs complete; false positive/negative rates within thresholds; audit trail end-to-end.
- **107 Multimodal Parser**: Accuracy/error targets per modality; format tolerance and fallback paths validated.
- **108 Model Router**: Routing rules and SLA (latency/cost/availability) met; degradation paths verified.
- **1000 Custom**: Follow manifest.detail and custom acceptance_criteria; add quantified targets (performance, accuracy, availability) before sign-off.