OMNISKILL v3.0 — 6-Layer Architecture

Overview

OMNISKILL is built as a 6-layer stack where each layer has a single responsibility and strict boundaries. Control flows top-down (bootstrap → agents → skills), data flows bottom-up (artifacts → pipelines → users). Layer 5 (v3.0) wraps the entire runtime with enforced contracts — sessions, policy engine, telemetry, and MCP trust routing.

block-beta columns 1 block:L5["Layer 5: RUNTIME CONTRACTS (v3.0)\nSessions, policy engine, telemetry, replay, MCP routing"]:1 end block:L4["Layer 4: ARTIFACT LAYER\nPipeline outputs, validated JSON, audit trails"]:1 end block:L3["Layer 3: PIPELINE LAYER\nsdd | ux | debug | skill-factory | full-product"]:1 end block:L2["Layer 2: SKILL LAYER\n83 skills, each with manifest.yaml"]:1 end block:L1["Layer 1: AGENT LAYER\n10 agents with guardrails & manifests"]:1 end block:L0["Layer 0: BOOTSTRAP & DISCIPLINE\nHooks, synapses, anti-rationalization"]:1 end style L5 fill:#E91E63,color:#fff,stroke:#AD1457 style L4 fill:#4CAF50,color:#fff,stroke:#388E3C style L3 fill:#2196F3,color:#fff,stroke:#1565C0 style L2 fill:#FF9800,color:#fff,stroke:#E65100 style L1 fill:#9C27B0,color:#fff,stroke:#6A1B9A style L0 fill:#F44336,color:#fff,stroke:#C62828

View original ASCII

┌─────────────────────────────────────────────────┐
│  Layer 5: RUNTIME CONTRACTS (v3.0)              │
│  Sessions, policy, telemetry, replay, MCP       │
├─────────────────────────────────────────────────┤
│  Layer 4: ARTIFACT LAYER                        │
│  Pipeline outputs, validated JSON, audit trails  │
├─────────────────────────────────────────────────┤
│  Layer 3: PIPELINE LAYER                        │
│  sdd │ ux │ debug │ skill-factory │ full-product │
├─────────────────────────────────────────────────┤
│  Layer 2: SKILL LAYER                           │
│  83 skills, each with manifest.yaml              │
├─────────────────────────────────────────────────┤
│  Layer 1: AGENT LAYER                           │
│  10 agents with guardrails & manifests           │
├─────────────────────────────────────────────────┤
│  Layer 0: BOOTSTRAP & DISCIPLINE                │
│  Hooks, synapses, anti-rationalization          │
└─────────────────────────────────────────────────┘

Layer 0: Bootstrap & Discipline

Directory: hooks/

Layer 0 fires before any agent starts. It establishes the cognitive discipline that prevents agents from going off-rails.

Hook System

Hook	File	Fires When
Session Start	`hooks/session_start.py`	Agent session begins
Pre-Step	`hooks/pre_step.py`	Before each pipeline step
Post-Step	`hooks/post_step.py`	After each pipeline step
On Failure	`hooks/on_failure.py`	Any step fails
On Deviation	`hooks/on_deviation.py`	Agent deviates from spec

Bootstrap Sequence

When any OMNISKILL session starts, the following sequence executes:

flowchart TD A["🚀 session_start hook"] --> B["Load core synapses"] B --> B1["anti-rationalization.md"] B --> B2["sequential-thinking.md"] B --> B3["metacognition.md"] A --> C["Inject anti-rationalization rules"] C --> C1["10 Iron Laws activated"] A --> D["Inject sequential thinking protocol"] D --> D1["[THINKING] blocks required"] A --> E["Inject metacognition synapse"] E --> E1["Complexity scaling activated"] style A fill:#F44336,color:#fff style B fill:#FF9800,color:#fff style C fill:#FF9800,color:#fff style D fill:#FF9800,color:#fff style E fill:#FF9800,color:#fff

View original ASCII

session_start hook
    │
    ├─► Load core synapses from synapses/
    │     ├── anti-rationalization.md
    │     ├── sequential-thinking.md
    │     └── metacognition.md
    │
    ├─► Inject anti-rationalization rules
    │     └── 10 Iron Laws activated
    │
    ├─► Inject sequential thinking protocol
    │     └── [THINKING] blocks required for complex tasks
    │
    └─► Inject metacognition synapse
          └── Complexity scaling activated

The bootstrap is non-negotiable — no agent can bypass it. The session_start.py hook validates that all required synapses exist and are syntactically valid before allowing the session to proceed.

Why Layer 0 Exists

Without discipline enforcement, AI agents will: - Skip steps they consider "unnecessary" - Rationalize incomplete work as complete - Deviate from specifications mid-task

Layer 0 prevents all three by making discipline a system property, not a suggestion.

Layer 1: Agent Layer

Directory: agents/

Ten specialized agents, each with a single role and an agent-manifest.yaml that defines its guardrails.

Agent	Role	Primary Skill Dependencies
`spec-writer`	Specification Architect	spec-writer skill
`implementer`	Implementation Engineer	implementer skill
`reviewer`	Compliance Reviewer	reviewer skill
`debugger`	Root-Cause Investigator	systematic-debugging
`ux-research`	UX Researcher	ux-research
`ui-design`	Visual Designer	ui-visual-design, frontend-design
`qa-master`	QA Engineer	qa-test-planner, e2e-testing-patterns
`context-curator`	Pipeline Handoff Curator	context-curator
`dissector`	Codebase Analyst	dissector skill

Agent Manifest Structure

Each agent's agent-manifest.yaml contains:

name: implementer
role: Implementation Engineer
guardrails:
  must-do:
    - Follow spec section by section
    - Write tests before implementation
    - Verify each section compiles before proceeding
  must-not:
    - Skip sections marked as required
    - Add features not in the spec
    - Modify existing tests without justification
  on-violation: halt-and-report
skills:
  - implementer
  - systematic-debugging
triggers:
  - "implement the spec"
  - "build from spec"

Agent ↔ Skill Relationship

Agents invoke skills; skills are never invoked directly by users. This separation ensures that skills always run within the guardrail context of an agent.

flowchart TD U["👤 User Request"] --> AG["🤖 Agent\n(guardrails active)"] AG --> S1["Skill A\n(focused capability)"] AG --> S2["Skill B\n(focused capability)"] AG --> S3["Skill C\n(focused capability)"] S1 --> O["✅ Validated Output"] S2 --> O S3 --> O style U fill:#607D8B,color:#fff style AG fill:#9C27B0,color:#fff style S1 fill:#FF9800,color:#fff style S2 fill:#FF9800,color:#fff style S3 fill:#FF9800,color:#fff style O fill:#4CAF50,color:#fff

View original ASCII

User Request
    │
    ▼
Agent (guardrails active)
    │
    ├─► Skill A (focused capability)
    ├─► Skill B (focused capability)
    └─► Skill C (focused capability)
    │
    ▼
Validated Output

Layer 2: Skill Layer

Directory: skills/

83 skills, each a self-contained capability with a manifest.yaml. Skills contain the actual domain knowledge — how to write React components, how to debug GDScript, how to design wireframes.

Skill Manifest Format

name: react-best-practices
description: React development guidelines with hooks, patterns, and optimization
version: 1.0.0
triggers:
  - React component
  - React hooks
  - React performance
platforms:
  - copilot-cli
  - cursor
  - claude-code

Skill Bundles

Skills are grouped into installable bundles for common workflows:

stateDiagram-v2 [*] --> pending pending --> validating : validate validating --> executing : execute executing --> paused : pause/deviation paused --> executing : resume executing --> completed : all steps done executing --> failed : unrecoverable error executing --> cancelled : user cancel state executing { [*] --> step_loop step_loop --> step_loop : next step }

View original ASCII

godot-kit      → 5 Godot skills
web-dev-kit    → 5 web development skills
ux-design-kit  → 7 UX/UI skills
django-kit     → 4 Django skills
sdd-kit        → 6 spec-driven development skills
testing-kit    → 4 testing skills
mobile-kit     → 2 mobile skills
meta-kit       → 5 meta/tooling skills

Install a bundle: python scripts/install.py --bundle web-dev-kit

Layer 3: Pipeline Layer

Directory: pipelines/

Eight orchestrated workflows that chain agents together with context curation between steps.

Pipeline State Machine

flowchart TD U["👤 User: build feature X"] --> L0["Layer 0: Bootstrap fires"] L0 --> L3["Layer 3: Pipeline selects sdd-pipeline"] L3 --> SW["Layer 1: spec-writer agent"] SW --> SWS["Layer 2: spec-writer skill"] SWS --> SWA["Layer 4: spec artifact"] L3 --> CC1["Layer 1: context-curator curates"] L3 --> IM["Layer 1: implementer agent"] IM --> IMS["Layer 2: implementer skill"] IMS --> IMA["Layer 4: code artifact"] L3 --> CC2["Layer 1: context-curator curates"] L3 --> RV["Layer 1: reviewer agent"] RV --> RVS["Layer 2: reviewer skill"] RVS --> RVA["Layer 4: review report"] style U fill:#607D8B,color:#fff style L0 fill:#F44336,color:#fff style L3 fill:#2196F3,color:#fff style SW fill:#9C27B0,color:#fff style IM fill:#9C27B0,color:#fff style RV fill:#9C27B0,color:#fff style CC1 fill:#00BCD4,color:#fff style CC2 fill:#00BCD4,color:#fff style SWS fill:#FF9800,color:#fff style IMS fill:#FF9800,color:#fff style RVS fill:#FF9800,color:#fff style SWA fill:#4CAF50,color:#fff style IMA fill:#4CAF50,color:#fff style RVA fill:#4CAF50,color:#fff

View original ASCII

          ┌──────────┐
          │ pending   │
          └────┬──────┘
               │ validate
          ┌────▼──────┐
          │ validating │
          └────┬──────┘
               │ execute
          ┌────▼──────┐     ┌──────────┐
          │ executing  ├────►│  paused  │
          └────┬──────┘     └────┬─────┘
               │                 │ resume
          ┌────▼─────────────────▼─────┐
          │        step loop           │
          └────┬──────┬──────┬─────────┘
               │      │      │
          ┌────▼──┐ ┌─▼────┐ ┌▼─────────┐
          │completed│ │failed│ │cancelled │
          └────────┘ └──────┘ └──────────┘

Pipeline Execution

The PipelineExecutor in src/omniskill/core/pipeline_engine.py drives all pipelines. Each step:

Runs pre_step.py hook (validates prerequisites)
Invokes the designated agent
Runs post_step.py hook (validates outputs)
Curates context for the next step via context-curator

Available Pipelines

Pipeline	Steps
sdd	spec-writer → context-curator → implementer → context-curator → reviewer
ux	research → context-curator → wireframe → context-curator → visual → review → handoff
debug	debugger → context-curator → implementer → tester → reviewer
skill-factory	prompt → spec → context-curator → implement → validate → review
full-product	ux-pipeline → context-curator → sdd-pipeline → testing

Layer 4: Artifact Layer

Pipeline outputs are validated and persisted as structured artifacts.

Artifact Validation

The ArtifactValidator checks every pipeline output against its schema: - expected_artifacts — which files must exist - required_sections — which headings must appear - min_word_count — minimum content length

Persistence

Pipeline state is persisted at:

~/.copilot/.omniskill/pipeline-states/
    ├── sdd-pipeline-<id>.json
    ├── ux-pipeline-<id>.json
    └── ...

Each state file is human-readable JSON containing: - Current step and status - Accumulated decisions, constraints, and tech stack - Artifact paths and validation results - Thinking traces for audit

Why JSON?

Artifacts are JSON (not binary) so that: 1. Humans can inspect pipeline state at any time 2. Agents can resume from any checkpoint 3. Version control can track changes 4. Debugging is straightforward — cat the state file

Cross-Layer Data Flow

User: "build feature X from scratch"
    │
    ▼
Layer 0: Bootstrap fires, synapses loaded
    │
    ▼
Layer 3: Pipeline engine selects sdd-pipeline
    │
    ├─► Layer 1: spec-writer agent activated
    │       └─► Layer 2: spec-writer skill invoked
    │              └─► Layer 4: spec artifact produced
    │
    ├─► Layer 1: context-curator curates handoff
    │
    ├─► Layer 1: implementer agent activated
    │       └─► Layer 2: implementer skill invoked
    │              └─► Layer 4: code artifact produced
    │
    ├─► Layer 1: context-curator curates handoff
    │
    └─► Layer 1: reviewer agent activated
            └─► Layer 2: reviewer skill invoked
                   └─► Layer 4: review report produced

Every transition between agents passes through the context-curator to ensure only relevant context propagates forward — preventing context bloat and token waste.

Layer 5: Runtime Contracts (v3.0)

Directory: src/omniskill/core/ (v3 modules)

Layer 5 wraps the entire runtime with enforced contracts. No tool executes without a policy decision, no session transitions without state machine validation, and no completion claim without evidence.

Session Lifecycle

The SessionManager enforces an 8-state lifecycle with strict transition rules. Invalid transitions raise InvalidTransitionError.

stateDiagram-v2 [*] --> created created --> active : activate active --> waiting_tool : wait_for_tool active --> waiting_permission : wait_for_permission active --> idle : idle active --> error : fail active --> archived : complete waiting_tool --> active : resume waiting_permission --> active : resume idle --> active : resume error --> recovering : recover recovering --> active : resume

Every event is logged with a correlation ID that links sessions to pipeline traces.

Central Policy Engine

The PolicyEngine gates every tool invocation through 4 checks:

Schema validation — tool arguments checked against registered schemas
Permission rules — evaluated in order, first match wins
Trust tier precedence — builtin > verified > community > untrusted
Decision artifact — machine-readable PolicyDecision with rationale

Default action is deny — tools must have an explicit allow rule. Denied decisions are queryable from the audit log.

Telemetry & Replay

The TelemetryCollector normalizes all events to versioned envelopes:

TelemetryEnvelope:
  envelope_id: tel-xxxxxxxxxxxx
  schema_version: 3.0.0
  event_type: policy_decision | session_start | ...
  correlation_id: corr-xxxxxxxxxxxx
  source: {component, session_id, pipeline_name, step_name}
  payload: {...}
  retention_class: standard | audit

The ReplayHarness captures session snapshots and compares checksums for determinism — timestamps are excluded so structure-only comparison works across environments.

MCP Trust Routing

The MCPConnectorManager routes to MCP servers by capability and trust tier:

Connectors register with trust tier and capabilities
Routing selects the highest-trust healthy connector for a capability
Unhealthy connectors are excluded automatically
Routing is deterministic (same inputs → same output)

v3 Contract Schemas

Six new schemas define the wire format for all v3 contracts:

Schema	Purpose
`session.schema.yaml`	Session lifecycle states and transitions
`tool-invocation.schema.yaml`	Tool call with required policy decision
`permission.schema.yaml`	Permission rules with trust tiers
`hook-event.schema.yaml`	Normalized hook bus events
`telemetry-envelope.schema.yaml`	Versioned telemetry format
`context-handoff.schema.yaml`	Phase handoff with pinned constraints and evidence

Release Gates

The ReleaseGateValidator validates 6 hard gates before any release:

Gate	What It Checks
SchemaAndContracts	All v3 schemas present and version 3.x
PolicyAndSecurity	Policy engine and permission schema present
ReplayDeterminism	Telemetry module and replay tests present
ContextIntegrity	Handoff schema enforces pinned_constraints and evidence_links
PromptQuality	Prompt files present, schema validator functional
MigrationReadiness	Migration dry-run passes with zero blockers

All 6 must pass and the weighted score must reach 90+ for a GO recommendation.

v3 is fully backward compatible — all v2 skills, agents, pipelines, and schemas continue to work without modification. See the Migration v3 Guide for upgrade details.