MACP Determinism and Replay Integrity

Status: Non-normative (explanatory). In case of conflict, the referenced RFC is authoritative. Reference: RFC-MACP-0003

Overview

MACP guarantees structural replay integrity: replaying identical accepted Envelope sequences under identical conditions MUST produce identical state transitions. This property is essential for auditability, debugging, and provenance verification.

Core Guarantee

MACP provides this structural guarantee:

Replaying identical accepted Envelope sequences under identical:

macp_version

Mode identifier and Mode version

Configuration version(s)

MUST produce identical state transitions (OPEN → RESOLVED or OPEN → EXPIRED).

What is Deterministic in MACP Core

Structural Determinism

MACP Core guarantees deterministic structural behavior:

Session State Transitions: Same message sequence = same final state (RESOLVED vs EXPIRED)
Message Acceptance: Same envelope sequence = same acceptance/rejection decisions
Session Lifecycle: Same SessionStart + same TTL + same messages = same lifecycle progression
Idempotency: Same message_id within a session always produces idempotent acceptance

What MACP Core Does NOT Guarantee

MACP Core does NOT define determinism for:

Semantic outcomes: What the "correct" decision is (Mode-defined)
External side effects: Tool invocations, API calls (Mode/application responsibility)
Timestamp values: timestamp_unix_ms is informational only
Ordering across sessions: Cross-session message ordering is undefined

Replay Scenarios

1. Audit and Compliance Replay

Use Case: Verify that a past coordination session followed correct procedures.

Requirements:

Preserve session log (all accepted Envelopes in order)
Record Mode version and configuration version
Replay messages in identical order
Verify final state matches historical record

Example:

Original session:
  SessionStart → Proposal → Vote → Vote → Commitment → RESOLVED

Replay:
  SessionStart → Proposal → Vote → Vote → Commitment → RESOLVED ✓

2. Debugging Replay

Use Case: Reproduce a bug or unexpected outcome.

Requirements:

Capture full session log
Capture Mode version, configuration version, MACP version
Replay in test environment
Compare state transitions and final outcomes

3. Forensic Replay

Use Case: Investigate security incident or disputed outcome.

Requirements:

Immutable audit log
Timestamp correlation with external logs
Verify message authenticity (signatures if used)
Check for unauthorized messages or tampering

Versioning for Determinism

macp_version

All Envelopes in a session MUST use the same macp_version.

Major version changes: May alter core state transition logic
Minor version changes: SHOULD be backward compatible
Replay requirement: Use same MACP runtime version for replay

Mode Version

Modes MUST include explicit version identifiers:

Via mode field (e.g., macp.mode.decision.v1)
Via mode_version in SessionStartPayload
In Commitment payloads (for attribution)

Replay requirement: Use same Mode version for replay

Configuration Version

Sessions MAY reference configuration versions:

configuration_version in SessionStartPayload
policy_version in CommitmentPayload
External configuration (e.g., approval thresholds, quorum sizes)

Replay requirement: Use identical configuration for replay

Example Version Attribution in Commitment

{
  "commitment_id": "d4f9c5c0-3c7b-4b70-9f0c-2d57f6a3b1a2",
  "action": "vendor.select",
  "reason": "Vendor B meets security + budget constraints",
  "mode_version": "1.0.0",
  "policy_version": "org-procurement-2026.02",
  "configuration_version": "runtime-2026.03.01"
}

This allows replay verification: "Under these exact versions, this outcome is deterministic."

Mode-Level Determinism

Modes MAY claim stronger determinism guarantees beyond MACP Core.

Deterministic Modes

A Mode is deterministic if:

Same accepted messages → same semantic outcome
No dependence on non-deterministic sources (time, randomness, external I/O)
All decision logic is pure and versioned

Example: A voting mode where:

3-of-5 quorum required
Outcome is "approved" if ≥3 votes are "yes"
Deterministic: Same votes always produce same outcome

Non-Deterministic Modes

A Mode is non-deterministic if:

Outcome depends on wall-clock time beyond TTL
Uses randomness (e.g., random selection)
Depends on external I/O (API calls, database state)

Example: A mode that:

Calls an external risk scoring API
API returns different scores over time
Non-deterministic: Replay may produce different outcome

Determinism Claims in Mode Descriptors

Mode descriptors SHOULD declare determinism claims:

{
  "mode": "macp.mode.decision.v1",
  "determinism_class": "semantic-deterministic"
}

Sources of Non-Determinism to Avoid

Time-Based Logic (Beyond TTL)

Problem: Different wall-clock times during replay.

Anti-pattern:

if (currentTime > deadline) {
  return "expired";
}

Better: Use session TTL, which is part of SessionStart

External I/O

Problem: External state may change between original run and replay.

Anti-pattern:

let userProfile = await fetchUserProfile(userId);
if (userProfile.riskScore > 50) {
  return "reject";
}

Better: Bind necessary context at SessionStart — reference it via context_id (an opaque identifier the runtime preserves but never interprets) or carry protocol-specific bytes in extensions

Randomness

Problem: Different random values on replay.

Anti-pattern:

let selectedAgent = agents[Math.floor(Math.random() * agents.length)];

Better: Use deterministic selection (e.g., hash-based, first in list, etc.)

Iteration Order of Unordered Collections

Problem: Hash maps, sets may have different iteration order.

Anti-pattern:

for (let agent in agentMap) {  // Order undefined
  processVote(agentMap[agent]);
}

Better: Sort before iteration

let sortedAgents = Object.keys(agentMap).sort();
for (let agentId of sortedAgents) {
  processVote(agentMap[agentId]);
}

Handling External Side Effects

Tool Invocations

If a Mode coordinates tool execution:

Option 1 (Deterministic): Don't invoke tools during coordination; emit Commitment with tool execution plan
Option 2 (Idempotent): Include transaction IDs in tool calls; make tools idempotent
Option 3 (Logged): Log tool results as Mode messages; replay uses logged results

External Transactions

For financial transactions, database writes, etc.:

Use saga patterns with compensating transactions
Include external transaction IDs in Commitment
Ensure external systems support idempotency keys

Replay Integrity Verification

Verification Steps

Load session log: All accepted Envelopes in order
Verify versions: macp_version, mode, mode_version, configuration_version match
Replay messages: Feed messages to MACP runtime in same order
Compare state transitions: OPEN → RESOLVED/EXPIRED must match
Compare terminal message: the terminal Commitment (or mode-defined terminal message) must match (if deterministic Mode)

Cryptographic Verification (Optional)

For high-assurance scenarios:

Sign Envelopes: Agents sign messages (include signature in metadata)
Hash chain: Each message includes hash of previous message
Session hash: Final session state includes hash of all messages

Example structure:

{
  "session_id": "...",
  "final_state": "RESOLVED",
  "message_count": 5,
  "session_hash": "sha256:abcd1234...",
  "commitment_hash": "sha256:ef567890..."
}

Testing Determinism

Unit Tests for Modes

Test that same inputs produce same outputs:

test('deterministic voting', () => {
  let session1 = runSession([vote1, vote2, vote3]);
  let session2 = runSession([vote1, vote2, vote3]);
  expect(session1.outcome).toEqual(session2.outcome);
});

Replay Tests

Test that replaying a session produces identical state:

test('session replay', () => {
  let originalSession = runSession(messages);
  let replayedSession = replaySession(messages);
  expect(replayedSession.state).toEqual(originalSession.state);
  expect(replayedSession.finalMessage).toEqual(originalSession.finalMessage);
});

Fuzzing

Fuzz test message sequences to find non-deterministic edge cases:

for (let i = 0; i < 1000; i++) {
  let messages = generateRandomMessageSequence();
  let run1 = runSession(messages);
  let run2 = runSession(messages);
  expect(run1.state).toEqual(run2.state);
}

Limitations and Trade-offs

Determinism vs. Real-Time Responsiveness

Fully deterministic systems may sacrifice real-time responsiveness:

Cannot use wall-clock deadlines
Cannot react to external events dynamically

Trade-off: Decide based on use case. Financial coordination may require full determinism; incident response may prioritize responsiveness.

Determinism vs. External Context

Including all external context in SessionStart can lead to large payloads.

Trade-off: Balance between self-contained sessions and reliance on external lookups.

Best Practices

Version everything: Mode, policy, configuration
Include context in SessionStart: Minimize external dependencies
Make tools idempotent: Use transaction IDs, support replay
Log external results: Capture API responses in session messages
Test replay regularly: Ensure determinism holds as Modes evolve
Document determinism claims: Be explicit about what is/isn't deterministic
Use immutable logs: Prevent tampering with historical sessions