Skip to content

Chain Rehydration

Overview

When a Sub-Chain node restarts or detects a divergent state (local hash ≠ DB hash), it rehydrates its in-memory chain from the persistent storage backend. This ensures consistency after crashes, restarts, or network partitions.

The DB is always the authoritative source of truth. If local state diverges, it is discarded and rebuilt from DB.

Flow Diagram

sequenceDiagram
    autonumber
    participant SC as 📦 SubChain
    participant OS as ⚙️ OrderingService
    participant DB as 💾 Storage Backend

    Note over SC: Startup OR divergence detected

    rect rgb(0, 0, 0, 0)
        Note over SC: Phase 1 — Detect Divergence
        SC->>SC: sync_chain()
        SC->>OS: get_latest_block()
        OS->>DB: Query latest persisted block
        DB-->>OS: latest_block (index, hash)
        OS-->>SC: latest_block_os
    end

    rect rgb(0, 0, 0, 0)
        Note over SC: Phase 2 — Comparison & Rehydration
        SC->>SC: Compare: local_latest.index vs db_latest.index

        alt Local == DB (same index + same hash)
            SC->>SC: Already up-to-date. No-op.
        else Local < DB (node missed blocks during downtime)
            SC->>DB: get_blocks_from_db(start_index=0)
            DB-->>SC: All blocks []
            SC->>SC: Acquire write lock
            SC->>SC: Clear local chain, reset all counters
            loop For each block from DB
                SC->>SC: chain.append(block)
                SC->>SC: _update_event_statistics(block)
            end
            SC->>SC: Release write lock
            SC->>OS: Reset block_history & blocks_created
        else Local > DB OR hash mismatch
            SC->>SC: Log WARNING: divergent state detected
            SC->>DB: Force full rehydration from DB
        end
    end

    SC->>SC: _reset_ordering_service_state()
    Note over SC: Chain now consistent with storage

Divergence Scenarios

Scenario	Detection	Action
Cold start	`local.index == 0`	Load all blocks from DB
Crash recovery	`local.index < db.index`	Append missing blocks only
Hash mismatch	`local.hash != db.hash` (same index)	Full rehydration from DB
Already in sync	`local.index == db.index AND hash matches`	No-op
Local ahead of DB	`local.index > db.index`	WARNING — DB is authoritative; force rehydrate

Step-by-Step Breakdown

Step	Description
1. Trigger	Node startup calls `sync_chain()`, or `auto_sync` timer fires
2. DB query	`OrderingService.get_latest_block()` fetches the latest persisted block from storage
3. Comparison	Compare local chain `(index, hash)` with DB `(index, hash)`
4. Up-to-date	If identical: skip. Normal operations resume immediately
5. Partial sync	If `local < db`: load only the delta blocks. Acquires write lock to prevent race conditions
6. Full rehydration	If hash mismatch or local ahead: discard local, rebuild entirely from DB
7. Statistics reset	`_update_event_statistics()` rebuilds `entity_event_index` from each block
8. OrderingService reset	Sync block counters between chain and ordering service

Error Handling

Condition	Behavior
DB read fails during rehydration	Exception logged, retry on next sync interval
Write lock held too long	Timeout after `lock_timeout` seconds; critical alert via Risk Alerts
`entity_event_index` inconsistent after rehydration	Full index rebuild triggered
Storage backend unreachable	Node enters read-only mode; new events queued but not persisted

Key Classes & Methods

Step	Class / Method	File
Entry point	`SubChain.sync_chain()`	`hierarchical/sub_chain.py`
Latest DB block	`OrderingService.get_latest_block()`	`consensus/ordering/service.py`
Load all blocks	`storage.get_blocks_from_db()`	`adapters/database/sqlite_adapter.py`
Rebuild index	`SubChain._update_event_statistics()`	`hierarchical/sub_chain.py`
Reset OS counters	`SubChain._reset_ordering_service_state()`	`hierarchical/sub_chain.py`

Error Mitigation — rollback triggers rehydration when snapshot fails
System Integrity Validation — validates chain consistency after rehydration