Architecture

Overview

HomeCore is built as a set of independent Rust crates wired together in the homecore binary. All device communication flows through MQTT. All runtime state flows through two internal event buses.

Physical devices
    │
    ▼
MQTT broker (rumqttd, embedded)
    │
    ▼  homecore/devices/{id}/state  (retained)
hc-mqtt-client  ──────────────────────────────────►  internal_bus
    │                                                 (Event::MqttMessage)
    │
    ▼
state_bridge  ──── reads redb, computes diff ──────► pub_bus
                                                      (Event::DeviceStateChanged)
                                                             │
                                                             ▼
                                                       RuleEngine
                                                       ┌──────────────┐
                                                       │ DashMap cache│
                                                       │ trigger match│
                                                       │ conditions   │
                                                       │ actions      │
                                                       └──────────────┘
                                                             │
                                                             ▼
                                                       pub_bus.publish(RuleFired)
                                                       MQTT cmd topics
                                                       Notify / CallService

Dual event bus

The core runtime carries two EventBus instances — both are tokio::broadcast channels wrapping Event.

Bus	Populated by	Contains	Consumed by
`internal_bus`	`hc-mqtt-client`	`Event::MqttMessage` (every raw MQTT packet)	`state_bridge`, rule engine (for `MqttMessage` triggers)
`pub_bus`	`state_bridge`, scheduler, managers	Typed events: `DeviceStateChanged`, `RuleFired`, `Custom`, `SystemStarted`, `DeviceAvailabilityChanged`, `ModeChanged`, `TimerStateChanged`	Rule engine, API WebSocket broadcaster, `hc-api` event log

Why two buses?

The internal_bus carries raw MQTT traffic — high volume, low-level. The pub_bus carries semantically enriched events. Separating them lets the rule engine subscribe efficiently to both without mixing protocols. The MqttMessage trigger reads internal_bus; all other triggers read pub_bus.

EventBus implementation

#[derive(Clone)]
pub struct EventBus {
    tx: broadcast::Sender<Event>,
}

impl EventBus {
    pub fn new(capacity: usize) -> Self { ... }
    pub fn subscribe(&self) -> broadcast::Receiver<Event> { ... }
    pub fn publish(&self, event: Event) -> Result<()> { ... }
}

Capacity defaults to 512 for pub_bus and 1024 for internal_bus. Slow consumers that fall behind receive RecvError::Lagged(n) — the engine logs a warning and continues rather than blocking the fast path.

Engine subscription

The engine runs a tokio::select! loop receiving from both buses:

let mut internal_rx = self.internal_bus.subscribe();
let mut pub_rx      = self.pub_bus.subscribe();

loop {
    tokio::select! {
        biased;
        _ = shutdown.changed() => break,

        result = pub_rx.recv() => {
            // DeviceStateChanged, Custom, RuleFired, etc.
            handle_pub_event(event).await;
        }
        result = internal_rx.recv() => {
            // MqttMessage — only for MqttMessage triggers
            handle_internal_event(event).await;
        }
    }
}

The biased selector ensures the shutdown signal is always checked first.

`raw_bus` on AppState

AppState carries two EventBus handles, with semantic meaning that must be respected when adding new background tasks:

Field	Production value	Subscribers
`event_bus`	`pub_bus` (typed events only)	event log, WS event stream, plugin-registry listener, plugin-offline injector, metrics counter
`raw_bus`	`internal_bus` (`Event::MqttMessage` only)	plugin-stream SSE bridge, terminal observer, StreamCache populator

When adding a new background task in hc-api that filters Event::MqttMessage, subscribe to state.raw_bus. When watching for typed events (PluginCapabilities, RuleFired, DeviceStateChanged, etc.), subscribe to state.event_bus.

In production these are distinct channels (pub_bus vs internal_bus). Test harnesses that use a single merged bus get them populated as clones of the same handle — AppState::new defaults raw_bus = event_bus.clone(), while AppState::new_with_plugins_and_raw_bus takes them separately.

Subscriber-spawn timing

tokio::broadcast does not replay history on late subscribe — messages published before a subscriber existed are lost forever. Listeners that need to see retained messages (capability manifests in particular) must be spawned before the publisher fires — specifically, spawn_plugin_registry_listener must run before plugins spawn and core.start() (which spawns state_bridge) must run before mqtt_client.run(). Both are wired correctly today but worth keeping in mind when reordering main.rs.

Plugin streaming substrate

Plugins emit live progress for long-running actions ("include Z-Wave device", "pair Hue bridge") through a frozen six-stage protocol — progress, item, awaiting_user, warning, complete, error, plus core-injected synthetic canceled and timeout. Events flow on plain MQTT topics:

homecore/plugins/{plugin_id}/commands/{request_id}/events

The plugin SDK's StreamContext handles the envelope shape and the retain-then-clear-on-terminal lifecycle; see the capabilities page for the spec.

SSE bridge (`handlers::get_plugin_stream_sse`)

The Leptos admin client opens an EventSource against GET /api/v1/plugins/:id/command/:rid/stream. The handler is deliberately on the public router (browsers can't set Authorization on EventSource) and accepts ?token=<jwt> or Authorization: Bearer for programmatic clients. Required scope is plugins:read.

Inside, it subscribes to state.raw_bus, filters for Event::MqttMessage on the target topic, and forwards each as an SSE event: stream chunk. Closes on the first terminal stage.

StreamCache

Fast streaming actions can finish emitting all events before an HTTP client manages to receive the request_id and open the SSE connection. The retained-MQTT clear-on-terminal contract makes the broker unhelpful here: by the time the client subscribes, the bridge has already wiped the retained event so there's nothing to replay.

streaming::StreamCache solves this by mirroring stream events in-process. spawn_stream_cache_populator(raw_bus, cache) subscribes to raw_bus and appends every stream-topic event to a per-request_id ring buffer (capped at 256 events; entries garbage-collected 60 seconds after their terminal stage). When the SSE handler opens, it:

Subscribes to raw_bus first (so it doesn't miss live events).
Reads the cached snapshot for this request_id.
Replays cached events to the client.
Forwards live events, deduping by ts against the snapshot.

This is the "retained last event is the resilience floor" plus a short replay window. Late subscribers see the full history they missed, then catch up to live.

Terminal observer + timeout injection

streaming::spawn_terminal_observer subscribes to raw_bus and releases the StreamingRegistry slot the moment a terminal stage lands on any stream topic. This is what allows concurrency: "single" enforcement (a second invocation of the same action sees the slot freed as soon as the first one finishes).

streaming::schedule_timeout arms a per-request deadline derived from the manifest's timeout_ms. If the deadline fires before the plugin emits a terminal, core publishes a synthetic stage: "timeout" event onto the stream topic so the SSE consumer gets a clean terminal.

State bridge (`state_bridge.rs`)

The state bridge is the translation layer between raw MQTT and the typed event world.

Flow for each incoming MQTT message:

Receive Event::MqttMessage from internal_bus
Match topic against homecore/devices/{device_id}/state (or /state/partial)
Parse JSON payload
Apply ecosystem router transforms (if a matching profile is loaded)
Read current device state from redb (StateStore)
Compute changed — the set of attributes whose values actually differ
Write new state to redb
Only if !changed.is_empty(): publish Event::DeviceStateChanged to pub_bus

The guard in step 8 is critical for startup performance. On restart, the MQTT broker replays retained messages for all registered devices. Without the guard, every retained message would publish a spurious DeviceStateChanged even when the stored state is identical — causing the rule engine to evaluate all rules for every device at startup (O(devices × rules) work per restart).

Topic routing — `state` vs `state/partial`

The router matches against (parts[3], parts.get(4)) rather than just parts[3] because topic.split('/') separates state and partial into adjacent parts. A naïve match against "state/partial" against parts[3] never fires — every per-attribute partial publish would silently route through the full-replace branch and wipe device.attributes. The routing is now:

Topic	`parts[3]`	`parts[4]`	Handler
`…/state`	`"state"`	`None`	`handle_state(partial=false)` (full replace)
`…/state/partial`	`"state"`	`Some("partial")`	`handle_state(partial=true)` (merge)
`…/availability`	`"availability"`	`None`	`handle_availability`
`…/schema`	`"schema"`	`None`	`handle_device_schema`

Partial merge uses apply_partial_merge_patch: a JSON null value in the patch deletes that attribute from the stored state. Plugins emitting per-attribute updates must filter null newValues if they don't intend the attribute to be removed.

Availability handling

Availability topics (homecore/devices/{id}/availability) are also handled by the bridge. They publish Event::DeviceAvailabilityChanged { device_id, available } to pub_bus.

Rule engine (`engine.rs`)

In-memory device cache

The engine never reads redb during condition evaluation. Instead, it maintains an Arc<DashMap<String, HashMap<String, JsonValue>>> (device_id → attributes) that is:

Pre-populated at startup from the state store via spawn_blocking
Updated synchronously on every DeviceStateChanged event before rule evaluation begins

This means DeviceState conditions resolve in ~10 µs (DashMap lookup) rather than ~2–5 ms (redb + spawn_blocking).

RwLock early release

The rules Arc<RwLock<Vec<Rule>>> is held only long enough to clone the current rule list into a local snapshot. All trigger matching and condition evaluation run against the snapshot after the lock is released. Hot-reload never blocks rule evaluation.

// Hold lock briefly, clone snapshot
let rules_snapshot = {
    let guard = self.rules.read().await;
    guard.clone()
};
// Lock released here — hot-reload can now proceed
for rule in &rules_snapshot {
    evaluate_rule(rule, &event, &device_cache).await;
}

Fire history ring buffer

The engine records the last 500 evaluation attempts for every rule in Arc<DashMap<Uuid, VecDeque<RuleFiring>>>. Each RuleFiring contains:

timestamp — when the rule was evaluated
trigger_type — which trigger variant fired
trigger_context — the event data (device_id, attribute, value, etc.)
outcome — Fired, ConditionFailed, Cooldown, Paused, RequiredExpressionFailed, TriggerGateFailed, or Skipped
conditions — per-condition trace with actual, expected, and reason
actions — per-action outcome trace
eval_ms — time spent evaluating conditions

The ring buffer is pre-populated at startup from the database so history survives restarts. The API exposes it via GET /api/v1/automations/{id}/history.

ExecutorContext

Each rule firing creates an ExecutorContext that carries all state needed by the action executor:

pub struct ExecutorContext {
    pub rule_id:         Uuid,
    pub state:           StateStore,
    pub publish:         Option<PublishHandle>,
    pub notify:          Option<Arc<NotificationService>>,
    pub event_bus:       Option<EventBus>,          // pub_bus
    pub device_cache:    Arc<DashMap<...>>,
    pub delay_registry:  Arc<DashMap<String, Arc<tokio::sync::Notify>>>,
    pub rule_vars:       Arc<DashMap<(Uuid, String), JsonValue>>,
    pub priv_bools:      Arc<DashMap<(Uuid, String), bool>>,
    pub capture_store:   Arc<DashMap<(Uuid, String), HashMap<...>>>,
    pub hub_vars:        Arc<DashMap<String, JsonValue>>,
    pub trigger_context: TriggerContext,
}

The executor is pure async Rust — it does not call back into the engine. Actions that need to publish to MQTT do so via publish: PublishHandle. Actions that need to emit events do so via event_bus.

Concurrency model

Each rule firing is dispatched as a tokio::spawn task (non-blocking from the select loop).
An Arc<AtomicUsize> (in_flight) tracks running tasks for graceful shutdown.
Per-rule run_mode (Parallel, Single, Restart, or Queued with optional max_queue) uses a per-rule Arc<AtomicUsize> to enforce the policy.
Delay actions yield their task without blocking other firings.
Parallel { actions } runs sub-actions via tokio::join! within the same task.
Cancellable delays register a tokio::sync::Notify in delay_registry keyed by a label; CancelDelays looks up and triggers the notify.

Graceful shutdown

When the shutdown watch::Receiver<bool> fires:

The select loop exits.
The engine waits up to drain_timeout_secs (default 10 s) for in_flight to reach zero.
Any tasks still running after the timeout are abandoned (tokio will drop them).

Scheduler (`scheduler.rs`)

The scheduler runs a 1-minute tick loop and evaluates TimeOfDay, SunEvent, Cron, Periodic, and CalendarEvent triggers. It publishes Event::SchedulerTick to pub_bus — the engine handles it like any other event.

Solar times are computed locally from the [location] lat/lon config using the sunrise crate. No cloud API is called.

Catch-up on restart: At startup the scheduler checks all enabled time-based rules against a configurable window (catchup_window_minutes, default 15). Any trigger whose computed time falls within (now - window, now] fires immediately.

Managers

Four subsystems run as independent tokio tasks spawned from Core::start(). The device managers subscribe to internal_bus for MQTT commands and publish to pub_bus for state changes. The PluginManager monitors plugin heartbeats and handles management commands.

Manager	Device prefix	Purpose
`TimerManager`	`timer_`	Countdown timer devices with start/pause/resume/cancel/restart commands
`SwitchManager`	`switch_`	Virtual on/off boolean switches
`ModeManager`	`mode_`	Solar modes (`mode_night`, `mode_day`) + named boolean modes from `modes.toml`
`PluginManager`	—	Per-plugin supervisor, heartbeat monitoring, start/stop/restart, exponential backoff

All managers persist their state to redb via StateStore so state survives restarts.

PluginManager

The PluginManager supervises managed plugins as independent processes. It runs as a tokio task spawned from Core::start().

Per-plugin supervisor

Each managed plugin gets its own supervisor task that:

Spawns the plugin process
Monitors its heartbeat via homecore/plugins/{id}/heartbeat
Handles start/stop/restart commands
Applies exponential backoff on crashes

MQTT management channel

Topic	Direction	Purpose
`homecore/plugins/{id}/heartbeat`	Plugin → Core	Liveness signal (30-60s)
`homecore/plugins/{id}/manage/cmd`	Core → Plugin	Management commands
`homecore/plugins/{id}/manage/response`	Plugin → Core	Command responses

Available commands: ping, get_config, set_config, set_log_level.

Timeout sweep

The PluginManager runs a periodic sweep checking heartbeat timestamps. Plugins that have not sent a heartbeat within 90 seconds are marked offline and a plugin_offline event is published to pub_bus.

REST API

GET /api/v1/plugins/:id — plugin status and details
PATCH /api/v1/plugins/:id — update plugin metadata
POST /api/v1/plugins/:id/start — start a managed plugin
POST /api/v1/plugins/:id/stop — stop a managed plugin
POST /api/v1/plugins/:id/restart — restart a managed plugin
GET /api/v1/plugins/:id/config — read plugin configuration
PUT /api/v1/plugins/:id/config — push configuration changes

Rhai scripting boundary

Rhai scripts run synchronously inside tokio::task::spawn_blocking to avoid blocking the async runtime. The boundary is explicit:

Condition evaluation (ScriptExpression): sync Rhai call, returns bool
Action scripts (RunScript): sync Rhai call, collects side effects (device state changes, MQTT publishes, notifications) into a Vec, then applies them asynchronously after the script returns
Topic mapper transforms: sync Rhai call, returns a Dynamic value for payload reshaping

The hc-scripting crate exposes the Rhai engine with the sync feature enabled. The engine is reused across evaluations (not recreated per call) for fast startup.

Module map

File	Responsibility
`src/main.rs`	Binary entry point: parse config, wire all crates, start `Core`
`crates/hc-core/src/lib.rs`	`Core` builder, `EventBus` definition, `start()` wiring
`crates/hc-core/src/engine.rs`	Rule evaluation, DashMap cache, fire history, `RuleEngine::run()`
`crates/hc-core/src/executor.rs`	Action dispatch, `ExecutorContext`, all action type implementations
`crates/hc-core/src/state_bridge.rs`	MQTT→DeviceStateChanged translation, redb writes
`crates/hc-core/src/scheduler.rs`	Time/solar/cron triggers, catch-up on restart
`crates/hc-core/src/timer_manager.rs`	Virtual timer devices
`crates/hc-core/src/switch_manager.rs`	Virtual switch devices
`crates/hc-core/src/mode_manager.rs`	Solar + boolean mode devices
`crates/hc-core/src/plugin_manager.rs`	Plugin supervisor, heartbeat monitoring, management commands
`crates/hc-core/src/rule_loader.rs`	RON rule file loading, UUID write-back, hot-reload watcher
`crates/hc-mqtt-client/src/lib.rs`	rumqttc client, `internal_bus` publisher, `PublishHandle`
`crates/hc-state/src/lib.rs`	redb device registry, SQLite history, `StateStore`
`crates/hc-api/src/lib.rs`	axum router, WebSocket broadcaster, OpenAPI
`crates/hc-api/src/handlers.rs`	All REST handler functions
`crates/hc-topic-map/src/lib.rs`	`EcosystemRouter`, profile loading, `apply_field_map`, Rhai transforms
`crates/hc-auth/src/lib.rs`	JWT issuance/validation, Argon2id passwords, MQTT credentials
`crates/hc-scripting/src/lib.rs`	Rhai engine setup, sandboxing, `ScriptRuntime`
`crates/hc-notify/src/lib.rs`	`NotificationService`, Pushover/email/Telegram delivery
`crates/hc-broker/src/lib.rs`	rumqttd embedded broker startup, TLS config

Overview​

Dual event bus​

EventBus implementation​

Engine subscription​

raw_bus on AppState​

Plugin streaming substrate​

SSE bridge (handlers::get_plugin_stream_sse)​

StreamCache​

Terminal observer + timeout injection​

State bridge (state_bridge.rs)​

Topic routing — state vs state/partial​

Availability handling​

Rule engine (engine.rs)​

In-memory device cache​

RwLock early release​

Fire history ring buffer​

ExecutorContext​

Concurrency model​

Graceful shutdown​

Scheduler (scheduler.rs)​

Managers​

PluginManager​

Per-plugin supervisor​

MQTT management channel​

Timeout sweep​

REST API​

Rhai scripting boundary​

Module map​