The Shared Substrate

How intelligence has always coordinated — and why AI needs to learn the same lesson
Mike Monteith, Founder & Chief Architect
March 2026

Part I: The Oldest Pattern in the World

Somewhere in southern France, about 40,000 years ago, a hunter walked into a cave and painted a bison on the wall.

We don’t know the hunter’s name. We don’t know if the painting was art or instruction or prayer. But we know what happened next: other humans entered that cave, saw the painting, and changed their behavior because of it. They hunted differently. They moved differently. They knew something they hadn’t known before — not because anyone told them, but because someone had left a mark on a shared surface.

That mark is the oldest technology in the world. Older than the wheel. Older than fire kept in a hearth. A mark left on a shared environment that changes the behavior of whoever finds it next.

In 1959, a French entomologist named Pierre-Paul Grassé watched termites build cathedral-like structures — some taller than a human — without a blueprint, without a leader, and without any termite communicating with any other termite. Each termite simply deposited a pheromone-laced pellet of mud. That deposit changed the local environment. The next termite to arrive sensed the change and responded — adding its own pellet in a way that was stimulated by the marks already there.

No termite knew the plan. The plan emerged from the accumulated marks on the shared substrate.

Grassé coined a word for this: stigmergy. From the Greek stigma (sign, mark) and ergon (work). Work incited by a sign left in the environment.

But here’s what Grassé missed, or at least didn’t say: he didn’t discover something unique to termites. He named the fundamental coordination mechanism of all intelligence. Including ours.


Part II: Forty Thousand Years of Better Blackboards

The blackboard is the medium. It has always been the medium.

Every advance in human coordination technology is the same story: someone built a better blackboard. Not a better message. Not a better messenger. A better shared surface for leaving marks. And each new blackboard solved one limitation of the previous one — while leaving others for the next generation.

~40,000 BC
The Cave Wall
Persistent — the pigment lasted millennia. But you had to go to the wall. The knowledge couldn’t come to you. A hunter in the next valley had no idea the bison painting existed.
Gained: persistence. Missing: portability.
3100 BC
Clay Tablets & Scrolls
A scribe in a Sumerian grain warehouse records the harvest. Months later, another scribe reads it and adjusts planting. They coordinate across time through the shared record. But a copy took nearly as long as the original.
Gained: portability. Missing: replication at scale.
1440 AD
Gutenberg’s Press
The first replication protocol. Luther didn’t coordinate the Reformation by sending personal letters. He modified a shared substrate — printed theses, spread by people he’d never meet. But the page was one-way.
Gained: replication. Missing: bidirectional mutation.
1977
The Bulletin Board & Sticky Note
Stigmergy in its purest modern form. Kanban boards, wikis, shared docs — the individual modifies the medium, others observe and respond. Bidirectional but structureless.
Gained: bidirectional mutation. Missing: structure, ordering, audit.
1970s – Present
Enterprise Systems
SAP maintains a shared substrate: procurement, warehouse, finance, accounting — all reading and writing to a shared environment. Coordination emerges. But enterprise systems are siloed blackboards.
Gained: structure, persistence, transactions. Missing: unity, semantic understanding.
2004 – Present
Social Networks
Stigmergy at planetary scale. Billions coordinating without explicit coordination. But misinformation propagates as efficiently as truth. No coherence guarantee.
Gained: planetary scale, real-time propagation. Missing: coherence, auditability, safety, truth.
2026
AIONdb — The Formal Stigmergic Substrate
Transactional coherence. Temporal ordering. Cryptographic auditability. Physical safety. Neural-symbolic unity. Every property, nothing left out.
All properties gained. Nothing missing.

The pattern across 40,000 years is not the marks. It’s the blackboard. Each generation builds a better shared surface. Each generation solves the last generation’s limitation. And each generation leaves a gap for the next.

The question for our generation is: what properties does the blackboard need now?


Part III: Where AI Went Wrong

In 2023, the AI industry discovered that large language models could use tools, follow multi-step instructions, and act as autonomous agents. The obvious next question was: how do you get multiple AI agents to work together?

The industry’s answer was orchestration.

LangChain, AutoGen, CrewAI, and dozens of similar frameworks took the same approach: a central coordinator receives a task, decomposes it into subtasks, assigns each subtask to a specialized agent, passes messages between them, and assembles the result. An explicit control flow. A state machine. A conductor leading an orchestra.

This is the equivalent of putting a termite queen in the middle of the colony and having her issue individual building instructions to each worker. Nature tried centralized coordination. It works for small colonies. It collapses at scale.

The failure modes are predictable, because they’re the same failure modes that have plagued centralized coordination systems for decades:

One point of failure. If the orchestrator crashes, every agent stops. The work in progress is lost. There is no graceful degradation — just a dead state machine and a pile of half-finished tasks.

Cascading timeouts. Agent A calls Service B, which calls Service C. Service C is slow. Agent A times out. The orchestrator retries Agent A, which queues behind its own still-pending first attempt. The system enters a death spiral of retries and timeouts that amplifies the original failure.

Fragile coupling. Adding a new agent requires modifying the orchestration logic. The agents can’t discover each other’s work. They can’t adapt to a changed environment. They are not autonomous in any meaningful sense — they are subroutines with natural-language interfaces, waiting to be called.

The distributed systems community solved these problems decades ago. Carl Hewitt’s Actor Model (1973). David Gelernter’s tuple spaces (1985). CRDTs. Partition-tolerant consensus protocols. The academic literature is full of robust, proven alternatives to centralized coordination.

But the AI agent ecosystem ignored all of it and reinvented the 1990s enterprise service bus.

The reason is cultural. The people building AI agent frameworks come from the machine learning community, not the distributed systems community. They think in terms of prompts and chains, not in terms of shared state and partition tolerance. They are solving the wrong problem — “how do I wire these agents together?” — when the right problem is “how do I create a shared environment that lets agents coordinate without being wired together at all?”

The right problem is stigmergy.


Part IV: What a Formal Stigmergic Substrate Requires

The cave wall was a blackboard without portability. The tablet was a blackboard without scale. The printing press was a blackboard without bidirectionality. The sticky note was a blackboard without structure. The enterprise database was a blackboard without unity. Social networks were a blackboard without trust.

AI agents operating on physical infrastructure — monitoring hospitals, managing power grids, controlling water treatment plants, coordinating autonomous vehicles — need the next blackboard. And this time, nothing can be left out.

Transactional coherence. When an agent writes a mark to the substrate, every other agent must see a consistent snapshot — never a half-written observation, never a partially updated state. In a hospital, a pharmacy agent checking medication interactions cannot see the discharge order before the attending physician’s notes have been committed. The substrate must make reality coherent before anyone sees it.

In academic terms, this is the problem that Yoshua Bengio identified in his 2019 NeurIPS keynote on “System 2 Deep Learning.” He argued that deep learning excels at fast, reactive pattern matching (System 1, in Daniel Kahneman’s framework) but lacks the infrastructure for slow, deliberate reasoning (System 2) — and critically, that the two systems need to share state without corrupting each other. He described the need. He didn’t describe the mechanism that enforces it.

Temporal ordering. Marks must be globally ordered, not just locally timestamped. When three agents write observations within the same millisecond, the substrate must establish a definitive sequence. In a power grid, the order in which events are recorded determines the causal chain that explains a failure. “What happened first?” is not a question you can answer with wall-clock timestamps on independent machines.

Cryptographic auditability. Every mark must be signed, traceable, and tamper-evident. When a regulator asks “who knew what, and when?” — the answer must be a mathematical proof, not a log file. A chain of cryptographic hashes, each block signed with a digital signature, producing an audit trail that can be independently verified by anyone with standard cryptographic libraries. This is not a nice-to-have for regulated industries. It is the minimum condition for deploying AI in healthcare, finance, energy, and transportation.

Physical safety. When an AI agent writes a mark that triggers physical actuation — opening a valve, switching a breaker, adjusting a dosage — the substrate must independently verify that the physical world is ready before the command reaches the wire. Not by asking the AI if it’s sure. Not by checking a software guardrail. By physically probing the target device at the application protocol layer and confirming it is in a state to safely receive the command.

This is the lesson of the Therac-25 radiation therapy machine, which killed three patients in the 1980s because its designers removed the hardware safety interlocks and trusted software alone. It’s the lesson of the Boeing 737 MAX, which killed 346 people because its automated system trusted a single sensor without physical cross-verification. Software guardrails on software systems are necessary but not sufficient. The containment must be deterministic and physical.

Neural-symbolic unity. The substrate must hold both structured knowledge (what happened — the facts, the relationships, the sequences) and semantic understanding (what it means — the concepts, the patterns, the implications) in the same transactional boundary. When a fact is deleted — a patient invokes their right to be forgotten under GDPR — every neural embedding derived from that fact must physically disappear in the same transaction. Not eventually. Not in a batch job overnight. Atomically. Because the law doesn’t recognize “eventually consistent” as compliance.

This is the problem that Gary Marcus has been articulating for a decade: pure neural systems hallucinate because they manipulate statistical patterns without grounding in structured reality. And it’s the problem that Henry Kautz taxonomized in his 2020 AAAI address, where he identified six patterns of neuro-symbolic integration — none of which described a system where both layers share a transactional substrate and mutations in one cascade to the other.

No previous blackboard — not cave walls, not enterprise databases, not social networks — has provided all five of these guarantees simultaneously. This is the engineering problem I’ve spent 18 years solving.


Part V: The Guardian Cycle

AIONdb is a knowledge graph — a database that stores relationships between things as subject-verb-object triples, queryable through the W3C standard language SPARQL. The name is a play on words: AI-ON-DB. The intelligence doesn’t call the database. It lives on it.

But it is built on a radical architectural choice that makes everything else possible.

Every string that enters the system — every name, every relationship, every value — is converted to a 32-bit integer at the boundary and never touched as a string again until output. A triple is 12 bytes: three integers. The entire query engine operates on integer comparisons. No memory allocation in the critical path. No garbage collection pauses. No cache thrashing.

This single decision — made in 2008, refined across two companies and three codebases — is why AIONdb processes 795,000+ results per second on a laptop. Not a cloud cluster. A laptop.

But speed alone doesn’t make a stigmergic substrate. What makes AIONdb a substrate is the Guardian Cycle: a three-phase execution model that enforces the coherence guarantees that stigmergy requires.

01 — TICK
Ingest & Sequence
New information arrives — a sensor reading, an agent’s observation, a human correction. The system assigns a monotonically increasing sequence number via lock-free WAL broadcast and writes to a durable journal. The new information has an identity and is permanent.
02 — GUARDIAN
Evaluate & Enforce
The phase that doesn’t exist in any other system. The database becomes invisible. The Guardian evaluates every rule against new information: threshold violations, drug interactions, contradictions. Truth is made coherent before anyone sees it.
03 — TOCK
Notify & Propagate
Every observer is notified simultaneously. Pharmacy agents, insurance validators, LLM observers, human dashboards — all see the same coherent snapshot at the same sequence number. A slow observer never blocks a fast one.
04 — ACT
Respond & Recurse
Any observer’s response enters as a new Tick, creating a recursive loop: reality flows in, truth is made coherent, and coherent truth flows out. The cycle repeats. Coordination emerges from the substrate.

In Kahneman and Bengio’s framework, the Guardian is the mechanism that ensures System 1 (fast reactive processing) settles into a consistent state before System 2 (slow deliberate reasoning) observes it. It is the consistency barrier that Bengio’s architecture requires but does not specify.

On March 12, 2026, we demonstrated this with three frontier AI models — Claude, GPT, and Gemini — coordinating through the shared substrate without an orchestrator, without APIs between them, and without any awareness of each other’s existence. Each model wrote observations. Each model read the others’ observations. Coordination emerged from the marks on the shared environment.

Grassé’s termites, with transactional guarantees and a cryptographic audit trail.


Part VI: Why This Matters Now

Enterprise AI value is concentrating at four layers of the stack. I’ve come to think of them as choke points — not in the constraint sense, but in the sense that whoever owns each layer captures the value that flows through it:

System of Record
Who owns the enterprise’s operational data
Control Plane
Who governs, catalogs, and permissions that data
Execution Layer
Who makes structured data usable for AI models
Accelerated Infrastructure
Who makes it all fast enough to matter

Most companies in the AI ecosystem occupy one of these choke points and partner for the rest. A model vendor (Anthropic, OpenAI, Google) provides the execution layer but doesn’t own the data. A data platform (Snowflake, Databricks) owns the storage but doesn’t govern it for AI agents. A cloud provider (AWS, Azure) provides infrastructure but not the semantic layer.

The system of record is the deepest of the four. NVIDIA’s Jensen Huang made the case at GTC 2026: “This is the structured data, the ground truth of business. This is the ground truth of enterprise computing.” Models don’t reason their way to truth; they get coupled to it. Whoever owns the substrate where that coupling happens owns the value.

AIONdb occupies all four in a single system. The knowledge graph is the system of record. The Guardian Cycle is the control plane — governance enforced as physics, not policy. The LLM participant is the execution layer — models living inside the substrate as participants, not calling it from outside as clients. And the integer-interned, zero-allocation architecture is the accelerated infrastructure — 795,000+ results per second without a GPU in sight.

This isn’t a strategy. It’s a consequence of the architecture. When you start from the premise that intelligence coordinates through a shared substrate, and you build the substrate to be coherent, auditable, fast, and safe — you end up occupying the entire stack. Because the stack was never supposed to be separated in the first place.


Part VII: The Three Problems That Disappear

AI Amnesia

Every enterprise running AI pilots today has the same unspoken problem: the AI forgets everything between sessions. Each conversation starts from scratch. Each agent deployment loses its accumulated knowledge when the model restarts, the API times out, or the vendor changes.

The industry is trying to solve this at the model layer — longer context windows, memory features, persistent threads. But expanding the context window to simulate memory is a fundamental architectural error. It forces the enterprise to re-compute its entire institutional history on every inference call, paying a compute tax to the model provider every time. It is the most expensive possible implementation of “remembering.” And it is a vendor lock-in trap: your institutional knowledge becomes inseparable from the specific model’s proprietary weights. Switch vendors and the memory disappears.

Memory is not a model capability. Memory is infrastructure.

In a stigmergic substrate, memory is the environment itself. An agent writes an observation — “the compressor in Room 302 is showing vibration patterns consistent with bearing failure.” That observation is a triple in the graph, durable, queryable, signed. When the agent restarts — or is replaced by a completely different model from a different vendor — the knowledge is still there. The new agent reads the substrate and knows what the old one knew.

The intelligence lives in the substrate, not in the model. You can swap the model the way you swap a light bulb — the room’s wiring doesn’t change.

AI Fragility

Three AI agents coordinate a hospital patient discharge: pharmacy review, insurance validation, transport scheduling. In today’s orchestration frameworks, they’re wired together through a central coordinator. When the insurance company’s API goes down — which it does, regularly — the entire pipeline crashes. The patient waits. A human has to manually restart the workflow.

In a stigmergic substrate, these three agents don’t know each other exist. The pharmacy agent watches the graph for discharge orders and writes its review. The insurance agent watches for pharmacy clearances and writes its authorization. The transport agent watches for authorizations and schedules the ride. They coordinate through the shared environment, exactly as termites coordinate through pheromone marks.

When the insurance agent’s network goes down, the pharmacy and transport agents keep working on other patients. They don’t notice. They don’t need error handling code or retry logic. When the insurance agent’s network returns, it reads the write-ahead log to catch up on what it missed and completes its work.

Zero lines of coordination code. Zero points of failure. The substrate is the coordinator.

AI Liability

An AI agent analyzing power grid load is tricked by a prompt anomaly — the 2026 equivalent of a phishing attack — into confidently recommending that a high-voltage breaker be opened. It fires the command.

In every AI system deployed today, that command reaches the wire. The guardrails are software: prompt engineering, constitutional AI, reinforcement learning from human feedback. Software checking software. The industry has confused “semantic alignment” with “physical safety.” RLHF and constitutional AI can stop a model from generating a toxic word, but they are entirely useless at stopping a hallucinating agent from opening a high-voltage breaker. This is the Therac-25 architecture — and we know how that ended.

In AIONdb, between the AI’s command and the physical wire sits a protocol-specific probe. For a Modbus-controlled breaker, this is a Read Holding Registers command (function code 0x03) that interrogates the physical device. The probe discovers that the breaker is in maintenance mode — a human contractor is working on it. The ReadinessFlag stays locked. The command is rejected at the hardware boundary before it reaches the wire. The AI’s hallucination never becomes a physical action.

The dashboard flashes red: ACTUATION BLOCKED. A cryptographically signed violation record is written to the audit ledger. The AI reads its own failure and adapts its behavior. The power stays on. The contractor goes home safe.

This is not a software guardrail. This is deterministic physical containment — the same principle that kept nuclear reactors safe for 70 years, applied to AI for the first time. And it’s why this system is insurable when others aren’t: an actuary can quantify the failure probability of a deterministic safety system. They cannot quantify the failure probability of an LLM’s judgment.


Part VIII: The Evidence

I’m aware that the preceding sections make extraordinary claims. Here is the evidence.

350B+
Error-free result triples
449 formal test reports. Zero failures.
102
Patent Claims
1 provisional patent, 8 invention families. USPTO 64/006,261.
328/328
W3C SPARQL 1.1
Full standard conformance. No asterisks.
1 Laptop
All Tests, One Machine
2023 MacBook Pro, M2 chip, 32 GB RAM.

Sustained throughput. 50 million queries over 10 continuous hours at 1,361 queries per second, with stable memory (no leaks, no growth, no garbage collection pauses). During that test, the system automatically recovered from 20 deliberate failover injections without human intervention.

Intellectual property. A single comprehensive provisional patent filed with the USPTO (64/006,261), covering 102 claims across 8 invention families: the Guardian Cycle, integer interning, hybrid LSM-Lite storage, zero-copy WASM ABI, selectivity inversion, self-governing compaction, physical actuation safety, and stigmergic agent coordination. The claims are architecturally interdependent — the fast query engine, the Guardian Cycle, and the physical safety probes are legally bound together as a unified system.

Codebase. 193,527 lines of Rust. Not Python glue around someone else’s library. Rust — the language you choose when failure is not an option.


Part IX: The 18-Year Question

In 2008, I asked a question: What would happen if every system and device became contextually aware of my needs and intentions?

I was the Chief Architect for systems that monitored entire provinces — hospitals, transit networks, building portfolios spanning millions of square feet. Every one of those deployments had the same bottleneck: humans manually stitching together systems that should have been aware of each other. I called it the “human middleware” problem. A nurse checking one screen, then walking to another screen, then calling someone on the phone to relay what the first screen said to the person who needed to act on it. That nurse was the middleware. And that middleware doesn’t scale, doesn’t have a backup instance, and eventually goes home.

The first company taught me what the problem was. The second company taught me what the architecture shouldn’t be. The third codebase — AIONdb — is what the architecture should be.

Two companies. Three codebases. 18 years. Not pivots. Convergences. Each one stripped away an assumption that was blocking the real solution.

The first assumption I stripped away was that you need strings. You don’t. Intern everything to integers at the boundary. A triple becomes 12 bytes. The query engine becomes pure arithmetic. Suddenly you’re processing 795,000+ results per second on hardware that costs less than a month of cloud compute.

The second assumption I stripped away was that you need an orchestrator. You don’t. Let agents write to a shared substrate. Let the substrate make truth coherent. Let other agents discover truth through queries. Coordination emerges from the environment. This is how every successful large-scale coordination system in history has worked — from termite colonies to SAP.

The third assumption I stripped away was that safety is a software problem. It isn’t. When AI touches the physical world, the containment must be physical. A protocol-level probe that interrogates the device. A ReadinessFlag that gates the wire. Deterministic, auditable, insurable.

These aren’t features. They’re architectural invariants. You can’t bolt them on after the fact. They have to be in the foundation. That’s why this took 18 years. That’s why it took three codebases. That’s why 193,527 lines of Rust exist — because every one of those lines encodes a lesson about what breaks at scale in systems that matter.


Part X: What Comes Next

At GTC 2026, Jensen Huang put it simply: structured operational data is the ground truth of enterprise computing. The future of AI isn’t smarter models. It’s models coupled to the truth that keeps the business running.

The termites already knew this. The cave painters knew it. The Sumerian scribes knew it. Every enterprise that ever deployed SAP knew it, even if they never had the word for it.

The word is stigmergy. The mark left on the shared environment that changes the behavior of whoever finds it next.

For 40,000 years, we’ve been building better blackboards — each one more persistent, more portable, more scalable, more structured than the last. Cave walls gave way to clay tablets. Clay tablets gave way to printed pages. Printed pages gave way to whiteboards. Whiteboards gave way to databases. Databases gave way to social networks.

Each transition preserved the pattern and added a property. Persistence. Portability. Replication. Bidirectionality. Structure. Scale.

But each transition also left something out. And the thing that was left out became the defining problem of the next era.

AI agents are the next participants in this 40,000-year lineage. They need a blackboard with every property that every previous blackboard provided — and the five that none of them ever did: transactional coherence, temporal ordering, cryptographic auditability, physical safety, and the unity of symbolic truth with semantic understanding.

That blackboard exists. It’s been tested across 350 billion+ result triples. It runs on a laptop. And it’s ready.


Go Deeper

Read the Architecture Overview →

The architecture, the design decisions, the evidence.

Apply for Beta →

Bring your own use case. Work directly with the founder.

Speak with the founder →

Direct access. No sales funnel.


The AI world is having an urgent conversation right now about agents — how they coordinate, how they remember, how they stay coherent across sessions and vendors. We believe AIONdb is the solution the AI world has been looking for.

AIONdb is a continuity engine and shared memory substrate for AI agents. The system was engineered from the memory layer up — with no existing database underneath. It is lock-free, zero-copy, and cryptographically auditable. It has 102 patent-pending claims, and we recently pushed it through 350 billion+ error-free result triples without a single failure. AIONdb gives AI agents a shared, permanent memory that survives across sessions, vendors, and failures.

Mike Monteith is a Canadian technology builder, inventor, and systems architect. For 35 years, Mike has been focused on solving how distributed, autonomous systems coordinate truth. Mike founded and led ThoughtWire, a Canadian company focused on digital twin technology, for over 15 years, building real-time operating platforms for life-critical infrastructure at scale — smart hospitals, large commercial environments, and complex physical systems.

AIONdb is the culmination of a career built on solving some of the most complex technological problems.

AIONdb is protected by a comprehensive provisional patent (USPTO 64/006,261, 2026) with 102 claims across 8 invention families, covering the Guardian Cycle, integer interning, hybrid LSM-Lite storage, zero-copy WASM ABI, selectivity inversion, self-governing compaction, physical actuation safety, and stigmergic agent coordination.


References

Grassé, P.-P. (1959). “La théorie de la stigmergie.” Insectes Sociaux, 6(1), 41–80.
Heylighen, F. (2016). “Stigmergy as a Universal Coordination Mechanism.” Cognitive Systems Research, 38, 4–13.
Gelernter, D. (1985). “Generative Communication in Linda.” ACM TOPLAS, 7(1), 80–112.
Erman, L.D. et al. (1980). “The Hearsay-II Speech-Understanding System.” ACM Computing Surveys, 12(2), 213–253.
Dorigo, M. & Stützle, T. (2004). Ant Colony Optimization. MIT Press.
Bengio, Y. (2019). “From System 1 Deep Learning to System 2 Deep Learning.” NeurIPS Keynote.
Marcus, G. & Davis, E. (2019). Rebooting AI. Pantheon Books.
Kautz, H. (2020). “The Third AI Summer.” AAAI Presidential Address.
Kahneman, D. (2011). Thinking, Fast and Slow. Farrar, Straus and Giroux.
Leveson, N. (2011). Engineering a Safer World. MIT Press.
Leveson, N. & Turner, C. (1993). “An Investigation of the Therac-25 Accidents.” IEEE Computer, 26(7).
Edge, D. et al. (2024). “From Local to Global: A Graph RAG Approach.” Microsoft Research.
Bourtoule, L. et al. (2021). “Machine Unlearning.” IEEE S&P.
Gilbert, S. & Lynch, N. (2002). “Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services.” ACM SIGACT News.

Continue your read

ARCHITECTURE OVERVIEW

A look under the hood

The architecture, the design decisions, and the evidence behind 350 billion+ error-free result triples.

Read the overview →
FAQs

AIONdb Explained

Plain-language answers to the questions buyers actually ask.

See the FAQs →
APPLY FOR BETA

Join the cohort

For organizations with audit-critical, coordination-heavy workloads.

Apply now →