START WITH ONE WORKFLOW — GIVE AI THE CONTEXT IT IS MISSING
    Penumbra
    ← Blog

    Field note

    Why AI Agents Need Ontology

    Give the model a map of the work before you ask it to run the work.

    Abstract domain model showing connected business concepts for AI agents
    Agents need a working model of the business before they can act inside it.

    Most AI agent failures begin one layer below the tool call.

    The agent sees a CRM record, a support thread, a contract clause, a Slack update, and a few tool descriptions. Then it has to infer the business behind those fragments. Sometimes it gets lucky. In production, luck starts looking expensive.

    A real business is full of overloaded words. "Account" can mean the billing entity, the strategic relationship, the workspace, the parent company, or the team currently yelling in the support queue. "Risk" can mean churn risk, security risk, implementation risk, legal risk, or a founder's private sense that this deal is going sideways.

    When those meanings live in people's heads, docs, and one-off prompts, agents guess. The failure mode is rarely dramatic. The agent drafts the wrong email, moves the wrong deal stage, files the ticket under the wrong category, or asks a human for help because it cannot tell what kind of situation it is in.

    That is why AI agents need ontology. Action depends on meaning. Tool access gives the model reach. Ontology gives the model a world to act inside.

    Where ontology sits in the agent stack

    Anthropic's agent guidance draws a useful line: workflows follow predefined code paths, while agents dynamically direct their own process and tool use. That extra autonomy changes the architecture problem. Once the model chooses actions, the system has to tell it what the business objects mean and what counts as a valid move.

    LayerWhat it gives the agentWhat remains unresolved
    PromptTask framing, tone, broad instructions, and temporary context.A prompt cannot govern the company's definition of Account, Renewal Risk, or Approval Required across tools.
    RAGRelevant documents, historical examples, and retrieved notes.Retrieved text still has to be classified into business objects, states, and decisions.
    Tool schemaCallable actions with parameters and descriptions.The schema rarely explains when an action is allowed, which evidence is enough, or who must review it.
    OntologyDomain objects, relationships, states, evidence rules, and action boundaries.The agent can connect context to an operating model instead of reconstructing the business each turn.

    Prompting, retrieval, and tool definitions all matter. They solve different parts of the problem. Ontology answers the question sitting underneath them: what kind of thing is this, how does it relate to the rest of the business, and what should change now that we know it?

    A good agent should spend its reasoning budget on the new situation in front of it. The stable grammar of the business belongs in the system.

    The production symptoms are concrete

    This is where the SEO phrase starts paying rent. "AI agents need ontology" should predict specific failures. If the claim cannot explain what breaks, it is just category language.

    SymptomWhat the agent is missingWhat ontology adds
    The agent uses the wrong object."Account" means different things in sales, billing, product, and support.Canonical business objects and relationships between them.
    The agent calls the right tool at the wrong time.The tool exists, but the action boundary is undefined.State-specific action rules and approval requirements.
    Memory becomes a pile of snippets.The agent retrieves prior text without knowing what changed.Typed memory attached to Accounts, Commitments, Risks, Issues, and Decisions.
    Human review is noisy.Every uncertain action escalates because the system cannot locate the judgment point.Review gates tied to risk, authority, evidence, and reversibility.
    Provenance is vague.The system says it used context, but cannot show which evidence changed which conclusion.A trace from source evidence to object state to action.

    These are boring failures, which is exactly why they matter. They show up in the places where companies actually want agents: customer escalation, renewal support, implementation handoff, RFI response, account research, contract review, and internal operations.

    A worked example: customer escalation

    Take a support agent assigned to customer escalations. The model can summarize a thread and draft a response. That is the easy part. The hard part is knowing what the thread means to the business.

    1. 1

      Classify the situation

      The thread is classified as an Implementation Blocker, Product Issue, Contract Risk, Relationship Escalation, or some combination of those types.

    2. 2

      Attach it to business objects

      The issue connects to an Account, Contract, Stakeholder, open Commitment, affected Product Surface, and current Implementation Phase.

    3. 3

      Update state with evidence

      The missed handoff, contract clause, customer quote, and prior ticket become evidence that changes the state of the Commitment or Risk.

    4. 4

      Select the allowed action

      The agent can draft a response, open a work item, route to an owner, or request review depending on the state and authority rules.

    5. 5

      Show the trace

      A human can see the source evidence, the object state that changed, and the reason the agent chose that action.

    The ontology behind that workflow might be small: Account, Contract, Stakeholder, Commitment, Issue, Risk, Evidence, Action, Owner, Review Gate. Small is the point. The agent now has a world with nouns, state, and consequences.

    The relationship map behind one escalation

    The useful structure is the path from evidence to state to action.

    AccountContractCommitmentIssueEvidenceRiskActionReview Gate
    AccounthasContract
    ContractdefinesCommitment
    IssueproducesEvidence
    EvidencechangesRisk
    RiskgovernsAction
    Actionmay requireReview Gate

    Knowledge graphs for AI agents become useful here because relationships carry operational meaning. Account has Contract. Contract defines Commitment. Commitment creates Work. Work produces Evidence. Evidence changes Risk. Risk changes Action. The graph becomes a decision substrate the agent can traverse.

    The smallest useful ontology

    Start smaller than a company-wide ontology project. A giant diagram usually becomes too large to verify and too abstract to operate. A minimum viable ontology starts with one workflow where the agent already has a job to do.

    • Core objects: the five to ten nouns experts already use when discussing the workflow.
    • Relationships: how those objects depend on, contain, block, create, or govern each other.
    • States: the meaningful statuses an object can occupy, such as Proposed, Blocked, At Risk, Approved, or Superseded.
    • Evidence: the source material that can change an object's state.
    • Actions: what the agent can do around each object.
    • Review gates: when a human must approve, reject, or supply judgment.
    • Provenance: the path from source evidence to object state to agent action.

    That is enough structure to change the system. Memory can update objects instead of accumulating notes. Tools can bind to states instead of floating as generic capabilities. Review can move to the moments where judgment matters. Provenance can explain the path from evidence to action.

    W3C standards like RDF and OWL are useful background because they formalized an old idea that is suddenly practical again: systems need a way to describe things, relationships, and constraints. Most teams can ship their first agent ontology without RDF. The discipline still matters: make the domain explicit enough for software to use.

    What changes after the ontology exists

    BeforeAfter
    The agent retrieves five similar notes.The agent updates the relevant Account, Commitment, Risk, or Decision.
    The prompt says to be careful with renewals.Renewal Window, Commercial Risk, Approval Requirement, and Review Gate are defined objects.
    Every exception goes to a human.Only exceptions crossing authority, risk, or evidence thresholds require review.
    The agent explains itself in prose.The system shows the source evidence, object state, and allowed action.
    Each agent needs its own pile of instructions.Agents share the same domain model and use different tools around it.

    This is the semantic layer for agents. It sits between the model and the business systems, giving the model the terms, constraints, and relationships it needs to act without pretending every workflow is a blank page.

    How Penumbra builds the model behind the agent

    Penumbra starts with the workflow where AI is already close to useful: proposal response, customer escalation, account intelligence, expert research, partner onboarding, RFI review, implementation handoff. The expert describes the work in business language. Penumbra captures the objects, rules, relationships, review standards, and evidence model around that workflow.

    1. 1

      Capture the expert's domain commitments

      What exists in this workflow? Which states matter? Which distinctions separate a good decision from a bad one?

    2. 2

      Turn the commitments into a domain model

      The model defines the objects, relationships, evidence, action rules, and review gates the agent will use.

    3. 3

      Compile the model into working surfaces

      The same model can support extraction, memory, tools, APIs, guardrails, review, and provenance.

    4. 4

      Let people and agents share context

      Humans inspect and improve the model. Agents act through it. The workflow stops depending on private tribal knowledge.

    This is the buyer-useful answer to "why do AI agents need ontology?" Agents need ontology because the model needs a governed representation of the work before it can act on the work. Without that representation, every agent is forced to reverse-engineer the business from scraps.

    Build the model behind the work. Then give the agent tools.


    Read next

    See how Penumbra explains ontology as the working model of a domain

    Explore the Penumbra platform for domain models, tools, memory, and provenance

    Build agents on a real model of your domain

    Read Gundam for Your Brain

    Bring one workflow where AI keeps losing the plot

    Sources

    Anthropic: Building effective agents

    W3C: RDF 1.1 Concepts and Abstract Syntax

    W3C: OWL 2 Web Ontology Language Document Overview