OpsCanvas Whitepaper · January 2026

Cloud
Intelligence Graph

A context graph for cloud operations: a continuously updated, versioned model of what is running, how it is connected, who owns it, and how it changed — without requiring process overhaul or replatforming.

Jason Turim CTO & Co-founder, OpsCanvas

22 pages Full technical paper

6 sections Architecture to agentic operations

Download PDF Read online

Abstract

The problem with cloud operations today is not a lack of data

Modern cloud environments produce enormous volumes of observability data, infrastructure state, and deployment events. The problem is that this data is distributed across dozens of tools, none of which share a common model of what is running, how services depend on each other, or how the environment changed over time. What is missing is not more data — it is provenance and change lineage: the structured, queryable record of what exists, who owns it, when it changed, and why.

Without a shared operational model, context reconstruction becomes the dominant cost in cloud operations. Engineers spend the first hour of every incident rebuilding what changed. Approval workflows route to the wrong owners. Cost anomalies cannot be explained. AI agents cannot reason safely across a cloud environment they cannot see as a coherent whole.

"A context graph turns operational data into operational knowledge. The data already exists. The missing piece is the shared model that makes it queryable."

This paper introduces the Cloud Intelligence Graph: a continuously updated, versioned representation of cloud operational reality. Built on a context graph primitive, it enables safer change, faster incident response, cost accountability, audit-ready governance, and AI agent parallelization — without requiring teams to change how they build or deploy.

Executive Summary

Cloud operations have reached an inflection point

The tools teams rely on for cloud operations were each built to solve a specific problem: IaC for provisioning, observability for signals, CMDBs for inventory, FinOps for cost. But none of these tools was built to answer questions that span all of them — and in practice, making context reconstruction a dominant source of operational inefficiency.

The root cause is structural. Each tool holds a partial view of the environment. IaC captures intent, not reality. Observability captures signals, not ownership. CMDBs capture snapshots, not change lineage. The result is that answering basic operational questions — what changed, who owns this, what does it depend on — requires manual synthesis across four or five tools, every time.

"Context is not a nice-to-have. It is the precondition for safe automation, accountable governance, and effective AI agents in cloud environments."

A context graph addresses this by maintaining a continuously updated, versioned model of the environment: every resource, service, dependency, ownership relationship, and change event, captured in a way that is queryable as a unified whole. The model tracks the five Ws — what changed, who made the change, when it happened, where in the dependency graph it propagates, and why it was authorized.

When the current state becomes legible and versioned, a different class of capabilities becomes possible. Organizations can:

Understand change impact before deployments ship Surface shared dependencies and cross-team coupling before a change reaches production, so approvals are grounded in what the change actually affects.

Reduce incident response time Connect symptoms to dependencies, ownership, and recent change history from the moment an incident begins — instead of spending the first two hours reconstructing what changed.

Attribute cloud spend to ground truth Explain cost shifts in terms of actual changes, scaling events, and configuration updates — not fragile tagging schemes that drift from reality as systems evolve.

Identify savings opportunities that are safe to execute Find dormant environments and orphaned resources, then validate safety through dependency context before taking action — so cost reduction does not come with reliability risk.

Provide audit-ready traceability Answer questions about any point-in-time environment state, who approved each change, and why a resource exists — as a query, not a multi-team investigation.

The context graph does not require replatforming or changes to how teams build and deploy. It is designed to be adoptable alongside existing tooling, slotting into current workflows rather than replacing them.

The operational model also directly enables AI agents operating in cloud environments. Agents that can read a shared, queryable model of the environment — and write change events back to it — can act with meaningful safety guarantees and be governed with the same traceability that applies to human operators.

Paper Contents

What the full paper covers

The complete 22-page paper works through six sections, moving from the structural problems in current cloud operations tooling to a detailed architecture and the operational capabilities it enables:

Why the current stack does not add up. IaC captures intent, not reality. Observability captures signals, not ownership. The gap between what tools know and what operators need to know is where operational cost accumulates.
The business consequences of missing context. How fragmented context drives cloud waste, prolongs incidents, and makes governance expensive — with specific examples of the questions that cannot be answered without a shared operational model.
Context graphs as a category. How context graphs compare to CMDBs, developer portals, observability platforms, and FinOps tools — and why none of those categories addresses the problem of versioned, queryable operational state.
The Cloud Intelligence Graph architecture. Five core primitives (applications, services, environments, shared infrastructure, deployments), data sources, the change lineage model, and how the graph is built and kept current without manual curation.
What this enables. Change safety, security and governance, cost accountability, and durable ownership — described in terms of specific operational scenarios and the questions they make answerable.
The next generation of cloud operations. From on-call-reactive to continuous-oversight operations, the path toward provider-agnostic workload placement, and the role of AI agents in a context-grounded operational model.

Download the full paper or contact Jason directly with questions or discussion.

What a Context Graph Makes Possible

Four capabilities that change when context is queryable

The Cloud Intelligence Graph does not replace existing infrastructure tooling. It makes environments, services, dependencies, ownership, change history, cost, and savings opportunities queryable in a unified way.

Change Safety and Operational Resilience

Surface blast radius before deployments ship. Connect changes to ownership so approvals are informed. Promote and roll back environments as coherent units, not isolated artifacts.

Security, Compliance, and Governance

Accelerate blast radius analysis for security incidents. Enable audit readiness with point-in-time environment state and change history. Enforce governance based on meaning and impact, not brittle rules.

Cost Accountability and Savings

Attribute spend to real dependencies, not fragile labels. Explain cost shifts by correlating them with actual change events. Validate savings actions through dependency context before executing them.

Durable Ownership and Accountability

Make ownership queryable and persistent as teams change. Tie ownership to change lineage and cost responsibility. Remove ambiguity around who must approve, respond, or remediate when something breaks.

CloudIntelligence Graph