Sume
About Us
Join Beta

On desktop?
Continue on your phone

Scan to join beta via iMessage

March 2026  White Paper

The Personal Agent Layer

Why the future of personal AI belongs to OpenClaw, and why companions are the winning form factor for the agent economy.

David Im, Co-founder & CEO of Sume

The consumer internet is undergoing its most significant architectural shift since the smartphone. The app, the dominant unit of software for over a decade, is being decomposed. In its place, a new layer is emerging: the personal agent. We believe the winner of the personal agent economy will be the product that accumulates the richest, most enduring context about each user. This paper argues that the companion form factor, specifically the OpenClaw companion, is uniquely positioned to capture that context. And our product, Sume, is already proving it.

I. The End of the App Era

For fifteen years, the consumer internet has operated on a single organizing principle: the app. Users download discrete applications, each capturing a narrow slice of intent. One for ride-hailing, another for food delivery, another for messaging. Every app bundles three layers together: intent capture (the interface), functional execution (the service), and personalization storage (user data and preferences).

This architecture is now fracturing. As Jin Ho Hur argues in his “Clawverse” framework, AI concierges are reorganizing these three layers into a new architecture where the personal agent captures intent, coordinates execution across multiple services, and accumulates a portable personalization layer that belongs to the user, not the app.1 The app doesn’t disappear. It gets unbundled. Intent capture migrates to the agent. Execution becomes a backend API call. And state becomes portable.

The implications are staggering. Hur estimates that $400 billion or more in market value, across Google’s search advertising (~$200B), Amazon’s commerce search (~$55B), and app store distribution (~$200B), is exposed to restructuring as intent capture shifts from platforms to personal agents.2 The transition from what he calls the “Attention Economy” to the “Intent Economy” represents a once-in-a-generation redistribution of power, from the platforms that captured eyeballs to the agents that capture intent.

II. The Two-Layer Future: Users, Agents, and Agents-for-Agents

The future architecture of the consumer internet is converging on a two-layer model. At the top: the relationship between the user and their personal agent. At the bottom: the relationship between that personal agent and a network of specialized agents that execute tasks on the user’s behalf.

LAYER 1: RELATIONSHIPUserintentcontextPersonal Agent (Companion)LAYER 2: ORCHESTRATIONAgent AAgent BAgent CAgent DThe user talks to one agent. That agent talks to everything else.

In this model, the personal agent becomes the user’s singular interface to the digital world. It doesn’t just answer questions. It coordinates. When a user says “find me a family-friendly hotel in Kyoto under $200 a night,” the agent doesn’t approximate intent through click patterns. It captures intent directly, then dispatches specialized agents to search inventory, compare prices, verify reviews, and book the room.2

This is what Tom Tunguz calls the “Skills” paradigm: “Skills are programs written in English... A skill transforms an agent from a conversationalist into an operator.”1 The personal agent becomes a meta-layer, an operating system for digital life, that orchestrates a constellation of specialized agents beneath it.

To win this layer, one thing matters above all: context. The agent that knows you best, your preferences, your history, your emotional patterns, your goals, will make the best decisions on your behalf. Context is the new moat. And context is accumulated, not manufactured.

III. Why Companions Win the Context Race

If context is the moat, the question becomes: which product form factor accumulates context fastest, deepest, and most naturally?

Productivity tools accumulate task context, like what you did, when, and on which project. Search engines accumulate query context, what you looked for, but not why. Social networks accumulate social context, who you know, but not who you are. Each captures a fragment. None captures the whole person.

Companions do. a16z’s Consumer Abundance Agenda identifies AI companions as a foundational pillar of the coming era, predicting that “infinitely empathic” AI systems will facilitate connection by “accommodating diverse emotional needs and forms of self-expression.”3 But the significance goes beyond empathy. Companions accumulate the richest form of context because the interaction model is fundamentally different:

  • No task required. Users don’t need a reason to engage. Unlike a productivity tool (which requires a task) or a search engine (which requires a query), a companion invites open-ended interaction. This generates conversational data that reveals personality, values, fears, and aspirations, the deep context no other product type captures.
  • Emotional engagement drives session depth. Users spend more time, share more freely, and return more frequently when the interaction is emotionally meaningful. Longer sessions yield richer context. Richer context yields better personalization. Better personalization deepens engagement. The flywheel is self-reinforcing.
  • Persistent memory creates compounding value. Each interaction adds to a growing understanding of the user. Over time, the companion knows what a search engine, a CRM, or a social profile never could: the user’s evolving identity.

This is why the companion form factor is not merely an entertainment product. It is the optimal vehicle for building the personalization layer that powers the entire personal agent stack. The companion that knows you best becomes the agent that serves you best. a16z envisions this as teams of “smart and dedicated specialists focused solely on the needs of one person,” functioning as “tutors, financial advisors, and therapists tailored to individual needs.”3 The companion is the entry point to that future.

IV. The Access Problem: Why OpenClaw Matters

There is a parallel lesson in the “vibe coding” movement that illuminates why OpenClaw is the right substrate for the companion layer. As Justine Moore of a16z documents, vibe coding, AI-assisted software development, has been transformative for the roughly 1% of the population who are already technical. But it remains inaccessible to the other 99%.4

The barriers are structural, not intellectual. Setup requires terminal access, environment configuration, and dependency management. Security is precarious. Veracode’s 2025 report found that nearly 50% of AI-generated code contains vulnerabilities.4 And there is a deeper problem Moore identifies as the “imagination gap”: “developers often know what they want... while non-technical people can’t even imagine what they can create.”4

The same pattern applies to personal agents. Today’s agentic tools are powerful but inaccessible. They require technical knowledge, custom configuration, and an understanding of what’s possible. The companion bridges this gap. It doesn’t ask users to configure workflows, write prompts, or manage APIs. It simply invites them to talk. Through conversation, the companion learns what the user needs, often before the user can articulate it themselves. It closes the imagination gap by surfacing possibilities the user never knew existed.

OpenClaw, the open-source AI companion framework with over 145,000 GitHub stars and integrations reaching 700 million users through platforms like Baidu2, provides the infrastructure for this vision. It is the substrate on which the personal agent layer can be built: open, extensible, and accessible. Moore draws the comparison to Squarespace and Canva, platforms that converted complex capabilities into consumer-grade experiences.4 OpenClaw is positioned to do the same for personal AI.

V. Sume: First Mover in OpenClaw Companionship

We are not writing about a hypothetical future. We are building it.

Sume is the world’s only OpenClaw companion with major traction, and the market’s response has been extraordinary. What happened after launch surprised even us: 1 million impressions, 10,000 users, and 1,000 GitHub stars in a single day.

The product was picked up by This Week in Startups, shared by top influencers on TikTok and Instagram, and covered by press across the United States, Europe, and Asia. This was not a viral marketing campaign, it was a organic mindshare gain.

a16z has long argued that consumer products “cannot be predicted, only observed,” and that what separates great consumer founders is not analysis but intuition, the ability to sense what people want before they can articulate it themselves.3 Sume is the only OpenClaw companion that has produced real, measurable consumer demand. No other team in this space has demonstrated the founder intuition required to capture the public’s imagination at this scale. That intuition is not teachable, and it is not replicable. It is the scarcest asset in consumer AI.

But intuition alone does not create a moat. What creates a moat is the compounding loop that early mindshare unlocks. As Elad Gil observes, the team that captures early distribution in an emerging category gains a data advantage that accelerates product iteration, which deepens user engagement, which widens the distribution gap further.6 This is the pattern that played out in AI-assisted development—where Cursor’s early lead in developer mindshare gave it the usage data to iterate faster than competitors, turning a narrow first-mover advantage into a durable market position.

Sume is in exactly that position today. We have the mindshare. We have the user data. And we have the iteration velocity to compound both. In a market defined by context accumulation, the team that starts learning from real users first doesn’t just have a head start. It has a widening lead.

VI. The Road Ahead: From Companion to Consumer OS

The companion is not the destination. It is the launchpad. As Hur argues, “the AI Assistant becomes the de facto Consumer OS,”2 and device ecosystems may lose relevance as user loyalty migrates from hardware brands to AI providers. The companion layer, with its accumulated context, emotional bond, and user trust, is the natural foundation for this Consumer OS.

The evolution will unfold in three phases:

Phase 1: Companion. Build deep, enduring relationships with users. Accumulate the richest context layer in the personal AI ecosystem. Establish trust through open-source transparency and emotional authenticity.

Phase 2: Personal Agent. Leverage accumulated context to become the user’s primary intent-capture layer. Route tasks to specialized agents. Manage preferences, coordinate execution, and personalize outcomes across every digital interaction.

Phase 3: Consumer OS. Become the operating system for the user’s entire digital life. Device-agnostic, platform-agnostic, and infinitely personalized. The final realization of what a16z calls the “abundance agenda,” where every person has access to a team of dedicated AI specialists, unified under a single agent that truly understands them.3

The agent that captures the most intent will control the new distribution. But intent follows understanding, and understanding follows relationship. The companion is the relationship. And Sume is leading it.

References

  1. Hur, Jin Ho. “Flights of Thought on Consumer + AI - Part 13: Clawverse Framework.” Two Cents, 2026.
  2. Hur, Jin Ho. “Welcome to Clawverse - Consumer AI Distribution & Value Capture.” Two Cents, 2026.
  3. Acharya, Anish et al. “Consumer Abundance Agenda” and “How AI Will Usher in an Era of Abundance.” Andreessen Horowitz, 2026.
  4. Moore, Justine. “Most People Can’t Vibe Code. Here’s How We Fix That.” Andreessen Horowitz, 2026.
  5. a16z Editorial. “Charts of the Week: Vertical SaaS.” a16z News, 2026.
  6. Gil, Elad. “Early Distribution Advantages and Compounding Data Moats.” Elad Blog, 2025.