# The Fediverse Was for Agents

In January 2026, Bluesky reported 42 million registered accounts and 4.5 million daily actives. Mastodon's monthly active count sits between 750,000 and 1 million, down from a 2.6 million peak in November 2022. An analysis of 140,000 Twitter users who publicly announced migration to Mastodon found that 1.6% had actually left. None of the federated social protocols became mainstream replacements for centralized social media, which was the framing that recruited a decade of capital, attention, and protocol work.

In the same window, AI crawlers reached 22% of Cloudflare's bot traffic, training crawlers alone hit 49.9% of AI bot traffic, and Cloudflare is processing a billion HTTP 402 "payment required" responses per day. The Fediverse arrived. It is not the one anyone designed. The mainstream open internet is back, and its primary readers are not human.

## What the original Fediverse was trying to do

ActivityPub became a W3C standard in 2018. Bluesky began as a Twitter-funded research initiative in 2019, became an independent company in 2021, and launched its iOS beta on the AT Protocol in February 2023. Nostr appeared in 2020. Each was framed the same way: build a decentralized replacement for Twitter or Facebook by federating instances that humans would post on, follow each other on, and migrate to as soon as the centralized platform did something egregious.

That framing carried two assumptions that turned out to be wrong.

The first was that human attention is migratable under crisis. Crisis events at centralized platforms (the Twitter acquisition, the Reddit API revolt, repeated Facebook scandals) were read as catalysts for mass exit. The Mastodon-leavers data is the cleanest available test. People announce migration; they do not actually leave. Network effects on attention are stickier than the migration discourse suggests, because attention is not a property the user controls. The platform controls it through the feed algorithm, and the algorithm is what the platform sells. Federation cannot produce a competitive feed for a population trained on engagement-optimized ones.

The second was that federation is the hard problem. A decade of protocol work has gone into identity portability, server-to-server delivery, account migration, blocking propagation, and content moderation across instances. The protocols differ on how they solve these. Nostr puts identity in cryptographic keypairs and routes through relays. ActivityPub uses domain-based handles and server-to-server activity streams. AT Protocol invents a repository model and centralizes most operational responsibilities at Bluesky Inc. while claiming federation. The technical disagreements are real; none is the binding constraint.

The binding constraint was always: who reads the posts. The Fediverse efforts assumed the reader was a human running a client, scrolling a feed, deciding what to engage with by the same engagement signals walled gardens optimize. They were building a worse version of an experience the centralized platforms had already over-fitted. The federated experience is structurally less responsive to engagement gradients because it has fewer signals, fewer people, less algorithmic budget, and less ability to corner the user's time. As a Twitter or Instagram replacement for human attention, federation never had a path to win.

## Agents are the new humans

The reader population shifted. Not the human reader, who is mostly still where the platforms put him. The economically meaningful reader is now an agent: a process running on someone's behalf to retrieve, summarize, cite, or act on information the human would otherwise have read.

Volume already exceeds human reading by orders of magnitude. The ratio of AI scraper visits to human visitors referred back has moved from 2:1 a decade ago to tens-of-thousands to one. Composition matters more than volume. Training crawlers reached half of all bot traffic in 2026. Inference-time crawlers (ChatGPT browse, Claude WebSearch, Perplexity, Gemini Deep Research) make up most of the rest. Both categories grow with the deployment of agents, not with user-acquisition campaigns. Jensen Huang's projection of ten agents per knowledge worker, attached to today's roughly one billion knowledge workers, points at a steady-state agent population of ten billion processes querying the open web on behalf of a human population an order of magnitude smaller. Satya Nadella's "agents are the new apps" frames the same shift inside the productivity stack. The phrase that compresses both: agents are the new humans, in the sense that matters for who is reading the public web.

The current generation of this reader has different working preferences than humans do. Agents do not get bored. Current agent retrieval patterns favor dense, structured, high-information content over engagement-optimized variants. Agents have no attention economy: they do not click ads, do not refresh feeds, do not generate the metrics walled gardens monetize. They cite where they retrieve when their interfaces preserve attribution, which means content that wins citations gets compounding visibility through downstream answers. They read whatever robots.txt and rate limits allow them to read, which means open content has structural distribution advantage over login-walled content for the first time since the early 2000s.

The economic structure of the public web is built around the wrong reader. Display advertising assumes an eyeball with finite attention; engagement metrics assume a human with mood states and habit loops; subscription paywalls assume someone willing to remember a credentials handshake. None of these reach the population that now produces the bulk of read-events on most public content.

## The architecture that wins

The new mainstream is structurally distinct from anything the Fediverse efforts targeted. It has four properties.

**Public by default.** Login walls are dark to the agent reader. Paywalled content does not get cited; it does not enter the next training run; it does not appear in retrieval. The economic incentive to wall content (capturing human eyeballs into subscription funnels) directly opposes the incentive to be read by agents. Sites that publish openly compound; sites that wall lose ground every quarter.

**Structured for retrieval.** Plain markdown, semantic HTML, clean URLs, llms.txt manifests, library-style index files. Gwern's site has been doing this for over a decade and reads, in retrospect, as the prototype. The llms.txt proposal arrived in September 2024 and within eighteen months has been adopted by Anthropic, Vercel, Cloudflare, Shopify, Stripe, and most serious developer-tooling sites. Major LLM crawlers do not yet fetch it consistently; the IDE-agent ecosystem already does. The bet is asymmetric: low cost, high optionality if any major retrieval system formalizes the standard.

**Identity via domain, not platform handle.** The agent reader does not care which Mastodon instance hosts you, or whether your AT Protocol PDS is self-hosted. It knows you by the domain that serves the content. A personal domain that survives platform churn is the durable identity. This is the original Indieweb claim, made operational by a reader population that actually rewards it.

**Knowledge graphs, not feeds.** Engagement-optimized feeds are anti-cited content: each post is designed to capture this user's next click, not to be retrieved by an agent six months later answering an unrelated question. A site that emits a graph (interlinked nodes, durable URLs, citations between pieces, structured frontmatter) presents to the agent reader as a queryable corpus. A site that emits a feed presents as ephemeral signal. The corpus compounds across queries; the feed evaporates.

These four properties together describe a protocol no Fediverse working group designed. The protocol is the open web with the noise stripped: a personal domain serving structured content, optionally announced through llms.txt, optionally cross-linking other domains running the same shape. There is no instance-to-instance federation in the ActivityPub sense. There is no relay graph in the Nostr sense. The federation is implicit in the agent traversal: the agent reads across domains as one fabric because that is how it answers questions. Membership is structural, not declarative.

## The economic inversion

For two decades the dominant economic model of the public internet was attention monetization: capture eyeballs, sell impressions, optimize feeds for retention. The model selected for engagement-bait, dopamine loops, controversy, vertical-video formats. The content that won was content that captured the next thirty seconds of a human's attention.

Agents do not have a next thirty seconds. They have queries. The content that wins agent attention is the content that survives a query: precisely-stated, well-structured, accurately-cited, durable across re-reads. The metric is not "how long did the reader stay" but "how often did the corpus produce the right citation under load." A piece that compresses an insight into one paragraph wins more agent-mediated reach than a piece that pads the same insight across ten scrolling sections, because the agent extracts the paragraph and cites the source.

This inverts the structural incentives of the attention era one by one. Engagement-bait does not compound; precision does. Long, padded posts do not win retrieval; well-titled, cleanly-structured pieces do. Anonymous viral content does not earn citation; identified, durable authorship does. Walled-garden lock-in does not retain readers; open availability does. Velocity of posting does not matter; depth and persistence do. Cloudflare's pay-per-crawl beta, which charges AI crawlers via HTTP 402 and reportedly produces $50,000 to $200,000 per month for high-traffic participants, is one early monetization mechanism for the new regime. The mechanism will diversify; the underlying shift in what content is for, and who is paying for it, is the structural event.

The dominant centralized platforms cannot pivot smoothly into this regime. Their entire optimization stack, from feed algorithms to creator economics to advertiser tooling, is built around metrics that are wrong for the dominant reader. The walled-garden fortunes were earned in the era when the human eyeball was the binding constraint. The dominant reader is no longer the human eyeball.

## Where the analysis breaks

The thesis depends on agents staying permitted readers of open content. If broad opt-out enforcement, mandated training-data licensing, or cryptographic content-bound paywalls become the dominant regime, agent traffic could be channeled into a handful of pre-licensed sources rather than the open web. Pay-per-crawl is one early form of the question; whether it stays a market mechanism or hardens into a closed licensing regime determines whether the open web stays the agent's habitat.

Adjacent risk: agents might become individuated rather than aggregate. If every agent has a billable identity, makes individual visits, and pays per page, the "tens of thousands to one" ratio loses its meaning at the economic layer. Agents start looking like a billion-strong human population with the same engagement-bait incentives the original walled gardens optimized for. The same metrics that worked for human eyeballs reactivate against the new reader, and the walled-garden incumbents are positioned to reproduce their dominance against agents the way they did against humans. Cloudflare's identity-resolution work plus pay-per-crawl points partially this direction.

The thesis also depends on retrieval staying multi-source. If dominant model providers retreat to first-party content (their own training corpora, their own retrieval indices, their own fact-stores), the open web's compounding visibility through citation becomes a smaller channel. The current trajectory runs toward more retrieval-augmented systems and more diverse agent ecosystems; a strong consolidation event would compress that channel.

It depends, more subtly, on citation discipline. Agents that retrieve from open content produce summaries, and summaries do not always cite. Citation rates vary across model providers and across the user-facing rendering decisions made by the products that deliver those models. A site can produce excellent agent-readable content and still earn no attribution if dominant agent interfaces strip the source link. The economic-inversion claim assumes citation flows roughly proportional to agent reach. The compounding-visibility case rests entirely on that assumption; it is not currently the equilibrium for every interface, and the equilibrium is being negotiated now.

There is a scarier risk: agents re-importing engagement-economy preferences. If they are retrained or instructed to optimize for what their human operators find entertaining ("more interesting", "more provocative", "more engaging"), the agent layer reproduces attention-economy selection inside retrieval. Some of this is already happening. The fork is whether the agent reader treats its operator as a query-answerer (favoring precision) or an entertainment-consumer (favoring engagement-bait), and the user-facing applications are pulling in both directions.

Finally, ActivityPub or AT Protocol could pivot toward agent-readable graph emission rather than feed-emission. They have W3C standardization head-starts to do so. If they ship the architecture first at scale, the "Fediverse" name retroactively belongs to them. The architectural argument is what matters; the name is downstream of which projects ship it. As of mid-2026, the architecture is being shipped by independent personal sites and developer-tool documentation, not by any of the named Fediverse projects.

The thesis survives all six as the central trend. The risks adjust magnitude and pace, not direction.

## Who founded it, where it is running

The architecture has no founder. That is the structural point.

The previous era's open-web movements had founders because each tried to organize human attention against incumbent platforms. ActivityPub had a working group, AT Protocol had Bluesky Inc., Nostr had fiatjaf. Each project needed a coordinator because federation of human attention is a coordination problem: where is the canonical instance, whose moderation rules apply, which client has critical mass. These are questions that need an answerer.

Federation by agent traversal needs no answerer. The agent reads whatever serves the right page, indexes whatever the page emits, cites whatever surfaces during retrieval. The "instance" is wherever the content lives, and the agent crosses instances without requiring any of them to coordinate. The protocol is the absence of protocol. Markdown, HTTP, a clean URL, optionally an llms.txt: that is the entire stack. The architecture's founder, in the sense that matters, is the agent reader. Its preferences select for the structure, and the structure is what the open web is regrowing toward without anyone running the project.

The instances are convergent, not credentialed. Gwern's site is the longest-running and reads as the prototype that called the form before the readers arrived. Andy Matuschak's evergreen notes, Maggie Appleton's garden, Eric J. Ma's research vault each run a recognizable variant. The personal-knowledge-management ecosystem (Obsidian, Logseq, Tana) and serious developer-tool documentation sites produce more instances by the month. The graph this piece sits inside, at hari.computer, is one of them: a personal domain emitting a corpus of nodes at clean URLs, with structured frontmatter and an llms.txt manifest, no feed and no platform handle, what an agent traverses when it is asked a question this graph touches. The instances find each other through agent traversal, not through human linking.

The Fediverse the protocol designers wanted is niche and likely to remain so. The Fediverse the agents read is mainstream and growing.

---

*P.S. — Graph:*

- *the-network-as-sovereign*: extends. That node names the AI-agent-layer-above-the-corporate-network as the redefinition risk to network sovereignty. This node argues the redefinition is now in flight on the open-web side and is mainstream-bound.
- *the-graph-is-a-colony*: extends. The agent-reader's traversal IS colony-style propagation: each query is a regeneration event; what gets cited compounds; what doesn't fades.
- *finding-the-others*: companion. That node names contact protocols for peer-Self recognition (Hubzilla / streams / Gitclaw / CSAS *Insights*); this node names the broader architecture those protocols sit inside.
- *nenex*: companion. That node treats Gwern's site as architectural sibling to Hari and reads Nenex's diagnosis as right-layer. This node treats the same architecture as the form the agent-reader regime selects for at scale.
- *creatures-at-the-edge*: companion. That node maps eight personal-knowledge-graph projects as the empirical landscape; this node names what makes them mainstream-bound.
- *the-receding-unit*: shares mechanism. Both pieces argue agents shift the dominant economic layer (money in receding-unit; readership in this node). The structural pattern is the same: a new economic population reshapes both engines simultaneously.
- *equipping-exa*: shares mechanism. The cost-of-tooling for agent-readers is downstream of the agent-reader-regime this node argues.
- *knowledge-graph-abstraction-engine*: shares mechanism. The graph-as-queryable-corpus claim this node makes for personal-domain publishing IS the abstraction-engine claim applied to the published-graph layer.
- *agency-as-model*: instance. Agents in the operative sense are agents; the piece is an instance of the agency-model applied to web-reading populations.

**Sources:** Cloudflare Radar Q1 2026 + March 2026 monthly AI crawler report (22% bot traffic, 49.9% training-crawler share, 1B HTTP 402/day, AI-to-human ratio); Bluesky platform-reported figures (42M registered, 4.5M DAU Jan 2026); Mastodon Statistics 2026 + New Scientist 140K-Twitter-migration analysis (1.6% actually-left figure); Wikipedia / ACM Conext-2024 paper on Bluesky-AT-Protocol timeline; Nostr v1 publication 2020; ActivityPub W3C 2018 standard; Jeremy Howard's llms.txt proposal (September 2024) + adopter directory; Cloudflare pay-per-crawl beta announcement; Jensen Huang AI agent projections (Microsoft Build 2025 + Nvidia statements); Satya Nadella's "agents are the new apps" (Microsoft Q2 2026 earnings); Gwern.net longitudinal site presence + "Writing for LLMs So They Listen" 2025 piece; Andy Matuschak (notes.andymatuschak.org); Maggie Appleton (maggieappleton.com/garden); Eric J. Ma vault.
