When a Content Network Starts Publishing to Itself

📊 Full opportunity report: When a Content Network Starts Publishing to Itself on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A content network with 474 sites is self-publishing heavily to only a few, leaving over half inactive. The imbalance stems from internal supply and placement issues, now being addressed through targeted fixes.

A large automated content network with 474 WordPress sites is primarily publishing to just 8% of its sites, leaving the majority inactive, according to a recent audit. This uneven distribution poses risks to SEO and content diversity, highlighting systemic issues in the network’s supply and placement systems.

The network operates through two separate systems: Stenvrik, which sources and assesses news signals, and DojoClaw, which rewrites and distributes content across the sites. Despite the system’s design to decouple content sourcing from distribution, an audit revealed that 80% of posts were concentrated on only 38 sites, mainly in the technology category, while 249 sites received no posts over 28 days.

This imbalance was caused by two interconnected issues. First, the content matching algorithm repeatedly surfaced the same popular tech sites, limiting opportunities for other sites to publish. Second, the content supply was heavily skewed toward tech topics, with only about 13% of sites in that category, while many other categories such as Home, Health, and Food received little to no content. Addressing these problems involved targeted adjustments to the distribution system, including caps on site outputs and a recency-based ordering system to promote less active sites.

Balancing a 474-site network — ThorstenMeyerAI.com
ThorstenMeyerAI.com
AI & Tooling · Engineering Note
Systems at scale

When a content network starts publishing to itself

A 474-site network quietly collapsed onto 38 of its own favorites while half the catalog went dark. The throughput graph looked fine. The fix wasn’t one thing — it was two causes and a three-part repair across two decoupled systems.

Stenvrik

News-intelligence layer

Ingests hundreds of feeds, scores & geo-tags stories, surfaces what’s trending.

SUPPLY · what’s worth covering
DojoClaw

AI content engine

Rewrites a story in each site’s voice and fans it out across the catalog.

PLACEMENT · where it lands & how it reads
01The symptom

80% of output on 8% of sites

A 28-day audit, bucketed per site, was lopsided in a way the totals had hidden. Every individual placement was “correct” — the aggregate was a slow-motion failure.

Where 28 days of syndication actually landed

474-site catalog · per-site audit
Top 38 sites8% of catalog
80% of all posts
Top 4 sitesall tech titles
200+ articles/week each
249 sites53% of catalog
ZERO posts — half the network dark
02The diagnosis · refuse the obvious
WordPress Explained: Your Step-by-Step Guide to WordPress (2020 Edition)

WordPress Explained: Your Step-by-Step Guide to WordPress (2020 Edition)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Not one bug — two independent causes

The tempting move is to blame the matcher and move on. The data showed two distinct problems living on two different systems, each needing its own fix.

Cause 1 · DojoClaw

Within-topic concentration

The matcher kept surfacing the same broad tech sites for every tech story, and rotation only shuffled candidates within the matched pool. A site that never entered the pool could never get a turn — fair only among the already-chosen.

Cause 2 · Stenvrik

Supply ≠ demand

53% of supplied content was tech/AI — but only ~13% of sites are. The catalog skews the other way, so those sites starved for on-topic material.

supply
tech/AI content in53%
demand
tech/AI sites in catalog~13%
03The load balancer · flip it
Survival Audit SEO, GEO and AEO: Ignored by AI? Losing visibility on Google? Evaluate and Adjust! (Book 2 - Taking Action)

Survival Audit SEO, GEO and AEO: Ignored by AI? Losing visibility on Google? Evaluate and Adjust! (Book 2 – Taking Action)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Watch the network rebalance

Each square is one of the 474 sites; color is how much it’s publishing. Toggle the selection logic to see placement spread off the red-hot favorites and into the dark long tail.

Placement simulator

Same matcher relevance gate either way — the only change is how candidates are ordered after it.

38
sites carrying 80% of posts
249
dark sites · zero posts
overloaded
hottest sites at ~30/day
dark · 0 light healthy busy overloaded
04The three-part fix
Mastering GitHub Actions: Advance your automation skills with the latest techniques for software integration and deployment

Mastering GitHub Actions: Advance your automation skills with the latest techniques for software integration and deployment

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Placement, supply, throughput

Two causes meant the fix had to touch both systems — and only then could the ceiling rise without re-concentrating the load.

1

Placement levers

DojoClaw
  • Per-site weekly cap — any site over 25 posts/7d drops from the pool, pushing selection into the long tail (relaxes only if it would starve a fan-out).
  • Global LRU — order by network-wide recency, not just within-topic, so sites idle across the whole network float to the top.
  • Starvation floor — guaranteed by construction: the most-idle eligible site is always within the picks.
2

Supply rebalance

Stenvrik
  • Audited existing feeds for liveness — removed ones returning HTTP 200 but zero items (broken RSS).
  • Added a verified batch across Home, Garden, Health, Food, Fashion, Auto, Science, Pets & more — every feed fetched live first, weighted to the most idle categories.
  • Flagged throttled feeds (big publishers exposing only 1–2 items) for replacement rather than burying the risk.
3

Throughput raise

Scheduler
  • Fan-out width maxSites 5 → 7 — the extra slots land on fresh sites because the cap is now enforcing.
  • Quota depth K 2 → 3 — every category’s daily cap scaled ×1.5.
  • Honest note: a documented ~950/day intent the code never delivered (units quirk) stays gated behind a sign-off.
05What it adds up to
Amazon

website content balancing plugins

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

The scoreboard — with an honest asterisk

The change is behavioral: it shapes future placement, it doesn’t retroactively rescue the month sites sat dark. The proof is in the next weeks of data — which is why the instrumentation is the real deliverable.

Metric
Before
After
Concentration
80% on 38 sites
cap + LRU + floor
Dormant sites
249 (53%)
shrinking ↓
Feed sources
245
271 verified
Daily ceiling
~188/day
~280/day · +49%
Fan-out width
5
7
Why two systems, not one

Supply and placement are genuinely separate concerns. Diagnosing the imbalance meant looking at both sides and seeing they disagreed. A clean boundary made a failure that spanned both legible — good system boundaries organize thought, not just code.

The tradeoff taken

Ordering by load & idleness sacrifices a little topical ranking for dramatically better coverage. All candidates already cleared the relevance gate — so it’s a deliberate trade, not a regression.

ThorstenMeyerAI.com
Stenvrik (news-intelligence) ↔ DojoClaw (content engine) · figures reflect the May 2026 engineering audit & the behavioral changes made in response · the network’s response is being tracked.

Implications of Content Distribution Imbalance

This imbalance can lead to several issues, including a skewed content landscape that favors certain topics and sites, potential SEO penalties for over-published sites, and diminished value for the majority of the network’s sites. The systemic nature of the problem shows how automated systems can inadvertently reinforce biases, risking long-term sustainability and diversity of the content network.

Background of Automated Content Networks and Distribution Challenges

This content network, managed with a division of labor between the news-sourcing system (Stenvrik) and the distribution engine (DojoClaw), was designed to operate independently but efficiently. Prior to the recent issues, the system functioned as intended, but the audit revealed that the distribution was heavily concentrated on a small subset of sites. Similar challenges in automated content management have been observed in other large-scale systems, where balancing supply and demand remains complex and prone to unintended biases.

"The system was quietly publishing to a handful of sites while leaving the rest dormant. It’s a classic case of automation reinforcing itself in unexpected ways."

— Thorsten Meyer, system operator

Unresolved Aspects of the Distribution Imbalance

It remains unclear whether these fixes will fully resolve the imbalance long-term or if further systemic adjustments will be necessary. The ongoing monitoring will determine if the distribution becomes more equitable across the entire network or if new issues emerge as the system adapts.

Next Steps for Restoring Content Balance

The team plans to continue refining the distribution algorithms, including adjusting site caps and recency priorities, and will monitor the network’s output over the coming weeks. Further technical audits are expected to assess whether the changes lead to a more balanced and sustainable publishing environment.

Key Questions

Why is the network publishing mostly to a few sites?

The distribution algorithm favors already active, popular sites, and the supply of content is heavily skewed toward certain topics, causing the system to concentrate posts on a small subset of sites.

Could this imbalance harm the network’s SEO or reputation?

Yes, over-publishing on a few sites can appear spammy to search engines and reduce content diversity, potentially impacting rankings and overall credibility.

Are the fixes guaranteed to solve the problem?

While initial adjustments aim to promote more equitable distribution, ongoing monitoring is necessary to confirm whether the imbalance is fully addressed or if further tuning is needed.

Will other categories besides tech benefit from these changes?

Yes, the goal is to diversify content across all categories by balancing supply and improving distribution algorithms, which should help less active sites in categories like Home, Health, and Food.

Source: ThorstenMeyerAI.com

You May Also Like

The mandate. Why the US conversational- finance surface does not translate to Europe.

The US launches permissionless personal finance surfaces, while Europe’s regulatory framework mandates licensed, consent-based access, shaping market dynamics.

The license. Why the AI content market pays the brand-name corpus and strands the long tail.

Analysis of how licensing favors large publishers, marginalizes small ones, and the potential of collective licensing to address structural inequalities.

The referral. How AI search severs the content-for-traffic contract that funded the open web.

AI search now answers queries directly, ending the traditional referral traffic to publishers. This shift impacts revenue models for publishers of all sizes.

Data retention cleanup assistant for small law firms

Small law firms are testing a new data retention cleanup assistant aimed at simplifying old matter file reviews, with initial validation underway.