- Roko's Basilisk
- Posts
- The Tithonus Curse
The Tithonus Curse
Plus: Flipkart's India push, the BEAD broadband battle, and ByteDance seeks $20B.
Here’s what’s on our plate today:
🧪 AI's Tithonus curse: immortality without renewal.
📰 Flipkart's quick-commerce push, the fight over broadband billions, and ByteDance's record loan.
🛠️ Weekend To-Do: audit a model for drift, read the ML debt paper, set up drift monitoring.
🗳️ Poll: What's the right response to silent model decay?
Let’s dive in. No floaties needed…

AI Agents Are Reading Your Docs. Are You Ready?
Last month, 48% of visitors to documentation sites across Mintlify were AI agents, not humans.
Claude Code, Cursor, and other coding agents are becoming the actual customers reading your docs. And they read everything.
This changes what good documentation means. Humans skim and forgive gaps. Agents methodically check every endpoint, read every guide, and compare you against alternatives with zero fatigue.
Your docs aren't just helping users anymore. They're your product's first interview with the machines deciding whether to recommend you.
That means: clear schema markup so agents can parse your content, real benchmarks instead of marketing fluff, open endpoints agents can actually test, and honest comparisons that emphasize strengths without hype.
Mintlify powers documentation for over 20,000 companies, reaching 100M+ people every year. We just raised a $45M Series B led by @a16z and @SalesforceVC to build the knowledge layer for the agent era.
*This is sponsored content

The Laboratory
TL;DR
Decay, not failure: Machine learning systems degrade silently as the world drifts from their training data, accumulating ‘technical debt’ that stays invisible until a costly mistake forces it into view.
The bodies are real: A 2023 suit alleges that UnitedHealthcare's nH Predict denied elderly care with a claimed 90% error rate, that Zillow's pricing model triggered a $540M+ write-down and 25% layoffs, and that Google Flu Trends overshot the 2013 peak by roughly 140%.
Loops make it worse: Outputs reshape the next training set. Predictive policing keeps sending patrols to the same blocks, and credit scoring is less accurate for thin-file borrowers, hardening bias into data.
Stakes: Gartner says that at least 50% of GenAI projects will be abandoned after proof of concept. Treat models as finished products, not standing obligations, and the harm scales quietly.
AI's Tithonus curse: immortality without renewal
In Tennyson's retelling of the old Greek myth, the goddess Eos begs Zeus to grant her mortal lover, Tithonus, eternal life, but forgets to ask for eternal youth. The wish is granted exactly as worded. Tithonus does not die; he simply ages without end, while his divine companion remains luminous and unchanged beside him, until he is left withered beyond all recognition, a voice without a use, pleading for the release that immortality has permanently set out of reach. Tennyson's reading is unsentimental: to persist without renewal is to decay in place, indefinitely. The myth has found an uncomfortable second life in the way we now build software to last.
A trained machine learning model is a strange kind of artifact. It is designed to endure, and unlike conventional software, it rarely announces the moment it stops working. Ordinary code fails loudly, with an error thrown, a service down, or an alert firing somewhere, while a model fails quietly. It keeps returning answers in the same confident format it always has, while the distance between the world it learned and the world it is now asked to describe widens month by month. Researchers have a name for the residue this leaves behind. They call it 'technical debt,' and a growing body of evidence suggests the bill comes due in ways that are hard to reverse.
What AI technical debt is
In ordinary software development, technical debt is the future cost of a shortcut taken today: code that works now but will demand rewriting as the requirements around it shift. Machine learning inherits that problem and quietly multiplies it, because the shortcuts within a model are not always visible in the code. A widely cited paper from Google argued that real-world machine learning systems incur ongoing maintenance costs through boundary erosion, tangled dependencies, hidden feedback loops, and what its authors called 'undeclared consumers,' meaning other systems that come to rely on a model's outputs without the original team ever knowing. The learning code itself, in their account, is a small box inside a much larger machine made of data pipelines, monitoring, and connective tissue. When any part of that surrounding apparatus drifts, the model drifts with it, often invisibly, until the outputs begin to diverge from reality in ways that matter.
Why models decay when code does not
The deeper trouble runs beneath the infrastructure, because a model can rot even when every line of code around it stays pristine. The phenomenon is called 'model drift,' and its cause is almost banal: the world keeps moving, and the model does not. Its assumptions calcify around a snapshot of reality that recedes further into the past with each passing month.
A pricing algorithm trained before 2020 has no concept of pandemic-era supply shocks. A hiring tool built on a decade of resumes can quietly encode the preferences of a labor market that no longer exists. A diagnostic system cannot recognize a condition that emerged after its training cutoff. In every case, the model keeps producing confident output anchored to a world that has expired, and the people leaning on that output often have no way to know the reference point is gone until something goes badly wrong.
The damage has already surfaced in courtrooms and on balance sheets. A class action filed in 2023 alleges that UnitedHealthcare leaned on a flawed AI tool, nH Predict, to cut off post-acute care, the rehabilitation and nursing care that follows a hospital stay, for elderly Medicare Advantage patients. The plaintiffs claim a 90% error rate, meaning roughly 9 of every 10 denials that patients appealed were ultimately reversed. UnitedHealth disputes that characterization and says the tool does not make coverage decisions, but the litigation has been allowed to proceed, and the gap the complaint describes, between what the model predicted and what physicians judged a body actually needed to recover, is the gap of a system grading a world it no longer understands.
Zillow offers the same lesson in a different register. The company's algorithmic home-buying arm could not keep pace with a rapidly turning housing market, and in late 2021, Zillow shut down the operation, cut roughly 25% of its staff, and absorbed a write-down of more than $540M after its model proved unable to forecast home prices.
Google Flu Trends supplies the earliest cautionary tale of all. The system overshot the 2013 flu peak by roughly 140% because it could not distinguish searches driven by genuine illness from those driven by media coverage of illness. The correlations it had learned slowly lost their meaning as behavior changed, and the model had no way to notice that the ground beneath it had moved.
The feedback loop problem
Past the decay of any single model lies a more structural failure, one in which a system's own outputs reshape the data it will be trained on next. Predictive policing is the most studied case. A model trained on discovered crime, meaning the arrests officers happen to make, sends patrols back to the same neighborhoods, and those patrols generate more recorded incidents, which the model reads as more underlying crime, which justifies sending still more officers, until the map stops measuring the world and starts manufacturing it. A peer-reviewed analysis of these systems showed they are prone to 'runaway feedback loops' that keep directing police to the same areas regardless of the true crime rate, with the distortion growing in proportion to the gap between neighborhoods.
Credit scoring runs a similar loop on a longer fuse. People with thin credit files receive less reliable scores, are denied loans, and thereby miss out on the repayment record that would have sharpened the next assessment. The largest study of real mortgage data, covered by MIT Technology Review, found that predictive tools are meaningfully less accurate for minority and low-income borrowers, a shortfall that traces to the thinness of their credit data rather than to bias inside the algorithm, and one that fairer algorithms alone cannot fix.
Why organizations struggle
The scale of all this is coming into focus as more deployments mature and the gap between launch-day and current performance becomes clearer. Gartner had predicted that at least 30% of generative AI projects would be abandoned after proof of concept by the end of 2025, citing poor data quality, weak risk controls, rising costs, and unclear value. By that deadline, the firm reported the figure had reached at least 50%. Looking further out, Gartner expects organizations to abandon 60% of AI projects through 2026 due to a lack of data foundations needed to keep models current.
The reason AI maintenance is so punishing is that it does not behave like ordinary IT maintenance. Patching a bug means finding the broken code and replacing it. Maintaining a model means asking whether the world itself has shifted in a way that quietly invalidated the model's assumptions, a harder question that rarely raises its hand. Some systems look stable for long stretches, then fail all at once as small accumulated errors cross a threshold, with no visible signal until a consequential mistake arrives. The economics are unforgiving even before AI enters the picture: McKinsey reports that chief information officers put technical debt at 20 to 40% of the value of their entire technology estate, a tax that compounds as systems age, and AI layers on complexities that conventional quality frameworks were never built to hold.
Stewardship over intelligence
Sitting with this limitation has prompted several organizations to stop treating AI systems as finished projects and to start treating them as ongoing obligations. In practice, that means monitoring models for signs of drift, keeping clean pipelines ready to retrain them as conditions change, and building governance that specifies who is accountable when an output causes harm. It also means accepting that these systems carry recurring costs that never taper off after the initial build. Regulators in Europe and elsewhere are reinforcing the same posture, with rules that require companies to demonstrate that their systems remain accurate, fair, and safe long after launch, rather than only at launch. The organizations handling this best are the ones that quietly moved the finish line, from the moment of deployment to the full span of a system's working life.
The cost of immortality without care
What undid Tithonus was the half of the wish that went unspoken: renewal. Eternity only deepened the loss, stretching a body and a purpose long past their natural end. A model can easily outlive the conditions that made it trustworthy, and left alone, it does not retire gracefully. It grows brittle, drifts away from the reality it was trained to read, and produces harm at scale with no alarm to mark the moment it stopped being right. The way out is custodial: letting these systems be questioned, retrained, and sometimes allowed to end when the world they were built to understand has moved past them. The alternative is a kind of machine intelligence that survives its own usefulness, maintained in perfect form while hollowed of function, still answering in the same steady voice long after it has stopped having anything true to say.


Headlines You Actually Need
Flipkart's quick-commerce push: Walmart-backed Flipkart is expanding its quick-commerce operations as Amazon ramps up in India, intensifying the battle for the country's fast-delivery market.
The fight over broadband billions: The BEAD broadband program is becoming a flashpoint, with Trump, Musk, and Bezos all positioned to shape who gets billions in federal funding.
ByteDance's record loan: ByteDance is seeking $20B in its largest-ever offshore loan, a sign of how aggressively the TikTok parent is financing its global AI and infrastructure ambitions.

Build your site on Framer, now with Agents
Framer is a pro website builder trusted by companies like Miro and Perplexity, helping creators, teams, and businesses ship production-ready sites faster than ever.
With AI agents built directly into the canvas, teams can design pages, manage CMS content, write copy, add SEO, and audit for issues — all without leaving the tool where the real site lives.
*This is sponsored content

Friday Poll
🗳️ AI models decay silently as the world moves on. What's the right response? |

Weekend To-Do
Audit a model for drift: Pick any AI tool you rely on and test it against current real-world data; you'll quickly see whether its assumptions still hold.
Read the Google "hidden technical debt" paper: The foundational paper on ML technical debt is the clearest explanation of why models rot even when code doesn't.
Set up basic drift monitoring: Try an open-source tool like Evidently AI to see how teams actually catch model drift before it causes harm.
Meme Of The Day
The Toolkit
Regie.ai: AI sales agent that researches prospects, writes personalized outbound at scale, and handles follow-ups so reps can focus on closing.
Replit AI: A browser-based coding environment with an AI agent that builds, debugs, and deploys apps from prompts, with no local setup required.
Sourcegraph: Code intelligence platform with Cody, an AI assistant that understands your entire codebase so it can answer questions and write code that actually fits.

Rate This Edition
What did you think of today's email? |





