The model is the stale one
2026.162I pointed two frontier models at my own docs to see which one to pay for. Both flagged the correct, freshly-updated pages as stale -- the audit was poisoned by the thing being audited.
ai
all tags →I pointed two frontier models at my own docs to see which one to pay for. Both flagged the correct, freshly-updated pages as stale -- the audit was poisoned by the thing being audited.
Three AI code-review tools sit on the same model. The one that guarantees coverage did it with a for-loop, not a better prompt.
An attention mask drawn in ASCII, and the architectural trade it names.
The old failure mode was reading every line equally. The new one is trusting code that looks right.
An LLM in the middle of a fetch pipeline took orders from the file it was summarizing.
A wallet lets you spend. That's the easy half.
Your code already leaves the machine. The agent just gave it another surface.
Credentials got easier to claim. Artifacts didn't.
The deal was never about the license. It was about the loop.
When quality looks suspicious, people stop producing quality.
Retries fix distributed systems. Repeating yourself fixes attention asymmetry.
The bugs are quieter now. Just wrong assumptions buried in correct-looking code.