The model is the stale one
2026.162I pointed two frontier models at my own docs to see which one to pay for. Both flagged the correct, freshly-updated pages as stale -- the audit was poisoned by the thing being audited.
tooling
all tags →I pointed two frontier models at my own docs to see which one to pay for. Both flagged the correct, freshly-updated pages as stale -- the audit was poisoned by the thing being audited.
Three AI code-review tools sit on the same model. The one that guarantees coverage did it with a for-loop, not a better prompt.
An LLM in the middle of a fetch pipeline took orders from the file it was summarizing.