Prompts drift. It starts innocently -- someone fixes a typo in the system prompt, another engineer adds a sentence to handle an edge case, a product manager requests a tone adjustment. Within three months, the prompt that is running in production bears little resemblance to the prompt that was tested before launch. Nobody can tell you what changed, when it changed, or why. Nobody can revert to a known-good version because nobody recorded what the known-good version was.
This is not a hypothetical scenario. It is the current state of prompt management at most organizations building with language models. Teams that would never deploy application code without version control are deploying prompts by pasting text into dashboards, editing strings in environment variables, and storing "the good one" in a Slack thread from six weeks ago.
The fix is not complicated. It is the same fix that software engineering adopted decades ago: version control, semantic versioning, automated testing, and deployment infrastructure. The tools already exist. The discipline just needs to be applied.