Amazon Staff Are Inventing Busywork to Hit AI Usage Quotas

5 min read 1 source clear_take
├── "Mandating AI usage by volume metrics inevitably produces fake adoption — this is Goodhart's Law in action"
│  ├── Fast Company (Fast Company) → read

Reports that Amazon employees are fabricating tasks to meet internal AI usage targets, asking questions they already know the answers to and running code through AI assistants unnecessarily. The article frames this as a predictable consequence of tracking adoption by volume rather than by outcomes.

│  └── top10.dev editorial (top10.dev) → read below

Argues this is the textbook manifestation of Goodhart's Law: when AI usage metrics become targets tied to performance reviews, employees rationally game the system. The editorial emphasizes this isn't Amazon-specific but rather the inevitable result of the industry's 'mandate phase' of AI adoption.

├── "This reflects industry-wide pressure to justify massive AI investments through adoption numbers"
│  └── top10.dev editorial (top10.dev) → read below

Contextualizes Amazon's mandate within the broader tech industry's shift from 'build AI tools' to 'prove people are using them,' noting hundreds of billions in collective AI investment from Anthropic, OpenAI, Google, and others. Amazon's aggressive internal dogfooding culture and strategic bets on AWS Bedrock, Amazon Q, and Alexa LLM efforts make it the highest-profile case of conflating consumption metrics with productivity.

└── "Amazon's dogfooding culture makes it uniquely susceptible to this kind of metric gaming"
  └── top10.dev editorial (top10.dev) → read below

Notes that while fabricated AI usage isn't exclusive to Amazon, the company's scale and aggressive internal dogfooding culture — where employees are expected to be showcase users of their own products — makes it the most visible example. Amazon's performance review system, which ties employee standing to these adoption signals, creates especially strong incentives to game the metrics.

What happened

Amazon employees are reportedly under mounting pressure to demonstrate increased usage of the company's internal AI tools — most notably Amazon Q, the company's AI coding assistant and enterprise productivity tool. According to reports from Fast Company and corroborated by discussions on Hacker News (where the story scored 164 points), workers across multiple divisions have responded to this pressure in the most predictable way possible: they're making up tasks.

The dynamic is straightforward. Management wants to see AI adoption numbers go up. Those numbers get tracked. Employees, whose performance reviews and standing within the company are influenced by these signals, start generating artificial interactions with AI tools — asking questions they already know the answers to, running code through AI assistants when they'd be faster doing it manually, and creating workflows that exist primarily to register as "AI usage" in whatever dashboard their managers are watching.

This isn't an Amazon-specific failure — it's the inevitable result of measuring tool adoption by volume rather than by outcomes. But Amazon, given its scale and its aggressive internal dogfooding culture, is the highest-profile case yet of what happens when a company confuses AI consumption metrics with AI-driven productivity.

Why it matters

The tech industry is deep into what you might call the "mandate phase" of AI adoption. After two years of breathless investment — Anthropic, OpenAI, Google, and others raising hundreds of billions collectively — the pressure has shifted from "build AI tools" to "prove people are using them." Amazon, which has staked significant strategic bets on its AI portfolio (AWS Bedrock, Amazon Q, internal Alexa LLM efforts), has every incentive to show that its own workforce is a showcase for AI-augmented productivity.

But Goodhart's Law doesn't care about your strategic roadmap. When a measure becomes a target, it ceases to be a good measure. Amazon's situation is a textbook case: the company wants to demonstrate that AI tools make workers more productive, so it tracks usage. Workers respond by generating usage that has nothing to do with productivity. The metric goes up. The actual value delivered stays flat or declines (since time spent on fake tasks is time not spent on real work).

The Hacker News discussion, which drove much of the signal on this story, surfaced a telling range of reactions. Many commenters — including self-identified current and former Amazon employees — noted that this pattern isn't new. Amazon has a long history of metric-driven management culture, from the infamous stack ranking days to warehouse productivity tracking. The AI mandate is just the latest version of a recurring organizational pathology: when leadership wants a number to go up, the number goes up, regardless of whether the underlying reality changes.

Other commenters drew parallels to enterprise software rollouts of the past — Salesforce adoption mandates, Jira ticket velocity tracking, mandatory Confluence documentation. In every case, the pattern is the same: mandate usage → track metrics → employees game metrics → leadership sees adoption curve → reality diverges from dashboard.

What makes the AI version more concerning is the cost structure. Every fake query to an LLM-powered tool burns real compute — GPU cycles, API calls, and inference costs that are non-trivial at Amazon's scale. Unlike clicking through a Salesforce form, fabricated AI usage actually consumes expensive resources. Amazon is essentially paying for its employees to generate training data for a metric that doesn't measure what it claims to measure.

What this means for your stack

If you're an engineering leader or CTO evaluating AI tool adoption in your own organization, Amazon's situation is a cautionary tale with specific, actionable lessons.

First, measure outputs, not inputs. The right question isn't "how many times did your team use Copilot/Cursor/Q this week?" It's "did code review turnaround time decrease? Did bug introduction rate change? Did time-to-first-PR for new hires improve?" If you can't tie AI tool usage to a downstream productivity metric that matters independently, you don't have an adoption strategy — you have a compliance exercise.

Second, let adoption be pull-based, not push-based. The teams and individuals who get genuine value from AI coding assistants tend to be the ones who adopted them voluntarily because the tool solved a real friction point. Mandating usage across the board — including for senior engineers who may legitimately be faster without the tool for certain tasks — creates exactly the perverse incentives Amazon is now experiencing. The better approach: make the tools available, remove friction from onboarding, showcase wins from early adopters, and let organic adoption curves tell you where the tool actually helps.

Third, watch for the "demo day" trap. When AI adoption becomes a performance signal, you'll start seeing AI usage optimized for visibility rather than utility. Engineers will use AI for tasks that are easy to screenshot in a status update rather than tasks where AI genuinely accelerates their workflow. This is the organizational equivalent of teaching to the test — and it produces the same hollowed-out results.

The broader industry context matters here too. Companies like Microsoft (with Copilot), Google (with Gemini for Workspace), and Salesforce (with Einstein) are all running similar internal adoption pushes. Amazon may be the first high-profile case of the gaming problem, but it won't be the last. Any organization tracking AI usage volume as a KPI should treat this story as a leading indicator.

Looking ahead

The irony is thick: a company at the forefront of AI development is discovering that you can't mandate your way to genuine productivity gains. The employees who are fabricating AI tasks aren't lazy or resistant to change — they're rational actors responding to incentive structures. Amazon will likely adjust its approach, possibly shifting to outcome-based metrics or team-level productivity measures that are harder to game. But the broader lesson for the industry is already clear. AI tools deliver real value when they solve real problems. When they become compliance checkboxes, you get exactly what you'd expect: checked boxes and unchanged workflows. The companies that win the AI productivity race won't be the ones with the highest usage dashboards — they'll be the ones who never bothered building those dashboards in the first place.

Hacker News 347 pts 391 comments

Amazon workers under pressure to up their AI usage–so they're making up tasks

→ read on Hacker News

// share this

// get daily digest

Top 10 dev stories every morning at 8am UTC. AI-curated. Retro terminal HTML email.