Why AI Won’t Replace the Human Touch in Performance Reviews
Performance reviews are a perennial headache in many organizations – often seen as bureaucratic, time‐consuming, and stressful for managers and employees alike. With the rise of AI tools, some argue we might automate or even eliminate traditional reviews. Is it realistic (or wise) to have AI do our performance reviews? I believe the answer is no: while AI can assist, it cannot replace the thoughtful human effort that makes reviews valuable. The idea that we can fully offload performance evaluations to algorithms feels as misguided as measuring a developer’s worth by lines of code. In this post, I’ll explore what people are saying on both sides, and why the “AI-only” approach is fraught with issues – backed by insights from industry leaders like Will Larson and others.
The Temptation to Automate Reviews
It’s easy to see the appeal of handing performance reviews over to AI. Annual reviews are “tedious”, “bureaucratic” chores for many managers, and writing self-assessments can be “painful and even cringe-worthy” for employees. AI promises to streamline this process. Proponents envision feeding data into a system and getting a polished evaluation out, sparing everyone the drudgery.
In fact, tools already exist to aggregate data from one-on-one notes, emails, peer feedback, and project metrics into a single coherent summary. The promise is that managers will no longer have to rely on fuzzy memory or scramble for last-minute input – the AI will surface everything important. This could enable more frequent, data-driven check-ins instead of one big yearly ordeal.
As one tech leader noted, “AI may be able to get us to a more frequent in-the-moment coaching and review conversation vs. the annual rigmarole”, though “we cannot take the human out of the equation”.
Advocates also argue AI can improve fairness and consistency. AI systems aren’t subject to the same moods or personal biases as humans. They can sift through large volumes of performance data to flag patterns objectively. For example, an HR tech report cites benefits like minimized bias, identifying top and bottom performers through data, more consistent feedback cycles, and better goal alignment across the company. AI can even detect sentiment in peer reviews or highlight if a manager’s evaluations show unconscious bias.
In theory, an AI-driven process bases evaluations on evidence rather than gut feel, which could make reviews “more efficient but also fairer” when done right.
It’s no wonder some managers have started experimenting with AI for first drafts. News reports reveal that both managers and employees are turning to tools like ChatGPT to help write reviews. They describe it as a way to get past the “terror of staring at a blank page”. Even senior leaders admit to using AI to generate a “shitty first draft” which they then edit. The allure is strong: why not let the machine do the heavy lifting of collating feedback and even phrasing it nicely?
However, as attractive as this sounds, there’s a catch. Several catches, in fact.
The Case Against AI-Only Performance Reviews
Despite the new tools and hype, fully outsourcing performance reviews to AI is risky and problematic. Here are some key reasons why AI alone falls short:
-
Garbage In, Garbage Out: AI is only as good as the information it’s given. If a manager hasn’t been diligently tracking goals, projects, and feedback, there’s little for an AI to synthesize. Any automated summary will be incomplete or skewed. As one Inc. analysis put it, “AI is only as good as the information it’s given. Without consistent tracking, there’s little for AI to synthesize.” In other words, you still have to do the managerial legwork – AI won’t magically fill in the blanks.
-
Loss of Human Insight: Performance reviews, at their best, provide thoughtful, nuanced feedback and coaching. An algorithm scraping data can miss context and the subtleties of human performance. If handled poorly, an AI-driven review might end up “replacing thoughtful feedback from managers with cold data,” stripping away the human element that makes reviews constructive. Numbers and text analysis can’t fully capture how results were achieved or the soft skills a person demonstrated. There’s a risk of focusing only on what’s easy to measure, not what truly matters.
-
Generic and Impersonal Tone: Today’s generative AI is notorious for sounding formulaic. If you let it write an evaluation without heavy editing, the result may be a cookie-cutter review that any employee could tell was auto-generated. As one HR leader observed, “I find ChatGPT writing is visible from space”. Employees value personalized, detailed feedback – not a boilerplate paragraph that could be about anyone. Reviews that feel impersonal can undermine trust in the process. The moment an employee senses their review was just spat out by a bot, you’ve lost credibility as a manager.
-
Accuracy and Hallucinations: AI language models often “confabulate”, i.e. make things up. They might insert accomplishments or incidents that never happened, or gloss over important issues not well-represented in the data. Lee Gonzales, an engineering director who uses ChatGPT for drafts, warned: “Never, ever, ever, ever take what comes out of these models as the truth. They make stuff up.” Relying on the AI without verification could put outright false information in someone’s review – a fast way to destroy the review’s integrity.
-
Biases in, Biases out: While one hope is that AI can reduce human rater bias, the reality is more complicated. AI models learn from historical data, which may contain its own biases. If an AI tool was trained on past performance reviews or biased feedback, it can reinforce those biases rather than eliminate them. And unlike a human, it won’t have the judgment to recognize when an assessment seems unfair or out of context for a person’s situation. Over-reliance on the AI could even “ingrain flaws, rather than fix them” if its suggestions aren’t carefully checked.
In short, an AI-only approach tends to produce superficial evaluations. It might tick the boxes, but it misses the soul of a good review – the insightful, tailored coaching and recognition that help an employee grow.
As management author Will Larson has pointed out in a related context, using simplistic metrics to judge people is a dangerous trap:
“Using productivity metrics to measure individuals… is performative… If you want to blame someone then just go ahead and blame someone; don’t waste your time getting arbitrary metrics to support it.”
When we substitute genuine evaluation with an automated metrics-driven report, we create a false sense of objectivity. It may even give managers “false confidence” while eroding trust with the team. This is akin to measuring a developer’s contribution by lines of code or number of pull requests – easy to count, but utterly disconnected from true impact and prone to distortion. Great performance can’t be reduced to an algorithm.
Why the Human Element Remains Irreplaceable
A performance review is more than a formality – it’s part of the relationship between employer and employee. However advanced our tools become, human judgment and empathy are irreplaceable in this process. Here’s why:
-
Understanding Context: A human manager can synthesize not just data, but context. You know the nuances of the projects your report tackled, the challenges they overcame, and the areas where they grew. You remember the contents of 1:1 conversations, career aspirations discussed, and how shifting business priorities affected goals. An AI won’t intuit these storylines unless you explicitly feed it every detail – and even then, it can’t truly understand the significance. Managers provide narrative and meaning to the raw facts of performance.
-
Alignment with Career Paths: Good evaluations measure someone’s growth against expectations for their role and level. Most companies have a career matrix or ladder defining what increasing impact looks like. As Will Larson, who has designed engineering career ladders at multiple companies, notes: “At each level, people want to know what the expectations are… at their best, career ladders are powerful tools for shaping culture.” It takes a manager’s insight to map an individual’s accomplishments to these expectations. AI might help retrieve examples, but assessing growth is a human judgment call. Larson actually laments that engineers too often dismiss performance systems (ladders, reviews, calibrations) as “bureaucracy” when in fact “these are really powerful systems” that deserve our time and care. In other words, performance management done right can hugely benefit an organization – but it requires managers to engage, not abdicate the responsibility to an app.
-
Trust and Personal Connection: Performance reviews can be emotional. They involve praise, constructive criticism, and sometimes tough messages about areas to improve. Delivering this effectively requires empathy and tact. A manager writing a review will (hopefully) choose words carefully, considering how the employee will feel. AI, in contrast, has no inherent empathy. Even if the content is correct, an AI-generated review can come across as cold or generic. Employees are keenly aware of whether their manager put thought into their evaluation or just regurgitated data. “Employees value personalized, detailed, and constructive input,” not robo-generated platitudes. Ultimately, trust in the review process hinges on the human relationship. As one HR guide bluntly states, “AI can speed up drafting, but it cannot replace a manager’s judgment. Human oversight is necessary to ensure accuracy, fairness, and the correct context.” In a similar vein, an industry commentator on LinkedIn emphasized that the “right mix of AI + human is the key” – technology can assist, but the human touch is non-negotiable.
-
Accountability and Effort: There’s also a principled reason not to let AI do the whole job: respect for the individual being reviewed. Writing a performance review is an act of leadership. It’s how you show your team members that you see them, value their work, and care about their growth. If a manager can’t be bothered to put in the effort – and instead throws a few bullet points into ChatGPT and calls it a day – that laziness will be apparent. One manager told Axios that “not reviewing, iterating and editing an AI response is lazy.” Your employees will likely agree. They deserve the manager’s effort, not a copy-paste robot assessment. Using AI isn’t a shortcut to skip engagement; at best it’s a tool to support a thoughtful process.
Finding the Right Role for AI (Assist, Don’t Replace)
None of this is to say AI has zero place in performance reviews. The consensus among experienced leaders is to use AI as an aid, not an outright replacement. The sweet spot is where AI handles the drudgery while humans handle the judgment and communication.
For example, AI can help comb through a year’s worth of notes and feedback to produce a first-pass summary – what one manager called a “thinking tool” to create a draft. It can highlight trends (e.g. an uptick in sales metrics, or recurring peer comments about teamwork) that you might want to mention. It might even suggest wording for complex feedback, which you as the manager can then refine to fit the person and the message you intend.
By all means, leverage AI to save time and prompt your memory. But the heavy lifting of analysis and decision-making stays with the manager. Lee Gonzales, the engineering director mentioned earlier, said he uses generative AI to help compose a review but then “edits from there,” and crucially, “then he sits down and talks to the person about it.” The AI can generate a polished paragraph, but only the manager can have the performance conversation that gives it meaning.
As the authors of one HR guide put it succinctly: “AI should enhance human judgment, not replace it.” A smart manager treats AI-generated text as a draft – something to critique, adjust, or even discard – rather than an oracle of truth.
To strike this balance, consider a few best practices if you experiment with AI in your review process:
-
Keep Good Records: Invest time throughout the year to document goals, achievements, feedback from others, and your own observations. This creates the rich input AI needs to be useful. If you have nothing but vague impressions, AI won’t magically create substance.
-
Use AI for Summaries, Not Final Judgments: Let the AI compile and summarize data points (e.g. project X launched, sales increased Y%, etc.). But determine the evaluation (e.g. meets expectations, exceeds, or areas to improve) yourself based on all the evidence and context. The AI can suggest phrasing for feedback, but the call on performance rating and narrative should be yours.
-
Review and Personalize the Wording: Never send an AI-written review to an employee as-is. Edit it heavily. Make sure the tone sounds like you, and that it highlights what you genuinely find important about the person’s performance. Add specific examples or anecdotes only you would know. Remove any statements that don’t ring true – AI might have inferred something incorrectly. This is where you prevent those “visible from space” AI tells and ensure the feedback will resonate with the individual.
-
Maintain the Human Conversation: Perhaps most importantly, use the written review as a springboard for a face-to-face (or video) conversation. The real value comes when you discuss accomplishments, clarify expectations, and jointly plan future development. That dialogue is something no auto-generated text can replace. As HR expert Purbita Banerjee noted, AI might enable more frequent check-ins, but it doesn’t eliminate “performance review conversations.” It’s in those conversations that employees feel truly seen and heard.
In embracing AI, don’t lose sight of the purpose of performance reviews: to help people grow, recognize their contributions, and align on how to succeed further. AI can crunch data and draft text, but it can’t have empathy, can’t tailor itself in real-time to how someone reacts, and can’t inspire trust on its own. Those remain human tasks.
Conclusion: AI as a Tool, Not a Replacement
The bottom line is that performance reviews still need a human touch. Much like other people-management processes, there’s a lot of nuance involved that technology alone cannot handle. We can (and should) use better tools – including AI – to gather insights and reduce administrative burden.
Done well, that means more time for managers to focus on the meaningful parts of the review: interpreting the data, making fair judgments, and communicating with care. But done poorly (for example, by letting a chatbot spit out a perfunctory review), AI usage could backfire badly, leaving employees feeling “reviewed in a sloppy manner” – exactly what we want to avoid.
To quote engineering leader Will Larson one more time: performance management systems like reviews are “really powerful” when treated seriously. They direct focus to what the organization values and give people feedback on their impact. If we abdicate that responsibility to an auto-generator, we risk reducing a powerful coaching tool into a shallow, check-the-box exercise. No one wants their hard work to be appraised by a robot on autopilot.
AI can assist in performance reviews, but it’s not a replacement for the manager. The role of AI is that of a helpful editor or analyst – a “glorified assistant” – not the author of the review. The manager must remain the author and owner of the message. Use AI to work smarter, but keep the process human.
Sources
- Megan Morrone, “Managers, employees turn to ChatGPT to write performance reviews,” Axios, Feb 12, 2024.
- Alyshia Hull, “We Know You Want AI to Do Your Performance Reviews. Don’t Let It Do Too Much,” Inc.com, Oct 9, 2025.
- Neelie Verlinden, “AI for Performance Reviews: Make Smarter Talent Decisions,” AIHR, 2026.
- Will Larson, “My skepticism towards current developer meta-productivity tools,” Irrational Exuberance (blog), Nov 18, 2020.
- Jennifer McGrath, “Stripe’s Will Larson on Designing a Performance Management System from Scratch,” DZone (interview), Mar 12, 2019.
- LinkedIn comments on AI in performance reviews (Bryan Ackermann’s post, 2023).
Links:
- https://www.axios.com/2024/02/12/chatgpt-human-resources-performance-reviews
- https://www.inc.com/alyshiahull/the-5-dos-and-donts-of-performance-reviews-in-an-ai-driven-workplace/91243048
- https://www.linkedin.com/posts/backermann_heres-what-happens-when-ai-takes-over-performance%E2%80%93activity-7302360810322833408-sFr9
- https://www.aihr.com/blog/ai-for-performance-reviews/
- https://lethain.com/developer-meta-productivity-tools/
- https://dzone.com/articles/stripes-will-larson-on-designing-a-performance-man
