Future of Work··5 min read

Why Human Judgment Still Matters in Hiring

Every few years, a new tool promises to take the human bias out of hiring and replace it with something more rigorous. The tool always underperforms on the thing that matters most: predicting whether a specific person will succeed in a specific role at a specific company. Here is why that prediction remains stubbornly human.

future of workhiringAIjudgmentexecutive searchtalenthumanMajhi Groupassessment

Manas Majhi
Manas Majhi

Founder, Majhi Group & Majhi OS

Why Human Judgment Still Matters in Hiring

I have been doing executive search long enough to have seen several generations of technology promise to transform the assessment of human candidates. Structured interviews. Personality assessments. Predictive hiring algorithms. Skills-based evaluations. AI sourcing tools. Video interview analysis.

Each of these tools has genuine value in specific contexts. Each of them has also produced a version of the same mistake: the confusion of correlation with causation, of pattern-matching with judgment, of what can be measured with what matters.

I want to be precise about this, because I run a hiring operations company that uses technology throughout the search process, and I don't want to be read as a technophobe defending an old model. AI tools have made parts of what we do meaningfully better. What they have not changed — and I believe cannot change in the near term — is the core judgment that determines whether an executive placement succeeds.

What the judgment actually is

When I assess a candidate for a VP or C-suite role, I am not primarily evaluating their resume. The resume is a starting point, not a conclusion.

What I am evaluating is a much harder set of questions: How does this person think about a problem I put in front of them? What do they reveal about themselves when they're uncertain versus when they're confident? How do they talk about the people they've managed — with specificity and genuine insight, or with generalities that suggest they weren't paying close attention? What does the delta between their public track record and what their references say tell me about their self-awareness?

These are not inputs to a model. They are observations that I am making in real time, filtered through everything I've seen in hundreds of similar situations, and combined into a judgment that is irreducibly subjective — but not arbitrary. Subjective means it depends on a subject. In this case, the subject is an experienced professional who has been wrong, learned from being wrong, and developed a more calibrated instinct over time.

The AI tool does not have that calibration. It has pattern matching against historical data. That is useful. It is not the same thing.

Where technology genuinely helps

I want to be fair to the tools because I use them and they matter.

Technology has made candidate sourcing dramatically more efficient. What used to take a research team a week to compile — a qualified list of candidates in a specific functional area, in a specific geography, at a specific company stage — can now be done in hours with the right tools. That efficiency is real and it allows the human judgment to be applied to a better-curated set of candidates rather than to a noisier, less organized universe.

Technology has also improved the administrative layer of search: scheduling, communication, documentation, status tracking. These are areas where human error and inconsistency create friction, and where systematization improves the client and candidate experience without requiring any of the judgment I've described.

The right frame is: technology is raising the floor of what a search process produces, and freeing human judgment to focus on the ceiling. The sourcing, the scheduling, the first-pass screening — these can be better done with good tools. The final assessment, the candidate advocacy, the negotiation, the offer management — these require humans who know what they're doing.

The fit judgment that cannot be automated

The hardest judgment in executive search — the one where I've been most often right and most consequentially wrong — is cultural and contextual fit.

A candidate can be excellent in an absolute sense and wrong for a specific company at a specific stage. I have seen brilliant executives fail in companies where the culture required a different operating style. I have seen people who looked marginal on paper succeed enormously because the company's specific challenge was the one they were built for.

That judgment — whether this person, in this company, at this moment, with this team — is not derivable from a model. It requires understanding the company's culture from the inside, which means the kind of relationship with the CEO and leadership team that you build over years of work together, not from a brief. It requires understanding the candidate as a person, not as a profile.

When I make that judgment call and I'm right, it produces placements that the client describes as transformative. When I get it wrong — which I have, and which any honest search practitioner will acknowledge they have — the damage is significant: a failed executive hire at the VP level costs a company 12-24 months and anywhere from $500K to $1M+ when you account for the direct cost, the lost productivity, and the team disruption.

The cost of getting it wrong is why the judgment matters, and why the judgment needs to be human. Not because humans don't make errors — we do, regularly — but because the errors we make are recoverable in a way that the errors a model makes are not. When I get it wrong, I can explain why, I can learn from it, and I can do the relationship repair that a failed placement requires. When an algorithm gets it wrong, there's no relationship, no accountability, and no recovery path.

The festival of context

Here's an illustration that has stayed with me.

Several years ago, I was placing a senior finance executive at a company going through a difficult restructuring. On paper, the candidate was strong: the right functional background, the right sector experience, strong references. In the interview process, he was excellent.

What I noticed, and what I flagged to the client before the offer was extended, was a subtle pattern in how he talked about ambiguity. When I asked him about the last time a major plan changed significantly and he had to adapt, the structure of his answer was technically complete but emotionally thin. The words were right. The affect was slightly off — a quality of rehearsal rather than genuine recollection.

I recommended they do one more reference call, with a specific question about how this person handled a major plan failure. The reference, which the candidate had provided, said something between the lines that confirmed my read: "He's excellent when the direction is clear. When things are truly unclear, he needs more structure than you might expect."

That was a deal-changer for a restructuring role. The client went in a different direction.

I could not have told you, before that conversation, exactly what I was looking for. There was no rubric. It was pattern recognition accumulated from hundreds of conversations with people at that level, filtered through the specific requirements of that role and that company. The technology helped me find him. The judgment is why we didn't place him.


Manas Majhi is the founder of Majhi Group and Majhi OS. He has placed C-suite and VP leaders globally and believes the human element in executive search is a feature, not a bug.