ASAPP logo icon.
👋 Want to talk to our generative AI agent?
Click below to experience GenerativeAgent in action
Talk to GenerativeAgent: Try it now
Learn more about GenerativeAgent first
I’m interested in a demo
Table of Contents

Stay up to date

Sign up for the latest news & content.

Published on
June 2, 2026

Why Callers Demand a Human and How to Change Their Mind

Table of Contents

The Reflex to Opt Out

Before the conversation has really begun, the caller has already decided.

"Operator." "Wanna speak to a representative."

This was a real exchange from a call to an insurance company's AI-powered voice line. The bot had just introduced itself—warmly, even with a note of reassurance: "For car rental questions, you can talk to me like a real person. How can I help you today?" — and the caller's response, delivered in two flat sentences, was a refusal. Not a response to what was said, but a preemptive rejection of the entire interaction.

Another caller, on a different day, got the same welcome and said: "Speak to agent. Speak to agent." And then, a moment later, as if the bot had miraculously transformed into a human: "Have my rental car coverage extended." The request was there. The willingness to ask it to the bot was not.

This pattern of opting out before the conversation has a chance to begin is one of the defining challenges of voice AI in customer service. And understanding its psychology reveals something that most AI deployments get wrong.

A Deficit Built on Experience

The urge to demand a human agent is not irrational. It is learned.

Every caller who dials a service line brings with them an accumulated history of phone trees that didn't understand them, IVR menus that looped back on themselves, and automated systems that confidently mishandled their request. They have been trained, by repeated experience, to expect that the bot will fail and to act accordingly.

Researchers at Microsoft who studied early voice assistants found that users arrived at interactions carrying detailed prior expectations, shaped almost entirely by failure. When those expectations weren't met, disengagement was immediate and difficult to reverse. 

The problem wasn't the specific system in front of them; it was the category. "Voice bot" had become a prediction: this is not going to work.

This makes the opening seconds of any voice AI interaction unusually high stakes. The caller has already formed a hypothesis. The first thing the bot says either confirms it or challenges it.

The Limits of Tone

A natural instinct, when you know callers are skeptical, is to try to reassure them. To sound warmer. To acknowledge their possible frustration. To distance the system from the robotic IVRs of the past.

This reasoning led to a real change at one of our insurance industry customers using CXP GenerativeAgent® offering car rental benefits as part of its coverage. Their voice bot had been greeting callers with: "For car rental questions, you can talk to me like a real person. How can I help you today?"

The phrase "talk to me like a real person" was intended to lower the guard. It was replaced with something less awkward: "I understand you have a question about your car rental benefits. How can I help?"

The result, measured across three full days of calls, was essentially flat. More than four in ten callers were still opting out before the first real exchange, nearly identical rates under both versions.

This is worth sitting with. Two different phrasings, one explicitly designed to be more welcoming than the other, produced indistinguishable outcomes. The callers who wanted to speak to a human wanted to speak to a human regardless. Tone was not the problem, and tone was not the fix.

What Callers Are Actually Asking

The "agent request" is often framed as a preference: some people just prefer human contact. And for a small subset of callers, that may be true. But for the majority, something else is happening.

Research on why people resist AI systems has identified a specific mechanism: what one group of consumer researchers called “uniqueness neglect"—the feeling that an automated system cannot account for your particular situation, your specific circumstances, your individual case. When you call about a car rental claim after an accident, you are not calling with a generic question. You want to know about your car, your claim, your options right now. A bot that greets you as if it has no idea who you are, or why you might be calling, immediately signals that uniqueness neglect is warranted. Of course it can't help. It doesn't even know who I am.

The agent request, seen this way, is not a preference for humans. It is a prediction that the bot lacks the context to be useful.

Demonstrating What You Know

The real change at this insurance enterprise customer came when they replaced the static welcome with something structurally different. Instead of a generic greeting, GenerativeAgent now opens each call with what it actually knew about the person on the line:

"Hello! I see you're calling about your DODGE JOURNEY. Your most recent claim is authorized, which means your rental benefits are ready to go! Do you have questions about how to get your rental car, extend your rental, or get reimbursed?"

"Hello! I see you're calling about your JEEP CHEROKEE. For your most recent claim, an independent inspection company will need to visit your vehicle, which usually happens within two or three business days. In the meantime, do you have other questions about car rentals?"

These messages are not longer. They are not warmer. What they are is informed. They demonstrate, before the caller has said a word, that the system knows their vehicle, knows the status of their claim, and has already oriented the conversation around their situation.

The effect on behavior was striking. The rate at which callers asked for a human agent dropped by 10 percentage points. Engagement—callers who stayed on the line and had a real conversation—rose from 54% to 65%. The callers who, under the previous version, had said "operator" before the bot finished speaking were now asking questions.

Grouped bar chart titled 'How a personalized welcome message changed caller behavior.' The chart compares two metrics — the percentage of callers who asked for a human agent in the first two turns (orange bars) and the percentage who engaged substantively with the bot (purple bars) — across three versions of a welcome message. In v1 ('talk to me like a real person'), 44% asked for a human and 49% engaged with the bot. In v2 ('I understand you have a question'), 44% asked for a human and 54% engaged with the bot. In v3 (Dynamic, personalized), 34% asked for a human and 66% engaged with the bot. Arrows from v2 to v3 highlight the improvement in both metrics.
Deploying dynamic, personalized greetings to customers led to a drop in requests for a human agent: from 44% to 34%. The 10 percentage point drop represents a reduction of 23%.

A caller whose Dodge Journey claim was ready to go responded, "Yes... yeah, the rental car now." A caller about a Jeep Cherokee asked, "Can I rent the car from anywhere?" These were not passive participants being dragged through a script. They were people who had arrived expecting to demand a human and had instead decided the bot might actually know something.

A natural experiment in a telecom company, studied by researchers at several Chinese universities and published in 2023, showed the same dynamic in reverse: every time an AI agent failed or struggled in a call, customers became more likely to seek human agents, and prior AI interactions influenced behavior in later service encounters. The first impression of capability casts a long shadow.

The Callers Who Still Opt Out

Not every caller changed their behavior. About a third still asked for a human, even after being greeted with personalized, specific information about their situation.

One caller, after hearing that their FORD F150's rental benefits were authorized and ready to go, responded: "Agent. Agent. Agent."

This is a legitimate outcome. Some callers have made a categorical decision—based on prior experience, personal preference, or the nature of their issue—that they want to speak with a person. Reaching this group through better bot design is neither realistic nor necessarily the right goal. What matters is that this group is smaller, and more clearly defined, than it appears when every caller is given a generic greeting that offers them no reason to stay.

A New Kind of Success

The personalized welcome introduced a behavior that had no analogue in the previous versions: the silent hangup that is actually a resolution.

Under the static messages, a caller who hung up without saying anything had almost certainly disengaged: confused, or unwilling to wait. But under the dynamic welcome, a new pattern emerged. Some callers heard the opening message, got the information they needed—claim status confirmed, benefits ready to go—and disconnected. No question asked, no agent requested. Their call was complete.

"Hello! I see you're calling about your DODGE CHARGER SXT. Your most recent claim is authorized, which means your rental benefits are ready to go! Do you have questions about how to get your rental car, extend your rental, or get reimbursed?"

[Caller disconnects.]

This is easy to misread as a failure. Traditional metrics, including turns per conversation, containment rate, task completion, would flag it as an abandoned call. But for a caller who dialed in to check whether their claim was approved, being told in the first sentence that it is approved is exactly the outcome they wanted. The interaction was short not because it went wrong, but because it went completely right.

Roughly 3% of calls under the dynamic welcome appeared to follow this pattern: silent hangups with no follow-up call within two hours. Small, but meaningful. It represents a category of value that a static welcome message cannot deliver.

The Underlying Principle

Across all three versions of the welcome message, a single principle emerges: Callers are not objecting to the bot because it is a bot. They are objecting because they have no evidence that it can help them.

Change the phrasing, and the objection remains. Give them evidence—specific, concrete, accurate evidence that the system knows who they are and what their situation is—and a meaningful share of those objectors become participants.

The opening of a voice AI interaction is not a formality. It is an audition. The question callers are silently asking the moment the bot begins speaking is: does this thing know what it's doing? A generic greeting, no matter how warmly written, answers that question with a shrug. A greeting that demonstrates real knowledge answers it with a credential.

Most voice bots are still auditioning with the shrug.

Stay up to date

Sign up for the latest news & content.

Loved this blog post?

About the author

Nirmal Mukhi
Chief Architect at ASAPP

Nirmal Mukhi is the Chief Architect at ASAPP, where he builds machine learning capabilities and products. Prior to joining ASAPP, Nirmal held leadership positions in engineering and research at IBM, where he was R&D lead for Watson Education, and served as CTO at TRANSFR. He has over 30 publications (with 4500+ citations), 15 patents, and has appeared on a Discovery Channel documentary about AI.