There's a moment that happens dozens of times a day in sales calls across every industry. A prospect opens their Zoom participant list — the little grid of faces and names on the right side of the screen — and they see it: Otter.ai. Or Fireflies. Or Fathom. A name that isn't a person. A name that ends in ".ai".
They don't say anything. They never do. They just know.
They know this conversation is being recorded. They know it's being transcribed. They know it's being fed to a machine that will extract information from it. And they know the person on the other side of this call deployed that machine — without asking.
The trust cost nobody talks about
Enterprise sales trainers spend enormous energy teaching reps to build rapport in the first three minutes of a call. The psychology is well-established: people buy from people they trust. Trust is built through vulnerability and candor — through the feeling that a conversation is private.
A recording bot breaks that feeling the moment it appears in the participant list. The prospect's guard goes up. They start choosing words more carefully. The candid admission about budget constraints that would have helped you tailor your pitch? Gone. The honest "we're also talking to your competitor" that would have helped you sharpen your positioning? Never said.
You've traded a chance at a real conversation for a transcript.
This cost is especially acute in specific sales contexts:
- Early-stage discovery calls, where the whole point is to get a prospect to articulate a pain point they haven't fully admitted to themselves yet. People don't do that on record.
- Late-stage negotiations, where both sides are exploring creative deal structures. Nobody floats a number they might regret in front of a bot that's writing everything down.
- Executive conversations, where seniority correlates directly with sensitivity to surveillance. A VP who would never consent to being recorded casually on the street definitely noticed your Otter.ai in the participant list.
None of this shows up in your CRM. The deal just stalls and eventually goes cold, and the post-mortem conclusion is "prospect wasn't ready" when the actual cause was "prospect stopped being honest with us in week two."
How participants react — and why they stay silent
The counterargument you'll hear from teams that use recording bots: "If it were a problem, people would just say something." This is wrong, and the reason it's wrong is worth understanding.
When someone sees a recording bot in a meeting they didn't organize, they face a social dilemma. Asking the host to remove it is an accusation — it implies you have something to hide. Most people, especially in professional contexts, will not do something that makes them look paranoid or obstructive. So they say nothing. They just adjust their behavior.
The research on this is decades old. The mere presence of a camera — even a camera that isn't recording anything — changes how people behave. It's called the observer effect, and it applies as much to a name in a Zoom participant list as it does to a CCTV camera in a break room. The awareness of being observed is sufficient. The observer doesn't need to be watching.
Legal note: In many US states and most of the EU, recording a call without all-party consent is illegal regardless of your intent. Meeting bots typically handle this with a consent banner, but that banner itself is a signal — to your prospect — that they're about to be recorded. The consent prompt doesn't neutralize the trust cost. It amplifies it.
What "local audio processing" actually means
The alternative to a cloud-hosted recording bot is processing audio on the device where the meeting is happening. Here's what happens with a cloud-based meeting bot:
- A bot participant joins your call via the meeting platform's API.
- The bot receives the audio stream from the meeting server.
- That audio is sent to a third-party cloud service for transcription.
- The transcript is processed by an AI model.
- The output is returned to you through the vendor's web app or integration.
There are four or five different parties who touch your audio before you see a summary. The meeting platform. The bot vendor. The transcription provider. The AI provider. Potentially a CRM integration. Every hop is a data exposure surface.
With local audio processing, the path is different:
- The macOS system audio is captured by the local app using the operating system's audio APIs.
- The audio is processed on-device, in real time, using a lightweight model running locally.
- When a response is needed from a large language model, only the text of the prompt is sent to an AI provider. Not the audio. Not a recording. Just the question.
- The audio itself is never stored, never uploaded, and never shared with anyone.
The difference matters at every level. It matters for compliance (no audio leaves your jurisdiction). It matters for security (no audio sits in a third-party S3 bucket). And it matters for the conversation itself — because there is nothing to join, nothing to announce, nothing that appears in your prospect's participant list.
The participant list problem is architectural
Meeting bots announce themselves because they have to. They join calls via the meeting platform's API, which requires presenting as a participant. Zoom, Google Meet, and Microsoft Teams don't have a "silent observer" mode. If a piece of software wants to receive the audio from a call, it needs to be in the call. And if it's in the call, it's in the participant list.
The only way to avoid this is to not be in the call at all.
Sway doesn't join your call. It runs as a native macOS application that captures system audio — the same audio that goes to your headphones — and processes it locally. From the meeting platform's perspective, it doesn't exist. From your prospect's perspective, there's no extra participant, no bot name, no recording notice.
Why this matters more for sales than any other use case
Post-call transcription tools — Otter, Fathom, Fireflies — are genuinely useful for teams that need structured notes, CRM entries, or searchable records of customer conversations. That's a real workflow problem and they solve it reasonably well. The tradeoff is the trust cost, and for many teams, especially those with internal meetings or calls where the counterparty has already signed a data processing agreement, that cost is acceptable.
But for sales teams doing outbound calls, prospecting, and deal-stage conversations with people who haven't yet decided to buy from you, the calculus is different. The person on the other side of that call hasn't committed to anything yet. They're still deciding whether to trust you. Putting a bot in the room — even a helpful one, even a legal one — tips that calculus in the wrong direction.
Real-time AI assistance, delivered privately and locally, changes what's possible. Instead of reviewing a transcript after the call and wishing you'd said something different, you have suggestions in the moment — talking points, objection responses, competitive context — available while the conversation is still happening. And none of it required asking your prospect's permission, because none of it was visible to them.
That's not a feature. It's a different category of tool.
No bot. No record. Just real-time AI.
Sway runs locally on your Mac. No meeting bot joins your call. Your audio stays on your device. Free 7-day trial.
