SM> saswatbuilds
> CASE STUDY · Voice AI Agents

Podit — AI Voice Event Agent

A hybrid voice + text agent that plans, schedules, and protects your calendar

ROLE: AI engineer — voice architecture, agent graph, integrationsWHEN: 2025STATUS: LIVE

Podit is a hybrid voice and text AI agent for intelligent event planning and scheduling. Users talk or type to plan events; the agent detects scheduling conflicts, books into Google Calendar, and respects personal guardrails like sleep and work hours so it never schedules over them. The voice loop runs at sub-500ms latency, fast enough to feel like a real conversation rather than a delayed assistant.

> The problem

The pain this had to solve

Scheduling is one of the most common things people want to hand to an assistant, but most voice agents fail at exactly the part that matters: they book over conflicts, ignore personal boundaries like sleep and work hours, and lag badly enough that the conversation feels broken. A scheduling agent that double-books or wakes you up is worse than no agent at all.

Podit needed to be conversational and fast, but also disciplined — it had to reason over a live calendar, catch conflicts before committing, and honor each user’s preferences as hard constraints, all while staying responsive enough on voice to feel natural.

> The approach

What I built — the architecture

Hybrid voice + text

One agent serves both a voice channel (via Twilio) and a text channel, so users can switch between speaking and typing without losing context.

Sub-500ms voice loop

Tuned the speech-to-response pipeline — streaming, model selection, and pruned tool calls — to keep round-trip latency under 500ms so the conversation feels live.

Conflict detection

Before booking, the agent reads the live Google Calendar and checks for overlaps, flagging conflicts and proposing alternatives instead of double-booking.

Dynamic guardrails

User preferences such as sleep and work hours are enforced as dynamic guardrails, so the agent never schedules into protected time even when asked carelessly.

Agent orchestration

A LangGraph state graph (on OpenAI models) sequences understanding, calendar reads, conflict checks, and confirmation, with shared state persisted in Supabase and a React Native client.

BUILT WITH
LangChainLangGraphOpenAITwilioReact NativeSupabase
> The result

What it delivered

<500ms
voice round-trip latency
Voice + text
hybrid agent, one shared context
Calendar-integrated
live Google Calendar reads + writes
Guardrail-aware
respects sleep / work hours, no double-booking

Podit holds real-time voice conversations at sub-500ms latency, reads and writes a live Google Calendar, catches conflicts before booking, and enforces sleep/work-hour guardrails so it never double-books or schedules over protected time — a scheduling assistant users can actually trust with their calendar.

A dependable engineer who can be trusted with complex, high-stakes work

Ajay S., Founder
> RELATED
SERVICE
Voice AI Agents service
I build real-time conversational voice agents that answer and place calls, hold natural multi-turn conversations, take actions in your systems, and hand off to a human when it matters — with sub-second latency so callers never feel like they are talking to a robot.
ARTICLE
AI Voice Agent Cost in 2026: Build, Per-Minute & Total Cost of Ownership
A custom AI voice agent costs $5,000–$25,000 to build plus ~$0.05–$0.35/min to run in 2026 — build tiers, per-minute economics, and ROI.
ARTICLE
Retell vs Vapi vs Bland vs Twilio: How to Choose a Voice AI Stack (2026)
Vapi, Retell, and Bland are voice-AI orchestration platforms; Twilio is the telephony beneath them. How they compare on latency, control, and cost.
ARTICLE
How to Build a Voice AI Agent: An Architecture Walkthrough (2026)
An engineer’s end-to-end guide to a real-time voice AI agent: telephony, STT, turn-taking, the LLM, TTS, barge-in, human handoff, and the latency budget.

Want results like this for your team?

Tell me what you want to automate. On a free 30-minute call I’ll tell you straight whether it’s worth building, roughly what it costs, and how I’d approach it — no pitch, no obligation.

Book my free 30-min AI scoping call
Free · 30 min · no obligation · reply within 1 business day