Vai al contenuto
[ BLOG / AGENTI AI ]

AI booking agent for a multi-specialty clinic: the case

Illustrative scenario based on industry-typical patterns: how to design and build an AI agent for booking management at a clinic with 8 practices in 12 weeks.

Andrea Barberi 9 min

Editorial note: the scenario described in this article is an industry-typical pattern, built on elements common to many real projects in the private healthcare sector that we have studied. It is not a specific case from an Obsidian client: for our real cases we serve the privacy of the client first, even in anonymous form. When we have real cases publishable with authorization, they will be recognizable as such.

A multi-specialty clinic in central Italy, eight practices, seven medical specializations, a switchboard that in March was losing 30% of calls due to saturation. Let’s look at how to set up an AI booking agent project in this context, because the pattern applies to numerous Italian private healthcare facilities.

The context

The multi-specialty clinic in the scenario:

  • Eight practices distributed across two locations
  • Seven specializations: cardiology, orthopedics, gynecology, dermatology, ophthalmology, physiatry, psychology
  • Typical monthly volume: 4,500 bookings (60% new, 40% rescheduling or cancellations)
  • Switchboard with 3 part-time operators
  • Call peaks concentrated in 2 morning hours (8:30-10:30) and 1 evening hour (17:30-18:30)
  • Estimated missed call rate of 25-35% during peaks
  • Clinical business management system: Italian vertical software with partial REST APIs
  • Online payment system active since 2023 for service pre-payment

The concrete pain point reported by management: missed calls = patients going to competitors, low retention, overloaded operators with annual turnover of 40%.

What is available (project constraints)

  • Budget: €40-55k initial, €800-1,500/month at steady state
  • Timeline: go to production within 12 weeks (commercial need: autumn season for services)
  • Internal team: zero. Medical and administrative management + 1 part-time external IT contact
  • Compliance: GDPR Article 9 (health data), voice recording retention with specific rules, no extra-EU transfer of patient data
  • Channels: voice call (primary), WhatsApp (secondary), website (tertiary, already covered by a basic system)
  • Non-negotiable constraint: the agent CANNOT confirm bookings autonomously for specialists with variable schedules (cardiology, orthopedics). Only for visits with open availability can it close the full loop
  • Non-negotiable constraint: every call handled by the AI agent must be opt-in: at the start the patient hears “if you prefer a human operator, say ‘operator’” and routing is immediate

The chosen approach

Three crucial scope decisions, made in the first two weeks:

Decision 1: voice-first, not chatbot. 75% of bookings arrive by phone, especially given the clinic’s demographic (average patient age 52). Building a well-made WhatsApp chatbot first would have covered only 20% of the problem. The voice-first choice costs more (voice handling is much more complex than text) but it is where the pain is.

Decision 2: assisted agent, not autonomous. The AI agent does not replace operators for complex cases. It handles:

  • Visits with standard schedules (dermatology check-ups, ophthalmology routine visits, psychology first interviews)
  • Rescheduling and cancellations of existing bookings
  • Informational questions (hours, exam preparation, standard costs)

Specialties with dynamic schedules (cardiology with urgent echos, orthopedics with assessments requiring triage) remain with human operators. Complex calls are transferred with context (the agent passes the operator a textual summary of what has already been said).

Decision 3: privacy-first technical stack.

  • Voice: Twilio (EU provider, Germany datacenter) for call handling
  • ASR (speech-to-text): Whisper self-hosted on an EU instance
  • LLM: Claude 3.5 Sonnet via Anthropic Europe API + DPA with zero-retention agreement
  • TTS (text-to-speech): ElevenLabs (EU provider, Italian voice cloned from a consenting operator)
  • Storage: PostgreSQL on Hetzner Helsinki
  • Logs and analytics: no US service (no Datadog, no hosted Sentry), self-hosted Grafana Loki

The technical design costs more than a “convenient” integration with US services, but it is the only one the clinic’s DPO would have approved. GDPR Article 9 compliance for health data is not something to play with.

Execution in 12 weeks

Weeks 1-2: discovery and architecture

  • Shadowing an operator for 3 full days, recording (with consent) a sample of 100 real calls
  • Mapping of the 12 most frequent intents (representing 85% of volume)
  • Definition of technical architecture and routing flow
  • Setup of dev + staging environments

Weeks 3-5: core development and prompt engineering

  • Twilio + ASR + LLM bridge
  • Slot filling system to collect patient information (name, fiscal code via letter-by-letter dictation, requested specialty, preferred time slot)
  • Iterative prompt engineering on 200+ real scenarios derived from the recordings
  • Integration with clinical business management system API (read availability, write booking)
  • Human operator escalation logic (trigger schedule)

Weeks 6-7: voice cloning and internal testing

  • Recording of 40 minutes of speech from the consenting operator for voice cloning
  • TTS pipeline setup with cloned voice
  • Internal testing with the clinic team: each person calls 10 times simulating different cases
  • Iteration on tone, pauses, interruption handling

Weeks 8-9: testing with pilot patients

  • Activation on a secondary clinic number
  • Communication to 500 selected patients (“the clinic is testing a new automatic voice booking service, you can try it at this number”)
  • Structured feedback collection on the first 200 calls
  • Adaptation of prompts and flow on issues that emerged (e.g. regional dialects that the ASR struggled with, fiscal codes read in non-standard ways)

Weeks 10-11: partial roll-out

  • Activation on the main number, but only as a secondary line (it rings if the human switchboard is busy)
  • Intensive monitoring: every call handled by the AI agent is reviewed within 24h by the quality team
  • Quick adjustments on all anomalous cases

Week 12: full go-live

  • The AI agent answers directly as the first level
  • Operators handle only escalated calls or those where the patient said “operator”
  • Setup of management dashboard with live KPIs

The results

90 days after full go-live (the numbers are illustrative of industry-typical patterns, not specific):

  • Missed calls: from 30% to 4% (the agent always picks up, it does not saturate)
  • Booking completion rate via AI: 62% of total calls (above the initial 50% target)
  • Operator escalation rate: 38% (of which 22% by patient choice “operator”, 16% for cases the agent understands it cannot handle)
  • Clinic NPS on booking experience: from +12 to +34 (on a panel of 800 patients interviewed post-booking)
  • Average operator time per call: from 4’12” to 5’45” (they only handle complex cases, which require more time per call but fewer total calls)
  • Weekly operator hours on the phone: from 110 to 65 (reallocated to other non-telephone administrative activities)

What would make a difference in similar projects

Three things that, with hindsight, can be done better in scenarios like this.

1. More time on internal staff training. Operators are initially scared of AI (“it will replace us”). Two 2-hour sessions are the minimum, but four sessions distributed over two weeks would have been needed to properly explain what the agent does and does not do, and how escalation works from their side. Skipping this step costs avoidable operational friction in the first month.

2. Management dashboard visible from day 1. Setting up the dashboard only from week 10 is too late. Having a demo dashboard with simulated data already in week 3-4 helps management clarify earlier which KPIs they really want to monitor, avoiding refactors of the final dashboard.

3. More attention to regional Italian prompts. Central Italy has linguistic nuances that agents handle well on standard speech but struggle with on stronger accents from elderly patients. Collecting more regional speech samples in week 1-2 is an investment that pays off immediately.

Transferable lessons learned

1. Voice-first is 3-4x more complex than chatbot, but it is where the pain is in private healthcare. You cannot “start with WhatsApp and then think about voice”. The two channels have different architectures and the jump is not incremental.

2. Privacy first is not an obstacle, it is an asset. The EU-only and self-hosted stack choices added 15-20% complexity but made the project sellable internally to the clinic and with the DPO. In regulated sectors privacy-first technical design is an enabler, not an added cost.

3. Immediate opt-in to speak with a human is non-negotiable. Technologically the agent could handle 85% of cases. But the patient’s right to speak with a human if they prefer is fundamental (even more so in healthcare). Without this initial trust, adoption rates would have been much lower.

4. Quality review in the first weeks is the multiplier. Reviewing every call handled by the AI for the first 2-4 weeks is burdensome but is what differentiates a project that works from a project that fails in production. Almost all critical prompt fixes emerged from human review, not from automatic analysis.

5. Voice cloning requires explicit consent and easy opt-out. The sustainable practice is cloning the voice of a consenting operator with written agreement and opt-out at any time: the cloned voice of a real team member makes the agent more “natural” than standard synthetic voice, but consent must be managed formally.

FAQ

How much does a voice AI agent cost for a clinic of these dimensions?

In line with this scenario, €40-60k initial + €800-1,500 per month at steady state for clinics of 5-12 practices with volumes of 3,000-6,000 monthly bookings. The main items in the recurring costs: telephony costs (Twilio or equivalent), LLM costs, premium TTS costs (ElevenLabs or similar), monitoring and maintenance. Typical payback: 8-14 months.

Can it be done with a US-only cloud stack (Twilio, OpenAI, ElevenLabs cloud) to reduce costs?

Technically yes, reducing setup costs by 20-30% and marginally the monthly costs. Operationally, for Italian health data, the compliance risk is significant: extra-EU transfer of Article 9 GDPR data requires a specific DPO assessment. Most Italian DPOs in private healthcare today do not sign off. So technically possible, practically difficult.

How long does it take to train the agent on our specific facility?

In projects like this, the 12 weeks include training on the specific clinic. Iterative prompt engineering on the first 100-200 recorded calls is the heart of the work. It is not “training” in the ML sense (no fine-tuning of the model, too costly and rigid), but the construction of a system prompt + flow specific to the facility.

Does the AI agent also work for smaller clinics (2-3 practices)?

Below 1,500 monthly bookings the ROI takes longer to achieve (24+ months) and it is probably better to start from a preconfigured SaaS solution rather than a custom agent. Above 2,500 monthly bookings, custom becomes economically sensible. The 1,500-2,500 range should be evaluated case by case.

How is the case “the patient wants to cancel the booking 24h in advance” handled?

In the described scenario, the AI agent autonomously handles cancellations up to 48h in advance (immediately releases the slot, confirms via SMS to the patient, writes to the business management system). For cancellations under 48h the clinic’s penalty rule kicks in and the agent escalates to the operator for case-by-case handling (sometimes there are legitimate medical reasons, sometimes not, the evaluation remains human).

Conclusion

A voice AI agent for bookings in private healthcare is a project achievable in 12 weeks with a €40-60k budget, if the scope analysis in the first weeks is done well. The critical part is not the AI technology (now mature) but the decisions on what the agent handles and what it does not, and on how the human-AI coexistence is communicated to patients and staff.

If you run a private healthcare facility or a multi-specialty clinic and you are evaluating adopting an AI agent for bookings, let’s talk. We can do a specific analysis of your case before getting into scope.

To go deeper: the pillar page AI agents, the page dedicated to the AI booking agent for clinics, and the healthcare sector page for other specific applications in the sector.

Tags: agenti-aisanita-privatacase-studyscenarioprenotazionivoice-ai