Executive Summary
Vertical Insure, a leading provider of embedded insurance solutions for vertical software platforms, sought to modernize their customer support operation by deploying an enterprise AI support agent. Their internal estimate for a safe, validated deployment was twelve months. Swept AI completed it in six weeks.
Before launch, Swept AI audited the AI agent and uncovered critical failure modes that would have reached customers undetected: the system fabricated financial figures, merged unrelated policy types into single answers, and constructed plausible but incorrect contact information. These errors appeared in confident, well-formatted responses that would have been difficult for customers or support staff to identify as wrong.
Through a structured six-week engagement, Swept AI restructured the knowledge base, built a domain-specific test suite of several hundred real customer scenarios, implemented targeted guardrails, and supervised the deployment through production launch.
The results:
- 60-70% automation rate on real customer inquiries, validated against Swept's test suite
- Zero customer-facing hallucinations since launch
- Six-week deployment versus a projected twelve-month internal timeline
- Ongoing monitoring that catches regressions before they impact users
Vertical Insure now operates a production AI support system with confidence, visibility, & continuous oversight.
Company Background
Vertical Insure makes insurance easy by embedding protection directly into software platforms used for sports registration, event management, travel bookings, and more. They handle approximately 1,500 customer inquiries per month. In insurance, incorrect answers create liability and erode trust. There is little margin for error.
When Vertical Insure decided to implement an AI customer support agent, they recognized the deployment required more scrutiny than the vendor's standard onboarding process could provide.
The Problem: AI Systems Fail in Ways That Are Difficult to Detect
Often, Enterprise AI support agents are sold as production-ready out-of-the-box.
And although Vertical Insure's AI agent appeared polished during initial demos, Swept AI's evaluation revealed a different picture.
Finding 1: Cross-Contamination Between Policy Types
The AI merged information from unrelated insurance products into single responses. Answers were confident and well-formatted, but factually incorrect. A customer asking about youth sports coverage might receive guidance that incorporated irrelevant details from event insurance policies.
Finding 2: Fabricated Financial Information
The AI generated dollar amounts that did not exist in any source material. Outdated fees, inferred benefit totals, and invented costs appeared in responses. These figures looked legitimate but were hallucinated.
Finding 3: Poor Website Data Extraction
The vendor marketed the ability to "learn from your website instantly." Testing showed the AI correctly extracted only approximately 2.5% of relevant website content. It could not distinguish authoritative pages from incidental content, and combined website fragments with help articles unpredictably.
Finding 4: Fabricated Contact Information
Left unchecked, the AI constructed plausible but incorrect email addresses. Formats like support@companyname or help@companyname sounded professional but did not route to real inboxes.
Finding 5: Historical Ticket Ingestion Would Degrade Performance
Vertical Insure initially planned to import thousands of historical support conversations to train the AI. Swept AI's analysis showed this approach would overweight edge cases, teach outdated practices, and reduce accuracy on common questions.
The AI looked polished on the surface. Swept AI showed us exactly what was happening using our data.
Swept AI's Approach
Swept AI's engagement followed a structured methodology designed to identify risks before deployment and establish ongoing monitoring afterward.
1. Knowledge Base Restructuring
Vertical Insure's help content was spread across multiple systems with inconsistent formatting. Swept AI consolidated and reorganized this material into a structure the AI could reliably interpret. This eliminated ambiguity that caused the AI to blend information from unrelated sources.
2. Test Suite Development
Swept AI built a comprehensive test suite of several hundred scenarios drawn from real customer inquiries. These tests covered common questions, edge cases, and potential failure modes identified during initial evaluation.
3. Hazard Detection
Swept AI ran the AI agent through the full test suite and documented every failure mode. This included correct answers, incorrect answers, partially correct answers, and cases where the AI expressed inappropriate confidence.
4. Iterative Optimization
Swept AI worked through three to five optimization cycles per week. Each cycle involved adjusting the AI's configuration, grounding data, or guardrails, then re-running the test suite to measure impact. Changes that improved one area were validated against the full suite to ensure they did not create regressions elsewhere.
5. Guardrail Implementation
Based on the hazard detection findings, Swept AI implemented specific protections: financial figures were restricted to approved values only, contact information was limited to verified addresses, website ingestion was disabled to eliminate unpredictable source contamination, and escalation pathways were configured for topics outside the AI's validated scope.
6. Dashboard and Visibility
Swept AI built a custom audit dashboard that showed Vertical Insure exactly how the AI performed across the test suite. This provided visibility that the underlying platform did not offer.
7. Supervised Launch
Vertical Insure went live only after all major risks were addressed and performance reached consistently high accuracy across the test suite. Swept AI validated the final configuration and confirmed that escalation pathways functioned correctly.
8. Ongoing Monitoring
Swept AI continues to monitor Vertical Insure's AI agent in production. This catches performance drift and regressions that the vendor's system does not surface.
That dashboard changed everything. We could finally see how the AI agent behaved, not just hope it would actually work.
Results
Deployment Timeline
Vertical Insure reached production deployment in under six weeks. Without Swept AI's structured approach, the Vertical Insure team estimated this process would have taken them at least a year internally and involved discovering problems through customer complaints.
If we had launched without Swept AI, we would have found these issues the hard way, with real customers and not in time.
Lessons for AI Deployment
Vertical Insure's experience illustrates several principles that apply broadly to AI deployments in customer-facing contexts.
Vendor demos do not represent production behavior
Demo environments are optimized for common cases. Production environments surface edge cases, data conflicts, and failure modes that demos do not reveal. Systematic testing is required to understand how an AI system will actually perform.
AI systems require domain-specific validation
Generic accuracy metrics do not capture whether an AI system handles the specific questions that matter for a particular business. Test suites must be built from real customer behavior, not synthetic benchmarks.
Guardrails must be designed around observed failures
Effective guardrails address specific failure modes discovered through testing. General-purpose safety features do not catch domain-specific errors like fabricated policy details or incorrect contact information.
Ongoing monitoring is necessary
AI systems drift over time as underlying models change, knowledge bases are updated, and edge cases accumulate. Continuous monitoring catches regressions before they impact customers.
The cost of undetected errors is high
In regulated industries and high-stakes contexts, incorrect AI responses create liability, erode trust, and generate support escalations that exceed the cost of proper validation.
Conclusion
Vertical Insure successfully deployed an AI customer support agent by treating the implementation as a validation problem, not just an integration task. Swept AI's audit identified failure modes that would have reached customers, and the subsequent optimization process eliminated those risks.
The engagement produced a system that automates a majority of support inquiries while maintaining the accuracy standards required in insurance. Ongoing monitoring ensures that performance remains stable as the underlying AI continues to evolve.
60-70%
Automation Rate
The AI agent now handles 60-70% of customer inquiries with high-quality responses. This figure is based on Swept AI's domain-specific test suite, not vendor-provided metrics.
Zero
Customer-Facing Errors
Zero customer-facing hallucinations since launch. The guardrails and validation process eliminated fabricated figures, incorrect contact info, and cross-contaminated guidance.
Reduced
Tier-1 Workload
Reduced Tier-1 support workload without adding headcount. The support team now focuses on complex cases, claims issues, and exceptions that require human judgment.
Ongoing
Visibility
Vertical Insure receives alerts when AI performance drifts, enabling proactive intervention before issues impact customers.
