Ayaan

300 points · 2 submissions

Submissions

with AWS Kiro

+100

🎙️ VoiceGauntlet: Break your voice agent before the public does. 💥 Most teams ship voice agents after only testing the "happy path." Failures usually appear in production when users get angry, adversarial, or manipulative—resulting in broken policies, leaked data, or ignored escalations. 🚨 VoiceGauntlet is a spec-driven red-team harness that solves this. It turns a Kiro requirements.md spec into adversarial voice-agent attack scenarios, pressure-tests an ElevenLabs agent against those exact requirements, and sends the hardening tasks.md back into the same Kiro workflow. 🔄 Spec in. Attack out. Fix back. 🛠️ 💻 How we used Kiro (For Development & The Core Feature): First, we used Kiro for the entire development process of VoiceGauntlet. Every feature started as a Kiro spec (requirements → design → tasks) before a single line of code was written, using Kiro's AI agent to systematically implement our architecture. For the app's core functionality, Kiro isn’t just documentation; it is the source of truth. VoiceGauntlet uses a local MCP bridge to read your project’s actual requirements.md. It parses the acceptance criteria, turns them into ~20 adversarial test callers, and runs the attack. Once a failure is isolated, it generates a structured hardening task and writes it directly back to the Kiro spec folder as tasks.md. 📝 🗣️ How we used ElevenLabs (The Voice Substrate): We built the attack workflow entirely around the ElevenLabs voice-agent stack. VoiceGauntlet uses ElevenLabs Agents, the Simulate Conversation API, and specialized evaluation criteria for requirement-level checking. The underlying live-listen architecture is built around ElevenLabs signed URLs and WebSockets. (Note: For this hackathon demo, the calling stage utilizes a fast simulation mode to keep the visual attack loop tight and easily recordable). ⚡ 🎯 The end-to-end loop: 1️⃣ Product requirements are written in Kiro. 2️⃣ VoiceGauntlet reads that spec via MCP. 3️⃣ It generates hostile callers from the acceptance criteria. 4️⃣ It attacks the ElevenLabs voice agent. 5️⃣ It isolates the highest-risk failure and maps it to a specific requirement. 6️⃣ It generates the exact Kiro-friendly hardening markdown. 7️⃣ The task returns to Kiro as tasks.md. Don’t just test your agent. Pressure-test its requirements. 🛡️

Repo Demo

X LinkedIn

Submitted 23 Apr 2026

Hack #1: Firecrawl

with Firecrawl

+200

Meet Switchboard: the AI that actually picks up the phone and negotiates for you. 📞🤯 Most AI tools just give you a list of 20 links and tell you to figure it out. That’s useless. I built an autonomous sourcing desk that does the dirty work: 1️⃣ You give it a 30-second voice brief. 2️⃣ @firecrawl silently scrapes the live web for vendors, pricing, and hidden contact info. 3️⃣ @elevenlabs voice agents LITERALLY call the businesses, sit on hold, ask your questions, and pressure-test the options. 4️⃣ You get one vetted winner handed to you. AI shouldn't just browse the market. It should WORK the market for you. Live at switchboard.czarflix.me. If you think AI making outbound phone calls is the crazy future we were promised, drop a like.

Repo Demo