Hack #5: Kiro · AWS Kiro
23 Apr, 15:38
Auditorium is an AI-powered application that transforms any story—user-uploaded or AI-generated—into a fully immersive cinematic audio drama with character voices, background music, and dynamic sound effects. Reading long-form content can feel slow and passive, while traditional audiobooks often lack immersion and emotional depth. This project solves that by converting text into a rich, engaging episodic audio experience that feels closer to a movie for your ears—making storytelling more accessible, engaging, and shareable. I used Kiro’s spec-driven development approach to design and implement the entire workflow—from story parsing and scene structuring to audio generation pipelines. By defining clear specifications, I was able to leverage Kiro’s AI agent to systematically build and iterate on features. Also Kiro's rigorous testing ensures development with little to no bugs. For audio generation, I used multiple ElevenLabs APIs: - Text-to-Speech for expressive, multi-character voice generation - Sound Effects API to add contextual environmental sounds - Music generation to create cinematic background scores These were orchestrated together to automatically produce cohesive, scene-aware audio dramas, showcasing how combining structured development (Kiro) with advanced audio AI (ElevenLabs) can unlock new storytelling experiences.
