550 points · 4 submissions
with Cursor
SHADOWPRO Boxing · AI Corner Coach: a browser-native, "Zero-Keyboard" boxing coach that transforms your device into an intelligent corner man. By merging real-time computer vision with low-latency generative voice, it provides the same immediate feedback you’d get from a human trainer in a professional gym. Video: https://www.youtube.com/watch?v=wULXzrlJQSw
Submitted 14 May 2026
with v0
https://www.youtube.com/watch?v=59D7wbcGWYc - video Redesigning the San Diego Zoo website with ElevenLabs offers a chance to transform a static informational site into an immersive, "living" auditory ecosystem. By leveraging ElevenLabs’ low-latency streaming and expressive voice cloning, you can turn a browser window into a sensory gateway to the wild. I focused on three pillars: Atmospheric Immersion, Personalized AI Guides, and Multilingual Accessibility. 1. The "Wild Whisper" Ambient Hero Header Instead of just a video loop, the hero section becomes a generative soundscape. The Tech: Use ElevenLabs’ Sound Effects API (or integrated audio generation) to blend realistic animal calls with a soothing AI-generated narrator. The Experience: As users hover over different regions of a stylized park map (e.g., Panda Ridge or Africa Rocks), the background audio shifts. If they hover over the Pandas, a gentle voice narrates real-time "Panda Facts" in a tone that mimics a peaceful forest. Dynamic Lighting/Audio: The voice and background sounds change based on the user's local time. At night, the site adopts "Dark Mode" with the sounds of nocturnal animals and a hushed, whispered AI narration. 2. "Adopt-a-Voice" Interactive Wildlife Guides Move away from standard chatbots. Give every major animal exhibit a unique "Voice." The Tech: Use ElevenLabs Voice Design to create distinct "personalities" for different animals. Rex the Lion: A deep, gravelly, authoritative voice. Pip the Penguin: A high-pitched, energetic, and fast-talking voice. The UI: Each animal page features a "Talk to [Animal Name]" button. Users can ask questions about their diet or conservation status. The API Implementation: Use the ElevenLabs Conversational AI (Agents). Integrate it with a Large Language Model (LLM) like Gemini to process the zoo's database, then stream the response through the ElevenLabs API so the animal "talks back" with human-like emotion and inflection. 3. Real-Time "Live Cam" Narration The San Diego Zoo is famous for its live cams. Let’s make them accessible and engaging for people who can't watch closely. The Concept: An "Audio-Only Mode" for the live cams. The Tech: Use an AI vision model to analyze the live feed (detecting activity, like a tiger waking up) and send a text description to the ElevenLabs WebSocket for instant, low-latency speech synthesis. The Result: A user working at their desk can listen to a "Radio Wildlife" broadcast that says, "You’re listening to the Tiger Cam. Right now, the Malayan tiger is pacing near the watering hole—it looks like it’s getting ready for a midday drink." 4. The "Global Conservationist" (Instant Dubbing) San Diego Zoo is a global hub for conservation. The website should reflect that by breaking language barriers instantly. The Tech: Use ElevenLabs AI Dubbing API. The Feature: When the zoo posts a new video interview with a conservationist in the field, the website offers a "Listen in your language" toggle. The Impact: Instead of subtitles, the video is dubbed in real-time into 30+ languages, retaining the original speaker’s emotional tone. This makes the zoo’s mission accessible to a kid in Brazil or a researcher in Japan as if the speaker were native to them. 5. Sensory-First UI: Audio-Guided Navigation For an ultra-modern, accessible feel, implement a "Eyes-Free" navigation mode. Haptic & Audio Cues: When a user scrolls past a section, a short, expressive AI voiceover provides a 3-second "vibe check" of what’s in that section. Personalized Tickets: When a user buys a ticket, they receive an "Audio Ticket" narrated by their favorite animal voice, giving them personalized tips for their visit (e.g., "Hey there! I'm Omo the Hippo. Make sure you visit me before 10:00 AM; that's when I'm most active!").
with Zed
Sirens of the Shore is classic "tower defense" type game with some interesting twists found only on the high seas! The ancient Sirens protect their homeland from being plundered by the greedy pirates, and they use their enchanting voices (via ElevenLabs AI) to change the physics of their world, and drive the pirates ships into the dangerous rocks! I thought it would be interesting to use the ElevenLabs api to inflluence the physics of the game and wasn't sure it was possible, but it came out quite nice. I used the power of ZED ai development to create a unique weapon system, gameplay, and make it fast and efficient for every computer as a browser-based game. What should have taken me a few months I was able to complete in days. I have never made any serious game before, certainly not to this caliber. The catchy siren music and pirate music is ElevenLabs music. The Pirates try to find their way ashore, to plunder the land and take as they wish. Who will win in this classic battle? The sounds of the Sirens or the brute force of the Pirates?! It's up to you to fight as the Sirens and use your voices to win the battle, and save your land! Video: https://www.youtube.com/watch?v=BMzXTuc-EeY
with AWS Kiro
For the first time in nearly two millennia, the oldest written magic in the world speaks again. These ancient Greek magical rituals, silent on papyrus for over 1,700 years, are now vocalized through cutting-edge AI voice synthesis. Every barbarous name, every divine invocation, every word of power that once echoed through the temples of Greco-Roman Egypt can now be heard, felt, and experienced in your browser. This is not just a translation, it is a resurrection of sound itself. This project combines scholarly translation, modern AI voice synthesis, real-time audio analysis, and procedural visual design to create an immersive experience that honors the original texts while leveraging cutting-edge technology. Every element, from the papyrus texture to the glowing text, is crafted to transport you to the world of ancient magical practice. Every vox magica (magical word) is spoken aloud using ElevenLabs multilingual text-to-speech AI. Ancient Greek pronunciation is reconstructed using modern scholarly consensus. As each word is spoken, it glows in amber/gold tones that pulse in perfect synchrony with the audio.
Submitted 7 May 2026
Submitted 30 Apr 2026
Submitted 24 Apr 2026