Hack #2: Cloudflare · Cloudflare
1 Apr, 14:45
Pidgyn is a dating app where language barriers don't exist. Record a voice bio, browse profiles worldwide, and hear everyone in your own language spoken in a clone of their actual voice. What it does Pidgyn lets you date anyone on earth regardless of what language they speak. When you sign up, you record a voice bio and Pidgyn instantly clones your voice. Other users can browse your profile and tap "Hear their voice in English" (or whatever their language is) to hear your bio translated and spoken aloud in your cloned voice. When two people match, they chat with real-time message translation. Voice messages go through the full pipeline: your speech is transcribed, translated, and re-spoken in your cloned voice in the other person's language. The result: it sounds like you're fluently speaking a language you don't know. How it uses Cloudflare Workers handle all API routing, speech-to-text processing, and orchestration between Cloudflare AI and ElevenLabs. Durable Objects power two critical pieces of stateful infrastructure: UserDirectory — a single global DO that manages all user profiles, interest tracking, mutual matching, and smart profile discovery (sorted by language diversity, voice bio presence, and clone status). ChatRoom — per-match DOs that manage WebSocket connections, message persistence, and the full voice message pipeline (STT, translation, TTS) within the DO itself. Workers AI runs two models on the edge: @cf/openai/whisper for speech-to-text (voice bio transcription and voice message transcription) @cf/meta/llama-3.1-8b-instruct for translation between 15 languages, with @cf/meta/m2m100-1.2b as fallback How it uses ElevenLabs Instant Voice Cloning — When a user records their voice bio, the browser converts the WebM recording to WAV via an AudioContext-based transcoder, then sends it to ElevenLabs' IVC API. The cloned voice ID is stored on their profile and used for all TTS output. Text-to-Speech (Flash v2.5) — Every "Hear in [language]" button and every voice message in chat uses ElevenLabs TTS with the speaker's cloned voice ID, so the output sounds like them speaking the target language. Why dating? Every translation demo uses the same example: a chatroom. But chatrooms don't have stakes. Dating does. You're hearing someone's voice for the first time, deciding if you're interested, starting a conversation. The emotional weight makes the technology feel real. And the viral angle writes itself: "I went on a date with someone who doesn't speak my language." Tech Stack Cloudflare Workers (routing, orchestration) Cloudflare Durable Objects (UserDirectory, ChatRoom) Cloudflare Workers AI (Whisper STT, Llama 3.1 8b translation) ElevenLabs Instant Voice Cloning ElevenLabs Flash v2.5 TTS Single-file HTML frontend served via Cloudflare Pages Links Live demo: https://app.pidgyn.workers.dev GitHub: https://github.com/reddxmanager/pidgyn
