IA de Texto a Voz Jobs
Más detalles: ¿Qué funcionalidades específicas necesita la aplicación? Deslizar para emparejar, Mensajería automatizada, Cargar fotos y videos ¿Cómo deberían subir contenido las creadoras? Manual ¿Qué tipo de integración de IA prefieres para la mensajería? Chatbot con capacidad de autoaprendizaje El objetivo es q las creadoras no tengan q escribir todo el dia con 10 o 20 personas para tener ingresos q la ia tiene la conversación genere atracción y envíe fotos y videos de acuerdo a la conversación y pagos
POSITION BRIEF: Discover Henderson – AI Platform & Systems Manager Discover Henderson is building a next generation tourism platform powered by AI. We are seeking a technical operator who can oversee our entire digital ecosystem — including our website, automations, data systems, and our AI concierge, Ava. This role ensures the platform runs smoothly, evolves continuously, and delivers an exceptional experience for visitors and local partners. ⭐ Role Title AI Platform Manager / No Code Systems Integrator (Wix + Voiceflow + + Airtable) ⭐ Role Summary You will manage and optimize the full Discover Henderson platform, including the website, partner onboarding systems, automations, data flows, and Ava — our AI concierge. Your job is to maintain stability, improve funct...
Goal: Create a FULLY AUTOMATED process that takes a male audio file and converts it into a female voice. What you must do: 1) Take the male audio I provide 2) Convert it into a female voice 3) Upload the final audio into a Google Drive folder 4) Add the Google Drive link in your competition entry 5) Explain clearly what software/tools you will use 6) Explain clearly how you will automate the FULL process from start to finish Important: - The automation must run locally - The final voice must sound perfectly natural and human - The female voice must correctly reproduce the multiple emotions, tone and intonations from the original audio - The result must NOT sound robotic or AI-generated - The automation must be able to process multiple audio files - Do NOT clean the original audio more...
Goal: Create a FULLY AUTOMATED process that takes a male audio file and converts it into a female voice. What you must do: 1) Take the male audio I provide 2) Convert it into a female voice 3) Upload the final audio into a Google Drive folder 4) Add the Google Drive link in your competition entry 5) Explain clearly what software/tools you will use 6) Explain clearly how you will automate the FULL process from start to finish Important: - The automation must run locally - The final voice must sound perfectly natural and human - The female voice must correctly reproduce the multiple emotions, tone and intonations from the original audio - The result must NOT sound robotic or AI-generated - The automation must be able to process multiple audio files - Do NOT clean the original audio more...
I need a ten-minute YouTube video built entirely with AI-driven 3D animated characters. The piece must carry a professional, serious tone—think corporate explainer rather than cartoon—while still feeling visually engaging. Precise, frame-accurate lip sync is critical. Whether you connect a pre-recorded voice-over I supply or generate a natural-sounding AI voice yourself, the mouth movements have to match flawlessly throughout the full ten minutes. Please use whichever tools you trust—Unreal Engine’s MetaHuman Animator, Blender with FaceWare, or other reliable AI lip-sync solutions—as long as the final result looks polished and on beat. I will provide the script, branding assets, and any reference footage once we begin. Your deliverables are: • A 1920&t...
Siamo uno studio commercialistico alla ricerca di un consulente freelance specializzato in intelligenza artificiale, con esperienza nell’analisi dei processi aziendali e nella progettazione di soluzioni personalizzate. L’obiettivo è individuare come integrare l’AI nella nostra organizzazione per migliorare efficienza, automazione e qualità del lavoro, nel rispetto di riservatezza, sicurezza dei dati e normative applicabili. Attività richieste: Analisi del contesto organizzativo e dei processi interni dello studio. Individuazione delle aree in cui l’AI può apportare valore concreto. Proposta di soluzioni personalizzate e realistiche per le nostre esigenze. Eventuale automazione di attività ripetitive o documentali. Definizion...
We are looking for a HIGH-LEVEL AI conversation engineer to help polish and optimize a live AI phone ordering system already running on Twilio + n8n + ElevenLabs + OpenAI. IMPORTANT: The backend infrastructure and payment loop are already mostly working. We are NOT looking for someone to rebuild the platform. We specifically need someone strong in: - AI prompting - conversational flow optimization - reducing hesitation/repetition - human-like ordering behavior - interruption handling - “pay now” behavior - upsell timing logic - manager escalation behavior - fallback/recovery logic - bilingual conversation flow (English/Spanish later) - voice AI optimization in ElevenLabs Current flow: Call → AI order → Stripe payment link → payment confirmation → receip...
I’m launching a Shopify-based T-shirt line and need the entire store built around an AI-driven buying experience. Shoppers must be able to: • Pick their nationality so the on-screen model automatically adjusts skin tone, facial features, and accent. • Choose size, color, fabric, and—because we specialize in Tees—our core Casual style (I may add athletic or formal later). The same AI engine will also generate short spoken videos (15–60 sec) for TikTok, Reels and YouTube Shorts. Each clip should rotate through three themes—product descriptions, promotional hooks and authentic-sounding customer reviews—ready for me to post straight from Shopify’s dashboard. Scope of work 1. Configure and brand a new Shopify store, including payment, shipping...
I’m building a real-time speech-to-text application for Tamil and need a full mobile solution that runs smoothly on both Android and iOS. The core requirement is low-latency live transcription that recognises the major dialects of Tamil—Madurai, Kongu, Nellai, Chennai and Sri Lankan variations—so users hear their words appear on-screen almost instantly, regardless of accent. My priority is accuracy and speed, followed by an interface that keeps the mic open, shows streaming text, and lets users copy, save or share the transcript once they stop speaking. If you can add useful extras such as offline mode, punctuation handling, or a light / dark theme switcher, feel free to mention them. When you respond, focus on your relevant experience: the speech-to-text engines you&rs...
Possiedo già un canale YouTube dedicato a video ai attualmente tratto argomenti di spiritualità, ma vorrei pian piano spostarmi su video avatar ai che trattano argomenti di salute e benessere, con un focus specifico su Alimentazione e dieta. Cerco una sola persona che segua l’intero flusso creativo fin dall’inizio: • ricerca e stesura degli script • v.o. • montaggio completo con grafiche, musica royalty-free e sottotitoli • ottimizzazione SEO (titoli, descrizioni, tag, miniature) • pubblicazione programmata e analisi delle performance Mi aspetto puntualità nelle consegne, capacità di lavorare in autonomia e voglia di crescere. Quando il canale inizierà a generare entrate rilevanti, il tuo ruolo evolver&agrav...
I am looking for a freelance developer or team to create a local AI avatar system with real-time voice interaction and facial/lip synchronization for Latin American Spanish. Currently, we already have a basic avatar that can display responses, but it does not speak or animate facial movements naturally. The goal is to build an avatar that can: Speak directly using AI-generated voice (TTS) Synchronize mouth/facial movements with speech Simulate realistic modulation using at least the 5 main vowel mouth shapes (visemes/phonemes) Run locally (offline or local server environment) Allow flexible integration with different AI providers Work primarily in Latin American Spanish Main requirements: • Local execution The system must run locally using CPU/GPU resources. Cloud dependence shou...
I want to bring everyday Hindi-English-Marathi conversations into one streamlined Android app. The idea is simple: I point the camera at a street sign or menu, the app grabs the text with Optical Character Recognition, instantly translates it, and then reads the result back to me through a clear Text-to-Speech engine. The same flow should work when I speak or type a phrase—I receive a fast, accurate translation plus an optional audio playback so I can mimic the correct pronunciation on the spot. Even for communication with auto rickshaw driver, sabzi mandi, hawkers etc. I want to make it as a communication tool Core flow • Capture text with OCR from photos or the live camera view • Translate bi-directionally between Hindi, English and Marathi • Convert transl...
**AI-Powered YouTube Thumbnail & Hook Generator (ChatGPT Image 2.0 & Gemini 3.1 Pro)** ## **Project Description:** I am looking for an expert Full-Stack Developer to build a web application that automates the creation of high-converting YouTube thumbnails and viral "hook" titles. This platform will cater to content creators by turning simple ideas into cinematic, "click-worthy" visuals using a specialized template system. ### **Key Technical Requirements:** * **Image Generation:** Integration with **ChatGPT Image 2.0 (GPT Image 2)** API. The app must support high-resolution (2K) output and **multi-turn editing (inpainting)** to allow users to add or edit elements on the same image. * **Text & Logic:** Integration with **Gemini 3.1 Pro** API to generate...
I have a series of educational modules that need lively, AI-generated voiceovers. The goal is an energetic tone that keeps learners engaged while still sounding clear and professional. Here’s what I need from you: 1. Select or create an AI voice that fits an upbeat classroom style—nothing robotic or monotone. 2. Generate polished audio for each lesson script I supply (roughly 8-10 minutes per lesson, delivered as high-quality WAV or MP3). 3. Tweak pacing, emphasis, and pronunciation so technical terms come through naturally and the material flows. 4. Deliver a separate, neatly labeled file for every lesson plus a master list of any pronunciation rules you had to set. I’m open to whichever tool—ElevenLabs, PlayHT, Amazon Polly, Google Cloud TTS, or anot...
realistic AI voice generation. It allows users to create natural-sounding speech from text in multiple languages and voices. Here is a breakdown of what makes their platform stand out: Key Features Text-to-Speech (TTS): Converts written text into high-quality, human-like audio. It excels at capturing nuance, emotion, and natural pacing, making it sound less robotic than traditional TTS engines. Voice Cloning: Allows you to upload a short sample of a real human voice and create a digital replica. You can then use that cloned voice to read any text. Voice Library: A community-driven library where you can browse and use thousands of pre-made voices suited for different tones, ages, and styles. Speech-to-Speech (STS): Lets you speak into a microphone and transform your audio into another...
Im after someone that is a master at ai Claude etc I have alot of tasks to be done that im to busy to try do myself Website creation, seo, email signature animations, google my business review help and alot more
Artículos recomendados solo para ti
How user testing can make your product great
Get your product into the hands of test users and you'll walk away with valuable insights that could make the difference between success and failure.