
Abierto
Publicado
•
Termina en 3 días
Pagado a la entrega
Our client runs a call center with Nigerian agents calling US businesses. They need a Python script that makes the agents' accent sound more neutral and clear to American ears in real time during live phone calls. The hard part is already done. There is a fully built Node.js server that handles the live call audio. We just need you to write one Python script that plugs into it. What the script needs to do: Receive live audio from the server, process it through a voice conversion model to soften the Nigerian accent, and send the processed audio back. The whole process must happen in under 150ms so the conversation feels natural with no delay. Tech details: Audio comes in as mulaw 8kHz chunks. You can resample internally for better model performance then convert back before sending out. RVC is the preferred model but we are open to alternatives that meet the latency requirement. Script runs on a RunPod RTX A5000 GPU. Before we hire: You must send a before and after audio demo showing accent softening on any Nigerian English voice sample. You do not need our server or infrastructure to do this. Just run a Nigerian English audio sample through your model and send both files. This is how we evaluate all candidates. Deliverable: One working Python script tested on GPU with latency confirmed under 150ms. Timeline: 2 to 3 days after demo is approved. Please include your demo files and your fixed price in your proposal.
ID del proyecto: 40359128
106 propuestas
Abierto para recibir ofertas
Proyecto remoto
Activo hace 2 días
Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
106 freelancers están ofertando un promedio de $172 USD por este trabajo

⭐⭐⭐⭐⭐ Create a Python Script to Neutralize Nigerian Accent in Real-Time Calls ❇️ Hi My Friend, I hope you are doing well. I've reviewed your project requirements and noticed you're looking for a Python script to enhance the clarity of Nigerian accents during live calls. Look no further; Zohaib is here to assist you! My team has successfully completed 50+ similar projects for audio processing. I will build a Python script that integrates with your existing Node.js server, ensuring the audio processing occurs under 150ms for a seamless conversation. ➡️ Why Me? I bring 5 years of solid experience in audio processing and Python development, specializing in real-time applications. My expertise includes voice conversion, audio analysis, and low-latency processing. Additionally, I have a strong grip on GPU programming and audio modeling techniques, ensuring a high-quality solution for your project. ➡️ Let's have a quick chat to discuss your project in detail and I can show you samples of my previous work. Looking forward to discussing this with you in chat. ➡️ Skills & Experience: ✅ Python Programming ✅ Audio Processing ✅ Real-Time Systems ✅ Voice Conversion ✅ Low-Latency Solutions ✅ GPU Programming ✅ Audio Analysis ✅ Node.js Integration ✅ Signal Processing ✅ Model Optimization ✅ RVC Model Familiarity ✅ Sample Rate Conversion Waiting for your response! Best Regards, Zohaib
$150 USD en 2 días
7,9
7,9

Hi there, I will deliver a single GPU-tested Python audio-processing script that plugs into your existing Node.js live-audio pipeline, accepts mulaw 8kHz chunks, runs a low-latency RVC-style voice conversion to soften Nigerian accents, and returns processed audio within a 150ms budget , tested on RunPod RTX A5000. - Deliverable: Python script (WebSocket/UDP client) that receives mulaw 8kHz, resamples, runs the model on GPU, converts back to mulaw 8kHz, and streams output. - Deliverable: Before/after demo files (Nigerian English sample) and a latency report proving <150ms round-trip on RTX A5000. - Deliverable: Integration notes, Dockerfile, and a short rollback/staged-deploy plan with unit tests and CI smoke checks to validate audio quality and minimal downtime. Skills: ✅ mulaw 8kHz audio ✅ RVC / voice conversion model ✅ real-time streaming (WebSocket/UDP) integration ✅ GPU deployment on RunPod / Docker ✅ latency tuning, profiling and audio quality validation Certificates: ✅ Microsoft® Certified: MCSA | MCSE | MCT ✅ cPanel® & WHM Certified CWSA-2 I am available to start immediately; Do you have a preferred RVC checkpoint or may I select a public checkpoint and provide the before/after demo using that model? Also, should the script integrate over WebSocket or UDP with your Node.js server? Price: $180 , delivery: 1 day after demo approval. Best regards,
$180 USD en 1 día
6,7
6,7

Hi, I'm thrilled to bid for your project requiring Python - an area in which I have substantial skills. In the past, I've worked on projects similar to your call center scenario, leveraging my knowledge in AI and automation. I understand you're looking to develop a script for voice accent softening which is indeed an intriguing task, and I'm thoroughly interested. While focusing on making the solution robust and reliable, I assure you my code will be clean, tested, and production-ready. Feel free to share more details or any questions so that we can think over the project scope. Looking forward to a fruitful discussion.
$190 USD en 10 días
6,5
6,5

I have carefully reviewed your requirement for a real-time Nigerian accent softening script and have strong experience in Python-based audio processing, GPU optimization, and voice conversion models. I can deliver a Python script that plugs into your Node.js server, receives live mulaw 8kHz audio, processes it through an RVC or similar low-latency model, and returns the softened audio with under 150ms latency on your RTX A5000 GPU. Approach: Resample incoming audio for optimal model performance. Process via a fine-tuned voice conversion model. Convert back to mulaw 8kHz and send output. Measure and ensure end-to-end latency <150ms. I will provide a before-and-after audio demo using a Nigerian English voice sample for your evaluation before starting full integration. WE WILL FOLLOW AGILE METHODOLOGY, PROVIDE 2 YEARS OF FREE ONGOING SUPPORT AND COMPLETE SOURCE CODE, AND ASSIST FROM DEMO TO FINAL DEPLOYMENT. I eagerly await your positive response. Thanks
$1.000 USD en 7 días
6,2
6,2

With my advanced Python skills and extensive experience in developing and implementing Machine Learning algorithms, I am confident that I'm the right person for this project. My team has worked on numerous projects involving A.I and Machine Learning, for instance, voice recognition, natural language processing, and sentiment analysis, showcasing our ability to seamlessly handle your audio processing needs. We will ensure that your agent's Nigerian accent sounds more neutral and agreeable without any noticeable delay through the use of an efficient neural network model. Our proficiency with Python and a deep understanding of frameworks such as Django and Flask will enable us to write a robust script that effectively plugs into your existing Node.js server. It is also worth mentioning that we are familiar with speech data handling using libraries like spacy, speech-recognition, and wavfile, which perfectly aligns with the coding requirements of this project.
$300 USD en 7 días
6,3
6,3

Hi there, I’ve carefully reviewed your project and am excited about the opportunity to work with you. With over 6 years of experience in Python based audio processing and real time speech conversion, I specialize in delivering ultra low latency voice transformation pipelines. I am confident I can build the accent softening module you need for seamless US business calls. Here’s my approach: Build an optimized Python pipeline that receives mulaw 8kHz audio and resamples for clearer model inference Set up a GPU accelerated RVC or alternative model tuned specifically for Nigerian to neutral accent softening then return clean audio under 150ms I am available to start immediately and aim to deliver the full project within 3 days after demo approval. Additional instructions or notes optional: I will test latency directly on comparable hardware I will provide clean before and after demo files Best regards, Jushua
$155 USD en 6 días
5,5
5,5

I have thoroughly understood your requirements for the Python script to soften Nigerian accents in real-time during live phone calls. The script will need to receive live audio, process it through a voice conversion model, and send the processed audio back with a latency of under 150ms. I am confident in my ability to deliver the working Python script tested on GPU within 2 to 3 days after the demo is approved. I am eager to showcase my skills by providing the before and after audio demo as per your request. Looking forward to discussing the project scope and adjusting the budget accordingly. Please go through my profile, which showcases 15 years of experience. Your satisfaction is my top priority. Let's discuss the job details and get started right away.
$158 USD en 5 días
5,8
5,8

⭐Hi, I’m ready to assist you right away!⭐ I believe I’d be a great fit for your project since I have extensive experience in Python development, particularly in audio engineering and deep learning models. With a keen eye for detail and a track record of successful projects, I am confident in delivering exceptional results for accent softening in real-time audio streams. Utilizing my expertise in Linux, NLP, and ML algorithms, I will ensure that the processed audio maintains a natural flow with minimal latency. Solving the challenge of making Nigerian accents clearer for American ears is crucial in enhancing communication quality for your call center. My proficiency in audio processing, speech synthesis, and signal processing will guarantee a seamless integration of the Python script with your existing Node.js server. If you have any questions, would like to discuss the project in more detail, or would like to know how I can help, we can schedule a meeting. Thank you. Maxim
$150 USD en 5 días
5,4
5,4

I can help you. The primary technical hurdle is the 150ms latency limit while starting with a low-fidelity 8kHz source. Converting 8kHz mulaw to the 40kHz/48kHz required by RVC models usually introduces significant "phase clicking" or metallic artifacts during real-time chunking. I will implement a lightweight RVC inference engine optimized with TensorRT or TorchScript to keep the GPU processing time under 50ms, leaving ample headroom for network overhead. To solve the hidden problem of robotic audio quality from low-bitrate input, I’ll apply a specific overlap-add (OLA) windowing technique to the chunks. This ensures seamless transitions between audio packets and prevents the jitter typically seen in high-speed voice conversion on RunPod environments.
$140 USD en 7 días
5,2
5,2

With my diverse skill set, I'm uniquely positioned to tackle the accent softening requirements of your Python project. My background in Data Science and Machine Learning encompasses extensive experience with Python, the key programming language for this task. I have hands-on experience in audio engineering and signal processing that will be paramount in creating an efficient voice conversion model to meet your needs. My particular expertise is in machine learning and deep learning solutions using various models including RVC, which you mentioned as your preferred model. However, flexibility is also important, and I won't hesitate to explore alternative models that guarantee low latency and high-quality output—ultimately aiming for a smoother conversation flow between Nigerian agents and US businesses. To supplement my technical proficiency, my professional experience includes delivering projects within strict timelines across industries such as healthcare and fintech. This demonstrates not only my ability to produce results but to do so efficiently. As an AWS-certified expert, I appreciate the significance of optimized systems that ensure minimal delay—an aspect I'll bring into your project for maximum efficacy. With all this combined, I am confident that, given the chance to prove myself through the demo teste
$220 USD en 3 días
4,6
4,6

Hello,\n\nI can develop a Python script to process live audio for accent neutralization in real-time. My approach involves integrating a voice conversion model that meets your latency requirements. I will ensure the script runs efficiently on the provided GPU. Could you share a sample Nigerian English voice recording for the demo? This will help me refine the accent softening for your specific needs. Regards, Muhammad Azeem
$150 USD en 3 días
4,8
4,8

With my extensive background in Linux and Python, I am confident that I can deliver the innovative solution you need for your call center's accent softening project. Additionally, I have ample experience with audio processing, including the use of voice conversion models, which will be crucial for this particular task. As a problem solver, I'm always attentive to the specific needs of my clients and adept at finding bespoke solutions tailored to their unique challenges. For your project, using RunPod RTX A5000 GPU to efficiently process mulaw 8kHz audio chunks is part of my regular toolkit. To ensure I can meet your anticipated timeline of 2 to 3 days, I will dedicate myself fully to deliver a tested and optimized python script that performs flawlessly within your previously stipulated latency limit. If you choose me, you are choosing expertise which can provide you access to efficient code delivery without compromising quality or timeliness. Let me help transform your call center's function into one that caters even better to your US-based clientele.
$140 USD en 7 días
4,7
4,7

As a seasoned Full-Stack Developer with over 7 years of experience in developing and automating applications, I'm confident of being the perfect solution for your Python related needs, including the task you have described. My expertise encompasses various aspects of backend development, including Python (Django, FastAPI), Node.js, and REST & GraphQL APIs that provide a solid foundation for handling live audio through your server. Moreover, my command over Docker and Kubernetes can further optimize the efficiency of the RunPod RTX A5000 GPU making accent transformation within 150ms plausible. My depth with real-time audio processing and handling is another asset I bring to the table. With your specific inclination towards using RVC voice conversion model for accent neutralization, I'm well-versed with it and also aware of alternative models that can be equally efficient while maintaining low latency. To back up my claims, I am keen on performing a demo of transformaing a Nigerian English accent audio snippet for you – a priority criterion for your decision-making process.
$150 USD en 5 días
4,6
4,6

Hello, My name is SunZhen, and I have hands-on experience building real-time audio processing pipelines and integrating AI models into low-latency production systems. Your setup is clear, and I’m confident I can deliver a Python module that cleanly plugs into your existing Node.js audio server. For this task, I’ll implement a GPU-optimized pipeline using RVC (Retrieval-based Voice Conversion) with streaming-friendly buffering. The script will: • Receive mulaw 8kHz audio chunks • Convert → resample (16k/32k) for model processing • Apply accent softening via a tuned voice conversion model • Stream processed audio back in mulaw 8kHz format • Maintain sub-150ms latency using chunk-based inference, async I/O, and CUDA optimization I will focus heavily on minimizing latency (model warmup, chunk size tuning, and efficient resampling) to ensure natural conversation flow without noticeable delay. For the demo, I will provide: • Original Nigerian English audio sample • Processed version with accent neutralization • Measured latency benchmarks Timeline: 2–3 days after demo approval Fixed Price: $200 I can start immediately and deliver a clean, production-ready Python script with clear integration points for your Node.js server. Best regards, SunZhen
$100 USD en 2 días
4,3
4,3

Greetings! I am excited to propose my expertise for your project requiring a Python script to enhance the accents of Nigerian agents during live phone calls in your call center. With the Node.js server infrastructure already in place, my focus will be on seamlessly integrating the voice conversion model to deliver real-time, neutral-sounding audio to American ears. I am confident in my abilities to meet the stringent latency requirements and ensure a smooth and natural conversation experience for your agents and clients. I look forward to the opportunity to showcase my skills and provide a demo of the accent softening capabilities before proceeding further. Regards, Yogesh Kumar
$100 USD en 7 días
4,4
4,4

Hi there, I’ve read your Nigerian-English accent softening need for real-time call center audio and I’m confident we can achieve sub-150ms latency with a lightweight Python module that plugs into your existing Node.js server. I specialize in real-time audio processing, ML-driven voice conversion, and GPU-accelerated inference, including RVC-style models and efficient resampling pipelines. I’ll design a single Python script that receives mulaw-encoded 8 kHz packets, resamples internally for model performance, runs the voice-conversion pass on a RTX A5000-enabled RunPod, and streams back the transformed audio with minimal padding. Plan: - Input: live mulaw 8kHz chunks, decode to PCM, optional up/down-sample to a target rate for the model. - Inference: run a lightweight voice-conversion model (RVC or equivalent) tuned for Nigerian English neutrality with the strict latency bound. Use batch-free streaming so latency remains near real-time. - Output: re-encode to mulaw 8kHz and forward to the Node.js server with synchronized timestamps to avoid buffer jitter. - Validation: measure end-to-end latency on GPU, target under 150ms, with robust error handling and fallbacks. Deliverables: - One working Python script tested on GPU with confirmed latency <= 150ms and clear logging. - A demo kit: before/after audio sample processing results for Nigerian English voice (you can provide your sample or I can run a provided one). I’ll include a short README with test commands and how to run in
$75 USD en 3 días
4,3
4,3

Hey, real-time voice processing specialist here. I've created demo before/after audio samples and the actual Python script that plugs into your Node.js server. The script handles mulaw 8kHz decode, resamples to 16kHz for RVC processing, neutralizes the accent, and converts back, all under 150ms on the A5000. I have the zip ready, please DM and see the samples and let me know if you want to move forward.
$120 USD en 7 días
4,1
4,1

Hi there, Strong alignment with this project comes from experience delivering real-time audio processing and AI-based voice transformation systems where low latency and clarity were critical. Clear understanding of your requirement to process live mulaw 8kHz audio, apply accent softening using RVC or similar models, and return optimized audio within strict sub-150ms latency. Hands-on expertise with Python, PyTorch, real-time audio streaming, and GPU optimization on RunPod ensures efficient processing, resampling, and seamless Node.js integration. Risk stays controlled through latency benchmarking, optimized inference pipelines, and consistent audio quality validation across streams. Available to start immediately happy to share a demo with before/after samples and discuss next steps. Recent work: https://www.freelancer.com/u/chiragardeshna Regards Chirag
$140 USD en 7 días
4,4
4,4

Hi, I see you already have a Node.js pipeline built and just need a Python layer that performs real‑time accent softening on mulaw 8 kHz audio while hitting sub‑150ms latency. That tells me the integration points and timing constraints are the real focus here. I’ve implemented similar low‑latency voice‑conversion flows before, including an RVC-based transformer that delivered consistent <110ms round‑trip processing on an A5000 after optimizing resampling and GPU batching. A key risk here is avoiding latency spikes caused by resampling and model warm‑up; most juniors miss this. I typically preload weights on GPU, pin memory, and run a streaming inference path so the pipeline stays stable across long calls. I’ll build a Python script that receives each mulaw chunk, decodes and resamples it, runs the chosen RVC model with streaming inference, then re‑encodes to mulaw before sending it back. I’ll also benchmark each stage to ensure the 150ms ceiling is never crossed. Before final delivery, I can prepare the required before/after demo using a Nigerian English sample and provide timing logs. Thanks, John allen.
$155 USD en 1 día
4,1
4,1

⭐⭐⭐⭐⭐ ✅Hi there, hope you are doing well! I have worked on similar real-time audio processing projects where Python scripts integrated with existing systems to modify voice characteristics smoothly and quickly. The key to successfully completing your project is optimizing latency to keep audio delay under 150ms while maintaining natural-sounding accent softening. Approach: ⭕ I will develop a Python script that receives mulaw 8kHz audio chunks from your Node.js server, resamples for model input, applies an efficient voice conversion model like RVC or an alternative with proven low latency, then converts and sends back the processed audio maintaining under 150ms round-trip delay. ❓ Can you provide a sample Nigerian English voice clip to test my demo model? I am confident my experience with real-time audio pipelines and GPU-accelerated Python voice models will deliver a seamless accent softening solution for your call center clients. Looking forward to working with you. Best regards, Nam
$200 USD en 3 días
3,8
3,8

Houston, United States
Forma de pago verificada
Miembro desde ene 31, 2010
$30-250 USD
$30-250 USD
$30-250 USD
$250-750 USD
$30-250 USD
$30-250 USD
₹3000-8000 INR
$30-250 USD
$42 USD
$250-750 USD
$10-30 USD
$30-250 USD
€30-250 EUR
$42 USD
$30-250 AUD
$250-750 USD
$1500-3000 USD
$30-250 USD
$30-250 USD
$10-30 USD
£25 GBP
₹750-1250 INR /hora
$10-30 USD
$30-40 USD
$250-750 AUD