AI/ML Engineer

All India 3 to 7 Yrs 1 month ago

Job Description

Designing, developing, and fine-tuning deep learning models for voice synthesis such as TTS and voice cloning.
Implementing and optimizing neural network architectures like Tacotron, FastSpeech, WaveNet, or similar.
Collecting, preprocessing, and augmenting speech datasets.
Collaborating with product and engineering teams to integrate voice models into production systems.
Performing evaluation and quality assurance of voice model outputs.
Researching and staying current on advancements in speech processing, audio generation, and machine learning.

Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field.
Strong experience with Python and machine learning libraries like PyTorch, TensorFlow.
Hands-on experience with speech/audio processing and relevant toolkits such as Librosa, ESPnet, Kaldi.
Familiarity with voice model architectures like TTS, ASR, vocoders.
Understanding of deep learning concepts and model training processes.

Experience with deploying models to real-time applications or mobile devices.
Knowledge of data labeling, voice dataset creation, and noise handling techniques.
Experience with cloud-based AI/ML infrastructure such as AWS, GCP.
Contributions to open-source projects or published papers in speech/voice-related domains. As an AI/ML Engineer at a US Based IT MNC working on Enterprise class voice solutions for a reputed client, your role involves developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition, and/or voice transformation. You will be responsible for:
Designing, developing, and fine-tuning deep learning models for voice synthesis such as TTS and voice cloning.
Implementing and optimizing neural network architectures like Tacotron, FastSpeech, WaveNet, or similar.
Collecting, preprocessing, and augmenting speech datasets.
Collaborating with product and engineering teams to integrate voice models into production systems.
Performing evaluation and quality assurance of voice model outputs.
Researching and staying current on advancements in speech processing, audio generation, and machine learning.

Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field.
Strong experience with Python and machine learning libraries like PyTorch, TensorFlow.
Hands-on experience with speech/audio processing and relevant toolkits such as Librosa, ESPnet, Kaldi.
Familiarity with voice model architectures like TTS, ASR, vocoders.
Understanding of deep learning concepts and model training processes.

Experience with deploying models to real-time applications or mobile devices.
Knowledge of data labeling, voice dataset creation, and noise handling techniques.
Experience with cloud-based AI/ML infrastructure such as AWS, GCP.
Contributions to open-source projects or published papers in speech/voice-related domains.