Linkedin Logo

AI/ML Engineer

Linkedin

All India 3 to 7 Yrs 1 month ago

Job Description

As an AI/ML Engineer at a US Based IT MNC working on Enterprise class voice solutions for a reputed client, your role involves developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition, and/or voice transformation. You will be responsible for:

  • Designing, developing, and fine-tuning deep learning models for voice synthesis such as TTS and voice cloning.
  • Implementing and optimizing neural network architectures like Tacotron, FastSpeech, WaveNet, or similar.
  • Collecting, preprocessing, and augmenting speech datasets.
  • Collaborating with product and engineering teams to integrate voice models into production systems.
  • Performing evaluation and quality assurance of voice model outputs.
  • Researching and staying current on advancements in speech processing, audio generation, and machine learning.

Qualifications required for this role include:

  • Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field.
  • Strong experience with Python and machine learning libraries like PyTorch, TensorFlow.
  • Hands-on experience with speech/audio processing and relevant toolkits such as Librosa, ESPnet, Kaldi.
  • Familiarity with voice model architectures like TTS, ASR, vocoders.
  • Understanding of deep learning concepts and model training processes.

Preferred qualifications for this position include:

  • Experience with deploying models to real-time applications or mobile devices.
  • Knowledge of data labeling, voice dataset creation, and noise handling techniques.
  • Experience with cloud-based AI/ML infrastructure such as AWS, GCP.
  • Contributions to open-source projects or published papers in speech/voice-related domains. As an AI/ML Engineer at a US Based IT MNC working on Enterprise class voice solutions for a reputed client, your role involves developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition, and/or voice transformation. You will be responsible for:
  • Designing, developing, and fine-tuning deep learning models for voice synthesis such as TTS and voice cloning.
  • Implementing and optimizing neural network architectures like Tacotron, FastSpeech, WaveNet, or similar.
  • Collecting, preprocessing, and augmenting speech datasets.
  • Collaborating with product and engineering teams to integrate voice models into production systems.
  • Performing evaluation and quality assurance of voice model outputs.
  • Researching and staying current on advancements in speech processing, audio generation, and machine learning.

Qualifications required for this role include:

  • Bachelors or Masters degree in Computer Science, Electrical Engineering, or related field.
  • Strong experience with Python and machine learning libraries like PyTorch, TensorFlow.
  • Hands-on experience with speech/audio processing and relevant toolkits such as Librosa, ESPnet, Kaldi.
  • Familiarity with voice model architectures like TTS, ASR, vocoders.
  • Understanding of deep learning concepts and model training processes.

Preferred qualifications for this position include:

  • Experience with deploying models to real-time applications or mobile devices.
  • Knowledge of data labeling, voice dataset creation, and noise handling techniques.
  • Experience with cloud-based AI/ML infrastructure such as AWS, GCP.
  • Contributions to open-source projects or published papers in speech/voice-related domains.

Posted on: April 7, 2026