AI Platform Engineer
PROCIGMA BUSINESS SOLUTIONS PRIVATE LIMITED
All India, Gurugram • 2 months ago
Experience: 4 to 8 Yrs
PREMIUM
Deal of the Day
--:--:--
7 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime).
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
Enter Your Details
Job Description
As an MLOps Engineer, you will be responsible for the following key responsibilities:
- Designing and implementing MLOps pipelines for training, validation, deployment, and monitoring of machine learning models.
- Developing and maintaining infrastructure for data versioning, model registries, and experiment tracking using tools like MLflow, LakeFS, and Airflow.
- Integrating orchestration tools such as Kubeflow, Ray, and Airflow to support automated workflows and distributed training.
- Collaborating with data scientists and software engineers to ensure seamless model handoff and deployment.
- Building APIs and SDKs to abstract infrastructure complexity and enable self-service model development.
- Implementing monitoring and alerting systems for model drift, performance degradation, and system health.
- Supporting on-prem and cloud-based deployments using technologies like Kubernetes, HPC clusters, and AWS.
Qualifications required for this role include:
- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- 3+ years of experience in software development, preferably in AI/ML infrastructure or data platforms.
- Proficiency in Python and/or TypeScript/JavaScript.
- Experience with backend frameworks such as FastAPI, Flask, and frontend libraries like React and Vue.
- Familiarity with cloud services (preferably AWS), containerization (Docker), and orchestration (Kubernetes).
- Strong understanding of RESTful APIs, CI/CD pipelines, and Git-based workflows.
Preferred qualifications for the role may include:
- Experience with distributed training frameworks like Ray and Ray Tune.
- Knowledge of model explainability, monitoring, and rollback strategies.
- Exposure to hybrid cloud/on-prem infrastructure and HPC environments.
- Prior work on internal platforms or developer tools. As an MLOps Engineer, you will be responsible for the following key responsibilities:
- Designing and implementing MLOps pipelines for training, validation, deployment, and monitoring of machine learning models.
- Developing and maintaining infrastructure for data versioning, model registries, and experiment tracking using tools like MLflow, LakeFS, and Airflow.
- Integrating orchestration tools such as Kubeflow, Ray, and Airflow to support automated workflows and distributed training.
- Collaborating with data scientists and software engineers to ensure seamless model handoff and deployment.
- Building APIs and SDKs to abstract infrastructure complexity and enable self-service model development.
- Implementing monitoring and alerting systems for model drift, performance degradation, and system health.
- Supporting on-prem and cloud-based deployments using technologies like Kubernetes, HPC clusters, and AWS.
Qualifications required for this role include:
- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- 3+ years of experience in software development, preferably in AI/ML infrastructure or data platforms.
- Proficiency in Python and/or TypeScript/JavaScript.
- Experience with backend frameworks such as FastAPI, Flask, and frontend libraries like React and Vue.
- Familiarity with cloud services (preferably AWS), containerization (Docker), and orchestration (Kubernetes).
- Strong understanding of RESTful APIs, CI/CD pipelines, and Git-based workflows.
Preferred qualifications for the role may include:
- Experience with distributed training frameworks like Ray and Ray Tune.
- Knowledge of model explainability, monitoring, and rollback strategies.
- Exposure to hybrid cloud/on-prem infrastructure and HPC environments.
- Prior work on internal platforms or developer tools.
Skills Required
Airflow
APIs
Kubernetes
AWS
Python
JavaScript
Flask
cloud services
containerization
Docker
orchestration
RESTful APIs
hybrid cloud
developer tools
MLOps
machine learning models
data versioning
model registries
experiment tracking
MLflow
LakeFS
orchestration tools
Kubeflow
Ray
SDKs
monitoring systems
alerting systems
TypeScript
backend frameworks
FastAPI
frontend libraries
React
Vue
CICD pipelines
Gitbased workflows
distributed training frameworks
Ray Tune
model explainability
rollback strategies
onprem infrastructure
HPC environments
internal platforms
Posted on: March 11, 2026
Relevant Jobs
Step 2 of 2