High Performance Network Engineer Infiniband
Aptly Technology
All India, Delhi • 2 months ago
Experience: 5 to 9 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
After Free Trial → Flat 50% OFF
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime). Quoted price includes 50% discount.
Enter Your Details
Job Description
As an InfiniBand Engineer, your role involves designing, deploying, and supporting high-performance, low-latency network infrastructures in data center and HPC environments. Your responsibilities include:
- Designing, implementing, and managing large-scale InfiniBand (IB) fabrics
- Configuring and troubleshooting InfiniBand switches and adapters (e.g., Mellanox / NVIDIA IB platforms)
- Performing fabric bring-up, subnet management, partitioning, and performance tuning
- Monitoring and optimizing network performance, latency, throughput, and congestion control
- Integrating InfiniBand with Ethernet-based networking environments
- Supporting RDMA technologies (RoCE, iWARP) and GPUDirect environments
- Collaborating with system, storage, and compute teams to support AI/ML and distributed workloads
- Performing firmware upgrades, patching, and capacity planning
- Troubleshooting Layer 2 / Layer 3 networking issues
- Maintaining documentation, network diagrams, and SOPs
To qualify for this role, you should have:
- 5+ years of networking experience with solid fundamentals in TCP/IP, routing, and switching
- Hands-on experience with InfiniBand technologies (HDR/NDR preferred)
- Experience with NVIDIA / Mellanox Technologies switches and adapters
- Solid understanding of RDMA, congestion control, QoS, and low-latency tuning
- Experience with subnet managers (OpenSM) and fabric diagnostic tools
- Solid understanding of BGP, OSPF, EVPN-VXLAN, MPLS
- Experience in HPC, AI/ML cluster networking environments
- Familiarity with Linux networking and troubleshooting tools
- Experience with automation (Python, Ansible) is a plus
Preferred qualifications include:
- Experience supporting large GPU clusters
- Knowledge of storage networking (NVMe-oF, parallel file systems)
- Experience with monitoring tools and telemetry systems
- Networking certifications (CCNP/CCIE or equivalent)
Key competencies for this role include:
- Strong analytical and troubleshooting skills
- Ability to work in high-performance, mission-critical environments
- Excellent documentation and communication skills
- Proactive problem-solving mindset As an InfiniBand Engineer, your role involves designing, deploying, and supporting high-performance, low-latency network infrastructures in data center and HPC environments. Your responsibilities include:
- Designing, implementing, and managing large-scale InfiniBand (IB) fabrics
- Configuring and troubleshooting InfiniBand switches and adapters (e.g., Mellanox / NVIDIA IB platforms)
- Performing fabric bring-up, subnet management, partitioning, and performance tuning
- Monitoring and optimizing network performance, latency, throughput, and congestion control
- Integrating InfiniBand with Ethernet-based networking environments
- Supporting RDMA technologies (RoCE, iWARP) and GPUDirect environments
- Collaborating with system, storage, and compute teams to support AI/ML and distributed workloads
- Performing firmware upgrades, patching, and capacity planning
- Troubleshooting Layer 2 / Layer 3 networking issues
- Maintaining documentation, network diagrams, and SOPs
To qualify for this role, you should have:
- 5+ years of networking experience with solid fundamentals in TCP/IP, routing, and switching
- Hands-on experience with InfiniBand technologies (HDR/NDR preferred)
- Experience with NVIDIA / Mellanox Technologies switches and adapters
- Solid understanding of RDMA, congestion control, QoS, and low-latency tuning
- Experience with subnet managers (OpenSM) and fabric diagnostic tools
- Solid understanding of BGP, OSPF, EVPN-VXLAN, MPLS
- Experience in HPC, AI/ML cluster networking environments
- Familiarity with Linux networking and troubleshooting tools
- Experience with automation (Python, Ansible) is a plus
Preferred qualifications include:
- Experience supporting large GPU clusters
- Knowledge of storage networking (NVMe-oF, parallel file systems)
- Experience with monitoring tools and telemetry systems
- Networking certifications (CCNP/CCIE or equivalent)
Key competencies for this role include:
- Strong analytical and troubleshooting skills
- Ability to work in high-performance, mission-critical environments
- Excellent documentation and communication skills
- Proactive problem-solving mindset
Skills Required
advanced networking technologies
HPC
TCPIP
routing
switching
RDMA
QoS
BGP
OSPF
MPLS
HPC
Ansible
monitoring tools
CCNP
CCIE
InfiniBand Engineer
InfiniBand fabrics
data center networking
AI
ML clusters
InfiniBand IB fabrics
Mellanox NVIDIA IB platforms
RDMA technologies
RoCE
iWARP
GPUDirect environments
HDRNDR
NVIDIA Mellanox Technologies switches
adapters
congestion control
lowlatency tuning
subnet managers OpenSM
fabric diagnostic tools
EVPNVXLAN
AIML cluster networking environments
Linux networking
automation Python
large GPU clusters
storage networking
NVMeoF
parallel file systems
telemetry systems
Posted on: March 11, 2026
Relevant Jobs
Step 2 of 2