Overview
Skills
Job Details
Role: Network Architect
Location REMOTE
Experience: 12 Years
This is a hands-on architecture position focused on the development and deployment of ultra-high-speed, resilient, and scalable interconnects for GPU-accelerated data centres and compute clusters. Outstanding problem-solving abilities and a comprehensive understanding of the network security protocols & standards, routing, switching, automation and deep understanding of fundamental network theory is also critical to success
Top Skills:
Core Network Architecture & Design Skills
Data Center fabric/interconnect architecture (HPC, AI, GPU clusters)
InfiniBand, Ultra Ethernet, high-performance networking protocols
Hybrid Data Center network design & scalability
Dark fiber deployments, low-latency routing, global scale interconnects
Key Responsibilities:
Lead the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters.
Develop high-performance data center fabric using InfiniBand, Ultra Ethernet and related technologies.
Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability.
Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads.
Implement network monitoring, telemetry, solving, and continuous performance improvement processes.
Drive technology selection, vendor engagement, and lifecycle management for Data Center hardware and software.
What We re Looking For:
Minimum 6-8 years of experience in building, managing and supporting large scale hybrid Data Center networks. Developing automation pipelines with Python, Ruby, Go or other languages used in infrastructure automation.
SME in networking technologies: InfiniBand, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS, IS-IS, DWDM. Etc
Experience automating SDN/NFV/NFVI Infrastructure
Experience using an automated configuration management system (Ansible, Salt, etc.)