Overview
Skills
Job Details
Title Network HPC Engineer
Location 5 Days Onsite in Ashburn, VA
Duration 12 Months
Job Description
The client has adjusted this to focus on 60% networking skills and 30% Linux/CICD, 10% HPC.
Key Responsibilities:
o Design and deploy high-throughput, low-latency network systems to support global infrastructure footprint.
o Troubleshoot and optimize Linux-based networking stacks across thousands of nodes.
o Tune Linux kernel networking parameters for performance (RQ affinity, socket buffers, MTU, etc.).
o Automate network provisioning, monitoring, and diagnostics using Python, Ansible, Terraform, or equivalent tools.
o Implement and manage L2/L3 network topologies, including EVPN-VLAN, BGP, OSPF, and static routing.
o Support and troubleshoot Infiniband, RoCEv2, SR-IOV, or smartNIC-based deployments.
o Analyze performance metrics to identify and resolve packet loss, congestion, jitter, and other network anomalies.
o Integrate telemetry and logging solutions using tools such as Prometheus, Grafana, and sFlow/NetFlow.
o Collaborate with security and platform teams to enforce network segmentation, ACLs, and policy enforcement.
o Participate in design reviews, capacity planning, and incident response for critical infrastructure.
Minimum Qualifications:
o Bachelor's or Master's degree in Computer Science, Electrical Engineering, or equivalent experience.
o 5+ years of experience as a Network Engineer working in Linux-dominant environments.
o Strong understanding of TCP/IP stack, multicast, DNS, DHCP, NAT, and Qos.
o Hands-on experience with network configuration on Linux systems (e.g., Netplan, systema-networkd, NetworkManager).
o Proven skills in scripting and automation (e.g., Python, Bash, Git).
o Experience deploying and managing enterprise-grade switches, routers, and NICs (e.g., Arista, Juniper, Mellanox, Broadcom).
o Ability to troubleshoot across physical, data link, and network layers using tools like topdump, iperf, ethtool, nmap, etc.
Preferred Qualifications:
o Experience in hyperscale infrastructure, supporting Al/ML or cloud-scale services.
o Familiarity with internal tools or similar platforms at scale.
o Understanding of network security principles, including firewalls, VPNs, microsegmentation, and secure provisioning.
o Experience with zero-touch provisioning, out-of-band management, and network bootstrap pipelines.
Best Regards,
Rajesh(Ken)
Cymansys Solutions LLC
Email:
Tel.