Sr AWS Developer (HPC)

  • Dallas, TX
  • Posted 11 hours ago | Updated 10 hours ago

Overview

Hybrid
Depends on Experience
Accepts corp to corp applications
Contract - W2
Contract - Independent

Job Details

We are seeking a highly skilled and experienced Sr AWS Developer specializing in High-Performance Computing (HPC) to join our team. This role involves designing and implementing advanced computing solutions on AWS EC2 to support R&D efforts in GPU-accelerated and data-heavy applications. The ideal candidate will have a strong background in cloud computing, particularly AWS services, and a proven track record in deploying scalable and efficient HPC workloads. This position offers a hybrid schedule, with time split between the office and remote work.

Responsibilities

  • Design and deploy HPC workloads on AWS EC2, focusing on GPU-accelerated instances like P4d and G5, as well as high-throughput CPU instances.
  • Optimize the performance, networking (including EFA and placement groups), and cost-efficiency of HPC clusters.
  • Develop and maintain infrastructure as code using Terraform to ensure reproducibility and scalability of cloud resources.
  • Collaborate with R&D teams to prototype and test new solutions, ensuring they meet research needs and performance criteria.
  • Provide expert technical input on AWS architecture decisions, helping to guide the strategic direction of cloud infrastructure.
  • Conduct performance tuning and troubleshooting of HPC environments to maximize efficiency and minimize costs.

Qualifications

  • 3–5+ years of hands-on experience with AWS, specifically with EC2 and related services.
  • Demonstrated expertise in High-Performance Computing (HPC) environments and GPU-based instances.
  • Strong proficiency in Linux environments and automation scripting using Python and Bash.
  • Experience with AWS Batch or similar distributed compute orchestration tools.
  • Background in scientific computing, machine learning, or simulation workloads is preferred.
  • Understanding of network performance tuning, especially Elastic Fabric Adapter (EFA) and related AWS networking technologies.
  • Proven ability to develop infrastructure as code, preferably using Terraform.
  • Excellent problem-solving skills and ability to work collaboratively in a dynamic R&D setting.
  • Strong communication and documentation skills, capable of effectively articulating technical challenges and solutions to varied audiences.
  • Ability to manage multiple projects simultaneously and meet project deadlines.

Pay Range: $70 - $75 per hour

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About GDH