We are seeking a Senior AWS HPC Engineer to join our team focused on high-performance computing solutions. You will be responsible for deploying and managing Open OnDemand portals integrated with AWS services to optimize HPC workloads. Join us to contribute to cutting-edge cloud and HPC projects and enhance client computing capabilities.
This position offers remote setup with the flexibility to work from any location in Georgia, whether it's your home, well-equipped offices in Tbilisi and Batumi or a coworking space in Kutaisi.
Responsibilities
- Deploy, configure, and customize the Open OnDemand web portal for HPC workloads
- Integrate Open OnDemand with AWS services such as EC2, S3, FSx, ParallelCluster, and IAM
- Collaborate with AWS and HPC teams to ensure seamless user access and workload execution
- Implement best practices for performance, scalability, and cost optimization
- Develop automation scripts and infrastructure as code using Terraform, CloudFormation, or Ansible
- Document architecture, configurations, and operational processes
- Maintain security standards through Identity and Access Management integration
- Monitor HPC environment performance and troubleshoot issues
- Optimize Linux system configurations for HPC workloads
- Coordinate with cross-functional teams to support client requirements
- Evaluate new technologies and recommend improvements for HPC infrastructure
Requirements
- 3+ years of experience managing HPC environments
- Proven hands-on experience deploying and maintaining Open OnDemand
- Strong knowledge of Linux systems administration
- Experience with AWS HPC services and cloud infrastructure
- Proficiency in scripting languages such as Bash, Python, or Ruby
- Familiarity with authentication and user management systems like LDAP, SSO, or Keycloak
- Ability to implement Identity and Access Management (IAM) policies
- Experience developing infrastructure as code (Terraform, CloudFormation, or Ansible)
- Strong problem-solving and communication skills
- English language proficiency at B2 level or higher
Nice to have
- AWS certification such as Solutions Architect, SysOps, or DevOps
- Experience with AWS ParallelCluster or similar HPC orchestration tools
- Knowledge of SLURM or other workload managers
We offer/Benefits
We connect like-minded people- Delivering innovative solutions to industry leaders, making a global impact
- Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
- Opportunity to work abroad for up to two months per year
- Relocation opportunities within our offices in 55+ countries
- Corporate and social events
We invest in your growth- Leadership development, career advising, soft skills and well-being programs
- Certifications, including Google Cloud Platform, Azure and AWS
- Unlimited access to LinkedIn Learning and Get Abstract
- Free English classes with certified teachers
We cover it all- Participation in the Employee Stock Purchase Plan
- Monetary bonuses for engaging in the referral program
- Comprehensive medical & family care package
- Five trust days per year (sick leave without a medical certificate)
- Benefits package (sports activities, a variety of stores and services)
EPAM Georgia is a team of innovators united by a passion for technology. The dynamic and inclusive culture we embrace helps positively impact our communities, clients, and employees. Here you will collaborate with multi-national teams, contribute to numerous cutting-edge projects, deliver the most creative solutions, and have an opportunity to learn. Our people are at the heart of our success, and we are proud to provide talents with a solid ground to develop and grow.
Why Choose Us
2024 Best Place to Work 2024 2024 Sitecore's Partner Experience Awards
Looking for something else?
Find a vacancy that works for you. Send us your CV to receive a personalized offer.
Find me a job