Sr. Software Development Engineer

Overview

USD 104,230.00 per year
Full Time

Skills

Embedded Systems
Machine Learning (ML)
Artificial Intelligence
Software Development
Problem Solving
Conflict Resolution
Communication
Continuous Integration and Development
Software Deployment
Cloud Computing
Server Administration
Computer Hardware
MySQL
Database
Dashboard
Quality Assurance
Automated Testing
GitHub
Jenkins
Ansible
Scripting
Python
Bash
DevOps
Orchestration
Docker
Kubernetes
Management
Provisioning
Scalability
Lifecycle Management
GPU
Workflow
Grafana
Git
Version Control
Continuous Integration
Continuous Delivery
System Administration
Linux
Shell Scripting
Collaboration
Agile
Innovation
Software Engineering
Computer Science
Military
Law
Recruiting

Job Details

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

THE ROLE:

The AI/ML Frameworks team is hiring an MTS Software Development Engineer to build and maintain scalable DevOps infrastructure that accelerates AMD's AI software development. You will lead CI/CD pipeline design, manage Kubernetes deployments, and automate systems with Ansible and Python. This role supports GPU environments, integrates monitoring tools, and enables rapid, reliable software delivery across teams.

THE PERSON:

The ideal candidate is a skilled engineer with a strong background in DevOps, site reliability, or infrastructure engineering. They are proficient in Kubernetes, CI/CD tools, scripting (Python/Bash), and infrastructure automation frameworks such as Ansible. Experience working with GPU compute environments and integrating automated test workflows is highly valued. This person thrives in collaborative, fast-paced environments and can drive technical execution with minimal oversight. They bring a problem-solving mindset, strong communication skills, and a passion for building reliable, scalable systems.

KEY RESPONSIBILITIES:
  • CI/CD Pipeline Development: Design, implement, and manage efficient continuous integration and delivery pipelines using Buildkite, GitHub Actions, and Jenkins to enable rapid and reliable software deployment.
  • Kubernetes Infrastructure Management: Deploy and maintain robust Kubernetes-based environments across both on-premise and cloud platforms to support scalable service orchestration.
  • Infrastructure Automation: Automate provisioning, configuration, and management of infrastructure using Ansible, Python, and Bash to improve system consistency and reduce manual intervention.
  • Service Deployment with Helm: Administer application and service deployment in Kubernetes using Helm charts for consistent and repeatable release processes.
  • GPU Server Support: Configure, manage, and maintain GPU-based compute environments including lifecycle automation and hardware-level test integration.
  • Database and Observability Integration: Interact with MySQL databases to support dynamic data updates and integrate data sources into Grafana dashboards for monitoring and insights.
  • Cross-Functional Collaboration: Work closely with development teams, SREs, and project stakeholders to ensure system-level alignment and high-impact delivery.
  • Quality Assurance Enablement: Integrate automated testing frameworks into CI pipelines to ensure code quality, stability, and performance across development cycles.

PREFERRED EXPERIENCE:
  • DevOps Tools & Automation: Proficient with Buildkite, GitHub Actions, Jenkins, Ansible, and scripting languages like Python and Bash for streamlining DevOps workflows.
  • Containerization & Orchestration:Strong experience with Docker, Kubernetes, and Helm for deploying and managing scalable, containerized applications.
  • Infrastructure as Code (IaC): Hands-on experience automating infrastructure provisioning and configuration to ensure reproducibility and scalability across environments.
  • GPU-Based Compute Environments: Familiarity with GPU server lifecycle management and integration of GPU resources into CI test workflows for performance-critical applications.
  • Monitoring & Observability: Experience using tools like Checkmk, Prometheus, and Grafana to monitor infrastructure health and application performance.
  • Version Control & Collaboration: Advanced knowledge of Git-based version control, including branching strategies and CI/CD integration for collaborative development.
  • Linux & System Administration: Solid background in Linux environments, including shell scripting and system-level troubleshooting across distributed systems.
  • Agile & Cross-Disciplinary Collaboration: Comfort working in Agile teams and partnering with software, infrastructure, and product teams to drive consistent delivery and innovation.

ACADEMIC CREDENTIALS:

  • Bachelor's Degree in computer/software engineering, Computer Science, or related discipline

#LI-JG1

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.