Software Engineer, Reliability Engineering, AiDP

Austin, TX, US • Posted 4 days ago • Updated 1 day ago
Full Time
On-site
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Artificial Intelligence
  • Cloud Computing
  • Reliability Engineering
  • Computer Science
  • Python
  • Java
  • Kubernetes
  • Docker
  • Orchestration
  • Open Source
  • Management
  • Continuous Integration
  • Continuous Delivery
  • Performance Analysis
  • FOCUS
  • Big Data
  • Apache Spark
  • Apache Flink
  • Generative Artificial Intelligence (AI)
  • Machine Learning (ML)
  • Linux
  • Database

Summary

The Applied Machine Learning team in AI and Data Platform org has been at the forefront of accelerating digital transformation through machine learning across Apple's enterprise ecosystem. We build and operate ML, GenAI, Inference and Data Platforms and Services to provide a comprehensive suite of capabilities-serving business-critical needs across Apple's enterprise. We work on interesting and hard challenges related to scale and performance across diverse set of open-source and cutting edge technologies.

We are looking for a talented engineer to join our team and bring passion for building and operating large scale platform and distributed systems leveraging cutting edge open source technologies across hybrid cloud environments.As a software engineer in AiDP reliability engineering you will work on one or many projects related to GenAI, ML, Inference and Big data platform.

BS/MS in computer science or equivalent experience.\n2+ years experience programming skills in one of the following areas: Python, Java, or Go.\n2+ years experience in Kubernetes, Docker or other container orchestration framework.

Ability to read and explain open source codebase.\nExperience deploying and managing CI/CD pipelines.\nStrong expertise in troubleshooting complex production issues.\nShould be able to understand complex architectures and be comfortable working with multiple teams.\nAbility to conduct performance analysis and troubleshoot large scale distributed systems.\nShould be highly proactive with a keen focus on improving uptime/availability of our mission-critical services.\nExperience with big data technologies - Spark, Flink, Iceberg or emerging GenAI/ML like Ray/MLflow/model serving) technologies.\nExperience of Linux, database and security concepts.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: aa97f6e72bc594b35795993f4ae5f19c
  • Posted 4 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Austin, Texas

Yesterday

Full-time

Austin, Texas

Yesterday

Full-time

USD 88,000.00 - 136,900.00 per year

Austin, Texas

Yesterday

Full-time

Austin, Texas

Yesterday

Full-time

Search all similar jobs