Overview
On Site
Depends on Experience
Contract - W2
Contract - 1 Year(s)
Skills
Access Control
Agile
Amazon Web Services
Artificial Intelligence
Cloud Computing
Collaboration
Command-line Interface
Communication
Computer Networking
Continuous Delivery
Continuous Integration
Databricks
Debugging
DevOps
Distributed Computing
Finance
GitHub
Good Clinical Practice
Google Cloud Platform
Grafana
Health Care
Identity Management
Kanban
Linux
Machine Learning (ML)
Machine Learning Operations (ML Ops)
Management
Mentorship
Microsoft Azure
Python
Scrum
Storage
Teamwork
Terraform
Unity
Virtual Private Cloud
Job Details
Role: Senior Databricks AI Platform Admin
Location: Alpharetta, GA (Hybrid)
Note: Only on W2
Job Description
We are looking for a Senior Databricks AI Platform SRE to join our Platform SRE team. This role will be critical in designing, building, and optimizing a scalable, secure, and developer-friendly Databricks platform to enable Machine Learning (ML) and Artificial Intelligence (AI) workloads at enterprise scale
You will partner with ML engineer, data scientists, platform teams, and cloud architects to automate infrastructure, enforce best practices, and streamline the end-to-end ML lifecycle using modern cloud-native technologies.
Required Skills:
- Proven experience with Terraform for building and managing infrastructure.
- Strong programming skills in Python and Java
- Hands-on experience with cloud networking, identity and access management, key vaults, monitoring, and logging in Azure
- Hands on experience with Databricks (Workspace management, Clusters, Jobs, MLFlow, Delta Lake, Unity Catalog, Mosaic AI)
- Deep understanding of Azure or AWS infrastructure (e.g. IAM, VNets/VPC, Storage, Networks, Compute, Key management, monitoring)
- Strong experience in distributed system design, development and deployment using agile/devops practices.
- Experience with CI/CD pipelines (GitHub Actions, or similar)
- Experience implementing monitoring and observability using Prometheus, Grafana or Databricks-native solutions.
- Good communication skills, excellent teamwork experience, ability to mentor and develop more junior developers, including participating in constructive code reviews
Preferred Skills:
- Experience in multi-cloud environments (AWS/Google Cloud Platform) is a bonus
- Experience in working in highly regulated environments (finance, healthcare, etc.) is desirable
- Experience with Databricks REST APIs and SDKs
- Knowledge of MLFlow, Mosaic AC, & MLOps tooling
- Working with teams using Scrum, Kanban or other agile practices
- Proficiency with standard Linux command line and debugging tools
- Azure or AWS Certifications
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.