Job Title: Databricks Administrator (Public Sector TX HHSC)
Visa: GC-EAD & EAD
Location: Austin, TX (Hybrid/Onsite Local Candidates Only)
Duration: 780 Hours (Through 08/31/2026) + Extensions
Must-Have Requirements (Strict Screening)
Public Sector Experience (MANDATORY NO EXCEPTIONS)
Client Domain: Health & Human Services / Medicaid / Benefits Systems
Must meet Residency Requirements (Local to Austin, TX)
Include Job Number + Candidate Name in submission
Strong ATS keyword alignment required (AI-based screening by client)
Mandatory forms must be completed (Required = Mandatory | Preferred = Value Add)
Role Overview
The Databricks Administrator will be responsible for administration, configuration, governance, and optimization of the Databricks platform to support enterprise-scale data engineering, analytics, and AI/ML workloads.
This role plays a critical part in ensuring platform reliability, security, performance tuning, cost optimization, and compliance across cloud-based data ecosystems.
Technical Stack & Responsibilities
Cloud Platform & Databricks
AWS Cloud (Primary): EC2, S3, IAM, VPC, CloudWatch
Databricks Workspace Administration
Cluster Management & Configuration (Autoscaling, Instance Pools)
Job Scheduling & Workflow Orchestration
Big Data & Processing
Apache Spark (Core): Spark SQL, DataFrames, RDDs
Performance Tuning (Partitioning, Caching, Query Optimization)
Distributed Data Processing
Identity, Access & Security
IAM (AWS Identity & Access Management)
SCIM (User Provisioning)
Role-Based Access Control (RBAC)
Encryption (At-rest & In-transit)
Secrets Management (Databricks Secrets, AWS Secrets Manager)
Data & Storage Integration
AWS S3 (Primary Data Lake Storage)
Data Lake / Lakehouse Architecture
Delta Lake (ACID transactions, schema enforcement)
Data Ingestion & Integration Pipelines
Databricks Platform Features
Databricks SQL Analytics
Notebooks (Python, SQL, Scala)
Job Orchestration & Scheduling
Unity Catalog (Data Governance & Access Control)
Monitoring & Performance
Platform Monitoring: Databricks Metrics, AWS CloudWatch
Cluster Health Monitoring & Troubleshooting
Query Performance Optimization
Logging & Alerting
DevOps & Automation
Infrastructure as Code (Terraform, AWS CloudFormation)
CI/CD Pipelines (Jenkins, GitHub Actions, Azure DevOps)
Version Control (Git)
Scripting (Python, Bash)
Cost Optimization
Cluster Cost Management (Auto-termination, Rightsizing)
Usage Monitoring & Budget Controls
Workload Optimization Strategies
AI / ML Enablement (Preferred)
Databricks ML
MLflow (Experiment Tracking, Model Registry)
Support for AI/ML pipelines
Minimum Qualifications
Years
Requirement
Details
8+
Required
Databricks Administration (AWS Environment)
8+
Required
Cluster Configuration & Workspace Management
8+
Required
IAM, SCIM, RBAC Access Control
8+
Required
Apache Spark (Performance Tuning & Troubleshooting)
8+
Required
AWS S3 Integration & Data Lake Architecture
8+
Required
Cluster Policies & Governance
8+
Required
Databricks SQL, Notebooks, Job Scheduling
8+
Required
Monitoring, Logging & Performance Optimization
8+
Required
Data Security, Encryption & Compliance
8+
Required
DevOps Tools (Terraform, CI/CD, Automation)
4+
Preferred
Enterprise / Government Environment
4+
Preferred
Unity Catalog
4+
Preferred
Cost Optimization Strategies
4+
Preferred
AI/ML (Databricks ML, MLflow)
4+
Preferred
Lakehouse Architecture
4+
Preferred
Python / SQL / Scala
Key Skills (ATS Keywords)
Databricks Administration, AWS, Apache Spark, Delta Lake, Unity Catalog, IAM, RBAC, SCIM, S3, Data Lake, Lakehouse, Terraform, CI/CD, MLflow, Databricks SQL, Performance Tuning, Cluster Management, Data Governance, Cost Optimization