Overview
Skills
Job Details
Cloudera Hadoop / Spark / Airflow Admin:
Administer Cloudera Hadoop clusters (HDFS, YARN, Hive, Impala) using Cloudera Manager; ensure high availability and performance.
Perform OS and platform patching/upgrades for Hadoop, Spark, and Airflow components with rollback plans.
Manage Spark workloads: tune executors, memory, and dynamic allocation for optimal performance.
Install, configure, and maintain Apache Airflow (Scheduler, Webserver, Workers); ensure DAG reliability and SLA compliance.
Implement security and compliance: Kerberos, LDAP/AD integration, Ranger policies, TLS certificates.
Monitor and troubleshoot platform health using Cloudera Manager, PrometheGrafana, and log analysis.
Automate operational tasks with Ansible, Python, and shell scripting for provisioning, patching, and config management.
Handle incident and change management: root cause analysis, documentation, and on-call support.
Perform capacity planning and resource optimization for multi-tenant clusters.
Collaborate with data engineering and infrastructure teams to ensure platform stability and scalability.