o Install, Upgrade, Configure, Tuning and apply patches for Cloudera Manager, CDH and all CDH Services
o Install/Upgrade R and R Packages
o Install/Upgrade Python and Python Packages
o Install/Upgrade SparkR
o Resource management configuration and tuning
o Add and Configure Master, Worker and Edge Nodes
o Add Data Science Boxes(Linux Boxes) and Configure Hadoop Services
o Install Configure and Manage RStudio/JupyterHub/Dataiku
o DR and related activity like configuring additional replication,backup.
o Install Configure and Upgrade Rundeck
o Create/Update and Manage reports for metrics and performance
o Work with Shell Scripts, Python Scripts, Ansible, Java
o Build ansible roles for new product enhancements
o Debug code/script to fix bugs / issues
o Maintain scripts/code in Stash / GitHub
· Host Preparation
o Prepare the Host as per standards
o Prepare file system / mount points
o Install/Upgrade required services LDAP, DNS, KDC, HAPRoxy etc..
o Install required packages
· Working with data delivery teams to setup new Hadoop users; setting up Linux users, setting up Kerberos principals and testing HDFS, Hive, Pig and MapReduce access for the new users
· Develop new processes for better maintenance of the environments.
· Collaborating with development and data science teams to troubleshoot and resolve their issues
· Point of Contact for Vendor escalation.
· Executes and provides feedback for operational policies, procedure, processes and standards
· Support any process that needs attention in Production environment.
· Automate manual tasks
· Process Support
· Develop Infrastructure documentation.
if interested, please email your updated resume along with current location and the visa status.
mshah at qualityitsource dot com