CLOUD OPERATIONS LEAD

Overview

depending on experience
Full Time

Skills

System Monitoring
Workflow
High Availability
Incident Management
Root Cause Analysis
Operational Efficiency
Documentation
Management
IaaS
Issue Resolution
Regulatory Compliance
Communication
Splunk
DNS
Dragon NaturallySpeaking
Scripting
Python
Bash
Windows PowerShell
Configuration Management
Ansible
Progress Chef
Puppet
Training
Cloud Computing
Microsoft Azure
Terraform
Amazon Web Services
Orchestration
Docker
Kubernetes
Computer Networking
Firewall
DNS Administration
Continuous Integration
Continuous Delivery
Jenkins
Grafana
JIRA
ServiceNow
Salesforce.com
DICE
MIT
Military
Collaboration
Partnership
Law

Job Details

City/State:
Yonkers, New York
Grant Funded:
No
Department:
IT - Technology & Cloud Services
Work Shift:
Day
Work Days:

Scheduled Hours:

Scheduled Daily Hours:

Pay Range:

Role Description: The Cloud Operations Lead is responsible for managing the team that is responsible for the systems, monitoring, and maintenance of cloud-based platforms, systems and services to ensure seamless operation, high availability, and optimal performance. This team's responsibilities include automating workflows, troubleshooting issues, and enabling reliable operations within a cloud infrastructure environment.

Responsibilities:
  • Oversee the monitoring, management, and troubleshooting of cloud environments and systems to ensure high availability, reliability, and cost-efficiency using tools like Prometheus, Grafana, Terraform, and Kubernetes.
  • Act as the primary point of contact between the organization and the contracted team, providing technical guidance, aligning team efforts, and collaborating across dev and ops teams.
  • Lead incident management processes, including root cause analysis, escalation, and the implementation of preventive measures to minimize downtime and operational risks.
  • Drive automation initiatives to streamline cloud operations, reduce manual intervention, and enhance operational efficiency through Infrastructure as Code (IaC) and container orchestration.
  • Maintain comprehensive documentation of cloud infrastructure, operational procedures, and troubleshooting guides while staying updated on emerging cloud technologies to improve practices.

Skillset:
  • A highly technical manager who has overseen Cloud Operations teams and is hands on and has remained current with their technical skills.
  • Expertise in managing and optimizing cloud infrastructure for reliability and performance.
  • Proficiency in implementing monitoring and alerting systems for proactive issue resolution.
  • Knowledge of security best practices and compliance requirements for cloud operations.
  • Strong collaboration and communication skills to work effectively with cross-functional teams.
  • Expertise in monitoring and observability tools like Prometheus, Grafana, or Splunk.
  • Solid knowledge of networking concepts, load balancers, firewalls, and DNS in cloud environments.
  • Proficiency in scripting and automation using Python, Bash, or PowerShell.
  • Experience with container orchestration tools like Kubernetes and Docker.
  • Familiarity with configuration management tools like Ansible, Chef, or Puppet.

Recommended Certifications and Training:
  • AWS Certified SysOps Administrator
  • AWS Cloud Operations Certification
  • AWS certified partitioner.
  • Kubernetes Administrator (CKA)
  • Terraform Associate Certification
  • Docker Certified Associate

Tech Stack:
  • Public cloud platform services (AWS and Azure)
  • Infrastructure as Code tools (e.g. Terraform and AWS CloudFormation)
  • Containerization and Orchestration tools (e.g. Docker and Kubernetes)
  • Networking Tools -Virtual Private Clouds (VPCs), firewalls, and DNS management
  • CI/CD Tools (e.g. Jenkins)
  • Security Tools
  • Monitoring and Observability tools (e.g. Prometheus, Grafana, Datadog)
  • Ticketing Tools (e.g. Jira and ServiceNow)


Estimated salary 122,500

SF-DICE-MIT

Montefiore Health System, Inc. is an equal employment opportunity employer. Montefiore Health System, Inc. will recruit, hire, train, transfer, promote, layoff and discharge associates in all job classifications without regard to their race, color, religion, creed, national origin, alienage or citizenship status, age, gender, actual or presumed disability, history of disability, sexual orientation, gender identity, gender expression, genetic predisposition or carrier status, pregnancy, military status, marital status, or partnership status, or any other characteristic protected by law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Montefiore Health System Inc