Overview
Skills
Job Details
Hope you are doing great!!
Currently, we have a job opening of Cloud Operations Lead in Rockville, MD/Princeton, NJ for a Full Time Permanent role with our client. If you are interested then please reply me with your updated resume
Key Responsibilities:
Cloud Infrastructure Management:-
Oversee and ensure the high availability, reliability, and performance of cloud infrastructure.
-
Act as the primary escalation point for cloud-related issues.
-
Optimize cloud resource configurations to meet performance, cost, and operational goals.
-
Implement monitoring tools and practices to track infrastructure health and performance.
-
Perform root cause analysis and implement preventive measures for recurring issues.
-
Lead major and potential incident resolution with timely updates to stakeholders.
-
Conduct change impact analysis and ensure due diligence for all cloud platform modifications.
-
Lead, mentor, and support a team of cloud engineers and administrators.
-
Coordinate day-to-day team activities in alignment with business priorities.
-
Facilitate professional development opportunities for team members.
-
Collaborate with cross-functional teams, business units, and vendors to deliver cloud solutions.
-
Identify automation opportunities to streamline operations and reduce manual effort.
-
Develop and implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or similar tools.
-
Manage CI/CD pipelines and DevOps tools such as Jenkins, Azure DevOps, GitHub, GitLab.
-
Oversee deployment, configuration, and lifecycle of cloud-native services (e.g., Lambda, EKS, CodeBuild, CodePipeline).
-
Administer a wide range of cloud services: EC2, ELB, VPC, Route 53, FSx, EFS, CloudTrail, SQS, SNS, Kinesis, etc.
-
Manage identity and access controls (IAM, SSO, Organizations, Permission Sets).
-
Oversee logging, monitoring (CloudWatch, Log Insights), and cloud-native analytics.
-
Ensure effective use of auto-scaling, load balancers, and backup/recovery tools.
-
Coordinate application migrations and maintain cloud-native database systems (e.g., DynamoDB, RDS).
-
Implement and maintain cloud security controls and policies.
-
Ensure compliance with industry regulations (e.g., GDPR, HIPAA).
-
Manage security tools and services: AWS GuardDuty, WAF, Inspector, Macie, KMS, Key Vault.
-
Collaborate with InfoSec teams for audit readiness and incident response.
-
Monitor cloud usage and resource allocation to prevent overprovisioning.
-
Implement cost optimization strategies using native tools like AWS Cost Explorer.
-
Perform capacity planning and forecasting to support future growth.
-
Design and maintain comprehensive disaster recovery (DR) and business continuity plans.
-
Conduct regular DR testing to validate readiness and recovery procedures.
-
Maintain up-to-date documentation on infrastructure, configurations, and processes.
-
Generate regular reports on performance, incidents, cost metrics, and compliance.
-
Ensure documentation is accessible to stakeholders and audit-ready.
Platform-Specific Operations:
AWS:-
Manage services like EC2, EKS, Lambda, RDS, DynamoDB, CloudFormation, GuardDuty, and more.
-
Oversee AWS-native networking, storage, monitoring, automation, and security services.
-
Manage subscriptions, resource groups, Azure DevOps, storage, networking, automation, and monitoring.
-
Administer Azure AD, licensing, pipelines, GitHub integration, and Azure Automation.
-
Oversee infrastructure services, VCN, Golden Gate, native firewalls, GuardDuty equivalents, and more.
Qualifications & Experience:
Required:-
Bachelor's degree in Computer Science, IT, Electrical Engineering, or a related field; advanced degree preferred.
-
Strong experience in system/cloud administration and senior-level technical leadership.
-
Proficiency in AWS, with working experience across Azure and Oracle Cloud platforms.
-
Expertise in cloud operations, architecture, and service management.
-
Skilled in automation and scripting (PowerShell, Python, Terraform, Ansible, CloudFormation).
-
Proficiency with Windows/Linux servers, networking (DNS, DHCP, LAN/WAN), and VMware/Active Directory.
-
Strong interpersonal and communication skills.
-
Experience with vendor management and contract negotiation.
-
Certifications: AWS Certified Solutions Architect (Associate/Professional), Azure Architect Expert.
-
Familiarity with ITIL and ITSM frameworks.
-
Experience with CI/CD tools (Jenkins, Git, Azure DevOps).
-
Strong analytical and problem-solving skills.
-
Proactive, self-motivated, and capable of working both independently and collaboratively.
Note: VBeyond is fully committed to Diversity and Equal Employment Opportunity.