Job Title: Principal Cloud Operations Architect
Location: Rockville MD (Onsite)
Duration/Term: Long Term Contract
Job Description
We are seeking an experienced Cloud Architect Operations to oversee the daily operations of the organization's cloud infrastructure and ensure optimal performance, reliability, scalability, and security. The ideal candidate will have strong expertise in cloud platforms, infrastructure automation, system administration, and incident management, along with the ability to lead a team of cloud engineers and administrators.
This role requires deep technical knowledge in cloud architecture, infrastructure management, monitoring, and automation, combined with leadership skills to manage operational excellence and continuous improvement of cloud environments.
Responsibilities
- Oversee the management, maintenance, and optimization of cloud infrastructure to ensure high availability and reliability.
- Configure and manage cloud resources to meet performance, scalability, and cost optimization objectives.
- Implement and maintain cloud monitoring and alerting solutions to track infrastructure health and performance.
- Demonstrate strong hands-on expertise with AWS cloud services, Terraform, and Ansible for Infrastructure as Code (IaC).
- Manage and support Windows and Linux operating systems in cloud environments.
- Lead and mentor a team of cloud engineers and administrators, ensuring productivity and technical growth.
- Coordinate daily operational activities and ensure alignment with organizational priorities.
- Lead the response to cloud-related incidents, ensuring timely resolution and minimal business disruption.
- Conduct root cause analysis (RCA) and implement preventive measures.
- Identify opportunities to automate repetitive operational tasks using automation scripts and IaC practices.
- Continuously improve cloud operations processes and operational efficiency.
- Ensure adherence to cloud security policies, standards, and best practices.
- Implement and maintain security controls to protect cloud resources and sensitive data.
- Ensure compliance with regulatory standards such as GDPR and HIPAA.
- Monitor cloud usage and implement cost optimization and resource utilization strategies.
- Perform capacity planning to support business growth and future infrastructure needs.
- Develop and maintain disaster recovery and business continuity plans for cloud infrastructure.
- Conduct regular testing and validation of disaster recovery procedures.
- Collaborate with IT teams, business stakeholders, and vendors to deliver scalable cloud solutions.
- Maintain detailed documentation of cloud architecture, configurations, operational procedures, and incident reports.
- Provide regular reporting on cloud performance, incidents, and operational metrics.
Qualifications
- Bachelor s degree in Computer Science, Information Technology, Electrical Engineering, or a related field.
- Advanced certifications or professional training in cloud technologies are preferred.
- Strong understanding of cloud architecture, infrastructure management, and operational best practices.
- Experience with cloud monitoring tools, automation frameworks, and system administration.
- Knowledge of cloud security principles and regulatory compliance frameworks.
- Strong communication skills with the ability to collaborate with both technical and non-technical stakeholders.
- Experience in vendor management and cloud technology evaluation.
Experience
- Proven experience in system administration and cloud operations environments.
- Experience working in a senior technical role or leadership position managing cloud infrastructure.
- Hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Experience implementing Infrastructure as Code and automation frameworks.
- Experience managing cloud incidents, operational monitoring, and performance optimization.
- Experience working with DevOps practices and CI/CD pipelines.
- Familiarity with ITIL or IT service management frameworks is preferred.
Key Skills
AWS, Cloud Architecture, Cloud Operations, Terraform, Ansible, Infrastructure as Code (IaC), Windows Server, Linux Administration, Cloud Monitoring, Incident Management, Root Cause Analysis, Automation, PowerShell, Python, Cloud Security, Cost Optimization, Capacity Planning, Disaster Recovery, Business Continuity Planning, DevOps, CI/CD, Jenkins, Git, ITIL, Vendor Management, Cloud Compliance (GDPR, HIPAA)
VDart Group is a global leader in technology, product, and talent solutions, serving clients including Fortune 500 companies across 13 countries. With over 4,000 professionals, we deliver innovation and results across industries. Committed to People, Purpose, and Planet, we are recognized for our sustainable practices through our EcoVadis Bronze Medal and UN Global Compact membership.