Overview
Skills
Job Details
Hope you are doing well. Below is the Job Description, kindly go through it and please let me know if you are interested.
Job Title: Senior DevOps Engineer
Duration: 6 Months C2H
Location: Raleigh, NC - 5days/week onsite
Job Description
We are seeking a seasoned Senior DevOps Engineer to architect and maintain the infrastructure and deployment pipelines for our mission-critical Banking Foundation Platform. In this role, you will be the cornerstone of our reliability engineering practice, ensuring our hybrid event-driven and batch processing systems achieve exceptional availability, performance, and security. You will work across the entire stack, from Kubernetes orchestration to database performance monitoring, in a complex, high-transaction environment.
Key Responsibilities:
Architect, implement, and manage cloud infrastructure on Azure using Infrastructure-as-Code (Terraform/Puppet/ Ansible)
Design and maintain robust GitLab CI/CD pipelines for microservices, data pipelines, and infrastructure deployments
Manage and optimize Azure Kubernetes Service (AKS) clusters for 50+ microservices with a focus on auto-scaling, resource efficiency, and resilience
Implement and advance our monitoring, alerting, and observability stack (Prometheus, Grafana, Application Insights) to provide deep insights into application and infrastructure performance
Ensure platform security and compliance through Azure Active Directory, Key Vault, and network security configurations (NSGs, Private Endpoints)
Automate operational procedures and disaster recovery strategies to ensure high availability for banking operations
Collaborate with development teams to troubleshoot complex performance issues across the application stack
Required Qualifications:
8+ years of DevOps/SRE experience with proven expertise in Azure cloud services
Deep hands-on experience with Kubernetes (AKS preferred) in production environments at scale
Expertise in building and maintaining CI/CD pipelines (GitLab CI/CD preferred) with integrated security and quality gates
Strong proficiency in Infrastructure-as-Code using Terraform, Ansible, or similar technologies
Experience implementing and managing observability stacks (PrometheGrafana/Application Insights)
Solid understanding of networking, security, and compliance in cloud environments
Proficiency in scripting languages (Bash, Python, or PowerShell)
Ideal Candidate Possesses:
Proactive, can-do attitude with a passion for automation and continuous improvement
Excellent problem-solving skills with ability to diagnose complex distributed system issues
Strong communication skills for collaborating with development teams and proposing technical solutions
Ability to design for reliability, scalability, and performance under high-load conditions
Financial services or regulated industry experience (preferred)
This role is critical for maintaining the platform's reliability and performance while enabling rapid, safe deployment of new features across our microservices and data pipeline ecosystem. Position emphasize the need for strong communication, problem-solving skills, and the ability to work in a dynamic, fast-paced environment building mission-critical banking application.