Role Summary
Senior DevOps Engineer with 8+ years of hands-on experience designing, building, and operating end‑to‑end CI/CD platforms across hybrid environments (AWS and on‑premises). Proven expertise in automation, reliability engineering, progressive delivery, security‑by‑design, and platform standardization to enable high‑quality, low‑risk software delivery at scale.
Key Responsibilities
CI/CD Platform & Environment Strategy
- Design, implement, and operate a standardized CI/CD framework supporting Dev, QA, PartnerLab, Staging, and Production environments
- Define promotion workflows with enforced quality gates and artifact immutability
- Establish PartnerLab as a dedicated integration and validation environment with no direct promotion path to Production
- Enable environment parity across AWS and on‑premises systems
Progressive Delivery & Release Engineering
- Implement feature flags, canary deployments, blue‑green strategies, and phased rollouts
- Enable automated rollback based on health checks, error rates, and SLO breaches
- Support release traceability from commit through production deployment
Test Automation & Quality Engineering
- Integrate unit, integration, regression, security, and performance testing into CI/CD pipelines
- Enforce automated quality gates prior to environment promotion
- Support manual validation workflows with controlled access, observability, and test artifacts
Database & Data Automation
- Automate database schema versioning, migrations, rollbacks, and validation
- Implement lower‑environment refresh pipelines sourced from production data
- Enforce data masking and PII anonymization for all non‑production environments
- Validate data integrity and consistency post‑refresh
Observability, Reliability & Operations
- Define and enforce observability standards across logs, metrics, and traces
- Implement service health dashboards, alerting, and incident signals
- Integrate deployment health into automated release decisions
- Support on‑call readiness, incident response, and post‑incident learning
Security, Governance & Compliance
- Embed security scanning, secrets management, and access controls into pipelines
- Enforce least‑privilege IAM, credential rotation, and artifact integrity verification
- Align CI/CD workflows with enterprise change management and audit requirements
Required Technical Skills
Cloud & Infrastructure
- AWS (mandatory): ECS, EKS, Lambda, RDS, IAM, CloudFormation, CloudWatch
- Hybrid infrastructure experience with on‑prem VM, bare‑metal, and internal networking platforms
- Terraform for modular, reusable, and policy‑compliant infrastructure provisioning
CI/CD & Platform Engineering
- GitHub Enterprise & GitHub Actions (workflow design, reusable templates, runners, environments)
- CI/CD orchestration across hybrid AWS and on‑prem topologies
- Artifact versioning, promotion, and immutability strategies
Containers & Orchestration
- Docker image design, optimization, and security hardening
- Kubernetes (EKS + on‑prem) deployment patterns, scaling, and lifecycle management
- Helm‑based deployment standardization
Testing, Analysis & Release Safety
- Automated testing frameworks for unit, integration, regression, and performance
- Static and dynamic analysis tools (code quality, security, dependency scanning)
- Feature flag management platforms (or equivalent internal capability)
Database & Data Management
- Hands‑on expertise with Oracle and Microsoft SQL Server (mandatory)
- Schema migration tooling and automated rollback strategies
- Data masking, anonymization, and controlled refresh automation
Observability & Reliability Engineering
- Metrics, logging, and tracing using Prometheus, Splunk, New Relic, Grafana, CloudWatch, OpenTelemetry, and ELK
- SLO‑driven alerting and deployment health evaluation (Uptrends & PagerDuty)
- Automated rollback and failure containment mechanisms
Security & Secrets Management
- Secrets management using HashiCorp Vault, AWS Secrets Manager, or equivalent
- Secure pipeline design with controlled credential access
- Compliance‑ready logging, approvals, and traceability
Soft Skills & Delivery Expectations
- Experience operating in regulated or financial services environments
- Strong written documentation, runbooks, and architectural clarity
- Ability to collaborate with application, infrastructure, security, and QA teams
- Comfortable working in onshore enterprise delivery models
Core Technical Deliverables
Hybrid CI/CD Platform
- Reusable GitHub Actions pipeline templates for AWS and on‑prem deployments
- Secure secrets integration and environment‑specific controls
- Automated tagging, versioning, and artifact promotion
Infrastructure as Code
- Terraform modules for compute, networking, storage, and access control
- Automated provisioning of build agents and deployment runners
- Consistent configuration standards across environments
Database Automation
- End‑to‑end migration, rollback, and validation pipelines
- Automated lower‑environment refresh with sanitized production data
- PII compliance enforcement and consistency checks
Validation & Quality Enablement
- Embedded automated test execution per environment
- Manual validation support with controlled access and observability
- Enforced quality gates before promotion
Observability & Release Safety
- Unified dashboards across AWS and on‑prem platforms
- Deployment health alerts and regression detection
- Progressive delivery with traceable rollback execution