This company operates a family of highly specialized electronic trading platforms that support fast, reliable, and regulated financial transactions. They design and run the technology that powers these markets, including the infrastructure, software systems, and connectivity that enable participants to trade efficiently and securely. Their work spans on-premises data centers and cloud environments, with a strong emphasis on performance, compliance, and operational resilience.
They are searching or a Senior DevOps Engineer to take ownership of their administering Kubernetes clusters and onboarding applications across both on-prem and cloud environments. You will build and maintain CI/CD pipelines, implement secure identity and access workflows, and establish observability, networking, and storage tooling for the platform. The position also involves automating operational processes, troubleshooting complex incidents, and contributing to SRE practices while collaborating closely with network, security, and application teams in a fast-paced, highly technical environment. This is a hybrid opportunity in West Windsor Township, NJ.
Required Skills & Experience - 5+ years of experience in cloud engineering, DevOps, or infrastructure roles.
- 3+ years of experience with Kubernetes app manifests (Kustomize/ Helm), networking, and GitOps for Kubernetes.
- Strong hands-on experience with AWS (EKS, Load Balancers, S3, VPC, RDS, ECS, CloudWatch) and on-prem.
- Proficiency in scripting languages: Shell, Bash, and Python.
- Bonus points for experience in financial services or trading environments, but not required.
What You Will Be Doing - Lead end-to-end Kubernetes application onboarding, including environment preparation, production-readiness assessments, and ongoing operational support across EKS and on-prem clusters.
- Implement secure access patterns, designing and managing authentication and authorization workflows using Keycloak and AWS IAM.
- Build and maintain CI/CD automation, creating Jenkins pipelines for container image builds and integrating with multiple image registries.
- Develop and support platform observability and infrastructure tooling, covering monitoring, container storage, and networking components.
- Handle incident response and SRE practices, including root-cause analysis, documentation, SLIs, error budgets, alerting improvements, and participation in on?call rotations.
The Offer You will receive the following benefits:
- Medical, Dental, and Vision Insurance
- 401K Retirement Savings Plan
- Life and Disability Benefits
- Paid Parental Leave
Applicants must be currently authorized to work in the US on a full-time basis now and in the future.