Overview
Skills
Job Details
My name is Mohammed Tousif, and I am a Recruiter with Russell Tobin. I came across your resume and was hoping to discuss your current employment situation in more detail. I have included a position below that you may be a great fit for. If you are not looking for an immediate opportunity, I would still love to connect. Also, if you are not interested in the position, please feel free to pass this opportunity along to your friends and colleagues who might be interested. Responsibilities
Use the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platform.
Build infrastructure and drive projects that break things with the aim to improve the robustness of production systems
Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toil.
Step back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices.
Share on-call responsibilities with other teammates and own the improvement of the team's on-call practices
Unblock, support, and effectively communicate across teams to achieve results
Assist in cost engineering efforts to ensure effi cient use of cloud resources.
Requirements
Infrastructure-as-code best practices (preferably using Terraform and/or Crossplane) Source control best practices using Git A deep commitment to
security fundamentals such as attack surface minimization, blast radius minimization and least-privilege authorization
Deep understanding of VPC fundamentals including subnets, routing and security groups VPC peerings and Transit Gateway routing (including cross-region routing)
Redundant Site-to-Site VPN best practices DirectConnect best practices AWS ALB confi guration including automated confi guration via a Kubernetes Load Balancer Controller DNS management on AWS Route53 including automated confi guration via a Kubernetes External DNS controller
Internal DNS management including the use of Route53 resolvers Network observability and alerting using cloud services such as Datadog Experience with operating in PCI-compliant environments is an advantage Experience with PrivateLink is an advantage
Experience with BGP is a nice to have Experience with Kubernetes manifests and network meshes such as Cilium is an advantage
Mohd Tousif
Senior Associate - COE
- Ext. 0268
420 Lexington Ave, 30th Fl.
New York, NY 10170