SRE Architect - 100% remote - Full Time Only

Overview

Remote
Full Time

Skills

Application Performance Monitoring (APM) tool New Relic/Dynatrace

Job Details

Please share your Resume to or

you can call me directly at +1-

Roles & Responsibilities:

  • 18+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience
  • 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking customers.
  • Has a keen eye for industry trends, tries out newer tools/infrastructure to improve current systems in terms of execution and/or operability
  • Strong hands-on coding experience in one or more of programming languages such as Java etc.
  • Good understanding of Observability (monitoring, logging, tracing, metrics), Chaos engineering concepts.
  • Proficiency in using Application Performance Monitoring (APM) tool New Relic/Dynatrace for monitoring, logging, tracing and Splunk for Log monitoring.
  • Expert level hands on knowledge in cloud platforms like PCF.
  • should have implemented solutions around Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for services.
  • Should have supported Production Incidents (PIs) on mission critical applications of a company. Troubleshoot, debug, and diagnose operational issues and drive them to closure.
  • Understanding of software delivery life cycles, particularly Agile/Lean & DevOps
  • Proven experience in handling large scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms
  • Experience as a service owner in managing large geographically diverse stakeholders
  • Ability to work with creative fast growing engineering team and motivate them to deliver their best work